numam-dpdk

Author	SHA1	Message	Date
Jiawen Wu	dee93977a6	net/ngbe: support MAC filters Add MAC addresses to filter incoming packets, support to set multicast addresses to filter. And support to set unicast table array. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	506abd4a8b	net/ngbe: support FW version query Add firmware version get operation. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	b83372a030	net/ngbe: support device promiscuous and allmulticast mode Support to enable/disable promiscuous and allmulticast mode for a port. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	07baabb6a5	net/ngbe: support MTU set Support updating port MTU. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	8b433d04ad	net/ngbe: support device xstats Add device extended stats get from reading hardware registers. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	fdb1e85197	net/ngbe: support basic statistics Support to read and clear basic statistics. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	59b46438fd	net/ngbe: support VLAN offload and VLAN filter Support to set VLAN and QinQ offload, and filter of a VLAN tag identifier. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	586e602837	net/ngbe: support jumbo frame Add to support Rx jumbo frames. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	64b36e4af1	net/ngbe: support CRC offload Support to strip or keep CRC in Rx path. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	d148a87e69	net/ngbe: support Rx/Tx burst mode info Support to get Rx/Tx burst mode info. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	9f32061402	net/ngbe: support TSO Add transmit datapath with offloads, and support TCP segmentation offload. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	ffc959f5b3	net/ngbe: support Rx checksum offload Support IP/L4 checksum on Rx, and convert it to mbuf flags. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	79f3128d4d	net/ngbe: support scattered Rx Add scattered Rx function to support receiving segmented mbufs. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Jiawen Wu	f6aef1dacf	net/ngbe: support packet type query Add packet type macro definition and convert ptype to ptid. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-30 00:53:19 +02:00
Miao Li	327fcd2d38	net/vhost: support power monitor According to current semantics of power monitor, this commit adds a callback function to decide whether aborts the sleep by checking current value against the expected value and vhost_get_monitor_addr to provide address to monitor. When no packet come in, the value of address will not be changed and the running core will sleep. Once packets arrive, the value of address will be changed and the running core will wakeup. Signed-off-by: Miao Li <miao.li@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-10-29 12:32:29 +02:00
Miao Li	34fd4373ce	vhost: add power monitor API This commit defines rte_vhost_power_monitor_cond which is used to pass some information to vhost driver. The information is including the address to monitor, the expected value, the mask to extract value read from 'addr', the value size of monitor address, the match flag used to distinguish the value used to match something or not match something. Vhost driver can use these information to fill rte_power_monitor_cond. Signed-off-by: Miao Li <miao.li@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-10-29 12:32:29 +02:00
Miao Li	64ac7e08f6	net/virtio: support power monitor According to current semantics of power monitor, this commit adds a callback function to decide whether aborts the sleep by checking current value against the expected value and virtio_get_monitor_addr to provide address to monitor. When no packet come in, the value of address will not be changed and the running core will sleep. Once packets arrive, the value of address will be changed and the running core will wakeup. Signed-off-by: Miao Li <miao.li@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-10-29 12:32:29 +02:00
Maxime Coquelin	0c9d662070	net/virtio: support RSS Provide the capability to update the hash key, hash types and RETA table on the fly (without needing to stop/start the device). However, the key length and the number of RETA entries are fixed to 40B and 128 entries respectively. This is done in order to simplify the design, but may be revisited later as the Virtio spec provides this flexibility. Note that only VIRTIO_NET_F_RSS support is implemented, VIRTIO_NET_F_HASH_REPORT, which would enable reporting the packet RSS hash calculated by the device into mbuf.rss, is not yet supported. Regarding the default RSS configuration, it has been chosen to use the default Intel ixgbe key as default key, and default RETA is a simple modulo between the hash and the number of Rx queues. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-29 11:23:10 +02:00
Ajit Khaparde	9446d7fcd9	doc: remove obsolete option from bnxt guide host-based-truflow devarg is not used anymore to enable host based flow table management functionality TruFlow. Instead this feature is now driven by a capability indicated by the firmware. TruFlow is not in tech preview anymore. Update the doc accordingly. Fixes: `da3731e2ea` ("net/bnxt: check FW capability to support TRUFLOW") Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-28 19:58:54 +02:00
Ajit Khaparde	0e7bdac71d	doc: update NIC feature matrix for bnxt Support for runtime Rx/Tx queue setup and inner RSS is not updated. Update feature matrix for bnxt PMD. Fixes: `7ed45b1a7c` ("net/bnxt: support RSS hash selection") Fixes: `0105ea1296` ("net/bnxt: support runtime queue setup") Cc: stable@dpdk.org Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-28 19:58:54 +02:00
Radu Nicolau	6bc987ecb8	net/iavf: support IPsec inline crypto Add support for inline crypto for IPsec, for ESP transport and tunnel over IPv4 and IPv6, as well as supporting the offload for ESP over UDP, and in conjunction with TSO for UDP and TCP flows. Implement support for rte_security packet metadata Add definition for IPsec descriptors, extend support for offload in data and context descriptor to support Add support to virtual channel mailbox for IPsec Crypto request operations. IPsec Crypto requests receive an initial acknowledgment from physical function driver of receipt of request and then an asynchronous response with success/failure of request including any response data. Add enhanced descriptor debugging Refactor of scalar tx burst function to support integration of offload Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Abhijit Sinha <abhijit.sinha@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Reviewed-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:22:04 +02:00
Rongwei Liu	7299ab6822	net/mlx5: support socket direct mode bonding In socket direct mode, it's possible to bind any two (maybe four in future) PCIe devices with IDs like xxxx:xx:xx.x and yyyy:yy:yy.y. Bonding member interfaces are unnecessary to have the same PCIe domain/bus/device ID anymore, Kernel driver uses "system_image_guid" to identify if devices can be bound together or not. Sysfs "phys_switch_id" is used to get "system_image_guid" of each network interface. OFED 5.4+ is required to support "phys_switch_id". Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-26 13:24:20 +02:00
Sean Zhang	80a3233197	app/flow-perf: add destination ports parameter Add optional destination ports parameter for port-id action. The parameter is not must, and the value is 1 by default as before if the parameter not provided. For example: $ dpdk-test-flow-perf -w 08:00.0,representor=[0,1] -- --transfer \ --ingress --transfer --ether --portmask=0x2 --vxlan-encap \ --port-id=0 This command means the rule created on representor 0 with port 0 as destination, since the portmask is 0x2 and dst-ports is 0: $ dpdk-test-flow-perf -w 08:00.0,representor=[0,1] \ -w 08:00.1,representor=[0,1]-- --transfer --ingress --transfer \ --ether --portmask=0x12 --vxlan-encap --port-id=0,3 This command means the rules created on both representor 0 of PF 0 and PF 1, the destination port for the first representor is PF 0, and the destination port for the other one is PF 1. Signed-off-by: Sean Zhang <xiazhang@nvidia.com> Reviewed-by: Wisam Jaddo <wisamm@nvidia.com>	2021-11-05 10:58:25 +01:00
Wojciech Liguzinski	44c730b0e3	sched: add PIE based congestion management Implement PIE based congestion management based on rfc8033. The Proportional Integral Controller Enhanced (PIE) algorithm works by proactively dropping packets randomly. PIE is implemented as more advanced queue management is required to address the bufferbloat problem and provide desirable quality of service to users. Tests for PIE code added to test application. Added PIE related information to documentation. Signed-off-by: Wojciech Liguzinski <wojciechx.liguzinski@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-11-04 15:41:49 +01:00
Vladimir Medvedkin	11c5b9b51a	fib: add RIB extension size parameter This patch adds a new parameter to the FIB configuration to specify the size of the extension for internal RIB structure. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Tested-by: Conor Walsh <conor.walsh@intel.com>	2021-11-04 12:38:03 +01:00
Vladimir Medvedkin	31d7c06947	hash: add bulk Toeplitz hash implementation This patch adds a bulk version for the Toeplitz hash implemented with Galios Fields New Instructions (GFNI). Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-11-04 11:19:10 +01:00
Vladimir Medvedkin	4fd8c4cb0d	hash: add new Toeplitz hash implementation This patch add a new Toeplitz hash implementation using Galios Fields New Instructions (GFNI). Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-11-04 11:19:10 +01:00
Ady Agbarih	9fa82d287f	regex/mlx5: move RXP to CrSpace Add patch for programming the regex database through ROF file, using the firmware instead of manually through the software. No need to setup the DB anymore, the regex-daemon is responsible for that always. In the new flow the regex driver only has to program ROF rules by using set params DevX cmd, requires ROF mkey creation. The rules file has to be read into 4KB aligned memory. Signed-off-by: Ady Agbarih <adypodoman@gmail.com> Acked-by: Ori Kam <orika@nvidia.com>	2021-11-03 23:14:48 +01:00
Zhihong Peng	6cc51b1293	mem: instrument allocator for ASan This patch adds necessary hooks in the memory allocator for ASan. This feature is currently available in DPDK only on Linux x86_64. If other OS/architectures want to support it, ASAN_SHADOW_OFFSET must be defined and RTE_MALLOC_ASAN must be set accordingly in meson. Signed-off-by: Xueqin Lin <xueqin.lin@intel.com> Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2021-10-29 16:25:03 +02:00
Zhihong Peng	6e0290250d	build: enable AddressSanitizer AddressSanitizer [1] a.k.a. ASan is a widely-used debugging tool to detect memory access errors. It helps to detect issues like use-after-free, various kinds of buffer overruns in C/C++ programs, and other similar errors, as well as printing out detailed debug information whenever an error is detected. ASan is integrated with gcc and clang and can be enabled via a meson option: -Db_sanitize=address See the documentation for details (especially regarding clang). Enabling ASan has an impact on performance since additional checks are added to generated binaries. Enabling ASan with Windows is currently not supported in DPDK. 1: https://github.com/google/sanitizers/wiki/AddressSanitizer Signed-off-by: Xueqin Lin <xueqin.lin@intel.com> Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2021-10-29 15:25:34 +02:00
Anatoly Burakov	efb57dd5bb	doc: clarify SRIOV activation with built-in VFIO Currently, the documentation only contains instructions for enabling SRIOV support for VFIO compiled as a module, but doesn't have any instructions on how to do the same for cases where VFIO is built-in. Add these instructions. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2021-10-28 10:14:58 +02:00
Kevin Laatz	bb4141dbe5	examples/dma: rename ioat application example Since the APIs have been updated from rawdev to dmadev, the application should also be renamed to match. This patch also includes the documentation updates for the renaming. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-27 16:53:39 +02:00
Kevin Laatz	5a5f46be19	examples/ioat: add option to control stats print interval Add a command line option to control the interval between stats prints. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-27 16:53:39 +02:00
Konstantin Ananyev	8a256e97fb	examples/ioat: add option to control maximum frame size Add command line option for setting the max frame size. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-27 16:53:39 +02:00
Konstantin Ananyev	198db0c9e2	examples/ioat: add option to control DMA batch size Add a commandline options to control the HW copy batch size in the application. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-27 16:53:39 +02:00
Konstantin Ananyev	81ce0906ea	examples/ioat: use always same lcore for enqueue/dequeue Few changes in ioat sample behaviour: - Always do SW copy for packet metadata (mbuf fields) - Always use same lcore for both DMA requests enqueue and dequeue Main reasons for that: a) it is safer, as idxd PMD doesn't support MT safe enqueue/dequeue (yet). b) sort of more apples to apples comparison with sw copy. c) from my testing things are faster that way. Documentation updates to reflect these changes are also included. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-27 16:53:39 +02:00
Sarosh Arif	b54403fd08	examples/l2fwd: add promiscuous mode option The default behaviour of l2fwd is to exit if we are unable to turn promiscuous mode on. On some aws instances turning promiscuous mode on is not permitted. In such cases there should be a way to run the application without promiscuous mode. This patch allows user to turn promiscuous mode on via command line parameter. l3fwd has a similar option available. Signed-off-by: Sarosh Arif <sarosh.arif@emumba.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-25 22:31:53 +02:00
Wisam Jaddo	741bb850ee	app/flow-perf: export some config as runtime options Some options are needed in the runtime many times, so leaving it during compilation is not correct. As a result some options has been exported into command line options to be used at run time. The options exported are: --txq=N --rxq=N --txd=N --rxd=N --mbuf-size=N --mbuf-cache-size=N --total-mbuf-count=N Signed-off-by: Wisam Jaddo <wisamm@nvidia.com> Reviewed-by: Alexander Kozyrev <akozyrev@nvidia.com>	2021-10-25 22:31:53 +02:00
Honnappa Nagarahalli	f6c6c686f1	eal: remove FINISHED lcore state FINISHED state seems to be used to indicate that the worker's update of the 'state' is not visible to other threads. There seems to be no requirement to have such a state. Since the FINISHED state is removed, the API rte_eal_wait_lcore is updated to always return the status of the last function that ran in the worker core. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Feifei Wang <feifei.wang2@arm.com>	2021-10-25 18:20:59 +02:00
David Marchand	65c2bbf41f	doc: fix default mempool option in guides This option should be prefixed with -- for consistency with others. Fixes: `a103a97e71` ("eal: allow user to override default mempool driver") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2021-10-25 17:12:52 +02:00
Olivier Matz	daa02b5cdd	mbuf: add namespace to offload flags Fix the mbuf offload flags namespace by adding an RTE_ prefix to the name. The old flags remain usable, but a deprecation warning is issued at compilation. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-24 13:37:43 +02:00
Anoob Joseph	fd390896f4	crypto/cnxk: allow different cores in pending queue Rework pending queue to allow producer and consumer cores to be different. Signed-off-by: Anoob Joseph <anoobj@marvell.com>	2021-10-20 15:56:46 +02:00
Akhil Goyal	92cb130919	cryptodev: move device-specific structures The device specific structures - rte_cryptodev and rte_cryptodev_data are moved to cryptodev_pmd.h to hide it from the applications. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Tested-by: Rebecca Troy <rebecca.troy@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-10-20 15:33:16 +02:00
Kai Ji	418c1a5499	test/crypto: enable chacha_poly PMD An autotest is added for the new chacha20_poly1305 PMD. A new test case is also added for SGL test. Signed-off-by: Kai Ji <kai.ji@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 15:33:16 +02:00
Kai Ji	f166628854	crypto/ipsec_mb: add chacha_poly PMD Add in new chacha20_poly1305 PMD to the ipsec_mb framework. Signed-off-by: Kai Ji <kai.ji@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 15:33:16 +02:00
Piotr Bronowski	cde8df1bda	crypto/ipsec_mb: move zuc PMD This patch removes the crypto/zuc folder and gathers all zuc PMD implementation specific details into two files, pmd_zuc.c and pmd_zuc_priv.h in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 15:32:36 +02:00
Piotr Bronowski	5208d68d30	crypto/ipsec_mb: support snow3g digest appended ops This patch enables out-of-place auth-cipher operations where digest should be encrypted along with the rest of raw data. It also adds support for partially encrypted digest when using auth-cipher operations. Signed-off-by: Damian Nowak <damianx.nowak@intel.com> Signed-off-by: Kai Ji <kai.ji@intel.com> Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	4f1cfda59a	crypto/ipsec_mb: move snow3g PMD This patch removes the crypto/snow3g folder and gathers all snow3g PMD implementation specific details into a single file, pmd_snow3g.c in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	bc9ef81c42	crypto/ipsec_mb: move kasumi PMD This patch removes the crypto/kasumi folder and gathers all kasumi PMD implementation specific details into a single file, pmd_kasumi.c in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	746825e5c0	crypto/ipsec_mb: move aesni_gcm PMD This patch removes the crypto/aesni_gcm folder and gathers all aesni-gcm PMD implementation specific details into a single file, pmd_aesni_gcm.c in the crypto/ipsec_mb folder. A redundant check for iv length is removed. GCM ops are stored in the queue pair for multi process support, they are updated during queue pair setup for both primary and secondary processes. GCM ops are also set per lcore for the CPU crypto mode. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Pablo de Lara	8c835018de	crypto/ipsec_mb: support ZUC-256 for aesni_mb Add support for ZUC-EEA3-256 and ZUC-EIA3-256. Only 4-byte tags supported for now. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	918fd2f146	crypto/ipsec_mb: move aesni_mb PMD This patch removes the crypto/aesni_mb folder and gathers all aesni-mb PMD implementation specific details into a single file, pmd_aesni_mb.c in crypto/ipsec_mb. Now that intel-ipsec-mb v1.0 is the minimum supported version, old macros can be replaced with the newer macros supported by this version. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Ciara Power	72a169278a	crypto/ipsec_mb: support multi-process The ipsec_mb SW PMD now has multiprocess support. The queue-pair IMB_MGR is stored in a memzone instead of being allocated externally by the Intel IPSec MB library, when v1.1 is used. If v1.0 is used, multi process is not supported, and allocation is done as before. The secondary process needs to reconfigure the queue-pair to allow for IMB_MGR function pointers be updated. Intel IPsec MB library version 1.1 is required for this support. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Fan Zhang	c75542ae42	crypto/ipsec_mb: introduce IPsec_mb framework This patch introduces the new framework to share common code between the SW crypto PMDs that depend on the intel-ipsec-mb library. This change helps to reduce future effort on the code maintenance and feature updates. The PMDs that will be added to this framework in subsequent patches are: - AESNI MB - AESNI GCM - CHACHA20_POLY1305 - KASUMI - SNOW3G - ZUC The use of these PMDs will not change, they will still be supported for x86, and will use the same EAL args as before. The minimum required version for the intel-ipsec-mb library is now v1.0. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Ferruh Yigit	295968d174	ethdev: add namespace Add 'RTE_ETH' namespace to all enums & macros in a backward compatible way. The macros for backward compatibility can be removed in next LTS. Also updated some struct names to have 'rte_eth' prefix. All internal components switched to using new names. Syntax fixed on lines that this patch touches. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Wisam Jaddo <wisamm@nvidia.com> Acked-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-22 18:15:38 +02:00
Ferruh Yigit	1aca4fdb00	doc: remove jumbo offload feature Jumbo offload is no more announced as capability, and 'DEV_RX_OFFLOAD_JUMBO_FRAME' offload flag is removed. This patch is also removing 'Jumbo frame' feature from documentation. Fixes: `b563c14212` ("ethdev: remove jumbo offload flag") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-22 17:26:07 +02:00
Chengchang Tang	2fc3e696a7	net/hns3: add runtime config for mailbox limit time Current, the max waiting time for MBX response is 500ms, but in some scenarios, it is not enough. Since it depends on the response of the kernel mode driver, and its response time is related to the scheduling of the system. In this special scenario, most of the cores are isolated, and only a few cores are used for system scheduling. When a large number of services are started, the scheduling of the system will be very busy, and the reply of the mbx message will time out, which will cause our PMD initialization to fail. This patch add a runtime config to set the max wait time. For the above scenes, users can adjust the waiting time to a suitable value by themselves. Fixes: `463e748964` ("net/hns3: support mailbox") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-10-22 04:11:43 +02:00
Xueming Li	5984037501	app/testpmd: add forwarding engine for shared Rx queue To support shared Rx queue, this patch introduces dedicate forwarding engine. The engine groups received packets by mbuf->port into sub-group, updates stream statistics and simply frees packets. Signed-off-by: Xueming Li <xuemingl@nvidia.com>	2021-10-22 00:09:19 +02:00
Xueming Li	f4d178c13b	app/testpmd: add parameter for shared Rx queue Adds "--rxq-share=X" parameter to enable shared RxQ. Rx queue is shared if device supports, otherwise fallback to standard RxQ. Shared Rx queues are grouped per X ports. X defaults to UINT32_MAX, implies all ports join share group 1. Queue ID is mapped equally with shared Rx queue ID. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-22 00:09:07 +02:00
Xueming Li	dd22740cc2	ethdev: introduce shared Rx queue In current DPDK framework, each Rx queue is pre-loaded with mbufs to save incoming packets. For some PMDs, when number of representors scale out in a switch domain, the memory consumption became significant. Polling all ports also leads to high cache miss, high latency and low throughput. This patch introduces shared Rx queue. Ports in same Rx domain and switch domain could share Rx queue set by specifying non-zero sharing group in Rx queue configuration. Shared Rx queue is identified by share_rxq field of Rx queue configuration. Port A RxQ X can share RxQ with Port B RxQ Y by using same shared Rx queue ID. No special API is defined to receive packets from shared Rx queue. Polling any member port of a shared Rx queue receives packets of that queue for all member ports, port_id is identified by mbuf->port. PMD is responsible to resolve shared Rx queue from device and queue data. Shared Rx queue must be polled in same thread or core, polling a queue ID of any member port is essentially same. Multiple share groups are supported. PMD should support mixed configuration by allowing multiple share groups and non-shared Rx queue on one port. Example grouping and polling model to reflect service priority: Group1, 2 shared Rx queues per port: PF, rep0, rep1 Group2, 1 shared Rx queue per port: rep2, rep3, ... rep127 Core0: poll PF queue0 Core1: poll PF queue1 Core2: poll rep2 queue0 PMD advertise shared Rx queue capability via RTE_ETH_DEV_CAPA_RXQ_SHARE. PMD is responsible for shared Rx queue consistency checks to avoid member port's configuration contradict each other. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-22 00:08:50 +02:00
Satheesh Paul	00ea15e7a3	net/cnxk: support port ID flow action This patch adds support for rte flow action type port_id to enable directing packets from an input port PF to an output port which is a VF of the input port PF. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 18:59:40 +02:00
William Tu	d1c7029a52	net/e1000: build on Windows This patch enables building the e1000 driver for Windows. I tested using two Windows VM on top of VMware Fusion, creating two e1000 devices with device ID 0x10D3 (8274L), verifying rx/tx works correctly using dpdk-testpmd.exe rxonly and txonly mode. Signed-off-by: William Tu <u9012063@gmail.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Pallavi Kadam <pallavi.kadam@intel.com> Tested-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Pallavi Kadam <pallavi.kadam@intel.com>	2021-10-21 04:58:40 +02:00
Rongwei Liu	a89f6433aa	net/mlx5: set Tx queue affinity in round-robin Previously, we set txq affinity to 0 and let firmware to perform round-robin when bonding. Firmware uses a global counter to assign txq affinity to different physical ports accord to remainder after division. There are three dis-advantages: 1. The global counter is shared between kernel and dpdk. 2. After restarting pmd or port, the previous counter value is reused, so the new affinity is unpredictable. 3. There is no way to get what affinity is set by firmware. In this update, we will create several TISs up to the number of bonding ports and bind each TIS to one PF port. For each port, it will start to pick up TIS using its port index. Upper layer application can quickly calculate each txq's affinity without querying. At DPDK layer, when creating txq with 2 bonding ports, the affinity is set like: port 0: 1-->2-->1-->2 port 1: 2-->1-->2-->1 port 2: 1-->2-->1-->2 Note: Only applicable to DevX api. This affinity subjects to HW hash. Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 12:37:00 +02:00
Jie Wang	748530f035	app/testpmd: support L2TPv2 and PPP protocol pattern Add support for test-pmd to parse protocol pattern L2TPv2 and PPP. Signed-off-by: Wenjun Wu <wenjun1.wu@intel.com> Signed-off-by: Jie Wang <jie1x.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-21 14:15:59 +02:00
Jie Wang	f30157d988	net/iavf: support PPPoL2TPv2oUDP RSS Hash Add support for PPP over L2TPv2 over UDP protocol RSS Hash based on inner IP src/dst address and TCP/UDP src/dst port. Patterns are listed below: eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6) eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6)/udp eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6)/tcp Signed-off-by: Wenjun Wu <wenjun1.wu@intel.com> Signed-off-by: Jie Wang <jie1x.wang@intel.com> Acked-by: Beilei Xing <beilei.xing@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-21 14:15:59 +02:00
Jie Wang	3a929df1f2	ethdev: support L2TPv2 and PPP procotol Added flow pattern items and header formats of L2TPv2 and PPP. Signed-off-by: Wenjun Wu <wenjun1.wu@intel.com> Signed-off-by: Jie Wang <jie1x.wang@intel.com> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-21 14:15:59 +02:00
Zhihong Peng	6ad06203a5	cmdline: free on exit Malloc cl in the cmdline_stdin_new function, so release in the cmdline_stdin_exit function is logical, so that cl will not be released alone. Fixes: `af75078fec` ("first public release") Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Tested-by: Zhihong Peng <zhihongx.peng@intel.com>	2021-10-22 23:32:00 +02:00
Dmitry Kozlyuk	f8f8dc2890	cmdline: make struct rdline opaque Hide struct rdline definition and some RDLINE_* constants in order to be able to change internal buffer sizes transparently to the user. Add new functions: * rdline_new(): allocate and initialize struct rdline. This function replaces rdline_init() and takes an extra parameter: opaque user data for the callbacks. * rdline_free(): deallocate struct rdline. * rdline_get_history_buffer_size(): for use in tests. * rdline_get_opaque(): to obtain user data in callback functions. Remove rdline_init() function from library headers and export list, because using it requires the knowledge of sizeof(struct rdline). Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com>	2021-10-22 23:23:45 +02:00
Dmitry Kozlyuk	f43809d28c	cmdline: make struct cmdline opaque Remove the definition of `struct cmdline` from public header. Deprecation notice: https://mails.dpdk.org/archives/dev/2020-September/183310.html Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com>	2021-10-22 22:44:18 +02:00
Conor Walsh	a4d11e0386	raw/ioat: deprecate rawdev driver Deprecate the rawdev IOAT driver as both IOAT and IDXD drivers have moved to dmadev. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Acked-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-10-22 22:40:59 +02:00
Conor Walsh	9f46de12c4	dma/ioat: add data path completion Add the data path functions for gathering completed operations from IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	3d4b027324	dma/ioat: add data path job submission Add data path functions for enqueuing and submitting operations to IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	583f046dd4	dma/ioat: add start and stop Add start, stop and recover functions for IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	a19a2dceda	dma/ioat: add configuration Add functions for device configuration. The info_get and close functions are included here also. info_get can be useful for checking successful configuration and close is used by the dmadev api when releasing a configured device. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	866e46bcd8	dma/ioat: add device probing and removal Add the basic device probe/remove skeleton code and initial documentation for new IOAT DMA driver. Maintainers update is also included in this patch. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	280c3ca02c	dma/idxd: add operation statistic tracking Add statistic tracking for DSA devices. The dmadev library documentation is also updated to add a generic section for using the library's statistics APIs. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	97aeed5637	dma/idxd: add data path job completion Add the data path functions for gathering completed operations. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	3d36a0a1c7	dma/idxd: add data path job submission Add data path functions for enqueuing and submitting operations to DSA devices. Documentation updates are included for dmadev library and IDXD driver docs as appropriate. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	a42ac7e31e	dma/idxd: add start and stop for PCI devices Add device start/stop functions for DSA devices bound to vfio. For devices bound to the IDXD kernel driver, these are not required since the IDXD kernel driver takes care of this. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	2f7d42c6e1	dma/idxd: add configure and info Add functions for device configuration. The info_get function is included here since it can be useful for checking successful configuration. Documentation is also updated to add device configuration usage info. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	e888bb1278	dma/idxd: add bus device probing Add the basic device probing for DSA devices bound to the IDXD kernel driver. These devices can be configured via sysfs and made available to DPDK if they are found during bus scan. Relevant documentation is included. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Kevin Laatz	e33ad06eae	dma/idxd: add skeleton for VFIO based DSA device Add the basic device probe/remove skeleton code for DSA device bound to the vfio pci driver. Relevant documentation and MAINTAINERS update also included. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Bruce Richardson	65bcddfc66	raw/ioat: build only if dmadev not present Only build the rawdev IDXD/IOAT drivers if the dmadev drivers are not present. This change requires the dependencies to be reordered in drivers/meson.build so that rawdev can use the "RTE_DMA_* build macros to check for the presence of the equivalent dmadev driver. A note is also added to the documentation to inform users of this change. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Stephen Hemminger	cbb44143be	app/dumpcap: add new packet capture application This is a new packet capture application to replace existing pdump. The new application works like Wireshark dumpcap program and supports the pdump API features. It is not complete yet some features such as filtering are not implemented. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-10-22 22:40:58 +02:00
Stephen Hemminger	10f726efe2	pdump: support pcapng and filtering This enhances the DPDK pdump library to support new pcapng format and filtering via BPF. The internal client/server protocol is changed to support two versions: the original pdump basic version and a new pcapng version. The internal version number (not part of exposed API or ABI) is intentionally increased to cause any attempt to try mismatched primary/secondary process to fail. Add new API to do allow filtering of captured packets with DPDK BPF (eBPF) filter program. It keeps statistics on packets captured, filtered, and missed (because ring was full). Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-10-22 22:07:48 +02:00
Stephen Hemminger	8d23ce8f5e	pcapng: add new library for writing pcapng files This is utility library for writing pcapng format files used by Wireshark family of utilities. Older tcpdump also knows how to read (but not write) this format. See https://github.com/pcapng/pcapng/ Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-10-22 17:19:07 +02:00
Shijith Thotton	796b07e9c6	examples/l2fwd-event: support event vector Added changes to receive packets as event vector. By default this is disabled and can be enabled using the option --event-vector. Vector size and timeout to form the vector can be configured using options --event-vector-size and --event-vector-tmo. Example: dpdk-l2fwd-event -l 0-3 -n 4 -- -p 0x03 --mode=eventdev \ --eventq-sched=ordered --event-vector --event-vector-size 16 Signed-off-by: Shijith Thotton <sthotton@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:16:00 +02:00
Shijith Thotton	e8adca1951	examples/l3fwd: support event vector Added changes to receive packets as event vector. By default this is disabled and can be enabled using the option --event-vector. Vector size and timeout to form the vector can be configured using options --event-vector-size and --event-vector-tmo. Example: dpdk-l3fwd -l 0-3 -n 4 -- -p 0x03 --mode=eventdev \ --eventq-sched=ordered --event-vector --event-vector-size 16 Signed-off-by: Shijith Thotton <sthotton@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:16:00 +02:00
Rashmi Shetty	20841a2551	app/eventdev: support burst enqueue Introduce a new command line option prod_enq_burst_sz to set burst size for eventdev enqueue at producer in perf_queue test. The newly added function perf_producer_burst is called when prod_enq_burst_sz is greater than 1. Signed-off-by: Rashmi Shetty <rashmi.shetty@intel.com> Acked-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:16:00 +02:00
Pavan Nikhilesh	f3f3a91788	eventdev/timer: move adapters memory to hugepage Move memory used by timer adapters to hugepage. Allocate memory on the first adapter create or lookup to address both primary and secondary process usecases. This will prevent TLB misses if any and aligns to memory structure of other subsystems. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2021-10-21 10:16:00 +02:00
Pavan Nikhilesh	1dcd67ba1e	eventdev/timer: rearrange struct fields Rearrange fields in rte_event_timer data structure to remove holes. Also, remove use of volatile from rte_event_timer. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	d35e61322d	eventdev: move inline APIs into separate structure Move fastpath inline function pointers from rte_eventdev into a separate structure accessed via a flat array. The intention is to make rte_eventdev and related structures private to avoid future API/ABI breakages.` Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-10-21 10:14:50 +02:00
Ganapati Kundapura	814d017093	eventdev/eth_rx: support telemetry Added telemetry callbacks to get Rx adapter stats, reset stats and to get Rx queue config information. Signed-off-by: Ganapati Kundapura <ganapati.kundapura@intel.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com> Acked-by: Naga Harish K S V <s.v.naga.harish.k@intel.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:14:50 +02:00
Naga Harish K S V	b06bca69b7	eventdev/eth_rx: add per-queue event buffer Added per queue buffer. To configure per queue event buffer size, application sets rte_event_eth_rx_adapter_params::use_queue_event_buf flag as true while using rte_event_eth_rx_adapter_create_with_params(). The per queue event buffer size is populated in rte_event_eth_rx_adapter_queue_conf::event_buf_size and passed to rte_event_eth_rx_adapter_queue_add(). Signed-off-by: Naga Harish K S V <s.v.naga.harish.k@intel.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com>	2021-10-21 10:14:50 +02:00
Naga Harish K S V	bc0df25c83	eventdev/eth_rx: add event buffer size configurability Currently event buffer is static array with a default size defined internally. To configure event buffer size from application, rte_event_eth_rx_adapter_create_with_params() API is added which takes struct rte_event_eth_rx_adapter_params to configure event buffer size in addition other params. The event buffer size is rounded up for better buffer utilization and performance. In case of NULL params argument, default event buffer size is used. Signed-off-by: Naga Harish K S V <s.v.naga.harish.k@intel.com> Signed-off-by: Ganapati Kundapura <ganapati.kundapura@intel.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:14:50 +02:00
Ganapati Kundapura	da781e6488	eventdev/eth_rx: support Rx queue config get Added rte_event_eth_rx_adapter_queue_conf_get() API to get rx queue information - event queue identifier, flags for handling received packets, scheduler type, event priority, polling frequency of the receive queue and flow identifier in rte_event_eth_rx_adapter_queue_conf structure Signed-off-by: Ganapati Kundapura <ganapati.kundapura@intel.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	929ebdd543	eventdev/eth_rx: simplify event vector config Include vector configuration into the structure ``rte_event_eth_rx_adapter_queue_conf`` that is used to configure Rx adapter ethernet device Rx queue parameters. This simplifies event vector configuration as it avoids splitting configuration per Rx queue. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:14:50 +02:00
David Marchand	1752b08781	test: rely on EAL detection for core list Cores count has a direct impact on the time needed to complete unit tests. Currently, the core list used for unit test is enforced to "all cores on the system" with no way for (CI) users to adapt it. On the other hand, EAL default behavior (when no -c/-l option gets passed) is to start threads on as many cores available in the process cpu affinity. Remove logic from meson: users can then select where to run the tests by either running meson with a custom cpu affinity (using taskset/cpuset depending on OS) or by passing a --test-args option to meson. Example: $ sudo meson test -C build --suite fast-tests -t 3 --test-args "-l 0-3" Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Aaron Conole <aconole@redhat.com>	2021-10-21 17:48:04 +02:00
Gregory Etelson	59f3a8acbc	app/testpmd: add flex item commands Network port hardware is shipped with fixed number of supported network protocols. If application must work with a protocol that is not included in the port hardware by default, it can try to add the new protocol to port hardware. Flex item or flex parser is port infrastructure that allows application to add support for a custom network header and offload flows to match the header elements. Application must complete the following tasks to create a flow rule that matches custom header: 1. Create flow item object in port hardware. Application must provide custom header configuration to PMD. PMD will use that configuration to create flex item object in port hardware. 2. Create flex patterns to match. Flex pattern has a spec and a mask components, like a regular flow item. Combined together, spec and mask can target unique data sequence or a number of data sequences in the custom header. Flex patterns of the same flex item can have different lengths. Flex pattern is identified by unique handler value. 3. Create a flow rule with a flex flow item that references flow pattern. Testpmd flex CLI commands are: testpmd> flow flex_item create <port> <flex_id> <filename> testpmd> set flex_pattern <pattern_id> \ spec <spec data> mask <mask data> testpmd> set flex_pattern <pattern_id> is <spec_data> testpmd> flow create <port> ... \ / flex item is <flex_id> pattern is <pattern_id> / ... The patch works with the jansson library API. A new optional dependency on jansson library is added for testpmd. If jansson not detected the flex item functionality is disabled. Jansson development files must be present: jansson.pc, jansson.h libjansson.[a,so] Signed-off-by: Gregory Etelson <getelson@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-20 19:00:26 +02:00
Viacheslav Ovsiienko	dc4d860e8a	ethdev: introduce configurable flexible item 1. Introduction and Retrospective Nowadays the networks are evolving fast and wide, the network structures are getting more and more complicated, the new application areas are emerging. To address these challenges the new network protocols are continuously being developed, considered by technical communities, adopted by industry and, eventually implemented in hardware and software. The DPDK framework follows the common trends and if we bother to glance at the RTE Flow API header we see the multiple new items were introduced during the last years since the initial release. The new protocol adoption and implementation process is not straightforward and takes time, the new protocol passes development, consideration, adoption, and implementation phases. The industry tries to mitigate and address the forthcoming network protocols, for example, many hardware vendors are implementing flexible and configurable network protocol parsers. As DPDK developers, could we anticipate the near future in the same fashion and introduce the similar flexibility in RTE Flow API? Let's check what we already have merged in our project, and we see the nice raw item (rte_flow_item_raw). At the first glance, it looks superior and we can try to implement a flow matching on the header of some relatively new tunnel protocol, say on the GENEVE header with variable length options. And, under further consideration, we run into the raw item limitations: - only fixed size network header can be represented - the entire network header pattern of fixed format (header field offsets are fixed) must be provided - the search for patterns is not robust (the wrong matches might be triggered), and actually is not supported by existing PMDs - no explicitly specified relations with preceding and following items - no tunnel hint support As the result, implementing the support for tunnel protocols like aforementioned GENEVE with variable extra protocol option with flow raw item becomes very complicated and would require multiple flows and multiple raw items chained in the same flow (by the way, there is no support found for chained raw items in implemented drivers). This RFC introduces the dedicated flex item (rte_flow_item_flex) to handle matches with existing and new network protocol headers in a unified fashion. 2. Flex Item Life Cycle Let's assume there are the requirements to support the new network protocol with RTE Flows. What is given within protocol specification: - header format - header length, (can be variable, depending on options) - potential presence of extra options following or included in the header the header - the relations with preceding protocols. For example, the GENEVE follows UDP, eCPRI can follow either UDP or L2 header - the relations with following protocols. For example, the next layer after tunnel header can be L2 or L3 - whether the new protocol is a tunnel and the header is a splitting point between outer and inner layers The supposed way to operate with flex item: - application defines the header structures according to protocol specification - application calls rte_flow_flex_item_create() with desired configuration according to the protocol specification, it creates the flex item object over specified ethernet device and prepares PMD and underlying hardware to handle flex item. On item creation call PMD backing the specified ethernet device returns the opaque handle identifying the object has been created - application uses the rte_flow_item_flex with obtained handle in the flows, the values/masks to match with fields in the header are specified in the flex item per flow as for regular items (except that pattern buffer combines all fields) - flows with flex items match with packets in a regular fashion, the values and masks for the new protocol header match are taken from the flex items in the flows - application destroys flows with flex items - application calls rte_flow_flex_item_release() as part of ethernet device API and destroys the flex item object in PMD and releases the engaged hardware resources 3. Flex Item Structure The flex item structure is intended to be used as part of the flow pattern like regular RTE flow items and provides the mask and value to match with fields of the protocol item was configured for. struct rte_flow_item_flex { void handle; uint32_t length; const uint8_t pattern; }; The handle is some opaque object maintained on per device basis by underlying driver. The protocol header fields are considered as bit fields, all offsets and widths are expressed in bits. The pattern is the buffer containing the bit concatenation of all the fields presented at item configuration time, in the same order and same amount. If byte boundary alignment is needed an application can use a dummy type field, this is just some kind of gap filler. The length field specifies the pattern buffer length in bytes and is needed to allow rte_flow_copy() operations. The approach of multiple pattern pointers and lengths (per field) was considered and found clumsy - it seems to be much suitable for the application to maintain the single structure within the single pattern buffer. 4. Flex Item Configuration The flex item configuration consists of the following parts: - header field descriptors: - next header - next protocol - sample to match - input link descriptors - output link descriptors The field descriptors tell the driver and hardware what data should be extracted from the packet and then control the packet handling in the flow engine. Besides this, sample fields can be presented to match with patterns in the flows. Each field is a bit pattern. It has width, offset from the header beginning, mode of offset calculation, and offset related parameters. The next header field is special, no data are actually taken from the packet, but its offset is used as a pointer to the next header in the packet, in other words the next header offset specifies the size of the header being parsed by flex item. There is one more special field - next protocol, it specifies where the next protocol identifier is contained and packet data sampled from this field will be used to determine the next protocol header type to continue packet parsing. The next protocol field is like eth_type field in MAC2, or proto field in IPv4/v6 headers. The sample fields are used to represent the data be sampled from the packet and then matched with established flows. There are several methods supposed to calculate field offset in runtime depending on configuration and packet content: - FIELD_MODE_FIXED - fixed offset. The bit offset from header beginning is permanent and defined by field_base configuration parameter. - FIELD_MODE_OFFSET - the field bit offset is extracted from other header field (indirect offset field). The resulting field offset to match is calculated from as: field_base + (offset_base & offset_mask) << offset_shift This mode is useful to sample some extra options following the main header with field containing main header length. Also, this mode can be used to calculate offset to the next protocol header, for example - IPv4 header contains the 4-bit field with IPv4 header length expressed in dwords. One more example - this mode would allow us to skip GENEVE header variable length options. - FIELD_MODE_BITMASK - the field bit offset is extracted from other header field (indirect offset field), the latter is considered as bitmask containing some number of one bits, the resulting field offset to match is calculated as: field_base + bitcount(offset_base & offset_mask) << offset_shift This mode would be useful to skip the GTP header and its extra options with specified flags. - FIELD_MODE_DUMMY - dummy field, optionally used for byte boundary alignment in pattern. Pattern mask and data are ignored in the match. All configuration parameters besides field size and offset are ignored. Note: "" - means the indirect field offset is calculated and actual data are extracted from the packet by this offset (like data are fetched by pointer p from memory). The offset mode list can be extended by vendors according to hardware supported options. The input link configuration section tells the driver after what protocols and at what conditions the flex item can follow. Input link specified the preceding header pattern, for example for GENEVE it can be UDP item specifying match on destination port with value 6081. The flex item can follow multiple header types and multiple input links should be specified. At flow creation time the item with one of the input link types should precede the flex item and driver will select the correct flex item settings, depending on the actual flow pattern. The output link configuration section tells the driver how to continue packet parsing after the flex item protocol. If multiple protocols can follow the flex item header the flex item should contain the field with the next protocol identifier and the parsing will be continued depending on the data contained in this field in the actual packet. The flex item fields can participate in RSS hash calculation, the dedicated flag is present in the field description to specify what fields should be provided for hashing. 5. Flex Item Chaining If there are multiple protocols supposed to be supported with flex items in chained fashion - two or more flex items within the same flow and these ones might be neighbors in the pattern, it means the flex items are mutual referencing. In this case, the item that occurred first should be created with empty output link list or with the list including existing items, and then the second flex item should be created referencing the first flex item as input arc, drivers should adjust the item configuration. Also, the hardware resources used by flex items to handle the packet can be limited. If there are multiple flex items that are supposed to be used within the same flow it would be nice to provide some hint for the driver that these two or more flex items are intended for simultaneous usage. The fields of items should be assigned with hint indices and these indices from two or more flex items supposed to be provided within the same flow should be the same as well. In other words, the field hint index specifies the group of fields that can be matched simultaneously within a single flow. If hint indices are specified, the driver will try to engage not overlapping hardware resources and provide independent handling of the field groups with unique indices. If the hint index is zero the driver assigns resources on its own. 6. Example of New Protocol Handling Let's suppose we have the requirements to handle the new tunnel protocol that follows UDP header with destination port 0xFADE and is followed by MAC header. Let the new protocol header format be like this: struct new_protocol_header { rte_be32 header_length; /* length in dwords, including options / rte_be32 specific0; / some protocol data, no intention / rte_be32 specific1; / to match in flows on these fields / rte_be32 crucial; / data of interest, match is needed / rte_be32 options[0]; / optional protocol data, variable length / }; The supposed flex item configuration: struct rte_flow_item_flex_field field0 = { .field_mode = FIELD_MODE_DUMMY, / Affects match pattern only / .field_size = 96, / three dwords from the beginning / }; struct rte_flow_item_flex_field field1 = { .field_mode = FIELD_MODE_FIXED, .field_size = 32, / Field size is one dword / .field_base = 96, / Skip three dwords from the beginning / }; struct rte_flow_item_udp spec0 = { .hdr = { .dst_port = RTE_BE16(0xFADE), } }; struct rte_flow_item_udp mask0 = { .hdr = { .dst_port = RTE_BE16(0xFFFF), } }; struct rte_flow_item_flex_link link0 = { .item = { .type = RTE_FLOW_ITEM_TYPE_UDP, .spec = &spec0, .mask = &mask0, }; struct rte_flow_item_flex_conf conf = { .next_header = { .tunnel = FLEX_TUNNEL_MODE_SINGLE, .field_mode = FIELD_MODE_OFFSET, .field_base = 0, .offset_base = 0, .offset_mask = 0xFFFFFFFF, .offset_shift = 2 / Expressed in dwords, shift left by 2 */ }, .sample = { &field0, &field1, }, .nb_samples = 2, .input_link[0] = &link0, .nb_inputs = 1 }; Let's suppose we have created the flex item successfully, and PMD returned the handle 0x123456789A. We can use the following item pattern to match the crucial field in the packet with value 0x00112233: struct new_protocol_header spec_pattern = { .crucial = RTE_BE32(0x00112233), }; struct new_protocol_header mask_pattern = { .crucial = RTE_BE32(0xFFFFFFFF), }; struct rte_flow_item_flex spec_flex = { .handle = 0x123456789A .length = sizeiof(struct new_protocol_header), .pattern = &spec_pattern, }; struct rte_flow_item_flex mask_flex = { .length = sizeof(struct new_protocol_header), .pattern = &mask_pattern, }; struct rte_flow_item item_to_match = { .type = RTE_FLOW_ITEM_TYPE_FLEX, .spec = &spec_flex, .mask = &mask_flex, }; Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2021-10-20 18:58:54 +02:00

1 2 3 4 5 ...

5168 Commits