numam-dpdk

Author	SHA1	Message	Date
Elena Agostini	8b8036a66e	gpudev: introduce GPU device class library In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. The new library gpudev is for dealing with GPGPU computing devices from a DPDK application running on the CPU. The infrastructure is prepared to welcome drivers in drivers/gpu/. Signed-off-by: Elena Agostini <eagostini@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:52 +01:00
Anatoly Burakov	4fd15c6af0	vfio: set errno on unsupported OS Currently, when code is running on FreeBSD or Windows, there is no way to distinguish between a geniune error and a "VFIO is unsupported" error. Fix the dummy implementations to also set the rte_errno flag. Fixes: `279b581c89` ("vfio: expose functions") Fixes: `c564a2a200` ("vfio: expose clear group function for internal usages") Fixes: `964b2f3bfb` ("vfio: export some internal functions") Fixes: `ea2dc10668` ("vfio: add multi container support") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-11-08 16:45:28 +01:00
Anatoly Burakov	da6e4cdca1	vfio: fix FreeBSD documentation On FreeBSD, `rte_vfio_is_enabled()` and `rte_vfio_noiommu_is_enabled()` API calls will not return error, and will instead return 0. This is intentional, because the caller of this API does not care whether VFIO is supported at all, and will instead be interested in whether VFIO is enabled or not. However, the doxygen comments for these functions state that they will return an error on FreeBSD, which is incorrect. Fix the doxygen comment to call out the fact that these functions are only relevant on Linux, but remove the reference to returning errors. Fixes: `279b581c89` ("vfio: expose functions") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-08 16:42:55 +01:00
Anatoly Burakov	bf8b792f3b	vfio: fix FreeBSD clear group stub On FreeBSD, `rte_vfio_clear_group()` was returning 0 even though this function is not valid for FreeBSD, and is called out to return error in doxygen comments. Fix the return value to match documentation. Fixes: `c564a2a200` ("vfio: expose clear group function for internal usages") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2021-11-08 16:42:44 +01:00
Anatoly Burakov	84e03bde1c	vfio: drop fallback Linux implementation Currently, VFIO support for Linux is compiled unconditionally, and supported kernel versions start with 4.4, so VFIO is assumed to always be enabled. There is no way of disabling VFIO support at compile time anyway, so just drop the "VFIO not available" fallback code altogether. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-08 16:27:15 +01:00
Chengwen Feng	b1f4933ef3	kni: check error code of allmulticast mode switch Some drivers may return errcode when switch allmulticast mode, so it's necessary to check the return code. Fixes: `b34801d1aa` ("kni: support allmulticast mode set") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-08 11:56:13 +01:00
Ferruh Yigit	e6cbfd9bf3	kni: update kernel API to set random MAC address Previously used 'random_ether_addr()' API is removed in upstream kernel with commit Commit ba530fea8ca1 ("ethernet: remove random_ether_addr()") Replacement API 'eth_random_addr()' is around since v3.6 [1], so simply switching to this API without any version checks. [1] 0a4dd594982a ("etherdevice: Rename random_ether_addr to eth_random_addr") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-08 11:51:39 +01:00
Satheesh Paul	fda3750de6	app/flow-perf: add random priority option Added support to create flows with priority attribute set randomly between 0 and a user supplied maximum value. This is useful to measure performance on NICs which may have to rearrange flows to honor flow priority. Removed the lower limit of 100000 flows per batch. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Acked-by: Wisam Jaddo <wisamm@nvidia.com>	2021-11-08 10:33:08 +01:00
Raja Zidane	f66898ebd0	common/mlx5: fix MMO configuration in DevX queue pair The QP extension valid bit was not set in the QP creation for MMO configuration. That caused the QP not to be connected to the GGA MMO engines, and any MMO WQE job got CQE with an error. Set the QP ext bit when MMO is configured. Fixes: `ddda000618` ("common/mlx5: add MMO configuration for DevX queue pair") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 09:36:27 +01:00
Raja Zidane	4b99fe0577	common/mlx5: fix HCA capabilities PRM alignment 0x20 reserved bytes were missed in the HCA cap PRM structure before the newly added fields for MMO QP capabilities. That caused reading MMO QP caps incorrectly. Add the reserved fields in the HCA cap structure. Fixes: `cbc4c13a25` ("common/mlx5: update MMO HCA capabilities") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 09:29:36 +01:00
Juraj Linkeš	d0152e1f1a	config/arm: split aarch32 options Aarch32 config got overlooked when splitting march in a previous patch. Fixes: `95e0f23022` ("config/arm: split -march into arch and features") Signed-off-by: Juraj Linkeš <juraj.linkes@pantheon.tech> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com>	2021-11-08 09:17:06 +01:00
Radha Mohan Chintakuntla	ebc539271e	dma/cnxk: add statistics Add the stats function to get the DMA statistics. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	3340c3e227	dma/cnxk: add scatter-gather copy Add the copy_sg function that will do the multiple DMA transfers of different sizes and different source/destination as well. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	b56f1e2dad	dma/cnxk: add channel operations Add functions for the dmadev vchan setup and DMA operations. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	53f6d7328b	dma/cnxk: create and initialize device on PCI probing This patch creates and initializes a dmadev device on pci probe. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	b6e395692b	common/cnxk: add DPI DMA support Add base support as ROC(Rest of Chip) API which will be used by PMD dmadev driver. This patch adds routines to init, fini, configure the DPI DMA device found in Marvell's CN9k or CN10k SoC families. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-07 23:29:58 +01:00
Chengwen Feng	fb73eb2fb1	usertools/devbind: add Kunpeng DMA Add Kunpeng DMA device ID to dmadev category. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:04:28 +01:00
Chengwen Feng	569e850b4b	dma/hisilicon: support multi-process This patch add multi-process support for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:27 +01:00
Chengwen Feng	2db4f0b823	dma/hisilicon: add data path This patch add data path functions for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:27 +01:00
Chengwen Feng	3c5f5f03a0	dma/hisilicon: add control path This patch add control path functions for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:24 +01:00
Chengwen Feng	9e16317a38	dma/hisilicon: add probing This patch add dmadev instances create during the PCI probe, and destroy them during the PCI remove. Internal structures and HW definitions was also included. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:01:52 +01:00
Chengwen Feng	4d0d4cf327	dma/hisilicon: introduce driver skeleton Add the basic device probe and remove functions and initial documentation for new hisilicon DMA drivers. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 19:54:19 +01:00
Ali Alnubani	948be25cdb	sched: fix debug build Compare pkt_len to 0 instead of NULL to avoid the following build failure with debug mode enabled: ../lib/sched/rte_pie.h: In function 'rte_pie_enqueue_empty': ../lib/sched/rte_pie.h:125:21: error: comparison between pointer and integer [-Werror] RTE_ASSERT(pkt_len != NULL); Bugzilla ID: 878 Fixes: `44c730b0e3` ("sched: add PIE based congestion management") Signed-off-by: Ali Alnubani <alialnu@nvidia.com>	2021-11-07 18:52:51 +01:00
David Marchand	0eb62bf295	app: fix external dependency linking ext_deps was not used in app/meson.build so testpmd dependency on jansson was ignored. testpmd currently can be linked because metrics library is pulling the dependency on libjansson. Fixes: `59f3a8acbc` ("app/testpmd: add flex item commands") Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Gregory Etelson <getelson@nvidia.com>	2021-11-07 17:16:36 +01:00
Michael Baum	5dfa003db5	common/mlx5: fix post doorbell barrier The rdma-core library can map doorbell register in two ways, depending on the environment variable "MLX5_SHUT_UP_BF": - as regular cached memory, the variable is either missing or set to zero. This type of mapping may cause the significant doorbell register writing latency and requires an explicit memory write barrier to mitigate this issue and prevent write combining. - as non-cached memory, the variable is present and set to not "0" value. This type of mapping may cause performance impact under heavy loading conditions but the explicit write memory barrier is not required and it may improve core performance. The UAR creation function maps a doorbell in one of the above ways according to the system. In run time, it always adds an explicit memory barrier after writing to. In cases where the doorbell was mapped as non-cached memory, the explicit memory barrier is unnecessary and may impair performance. The commit [1] solved this problem for a Tx queue. In run time, it checks the mapping type and provides the memory barrier after writing to a Tx doorbell register if it is needed. The mapping type is extracted directly from the uar_mmap_offset field in the queue properties. This patch shares this code between the drivers and extends the above solution for each of them. [1] commit `8409a28573` ("net/mlx5: control transmit doorbell register mapping") Fixes: `f8c97babc9` ("compress/mlx5: add data-path functions") Fixes: `8e196c08ab` ("crypto/mlx5: support enqueue/dequeue operations") Fixes: `4d4e245ad6` ("regex/mlx5: support enqueue") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b6e9c33c82	net/mlx5: remove duplicated reference of Tx doorbell The Tx doorbell has different virtual addresses per process. The secondary process takes the UAR physical page ID of the primary and mmap it to its own virtual address. The primary doorbell references were saved in two shared memory locations: the TxQ structure and a dedicated doorbell array. Remove the doorbell reference from the TxQ structure and move the primary processes to take the UAR information from the primary doorbell array. Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b4371d3d56	common/mlx5: fix doorbell mapping configuration UAR mapping type can be affected by the devarg tx_db_nc, which can cause setting the environment variable MLX5_SHUT_UP_BF. So, the MLX5_SHUT_UP_BF value and the UAR mapping parameter affect the UAR cache mode. Wrongly, the devarg was considered for the MLX5_SHUT_UP_BF but not for the UAR mapping parameter in all the drivers except the net. Take the tx_db_nc devarg into account for all the drivers. Fixes: `ca1418ce39` ("common/mlx5: share device context object") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	3f0e54fe00	common/mlx5: fix UAR allocation diagnostics messages Depending on kernel capabilities and rdma-core version the mapping of UAR (User Access Region) of desired memory caching type (non-cached or write combining) might fail. The PMD implements the flexible strategy of UAR mapping, alternating the type of caching to succeed. During this process the failure diagnostics messages are emitted. These messages are merely diagnostics ones and the logging level should be adjusted to DEBUG. Fixes: `9cc0e99c81` ("common/mlx5: share UAR allocation routine") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	d1325200ac	common/mlx5: remove unreachable branch in UAR allocation The User Access Region (UAR) provides access to the hardware resources like Doorbell Register from userspace. It means the resources should be mapped by the kernel to some virtual address range. There two types of memory mapping are supported by mlx5 kernel driver: MLX5DV_UAR_ALLOC_TYPE_NC - non-cached, all writes promoted directly to hardware. MLX5DV_UAR_ALLOC_TYPE_BF - "BlueFlame", all writes might be cached by CPU, and will be flushed to hardware explicitly with memory barriers. The supported mapping types depend on the platform (x86/ARM/etc), kernel version, driver version, virtualization environment (hypervisor), etc. In UAR allocation, if the system supports the allocation with non-cached mapping, the first attempt is performed with MLX5DV_UAR_ALLOC_TYPE_NC. Then, if this fails, the next attempt is done with MLX5DV_UAR_ALLOC_TYPE_BF. However, the function adds a condition for the case where the first attempt was performed with MLX5DV_UAR_ALLOC_TYPE_BF, a condition that is unattainable since the first attempt was always performed with MLX5DV_UAR_ALLOC_TYPE_NC. Remove the unreachable code. Fixes: `9cc0e99c81` ("common/mlx5: share UAR allocation routine") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	801b4885c5	crypto/mlx5: fix login release in probing and removal The probe function creates DevX object named login and saves pointer to it in priv structure. The remove function releases first the priv structure and then releases the login object. However, the pointer to login object is field of priv structure, which is invalid. Release the login object and then release the priv structure. Fixes: `debb27ea34` ("crypto/mlx5: create login object using DevX") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:20:35 +01:00
Michael Baum	204891763c	common/mlx5: make multi-process MR management port-agnostic In the multi-process mechanism, there are things that the secondary process does not perform itself but asks the primary process to perform for it. There is a special API for communication between the processes that receives parameters necessary for the specific action required as well as a special structure called mp_id that contains the port number of the processes through which the initial process finds the relevant ETH device for the processes. One of the operations performed through this mechanism is the creation of a memory region, where the secondary process sends the virtual address as a parameter and the mp_id structure with the port number inside it. However, once the memory area management is shared between the drivers and either port number or ETH device is no longer relevant to them, it seems unnecessary to continue communicating between the processes through the mp_id variable. In this patch we will remove the use of the above structure for all MR management, and add to the specific parameter of operations a pointer to the common device that contains everything needed to create/register MR. Fixes: `9f1d636f3e` ("common/mlx5: share MR management") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:12:08 +01:00
Michael Baum	334ed198ab	common/mlx5: remove redundant parameter in MR search Memory region management has recently been shared between drivers, including the search for caches in the data plane. The initial search in the local linear cache of the queue, usually yields a result and one should not continue searching in the next level caches. The function that searches in the local cache gets the pointer to a device as a parameter, that is not necessary for its operation but for subsequent searches (which, as mentioned, usually do not happen). Transferring the device to a function and maintaining it, takes some time and causes some impact on performance. Add the pointer to the device as a field of the mr_ctrl structure. The field will be updated during control path and will be used only when needed in the search. Fixes: `fc59a1ec55` ("common/mlx5: share MR mempool registration") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:11:16 +01:00
Michael Baum	6a4e438576	common/mlx5: fix MR search inlining Memory region management has recently been shared between drivers, including the search for caches in the data plane. The initial search in the local linear cache of the queue, usually yields a result and one should not continue searching in the next layer caches. Prior to cache sharing the local linear cache lookup function was defined with "static inline" attributes, those were missed in routine commoditizing step and this caused performance degradation. Set the common function as static inline. Fixes: `fc59a1ec55` ("common/mlx5: share MR mempool registration") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:04:47 +01:00
David Marchand	fe629897de	app/testpmd: remove double dependency on bitrate lib No need for double dependency, once is enough. While at it, sort alphabetically. Fixes: `fac83b3ef8` ("app: fix missing dependencies") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-06 00:46:00 +01:00
David Marchand	6cff0deff1	app/testpmd: remove unneeded dependency on meter lib testpmd depends on ethdev, which itself depends on meter. No need for an explicit dependency, since no testpmd code directly calls in the meter library. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-06 00:46:00 +01:00
Gregory Etelson	2d3d840135	app/testpmd: fix flex item flush Testpmd provides 2 sets of flex item create and destroy functions One for hosts with JSON library. These functions parse flex item configuration stored in JSON file and create or destroy flex item object. The second functions set is for hosts without JSON library for compilation compatibility. On hosts without JSON library, current implementation issues "no JSON library" notification on port close. The notification was triggered by port destructors that include flex items flush routine. The patch introduces single implementation for testpmd flex item destroy. Fixes: `59f3a8acbc` ("app/testpmd: add flex item commands") Signed-off-by: Gregory Etelson <getelson@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-05 22:19:38 +01:00
Andrew Rybchenko	5e973b3fa1	common/sfc_efx: fix debug compilation control efsys.h belongs to common/sfc_efx and common driver debug toggle should be used instead of net/sfc toggle. Fixes: `5e111ed879` ("net/sfc: introduce common driver library") Cc: stable@dpdk.org Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-11-05 22:04:55 +01:00
Bing Zhao	e848218741	net/mlx5: check delay drop settings in kernel driver The delay drop is the common feature managed on per device basis and the kernel driver is responsible one for the initialization and rearming. By default, the timeout value is set to activate the delay drop when the driver is loaded. A private flag "dropless_rq" is used to control the rearming. Only when it is on, the rearming will be handled once received a timeout event. Or else, the delay drop will be deactivated after the first timeout occurs and all the Rx queues won't have this feature. The PMD is trying to query this flag and warn the application when some queues are created with delay drop but the flag is off. Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-05 17:04:53 +01:00
Bing Zhao	febcac7b46	net/mlx5: support Rx queue delay drop For the Ethernet RQs, if there all receiving descriptors are exhausted, the packets being received will be dropped. This behavior prevents slow or malicious software entities at the host from affecting the network. While for hairpin cases, even if there is no software involved during the packet forwarding from Rx to Tx side, some hiccup in the hardware or back pressure from Tx side may still cause the descriptors to be exhausted. In certain scenarios it may be preferred to configure the device to avoid such packet drops, assuming the posting of descriptors will resume shortly. To support this, a new devarg "delay_drop" is introduced. By default, the delay drop is enabled for hairpin Rx queues and disabled for standard Rx queues. This value is used as a bit mask: - bit 0: enablement of standard Rx queue - bit 1: enablement of hairpin Rx queue And this attribute will be applied to all Rx queues of a device. The "rq_delay_drop" capability in the HCA_CAP is checked before creating any queue. If the hardware capabilities do not support this delay drop, all the Rx queues will still be created without this attribute, and the devarg setting will be ignored even if it is specified explicitly. A warning log is used to notify the application when this occurs. Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-05 17:04:53 +01:00
Ferruh Yigit	b7ade5d31a	ethdev: fix crash on owner delete 'eth_dev->data' can be null before ethdev allocated. The API walks through all eth devices, at least for some data can be null. Adding 'eth_dev->data' null check before accessing it. Fixes: `33c73aae32` ("ethdev: allow ownership operations on unused port") Cc: stable@dpdk.org Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-11-05 15:35:57 +01:00
Jiawen Wu	b4ce1520c9	net/txgbe: fix link process in KR mode Set the 'present' parameter to 0 by default. It is configured by hardware, users can set it to 1 for manual configuration. Fixes: `f611dada1a` ("net/txgbe: update link setup process of backplane NICs") Cc: stable@dpdk.org Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-11-05 15:10:21 +01:00
Jie Wang	8cc79a1636	net/i40e: fix forward outer IPv6 VXLAN Testpmd forwards packets in checksum mode that it need to calculate the checksum of each layer's protocol. Then it will fill flags and header length into mbuf. In process_outer_cksums, HW calculates the outer checksum if tx_offloads contains outer UDP checksum otherwise SW calculates the outer checksum. When tx_offloads contains outer UDP checksum or outer IPv4 checksum, mbuf will be filled with correct header length. This patch added outer UDP checksum in tx_offload_capa and I40E_TX_OFFLOAD_MASK, when we set csum hw outer-udp on that the engine can forward outer IPv6 VXLAN packets. Fixes: `7497d3e2f7` ("net/i40e: convert to new Tx offloads API") Cc: stable@dpdk.org Signed-off-by: Jie Wang <jie1x.wang@intel.com> Acked-by: Beilei Xing <beilei.xing@intel.com>	2021-11-05 05:31:22 +01:00
Viacheslav Ovsiienko	25ed2ebff1	net/mlx5: support shared Rx queue port data path When receive packet, mlx5 PMD saves mbuf port number from RxQ data. To support shared RxQ, save port number into RQ context as user index. Received packet resolve port number from CQE user index which derived from RQ context. Legacy Verbs API doesn't support RQ user index setting, still read from RxQ port number. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:51 +01:00
Xueming Li	09c2555303	net/mlx5: support shared Rx queue This patch introduces shared RxQ. All shared Rx queues with same group and queue ID share the same rxq_ctrl. Rxq_ctrl and rxq_data are shared, all queues from different member port share same WQ and CQ, essentially one Rx WQ, mbufs are filled into this singleton WQ. Shared rxq_data is set into device Rx queues of all member ports as RxQ object, used for receiving packets. Polling queue of any member ports returns packets of any member, mbuf->port is used to identify source port. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:50 +01:00
Xueming Li	5cf0707fc7	net/mlx5: remove Rx queue data list from device Rx queue data list(priv->rxqs) can be replaced by Rx queue list(priv->rxq_privs), removes it and replaces with universal wrapper API. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:49 +01:00
Xueming Li	5ceb3a02b0	net/mlx5: move Rx queue DevX resource To support shared RX queue, moves DevX RQ which is per queue resource to Rx queue private data. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:48 +01:00
Xueming Li	5db77fef78	net/mlx5: remove port info from shareable Rx queue To prepare for shared Rx queue, removes port info from shareable Rx queue control. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:47 +01:00
Xueming Li	44126bd9d0	net/mlx5: move Rx queue hairpin info to private data Hairpin info of Rx queue can't be shared, moves to private queue data. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:47 +01:00
Xueming Li	0cedf34da7	net/mlx5: move Rx queue reference count Rx queue reference count is counter of RQ, used to count reference to RQ object. To prepare for shared Rx queue, this patch moves it from rxq_ctrl to Rx queue private data. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:46 +01:00
Xueming Li	4cda06c3c3	net/mlx5: split Rx queue into shareable and private To prepare shared Rx queue, splits RxQ data into shareable and private. Struct mlx5_rxq_priv is per queue data. Struct mlx5_rxq_ctrl is shared queue resources and data. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-11-04 22:55:45 +01:00

1 2 3 4 5 ...

30818 Commits