numam-dpdk

Author	SHA1	Message	Date
Raja Zidane	ba707cdb6d	crypto/mlx5: fix queue size configuration The DevX interface for QP creation expects the number of WQEBBs. Wrongly, the number of descriptors was provided to the QP creation. In addition, the QP size must be a power of 2 what was not guaranteed. Provide the number of WQEBBs to the QP creation API. Round up the SQ size to a power of 2. Rename (sq/rq)_size to num_of_(send/receive)_wqes. Fixes: `6152534e21` ("crypto/mlx5: support queue pairs operations") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com>	2021-11-08 19:46:28 +01:00
Raja Zidane	56faca3bc1	crypto/mlx5: fix freeing on probing failure When calling device close, unset dek is called which destroys a hash list. In case of error during dev probe, close is called when dek hlist is not initialized. Ensure non null list destroy. Fixes: `90646d6c6e` ("crypto/mlx5: support basic operations") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 19:46:28 +01:00
Raja Zidane	54feeab1c0	common/mlx5: fix DevX queue size overflow The HW QP/SQ/RQ/CQ queue sizes may be bigger than 64KB. The width of the variable handled the queue size is 16 bits which cannot contain the maximum queue size. Replace the size type to be uint32_t. Fixes: `9dab4d62b4` ("common/mlx5: share DevX CQ creation") Fixes: `38f537635c` ("common/mlx5: share DevX SQ creation") Fixes: `f9213ab12c` ("common/mlx5: share DevX queue pair operations") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 19:46:28 +01:00
Maciej Szwed	aeed570a21	interrupt: fix request notifier interrupt processing We should call read() on RTE_INTR_HANDLE_VFIO_REQ event to confirm that event. Fixes: `0eb8a1c4c7` ("vfio: add request notifier interrupt") Cc: stable@dpdk.org Signed-off-by: Maciej Szwed <maciej.szwed@intel.com>	2021-11-08 18:26:07 +01:00
Harman Kalra	49fdb0ae0d	net/mlx4: fix crash on allocation failure This patch fixes coverity issue by adding a NULL check. Coverity issue: 373687 Fixes: `d61138d4f0` ("drivers: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Harman Kalra	aedd054c5c	drivers: check interrupt file descriptor validity This patch fixes coverity issue by adding a check for negative value to avoid bad bit shift operation and other invalid use of file descriptors. Coverity issue: 373717, 373697, 373685 Coverity issue: 373723, 373720, 373719, 373718, 373715, 373714, 373713 Coverity issue: 373710, 373707, 373706, 373705, 373704, 373701, 373700 Coverity issue: 373698, 373695, 373692, 373690, 373689 Coverity issue: 373722, 373721, 373709, 373702, 373696 Fixes: `d61138d4f0` ("drivers: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Harman Kalra	7e2083e462	eal/linux: check interrupt file descriptor validity This patch fixes coverity issue by adding a check for negative event fd value. Coverity issue: 373711, 373694 Fixes: `c2bd9367e1` ("lib: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Harman Kalra	3fcca9fac6	interrupts: check file descriptor validity This patch fixes coverity issues by adding a check for negative event fd value. Coverity issue: 373716, 373699, 373693, 373688 Fixes: `bbbac4cd6e` ("interrupts: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Elena Agostini	3a99464456	doc: add CUDA example in GPU guide Add a pseudo-code example to show how to use gpudev API with a CUDA application. Signed-off-by: Elena Agostini <eagostini@nvidia.com>	2021-11-08 17:20:53 +01:00
Elena Agostini	c7ebd65c13	gpudev: add communication list In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with task processing there may be the need to put in communication the CPU with the device in order to synchronize operations. An example could be a receive-and-process application where CPU is responsible for receiving packets in multiple mbufs and the GPU is responsible for processing the content of those packets. The purpose of this list is to provide a buffer in CPU memory visible from the GPU that can be treated as a circular buffer to let the CPU provide fondamental info of received packets to the GPU. A possible use-case is described below. CPU: - Trigger some task on the GPU - in a loop: - receive a number of packets - provide packets info to the GPU GPU: - Do some pre-processing - Wait to receive a new set of packet to be processed Layout of a communication list would be: ------- \| 0 \| => pkt_list \| status \| \| #pkts \| ------- \| 1 \| => pkt_list \| status \| \| #pkts \| ------- \| 2 \| => pkt_list \| status \| \| #pkts \| ------- \| .... \| => pkt_list ------- Signed-off-by: Elena Agostini <eagostini@nvidia.com>	2021-11-08 17:20:53 +01:00
Elena Agostini	f56160a255	gpudev: add communication flag In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. When mixing network activity with task processing there may be the need to put in communication the CPU with the device in order to synchronize operations. The purpose of this flag is to allow the CPU and the GPU to exchange ACKs. A possible use-case is described below. CPU: - Trigger some task on the GPU - Prepare some data - Signal to the GPU the data is ready updating the communication flag GPU: - Do some pre-processing - Wait for more data from the CPU polling on the communication flag - Consume the data prepared by the CPU Signed-off-by: Elena Agostini <eagostini@nvidia.com>	2021-11-08 17:20:53 +01:00
Elena Agostini	2d61b429cf	gpudev: add memory barrier Add a function for the application to ensure the coherency of the writes executed by another device into the GPU memory. Signed-off-by: Elena Agostini <eagostini@nvidia.com>	2021-11-08 17:20:53 +01:00
Elena Agostini	e818c4e2bf	gpudev: add memory API In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. Such workload distribution can be achieved by sharing some memory. As a first step, the features are focused on memory management. A function allows to allocate memory inside the device, or in the main (CPU) memory while making it visible for the device. This memory may be used to save packets or for synchronization data. The next step should focus on GPU processing task control. Signed-off-by: Elena Agostini <eagostini@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:53 +01:00
Thomas Monjalon	a9af048aba	gpudev: support multi-process The device data shared between processes are moved in a struct allocated in a shared memory (a new memzone for all GPUs). The main struct rte_gpu references the shared memory via the pointer mpshared. The API function rte_gpu_attach() is added to attach a device from the secondary process. The function rte_gpu_allocate() can be used only by primary process. Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:53 +01:00
Thomas Monjalon	82e5f6b658	gpudev: add child device representing a device context The computing device may operate in some isolated contexts. Memory and processing are isolated in a silo represented by a child device. The context is provided as an opaque by the caller of rte_gpu_add_child(). Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:52 +01:00
Thomas Monjalon	18cb075631	gpudev: add event notification Callback functions may be registered for a device event. Callback management is per-process and not thread-safe. The events RTE_GPU_EVENT_NEW and RTE_GPU_EVENT_DEL are notified respectively after creation and before removal of a device, as part of the library functions. Some future events may be emitted from drivers. Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:52 +01:00
Elena Agostini	8b8036a66e	gpudev: introduce GPU device class library In heterogeneous computing system, processing is not only in the CPU. Some tasks can be delegated to devices working in parallel. The new library gpudev is for dealing with GPGPU computing devices from a DPDK application running on the CPU. The infrastructure is prepared to welcome drivers in drivers/gpu/. Signed-off-by: Elena Agostini <eagostini@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-08 17:20:52 +01:00
Anatoly Burakov	4fd15c6af0	vfio: set errno on unsupported OS Currently, when code is running on FreeBSD or Windows, there is no way to distinguish between a geniune error and a "VFIO is unsupported" error. Fix the dummy implementations to also set the rte_errno flag. Fixes: `279b581c89` ("vfio: expose functions") Fixes: `c564a2a200` ("vfio: expose clear group function for internal usages") Fixes: `964b2f3bfb` ("vfio: export some internal functions") Fixes: `ea2dc10668` ("vfio: add multi container support") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-11-08 16:45:28 +01:00
Anatoly Burakov	da6e4cdca1	vfio: fix FreeBSD documentation On FreeBSD, `rte_vfio_is_enabled()` and `rte_vfio_noiommu_is_enabled()` API calls will not return error, and will instead return 0. This is intentional, because the caller of this API does not care whether VFIO is supported at all, and will instead be interested in whether VFIO is enabled or not. However, the doxygen comments for these functions state that they will return an error on FreeBSD, which is incorrect. Fix the doxygen comment to call out the fact that these functions are only relevant on Linux, but remove the reference to returning errors. Fixes: `279b581c89` ("vfio: expose functions") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-08 16:42:55 +01:00
Anatoly Burakov	bf8b792f3b	vfio: fix FreeBSD clear group stub On FreeBSD, `rte_vfio_clear_group()` was returning 0 even though this function is not valid for FreeBSD, and is called out to return error in doxygen comments. Fix the return value to match documentation. Fixes: `c564a2a200` ("vfio: expose clear group function for internal usages") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2021-11-08 16:42:44 +01:00
Anatoly Burakov	84e03bde1c	vfio: drop fallback Linux implementation Currently, VFIO support for Linux is compiled unconditionally, and supported kernel versions start with 4.4, so VFIO is assumed to always be enabled. There is no way of disabling VFIO support at compile time anyway, so just drop the "VFIO not available" fallback code altogether. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-08 16:27:15 +01:00
Chengwen Feng	b1f4933ef3	kni: check error code of allmulticast mode switch Some drivers may return errcode when switch allmulticast mode, so it's necessary to check the return code. Fixes: `b34801d1aa` ("kni: support allmulticast mode set") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-08 11:56:13 +01:00
Ferruh Yigit	e6cbfd9bf3	kni: update kernel API to set random MAC address Previously used 'random_ether_addr()' API is removed in upstream kernel with commit Commit ba530fea8ca1 ("ethernet: remove random_ether_addr()") Replacement API 'eth_random_addr()' is around since v3.6 [1], so simply switching to this API without any version checks. [1] 0a4dd594982a ("etherdevice: Rename random_ether_addr to eth_random_addr") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-08 11:51:39 +01:00
Satheesh Paul	fda3750de6	app/flow-perf: add random priority option Added support to create flows with priority attribute set randomly between 0 and a user supplied maximum value. This is useful to measure performance on NICs which may have to rearrange flows to honor flow priority. Removed the lower limit of 100000 flows per batch. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Acked-by: Wisam Jaddo <wisamm@nvidia.com>	2021-11-08 10:33:08 +01:00
Raja Zidane	f66898ebd0	common/mlx5: fix MMO configuration in DevX queue pair The QP extension valid bit was not set in the QP creation for MMO configuration. That caused the QP not to be connected to the GGA MMO engines, and any MMO WQE job got CQE with an error. Set the QP ext bit when MMO is configured. Fixes: `ddda000618` ("common/mlx5: add MMO configuration for DevX queue pair") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 09:36:27 +01:00
Raja Zidane	4b99fe0577	common/mlx5: fix HCA capabilities PRM alignment 0x20 reserved bytes were missed in the HCA cap PRM structure before the newly added fields for MMO QP capabilities. That caused reading MMO QP caps incorrectly. Add the reserved fields in the HCA cap structure. Fixes: `cbc4c13a25` ("common/mlx5: update MMO HCA capabilities") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 09:29:36 +01:00
Juraj Linkeš	d0152e1f1a	config/arm: split aarch32 options Aarch32 config got overlooked when splitting march in a previous patch. Fixes: `95e0f23022` ("config/arm: split -march into arch and features") Signed-off-by: Juraj Linkeš <juraj.linkes@pantheon.tech> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com>	2021-11-08 09:17:06 +01:00
Radha Mohan Chintakuntla	ebc539271e	dma/cnxk: add statistics Add the stats function to get the DMA statistics. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	3340c3e227	dma/cnxk: add scatter-gather copy Add the copy_sg function that will do the multiple DMA transfers of different sizes and different source/destination as well. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	b56f1e2dad	dma/cnxk: add channel operations Add functions for the dmadev vchan setup and DMA operations. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	53f6d7328b	dma/cnxk: create and initialize device on PCI probing This patch creates and initializes a dmadev device on pci probe. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-08 00:08:45 +01:00
Radha Mohan Chintakuntla	b6e395692b	common/cnxk: add DPI DMA support Add base support as ROC(Rest of Chip) API which will be used by PMD dmadev driver. This patch adds routines to init, fini, configure the DPI DMA device found in Marvell's CN9k or CN10k SoC families. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com>	2021-11-07 23:29:58 +01:00
Chengwen Feng	fb73eb2fb1	usertools/devbind: add Kunpeng DMA Add Kunpeng DMA device ID to dmadev category. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:04:28 +01:00
Chengwen Feng	569e850b4b	dma/hisilicon: support multi-process This patch add multi-process support for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:27 +01:00
Chengwen Feng	2db4f0b823	dma/hisilicon: add data path This patch add data path functions for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:27 +01:00
Chengwen Feng	3c5f5f03a0	dma/hisilicon: add control path This patch add control path functions for Kunpeng DMA devices. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:02:24 +01:00
Chengwen Feng	9e16317a38	dma/hisilicon: add probing This patch add dmadev instances create during the PCI probe, and destroy them during the PCI remove. Internal structures and HW definitions was also included. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 20:01:52 +01:00
Chengwen Feng	4d0d4cf327	dma/hisilicon: introduce driver skeleton Add the basic device probe and remove functions and initial documentation for new hisilicon DMA drivers. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2021-11-07 19:54:19 +01:00
Ali Alnubani	948be25cdb	sched: fix debug build Compare pkt_len to 0 instead of NULL to avoid the following build failure with debug mode enabled: ../lib/sched/rte_pie.h: In function 'rte_pie_enqueue_empty': ../lib/sched/rte_pie.h:125:21: error: comparison between pointer and integer [-Werror] RTE_ASSERT(pkt_len != NULL); Bugzilla ID: 878 Fixes: `44c730b0e3` ("sched: add PIE based congestion management") Signed-off-by: Ali Alnubani <alialnu@nvidia.com>	2021-11-07 18:52:51 +01:00
David Marchand	0eb62bf295	app: fix external dependency linking ext_deps was not used in app/meson.build so testpmd dependency on jansson was ignored. testpmd currently can be linked because metrics library is pulling the dependency on libjansson. Fixes: `59f3a8acbc` ("app/testpmd: add flex item commands") Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Gregory Etelson <getelson@nvidia.com>	2021-11-07 17:16:36 +01:00
Michael Baum	5dfa003db5	common/mlx5: fix post doorbell barrier The rdma-core library can map doorbell register in two ways, depending on the environment variable "MLX5_SHUT_UP_BF": - as regular cached memory, the variable is either missing or set to zero. This type of mapping may cause the significant doorbell register writing latency and requires an explicit memory write barrier to mitigate this issue and prevent write combining. - as non-cached memory, the variable is present and set to not "0" value. This type of mapping may cause performance impact under heavy loading conditions but the explicit write memory barrier is not required and it may improve core performance. The UAR creation function maps a doorbell in one of the above ways according to the system. In run time, it always adds an explicit memory barrier after writing to. In cases where the doorbell was mapped as non-cached memory, the explicit memory barrier is unnecessary and may impair performance. The commit [1] solved this problem for a Tx queue. In run time, it checks the mapping type and provides the memory barrier after writing to a Tx doorbell register if it is needed. The mapping type is extracted directly from the uar_mmap_offset field in the queue properties. This patch shares this code between the drivers and extends the above solution for each of them. [1] commit `8409a28573` ("net/mlx5: control transmit doorbell register mapping") Fixes: `f8c97babc9` ("compress/mlx5: add data-path functions") Fixes: `8e196c08ab` ("crypto/mlx5: support enqueue/dequeue operations") Fixes: `4d4e245ad6` ("regex/mlx5: support enqueue") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b6e9c33c82	net/mlx5: remove duplicated reference of Tx doorbell The Tx doorbell has different virtual addresses per process. The secondary process takes the UAR physical page ID of the primary and mmap it to its own virtual address. The primary doorbell references were saved in two shared memory locations: the TxQ structure and a dedicated doorbell array. Remove the doorbell reference from the TxQ structure and move the primary processes to take the UAR information from the primary doorbell array. Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b4371d3d56	common/mlx5: fix doorbell mapping configuration UAR mapping type can be affected by the devarg tx_db_nc, which can cause setting the environment variable MLX5_SHUT_UP_BF. So, the MLX5_SHUT_UP_BF value and the UAR mapping parameter affect the UAR cache mode. Wrongly, the devarg was considered for the MLX5_SHUT_UP_BF but not for the UAR mapping parameter in all the drivers except the net. Take the tx_db_nc devarg into account for all the drivers. Fixes: `ca1418ce39` ("common/mlx5: share device context object") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	3f0e54fe00	common/mlx5: fix UAR allocation diagnostics messages Depending on kernel capabilities and rdma-core version the mapping of UAR (User Access Region) of desired memory caching type (non-cached or write combining) might fail. The PMD implements the flexible strategy of UAR mapping, alternating the type of caching to succeed. During this process the failure diagnostics messages are emitted. These messages are merely diagnostics ones and the logging level should be adjusted to DEBUG. Fixes: `9cc0e99c81` ("common/mlx5: share UAR allocation routine") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	d1325200ac	common/mlx5: remove unreachable branch in UAR allocation The User Access Region (UAR) provides access to the hardware resources like Doorbell Register from userspace. It means the resources should be mapped by the kernel to some virtual address range. There two types of memory mapping are supported by mlx5 kernel driver: MLX5DV_UAR_ALLOC_TYPE_NC - non-cached, all writes promoted directly to hardware. MLX5DV_UAR_ALLOC_TYPE_BF - "BlueFlame", all writes might be cached by CPU, and will be flushed to hardware explicitly with memory barriers. The supported mapping types depend on the platform (x86/ARM/etc), kernel version, driver version, virtualization environment (hypervisor), etc. In UAR allocation, if the system supports the allocation with non-cached mapping, the first attempt is performed with MLX5DV_UAR_ALLOC_TYPE_NC. Then, if this fails, the next attempt is done with MLX5DV_UAR_ALLOC_TYPE_BF. However, the function adds a condition for the case where the first attempt was performed with MLX5DV_UAR_ALLOC_TYPE_BF, a condition that is unattainable since the first attempt was always performed with MLX5DV_UAR_ALLOC_TYPE_NC. Remove the unreachable code. Fixes: `9cc0e99c81` ("common/mlx5: share UAR allocation routine") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	801b4885c5	crypto/mlx5: fix login release in probing and removal The probe function creates DevX object named login and saves pointer to it in priv structure. The remove function releases first the priv structure and then releases the login object. However, the pointer to login object is field of priv structure, which is invalid. Release the login object and then release the priv structure. Fixes: `debb27ea34` ("crypto/mlx5: create login object using DevX") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:20:35 +01:00
Michael Baum	204891763c	common/mlx5: make multi-process MR management port-agnostic In the multi-process mechanism, there are things that the secondary process does not perform itself but asks the primary process to perform for it. There is a special API for communication between the processes that receives parameters necessary for the specific action required as well as a special structure called mp_id that contains the port number of the processes through which the initial process finds the relevant ETH device for the processes. One of the operations performed through this mechanism is the creation of a memory region, where the secondary process sends the virtual address as a parameter and the mp_id structure with the port number inside it. However, once the memory area management is shared between the drivers and either port number or ETH device is no longer relevant to them, it seems unnecessary to continue communicating between the processes through the mp_id variable. In this patch we will remove the use of the above structure for all MR management, and add to the specific parameter of operations a pointer to the common device that contains everything needed to create/register MR. Fixes: `9f1d636f3e` ("common/mlx5: share MR management") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:12:08 +01:00
Michael Baum	334ed198ab	common/mlx5: remove redundant parameter in MR search Memory region management has recently been shared between drivers, including the search for caches in the data plane. The initial search in the local linear cache of the queue, usually yields a result and one should not continue searching in the next level caches. The function that searches in the local cache gets the pointer to a device as a parameter, that is not necessary for its operation but for subsequent searches (which, as mentioned, usually do not happen). Transferring the device to a function and maintaining it, takes some time and causes some impact on performance. Add the pointer to the device as a field of the mr_ctrl structure. The field will be updated during control path and will be used only when needed in the search. Fixes: `fc59a1ec55` ("common/mlx5: share MR mempool registration") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:11:16 +01:00
Michael Baum	6a4e438576	common/mlx5: fix MR search inlining Memory region management has recently been shared between drivers, including the search for caches in the data plane. The initial search in the local linear cache of the queue, usually yields a result and one should not continue searching in the next layer caches. Prior to cache sharing the local linear cache lookup function was defined with "static inline" attributes, those were missed in routine commoditizing step and this caused performance degradation. Set the common function as static inline. Fixes: `fc59a1ec55` ("common/mlx5: share MR mempool registration") Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 14:04:47 +01:00
David Marchand	fe629897de	app/testpmd: remove double dependency on bitrate lib No need for double dependency, once is enough. While at it, sort alphabetically. Fixes: `fac83b3ef8` ("app: fix missing dependencies") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-11-06 00:46:00 +01:00

1 2 3 4 5 ...

30834 Commits