numam-dpdk

Author	SHA1	Message	Date
Abdullah Ömer Yamaç	7dde9c844a	drivers: omit symbol map when unneeded In this patch, we removed the necessity of the version files and you don't need to update these files for each release, you can just remove them. Suggested-by: Ferruh Yigit <ferruh.yigit@amd.com> Signed-off-by: Abdullah Ömer Yamaç <omer.yamac@ceng.metu.edu.tr> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Tested-by: Ferruh Yigit <ferruh.yigit@amd.com>	2022-11-14 15:22:46 +01:00
Thomas Monjalon	e9cc7c7abc	common/mlx5: use build configuration dictionary A recent commit added an explicit dependency check on common/mlx5. For consistency, query dpdk_conf instead of the list of common drivers. The lists *_drivers should be used only for printing. Fixes: `3df380f617` ("common/mlx5: fix disabling build") Suggested-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-11-14 11:28:49 +01:00
Thomas Monjalon	3df380f617	common/mlx5: fix disabling build If the dependency common/mlx5 is explicitly disabled, but net/mlx5 is not explicitly disabled, Meson will read the full recipe of net/mlx5 and will fail when accessing a variable from common/mlx5: drivers/net/mlx5/meson.build:76:4: ERROR: Unknown variable "mlx5_config". The solution is to stop parsing net/mlx5 if common/mlx5 is disabled. The deps array must be defined before stopping, in order to automatically disable the build of net/mlx5 and print the reason. The same protection is applied to other mlx5 drivers, so it will allow using the variable mlx5_config in future. Fixes: `22681deead` ("net/mlx5/hws: enable hardware steering") Reported-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Tested-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: David Marchand <david.marchand@redhat.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Alex Vesker <valex@nvidia.com>	2022-10-30 15:55:10 +01:00
David Marchand	1f37cb2bb4	bus/pci: make driver-only headers private The pci bus interface is for drivers only. Mark as internal and move the header in the driver headers list. While at it, cleanup the code: - fix indentation, - remove unneeded reference to bus specific singleton object, - remove unneeded list head structure type, - reorder the definitions and macro manipulating the bus singleton object, - remove inclusion of rte_bus.h and fix the code that relied on implicit inclusion, Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Rosen Xu <rosen.xu@intel.com>	2022-09-23 16:14:34 +02:00
David Marchand	72206323a5	version: 22.11-rc0 Start a new release cycle with empty release notes. The ABI version becomes 23.0. The map files are updated to the new ABI major number (23). The ABI exceptions are dropped and CI ABI checks are disabled because compatibility is not preserved. Special handling of removed drivers is also dropped in check-abi.sh and a note has been added in libabigail.abignore as a reminder. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2022-07-21 12:13:48 +02:00
David Marchand	ea2810fc21	vdpa/mlx5: fix leak on event thread creation As stated in the manual, pthread_attr_init return value should be checked. Besides, a pthread_attr_t should be destroyed once unused. In practice, we may have no leak (from what I read in glibc current code), but this may change in the future. Stick to a correct use of the API. Fixes: `5cf3fd3af4` ("vdpa/mlx5: add CPU core parameter to bind polling thread") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-07-08 11:15:32 +02:00
Spike Du	95ff465009	vdpa/mlx5: use common interrupt management Replace vDPA interrupt handle creation logic with mlx5-common interrupt management function. Signed-off-by: Spike Du <spiked@nvidia.com>	2022-07-05 20:15:28 +02:00
Wisam Jaddo	2cf6f9aac9	vdpa/mlx5: add ConnectX-6 LX device ID This adds ConnectX-6 LX to the list of supported Mellanox devices that run the MLX5 vdpa PMD. Signed-off-by: Wisam Jaddo <wisamm@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-07-01 15:49:49 +02:00
Stephen Hemminger	64e14b8b07	remove unnecessary null checks Found by nullfree.cocci. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> [David: for lpm parts:] Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> [David: for vdpa/mlx5 parts:] Acked-by: Matan Azrad <matan@nvidia.com> [David: for dma/dpaa2, raw/ifpga, vdpa/mlx5:] Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com> [David: reran cocci.sh and updated common/mlx5 and cryptodev asym test] Signed-off-by: David Marchand <david.marchand@redhat.com>	2022-06-24 14:51:09 +02:00
Li Zhang	cac75b2d2a	vdpa/mlx5: prepare virtqueue resource creation Split the virtqs virt-queue resource between the configuration threads. Also need pre-created virt-queue resource after virtq destruction. This accelerates the LM process and reduces its time by 30%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:16 +02:00
Li Zhang	91edbbfbb4	vdpa/mlx5: add virtq sub-resources creation pre-created virt-queue sub-resource in device probe stage and then modify virtqueue in device config stage. Steer table also need to support dummy virt-queue. This accelerates the LM process and reduces its time by 40%. Signed-off-by: Li Zhang <lizh@nvidia.com> Signed-off-by: Yajun Wu <yajunw@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:16 +02:00
Li Zhang	6ebb02b44b	vdpa/mlx5: add device close task Split the virtqs device close tasks after stopping virt-queue between the configuration threads. This accelerates the LM process and reduces its time by 50%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	0d9d28974d	vdpa/mlx5: add virtq live migration log task Split the virtqs LM log between the configuration threads. This accelerates the LM process and reduces its time by 20%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	8e72e6bded	vdpa/mlx5: add virtq creation task The virtq object and all its sub-resources use a lot of FW commands and can be accelerated by the MT management. Split the virtqs creation between the configuration threads. This accelerates the LM process and reduces its time by 20%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	06ebaaea20	vdpa/mlx5: add VM memory registration task The driver creates a direct MR object of the HW for each VM memory region, which maps the VM physical address to the actual physical address. Later, after all the MRs are ready, the driver creates an indirect MR to group all the direct MRs into one virtual space from the HW perspective. Create direct MRs in parallel using the MT mechanism. After completion, the primary thread creates the indirect MR needed for the following virtqs configurations. This optimization accelerrate the LM process and reduce its time by 5%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	69e07f43a2	vdpa/mlx5: add task ring for multi-thread management The configuration threads tasks need a container to support multiple tasks assigned to a thread in parallel. Use rte_ring container per thread to manage the thread tasks without locks. The caller thread from the user context opens a task to a thread and enqueue it to the thread ring. The thread polls its ring and dequeue tasks. That’s why the ring should be in multi-producer and single consumer mode. Anatomic counter manages the tasks completion notification. The threads report errors to the caller by a dedicated error counter per task. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	67b070936d	vdpa/mlx5: add multi-thread management for configuration The LM process includes a lot of objects creations and destructions in the source and the destination servers. As much as LM time increases, the packet drop of the VM increases. To improve LM time need to parallel the configurations for mlx5 FW. Add internal multi-thread management in the driver for it. A new devarg defines the number of threads and their CPU. The management is shared between all the devices of the driver. Since the event_core also affects the datapath events thread, reduce the priority of the datapath event thread to allow fast configuration of the devices doing the LM. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	057f7d2084	vdpa/mlx5: optimize datapath-control synchronization The driver used a single global lock for any synchronization needed for the datapath and control path. It is better to group the critical sections with the other ones that should be synchronized. Replace the global lock with the following locks: 1.virtq locks(per virtq) synchronize datapath polling and parallel configurations on the same virtq. 2.A doorbell lock synchronizes doorbell update, which is shared for all the virtqs in the device. 3.A steering lock for the shared steering objects updates. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Li Zhang	7f2de21244	vdpa/mlx5: pre-create virtq at probing time dev_config operation is called in LM progress. LM time is very critical because all the VM packets are dropped directly at that time. Move the virtq creation to probe time and only modify the configuration later in the dev_config stage using the new ability to modify virtq. This optimization accelerates the LM process and reduces its time by 70%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Yajun Wu	24969c7b62	vdpa/mlx5: reuse event queues To speed up queue creation time, event QP and CQ will create only once. Each virtq creation will reuse same event QP and CQ. Because FW will set event QP to error state during virtq destroy, need modify event QP to RESET state, then modify QP to RTS state as usual. This can save about 1.5ms for each virtq creation. After SW QP reset, QP pi/ci all become 0 while CQ pi/ci keep as previous. Add new variable qp_ci to save SW QP ci. Move QP pi independently with CQ ci. Add new function mlx5_vdpa_drain_cq to drain CQ CQE after virtq release. Signed-off-by: Yajun Wu <yajunw@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:17:41 +02:00
Yajun Wu	42a8fc7daa	vdpa/mlx5: support pre-creation of virtq resource The motivation of this change is to reduce vDPA device queue creation time by creating some queue resource in vDPA device probe stage. In VM live migration scenario, this can reduce 0.8ms for each queue creation, thus reduce LM network downtime. To create queue resource(umem/counter) in advance, we need to know virtio queue depth and max number of queue VM will use. Introduce two new devargs: queues(max queue pair number) and queue_size (queue depth). Two args must be both provided, if only one argument provided, the argument will be ignored and no pre-creation. The queues and queue_size must also be identical to vhost configuration driver later receive. Otherwise either the pre-create resource is wasted or missing or the resource need destroy and recreate(in case queue_size mismatch). Pre-create umem/counter will keep alive until vDPA device removal. Signed-off-by: Yajun Wu <yajunw@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:17:41 +02:00
Li Zhang	6f065d1539	vdpa/mlx5: fix maximum number of virtqs The driver wrongly takes the capability value for the number of virtq pairs instead of just the number of virtqs. Adjust all the usages of it to be the number of virtqs. Fixes: `c2eb33aaf9` ("vdpa/mlx5: manage virtqs by array") Cc: stable@dpdk.org Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:17:41 +02:00
Yajun Wu	95af59b7ad	vdpa/mlx5: workaround VAR offset within page vDPA driver first uses kernel driver to allocate doorbell (VAR) area for each device. Then uses var->mmap_off and var->length to mmap uverbs device file as doorbell userspace virtual address. Current kernel driver provides var->mmap_off equal to page start of VAR. It's fine with x86 4K page server, because VAR physical address is only 4K aligned thus locate in 4K page start. But with aarch64 64K page server, the actual VAR physical address has offset within page (not located in 64K page start). So the vDPA driver needs to add this within page offset (caps.doorbell_bar_offset) to get the right VAR virtual address. Fixes: `62c813706e` ("vdpa/mlx5: map doorbell") Cc: stable@dpdk.org Signed-off-by: Yajun Wu <yajunw@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-17 15:34:25 +02:00
Xueming Li	476048d546	vdpa/mlx5: make statistics counter persistent In order to speed-up the device suspend and resume, make the statistics counters persistent in reconfiguration until the device gets removed. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	d7e5d5a7e5	vdpa/mlx5: support device cleanup callback This patch supports device cleanup callback API which is called when the device is disconnected from the VM. Cached resources like VM MR and VQ memory are released. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	934ef2b666	vdpa/mlx5: cache and reuse hardware resources During device suspend and resume, resources are not changed normally. When huge resources were allocated to VM, like huge memory size or lots of queues, time spent on release and recreate became significant. To speed up, this patch reuses resources like VM MR and VirtQ memory if not changed. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	5fe068bf7a	vdpa/mlx5: reuse resources in reconfiguration To speed up device resume, create reuseable resources during device probe state, release when device is removed. Reused resources includes TIS, TD, VAR Doorbell mmap, error handling event channel and interrupt handler, UAR, Rx event channel, NULL MR, steer domain and table. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	b19cc62caf	vdpa/mlx5: avoid kick handling during shutdown When Qemu suspends a VM, HW notifier is un-mmapped while vCPU thread may still be active and write notifier through kick socket. PMD kick handler thread tries to install HW notifier through client socket. In such case, it will timeout and slow down device close. This patch skips HW notifier install if VQ or device in middle of shutdown. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	301ef4a185	vdpa/mlx5: fix dead loop when process interrupted In Ctrl+C handling, sometimes kick handling thread gets endless EGAIN error and fall into dead lock. Kick happens frequently in real system due to busy traffic or retry mechanism. This patch simplifies kick firmware anyway and skip setting hardware notifier due to potential device error, notifier could be set in next successful kick request. Fixes: `62c813706e` ("vdpa/mlx5: map doorbell") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	66a439c5d7	vdpa/mlx5: fix interrupt trash that leads to crash Disable interrupt unregister timeout to avoid invalid FD caused interrupt thread segment fault. Fixes: `62c813706e` ("vdpa/mlx5: map doorbell") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Michael Baum	a729d2f093	common/mlx5: refactor devargs management Improve the devargs handling in two aspects: - Parse the devargs string only once. - Return error and report for unknown keys. The common driver parses once the devargs string into a dictionary, then provides it to all the drivers' probe. Each driver updates within it which keys it has used, then common driver receives the updated dictionary and reports about unknown devargs. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-02-21 11:36:56 +01:00
Stephen Hemminger	06c047b680	remove unnecessary null checks Functions like free, rte_free, and rte_mempool_free already handle NULL pointer so the checks here are not necessary. Remove redundant NULL pointer checks before free functions found by nullfree.cocci Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2022-02-12 12:07:48 +01:00
Matan Azrad	b5e51edfbe	vdpa/mlx5: workaround queue stop with traffic When the event thread polls traffic and a virtq is stopping, the FW loses synchronization in the virtq indexes. It causes LM failure on synchronization between the HOST indexes to the GUEST indexes. Unset the event thread before the queue stop in the LM process. Fixes: `31b9c29c86` ("vdpa/mlx5: support close and config operations") Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@nvidia.com> Acked-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-01-27 05:44:49 +01:00
Josh Soref	7be78d0279	fix spelling in comments and strings The tool comes from https://github.com/jsoref Signed-off-by: Josh Soref <jsoref@gmail.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2022-01-11 12:16:53 +01:00
Bing Zhao	e9511a26e1	vdpa/mlx5: fix mkey creation check The return value of "mlx5_os_wrapped_mkey_create" is checked in the caller. A zero means success without any error. The typo in the if-condition should be fixed in case there is a misjudgment. Fixes: `398ea8450c` ("vdpa/mlx5: workaround dirty bitmap MR creation") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-16 11:21:18 +01:00
Michael Baum	04b4e4cbc0	vdpa/mlx5: workaround guest MR registrations Due to kernel issue in direct MKEY creation using the DevX API, this patch replaces the virtio MR creation to use Verbs API. Fixes: `cc07a42da2` ("vdpa/mlx5: prepare memory regions") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:35 +01:00
Matan Azrad	398ea8450c	vdpa/mlx5: workaround dirty bitmap MR creation Due to kernel driver/FW issues in direct MKEY creation using the DevX API, this patch replaces the dirty bitmap MR creation to use wrapped mkey instead. Fixes: `9d39e57f21` ("vdpa/mlx5: support live migration") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:26 +01:00
Raja Zidane	bba8281d2e	common/mlx5: fix queue size in DevX queue pair creation The number of WQEBBs was provided to QP create, and QP size was calculated by multiplying the number of WQEBBs by 64, which is the send WQE size. When creating RQ in the QP (i.e., vdpa driver), the queue size was bigger because the receive WQE size is 16. Provide queue size to QP create instead of the number of WQEBBs. Fixes: `f9213ab12c` ("common/mlx5: share DevX queue pair operations") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 19:46:28 +01:00
Raja Zidane	ba707cdb6d	crypto/mlx5: fix queue size configuration The DevX interface for QP creation expects the number of WQEBBs. Wrongly, the number of descriptors was provided to the QP creation. In addition, the QP size must be a power of 2 what was not guaranteed. Provide the number of WQEBBs to the QP creation API. Round up the SQ size to a power of 2. Rename (sq/rq)_size to num_of_(send/receive)_wqes. Fixes: `6152534e21` ("crypto/mlx5: support queue pairs operations") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com>	2021-11-08 19:46:28 +01:00
Harman Kalra	aedd054c5c	drivers: check interrupt file descriptor validity This patch fixes coverity issue by adding a check for negative value to avoid bad bit shift operation and other invalid use of file descriptors. Coverity issue: 373717, 373697, 373685 Coverity issue: 373723, 373720, 373719, 373718, 373715, 373714, 373713 Coverity issue: 373710, 373707, 373706, 373705, 373704, 373701, 373700 Coverity issue: 373698, 373695, 373692, 373690, 373689 Coverity issue: 373722, 373721, 373709, 373702, 373696 Fixes: `d61138d4f0` ("drivers: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Michael Baum	5dfa003db5	common/mlx5: fix post doorbell barrier The rdma-core library can map doorbell register in two ways, depending on the environment variable "MLX5_SHUT_UP_BF": - as regular cached memory, the variable is either missing or set to zero. This type of mapping may cause the significant doorbell register writing latency and requires an explicit memory write barrier to mitigate this issue and prevent write combining. - as non-cached memory, the variable is present and set to not "0" value. This type of mapping may cause performance impact under heavy loading conditions but the explicit write memory barrier is not required and it may improve core performance. The UAR creation function maps a doorbell in one of the above ways according to the system. In run time, it always adds an explicit memory barrier after writing to. In cases where the doorbell was mapped as non-cached memory, the explicit memory barrier is unnecessary and may impair performance. The commit [1] solved this problem for a Tx queue. In run time, it checks the mapping type and provides the memory barrier after writing to a Tx doorbell register if it is needed. The mapping type is extracted directly from the uar_mmap_offset field in the queue properties. This patch shares this code between the drivers and extends the above solution for each of them. [1] commit `8409a28573` ("net/mlx5: control transmit doorbell register mapping") Fixes: `f8c97babc9` ("compress/mlx5: add data-path functions") Fixes: `8e196c08ab` ("crypto/mlx5: support enqueue/dequeue operations") Fixes: `4d4e245ad6` ("regex/mlx5: support enqueue") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b4371d3d56	common/mlx5: fix doorbell mapping configuration UAR mapping type can be affected by the devarg tx_db_nc, which can cause setting the environment variable MLX5_SHUT_UP_BF. So, the MLX5_SHUT_UP_BF value and the UAR mapping parameter affect the UAR cache mode. Wrongly, the devarg was considered for the MLX5_SHUT_UP_BF but not for the UAR mapping parameter in all the drivers except the net. Take the tx_db_nc devarg into account for all the drivers. Fixes: `ca1418ce39` ("common/mlx5: share device context object") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Maxime Coquelin	94c16e89d7	vhost: mark vDPA driver API as internal This patch marks the vDPA driver APIs as internal and rename the corresponding header file to vdpa_driver.h. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-03 09:11:34 +01:00
Harman Kalra	d61138d4f0	drivers: remove direct access to interrupt handle Removing direct access to interrupt handle structure fields, rather use respective get set APIs for the same. Making changes to all the drivers access the interrupt handle fields. Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Raslan Darawsheh <rasland@nvidia.com>	2021-10-25 21:20:12 +02:00
Xueming Li	8011a09add	vdpa/mlx5: retry VAR allocation during vDPA restart VAR is the device memory space for the virtio queues doorbells, Qemu could mmap it to directly to speed up doorbell push. On a busy system, Qemu takes time to release VAR resources during driver shutdown. If vdpa restarted quickly, the VAR allocation failed with error 28 since the VAR is singleton resource per device. This patch adds retry mechanism for VAR allocation. Fixes: `4cae722c1b` ("vdpa/mlx5: move virtual doorbell alloc to probe") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Xueming Li	d38a53b175	vdpa/mlx5: workaround FW first completion in start After a vDPA application restart, Qemu restores VQ with used and available index, new incoming packet triggers virtio driver to handle buffers. Under heavy traffic, no available buffer for firmware to receive new packets, no Rx interrupts generated, driver is stuck on endless interrupt waiting. As a firmware workaround, this patch sends a notification after VQ setup to ask driver handling buffers and filling new buffers. Fixes: `bff7350110` ("vdpa/mlx5: prepare virtio queues") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Michael Baum	fe46b20c96	common/mlx5: share HCA capabilities handle Add HCA attributes structure as a field of device config structure. It query in common probing, and updates the timestamp format fields. Each driver use HCA attributes from common device config structure, instead of query it for itself. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	e35ccf243b	common/mlx5: share protection domain object Create shared Protection Domain in common area and add it and its PDN as fields of common device structure. Use this Protection Domain in all drivers and remove the PD and PDN fields from their private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	662d0dc671	common/mlx5: disable RoCE in device context creation Add option to get IB device after disabling RoCE. It is relevant if there is vDPA class in device arguments list. Use common device context in vDPA driver and remove the ctx field from its private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	7af08c8f1a	common/mlx5: share basic probing with internal drivers Create common probing structure that includes, for now, basic probing information detected by the common driver and share it with all the internal drivers. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:38:46 +02:00

1 2 3 4

152 Commits