numam-dpdk

Author	SHA1	Message	Date
Bing Zhao	e9511a26e1	vdpa/mlx5: fix mkey creation check The return value of "mlx5_os_wrapped_mkey_create" is checked in the caller. A zero means success without any error. The typo in the if-condition should be fixed in case there is a misjudgment. Fixes: `398ea8450c` ("vdpa/mlx5: workaround dirty bitmap MR creation") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-16 11:21:18 +01:00
Michael Baum	04b4e4cbc0	vdpa/mlx5: workaround guest MR registrations Due to kernel issue in direct MKEY creation using the DevX API, this patch replaces the virtio MR creation to use Verbs API. Fixes: `cc07a42da2` ("vdpa/mlx5: prepare memory regions") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:35 +01:00
Matan Azrad	398ea8450c	vdpa/mlx5: workaround dirty bitmap MR creation Due to kernel driver/FW issues in direct MKEY creation using the DevX API, this patch replaces the dirty bitmap MR creation to use wrapped mkey instead. Fixes: `9d39e57f21` ("vdpa/mlx5: support live migration") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:26 +01:00
Raja Zidane	bba8281d2e	common/mlx5: fix queue size in DevX queue pair creation The number of WQEBBs was provided to QP create, and QP size was calculated by multiplying the number of WQEBBs by 64, which is the send WQE size. When creating RQ in the QP (i.e., vdpa driver), the queue size was bigger because the receive WQE size is 16. Provide queue size to QP create instead of the number of WQEBBs. Fixes: `f9213ab12c` ("common/mlx5: share DevX queue pair operations") Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-08 19:46:28 +01:00
Raja Zidane	ba707cdb6d	crypto/mlx5: fix queue size configuration The DevX interface for QP creation expects the number of WQEBBs. Wrongly, the number of descriptors was provided to the QP creation. In addition, the QP size must be a power of 2 what was not guaranteed. Provide the number of WQEBBs to the QP creation API. Round up the SQ size to a power of 2. Rename (sq/rq)_size to num_of_(send/receive)_wqes. Fixes: `6152534e21` ("crypto/mlx5: support queue pairs operations") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com>	2021-11-08 19:46:28 +01:00
Harman Kalra	aedd054c5c	drivers: check interrupt file descriptor validity This patch fixes coverity issue by adding a check for negative value to avoid bad bit shift operation and other invalid use of file descriptors. Coverity issue: 373717, 373697, 373685 Coverity issue: 373723, 373720, 373719, 373718, 373715, 373714, 373713 Coverity issue: 373710, 373707, 373706, 373705, 373704, 373701, 373700 Coverity issue: 373698, 373695, 373692, 373690, 373689 Coverity issue: 373722, 373721, 373709, 373702, 373696 Fixes: `d61138d4f0` ("drivers: remove direct access to interrupt handle") Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-08 17:32:42 +01:00
Michael Baum	5dfa003db5	common/mlx5: fix post doorbell barrier The rdma-core library can map doorbell register in two ways, depending on the environment variable "MLX5_SHUT_UP_BF": - as regular cached memory, the variable is either missing or set to zero. This type of mapping may cause the significant doorbell register writing latency and requires an explicit memory write barrier to mitigate this issue and prevent write combining. - as non-cached memory, the variable is present and set to not "0" value. This type of mapping may cause performance impact under heavy loading conditions but the explicit write memory barrier is not required and it may improve core performance. The UAR creation function maps a doorbell in one of the above ways according to the system. In run time, it always adds an explicit memory barrier after writing to. In cases where the doorbell was mapped as non-cached memory, the explicit memory barrier is unnecessary and may impair performance. The commit [1] solved this problem for a Tx queue. In run time, it checks the mapping type and provides the memory barrier after writing to a Tx doorbell register if it is needed. The mapping type is extracted directly from the uar_mmap_offset field in the queue properties. This patch shares this code between the drivers and extends the above solution for each of them. [1] commit `8409a28573` ("net/mlx5: control transmit doorbell register mapping") Fixes: `f8c97babc9` ("compress/mlx5: add data-path functions") Fixes: `8e196c08ab` ("crypto/mlx5: support enqueue/dequeue operations") Fixes: `4d4e245ad6` ("regex/mlx5: support enqueue") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Michael Baum	b4371d3d56	common/mlx5: fix doorbell mapping configuration UAR mapping type can be affected by the devarg tx_db_nc, which can cause setting the environment variable MLX5_SHUT_UP_BF. So, the MLX5_SHUT_UP_BF value and the UAR mapping parameter affect the UAR cache mode. Wrongly, the devarg was considered for the MLX5_SHUT_UP_BF but not for the UAR mapping parameter in all the drivers except the net. Take the tx_db_nc devarg into account for all the drivers. Fixes: `ca1418ce39` ("common/mlx5: share device context object") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-11-07 16:21:03 +01:00
Maxime Coquelin	94c16e89d7	vhost: mark vDPA driver API as internal This patch marks the vDPA driver APIs as internal and rename the corresponding header file to vdpa_driver.h. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-11-03 09:11:34 +01:00
Harman Kalra	d61138d4f0	drivers: remove direct access to interrupt handle Removing direct access to interrupt handle structure fields, rather use respective get set APIs for the same. Making changes to all the drivers access the interrupt handle fields. Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Raslan Darawsheh <rasland@nvidia.com>	2021-10-25 21:20:12 +02:00
Xueming Li	8011a09add	vdpa/mlx5: retry VAR allocation during vDPA restart VAR is the device memory space for the virtio queues doorbells, Qemu could mmap it to directly to speed up doorbell push. On a busy system, Qemu takes time to release VAR resources during driver shutdown. If vdpa restarted quickly, the VAR allocation failed with error 28 since the VAR is singleton resource per device. This patch adds retry mechanism for VAR allocation. Fixes: `4cae722c1b` ("vdpa/mlx5: move virtual doorbell alloc to probe") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Xueming Li	d38a53b175	vdpa/mlx5: workaround FW first completion in start After a vDPA application restart, Qemu restores VQ with used and available index, new incoming packet triggers virtio driver to handle buffers. Under heavy traffic, no available buffer for firmware to receive new packets, no Rx interrupts generated, driver is stuck on endless interrupt waiting. As a firmware workaround, this patch sends a notification after VQ setup to ask driver handling buffers and filling new buffers. Fixes: `bff7350110` ("vdpa/mlx5: prepare virtio queues") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Michael Baum	fe46b20c96	common/mlx5: share HCA capabilities handle Add HCA attributes structure as a field of device config structure. It query in common probing, and updates the timestamp format fields. Each driver use HCA attributes from common device config structure, instead of query it for itself. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	e35ccf243b	common/mlx5: share protection domain object Create shared Protection Domain in common area and add it and its PDN as fields of common device structure. Use this Protection Domain in all drivers and remove the PD and PDN fields from their private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	662d0dc671	common/mlx5: disable RoCE in device context creation Add option to get IB device after disabling RoCE. It is relevant if there is vDPA class in device arguments list. Use common device context in vDPA driver and remove the ctx field from its private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	7af08c8f1a	common/mlx5: share basic probing with internal drivers Create common probing structure that includes, for now, basic probing information detected by the common driver and share it with all the internal drivers. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:38:46 +02:00
Raja Zidane	f9213ab12c	common/mlx5: share DevX queue pair operations Currently drivers using QP (vDPA, crypto and compress, regex soon) manage their memory, creation, modification and destruction of the QP, in almost identical code. Move QP memory management, creation and destruction to common. Add common function to change QP state to RTS. Add user_index attribute to QP creation. It's for better code maintenance and reuse. Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-05 18:15:40 +02:00
Xueming Li	6e914454d5	vdpa/mlx5: fix large VM memory region registration When VM size is larger than 4G (u32) and memory region is larger than 4G, the 32-bit GCD function overflowed and returned wrong value that resulted in memory registration failure. This patch calls 64-bit GCD function to avoid overflow. Fixes: `cc07a42da2` ("vdpa/mlx5: prepare memory regions") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com>	2021-09-27 17:24:22 +02:00
Thomas Monjalon	5dd12566f1	vdpa/mlx5: fix minsize build Error occurs when configuring meson with --buildtype=minsize with GCC 11.1.0: drivers/vdpa/mlx5/mlx5_vdpa_mem.c: In function ‘mlx5_vdpa_mem_register’: drivers/vdpa/mlx5/mlx5_vdpa_mem.c:183:24: error: initialization of ‘uint64_t’ {aka ‘long unsigned int’} from ‘void *’ makes integer from pointer without a cast [-Werror=int-conversion] \| uint64_t gcd = NULL; \| ^~~~ drivers/vdpa/mlx5/mlx5_vdpa_mem.c:244:75: error: ‘mode’ may be used uninitialized in this function [-Werror=maybe-uninitialized] \| klm_size = mode == MLX5_MKC_ACCESS_MODE_KLM ? \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \| KLM_SIZE_MAX_ALIGN(empty_region_sz) : gcd; \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>	2021-09-15 17:12:29 +02:00
Thomas Monjalon	fdab8f2e17	version: 21.11-rc0 Start a new release cycle with empty release notes. The ABI version becomes 22.0. The map files are updated to the new ABI major number (22). The ABI exceptions are dropped and CI ABI checks are disabled because compatibility is not preserved. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-08-17 08:37:52 +02:00
Michael Baum	c6b552e4c0	vdpa/mlx5: fix overflow in queue attribute The mlx5_vdpa_event_qp_create function makes shifting to the numeric constant 1, then multiplies it by another constant and finally assigns it into a uint64_t variable. The numeric constant type is an int with a 32-bit sign. if after shifting , its MSB (bit of sign) will change, the uint64 variable will get into it a different value than what the function intended it to get. Set the numeric constant 1 to be uint64_t in the first place. Fixes: `8395927cdf` ("vdpa/mlx5: prepare HW queues") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-07-22 14:48:07 +02:00
Thomas Monjalon	cf8a8a8f48	vdpa/mlx5: support Sub-Function RoCE disabling requirement is based on PCI address. In order to support Sub-Function, a conversion is needed in the case of an auxiliary device. SF device can be probed with such devargs string: auxiliary:mlx5_core.sf.<id>,class=vdpa Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-07-22 00:11:14 +02:00
Thomas Monjalon	d599bf8209	vdpa/mlx5: migrate to bus-agnostic common interface Replace PCI-specific handling with bus-agnostic structures. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-07-22 00:11:14 +02:00
Thomas Monjalon	bb060bb545	vdpa/mlx5: define driver name as macro Use a macro for the PMD driver name. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-07-22 00:11:14 +02:00
Rongwei Liu	630a587bfb	net/mlx5: support matching on VXLAN reserved field This adds matching on the reserved field of VXLAN header (the last 8-bits). The capability from rdma-core is detected by creating a dummy matcher using misc5 when the device is probed. For non-zero groups and FDB domain, the capability is detected from rdma-core, meanwhile for NIC domain group zero it's relying on the HCA_CAP from FW. Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Raslan Darawsheh <rasland@nvidia.com>	2021-07-13 15:06:43 +02:00
Xueming Li	ff09f80697	vdpa/mlx5: fix TSO offload without checksum Packet was corrupted when TSO requested without CSUM update. Enables CSUM automatically if only TSO requested. Fixes: `2aa8444b00` ("vdpa/mlx5: support stateless offloads") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:39:23 +02:00
Xueming Li	35d4f17b3d	devargs: add common key definition Add common devargs key definition for "bus", "class" and "driver". Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-07-05 16:33:18 +02:00
Matan Azrad	2838aa76ba	vdpa/mlx5: fix device unplug The vDPA PCI device unplug process should release all the private device resources and also to unregister the device. The device unregistration was missed what remained the device data invalid in the rte_vhost library. Unregister the device in unplug process via the remove operation. Fixes: `95276abaaf` ("vdpa/mlx5: introduce Mellanox vDPA driver") Cc: stable@dpdk.org Reported-by: Eli Britstein <elibr@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com> Tested-by: Eli Britstein <elibr@nvidia.com> Acked-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-05-18 10:15:19 +02:00
Shiri Kuzin	9f39076b71	common/mlx5: fix mkey attributes initialization The crypto driver added new fields to the mkey attributes struct: crypto_en and set_remote_rw. The entire mkey struct was not initialized, only specific fields in it, which caused the new added fields not to be initialized resulting in a mkey creation error. This is fixed by initializing the entire mkey attributes struct to 0 which will prevent this issue from reoccurring if any fields are added to the mkey struct in the future. Fixes: `0111a74e13` ("common/mlx5: adjust DevX mkey fields for crypto") Signed-off-by: Shiri Kuzin <shirik@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-05-09 09:06:31 +02:00
David Marchand	eeded2044a	log: register with standardized names Let's try to enforce the convention where most drivers use a pmd. logtype with their class reflected in it, and libraries use a lib. logtype. Introduce two new macros: - RTE_LOG_REGISTER_DEFAULT can be used when a single logtype is used in a component. It is associated to the default name provided by the build system, - RTE_LOG_REGISTER_SUFFIX can be used when multiple logtypes are used, and then the passed name is appended to the default name, RTE_LOG_REGISTER is left untouched for existing external users and for components that do not comply with the convention. There is a new Meson variable log_prefix to adapt the default name for baseband (pmd.bb.), bus (no pmd.) and mempool (no pmd.) classes. Note: achieved with below commands + reverted change on net/bonding + edits on crypto/virtio, compress/mlx5, regex/mlx5 $ git grep -l RTE_LOG_REGISTER drivers/ \| while read file; do pattern=${file##drivers/}; class=${pattern%%/}; pattern=${pattern#$class/}; drv=${pattern%%/}; case "$class" in baseband) pattern=pmd.bb.$drv;; bus) pattern=bus.$drv;; mempool) pattern=mempool.$drv;; ) pattern=pmd.$class.$drv;; esac sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern',/RTE_LOG_REGISTER_DEFAULT(\1,/' $file; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern'\.$.$,/RTE_LOG_REGISTER_SUFFIX(\1, \2,/' $file; done $ git grep -l RTE_LOG_REGISTER lib/ \| while read file; do pattern=${file##lib/}; pattern=lib.${pattern%%/}; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern',/RTE_LOG_REGISTER_DEFAULT(\1,/' $file; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern'\.$.$,/RTE_LOG_REGISTER_SUFFIX(\1, \2,/' $file; done Signed-off-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-05-11 15:17:55 +02:00
Matan Azrad	99f9d799ce	vdpa/mlx5: improve interrupt management The driver should notify the guest for each traffic burst detected by CQ polling. The CQ polling trigger is defined by `event_mode` device argument, either by busy polling on all the CQs or by blocked call to HW completion event using DevX channel. Also, the polling event modes can move to blocked call when the traffic rate is low. The current blocked call uses the EAL interrupt API suffering a lot of overhead in the API management and serve all the drivers and libraries using only single thread. Use blocking FD of the DevX channel in order to do blocked call directly by the DevX channel FD mechanism. Signed-off-by: Matan Azrad <matan@nvidia.com> Acked-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-05-04 10:22:17 +02:00
Thomas Monjalon	91d7e76462	vdpa/mlx5: improve portability of thread naming The function pthread_setname_np is non-portable, so it may be unavailable in old glibc or other systems. The function rte_thread_setname is workarounding portability issues. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-04-28 05:13:49 +02:00
Shiri Kuzin	c31f3f7f7b	common/mlx5: share Verbs device match function The get_ib_device_match function iterates over the list of ib devices returned by the get_device_list glue function and returns the ib device matching the provided address. Since this function is in use by several drivers, in this patch we share the function in common part. Signed-off-by: Shiri Kuzin <shirik@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-05-04 22:49:37 +02:00
Bruce Richardson	4ad4b20a79	drivers: change indentation in build files Switch from using tabs to 4 spaces for meson.build indentation. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2021-04-21 14:04:09 +02:00
Matan Azrad	846ec2ea75	vdpa/mlx5: fix virtq cleaning The HW virtq object can be destroyed either when the device is closed or when the state of the virtq becomes disabled. Some parameters of the virtq should continue to be managed when the virtq state is changed but all of them must be initialized when the device is closed. Wrongly, the enable parameter stayed on when the device is closed what might cause creation of invalid virtq in the next time a device is assigned to the driver. Clean all the virtqs memory when the device is closed. Fixes: `c47d6e8333` ("vdpa/mlx5: support queue update") Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@nvidia.com> Acked-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-03-31 10:37:10 +02:00
Thomas Monjalon	41b5a7a849	vdpa/mlx5: replace pthread functions unavailable in musl 1/ The function pthread_yield() does not exist in musl libc, and can be replaced with sched_yield() after including sched.h. 2/ The function pthread_attr_setaffinity_np() does not exist in musl libc, and can be replaced with pthread_setaffinity_np() after pthread_create(). Fixes: `b7fa0bf4d5` ("vdpa/mlx5: fix polling threads scheduling") Fixes: `5cf3fd3af4` ("vdpa/mlx5: add CPU core parameter to bind polling thread") Cc: stable@dpdk.org Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: David Marchand <david.marchand@redhat.com>	2021-03-23 08:41:05 +01:00
Viacheslav Ovsiienko	044423c4db	vdpa/mlx5: support timestamp format This patch adds support for the timestamp format settings for the receive and send queues. If the firmware version x.30.1000 or above is installed and the NIC timestamps are configured with the real-time format, the default zero values for newly added fields cause the queue creation to fail. The patch queries the timestamp formats supported by the hardware and sets the configuration values in queue context accordingly. Fixes: `95276abaaf` ("vdpa/mlx5: introduce Mellanox vDPA driver") Cc: stable@dpdk.org Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2021-03-16 10:05:36 +01:00
Thomas Monjalon	1b9e9826ad	common/mlx5: remove extra line feed in log messages The macro DRV_LOG already includes a terminating line feed character defined in PMD_DRV_LOG_. The extra line feeds added in some messages are removed. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@nvidia.com>	2021-03-15 14:30:57 +01:00
Matan Azrad	b7fa0bf4d5	vdpa/mlx5: fix polling threads scheduling When the event mode is with 0 fixed delay, the polling-thread will never give-up CPU. So, when multi-polling-threads are active, the context-switch between them will be managed by the system which may affect latency according to the time-out decided by the system. In order to fix multi-devices polling thread scheduling, this patch forces rescheduling for each CQ poll iteration. Move the polling thread to SCHED_RR mode with maximum priority to complete the fairness. Fixes: `6956a48cab` ("vdpa/mlx5: set polling mode default delay to zero") Signed-off-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Xueming Li <xuemingl@nvidia.com>	2021-02-10 22:17:47 +01:00
Matan Azrad	f00e5a15af	vdpa/mlx5: fix configuration mutex cleanup When the vDPA device is closed, the driver polling thread is canceled. The polling thread locks the configuration mutex while it polls the CQs. When the cancellation happens, it may terminate the thread inside the critical section what remains the configuration mutex locked. After device close, the driver may be configured again, in this case, for example, when the first queue state is updated, the driver tries to lock the mutex again and deadlock appears. Initialize the mutex after the polling thread cancellation. Fixes: `99abbd62c2` ("vdpa/mlx5: fix queue update synchronization") Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@nvidia.com> Acked-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-29 18:16:10 +01:00
Bruce Richardson	762bfccc8a	config: remove compatibility build defines As announced in the deprecation note, remove all compatibility build defines from previous make/meson versions and use only the standardized ones - RTE_LIB_<name> for libraries, and RTE_<CLASS>_<NAME> for drivers. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-01-20 01:43:25 +01:00
Michael Baum	0e41abd198	vdpa/mlx5: move DevX CQ creation to common Using common function for DevX CQ creation. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-01-14 10:12:36 +01:00
Xueming Li	1f93bee4e7	vdpa/mlx5: add hardware queue moderation The next parameters control the HW queue moderation feature. This feature helps to control the traffic performance and latency trade-off. Each packet completion report from HW to SW requires CQ processing by SW and triggers interrupt for the guest driver. Interrupt report and handling cost CPU cycles and time and the amount of this affects directly on packet performance and latency. hw_latency_mode parameters [int] 0, HW default. 1, Latency is counted from the first packet completion report. 2, Latency is counted from the last packet completion. hw_max_latency_us parameters [int] 0 - 4095, The maximum time in microseconds that packet completion report can be delayed. hw_max_pending_comp parameter [int] 0 - 65535, The maximum number of pending packets completions in an HW queue. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:56 +01:00
Xueming Li	05421ec938	vdpa/mlx5: set default event mode to polling For better performance and latency, this patch sets default event handling mode to polling mode which uses dedicate thread per device to poll and process event. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Xueming Li	5cf3fd3af4	vdpa/mlx5: add CPU core parameter to bind polling thread This patch adds new device argument to specify cpu core affinity to event polling thread for better latency and throughput. The thread could be also located by name "vDPA-mlx5-<id>". Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Xueming Li	c9a189f4ea	vdpa/mlx5: default polling mode delay time to zero To improve performance and latency, this patch sets Rx polling mode default delay time to zero. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Xueming Li	6956a48cab	vdpa/mlx5: set polling mode default delay to zero To improve throughput and latency, this patch allows Rx polling timer delay to 0us. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Tal Shnaiderman	981746264e	common/mlx5: wrap event channel functions per OS Wrap the API to create/destroy event channel and to subscribe an event with OS calls. In Linux those calls are implemented by glue functions while in Windows they are not supported. Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-01-08 16:03:07 +01:00
Raslan Darawsheh	3ea12cad71	common/mlx5: fix name for ConnectX VF device ID Starting ConnectX-6 Dx, the VF device ID is generic and not per chip. https://pci-ids.ucw.cz/v2.2/pci.ids 101e ConnectX Family mlx5Gen Virtual Function This means that all will have the same VF device ID. Fixes: `5fc66630be` ("net/mlx5: add ConnectX6-DX device ID") Cc: stable@dpdk.org Signed-off-by: Raslan Darawsheh <rasland@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2020-11-20 21:10:05 +01:00
Viacheslav Ovsiienko	b9aa4ba7ce	vdpa/mlx5: fix UAR allocation This patch provides the UAR allocation workaround for the hosts where UAR allocation with Write-Combining memory mapping type fails. Fixes: `8395927cdf` ("vdpa/mlx5: prepare HW queues") Cc: stable@dpdk.org Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2020-11-14 10:56:30 +01:00

1 2 3

118 Commits