numam-dpdk

Author	SHA1	Message	Date
Li Zhang	06ebaaea20	vdpa/mlx5: add VM memory registration task The driver creates a direct MR object of the HW for each VM memory region, which maps the VM physical address to the actual physical address. Later, after all the MRs are ready, the driver creates an indirect MR to group all the direct MRs into one virtual space from the HW perspective. Create direct MRs in parallel using the MT mechanism. After completion, the primary thread creates the indirect MR needed for the following virtqs configurations. This optimization accelerrate the LM process and reduce its time by 5%. Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-06-21 11:18:15 +02:00
Xueming Li	934ef2b666	vdpa/mlx5: cache and reuse hardware resources During device suspend and resume, resources are not changed normally. When huge resources were allocated to VM, like huge memory size or lots of queues, time spent on release and recreate became significant. To speed up, this patch reuses resources like VM MR and VirtQ memory if not changed. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Xueming Li	5fe068bf7a	vdpa/mlx5: reuse resources in reconfiguration To speed up device resume, create reuseable resources during device probe state, release when device is removed. Reused resources includes TIS, TD, VAR Doorbell mmap, error handling event channel and interrupt handler, UAR, Rx event channel, NULL MR, steer domain and table. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2022-05-09 21:39:58 +02:00
Stephen Hemminger	06c047b680	remove unnecessary null checks Functions like free, rte_free, and rte_mempool_free already handle NULL pointer so the checks here are not necessary. Remove redundant NULL pointer checks before free functions found by nullfree.cocci Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2022-02-12 12:07:48 +01:00
Josh Soref	7be78d0279	fix spelling in comments and strings The tool comes from https://github.com/jsoref Signed-off-by: Josh Soref <jsoref@gmail.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2022-01-11 12:16:53 +01:00
Michael Baum	04b4e4cbc0	vdpa/mlx5: workaround guest MR registrations Due to kernel issue in direct MKEY creation using the DevX API, this patch replaces the virtio MR creation to use Verbs API. Fixes: `cc07a42da2` ("vdpa/mlx5: prepare memory regions") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:35 +01:00
Matan Azrad	398ea8450c	vdpa/mlx5: workaround dirty bitmap MR creation Due to kernel driver/FW issues in direct MKEY creation using the DevX API, this patch replaces the dirty bitmap MR creation to use wrapped mkey instead. Fixes: `9d39e57f21` ("vdpa/mlx5: support live migration") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Signed-off-by: Matan Azrad <matan@nvidia.com>	2021-11-10 15:50:26 +01:00
Michael Baum	e35ccf243b	common/mlx5: share protection domain object Create shared Protection Domain in common area and add it and its PDN as fields of common device structure. Use this Protection Domain in all drivers and remove the PD and PDN fields from their private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	662d0dc671	common/mlx5: disable RoCE in device context creation Add option to get IB device after disabling RoCE. It is relevant if there is vDPA class in device arguments list. Use common device context in vDPA driver and remove the ctx field from its private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Xueming Li	6e914454d5	vdpa/mlx5: fix large VM memory region registration When VM size is larger than 4G (u32) and memory region is larger than 4G, the 32-bit GCD function overflowed and returned wrong value that resulted in memory registration failure. This patch calls 64-bit GCD function to avoid overflow. Fixes: `cc07a42da2` ("vdpa/mlx5: prepare memory regions") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com>	2021-09-27 17:24:22 +02:00
Thomas Monjalon	5dd12566f1	vdpa/mlx5: fix minsize build Error occurs when configuring meson with --buildtype=minsize with GCC 11.1.0: drivers/vdpa/mlx5/mlx5_vdpa_mem.c: In function ‘mlx5_vdpa_mem_register’: drivers/vdpa/mlx5/mlx5_vdpa_mem.c:183:24: error: initialization of ‘uint64_t’ {aka ‘long unsigned int’} from ‘void *’ makes integer from pointer without a cast [-Werror=int-conversion] \| uint64_t gcd = NULL; \| ^~~~ drivers/vdpa/mlx5/mlx5_vdpa_mem.c:244:75: error: ‘mode’ may be used uninitialized in this function [-Werror=maybe-uninitialized] \| klm_size = mode == MLX5_MKC_ACCESS_MODE_KLM ? \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \| KLM_SIZE_MAX_ALIGN(empty_region_sz) : gcd; \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>	2021-09-15 17:12:29 +02:00
Shiri Kuzin	9f39076b71	common/mlx5: fix mkey attributes initialization The crypto driver added new fields to the mkey attributes struct: crypto_en and set_remote_rw. The entire mkey struct was not initialized, only specific fields in it, which caused the new added fields not to be initialized resulting in a mkey creation error. This is fixed by initializing the entire mkey attributes struct to 0 which will prevent this issue from reoccurring if any fields are added to the mkey struct in the future. Fixes: `0111a74e13` ("common/mlx5: adjust DevX mkey fields for crypto") Signed-off-by: Shiri Kuzin <shirik@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-05-09 09:06:31 +02:00
Tal Shnaiderman	e82ddd28e3	common/mlx5: split PCI relaxed ordering for read and write The current DevX implementation of the relaxed ordering feature is enabling relaxed ordering usage only if both relaxed ordering read AND write are supported. In that case both relaxed ordering read and write are activated. This commit will optimize the usage of relaxed ordering by enabling it when the read OR write features are supported. Each relaxed ordering type will be activated according to its own capability bit. This will align the DevX flow with the verbs implementation of ibv_reg_mr when using the flag IBV_ACCESS_RELAXED_ORDERING Fixes: `53ac93f71a` ("net/mlx5: create relaxed ordering memory regions") Cc: stable@dpdk.org Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2020-11-04 19:16:24 +01:00
Matan Azrad	04e7beeb12	vdpa/mlx5: adjust virtio queue protection domain In other to fill the new requirement for virtq configuration, set the single PD managed by the driver for all the virtqs. Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@mellanox.com> Signed-off-by: Xueming Li <xuemingl@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-06-30 14:52:29 +02:00
Shiri Kuzin	53ac93f71a	net/mlx5: create relaxed ordering memory regions In the current state, when preforming read/write transactions we must wait for a completion in order to run the next transaction, and all transactions are performed by order. Relaxed Ordering is a PCI optimization which by enabling it we allow the system to perform read/writes in a different order without having to wait for completion and improve the performance in that matter. This commit introduces the creation of relaxed ordering memory regions in mlx5. As relaxed ordering is an optimization, drivers that do not support it can simply ignore it and therefore it is enabled by default. Signed-off-by: Shiri Kuzin <shirik@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:05 +02:00
Matan Azrad	cc07a42da2	vdpa/mlx5: prepare memory regions In order to map the guest physical addresses used by the virtio device guest side to the host physical addresses used by the HW as the host side, memory regions are created. By this way, for example, the HW can translate the addresses of the packets posted by the guest and to take the packets from the correct place. The design is to work with single MR which will be configured to the virtio queues in the HW, hence a lot of direct MRs are grouped to single indirect MR. Create functions to prepare and release MRs with all the related resources that are required for it. Create a new file mlx5_vdpa_mem.c to manage all the MR related code in the driver. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00

16 Commits