numam-dpdk

Author	SHA1	Message	Date
Raja Zidane	2700326085	regex/mlx5: refactor HW queue objects The mlx5 PMD for regex class uses an MMO WQE operated by the GGA engine in BF devices. Currently, all the MMO WQEs are managed by the SQ object. Starting from BF3, the queue of the MMO WQEs should be connected to the GGA engine using a new configuration, MMO, that will be supported only in the QP object. The FW introduced new capabilities to define whether the MMO configuration should be configured for the GGA queue. Replace all the GGA queue objects to QP, set MMO configuration according to the new FW capabilities. Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-05 18:15:40 +02:00
Thomas Monjalon	51d7396440	regex/mlx5: fix minsize build Error occurs when configuring meson with --buildtype=minsize with GCC 11.1.0: drivers/regex/mlx5/mlx5_regex_fastpath.c:398:17: error: ‘len’ may be used uninitialized in this function [-Werror=maybe-uninitialized] \| complete_umr_wqe(qp, sq, &qp->jobs[mkey_job_id], sq->pi, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \| klm_num, len); \| ~~~~~~~~~~~~~ drivers/regex/mlx5/mlx5_regex_fastpath.c:315:31: note: ‘len’ was declared here \| uint32_t klm_num = 0, len; \| ^~~ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>	2021-09-15 17:12:29 +02:00
Michael Baum	29ca3215f3	regex/mlx5: fix memory region unregistration The issue can cause illegal physical address access while a huge-page A is released and huge-page B is allocated on the same virtual address. The old MR can be matched using the virtual address of huge-page B but the HW will access the physical address of huge-page A which is no more part of the DPDK process. Register a driver callback for memory event in order to free out all the MRs of memory that is going to be freed from the DPDK process. Fixes: cda883bbb655 ("regex/mlx5: add dynamic memory registration to datapath") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2021-07-22 15:19:30 +02:00
Michael Baum	423719a367	regex/mlx5: fix size of setup constants The constant representing the size of the metadata is defined as an unsigned int variable with 32-bit. Similarly the constant representing the maximal output is also defined as an unsigned int variable with 32-bit. There is potentially overflowing expression when those constants are evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type size_t that might be 64-bit. Change the size of the above constants to size_t. Fixes: 30d604bb1504 ("regex/mlx5: fix type of setup constants") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-07-22 14:47:10 +02:00
Suanming Mou	330a70b773	regex/mlx5: add data path scattered mbuf process UMR (User-Mode Memory Registration) WQE can present data buffers scattered within multiple mbufs with single indirect mkey. Take advantage of the UMR WQE, scattered mbuf in one operation can be presented to an indirect mkey. The RegEx which only accepts one mkey can now process the whole scattered mbuf in one operation. The maximum scattered mbuf can be supported in one UMR WQE is now defined as 64. The mbufs from multiple operations can be combined into one UMR WQE as well if there is enough space in the KLM array, since the operations can address their own mbuf's content by the mkey's address and length. However, one operation's scattered mbuf's can't be placed in two different UMR WQE's KLM array, if the UMR WQE's KLM does not has enough free space for one operation, the extra UMR WQE will be engaged. In case the UMR WQE's indirect mkey will be over wrapped by the SQ's WQE move, the mkey's index used by the UMR WQE should be the index of last the RegEX WQE in the operations. As one operation consumes one WQE set, build the RegEx WQE by reverse helps address the mkey more efficiently. Once the operations in one burst consumes multiple mkeys, when the mkey KLM array is full, the reverse WQE set index will always be the last of the new mkey's for the new UMR WQE. In GGA mode, the SQ WQE's memory layout becomes UMR/NOP and RegEx WQE by interleave. The UMR and RegEx WQE can be called as WQE set. The SQ's pi and ci will also be increased as WQE set not as WQE. For operations don't have scattered mbuf, uses the mbuf's mkey directly, the WQE set combination is NOP + RegEx. For operations have scattered mbuf but share the UMR WQE with others, the WQE set combination is NOP + RegEx. For operations complete the UMR WQE, the WQE set combination is UMR + RegEx. Signed-off-by: John Hurley <jhurley@nvidia.com> Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2021-04-08 22:52:55 +02:00
Ori Kam	88e2a46d62	regex/mlx5: support priority match The high priority match request flags means that the RegEx engine should stop on the first match. This commit add this flag check to the RegEx engine. Signed-off-by: Ori Kam <orika@nvidia.com>	2021-01-19 18:06:05 +01:00
Ori Kam	2cace110ea	regex/mlx5: fix support for group id In order to know which groups in the RegEx engine should be used there is a need to check the req_flags. This commit adds the missing check. Fixes: 4d4e245ad637 ("regex/mlx5: support enqueue") Cc: stable@dpdk.org Signed-off-by: Ori Kam <orika@nvidia.com>	2021-01-19 18:04:33 +01:00
Michael Baum	9de7b16015	regex/mlx5: move DevX SQ creation to common Using common function for DevX SQ creation. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-01-14 10:12:36 +01:00
Michael Baum	3ddf57069b	regex/mlx5: move DevX CQ creation to common Using common function for DevX CQ creation. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-01-14 10:12:36 +01:00
Ori Kam	9b27a37b84	regex/mlx5: add response flags This commit propagate the response flags from the regex engine. Signed-off-by: Francis Kelly <fkelly@nvidia.com> Signed-off-by: Ori Kam <orika@nvidia.com>	2021-01-12 23:32:04 +01:00
Michael Baum	30d604bb15	regex/mlx5: fix type of setup constants The constant representing the size of the metadata is defined as a regular number (32-bit signed), even though all of its uses request an unsigned int variable. Similarly the constant representing the maximal output is also defined as a regular number, even though all of its uses request an unsigned int variable. Change the type of the above constants to unsigned. Fixes: 5f41b66d12cd ("regex/mlx5: setup fast path") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2020-11-22 15:05:08 +01:00
Yuval Avnery	cda883bbb6	regex/mlx5: add dynamic memory registration to datapath Currently job data is being copied to pre-registered buffer. To avoid memcpy on the datapath, use dynamic memory registration. This change will reduce latency when sending regex jobs. The first few jobs may have high latency due to registration, but assuming all following mbufs will arrive from the same mempool/hugepage, there will be no further memory registration. Signed-off-by: Yuval Avnery <yuvalav@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2020-10-06 01:11:45 +02:00
Phil Yang	f0f5d844d1	eal: remove deprecated coherent IO memory barriers Since the 20.08 release deprecated rte_cio_mb APIs because these APIs provide the same functionality as rte_io_mb APIs on all platforms, so remove them and use rte_io_*mb instead. Signed-off-by: Phil Yang <phil.yang@arm.com> Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-09-23 13:40:26 +02:00
Yuval Avnery	54fa1f6a67	regex/mlx5: add teardown for fastpath buffers Added missing code to free Input/Output buffers and memory registration. Also added calls to this code in case of error in the qp setup procedure. The rollback code itself did not handle rollback properly and did not check return value from the fastpath setup. Signed-off-by: Yuval Avnery <yuvalav@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-09-09 00:27:41 +02:00
Yuval Avnery	76e821a303	regex/mlx5: fix overrun on enqueueing When enqueueing a buffer the PMD check if there is room in its send queue (SQ). The current implementation did not take into account that queue indices are wrapping around, which may result in consumer index (sq->ci) can have bigger value than than the producer index (sq->pi). Fixes: 4d4e245ad637 ("regex/mlx5: support enqueue") Signed-off-by: Yuval Avnery <yuvalav@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-07-29 16:49:58 +02:00
Yuval Avnery	0db041e71e	regex/mlx5: support dequeue Implement dequeue function for the regex API. Signed-off-by: Yuval Avnery <yuvalav@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-07-21 19:04:05 +02:00
Yuval Avnery	4d4e245ad6	regex/mlx5: support enqueue Will look for a free SQ to send the job on. doorbell will be given when sq is full, or no more jobs on the burst. Signed-off-by: Yuval Avnery <yuvalav@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-07-21 19:04:05 +02:00
Yuval Avnery	5f41b66d12	regex/mlx5: setup fast path Allocated and register input/output buffers and metadata. Signed-off-by: Yuval Avnery <yuvalav@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-07-21 19:04:05 +02:00

18 Commits