numam-dpdk

Author	SHA1	Message	Date
Tyler Retzlaff	4b81c145ae	eal: change return type of bsf/fls functions The function return type is changed to fixed width uint32_t to be consistent with what appears to be the original authors intent. It doesn't make much sense to return signed integers for these functions. Signed-off-by: Tyler Retzlaff <roretzla@linux.microsoft.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Morten Brørup <mb@smartsharesystems.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2022-10-05 11:15:50 +02:00
Akhil Goyal	2973dbf93b	security: hide session structure Structure rte_security_session is moved to internal headers which are not visible to applications. The only field which should be used by app is opaque_data. This field can now be accessed via set/get APIs added in this patch. Subsequent changes in app and lib are made to compile the code. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:37:54 +02:00
Akhil Goyal	66837861d3	drivers/crypto: support security session size query Added the support for rte_security_op.session_get_size() in all the PMDs which support rte_security sessions and the op was not supported. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Kai Ji <kai.ji@intel.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:37:06 +02:00
Akhil Goyal	3f3fc3308b	security: remove private mempool usage As per current design, rte_security_session_create() unnecessarily use 2 mempool objects for a single session. To address this, the API will now take only 1 mempool object instead of 2. With this change, the library layer will get the object from mempool and session priv data is stored contiguously in the same mempool object. User need to ensure that the mempool created in application is big enough for session private data as well. This can be ensured if the pool is created after getting size of session priv data using API rte_security_session_get_size(). Since set and get pkt metadata for security sessions are now made inline for Inline crypto/proto mode, a new member fast_mdata is added to the rte_security_session. To access opaque data and fast_mdata will be accessed via inline APIs which can do pointer manipulations inside library from session_private_data pointer coming from application. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:37:00 +02:00
Akhil Goyal	2a440d6ab3	cryptodev: hide symmetric session structure Structure rte_cryptodev_sym_session is moved to internal headers which are not visible to applications. The only field which should be used by app is opaque_data. This field can now be accessed via set/get APIs added in this patch. Subsequent changes in app and lib are made to compile the code. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Kai Ji <kai.ji@intel.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:29:01 +02:00
Fan Zhang	6812b9bf47	crypto/scheduler: use unified session This patch updates the scheduler PMD to use unified session data structure. Previously thanks to the private session array in cryptodev sym session there are no necessary change needed for scheduler PMD other than the way ops are enqueued/dequeued. The patch inherits the same design in the original session data structure to the scheduler PMD so the cryptodev sym session can be as a linear buffer for both session header and driver private data. With the change there are inevitable extra cost on both memory (64 bytes per session per driver type) and cycle count (set the correct session for each cop based on the worker before enqueue, and retrieve the original session after dequeue). Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Kai Ji <kai.ji@intel.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:05:18 +02:00
Akhil Goyal	bdce2564db	cryptodev: rework session framework As per current design, rte_cryptodev_sym_session_create() and rte_cryptodev_sym_session_init() use separate mempool objects for a single session. And structure rte_cryptodev_sym_session is not directly used by the application, it may cause ABI breakage if the structure is modified in future. To address these two issues, the rte_cryptodev_sym_session_create will take one mempool object that the session and session private data are virtually/physically contiguous, and initializes both fields. The API rte_cryptodev_sym_session_init is removed. rte_cryptodev_sym_session_create will now return an opaque session pointer which will be used by the app and other APIs. In data path, opaque session pointer is attached to rte_crypto_op and the PMD can call an internal library API to get the session private data pointer based on the driver id. Note: currently single session may be used by different device drivers, given it is initialized by them. After the change the session created by one device driver cannot be used or reinitialized by another driver. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Kai Ji <kai.ji@intel.com> Tested-by: Gagandeep Singh <g.singh@nxp.com> Tested-by: David Coyle <david.coyle@intel.com> Tested-by: Kevin O'Sullivan <kevin.osullivan@intel.com>	2022-10-04 22:04:59 +02:00
Kevin Laatz	1cab1a40ea	bus: cleanup devices on shutdown During EAL init, all buses are probed and the devices found are initialized. On eal_cleanup(), the inverse does not happen, meaning any allocated memory and other configuration will not be cleaned up appropriately on exit. Currently, in order for device cleanup to take place, applications must call the driver-relevant functions to ensure proper cleanup is done before the application exits. Since initialization occurs for all devices on the bus, not just the devices used by an application, it requires a) application awareness of all bus devices that could have been probed on the system, and b) code duplication across applications to ensure cleanup is performed. An example of this is rte_eth_dev_close() which is commonly used across the example applications. This patch proposes adding bus cleanup to the eal_cleanup() to make EAL's init/exit more symmetrical, ensuring all bus devices are cleaned up appropriately without the application needing to be aware of all bus types that may have been probed during initialization. Contained in this patch are the changes required to perform cleanup for devices on the PCI bus and VDEV bus during eal_cleanup(). There would be an ask for bus maintainers to add the relevant cleanup for their buses since they have the domain expertise. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Morten Brørup <mb@smartsharesystems.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com>	2022-10-04 21:20:15 +02:00
Olivier Matz	d5262b521d	mem: fix API doc about allocation on secondary processes Since 10 years, memzone allocation is allowed on secondary processes. Now it's time to update the documentation accordingly. At the same time, fix mempool, mbuf and ring documentation which rely on memzones internally. Bugzilla ID: 1074 Fixes: `916e4f4f4e` ("memory: fix for multi process support") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2022-10-04 13:36:13 +02:00
David Marchand	bb7e1f17a6	net/bnxt: fix build with GCC 13 GCC 13 complains with: ../../../dpdk/drivers/net/bnxt/tf_ulp/ulp_flow_db.c:962:1: error: conflicting types for ‘ulp_flow_db_flush_flows’ due to enum/integer mismatch; have ‘int32_t(struct bnxt_ulp_context , enum bnxt_ulp_fdb_type)’ {aka ‘int(struct bnxt_ulp_context , enum bnxt_ulp_fdb_type)’} [-Werror=enum-int-mismatch] 962 \| ulp_flow_db_flush_flows(struct bnxt_ulp_context ulp_ctx, \| ^~~~~~~~~~~~~~~~~~~~~~~ In file included from ../../../dpdk/drivers/net/bnxt/tf_ulp/ulp_flow_db.c:12: ../../../dpdk/drivers/net/bnxt/tf_ulp/ulp_flow_db.h:211:1: note: previous declaration of ‘ulp_flow_db_flush_flows’ with type ‘int32_t(struct bnxt_ulp_context , uint32_t)’ {aka ‘int(struct bnxt_ulp_context , unsigned int)’} 211 \| ulp_flow_db_flush_flows(struct bnxt_ulp_context ulp_ctx, \| ^~~~~~~~~~~~~~~~~~~~~~~ cc1: all warnings being treated as errors Fixes: `30683082a8` ("net/bnxt: combine default and regular flows") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2022-10-04 02:16:37 +02:00
Kalesh AP	5fdf25bd55	net/bnxt: fix representor info freeing Driver allocates "bp->rep_info" inside bnxt_init_rep_info() which is invoked from bnxt_rep_port_probe(). But the memory is freed inside bnxt_uninit_resources(), which is wrong. As a result, after error recovery bp->rep_info will be NULL. The memory should have freed inside bnxt_drv_uninit() to maintain symmetry of calls. Fixes: `6dc83230b4` ("net/bnxt: support port representor data path") Cc: stable@dpdk.org Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Somnath Kotur <somnath.kotur@broadcom.com>	2022-10-04 02:14:31 +02:00
Kalesh AP	aaa44cb125	net/bnxt: remove unnecessary check We are not invoking rte_eth_switch_domain_free currently owing to an unnecessary check. This patch fixes that. Fixes: `e2895305a5` ("net/bnxt: fix resource cleanup") Cc: stable@dpdk.org Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Somnath Kotur <somnath.kotur@broadcom.com>	2022-10-04 02:14:30 +02:00
Benjamin Le Berre	09b5b2880d	net/bnxt: fix error code during MTU change When the BNXT PMD was made to disallow MTU changes on active ports, the error code chosen for the case in bnxt_set_mtu_op() was -EPERM. The doc comment for rte_eth_dev_set_mtu() in lib/ethdev/rte_ethdev.h lists -EBUSY as the value to be used if the port must be stopped before applying an MTU change and does not list -EPERM as a possible return value. This patch makes bnxt_set_mtu_op() return -EBUSY instead of -EPERM so that rte_eth_dev_set_mtu() behaves as expected. Fixes: `a42ab1eb33` ("net/bnxt: disallow MTU change when device is started") Cc: stable@dpdk.org Signed-off-by: Benjamin Le Berre <benjamin.le_berre@6wind.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2022-10-04 02:12:53 +02:00
Mao YingMing	4bfb499829	net/bnxt: fix null pointer dereference in LED config For VFs, bp->leds is uninitialized, check bp->leds is not null before checking for bp->leds->num_leds. segfault backtrace in trex program when use VF: 11: bnxt_hwrm_port_led_cfg (bp=0x23ffb2140, led_on=true) 10: bnxt_dev_led_on_op (dev=0x22d7780 <rte_eth_devices>) 9: rte_eth_led_on (port_id=0) 8: DpdkTRexPortAttr::set_led (this=0x23b6ce0, on=true) 7: DpdkTRexPortAttr::DpdkTRexPortAttr 6: CTRexExtendedDriverBnxt::create_port_attr 5: CPhyEthIF::Create 4: CGlobalTRex::device_start 3: CGlobalTRex::Create 2: main_test 1: main Fixes: `d4d5a04` ("net/bnxt: fix unnecessary memory allocation") Cc: stable@dpdk.org Signed-off-by: Mao YingMing <maoyingming@baidu.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2022-10-04 02:10:36 +02:00
David Marchand	6277523c04	malloc: remove unused function to set limit This function was never implemented and has been deprecated for a long time. We can remove it. Signed-off-by: David Marchand <david.marchand@redhat.com>	2022-10-03 19:37:54 +02:00
Sean Morrissey	fe1a5a9b42	dma/idxd: add completion status for page fault Add a status for page faults to be used when getting the completion status of an operation. Signed-off-by: Sean Morrissey <sean.morrissey@intel.com>	2022-10-03 19:32:36 +02:00
Radha Mohan Chintakuntla	681851b347	dma/cnxk: support CN10K DMA engine Added support for CN10K SoC DMA engine to dmadev. Signed-off-by: Radha Mohan Chintakuntla <radhac@marvell.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com>	2022-10-03 19:22:30 +02:00
Chengwen Feng	fbf006f7fe	examples/dma: support dequeue when no packet received Currently the example using DMA in asynchronous mode, which are: nb_rx = rte_eth_rx_burst(); if (nb_rx == 0) continue; ... dma_enqueue(); // enqueue the received packets copy request nb_cpl = dma_dequeue(); // get copy completed packets ... There are no waiting inside dma_dequeue(), and this is why it's called asynchronus. If there are no packet received, it won't call dma_dequeue(), but some packets may still in the DMA queue which enqueued in last cycle. As a result, when the traffic is stopped, the sent packets and received packets are unbalanced from the perspective of the traffic generator. The patch supports DMA dequeue when no packet received, it helps to judge the test result by comparing the sent packets with the received packets on traffic generator sides. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Kevin Laatz <kevin.laatz@intel.com>	2022-10-03 18:24:24 +02:00
Bruce Richardson	3aaa46977d	examples/kni: remove deprecated example As part of the agreed process for deprecating KNI in DPDK, the example app is scheduled for removal as part of the 22.11 release. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2022-10-03 18:24:24 +02:00
Thomas Monjalon	0f91f952be	replace Mellanox with NVIDIA NVIDIA acquired Mellanox Technologies in 2020. The DPDK documentation and code might still include instances of or references to Mellanox trademarks (like BlueField and ConnectX) that are now NVIDIA trademarks. The PCI IDs and copyrights are unchanged. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Gal Cohen <galco@nvidia.com>	2022-10-03 16:01:56 +02:00
Volodymyr Fialko	96798dcfc2	crypto/cnxk: support event vectorization Add support for vector aggregation of crypto operations for cn10k. Crypto operations will be grouped by sub event type, flow id, scheduler type and queue id fields from rte_event_crypto_metadata::response_info. Signed-off-by: Volodymyr Fialko <vfialko@marvell.com>	2022-10-02 20:33:25 +02:00
Volodymyr Fialko	c1749bc5ee	eventdev: introduce event cryptodev vector type Introduce ability to aggregate crypto operations processed by event crypto adapter into single event containing rte_event_vector whose event type is RTE_EVENT_TYPE_CRYPTODEV_VECTOR. Application should set RTE_EVENT_CRYPTO_ADAPTER_EVENT_VECTOR in rte_event_crypto_adapter_queue_conf::flag and provide vector configuration with respect of rte_event_crypto_adapter_vector_limits, which could be obtained by calling rte_event_crypto_adapter_vector_limits_get, to enable vectorization. The event crypto adapter would be responsible for vectorizing the crypto operations based on provided response information in rte_event_crypto_metadata::response_info. Updated drivers and tests accordingly to new API. Signed-off-by: Volodymyr Fialko <vfialko@marvell.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Olivier Matz	a0a17e2a3e	cryptodev: fix unduly newlines in logs The CDEV_LOG_* macros already add a '\n' at the end of the line. Remove it from format strings to avoid duplicated newlines. Fixes: `9e6edea418` ("cryptodev: add APIs to assist PMD initialisation") Fixes: `e764cd72a9` ("cryptodev: update symmetric session structure") Fixes: `1d6f89885e` ("cryptodev: add sym session mempool create") Fixes: `1f1e4b7cba` ("cryptodev: use single mempool for asymmetric session") Fixes: `757f40e28e` ("cryptodev: modify return value for asym session create") Fixes: `cea66374dc` ("cryptodev: support asymmetric operations") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: David Marchand <david.marchand@redhat.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Amit Prakash Shukla	3ebb587e53	cryptodev: add trace points Add trace points for cryptodev functions. Some of the APIs are restructured to add traces and return appropriately as needed. Signed-off-by: Amit Prakash Shukla <amitprakashs@marvell.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Gowrishankar Muthukrishnan	46a1584649	cryptodev: add elliptic curve fixed point multiplication Add enumeration in EC xform for FPM (fixed point multiplication). Crypto driver would need this to xform point multiplication based on given type of EC curve. Signed-off-by: Kiran Kumar K <kirankumark@marvell.com> Signed-off-by: Gowrishankar Muthukrishnan <gmuthukrishn@marvell.com> Acked-by: Kai Ji <kai.ji@intel.com>	2022-10-02 20:33:24 +02:00
Arek Kusztal	75fd4bbc94	crypto/qat: support SM3 hash algorithm Added support for ShangMi 3 (SM3) hash algorithm. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com>	2022-10-02 20:33:24 +02:00
Arek Kusztal	92522c84e4	crypto/qat: support SM4 encryption algorithm Added support for ShangMi 4 (SM4) encryption algorithms. Supported modes: ECB, CBC, CTR. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com>	2022-10-02 20:33:24 +02:00
Arek Kusztal	35ffc5b095	cryptodev: add SM3 hash algorithm ShangMi 3 (SM3) is a cryptographic hash function used in the Chinese National Standard. - Added SM3 algorithm Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Kai Ji <kai.ji@intel.com> Acked-by: Anoob Joseph <anoobj@marvell.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Arek Kusztal	515cd4a488	cryptodev: add SM4 encryption algorithm ShangMi 4 (SM4) is a block cipher used in the Chinese National Standard for Wireless LAN WAPI and also used with Transport Layer Security. Added SM4 encryption algorithm in ECB, CBC and CTR modes. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Kai Ji <kai.ji@intel.com> Acked-by: Anoob Joseph <anoobj@marvell.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Srujana Challa	68d25915d2	security: remove user data get API The API rte_security_get_userdata() was being unused by most of the drivers and it was retrieving userdata from mbuf dynamic field. Hence, the API was removed and the application can directly get the userdata from dynamic field. This helps in removing extra checks in datapath. Signed-off-by: Srujana Challa <schalla@marvell.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2022-10-02 20:33:24 +02:00
Stephen Hemminger	832cecc03d	rwlock: prevent readers from starving writers Modify reader/writer lock to avoid starvation of writer. The previous implementation would cause a writer to get starved if readers kept acquiring the lock. The new version uses an additional bit to indicate that a writer is waiting and which keeps readers from starving the writer. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Chengwen Feng	8af559f94c	ethdev: support telemetry private dump This patch supports telemetry private dump a ethdev port. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Chengwen Feng	e915d404eb	rawdev: support telemetry dump rawdev This patch supports telemetry dump rawdev. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Chengwen Feng	a3b7b476d7	eventdev: support telemetry dump eventdev This patch supports telemetry dump eventdev. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Chengwen Feng	94043b0421	dmadev: support telemetry dump dmadev This patch supports telemetry dump dmadev. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Bruce Richardson	6d03ef606b	metrics: return error code on initialization failures DPDK libraries should never call rte_exit on failure, so change the function return type of rte_metrics_init to "int" to allow returning an error code to the application rather than exiting the whole app on init failure. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2022-10-03 12:03:36 +02:00
Bruce Richardson	9856af4044	eal: remove panic on remote launch failure Library functions should not cause the app to exit or panic. Replace the existing panic call in the EAL remote launch functions with an error code return instead. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-10-03 12:03:11 +02:00
Pavan Nikhilesh	dbc964c904	doc: remove deprecation notice for event vector Deprecation notice targeted for v22.11 of event vector has been merged in the following commits, remove deprecation notices. Fixes: `0fbb55efa5` ("eventdev: add element offset to event vector") Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2022-09-30 12:13:15 +02:00
Pavan Nikhilesh	5fa63911e4	eventdev: replace padding type in event vector Replace u64s with u64s in rte_event_vector structure as the ptrs already serves the purpose of holding pointers and the intention of u64s is to hold array of uint64_t values. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2022-09-30 12:13:15 +02:00
Pavan Nikhilesh	d986276f9b	eventdev: add prefix to public symbol Add `rte` prefix to stop flush callback function pointer declaration to avoid conflicts with application functions, ``eventdev_stop_flush_t`` is renamed to ``rte_eventdev_stop_flush_t``. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2022-09-30 12:13:15 +02:00
Abdullah Sevincer	9c9e72326b	event/dlb2: handle enqueuing more than maximum depth This patch addresses an issue of enqueuing more than max_enq_depth and not able to dequeuing events equal to max_cq_depth in a single call of rte_event_enqueue_burst and rte_event_dequeue_burst. Apply fix for restricting enqueue of events to max_enq_depth so that in a single rte_event_enqueue_burst() call at most max_enq_depth events are enqueued. Also set per port and domain history list sizes based on cq_depth. This results in dequeuing correct number of events as set by max_cq_depth. Fixes: `f3cad285bb` ("event/dlb2: add infos get and configure") Cc: stable@dpdk.org Signed-off-by: Abdullah Sevincer <abdullah.sevincer@intel.com>	2022-09-30 10:57:40 +02:00
Abdullah Sevincer	bdd0b609a1	event/dlb2: optimize credit allocations This commit implements the changes required for using suggested port type hint feature. Each port uses different credit quanta based on port type specified using port configuration flags. Each port has separate quanta defined in dlb2_priv.h Producer and consumer ports will need larger quanta value to reduce number of credit calls they make. Workers can use small quanta as they mostly work out of locally cached credits and don't request/return credits often. Signed-off-by: Abdullah Sevincer <abdullah.sevincer@intel.com>	2022-09-30 10:26:42 +02:00
Abdullah Sevincer	d8c16de5df	event/dlb2: add fence bypass option for producer ports If producer thread is only acting as a bridge between NIC and DLB, then performance can be greatly improved by bypassing the fence instruction. DLB enqueue API calls memory fence once per enqueue burst. If producer thread is just reading from NIC and sending to DLB without updating the read buffers or buffer headers OR producer is not writing to data structures with dependencies on the enqueue write order, then fencing can be safely disabled. Signed-off-by: Abdullah Sevincer <abdullah.sevincer@intel.com>	2022-09-30 10:25:43 +02:00
Abdullah Sevincer	8d1d9070bb	event/dlb2: optimize producer port probing For best performance, applications running on certain cores should use the DLB device locally available on the same tile along with other resources. To allocate optimal resources, probing is done for each producer port (PP) for a given CPU and the best performing ports are allocated to producers. The CPU used for probing is either the first core of producer coremask (if present) or the second core of EAL coremask. This will be extended later to probe for all CPUs in the producer coremask or EAL coremask. Producer coremask can be passed along with the BDF of the DLB devices. "-a xx:y.z,producer_coremask=<core_mask>" Applications also need to pass RTE_EVENT_PORT_CFG_HINT_PRODUCER during rte_event_port_setup() for producer ports for optimal port allocation. For optimal load balancing ports that map to one or more QIDs in common should not be in numerical sequence. The port->QID mapping is application dependent, but the driver interleaves port IDs as much as possible to reduce the likelihood of sequential ports mapping to the same QID(s). Hence, DLB uses an initial allocation of Port IDs to maximize the average distance between an ID and its immediate neighbors. Using the initialport allocation option can be passed through devarg "default_port_allocation=y(or Y)". When events are dropped by workers or consumers that use LDB ports, completions are sent which are just ENQs and may impact the latency. To address this, probing is done for LDB ports as well. Probing is done on ports per 'cos'. When default cos is used, ports will be allocated from best ports from the best 'cos', else from best ports of the specific cos. Signed-off-by: Abdullah Sevincer <abdullah.sevincer@intel.com>	2022-09-30 10:24:36 +02:00
Pavan Nikhilesh	edbb4c09c5	event/cnxk: fix missing xstats operations Fix missing xstats ops registration when initializing event device. Fixes: `b5a52c9d97` ("event/cnxk: add event port and queue xstats") Cc: stable@dpdk.org Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2022-09-30 09:15:48 +02:00
Pavan Nikhilesh	6dc1d293d3	app/eventdev: fix cleanup flush During cleanup `rte_event_port_quiesce` should be called irrespective of whether an event has been dequeued or not to flush any prefetched events. Fixes: `7da008df0c` ("app/eventdev: use port quiescing") Cc: stable@dpdk.org Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2022-09-30 09:15:48 +02:00
Ivan Malov	cd8e2d8292	net/sfc: clarify Rx buffer size calculation The user has the right to supply pools with excessively large buffers, regardless of the maximum supported Rx packet length reported by the adapter. However, in this PMD, on EF10 boards, a Rx descriptor has only 14 bits to specify the buffer length. To avoid potential problems, use this information accordingly. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2022-09-28 11:51:39 +02:00
Ivan Malov	61b3e9e79a	common/sfc_efx/base: report maximum Rx data count Such information is useful to client drivers which deal with large Rx pool buffers (16-bit wide data count) and thus need to avoid overflow when setting EF10's 14-bit wide data count. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2022-09-28 11:51:39 +02:00
Ivan Malov	79b0c19261	common/sfc_efx/base: fix maximum Tx data count Maximum data count of a Tx descriptor is advertised to users, however, this value is mistakenly derived from the Rx define. Use the Tx one instead. The resulting value will be the same. Fixes: `1e43fe3cb4` ("net/sfc/base: separate limitations on Tx DMA descriptors") Cc: stable@dpdk.org Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2022-09-28 11:51:39 +02:00
Suanming Mou	c9dc038408	ethdev: add indirect action async query As rte_flow_action_handle_create/destroy/update() have their own asynchronous rte_flow_async_action_handle_create/destroy/update() version functions to accelerate the indirect action operations in queue based flow engine. Currently, the asynchronous version query function for indirect action was missing. Add rte_flow_async_action_handle_query() function corresponding to rte_flow_action_handle_query(). The new asynchronous version function enables enqueue the query to the hardware similar as asynchronous flow management does and returns immediately to free the CPU for other tasks. Application can get the query results from rte_flow_pull() when the hardware completes its work. Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2022-09-28 10:47:34 +02:00

1 2 3 4 5 ...

33582 Commits