numam-dpdk

Author	SHA1	Message	Date
Dariusz Sosnowski	f1fecffa88	net/mlx5: support Direct Rules action template API This patch adapts mlx5 PMD to changes in mlx5dr API regarding the action templates. It changes the following: 1. Actions template creation: - Flow actions types are translated to mlx5dr action types in order to create mlx5dr_action_template object. - An offset is assigned to each flow action. This offset is used to predetermine the action's location in the rule_acts array passed on the rule creation. 2. Template table creation: - Fixed actions are created and put in the rule_acts cache using predetermined offsets - mlx5dr matcher is parametrized by action templates bound to template table. - mlx5dr matcher is configured to optimize rule creation based on passed rule indices. 3. Flow rule creation: - mlx5dr rule is parametrized by the action template on which these rule's actions are based. - Rule index hint is provided to mlx5dr. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:40 +02:00
Xiaoyu Min	4d368e1da3	net/mlx5: support flow counter action for HWS This commit adds HW steering counter action support. The pool mechanism is the basic data structure for the HW steering counter. The HW steering's counter pool is based on the rte_ring of zero-copy variation. There are two global rte_rings: 1. free_list: Store the counters indexes, which are ready for use. 2. wait_reset_list: Store the counters indexes, which are just freed from the user and need to query the hardware counter to get the reset value before this counter can be reused again. The counter pool also supports cache per HW steering's queues, which are also based on the rte_ring of zero-copy variation. The cache can be configured in size, preload, threshold, and fetch size, they are all exposed via device args. The main operations of the counter pool are as follows: - Get one counter from the pool: 1. The user call _get_* API. 2. If the cache is enabled, dequeue one counter index from the local cache: 2. A: if the dequeued one from the local cache is still in reset status (counter's query_gen_when_free is equal to pool's query gen): I. Flush all counters in the local cache back to global wait_reset_list. II. Fetch _fetch_sz_ counters into the cache from the global free list. III. Fetch one counter from the cache. 3. If the cache is empty, fetch _fetch_sz_ counters from the global free list into the cache and fetch one counter from the cache. - Free one counter into the pool: 1. The user calls _put_* API. 2. Put the counter into the local cache. 3. If the local cache is full: A: Write back all counters above _threshold_ into the global wait_reset_list. B: Also, write back this counter into the global wait_reset_list. When the local cache is disabled, _get_/_put_ cache directly from/into global list. Signed-off-by: Xiaoyu Min <jackmin@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:39 +02:00
Alexander Kozyrev	24865366e4	net/mlx5: support flow meter action for HWS This commit adds meter action for HWS steering. HW steering meter is based on ASO. The number of meters will be used by flows should be specified in advance in the flow configure API. Signed-off-by: Alexander Kozyrev <akozyrev@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:39 +02:00
Bing Zhao	ddb68e4733	net/mlx5: add extended metadata mode for HWS The new mode 4 of devarg "dv_xmeta_en" is added for HWS only. In this mode, the Rx / Tx metadata with 32b width copy between FDB and NIC is supported. The mark is only supported in NIC and there is no copy supported. Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:38 +02:00
Dariusz Sosnowski	1939eb6f66	net/mlx5: support flow port action with HWS This patch implements creating and caching of port action for use with HW Steering FDB flows. Actions are created on flow template API configuration and created only on the port designated as the master. Attaching and detaching ports in the same switching domain causes an update to the port actions cache by, respectively, creating and destroying actions. A new devarg fdb_def_rule_en is being added and it's used to control the default dedicated E-Switch rules that are created by the PMD implicitly or not, and PMD sets this value to 1 by default. If set to 0, the default E-Switch rule will not be created and the user can create the specific E-Switch rules on the root table if needed. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:38 +02:00
Suanming Mou	0f4aa72b99	net/mlx5: support flow modify field with HWS This patch introduces support for modify_field rte_flow actions in HWS mode that includes: - Ingress and egress domains, - SET and ADD operations, - usage of arbitrary bit offsets and widths for packet and metadata fields. This is implemented in two phases: 1. On flow table creation the hardware commands are generated, based on rte_flow action templates, and stored alongside action template. 2. On flow rule creation/queueing the hardware commands are updated with values provided by the user. Any masks over immediate values, provided in action templates, are applied to these values before enqueueing rules for creation. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com> Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:38 +02:00
Suanming Mou	7f6daa490d	net/mlx5: add shared header reformat As the rte_flow_async API defines the action mask with a field value not being 0 means the action will be used as shared in all the flows in the table. The header reformat action with the action mask field not being 0 will be created as constant shared action. For encapsulation header reformat action, there are two kinds of encapsulation data, raw_encap_data and rte_flow_item encap_data. Both of these two kinds of data can be identified from the action mask conf as constant or not. Examples: 1. VXLAN encap (encap_data: rte_flow_item) action conf (eth/ipv4/udp/vxlan_hdr) a. action mask conf (eth/ipv4/udp/vxlan_hdr) - items are constant. b. action mask conf (NULL) - items will change. 2. RAW encap (encap_data: raw) action conf (raw_data) a. action mask conf (not NULL) - encap_data constant. b. action mask conf (NULL) - encap_data will change. Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:37 +02:00
Suanming Mou	b206c558f7	net/mlx5: fix IPv6 and TCP RSS hash fields In the flow_dv_hashfields_set() function, while item_flags was 0, the code went directly to the first if and the else case would never have a chance to be checked. This caused the IPv6 and TCP hash fields in the else case would never be set. This commit adds the dedicated HW steering hash field set function to generate the RSS hash fields. Fixes: `3a2f674b6a` ("net/mlx5: add queue and RSS HW steering action") Cc: stable@dpdk.org Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:37 +02:00
Suanming Mou	4fb0ef2976	net/mlx5: fix steering engine type check In the function flow_get_drv_type(), attr will be read in non-HWS mode. In case the user calls the HWS API in SWS mode, the attr should be placed in HWS functions or it will cause a crash. Fixes: `c40c061a02` ("net/mlx5: add basic flow queue operation") Cc: stable@dpdk.org Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:36 +02:00
Alex Vesker	22681deead	net/mlx5/hws: enable hardware steering Replace stub implementation of HWS with mlx5dr code. Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:36 +02:00
Hamdan Igbaria	78580cf4e7	net/mlx5/hws: add debug layer The debug layer is used to generate a debug CSV file containing details of the context, table, matcher, rules and other useful debug information. Signed-off-by: Hamdan Igbaria <hamdani@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:36 +02:00
Erez Shitrit	f8c8a6d844	net/mlx5/hws: add action object Action objects are used for executing different HW actions over packets. Each action contains the HW resources and parameters needed for action use over the HW when creating a rule. Signed-off-by: Erez Shitrit <erezsh@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:35 +02:00
Alex Vesker	405242c52d	net/mlx5/hws: add rule object HWS rule objects reside under the matcher, each rule holds the configuration for the packet fields to match and the set of actions to execute over the packet that has the requested fields. Rules can be created asynchronously in parallel over multiple queues to different matchers with each rule configured to the HW. Signed-off-by: Erez Shitrit <erezsh@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:35 +02:00
Alex Vesker	c467608215	net/mlx5/hws: add matcher object HWS matcher resides under the table object, each table can have multiple chained matches with different attributes. Each matcher represents a combination of match and action templates, and can contain multiple configurations based on the templates. Packets are steered from the table to the matcher and from there to other objects. The matcher allows efficient HW packet field matching and action execution based on the configuration done to it. Signed-off-by: Alex Vesker <valex@nvidia.com> Signed-off-by: Erez Shitrit <erezsh@nvidia.com>	2022-10-26 13:33:34 +02:00
Alex Vesker	394cc7ba40	net/mlx5/hws: add table object HWS table resides under the context object, each context can have multiple tables with different steering types RX/TX/FDB. The table is not only a logical object but it is also represented in the HW, packets can be steered to the table, and from there to other tables. Signed-off-by: Erez Shitrit <erezsh@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:34 +02:00
Alex Vesker	b0290e56dd	net/mlx5/hws: add context object Context is the first mlx5dr object created, all sub objects table, matcher, rule, and action are created using the context. The context holds the capabilities and the send queues used for configuring the offloads to the HW. Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:34 +02:00
Alex Vesker	c55c2bf353	net/mlx5/hws: add definer layer Definers are HW objects that are used for matching, rte items are translated to definers, each definer holds the fields and bit-masks used for HW flow matching. The definer layer is used for finding the most efficient definer for each set of items. In addition to definer creation we also calculate the field copy (fc) array used for efficient items to WQE conversion. Signed-off-by: Mark Bloch <mbloch@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:33 +02:00
Alex Vesker	3eb748869d	net/mlx5/hws: add send layer HWS configures flows to the HW using a QP, each WQE has the details of the flow we want to offload. The send layer allocates the resources needed to send the request to the HW as well as managing the queues, getting completions and handling failures. Signed-off-by: Mark Bloch <mbloch@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:33 +02:00
Erez Shitrit	b4dd7bcb0d	net/mlx5/hws: add pool and buddy HWS needs to manage different types of device memory in an efficient and quick way. For this, memory pools are being used. Signed-off-by: Erez Shitrit <erezsh@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:33 +02:00
Erez Shitrit	365cdf5f8c	net/mlx5/hws: add command layer This adds the command layer which is used to communicate with the FW, to query capabilities and allocate FW resources needed for HWS. Signed-off-by: Erez Shitrit <erezsh@nvidia.com> Signed-off-by: Alex Vesker <valex@nvidia.com>	2022-10-26 13:33:32 +02:00
Bing Zhao	8a89038f40	net/mlx5: provide available tag registers This stores the available tags that can be used by the application in a global array that will be used to transfer the TAG item directly from the ID to the REG_C_x since these can't be changed after startup. Signed-off-by: Bing Zhao <bingz@nvidia.com>	2022-10-26 13:33:31 +02:00
Dariusz Sosnowski	5bd0e3e671	net/mlx5: add port to metadata conversion This adds conversion functions between both ethdev port_id and IB context to internal corresponding tag/mask values. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com>	2022-10-26 13:33:31 +02:00
Suanming Mou	75a00812b1	net/mlx5: add hardware steering item translation This provides shared item tranlsation code for hardware steering root table flows as they still work under FW steering mode. Signed-off-by: Suanming Mou <suanmingm@nvidia.com>	2022-10-26 13:33:30 +02:00
Suanming Mou	cd4ab74206	net/mlx5: split flow item matcher and value translation This split the item matcher and value translation to make the code reusable for the new steering mode. Signed-off-by: Suanming Mou <suanmingm@nvidia.com>	2022-10-26 13:33:30 +02:00
Suanming Mou	e64fd460b7	net/mlx5: split flow item translation This splits flow item translation code to a dedicated function to share the item translation code with hardware steering mode. Signed-off-by: Suanming Mou <suanmingm@nvidia.com>	2022-10-26 13:33:29 +02:00
Michael Savisko	7f6e276b02	net/mlx5: support flow action send to kernel Introduce mlx5_get_send_to_kernel_priority() function which returns value of priority which must be used to jump back to table 0 in order to send traffic to kernel. This function returns lowest priority. Add flow_dv_translate_action_send_to_kernel() function which will allocate rdma-core send_to_kernel action object. Called from flow_dv_translate(). Fail translation of RTE_FLOW_ACTION_TYPE_SEND_TO_KERNEL action in HW steering. Signed-off-by: Michael Savisko <michaelsav@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-10-26 13:33:29 +02:00
Michael Savisko	f31a141e64	net/mlx5: add send to kernel action resource holder Add new structure mlx5_send_to_kernel_action which will hold together allocated action resource and a reference to used table. A new structure member of this type added to struct mlx5_dev_ctx_shared. The member will be initialized upon first created send_to_kernel action and will be reused for all future actions of this type. Release of these resources will be done when all shared DR resources are being released in mlx5_os_free_shared_dr(). Change function flow_dv_tbl_resource_release() from static to external. Signed-off-by: Michael Savisko <michaelsav@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-10-26 13:33:29 +02:00
Michael Savisko	25c4d6dfae	net/mlx5: add flow action stub for send to kernel Add new mlx5 action flag MLX5_FLOW_ACTION_SEND_TO_KERNEL. Add element MLX5_FLOW_FATE_SEND_TO_KERNEL in enum mlx5_flow_fate_type. For that purpose field 'fate_action' in structure mlx5_flow_handle must be expanded from 3 bits to 4 bits. Signed-off-by: Michael Savisko <michaelsav@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-10-26 13:33:28 +02:00
Michael Savisko	80f998da1d	common/mlx5: add send to kernel flow action Add new glue callback dr_create_flow_action_send_to_kernel. Default callback invokes mlx5dv_dr_action_create_dest_root_table(). Add static inline mlx5_flow_os_create_flow_action_send_to_kernel(), which calls dr_create_flow_action_send_to_kernel glue callback. Define HAVE_MLX5DV_DR_ACTION_CREATE_DEST_ROOT_TABLE macro if function mlx5dv_dr_action_create_dest_root_table exists in infiniband/mlx5dv.h Signed-off-by: Michael Savisko <michaelsav@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-10-26 13:33:28 +02:00
Dong Zhou	4df7f801ff	net/mlx5: fix thread workspace memory leak The thread workspace push/pop should be paired. In the "flow_list_create" routine, if error happened the workspace pop was missed. This patch shares the workspace pop for all return paths. Fixes: `0064bf4318` ("net/mlx5: fix nested flow creation") Cc: stable@dpdk.org Signed-off-by: Dong Zhou <dongzhou@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-26 13:33:27 +02:00
Olivier Matz	d118ff6051	net/virtio: remove declaration of undefined function This function is not defined, remove its declaration. Fixes: `c1f86306a0` ("virtio: add new driver") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2022-10-26 11:22:41 +02:00
Olivier Matz	974f216bf9	net/virtio: add queue and port ID in some logs Add the queue id and/or the port id in some logs, so it is easier to understand what happens. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2022-10-26 10:50:04 +02:00
Ferruh Yigit	578e37908d	net/mana: disable driver by default Driver is disabled by default because its dependencies are not upstreamed yet, code is available for development and investigation. When all dependencies are upstreamed, driver can be enabled back. Fixes: `517ed6e2d5` ("net/mana: add basic driver with build environment") Signed-off-by: Ferruh Yigit <ferruh.yigit@amd.com> Acked-by: Long Li <longli@microsoft.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2022-10-27 10:54:53 +02:00
Junfeng Guo	a46583cf43	net/gve: support Rx/Tx Add Rx/Tx of GQI_QPL queue format and GQI_RDA queue format. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	4bec2d0b55	net/gve: support queue operations Add support for queue operations: - setup rx/tx queue - release rx/tx queue - start rx/tx queues - stop rx/tx queues Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	71dea04cdf	net/gve: support device info and configure Add dev_ops dev_infos_get. Complete dev_configure with RX offloads force enabling. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	f19c864e4d	net/gve: support MTU setting Support dev_ops mtu_set. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	440f551dd0	net/gve: support link update Support dev_ops link_update. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	457967cd2b	net/gve: support device initialization Support device init and add following devops skeleton: - dev_configure - dev_start - dev_stop - dev_close Note that build system (including doc) is also added in this patch. Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	c9ba2caf63	net/gve/base: add OS-specific implementation Add some MACRO definitions and memory operations which are specific for DPDK. Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Junfeng Guo	f86e5ed867	net/gve/base: introduce Google Virtual Ethernet base The following base code is based on Google Virtual Ethernet (gve) driver v1.3.0 under MIT license. - gve_adminq.c - gve_adminq.h - gve_desc.h - gve_desc_dqo.h - gve_register.h - gve.h The original code is in: https://github.com/GoogleCloudPlatform/compute-virtual-ethernet-linux/\ tree/v1.3.0/google/gve Note that these code are not Intel files and they come from the kernel community. The base code there has the statement of SPDX-License-Identifier: (GPL-2.0 OR MIT). Here we just follow the required MIT license as an exception to DPDK. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Signed-off-by: Junfeng Guo <junfeng.guo@intel.com>	2022-10-25 14:30:28 +02:00
Chaoyong He	9082336048	net/nfp: support new solution for tunnel decap action The new version of flower firmware application add the support of a new tunnel decap action solution. It changes the structure of tunnel neighbor, and use a feature flag to indicate which tunnel decap action solution is used. Add the logic of read extra features from firmware, and store it in the app private structure. Adjust the data structure and related logic to make the PMD support both version of tunnel decap action solutions. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	52a83356a6	net/nfp: support IPv6 NVGRE decap flow action Add the offload support of decap action of IPv6 NVGRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	3f7aa85b6f	net/nfp: support IPv4 NVGRE decap flow action Add the offload support of decap action of IPv4 NVGRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	cb69acd0bb	net/nfp: support IPv6 NVGRE flow item Add the corresponding logics to support the offload of IPv6 NVGRE item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	162fc0de03	net/nfp: support IPv4 NVGRE flow item Add the corresponding logics to support the offload of IPv4 NVGRE item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	5f5e836005	net/nfp: prepare for IPv6 GRE tunnel decap flow action Add the related data structure and logics, prepare for the decap action of IPv6 GRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	8891d6700c	net/nfp: prepare for IPv4 GRE tunnel decap flow action Add the related data structure and logics, prepare for the decap action of IPv4 GRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	fff680eef7	net/nfp: support IPv6 NVGRE encap flow action Add the offload support of encap action of IPv6 NVGRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	88cce05380	net/nfp: support IPv4 NVGRE encap flow action Add the offload support of encap action of IPv4 NVGRE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	590ec35723	net/nfp: support IPv6 GENEVE decap flow action Add the offload support of decap action for IPv6 GENEVE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	646a8653b5	net/nfp: support IPv4 GENEVE decap flow action Add the offload support of decap action for IPv4 GENEVE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	dc1171dd03	net/nfp: support IPv6 GENEVE flow item Add the corresponding logics to support the offload of IPv6 GENEVE item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	d59b2b0b9d	net/nfp: support IPv4 GENEVE flow item Add the corresponding logics to support the offload of IPv4 GENEVE item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	98fa36eccc	net/nfp: support IPv6 GENEVE encap flow action Add the offload support of encap action for IPv6 GENEVE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	a2a1153b6f	net/nfp: support IPv4 GENEVE encap flow action Add the offload support of encap action for IPv4 GENEVE tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	e0843562c5	net/nfp: support IPv6 VXLAN decap flow action Add the offload support of decap action for IPv6 VXLAN tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	60bc1764a9	net/nfp: support IPv4 VXLAN decap flow action Add the offload support of decap action for IPv4 VXLAN tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	8ee18bad76	net/nfp: prepare for IPv6 UDP tunnel decap flow action Add the related data structure and functions, prepare for the decap action of IPv6 UDP tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	d3c33bdf1f	net/nfp: prepare for IPv4 UDP tunnel decap flow action Add the related data structure and functions, prepare for the decap action of IPv4 UDP tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	c3b7254093	net/nfp: support IPv6 VXLAN encap flow action Add the offload support of encap action for IPv6 VXLAN tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	724662b4ce	net/nfp: support IPv4 VXLAN encap flow action Add the offload support of encap action for IPv4 VXLAN tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	36fb222db9	net/nfp: prepare for IPv6 tunnel encap flow action Add the related data structure and functions, prepare for the encap action of IPv6 tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	f382761cd2	net/nfp: prepare for IPv4 tunnel encap flow action Add the related data structure and functions, prepare for the encap action of IPv4 tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	ea6f2fa073	net/nfp: support IPv6 VXLAN flow item Add the corresponding data structure and logics, to support the offload of IPv6 VXLAN item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	d617b75a71	net/nfp: support IPv4 VXLAN flow item Add the corresponding data structure and logics, to support the offload of IPv4 VXLAN item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	a5b876a5aa	net/nfp: fix CPP bridge service exit The CPP (Command Pull Push) bridge service is needed for some debug tools, but if no debug tools has ever been run, the initial logic of CPP bridge service will block in accept() function call, and the DPDK app can't exit normally. Fixes: `bab0e6f48b` ("net/nfp: fix infinite loop") Signed-off-by: Chaoyong He <chaoyong.he@corigine.com>	2022-10-25 10:53:33 +02:00
Chaoyong He	39d82d2098	net/nfp: fix promiscuous mode for representor port The original functions of promiscuous mode can't process the representor port rightly, revise the logic to do that. Fixes: `e1124c4f8a` ("net/nfp: add flower representor framework") Signed-off-by: Chaoyong He <chaoyong.he@corigine.com>	2022-10-24 14:30:43 +02:00
Chaoyong He	3202b003be	net/nfp: support IPv6 DSCP flow action Add the corresponding logics to support the offload of set IPv6 DSCP action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	9c665d70f7	net/nfp: support IPv4 DSCP flow action Add the corresponding logics to support the offload of set IPv4 DSCP action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	ac12e126c4	net/nfp: support TTL flow action Add the corresponding data structure and logics, to support the offload of set TTL action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	87986df09d	net/nfp: support TP destination flow action Add the corresponding logics to support the offload of set TP dest port action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	fc185097bb	net/nfp: support TP source flow action Add the corresponding data structure and logics, to support the offload of set TP source port action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	51384f79b2	net/nfp: support IPv6 destination flow action Add the corresponding logics to support the offload of set dest IPv6 address action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	596ae22172	net/nfp: support IPv6 source flow action Add the corresponding data structure and logics, to support the offload of set source IPv6 address action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	f85c1e05bf	net/nfp: support IPv4 destination flow action Add the corresponding logics to support the offload of set dest IPv4 address action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	ab80975b0a	net/nfp: support IPv4 source flow action Add the corresponding data structure and logics, to support the offload of set source IPv4 address action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	eb9277cb2f	net/nfp: support VLAN push flow action Add the corresponding data structure and logics, to support the offload of push_vlan action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	0203474d75	net/nfp: support VLAN pop flow action Add the corresponding data structure and logics, to support the offload of pop_vlan action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	eecc7ca308	net/nfp: support MAC destination flow action Add the corresponding logics to support the offload of set dest MAC action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	4f69831545	net/nfp: support MAC source flow action Add the corresponding data structure and logics, to support the offload of set source MAC action. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	cdb4743541	net/nfp: support SCTP flow item Add the corresponding logics to support the offload of SCTP item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	17956c4e6f	net/nfp: support UDP flow item Add the corresponding logics to support the offload of UDP item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	4b768cc836	net/nfp: support TCP flow item Add the corresponding data structure and logics, to support the offload of TCP item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	d965074044	net/nfp: support IPv6 flow item Add the corresponding data structure and logics, to support the offload of IPv6 item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	e8abcff52e	net/nfp: support IPv4 flow item Add the corresponding data structure and logics, to support the offload of IPv4 item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	397fa00bf8	net/nfp: support VLAN flow item Add the corresponding data structure and logics, to support the offload of VLAN item. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	4d946034bf	net/nfp: support basic flow actions Add the offload support of very basic actions: count, drop and output. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	8ad9ffa2ce	net/nfp: support basic flow items Add the offload support of very basic items: ethernet and port id. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	30ecce5227	net/nfp: support flow API Add the flow validate/create/query/destroy/flush API of nfp PMD. The flow create API construct a control cmsg and send it to firmware, then add this flow to the hash table. The flow query API get flow stats from the flow_priv structure. Note there exist an rte_spin_lock to prevent the update and query action occur at the same time. The flow destroy API construct a control cmsg and send it to firmware, then adelete this flow from the hash table. The flow flush API just iterate the flows in hash table and call the flow destroy API. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	5c30647de8	net/nfp: add stats logic in ctrl VNIC service Add the flow stats process logic in the ctrl VNIC service. The flower firmware pass the flow stats to nfp driver through control message, we store them in the flow_priv structure. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	ac09376096	net/nfp: add structures and functions for flow offload Add the structures and functions to process mask table, flow table, and flow stats id, which are used in the rte_flow offload logics. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	bab0e6f48b	net/nfp: fix infinite loop The services don't have a method to break the infinite loop, and this will cause the DPDK app can't end normally. Fixes: `a36634e87e` ("net/nfp: add flower ctrl VNIC Rx/Tx") Signed-off-by: Chaoyong He <chaoyong.he@corigine.com>	2022-10-21 15:00:25 +02:00
Chaoyong He	dee23e6c36	net/nfp: fix CPP bridge service requirement The CPP (Command Pull Push) bridge service is needed for some debug tools, and should be optional, so remove the mandatory requirement of service lcore parameter. Fixes: `b188042195` ("net/nfp: add initial flower firmware support") Signed-off-by: Chaoyong He <chaoyong.he@corigine.com>	2022-10-21 15:00:25 +02:00
Peng Zhang	dbad6f64f9	net/nfp: fix internal buffer size and MTU check When MTU is bigger than hw->flbufsz, it can't work. hw->flbufsz is set in the nfp_net_rx_queue_setup(). At first, in the nfp_net_configure(), the hw->flbufsz isn't set the value, it just judge the initialized value and MTU, it is unreasonable. Now, it just check the MTU can't be more than the NFP_FRAME_SIZE_MAX in the nfp_net_configure(), when hw->flbufsz is set the value, in the nfp_net_start(), judge the hw->flbufsz and MTU. Fixes: `5c305e218f` ("net/nfp: fix initialization") Cc: stable@dpdk.org Signed-off-by: Peng Zhang <peng.zhang@corigine.com> Reviewed-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-21 11:56:59 +02:00
Chengwen Feng	c622735dd3	net/bonding: call Tx prepare before Tx burst Normally, to use the HW offloads capability (e.g. checksum and TSO) in the Tx direction, the application needs to call rte_eth_tx_prepare() to do some adjustment with the packets before sending them. But the tx_prepare callback of the bonding driver is not implemented. Therefore, the sent packets may have errors (e.g. checksum errors). However, it is difficult to design the tx_prepare callback for bonding driver. Because when a bonded device sends packets, the bonded device allocates the packets to different slave devices based on the real-time link status and bonding mode. That is, it is very difficult for the bonded device to determine which slave device's prepare function should be invoked. So in this patch, the tx_prepare callback of bonding driver is not implemented. Instead, the rte_eth_tx_prepare() will be called before rte_eth_tx_burst(). In this way, all tx_offloads can be processed correctly for all NIC devices. Note: because it is rara that bond different PMDs together, so just call tx-prepare once in broadcast bonding mode. Also the following description was added to the rte_eth_tx_burst() function: "@note This function must not modify mbufs (including packets data) unless the refcnt is 1. The exception is the bonding PMD, which does not have tx-prepare function, in this case, mbufs maybe modified." Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Reviewed-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Chas Williams <3chas3@gmail.com>	2022-10-20 08:36:34 +02:00
Ivan Malov	df810d1b6e	net/bonding: fix flow flush order on close The current code first removes all back-end devices of the bonded device and then invokes flush operation to remove flows in such back-end devices, which makes no sense. Fix that by re-ordering the steps accordingly. Fixes: `49dad9028e` ("net/bonding: support flow API") Cc: stable@dpdk.org Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2022-10-20 08:30:22 +02:00
Ivan Malov	339f1ba513	net/bonding: make configure method re-entrant According to the documentation, rte_eth_dev_configure() can be invoked repeatedly while in stopped state. The current implementation in the bonding driver allows for that (technically), but the user sees warnings which say that back-end devices have already been harnessed. Re-factor the code to have cleanup before each (re-)configure. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Chas Williams <3chas3@gmail.com>	2022-10-19 10:23:40 +02:00
Andrew Boyer	a5b1ffd880	net/ionic: add watchdogs to protect each queue type Ring the doorbell again for the following scenarios: * No receives posted but Rx queue not empty after deadline * No transmits posted but Tx work still pending after deadline * Admin queue work still pending after deadline This will help the queues recover in the extremely rare case that a doorbell is missed by the FW. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	13133a2883	net/ionic: retry init commands up to five times In some configurations, the FW may return EAGAIN if it is not able to respond to commands immediately. Retry the init commands in this case to prevent errors from reaching the client. Fix up some return-code stuff while here, for clarity. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	15770e9866	net/ionic: use helper variable for page size This improves readability. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	e86a6fcc7c	net/ionic: add optimized non-scattered Rx/Tx The code is very similar, but the simple case can skip a few branches in the hot path. This improves PPS when 10KB mbufs are used. S/G is enabled on the Rx side by offload DEV_RX_OFFLOAD_SCATTER. S/G is enabled on the Tx side by offload DEV_TX_OFFLOAD_MULTI_SEGS. S/G is automatically enabled on the Rx side if the provided mbufs are too small to hold the maximum possible frame. To enable S/G in testpmd, add these args: --rx-offloads=0x2000 --tx-offloads=0x8000 Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: R Mohamed Shah <mohamedshah.r@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	07512941de	net/ionic: allow to specify Tx free threshold Some clients have opinions about how often to flush the transmit ring. The default value is the number of Tx descriptors minus the default Tx burst size. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	c5d0bb79fd	net/ionic: add alignment and socket info in allocations This will avoid memory access penalties on NUMA systems. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	c663c7ecb0	net/ionic: use calloc for array allocations Use rte_calloc() where appropriate. This makes the code clearer. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	9de21005e2	net/ionic: add Q-in-CMB option When 'ionic_cmb' is set to '1', queue memory will be allocated from the device's onboard memory (Controller Memory Buffer). In some configurations, this will dramatically reduce packet latency and increase PPS. Add the WC_ACTIVATE flag to the PCI driver flags. Write combining must be enabled to achieve the maximum PPS. When the queue is in the CMB, descriptors cannot be prefetched. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: Neel Patel <neel.patel@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	60625147e0	net/ionic: support Tx descriptor status This may be useful for clients. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: Allen Hubbe <allen.hubbe@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	0983a74aee	net/ionic: support Rx descriptor status These may be useful for clients. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: Allen Hubbe <allen.hubbe@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	b5b56afdd3	net/ionic: advertise supported packet types This improves performance, since clients may be able to skip SW packet classification. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	bbdf955ddc	net/ionic: add lookup table for checksum flags This improves performance by reducing branching. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	73b1c67e0c	net/ionic: add lookup table for packet type This improves performance by reducing branching. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	218afd825b	net/ionic: do bulk allocations of Rx mbufs Do bulk allocations to improve performance. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	9ac234ee8b	net/ionic: support mbuf fast free Use a put() rather than a free() in the optimized case. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	b4beb84af9	net/ionic: overhaul Tx for performance Linearize Tx mbuf chains in the info array. This avoids walking the mbuf chain during flush. Move a few branches out of the hot path. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	7b20fc2f3c	net/ionic: overhaul Rx for performance Linearize RX mbuf chains in the expanded info array. Clean one and fill one per CQE (completions are not coalesced). Touch the mbufs as little as possible in the fill stage. When touching the mbuf in the clean stage, use the rearm_data unions. Ring the doorbell once at the end of the bulk clean/fill. Signed-off-by: Neel Patel <neel.patel@amd.com> Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:26 +02:00
Andrew Boyer	7b2eb674e1	net/ionic: do one-time init of Rx descriptors These fields don't need to be set in the hot path. This improves performance. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	c35f08f75e	net/ionic: use helper variable in Tx This improves readability. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	d5850081d2	net/ionic: precalculate segment lengths on Rx side The first (header) segment includes the standard headroom. Subsequent segments do not. Store the fragment counts in the queue structure. Precalculating improves performance by reducing how much work must be done in the hot path. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	e7222f947e	net/ionic: free all buffers during Rx queue stop Free all of the mbufs in the receive queue when the queue is stopped. This will allow them to be resized when the MTU is changed. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	14f534be40	net/ionic: replace void pointer with actual type This makes the code safer by helping the compiler catch errors. Rename the variables, too; they're not callbacks anymore. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	0a00bdaf0b	net/ionic: request notifyq interrupt only if supported Enable the interrupt if the platform & device support it. This prevents spurious interrupts on virtual platforms. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	8eaafff38f	net/ionic: move PCI-specific code to separate file For future support of virtual devices, move the PCI code to its own file. Create a new device interface, struct ionic_dev_intf, to plug in to common code. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: Neel Patel <neel.patel@amd.com> Signed-off-by: R Mohamed Shah <mohamedshah.r@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	e670eedcc0	net/ionic: allocate interrupts only if required There is no need to allocate the interrupt vector list if datapath packet interrupts are not enabled. This conserves resources. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	2233544b59	net/ionic: remove unused identifiers These bits are not used. Remove them to simplify the code. Fix the spacing on the IONIC_ALIGN #define. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	84cf25bd7a	net/ionic: remove doorbell map helper There is only one doorbell page in DPDK configurations, so no helper function is needed. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	b671e69ae4	net/ionic: update MTU calculations Test min and max MTU against values read from firmware, for correctness. Update the firmware field name, for clarity. The device must be stopped before changing MTU, for correctness. Store the calculated frame size in the queue, for performance. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com> Signed-off-by: R Mohamed Shah <mohamedshah.r@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	766687540c	net/ionic: update license terms to remove GPL Remove GPL2 and leave only BSD-3-Clause. This is more in line with the norms of the DPDK community. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	a5205992d9	net/ionic: update documentation and copyrights Pensando Systems has been acquired by AMD. Update all copyright strings and email addresses. Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	26cc5dc29a	net/ionic: fix reported error stats Report descriptor errors in ierrors instead of imissed. Don't report rx_queue_empty or rx_queue_disabled in imissed, since those packet errors are already included in the rx_*_drop_packets counters. This makes the reported stats correct. Fixes: `3cdfd90579` ("net/ionic: add stats") Cc: stable@dpdk.org Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	5ab2983c80	net/ionic: fix Rx filter save Check the match variable after copying cmd info, or else there can be unexpected results. Fixes: `a27e0e96ab` ("net/ionic: observe endianness in Rx filter") Cc: stable@dpdk.org Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	d8fad46a3f	net/ionic: fix adapter name for logging Otherwise the log messages will be garbled. Fixes: `4ae96cb88f` ("net/ionic: do minor logging fixups") Cc: stable@dpdk.org Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	7506961a48	net/ionic: fix endianness for RSS This field needs to be LE when talking to the FW. Fixes: `22e7171bc6` ("net/ionic: support RSS") Cc: stable@dpdk.org Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Andrew Boyer	4a73559990	net/ionic: fix endianness for Rx and Tx These fields all need to be LE when talking to the FW. Fixes: `a27d901331` ("net/ionic: add Rx and Tx handling") Cc: stable@dpdk.org Signed-off-by: Andrew Boyer <andrew.boyer@amd.com>	2022-10-18 23:38:25 +02:00
Ivan Malov	d03c0e83cc	net/bonding: fix descriptor limit reporting Commit `5be3b40fea` ("net/bonding: fix values of descriptor limits") breaks reporting of "nb_min" and "nb_align" values obtained from back-end devices' descriptor limits. This means that work done by eth_bond_slave_inherit_desc_lim_first() as well as eth_bond_slave_inherit_desc_lim_next() gets dismissed. Revert the offending commit and use proper workaround for the test case mentioned in the said commit. Meanwhile, the test case itself might be poorly constructed. It tries to run a bond with no back-end devices attached, but, according to [1] ("Requirements / Limitations"), at least one back-end device must be attached. [1] doc/guides/prog_guide/link_bonding_poll_mode_drv_lib.rst Fixes: `5be3b40fea` ("net/bonding: fix values of descriptor limits") Cc: stable@dpdk.org Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Chas Williams <3chas3@gmail.com>	2022-10-17 16:06:06 +02:00
Chaoyong He	c55abf6141	net/nfp: support RSS on VXLAN inner layer Add the logics to support the feature of RSS packets based on the inner layer of VXLAN tunnel. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-17 14:11:39 +02:00
Usman Tanveer	f66323717e	net/bonding: fix mode 4 with dedicated queues When dedicated queues are enable with bonding mode 4 (mlx5), the application sets the flow, which cannot be set if the device is not started. This fixed the issue by starting the device just before setting the flow. Because device should be started to set the flow. Also it does not effect other driver codes (I have tried on ixgbe). Bugzilla ID: 759 Signed-off-by: Usman Tanveer <usman.tanveer@emumba.com> Acked-by: Chas Williams <3chas3@gmail.com>	2022-10-17 10:26:17 +02:00
Huisong Li	e5f185512d	net/bonding: add link speeds configuration This patch adds link speeds configuration. Signed-off-by: Huisong Li <lihuisong@huawei.com> Acked-by: Chas Williams <3chas3@gmail.com>	2022-10-17 10:20:30 +02:00
Kalesh AP	e11052f3a4	net/bnxt: support proactive error handling mode Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2022-10-17 08:34:05 +02:00
Chengwen Feng	53688fc909	net/hns3: support proactive error handling mode Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Dongdong Liu <liudongdong3@huawei.com>	2022-10-17 08:34:05 +02:00
Chengwen Feng	0d5c38bac7	ethdev: add error handling mode to device info Currently, the defined error handling modes include: 1) NONE: it means no error handling modes are supported by this port. 2) PASSIVE: passive error handling, after the PMD detect that a reset is required, the PMD reports RTE_ETH_EVENT_INTR_RESET event, and application invoke rte_eth_dev_reset() to recover the port. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2022-10-17 08:26:36 +02:00
Thomas Monjalon	df1982df8b	net/mana: fix dependencies The reason for not building is updated to be consistent with other drivers. The libibverbs was not detected through pkg-config. The method dependency() needs to be used first. The support in rdma-core and Linux is not released yet, so the documentation is updated. Fixes: `517ed6e2d5` ("net/mana: add basic driver with build environment") Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2022-10-11 01:34:07 +02:00
Yuan Wang	629dad3ef3	net/ice: support buffer split in scalar Rx Add support for protocol based buffer split in normal Rx data paths. When the Rx queue is configured with specific protocol type, packets received will be directly split into protocol header and payload parts. And the two parts will be put into different mempools. Currently, protocol based buffer split is not supported in vectorized paths. A new API ice_buffer_split_supported_hdr_ptypes_get() has been introduced, it will return the supported header protocols of ice PMD to app for splitting. Signed-off-by: Yuan Wang <yuanx.wang@intel.com> Signed-off-by: Xuan Ding <xuan.ding@intel.com> Signed-off-by: Wenxuan Wu <wenxuanx.wu@intel.com>	2022-10-09 16:41:49 +02:00
Jie Wang	336af7822e	net/iavf: fix memory leak in flow subscription When creating flow subscription pattern that it might cause a memory leak. This patch fix the error by adding a free memory code. And some typos have also been fixed. Coverity issue: 381130 Fixes: `6d42380e59` ("net/iavf: add flow subscrption supported pattern") Fixes: `7b902af499` ("net/iavf: support flow subscription rule") Signed-off-by: Jie Wang <jie1x.wang@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2022-10-09 15:38:37 +02:00
Jun Qiu	8168328854	net/bonding: fix Tx hash for TCP In the following two cases, tcp_hdr + sizeof(*tcp_hdr) == pkt_end, and the TCP port is not taken into account in calculating the HASH value of TCP packets. TCP connections with the same source and destination IP addresses will be hashed to the same slave port, which may cause load imbalance. 1. TCP Pure ACK packets with no options, The header length is 20 and there is no data. 2. A TCP packet contains data, but the first seg of the mbuf contains only the header information (ETH, IP, TCP), and the data is in subsequent segs, which is usually the case in the indirect mbuf used for zero-copy. Fixes: `726158060d` ("net/bonding: fix potential out of bounds read") Cc: stable@dpdk.org Signed-off-by: Jun Qiu <jun.qiu@jaguarmicro.com> Acked-by: Min Hu (Connor) <humin29@huawei.com>	2022-10-09 19:44:19 +02:00
Shijith Thotton	b1ae367ab8	drivers: mark SW PMDs to support disabling IOVA as PA Enabled software PMDs in IOVA as PA disabled build as they work with IOVA as VA. Signed-off-by: Shijith Thotton <sthotton@marvell.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2022-10-09 13:14:57 +02:00
Shijith Thotton	6771216c2f	drivers: mark cnxk to support disabling IOVA as PA Enabled the flag pmd_supports_disable_iova_as_pa in cnxk driver build files as they work with IOVA as VA. Updated cn9k and cn10k soc build configurations to disable the IOVA as PA build by default. Signed-off-by: Shijith Thotton <sthotton@marvell.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2022-10-09 13:14:57 +02:00
Shijith Thotton	a986c2b797	build: add option to configure IOVA mode as PA IOVA mode in DPDK is either PA or VA. The new build option enable_iova_as_pa configures the mode to PA at compile time. By default, this option is enabled. If the option is disabled, only drivers which support it are enabled. Supported driver can set the flag pmd_supports_disable_iova_as_pa in its build file. mbuf structure holds the physical (PA) and virtual address (VA). If IOVA as PA is disabled at compile time, PA field (buf_iova) of mbuf is redundant as it is the same as VA and is replaced by a dummy field. Signed-off-by: Shijith Thotton <sthotton@marvell.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2022-10-09 13:14:52 +02:00
Hanumanth Pothula	4daf12f442	net/cnxk: support mulitiple mbuf pools per Rx queue Presently, HW is programmed only to receive packets from LPB pool. Making all packets received from LPB pool. But, CNXK HW supports two pools, - SPB -> packets with smaller size (less than 4K) - LPB -> packets with bigger size (greater than 4K) Patch enables multiple mempool capability, pool is selected based on the packet's length. So, basically, PMD programs HW for receiving packets from both SPB and LPB pools based on the packet's length. This is achieved by enabling rx multiple mempool offload, RTE_ETH_RX_OFFLOAD_MUL_MEMPOOL. This allows the application to send more than one pool(in our case two) to the driver, with different segment(packet) lengths, which helps the driver to configure both pools based on segment lengths. This is often useful for saving the memory where the application can create a different pool to steer the specific size of the packet, thus enabling effective use of memory. Signed-off-by: Hanumanth Pothula <hpothula@marvell.com>	2022-10-08 22:37:45 +02:00
Dariusz Sosnowski	f2d43ff54d	net/mlx5: allow hairpin Rx queue in locked memory This patch adds a capability to place hairpin Rx queue in locked device memory. This capability is equivalent to storing hairpin RQ's data buffers in locked internal device memory. Hairpin Rx queue creation is extended with requesting that RQ is allocated in locked internal device memory. If allocation fails and force_memory hairpin configuration is set, then hairpin queue creation (and, as a result, device start) fails. If force_memory is unset, then PMD will fallback to allocating memory for hairpin RQ in unlocked internal device memory. To allow such allocation, the user must set HAIRPIN_DATA_BUFFER_LOCK flag in FW using mlxconfig tool. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-08 18:30:50 +02:00
Dariusz Sosnowski	7274b41756	net/mlx5: allow hairpin Tx queue in host memory This patch adds a capability to place hairpin Tx queue in host memory managed by DPDK. This capability is equivalent to storing hairpin SQ's WQ buffer in host memory. Hairpin Tx queue creation is extended with allocating a memory buffer of proper size (calculated from required number of packets and WQE BB size advertised in HCA capabilities). force_memory flag of hairpin queue configuration is also supported. If it is set and: - allocation of memory buffer fails, - or hairpin SQ creation fails, then device start will fail. If it is unset, PMD will fallback to creating the hairpin SQ with WQ buffer located in unlocked device memory. Signed-off-by: Dariusz Sosnowski <dsosnowski@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-10-08 18:30:50 +02:00
Zhichao Zeng	70888c61d9	net/iavf: fix tunnel TSO offload This patch is to fix the tunnel TSO not enabling issue, simplify the logic of calculating 'Tx Buffer Size' of data descriptor with IPSec, and fix handling that the mbuf size exceeds the TX descriptor hardware limit(1B-16KB) which causes malicious behavior to the NIC. Fixes: `1e728b0112` ("net/iavf: rework Tx path") Signed-off-by: Zhichao Zeng <zhichaox.zeng@intel.com> Tested-by: Ke Xu <ke1.xu@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2022-10-08 18:04:46 +02:00
Qi Zhang	27b7bdae1d	net/ice: fix DDP package init ICE_DDP_PKG_SAME_VERSION_ALREADY_LOADED and ICE_DDP_PKG_COMPATIBLE_ALREADY_LOADED should not be treated as a DDP package init failure. Use ice_is_init_pkg_successful to check return value of ice_copy_and_init_pkg. Signed-off-by: Qi Zhang <qi.z.zhang@intel.com> Tested-by: Zhimin Huang <zhiminx.huang@intel.com>	2022-10-08 16:23:14 +02:00
Ciara Loftus	1eb1846b1a	net/af_xdp: make compatible with libbpf 0.8.0 libbpf v0.8.0 deprecates the bpf_get_link_xdp_id() and bpf_set_link_xdp_fd() functions. Use meson to detect if bpf_xdp_attach() is available and if so, use the recommended replacement functions bpf_xdp_query_id(), bpf_xdp_attach() and bpf_xdp_detach(). Signed-off-by: Ciara Loftus <ciara.loftus@intel.com> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Andrew Rybchenko	5ff3dbe6ce	net/af_xdp: add log on XDP program removal failures Make it visible in logs if something goes wrong on XDP program removal failure. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Andrew Rybchenko	0ed0bc3834	net/af_xdp: avoid version-based check for program load Version-based checks are bad. It is better to check for required functions. Check for bpf_object__next_program() in this case since it appears last in libbpf among functions used to load program without bpf_prog_load() which is deprecated in libbpf v0.7.0. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Andrew Rybchenko	e024c7e838	net/af_xdp: avoid version-based check for shared UMEM Check for xsk_socket__create_shared() function instead. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Andrew Rybchenko	f76dc44ded	net/af_xdp: make clear which libxdp version is required Include checked libxdp version in driver build skip reason. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Andrew Rybchenko	50b855fc47	net/af_xdp: move XDP library presence flag setting RTE_NET_AF_XDP_LIBXDP is a conditional to include xdp/xsk.h and should be set as soon as we know that the header is present. RTE_NET_AF_XDP_SHARED_UMEM is one of conditions to use xsk_socket__create_shared(). Both do not depend on libbpf and bpf/bpf.h presence. Since else branch below returns error, there is no functional changes, just style which will help on further rework. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-07 19:37:24 +02:00
Gagandeep Singh	0bf99a02cc	net/dpaa: fix buffer freeing in slow path If there is any error in packet or taildrop feature is enabled, HW can reject those packets and put them in error queue. Driver poll this error queue to free the buffers. DPAA driver has an issue while freeing these rejected buffers. In case of scatter gather packets, it is preparing the mbuf SG list by scanning the HW descriptors and once the mbuf SG list prepared, it free only first segment of the mbuf SG list by calling the API rte_pktmbuf_free_seg(), This will leak the memory of other segments and mempool can be empty. Also there is one more issue, external buffer's memory may not belong to mempool so driver itself free the external buffer after successfully send the packet to HW to transmit instead of let the HW to free it. So transmit function free all the external buffers. But driver has no check for external buffers while freeing the rejected buffers and this can do double free the memory which can corrupt the user pool and crashes and undefined behaviour of system can be seen. This patch fixes the above mentioned issue by checking each and every segment and freeing all the segments except external. Fixes: `9124e65dd3` ("net/dpaa: enable Tx queue taildrop") Cc: stable@dpdk.org Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Gagandeep Singh	8716c0ec06	net/dpaa: fix buffer freeing on SG Tx When using SG list to TX with external and direct buffers, HW free direct buffers and driver free external buffers. Software scans the complete SG mbuf list to find the external buffers to free, but this is wrong as hardware can free the direct buffers if any present in the list and same can be re-allocated for other purpose in multi thread or high speed running traffic environment with new data in it. So the software which is scanning the SG mbuf list, if that list has any direct buffer present then that direct buffer's next pointer can give wrong pointer value, if already freed by hardware which can do the mempool corruption or memory leak. In this patch instead of relying on user given SG mbuf list we are storing the buffers in an internal list which will be scanned by driver after transmit to free non-direct buffers. This patch also fixes below issues. Driver is freeing complete SG list by checking external buffer flag in first segment only, but external buffer can be attached to any of the segment. Because of this, driver either can double free buffers or there can be memory leak. In case of indirect buffers, driver is modifying the original buffer list to free the indirect buffers but this original buffer list is being used by driver even after transmit packets for non-direct buffer cleanup. This can cause the buffer leak issue. Fixes: `f191d5abda` ("net/dpaa: support external buffers in Tx") Cc: stable@dpdk.org Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Gagandeep Singh	533c31cc83	net/dpaa: use internal mempool for SG table Creating and using driver's mempool for allocating the SG table memory required for FD creation. Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Rohit Raj	65afdda04b	net/dpaa: fix jumbo packet Rx in case of VSP For packet length of size more than 2K bytes, segmented packets were being received in DPDK even if mbuf size was greater than packet length. This is due to the configuration in VSP. This patch fixes the issue by configuring the VSP according to the mbuf size configured during mempool configuration. Fixes: `e4abd4ff18` ("net/dpaa: support virtual storage profile") Cc: stable@dpdk.org Signed-off-by: Rohit Raj <rohit.raj@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Gagandeep Singh	b0074a7ba1	net/dpaa2: fix buffer freeing on SG Tx When using SG list to TX with external and direct buffers, HW free the direct buffers and driver free the external buffers. Software scans the complete SG mbuf list to find the external buffers to free, but this is wrong as hardware can free the direct buffers if any present in the list and same can be re-allocated for other purpose in multi thread or high speed running traffic environment with new data in it. So the software which is scanning the SG mbuf list, if that list has any direct buffer present then that direct buffer's next pointer can give wrong pointer value, if already freed by hardware which can do the mempool corruption or memory leak. In this patch instead of relying on user given SG mbuf list we are storing the buffers in an internal list which will be scanned by driver after transmit to free non-direct buffers. This patch also fixes 2 more memory leak issues. Driver is freeing complete SG list by checking external buffer flag in first segment only, but external buffer can be attached to any of the segment. Because of which driver either can double free buffers or there can be memory leak. In case of indirect buffers, driver is modifying the original buffer list to free the indirect buffers but this original buffer list is being used even after transmit packets for software buffer cleanup. This can cause the buffer leak issue. Fixes: `6bfbafe18d` ("net/dpaa2: support external buffers in Tx") Cc: stable@dpdk.org Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Gagandeep Singh	75e2a1d473	net/dpaa2: use internal mempool for SG table Creating and using driver's mempool for allocating the SG table memory required for FD creation instead of relying on user mempool. Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Gagandeep Singh	e7524271c3	net/dpaa: support ESP type in packet parsing Add support of ESP packet type in packet receive path. Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Brick Yang	fb2790a535	net/dpaa2: check free enqueue descriptors before Tx Check if there exists free enqueue descriptors before enqueuing Tx packet. Also try to free enqueue descriptors in case they are not free. Fixes: `ed1cdbed6a` ("net/dpaa2: support multiple Tx queues enqueue for ordered") Cc: stable@dpdk.org Signed-off-by: Brick Yang <brick.yang@nxp.com> Signed-off-by: Rohit Raj <rohit.raj@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Vanshika Shukla	e45956ce18	net/dpaa2: fix DPDMUX error behaviour Driver is giving the wrong interface ID while setting the error behaviour. This patch fixes the issue by passing the correct MAC interface index value to the API. Fixes: `3d43972b1b` ("net/dpaa2: do not drop parse error packets by dpdmux") Cc: stable@dpdk.org Signed-off-by: Vanshika Shukla <vanshika.shukla@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Apeksha Gupta	fe10f6cc1c	net/enetfec: fix buffer leak Driver has no proper handling to free unused allocated mbufs in case of error or when the rx processing complete because of which mempool can be empty after some time. This patch fixes this issue by moving the buffer allocation code to the right place in driver. Fixes: `ecae71571b` ("net/enetfec: support Rx/Tx") Cc: stable@dpdk.org Signed-off-by: Apeksha Gupta <apeksha.gupta@nxp.com> Signed-off-by: Sachin Saxena <sachin.saxena@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:03 +02:00
Apeksha Gupta	d64e9cfe97	net/enetfec: fix restart Queue reset is missing in restart because of which IO cannot work on device restart. This patch fixes the issue by resetting the queues on device restart. Fixes: `b84fdd3963` ("net/enetfec: support UIO") Cc: stable@dpdk.org Signed-off-by: Apeksha Gupta <apeksha.gupta@nxp.com> Signed-off-by: Sachin Saxena <sachin.saxena@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:02 +02:00
Vanshika Shukla	05500852af	bus/dpaa: open QMAN interrupt file as non-blocking This patch sets qman portal file descriptors used for interrupts IO processing in non-blocking mode to avoid any unwanted blocks while IO operations over the FD. Signed-off-by: Vanshika Shukla <vanshika.shukla@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2022-10-07 17:19:02 +02:00
Dongdong Liu	fbb7a43a36	net/hns3: support Rx/Tx descriptor dump This patch support query HW descriptor from hns3 device. HW descriptor is also called BD (buffer description) which is shared memory between software and hardware. Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Signed-off-by: Dongdong Liu <liudongdong3@huawei.com> Acked-by: Ferruh Yigit <ferruh.yigit@xilinx.com>	2022-10-06 18:38:48 +02:00
Long Li	afd5d17072	net/mana: support Rx interrupts mana can receive Rx interrupts from kernel through RDMA verbs interface. Implement Rx interrupts in the driver. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	e350b56889	net/mana: report queue statistics Report packet statistics. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	37544700e1	net/mana: start/stop device Add support for starting/stopping the device. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	7f322844f4	net/mana: send packets With all the TX queues created, MANA can send packets over those queues. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	eb9994dd16	net/mana: receive packets With all the RX queues created, MANA can use those queues to receive packets. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	5f705ac262	net/mana: start/stop Rx queues MANA allocates device queues through the IB layer when starting Rx queues. When device is stopped all the queues are unmapped and freed. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	410333da8b	net/mana: start/stop Tx queues MANA allocate device queues through the IB layer when starting Tx queues. When device is stopped all the queues are unmapped and freed. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	56dd45c0ce	net/mana: implement hardware layer operations The hardware layer of MANA understands the device queue and doorbell formats. Those functions are implemented for use by packet RX/TX code. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	0f5db3c68b	net/mana: implement memory registration MANA hardware has iommu built-in, that provides hardware safe access to user memory through memory registration. Since memory registration is an expensive operation, this patch implements a two level memory registration cache mechanisum for each queue and for each port. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	f7dc479a13	net/mana: configure Tx queues Tx hardware queue is allocated when starting the queue, this is for pre configuration. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	0c63c005f6	net/mana: configure Rx queues Rx hardware queue is allocated when starting the queue. This function is for queue configuration pre starting. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	a382177c13	net/mana: configure RSS Currently this PMD supports RSS configuration when the device is stopped. Configuring RSS in running state will be supported in the future. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	d878cb0950	net/mana: report device info Add the function to get device info. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	bd15f237f2	net/mana: support device removal interrupts MANA supports PCI hot plug events. Add this interrupt to DPDK core so its parent PMD can detect device removal during Azure servicing or live migration. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	21958568c4	net/mana: support link update The carrier state is managed by the Azure host. MANA runs as a VF and always reports "up". Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	d9679c3a85	net/mana: report supported packet types Report supported protocol types. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	0dbfecfe23	net/mana: add device configuration and stop MANA defines its memory allocation functions to override IB layer default functions to allocate device queues. This patch adds the code for device configuration and stop. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Li	517ed6e2d5	net/mana: add basic driver with build environment MANA is a PCI device. It uses IB verbs to access hardware through the kernel RDMA layer. This patch introduces build environment and basic device probe functions. Signed-off-by: Long Li <longli@microsoft.com>	2022-10-06 10:49:00 +02:00
Long Wu	bb340f56fc	net/nfp: fix memory leak in Rx nfp_net_recv_pkts() should not return a value that less than 0 and the inappropriate return value in receive loop also causes the memory leak. Modify code to avoid return a value less than 0. Furthermore, When nfp_net_recv_pkts() break out from the receive loop because of packet problems, a rte_mbuf will not be freed and it will cause memory leak. Free the rte_mbuf before break out. Fixes: `b812daadad` ("nfp: add Rx and Tx") Cc: stable@dpdk.org Signed-off-by: Long Wu <long.wu@corigine.com> Signed-off-by: Niklas Söderlund <niklas.soderlund@corigine.com> Reviewed-by: Chaoyong He <chaoyong.he@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	ad83f5ee71	net/nfp: add representor port Rx/Tx For the Rx logic, the representor port decap packet from the corresponding ring. For the Tx logic, the representor port prepend the metadata into packet, and send to firmware through the queue 0 of pf vNIC. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	cf559c2a1d	net/nfp: add flower PF Rx/Tx For the Rx logic, fallback packets are multiplexed to the correct representor port based on the prepended metadata. For the Tx logic, because fallback packets are prepended metadata, the start of the packet has to be adjusted for in the Tx descriptor. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	a5f377d8f5	net/nfp: move Rx/Tx functions to header file The flower firmware application makes use of the same Rx and Tx checksum logic as the normal PMD. Expose it so that flower firmware application also can make use of it. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Signed-off-by: Heinrich Kuhn <heinrich.kuhn@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	831c44ab78	net/nfp: add flower PF related routines Adds the start/stop/close routine of the flower PF vNIC. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	e1124c4f8a	net/nfp: add flower representor framework Adds the framework to support flower representors. The number of VF representors are parsed from the command line. For physical port representors the current logic aims to create a representor for each physical port present on the hardware. An eth_dev is created for each physical port and VF, and flower firmware requires a MAC repr cmsg to be transmitted to firmware with info about the number of physical ports configured. Reify messages are sent to hardware for each physical port representor. An rte_ring is also created per representor so that traffic can be pushed and pulled to this interface. To up and down the real device represented by a flower representor port a port mod message is used to convey that info to the firmware. This message will be used in the dev_ops callbacks of flower representors. Each cmsg generated by the driver is prepended with a cmsg header. This commit also adds the logic to fill in the header of cmsgs. Also add the Rx and Tx path for flower representors. For Rx packets are dequeued from the representor ring and passed to the eth_dev. For Tx the first queue of the PF vNIC is used. Metadata about the representor is added before the packet is sent down to firmware. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	a36634e87e	net/nfp: add flower ctrl VNIC Rx/Tx Adds the Rx and Tx function for the ctrl VNIC. The logic is mostly identical to the normal Rx and Tx functionality of the NFP PMD. Make use of the ctrl VNIC service logic to service the ctrl vNIC Rx path. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Signed-off-by: Heinrich Kuhn <heinrich.kuhn@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	d981d315ac	net/nfp: move common Rx/Tx functions for flower use Move some common Rx and Tx logic to the header file so that they can be re-used by flower Tx and Rx logic. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Signed-off-by: Heinrich Kuhn <heinrich.kuhn@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	945441ebdb	net/nfp: add flower ctrl VNIC Adds the setup/start logic for the ctrl vNIC. This vNIC is used by the PMD and flower firmware application as a communication channel between driver and firmware. In the case of OVS it is also used to communicate flow statistics from hardware to the driver. A rte_eth device is not exposed to DPDK for this vNIC as it is strictly used internally by flower logic. Because of the add of ctrl vNIC, a new PCItoCPPBar is needed. Modify the related logics. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	5e13debf44	net/nfp: add flower PF setup Adds the vNIC initialization logic for the flower PF vNIC. The flower firmware application exposes this vNIC for the purposes of fallback traffic in the switchdev use-case. Adds minimal dev_ops for this PF vNIC device. Because the device is being exposed externally to DPDK it needs to implements a minimal set of dev_ops. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00
Chaoyong He	b188042195	net/nfp: add initial flower firmware support Adds the basic probing infrastructure to support the flower firmware application. Adds the cpp service, used for some user tools. Signed-off-by: Chaoyong He <chaoyong.he@corigine.com> Signed-off-by: Heinrich Kuhn <heinrich.kuhn@corigine.com> Reviewed-by: Niklas Söderlund <niklas.soderlund@corigine.com>	2022-10-05 18:09:21 +02:00

... 2 3 4 5 6 ...

15021 Commits