numam-dpdk

Author	SHA1	Message	Date
Harman Kalra	d61138d4f0	drivers: remove direct access to interrupt handle Removing direct access to interrupt handle structure fields, rather use respective get set APIs for the same. Making changes to all the drivers access the interrupt handle fields. Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Raslan Darawsheh <rasland@nvidia.com>	2021-10-25 21:20:12 +02:00
Honnappa Nagarahalli	f6c6c686f1	eal: remove FINISHED lcore state FINISHED state seems to be used to indicate that the worker's update of the 'state' is not visible to other threads. There seems to be no requirement to have such a state. Since the FINISHED state is removed, the API rte_eal_wait_lcore is updated to always return the status of the last function that ran in the worker core. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Feifei Wang <feifei.wang2@arm.com>	2021-10-25 18:20:59 +02:00
Olivier Matz	daa02b5cdd	mbuf: add namespace to offload flags Fix the mbuf offload flags namespace by adding an RTE_ prefix to the name. The old flags remain usable, but a deprecation warning is issued at compilation. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-24 13:37:43 +02:00
Olivier Matz	5b63493241	mbuf: mark old VLAN offload flags as deprecated The flags PKT_TX_VLAN_PKT and PKT_TX_QINQ_PKT are marked as deprecated since commit `380a7aab1a` ("mbuf: rename deprecated VLAN flags") (2017). But they were not using the RTE_DEPRECATED macro, because it did not exist at this time. Add it, and replace usage of these flags. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-24 13:30:40 +02:00
Raja Zidane	2efd265445	compress/mlx5: support partial transformation Currently compress, decompress and dma are allowed only when all 3 capabilities are on. A case where the user wants decompress offload, if decompress capability is on but one of compress, dma is off, is not allowed. Split compress/decompress/dma support check to allow partial transformations. Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-20 16:01:45 +02:00
Anoob Joseph	fd390896f4	crypto/cnxk: allow different cores in pending queue Rework pending queue to allow producer and consumer cores to be different. Signed-off-by: Anoob Joseph <anoobj@marvell.com>	2021-10-20 15:56:46 +02:00
Anoob Joseph	a455fd869c	common/cnxk: align CPT queue depth to power of 2 Use CPT LF queue depth as power of 2 to aid in masked checks for pending queue. Signed-off-by: Anoob Joseph <anoobj@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-20 15:56:46 +02:00
Akhil Goyal	92cb130919	cryptodev: move device-specific structures The device specific structures - rte_cryptodev and rte_cryptodev_data are moved to cryptodev_pmd.h to hide it from the applications. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Tested-by: Rebecca Troy <rebecca.troy@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-10-20 15:33:16 +02:00
Akhil Goyal	d54c72ec15	drivers/crypto: invoke probing finish function Invoke event_dev_probing_finish() function at the end of probing, this function sets the function pointers in the fp_ops flat array in case of secondary process. For primary process, fp_ops is updated in rte_cryptodev_start(). Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-20 15:33:16 +02:00
Akhil Goyal	beb4c305b3	crypto/scheduler: use proper API for device start/stop The worker PMDs were using direct device start/stop functions rather than rte_cryptodev_start(), so rte_crypto_fp_ops never get set. This patch calls the rte_cryptodev_start and stop APIs which start and stop devices properly and fp_ops get set. Reported-by: Ciara Power <ciara.power@intel.com> Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-10-20 15:33:16 +02:00
Kai Ji	f166628854	crypto/ipsec_mb: add chacha_poly PMD Add in new chacha20_poly1305 PMD to the ipsec_mb framework. Signed-off-by: Kai Ji <kai.ji@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 15:33:16 +02:00
Piotr Bronowski	cde8df1bda	crypto/ipsec_mb: move zuc PMD This patch removes the crypto/zuc folder and gathers all zuc PMD implementation specific details into two files, pmd_zuc.c and pmd_zuc_priv.h in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 15:32:36 +02:00
Piotr Bronowski	5208d68d30	crypto/ipsec_mb: support snow3g digest appended ops This patch enables out-of-place auth-cipher operations where digest should be encrypted along with the rest of raw data. It also adds support for partially encrypted digest when using auth-cipher operations. Signed-off-by: Damian Nowak <damianx.nowak@intel.com> Signed-off-by: Kai Ji <kai.ji@intel.com> Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	4f1cfda59a	crypto/ipsec_mb: move snow3g PMD This patch removes the crypto/snow3g folder and gathers all snow3g PMD implementation specific details into a single file, pmd_snow3g.c in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	bc9ef81c42	crypto/ipsec_mb: move kasumi PMD This patch removes the crypto/kasumi folder and gathers all kasumi PMD implementation specific details into a single file, pmd_kasumi.c in the crypto/ipsec_mb folder. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	746825e5c0	crypto/ipsec_mb: move aesni_gcm PMD This patch removes the crypto/aesni_gcm folder and gathers all aesni-gcm PMD implementation specific details into a single file, pmd_aesni_gcm.c in the crypto/ipsec_mb folder. A redundant check for iv length is removed. GCM ops are stored in the queue pair for multi process support, they are updated during queue pair setup for both primary and secondary processes. GCM ops are also set per lcore for the CPU crypto mode. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Pablo de Lara	8c835018de	crypto/ipsec_mb: support ZUC-256 for aesni_mb Add support for ZUC-EEA3-256 and ZUC-EIA3-256. Only 4-byte tags supported for now. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Piotr Bronowski	918fd2f146	crypto/ipsec_mb: move aesni_mb PMD This patch removes the crypto/aesni_mb folder and gathers all aesni-mb PMD implementation specific details into a single file, pmd_aesni_mb.c in crypto/ipsec_mb. Now that intel-ipsec-mb v1.0 is the minimum supported version, old macros can be replaced with the newer macros supported by this version. Signed-off-by: Piotr Bronowski <piotrx.bronowski@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Ciara Power	72a169278a	crypto/ipsec_mb: support multi-process The ipsec_mb SW PMD now has multiprocess support. The queue-pair IMB_MGR is stored in a memzone instead of being allocated externally by the Intel IPSec MB library, when v1.1 is used. If v1.0 is used, multi process is not supported, and allocation is done as before. The secondary process needs to reconfigure the queue-pair to allow for IMB_MGR function pointers be updated. Intel IPsec MB library version 1.1 is required for this support. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Fan Zhang	c75542ae42	crypto/ipsec_mb: introduce IPsec_mb framework This patch introduces the new framework to share common code between the SW crypto PMDs that depend on the intel-ipsec-mb library. This change helps to reduce future effort on the code maintenance and feature updates. The PMDs that will be added to this framework in subsequent patches are: - AESNI MB - AESNI GCM - CHACHA20_POLY1305 - KASUMI - SNOW3G - ZUC The use of these PMDs will not change, they will still be supported for x86, and will use the same EAL args as before. The minimum required version for the intel-ipsec-mb library is now v1.0. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-10-20 12:06:01 +02:00
Ferruh Yigit	295968d174	ethdev: add namespace Add 'RTE_ETH' namespace to all enums & macros in a backward compatible way. The macros for backward compatibility can be removed in next LTS. Also updated some struct names to have 'rte_eth' prefix. All internal components switched to using new names. Syntax fixed on lines that this patch touches. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Wisam Jaddo <wisamm@nvidia.com> Acked-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-22 18:15:38 +02:00
Ferruh Yigit	ede6356582	drivers/net: fix removing jumbo offload flag After DEV_RX_OFFLOAD_JUMBO_FRAME flag removed, drivers give jumbo frame decisions based on MTU value checks, but some of the checks were wrong by mistake, causing device initialization to fail, fixing them. Fixes: `b563c14212` ("ethdev: remove jumbo offload flag") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Yu Jiang <yux.jiang@intel.com>	2021-10-22 17:44:18 +02:00
Ciara Loftus	985e7673c0	net/af_xdp: fix max Rx packet length Commit `1bb4a528c4` ("ethdev: fix max Rx packet length") clarified the expected usage of the max_rx_pktlen and max_mtu values and implemented some extra checks on these values to ensure they are sane. After this, the AF_XDP PMD fails to initialise. The value for max_rx_pktlen which represents the max size of the Ethernet frame was set to ETH_FRAME_LEN (1514) and the max_mtu which represents the size of the payload was set to the max size of the Ethernet frame. This did not make sense, as naturally the maximum frame size should be greater than the payload size. Fix this by setting the max_rx_pktlen equal to the max size of the Ethernet frame as expected, and the max MTU equal to the max_rx_pktlen less the overhead which is set to the size of an Ethernet header plus CRC. Fixes: `1bb4a528c4` ("ethdev: fix max Rx packet length") Signed-off-by: Ciara Loftus <ciara.loftus@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-22 17:12:50 +02:00
Chengchang Tang	2fc3e696a7	net/hns3: add runtime config for mailbox limit time Current, the max waiting time for MBX response is 500ms, but in some scenarios, it is not enough. Since it depends on the response of the kernel mode driver, and its response time is related to the scheduling of the system. In this special scenario, most of the cores are isolated, and only a few cores are used for system scheduling. When a large number of services are started, the scheduling of the system will be very busy, and the reply of the mbx message will time out, which will cause our PMD initialization to fail. This patch add a runtime config to set the max wait time. For the above scenes, users can adjust the waiting time to a suitable value by themselves. Fixes: `463e748964` ("net/hns3: support mailbox") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-10-22 04:11:43 +02:00
Satheesh Paul	00ea15e7a3	net/cnxk: support port ID flow action This patch adds support for rte flow action type port_id to enable directing packets from an input port PF to an output port which is a VF of the input port PF. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 18:59:40 +02:00
Satheesh Paul	15f0b8a5b9	common/cnxk: support port ID action This patch adds ROC API to support flow port ID action type. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 18:58:50 +02:00
Xuan Ding	ad6f01945a	net/virtio: fix avail descriptor ID Vhost will update desc’s Buffer ID advance to next used descriptor when VIRTIO_F_IN_ORDER feature negotiated. When virtio reuses the descriptor, the Buffer ID should be restored even VIRTQ_DESC_F_INDIRECT feature negotiated. Fixes: `b473061b0e` ("net/virtio: fix indirect descriptors in packed datapaths") Cc: stable@dpdk.org Signed-off-by: Xuan Ding <xuan.ding@intel.com> Signed-off-by: Yong Liu <yong.liu@intel.com> Signed-off-by: Miao Li <miao.li@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Gaoxiang Liu	028f06e8be	net/vhost: merge stats loop in datapath To improve performance in vhost Tx/Rx, merge vhost stats loop. eth_vhost_tx has 2 loop of send num iteraion. It can be merge into one. eth_vhost_rx has the same issue as Tx. Signed-off-by: Gaoxiang Liu <liugaoxiang@huawei.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Xuan Ding	04bcc80204	net/virtio: fix indirect descriptor reconnection Add initialization for packed ring indirect descriptors in reconnection path. Fixes: `381f39ebb7` ("net/virtio: fix packed ring indirect descricptors setup") Cc: stable@dpdk.org Signed-off-by: Xuan Ding <xuan.ding@intel.com> Tested-by: Yinan Wang <yinan.wang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Ivan Malov	6474b59448	net/virtio: fix Tx checksum for tunnel packets Tx prepare method calls rte_net_intel_cksum_prepare(), which handles tunnel packets correctly, but Tx burst path does not take tunnel presence into account when computing the offsets. Fixes: `58169a9c81` ("net/virtio: support Tx checksum offload") Cc: stable@dpdk.org Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2021-10-21 14:24:21 +02:00
Marvin Liu	99ebada2d6	net/virtio: fix oversized packets in vectorized Rx If packed ring size is not power of two, it is possible that remained number less than one batch and meanwhile batch operation can pass. This will cause incorrect remained number calculation and then lead to receiving oversized packets. The patch fixed the issue by added remained number check before batch operation. Fixes: `77d66da838` ("net/virtio: add vectorized packed ring Rx") Cc: stable@dpdk.org Signed-off-by: Marvin Liu <yong.liu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Xueming Li	8011a09add	vdpa/mlx5: retry VAR allocation during vDPA restart VAR is the device memory space for the virtio queues doorbells, Qemu could mmap it to directly to speed up doorbell push. On a busy system, Qemu takes time to release VAR resources during driver shutdown. If vdpa restarted quickly, the VAR allocation failed with error 28 since the VAR is singleton resource per device. This patch adds retry mechanism for VAR allocation. Fixes: `4cae722c1b` ("vdpa/mlx5: move virtual doorbell alloc to probe") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Xueming Li	d38a53b175	vdpa/mlx5: workaround FW first completion in start After a vDPA application restart, Qemu restores VQ with used and available index, new incoming packet triggers virtio driver to handle buffers. Under heavy traffic, no available buffer for firmware to receive new packets, no Rx interrupts generated, driver is stuck on endless interrupt waiting. As a firmware workaround, this patch sends a notification after VQ setup to ask driver handling buffers and filling new buffers. Fixes: `bff7350110` ("vdpa/mlx5: prepare virtio queues") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Matan Azrad <matan@nvidia.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:21 +02:00
Zhihong Peng	84cc857b5d	net/virtio: fix check scatter on all Rx queues This patch fixes the wrong way to obtain virtqueue. The end of virtqueue cannot be judged based on whether the array is NULL. Fixes: `4e8169eb0d` ("net/virtio: fix Rx scatter offload") Cc: stable@dpdk.org Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-10-21 14:24:13 +02:00
Ting Xu	f5ec6a3a19	net/ice: fix TM hierarchy commit flag reset After DCF commits TM hierarchy configuration, the commit flag is set to avoid duplicated commit. But the flag is not reset after device stop, which prevents the update of hierarchy configuration unless close the device. It is not reasonable. This patch fix to reset the commit flag after device stop. Then users can delete and add nodes to commit a new TM hierarchy configuration. Fixes: `3a6bfc37ea` ("net/ice: support QoS config VF bandwidth in DCF") Cc: stable@dpdk.org Signed-off-by: Ting Xu <ting.xu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-21 13:32:26 +02:00
William Tu	d1c7029a52	net/e1000: build on Windows This patch enables building the e1000 driver for Windows. I tested using two Windows VM on top of VMware Fusion, creating two e1000 devices with device ID 0x10D3 (8274L), verifying rx/tx works correctly using dpdk-testpmd.exe rxonly and txonly mode. Signed-off-by: William Tu <u9012063@gmail.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Pallavi Kadam <pallavi.kadam@intel.com> Tested-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Pallavi Kadam <pallavi.kadam@intel.com>	2021-10-21 04:58:40 +02:00
Tudor Cornea	2108930be1	net/ixgbe: fix port initialization if MTU config fails On a VMware ESXi 6.0 setup with an Intel 82599 NIC the ports don't seem to initialize anymore, while running testpmd. Configuring Port 0 (socket 0) ixgbevf_dev_rx_init(): Set max packet length to 1518 failed. ixgbevf_dev_start(): Unable to initialize RX hardware (-22) Fail to start port 0: Invalid argument Configuring Port 1 (socket 0) ixgbevf_dev_rx_init(): Set max packet length to 1518 failed. ixgbevf_dev_start(): Unable to initialize RX hardware (-22) Fail to start port 1: Invalid argument Please stop the ports first If the call to ixgbevf_rlpml_set_vf fails and we return prematurely, we will not be able to initialize the ports correctly. The behavior seems to have changed since the following commit: Fixes: `c77866a169` ("net/ixgbe: detect failed VF MTU set") Cc: stable@dpdk.org We can make this particular use case work correctly if we don't return an error, which seems to be consistent with the overall kernel ixgbevf implementation. [1] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/ drivers/net/ethernet/intel/ixgbevf/ixgbevf_main.c?h=v5.14#n2015 Signed-off-by: Tudor Cornea <tudor.cornea@gmail.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com>	2021-10-21 04:56:06 +02:00
Rongwei Liu	a89f6433aa	net/mlx5: set Tx queue affinity in round-robin Previously, we set txq affinity to 0 and let firmware to perform round-robin when bonding. Firmware uses a global counter to assign txq affinity to different physical ports accord to remainder after division. There are three dis-advantages: 1. The global counter is shared between kernel and dpdk. 2. After restarting pmd or port, the previous counter value is reused, so the new affinity is unpredictable. 3. There is no way to get what affinity is set by firmware. In this update, we will create several TISs up to the number of bonding ports and bind each TIS to one PF port. For each port, it will start to pick up TIS using its port index. Upper layer application can quickly calculate each txq's affinity without querying. At DPDK layer, when creating txq with 2 bonding ports, the affinity is set like: port 0: 1-->2-->1-->2 port 1: 2-->1-->2-->1 port 2: 1-->2-->1-->2 Note: Only applicable to DevX api. This affinity subjects to HW hash. Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 12:37:00 +02:00
Rongwei Liu	cf5ac38d51	common/mlx5: add LAG context query Added a new function mlx5_devx_cmd_query_lag() to query LAG property from firmware including state/affinity/mode etc. Signed-off-by: Jiawei Wang <jiaweiw@nvidia.com> Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 12:36:57 +02:00
Dmitry Kozlyuk	ea823b2c51	net/mlx5: close tools socket with last device MLX5 PMD exposes a socket for external tools to dump port state. Socket events are listened using an interrupt source of EXT type. The socket was closed and the interrupt callback was unregistered at program exit, which is incorrect because DPDK could be already shut down at this point. Move actions performed at program exit to the moment the last MLX5 port is closed. The socket will be opened again if later a new MLX5 device is plugged in and probed. Also fix comments that were decisively talking about secondary processes instead of external tools. Fixes: `e6cdc54cc0` ("net/mlx5: add socket server for external tools") Cc: stable@dpdk.org Reported-by: Harman Kalra <hkalra@marvell.com> Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-10-21 10:31:53 +02:00
Dmitry Kozlyuk	9ec1ceab76	net/mlx5: fix Rx queue resource cleanup mlx5_rxq_start() allocates rxq_ctrl->obj and frees it on failure, but did not set it to NULL. Later mlx5_rxq_release() could not recognize this object is already freed and attempted to release its resources, resulting in a crash: Configuring Port 0 (socket 0) mlx5_common: Failed to create RQ using DevX mlx5_common: Can't create DevX RQ object. mlx5_net: Port 0 Rx queue 0 RQ creation failure. Segmentation fault Set rxq_ctrl->obj to NULL after it is freed to skip resource release. Fixes: `1260a87b28` ("net/mlx5: share Rx control code") Cc: stable@dpdk.org Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 09:31:17 +02:00
Bing Zhao	273b09376c	net/mlx5: fix meter yellow policy with RSS action The RSS configuration in a policy action container was a pointer inside a union, and the pointer area could be used as other fate action. In the current implementation, the RSS of the green color was prior to that of the yellow color. There was a high possibility the pointer was considered as the RSS and result in a error flow expansion when only the yellow color had the RSS action. The check of the fate action type should also be done to get rid of the misjudgment. Fixes: `b38a12272b` ("net/mlx5: split meter color policy handling") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 09:31:15 +02:00
Xueming Li	614966c2fa	net/mlx5: check DevX to support more Verbs ports Verbs API doesn't support device port number larger than 255 by design. To support more VF or SubFunction port representors, forces DevX API check when max Verbs device link ports larger than 255. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:14 +02:00
Xueming Li	686d05b60d	net/mlx5: enable DevX Tx queue creation Verbs API does not support Infiniband device port number larger 255 by design. To support more representors on a single Infiniband device DevX API should be engaged. While creating Send Queue (SQ) object with Verbs API, the PMD assigned IB device port attribute and kernel created the default miss flows in FDB domain, to redirect egress traffic from the queue being created to representor appropriate peer (wire, HPF, VF or SF). With DevX API there is no IB-device port attribute (it is merely kernel one, DevX operates in PRM terms) and PMD must create default miss flows in FDB explicitly. PMD did not provide this and using DevX API for E-Switch configurations was disabled. The default miss FDB flow matches E-Switch manager vport (to make sure the source is some representor) and SQn (Send Queue number - device internal queue index). The root flow table managed by kernel/firmware and it does not support vport redirect action, we have to split the default miss flow into two ones: - flow with lowest priority in the root table that matches E-Switch manager vport ID and jump to group 1. - flow in group 1 that matches E-Switch manager vport ID and SQn and forwards packet to peer vport Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:13 +02:00
Xueming Li	ebe9afedc7	net/mlx5: fix internal root table flow priority When creating internal transfer flow on root table with lowest priority, the flow was created with max UINT32_MAX priority. It is wrong since the flow is created in kernel and max priority supported is 16. This patch fixes this by adding internal flow check. Fixes: `5f8ae44dd4` ("net/mlx5: enlarge maximal flow priority") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:12 +02:00
Xueming Li	d9020f2577	net/mlx5: support flow item of normal Tx queue Extends txq flow pattern to support both hairpin and regular txq. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:11 +02:00
Xueming Li	a564038699	net/mlx5: support E-Switch manager egress traffic match For egress packet on representor, the vport ID in transport domain is E-Switch manager vport ID since representor shares resources of E-Switch manager. E-Switch manager vport ID and Tx queue internal device index are used to match representor egress packet. This patch adds flow item port ID match on E-Switch manager. E-Switch manager vport ID is 0xfffe on BlueField, 0 otherwise. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:10 +02:00
Xueming Li	1d47e9335e	net/mlx5: improve Verbs flow priority discovery To detect number flow Verbs flow priorities, PMD try to create Verbs flows in different priority. While Verbs is not designed to support ports larger than 255. When DevX supported by kernel driver, 16 Verbs priorities must be supported, no need to create Verbs flows. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:09 +02:00
Xueming Li	3fd2961efa	net/mlx5: use Netlink when IB port greater than 255 IB spec doesn't allow 255 ports on a single HCA, port number of 256 was cast to u8 value 0 which invalid to ibv_query_port() This patch invokes Netlink API to query port state when port number greater than 255. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:08 +02:00
Xueming Li	227813f28a	common/mlx5: get RDMA port state via Netlink Introduce netlink API to get RDMA port state. Port state is retrieved based on RDMA device name and port index. Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-21 09:31:03 +02:00
Jie Wang	f30157d988	net/iavf: support PPPoL2TPv2oUDP RSS Hash Add support for PPP over L2TPv2 over UDP protocol RSS Hash based on inner IP src/dst address and TCP/UDP src/dst port. Patterns are listed below: eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6) eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6)/udp eth/ipv4(6)/udp/l2tpv2/ppp/ipv4(6)/tcp Signed-off-by: Wenjun Wu <wenjun1.wu@intel.com> Signed-off-by: Jie Wang <jie1x.wang@intel.com> Acked-by: Beilei Xing <beilei.xing@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-21 14:15:59 +02:00
Tomasz Duszynski	77140af0b8	common/cnxk: add new PCI IDs to supported devices CNF10KA does not differ it terms of RVU resources from CN10KA platform hence add it to list of devices respective drivers support. Otherwise devices on CNF10KA are not probed even though compatible drivers exist. Signed-off-by: Tomasz Duszynski <tduszynski@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 13:43:56 +02:00
David Marchand	6353ff43a7	dma/idxd: fix build on Windows Windows compilation gives us a splat: In file included from ../drivers/dma/idxd/idxd_pci.c:10: In file included from ..\drivers\dma\idxd/idxd_internal.h:11: ..\drivers\dma\idxd/idxd_hw_defs.h:46:21: error: expected member name or ';' after declaration specifiers uint16_t __reserved[13]; ~~~~~~~~ ^ 1 error generated. Ironically, __reserved is probably a reserved token. Some drivers that build fine on Windows have structs with a "reserved" field, let's go with this. Fixes: `82147042d0` ("dma/idxd: add datapath structures") Signed-off-by: David Marchand <david.marchand@redhat.com>	2021-10-23 08:52:25 +02:00
Bruce Richardson	fed600889d	dmadev: enable build on Windows The dmadev library was not added to the list of libraries built on Windows, meaning it was skipped in those builds and also that none of the drivers were being considered for build. Adding dmadev to the list fixes this, and also enables the skeleton dmadev driver to be built - all-be-it with a small fix necessary. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com> Tested-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	96d7833393	dma/ioat: add burst capacity Adds the ability to find the remaining space in the IOAT ring. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	912c204424	dma/ioat: add vchan status Add support for the rte_dmadev_vchan_status API call. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	f526394174	dma/ioat: add statistics Add statistic tracking for operations in IOAT. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	9f46de12c4	dma/ioat: add data path completion Add the data path functions for gathering completed operations from IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	3d4b027324	dma/ioat: add data path job submission Add data path functions for enqueuing and submitting operations to IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	583f046dd4	dma/ioat: add start and stop Add start, stop and recover functions for IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	a19a2dceda	dma/ioat: add configuration Add functions for device configuration. The info_get and close functions are included here also. info_get can be useful for checking successful configuration and close is used by the dmadev api when releasing a configured device. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	4ffd224742	dma/ioat: add datapath structures Add data structures required for the data path of IOAT devices. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	13859ab773	dma/ioat: create dmadev instances on PCI probe When a suitable device is found during the PCI probe, create a dmadev instance for each channel. Internal structures and HW definitions required for device creation are also included. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	866e46bcd8	dma/ioat: add device probing and removal Add the basic device probe/remove skeleton code and initial documentation for new IOAT DMA driver. Maintainers update is also included in this patch. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Conor Walsh	8bedc51583	dma/idxd: move config script from raw driver Move the example script for configuring IDXD devices bound to the IDXD kernel driver from raw to dma, and create a symlink to still allow use from raw. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	9459de4edc	dma/idxd: add burst capacity Add support for the burst capacity API. This API will provide the calling application with the remaining capacity of the current burst (limited by max HW batch size). Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	5a23df349a	dma/idxd: add vchan status When testing dmadev drivers, it is useful to have the HW device in a known state. This patch adds the implementation of the function which will wait for the device to be idle (all jobs completed) before proceeding. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	280c3ca02c	dma/idxd: add operation statistic tracking Add statistic tracking for DSA devices. The dmadev library documentation is also updated to add a generic section for using the library's statistics APIs. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	97aeed5637	dma/idxd: add data path job completion Add the data path functions for gathering completed operations. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	3d36a0a1c7	dma/idxd: add data path job submission Add data path functions for enqueuing and submitting operations to DSA devices. Documentation updates are included for dmadev library and IDXD driver docs as appropriate. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	a42ac7e31e	dma/idxd: add start and stop for PCI devices Add device start/stop functions for DSA devices bound to vfio. For devices bound to the IDXD kernel driver, these are not required since the IDXD kernel driver takes care of this. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	2f7d42c6e1	dma/idxd: add configure and info Add functions for device configuration. The info_get function is included here since it can be useful for checking successful configuration. Documentation is also updated to add device configuration usage info. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	82147042d0	dma/idxd: add datapath structures Add data structures required for the data path for IDXD devices. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com> Reviewed-by: Chengwen Feng <fengchengwen@huawei.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	9449330a84	dma/idxd: create dmadev instances on PCI probe When a suitable device is found during the PCI probe, create a dmadev instance for each HW queue. HW definitions required are also included. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:59 +02:00
Kevin Laatz	55dc0f6096	dma/idxd: create dmadev instances on bus probe When a suitable device is found during the bus scan/probe, create a dmadev instance for each HW queue. Internal structures required for device creation are also added. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Kevin Laatz	e888bb1278	dma/idxd: add bus device probing Add the basic device probing for DSA devices bound to the IDXD kernel driver. These devices can be configured via sysfs and made available to DPDK if they are found during bus scan. Relevant documentation is included. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Kevin Laatz	e33ad06eae	dma/idxd: add skeleton for VFIO based DSA device Add the basic device probe/remove skeleton code for DSA device bound to the vfio pci driver. Relevant documentation and MAINTAINERS update also included. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Bruce Richardson	65bcddfc66	raw/ioat: build only if dmadev not present Only build the rawdev IDXD/IOAT drivers if the dmadev drivers are not present. This change requires the dependencies to be reordered in drivers/meson.build so that rawdev can use the "RTE_DMA_* build macros to check for the presence of the equivalent dmadev driver. A note is also added to the documentation to inform users of this change. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Reviewed-by: Conor Walsh <conor.walsh@intel.com>	2021-10-22 22:40:58 +02:00
Pravin Pathak	e4869c0b47	event/dlb2: optimize credit allocations using port This commit implements the changes required for using suggested port type hint feature. Each port uses different credit quanta based on port type specified using port configuration flags. Each port has separate quanta defined in dlb2_priv.h Producer and consumer ports will need larger quanta value to reduce number of credit calls they make. Workers can use small quanta as they mostly work out of locally cached credits and don't request/return credits often. Signed-off-by: Pravin Pathak <pravin.pathak@intel.com>	2021-10-21 10:16:00 +02:00
Pavan Nikhilesh	53548ad300	eventdev: hide timer adapter PMD file Hide rte_event_timer_adapter_pmd.h file as it is an internal file. Remove rte_ prefix from rte_event_timer_adapter_ops structure. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	295c053f90	eventdev: hide event device related structures Move rte_eventdev, rte_eventdev_data structures to eventdev_pmd.h. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Harman Kalra <hkalra@marvell.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	85be99719a	drivers/event: invoke probing finish function Invoke event_dev_probing_finish() function at the end of probing, this function sets the function pointers in the fp_ops flat array. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	23d06e3766	eventdev: make driver interface as internal Mark all the driver specific functions as internal, remove `rte` prefix from `struct rte_eventdev_ops`. Remove experimental tag from internal functions. Remove `eventdev_pmd.h` from non-internal header files. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2021-10-21 10:14:50 +02:00
Pavan Nikhilesh	929ebdd543	eventdev/eth_rx: simplify event vector config Include vector configuration into the structure ``rte_event_eth_rx_adapter_queue_conf`` that is used to configure Rx adapter ethernet device Rx queue parameters. This simplifies event vector configuration as it avoids splitting configuration per Rx queue. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jay Jayatheerthan <jay.jayatheerthan@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-21 10:14:50 +02:00
Shijith Thotton	08d681e9c4	event/cnxk: fix max timer chunk pool cache size Reduced max chunk pool cache size from RTE_MEMPOOL_CACHE_MAX_SIZE(512) to 128. If chunk pool cache is empty, it gets filled during arm. Filling 512 entries at a time will fail arm if timeout is shorter, hence reduce the pool cache size. Fixes: `0e792433d0` ("event/cnxk: create and free timer adapter") Cc: stable@dpdk.org Signed-off-by: Shijith Thotton <sthotton@marvell.com>	2021-10-21 10:14:50 +02:00
Shijith Thotton	5fb651552c	event/cnxk: fix SSO and TIM argument parsing Type of kvargs value and handler function argument should match to avoid spilling memory. Fixes: `7ffa737996` ("event/cnxk: add option to configure getwork mode") Cc: stable@dpdk.org Signed-off-by: Shijith Thotton <sthotton@marvell.com>	2021-10-21 10:14:50 +02:00
Michael Baum	fc59a1ec55	common/mlx5: share MR mempool registration Expand the use of mempool registration to MR management for other drivers. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:58:00 +02:00
Michael Baum	a5d06c9006	common/mlx5: support device DMA map and unmap Since MR management has moved to the common area, there is no longer a need for the DMA map and unmap function for each driver. This patch share those functions. For most drivers it supports these operations for the first time. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:58:00 +02:00
Michael Baum	9f1d636f3e	common/mlx5: share MR management Add global shared MR cache as a field of common device structure. Move MR management to use this global cache for all drivers. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:57:58 +02:00
Michael Baum	fb690f71bd	common/mlx5: share MR top-half search function Add function to search in local liniar cache and use it in the drivers instead of their functions. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:57:28 +02:00
Michael Baum	5fbc75ace1	common/mlx5: add global MR cache create function Add function for global shared MR cache structure initialization. This function include: - btree initialization. - set callbacks for reg and dereg MR. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:57:24 +02:00
Michael Baum	85c7005e84	common/mlx5: add MR control initialization Add function for MR control structure initialization. This function include: - btree initialization. - dev_gen_ptr initialization. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:54:32 +02:00
Michael Baum	05fa53d6a0	net/mlx5: remove redundancy in MR file This patch remove two redundant things from MR file: 1. mr_find_contig_memsegs_data structure which is moved to common file before. 2. External memory mechanism - mlx5_tx_update_ext_mp function. Since commit [1] which added support for DMA map and unmap, external mem must be configured by the user using rte_mem_map function and no need to handle this in pmd. [1] commit `989e999d93` ("net/mlx5: support PCI device DMA map and unmap") Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	fe46b20c96	common/mlx5: share HCA capabilities handle Add HCA attributes structure as a field of device config structure. It query in common probing, and updates the timestamp format fields. Each driver use HCA attributes from common device config structure, instead of query it for itself. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	e35ccf243b	common/mlx5: share protection domain object Create shared Protection Domain in common area and add it and its PDN as fields of common device structure. Use this Protection Domain in all drivers and remove the PD and PDN fields from their private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	662d0dc671	common/mlx5: disable RoCE in device context creation Add option to get IB device after disabling RoCE. It is relevant if there is vDPA class in device arguments list. Use common device context in vDPA driver and remove the ctx field from its private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:46 +02:00
Michael Baum	ca1418ce39	common/mlx5: share device context object Create shared context device in common area and add it as a field of common device. Use this context device in all drivers and remove the ctx field from their private structure. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:44 +02:00
Michael Baum	5bc38358b5	net/mlx5: remove redundant flag in device config Device configure structure has flag named devx as same as SH structure with the same meaning. Remove the flag from the configuration structure and move all the usages to the SH flag. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:36 +02:00
Michael Baum	887183effa	common/mlx5: move basic probing functions to common Move open IBV/DevX device function to common. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:53:32 +02:00
Michael Baum	5021ce2085	net/mlx5: rearrange probing functions for Windows Rearrange device detection code. Rearrange configuration structures filling. Remove unneeded variables. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:39:24 +02:00
Michael Baum	8520992403	common/mlx5: share memory related devargs Add device configure structure and function to parse user device arguments into it. Move parsing and management of relevant device arguments to common. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:39:04 +02:00
Michael Baum	a77bedf255	common/mlx5: share common definitions Create MACRO definitions file in the common driver as preparation for MR and basic probe sharing. Move relevant definitions from the net driver to the above file. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:39:01 +02:00
Michael Baum	7af08c8f1a	common/mlx5: share basic probing with internal drivers Create common probing structure that includes, for now, basic probing information detected by the common driver and share it with all the internal drivers. Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:38:46 +02:00
Michael Baum	620be7f27b	net/mlx5: register memory event callback in Windows In device initialization, the driver registers to free hugepages events. When hugepage is released, this callback frees all its related MRs. In Windows initialization, this callback is not registered what may cause to use invalid memory. This patch adds memory event callback registration in Windows initialization. Fixes: `980826dc6f` ("net/mlx5: probe on Windows") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-21 15:37:23 +02:00
Xueming Li	5adef306da	devargs: make bus optional Global devargs syntax is used as device iteration filter like "class=vdpa", a devargs without bus args is valid from parsing perspective. This patch makes bus args optional. Fixes: `d2a66ad794` ("bus: add device arguments name parsing") Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Gaetan Rivet <grive@u256.net>	2021-10-21 11:32:44 +02:00
Dapeng Yu	c8c6997325	net/ice: fix sideband queue initialization Sideband queue need to be initialized when device is initialized. Otherwise the calling to function "ice_init_ctrlq" may fail. This patch fixes it. Fixes: `97f4f78bbd` ("net/ice/base: add functions for device clock control") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-20 03:37:37 +02:00
Satheesh Paul	1f66919817	common/cnxk: improve MCAM entries management This patch removes the MCAM preallocation scheme. The free entry cache is removed and for every flow created, an MCAM allocation request is made to the kernel. Each priority level has a list of MCAM entries. For every flow rule added, the MCAM entry obtained from kernel is checked if it is at the correct user specified priority. If not, the existing rules are moved across MCAM entries so that the user specified priority is maintained. Signed-off-by: Satheesh Paul <psatheesh@marvell.com> Reviewed-by: Kiran Kumar K <kirankumark@marvell.com>	2021-10-19 21:00:11 +02:00
Gowrishankar Muthukrishnan	5ea354a1f2	net/cnxk: support telemetry Add telemetry endpoints to ethdev. Signed-off-by: Gowrishankar Muthukrishnan <gmuthukrishn@marvell.com> Reviewed-by: Harman Kalra <hkalra@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 18:37:48 +02:00
Gowrishankar Muthukrishnan	e8c954a369	mempool/cnxk: support telemetry Adding telemetry endpoints to cnxk mempool driver. Signed-off-by: Gowrishankar Muthukrishnan <gmuthukrishn@marvell.com> Reviewed-by: Harman Kalra <hkalra@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 18:37:15 +02:00
Gowrishankar Muthukrishnan	af75aac789	common/cnxk: support telemetry for NIX Add telemetry endpoints to NIX. Signed-off-by: Gowrishankar Muthukrishnan <gmuthukrishn@marvell.com> Reviewed-by: Harman Kalra <hkalra@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 18:37:05 +02:00
Gowrishankar Muthukrishnan	349cca75f6	common/cnxk: support telemetry for network pool allocator Add telemetry endpoints to NPA. Signed-off-by: Gowrishankar Muthukrishnan <gmuthukrishn@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 18:36:27 +02:00
Sunil Kumar Kori	6af19a9d89	net/cnxk: support meter action to flow destroy Meters are configured per flow using rte_flow_create API. Patch adds support for destroy operation for meter action applied on the flow. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:36 +02:00
Sunil Kumar Kori	58397fedc6	net/cnxk: support meter action to flow create Meters are configured per flow using rte_flow_create API. Implement support for meter action applied on the flow. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:31 +02:00
Sunil Kumar Kori	6c6ca3fecc	net/cnxk: support to read/update meter stats Implement API to read and update stats corresponding to given meter instance for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:28 +02:00
Sunil Kumar Kori	e10be75d19	net/cnxk: support to update precolor DSCP table Implement API to update DSCP table for pre-coloring for incoming packet per nixlf for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:25 +02:00
Sunil Kumar Kori	175166a8f4	net/cnxk: support to enable/disable meter Implement API to enable or disable meter instance for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:21 +02:00
Sunil Kumar Kori	71ec9409a4	net/cnxk: support to delete meter Implement API to delete meter instance for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:18 +02:00
Sunil Kumar Kori	d0ea0beb6a	net/cnxk: support to create meter Implement API to create meter instance for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:15 +02:00
Sunil Kumar Kori	b94fbf8b9a	net/cnxk: support to delete meter policy Implement API to delete meter policy for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:11 +02:00
Sunil Kumar Kori	ffee183e97	net/cnxk: support to create meter policy Implement API to add meter policy for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:25:08 +02:00
Sunil Kumar Kori	26b034f78c	net/cnxk: support to validate meter policy Implement API to validate meter policy for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:59 +02:00
Sunil Kumar Kori	d28eff65b2	net/cnxk: support to delete meter profile Implement API to delete meter profile for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:51 +02:00
Sunil Kumar Kori	a83db6b3c1	net/cnxk: support to create meter profile Implement API to add meter profile for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:47 +02:00
Sunil Kumar Kori	329d978e4a	net/cnxk: support to get meter capabilities Implement ethdev operation to get meter capabilities for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:43 +02:00
Sunil Kumar Kori	a7c236b894	net/cnxk: support meter ops get To enable support for ingress meter, supported operations are exposed for CNXK platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Signed-off-by: Rakesh Kudurumalla <rkudurumalla@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:34 +02:00
Sunil Kumar Kori	0ca63f1b4c	common/cnxk: support meter in action list Meter action is added in supported action list. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:26 +02:00
Sunil Kumar Kori	bea4a81b34	common/cnxk: support profile stats reset Implement RoC API to reset stats per bandwidth profile or per NIXLF. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:24:02 +02:00
Sunil Kumar Kori	0fa36bc288	common/cnxk: support profile statistics CN10K platform provides statistics per bandwidth profile and per nixlf. Implement RoC API to read stats for given bandwidth profile. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:23:53 +02:00
Sunil Kumar Kori	c8881e6ef0	common/cnxk: support bandwidth profile stats to index CN10K platform supports different stats for HW bandwidth profiles. Implement RoC API to get index for given stats type. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:23:32 +02:00
Sunil Kumar Kori	b609507b7c	common/cnxk: support bandwidth profiles connection To maintain chain of bandwidth profiles, they needs to be connected. Implement RoC API to connect two bandwidth profiles at different levels. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:23:18 +02:00
Sunil Kumar Kori	3396110111	common/cnxk: support precolor table setup For initial coloring of input packet, CN10K platform maintains precolor table for VLAN, DSCP and Generic. Implement RoC interface to setup pre color table. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:23:03 +02:00
Sunil Kumar Kori	ab706fa825	common/cnxk: support bandwidth profile dump Implement RoC API to dump bandwidth profile on CN10K platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:22:50 +02:00
Sunil Kumar Kori	52511cd2ba	common/cnxk: support profile state toggle Implement RoC API to enable or disable HW bandwidth profiles on CN10K platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:22:30 +02:00
Sunil Kumar Kori	4ad8bc2fc7	common/cnxk: support bandwidth profile configure Implement RoC API to configure HW bandwidth profile for CN10K platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:22:16 +02:00
Sunil Kumar Kori	bf7290c65f	common/cnxk: support bandwidth profiles free Implement RoC interface to free HW bandwidth profiles on CN10K platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:22:04 +02:00
Sunil Kumar Kori	7a63d75ecb	common/cnxk: support bandwidth profiles allocation Implement RoC API to allocate HW resources i.e. bandwidth profiles for policer processing on CN10K platform. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:21:40 +02:00
Sunil Kumar Kori	05a944fea3	common/cnxk: support to get profile count Implement interface to get available profile count for given NIXLF. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:21:25 +02:00
Sunil Kumar Kori	b7cb2203eb	common/cnxk: support to get policer level to index CN10K platform supports policer up to 3 level of hierarchy. Implement RoC API to get corresponding index for given level. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:21:14 +02:00
Sunil Kumar Kori	cf8f6aa12a	common/cnxk: update policer mbox API and HW definitions To support ingress policer on CN10K, MBOX interfaces and HW definitions updated. Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-19 16:20:29 +02:00
Tejasree Kondoj	206c9d5d92	net/octeontx2: use fast udata and mdata flags Using fast metadata and userdata flags instead of driver callbacks for set_pkt_metadata and get_userdata in inline IPsec. Signed-off-by: Tejasree Kondoj <ktejasree@marvell.com> Acked-by: Anoob Joseph <anoobj@marvell.com>	2021-10-19 16:19:52 +02:00
Lior Margalit	0c3fa68396	net/mlx5: fix RSS expansion for L2/L3 VXLAN The RSS expansion algorithm is using a graph to find the possible expansion paths. The current implementation does not differentiate between standard (L2) VXLAN and L3 VXLAN. As result the flow is expanded with all possible paths. For example: testpmd> flow create... / vxlan / end actions rss level 2 / end It is currently expanded to the following paths: ETH IPV4 UDP VXLAN END ETH IPV4 UDP VXLAN ETH IPV4 END ETH IPV4 UDP VXLAN ETH IPV6 END ETH IPV4 UDP VXLAN IPV4 END ETH IPV4 UDP VXLAN IPV6 END The fix is to adjust the expansion according to the outer UDP destination port. In case flow pattern defines a match on the standard udp port, 4789, or does not define a match on the destination port, which also implies setting the standard one, the expansion for the above example will be: ETH IPV4 UDP VXLAN END ETH IPV4 UDP VXLAN ETH IPV4 END ETH IPV4 UDP VXLAN ETH IPV6 END Otherwise, the expansion will be: ETH IPV4 UDP VXLAN END ETH IPV4 UDP VXLAN IPV4 END ETH IPV4 UDP VXLAN IPV6 END Fixes: `f4f06e3615` ("net/mlx5: add flow VXLAN item") Cc: stable@dpdk.org Signed-off-by: Lior Margalit <lmargalit@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-18 09:12:42 +02:00
Ruifeng Wang	778602fe57	net/i40e: fix risk in descriptor read in NEON Rx Rx descriptor is 16B/32B in size. If the DD bit is set, it indicates that the rest of the descriptor words have valid values. Hence, the word containing DD bit must be read first before reading the rest of the descriptor words. In NEON vector PMD, vector load loads two contiguous 8B of descriptor data into vector register. Given vector load ensures no 16B atomicity, read of the word that includes DD field could be reordered after read of other words. In this case, some words could contain invalid data. Read barrier is added after read of qword1 that includes DD field. And qword0 is reloaded to update vector register. This ensures that the fetched data is correct. Testpmd single core test on N1SDP/ThunderX2 showed no performance drop. Fixes: `ae0eb310f2` ("net/i40e: implement vector PMD for ARM") Cc: stable@dpdk.org Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2021-10-19 13:13:55 +02:00
Alvin Zhang	1506c90029	net/i40e: fix IPv6 fragment RSS offload type in flow To keep flow format uniform with ice, this patch adds support for this RSS rule: flow create 0 ingress pattern eth / ipv6 / ipv6_frag_ext / end \ actions rss types ipv6-frag end queues end queues end / end Fixes: `ef4c16fd91` ("net/i40e: refactor RSS flow") Cc: stable@dpdk.org Signed-off-by: Alvin Zhang <alvinx.zhang@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-19 13:06:42 +02:00
Leyi Rong	0d989ff9ca	net/ice: fix generic build on FreeBSD The common header file for vectorization is included in multiple files, and so must use macros for the current compilation unit, rather than the compiler-capability flag set for the whole driver. With the current, incorrect, macro, the AVX512 or AVX2 flags may be set when compiling up SSE code, leading to compilation errors. Changing from "CC_AVX_SUPPORT" to the compiler-defined "__AVX__" macros fixes this issue. In addition, splitting AVX-specific code into the new ice_rxtx_common_avx.h header file to avoid such bugs. Bugzilla ID: 788 Fixes: `a4e480de26` ("net/ice: optimize Tx by using AVX512") Fixes: `20daa1c978` ("net/ice: fix crash in AVX512") Cc: stable@dpdk.org Signed-off-by: Leyi Rong <leyi.rong@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-19 13:02:37 +02:00
Leyi Rong	c454435d88	net/i40e: fix generic build on FreeBSD The common header file for vectorization is included in multiple files, and so must use macros for the current compilation unit, rather than the compiler-capability flag set for the whole driver. With the current, incorrect, macro, the AVX512 or AVX2 flags may be set when compiling up SSE code, leading to compilation errors. Changing from "CC_AVX_SUPPORT" to the compiler-defined "__AVX__" macros fixes this issue. In addition, splitting AVX-specific code into the new i40e_rxtx_common_avx.h header file to avoid such bugs. Bugzilla ID: 788 Fixes: `0604b1f220` ("net/i40e: fix crash in AVX512") Cc: stable@dpdk.org Signed-off-by: Leyi Rong <leyi.rong@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-19 13:01:56 +02:00
Eli Britstein	292be511d2	net/mlx5: support more tunnel types Accept RTE_FLOW_ITEM_TYPE_GRE, RTE_FLOW_ITEM_TYPE_NVGRE and RTE_FLOW_ITEM_TYPE_GENEVE as valid tunnel types. Fixes: `4ec6360de3` ("net/mlx5: implement tunnel offload") Cc: stable@dpdk.org Signed-off-by: Eli Britstein <elibr@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-19 23:51:10 +02:00
Dapeng Yu	287ca31bea	net/softnic: fix memory leak of meter policy After the meter policies are created, they are not freed on device close. This patch fixes it. Fixes: `5f0d54f372` ("ethdev: add pre-defined meter policy API") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-10-19 22:45:19 +02:00
Michal Krawczyk	ba94dad4e0	net/ena: update version to 2.5.0 This version update contains: * Fix for verification of the offload capabilities (especially for IPv6 packets). * Support for Tx and Rx free threshold values. * Fixes for per-queue offload capabilities. * Announce support of the scattered Rx offload. * NUMA aware allocations. * Check for the missing Tx completions. Signed-off-by: Michal Krawczyk <mk@semihalf.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	f93e20e516	net/ena: check missing Tx completions In some cases Tx descriptors may be uncompleted by the HW and as a result they will never be released. This patch adds checking for the missing Tx completions to the ENA timer service, so in order to use this feature, the application must call the function rte_timer_manage(). Missing Tx completion reset threshold is determined dynamically, by taking into consideration ring size and the default value. Tx cleanup is associated with the Tx burst function. As DPDK applications can call Tx burst function dynamically, time when last cleanup was called must be traced to avoid false detection of the missing Tx completion. Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	08180833cb	net/ena: add NUMA-aware allocations Only the IO rings memory was allocated with taking the socket ID into the respect, while the other structures was allocated using the regular rte_zmalloc() API. Ring specific structures are now being allocated using the ring's socket ID. Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	e2a6d08bef	net/ena: advertise scattered Rx capability ENA can't be forced to always pass single descriptor for the Rx packet. Even if the passed buffer size is big enough to hold the data, we can't make assumption that the HW won't use extra descriptor because of internal optimizations. This assumption may be true, but only for some of the FW revisions, which may differ depending on the used AWS instance type. As the scattered Rx support on the Rx path already exists, the driver just needs to announce DEV_RX_OFFLOAD_SCATTER capability by turning on the rte_eth_dev_data::scattered_rx option. Fixes: `1173fca25a` ("ena: add polling-mode driver") Cc: stable@dpdk.org Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	3a822d79c5	net/ena: fix per-queue offload capabilities As ENA currently doesn't support offloads which could be configured per-queue, only per-port flags should be set. In addition, to make the code cleaner, parsing appropriate offload flags is encapsulated into helper functions, in a similar matter it's done by the other PMDs. [1] https://doc.dpdk.org/guides/prog_guide/ poll_mode_drv.html?highlight=offloads#hardware-offload Fixes: `7369f88f88` ("net/ena: convert to new Rx offloads API") Fixes: `56b8b9b7e5` ("net/ena: convert to new Tx offloads API") Cc: stable@dpdk.org Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	005064e505	net/ena: support Tx/Rx free thresholds The caller can pass Tx or Rx free threshold value to the configuration structure for each ring. It determines when the Tx/Rx function should start cleaning up/refilling the descriptors. ENA was ignoring this value and doing it's own calculations. Now the user can configure ENA's behavior using this parameter and if this variable won't be set, the ENA will continue with the old behavior and will use it's own threshold value. The default value is not provided by the ENA in the ena_infos_get(), as it's being determined dynamically, depending on the requested ring size. Note that NULL check for Tx conf was removed from the function ena_tx_queue_setup(), as at this place the configuration will be either provided by the user or the default config will be used and it's handled by the upper (rte_ethdev) layer. Tx threshold shouldn't be used for the Tx cleanup budget as it can be inadequate to the used burst. Now the PMD tries to release mbufs for the ring until it will be depleted. Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Michal Krawczyk	e8c838fde9	net/ena: fix offload capabilities verification ENA PMD has multiple checksum offload flags, which are more discrete than the DPDK offload capabilities flags. As the driver wasn't storing it's internal checksum offload capabilities and was relying only on the DPDK capabilities, not all scenarios could be properly covered (like when to prepare pseudo header checksum and when not). Moreover, the user could request offload capability, which isn't supported by the HW and the PMD would quietly ignore the issue. This commit reworks eth_ena_prep_pkts() function to perform additional checks and to properly reflect the HW requirements. With the RTE_LIBRTE_ETHDEV_DEBUG enabled, the function will do even more verifications, to help the user find any issues with the mbuf configuration. Fixes: `b3fc5a1ae1` ("net/ena: add Tx preparation") Cc: stable@dpdk.org Signed-off-by: Michal Krawczyk <mk@semihalf.com> Reviewed-by: Igor Chauskin <igorch@amazon.com> Reviewed-by: Shai Brandes <shaibran@amazon.com>	2021-10-19 15:04:17 +02:00
Viacheslav Galaktionov	26706314d4	net/sfc: implement transfer proxy port callback In sfc, MAE admin serves as a transfer proxy. In order to track which ethdev is privileged, augment every independent switch port structure with information about its MAE privilege. Signed-off-by: Viacheslav Galaktionov <viacheslav.galaktionov@oktetlabs.ru> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Andy Moreton <amoreton@xilinx.com>	2021-10-18 20:56:02 +02:00
Viacheslav Galaktionov	2f577f0ea1	net/sfc: allow ports without MAE privilege Register unprivileged ports in the switch domain registry in order to allow redirecting traffic to them. Differentiate between different levels of MAE support, update all MAE status checks. Signed-off-by: Viacheslav Galaktionov <viacheslav.galaktionov@oktetlabs.ru> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Andy Moreton <amoreton@xilinx.com>	2021-10-18 20:56:02 +02:00
Viacheslav Galaktionov	40ccb31158	common/sfc_efx/base: support unprivileged MAE clients In order to differentiate between privileged and unprivileged MAE clients, add a separate boolean flag to represent a NIC's MAE privilege level. Allow initializing unprivileged MAE clients by avoiding calls to functions that can only be called by the admin NIC. Signed-off-by: Viacheslav Galaktionov <viacheslav.galaktionov@oktetlabs.ru> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Andy Moreton <amoreton@xilinx.com>	2021-10-18 20:56:02 +02:00
Ferruh Yigit	b563c14212	ethdev: remove jumbo offload flag Removing 'DEV_RX_OFFLOAD_JUMBO_FRAME' offload flag. Instead of drivers announce this capability, application can deduct the capability by checking reported 'dev_info.max_mtu' or 'dev_info.max_rx_pktlen'. And instead of application setting this flag explicitly to enable jumbo frames, this can be deduced by driver by comparing requested 'mtu' to 'RTE_ETHER_MTU'. Removing this additional configuration for simplification. Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Huisong Li <lihuisong@huawei.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Acked-by: Michal Krawczyk <mk@semihalf.com>	2021-10-18 19:20:21 +02:00
Ferruh Yigit	f7e04f57ad	ethdev: move MTU set check to library Move requested MTU value check to the API to prevent the duplicated code. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-10-18 19:20:21 +02:00
Ferruh Yigit	dd4e429c95	ethdev: move jumbo frame offload check to library Setting MTU bigger than RTE_ETHER_MTU requires the jumbo frame support, and application should enable the jumbo frame offload support for it. When jumbo frame offload is not enabled by application, but MTU bigger than RTE_ETHER_MTU is requested there are two options, either fail or enable jumbo frame offload implicitly. Enabling jumbo frame offload implicitly is selected by many drivers since setting a big MTU value already implies it, and this increases usability. This patch moves this logic from drivers to the library, both to reduce the duplicated code in the drivers and to make behaviour more visible. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Huisong Li <lihuisong@huawei.com>	2021-10-18 19:20:21 +02:00
Ferruh Yigit	1bb4a528c4	ethdev: fix max Rx packet length There is a confusion on setting max Rx packet length, this patch aims to clarify it. 'rte_eth_dev_configure()' API accepts max Rx packet size via 'uint32_t max_rx_pkt_len' field of the config struct 'struct rte_eth_conf'. Also 'rte_eth_dev_set_mtu()' API can be used to set the MTU, and result stored into '(struct rte_eth_dev)->data->mtu'. These two APIs are related but they work in a disconnected way, they store the set values in different variables which makes hard to figure out which one to use, also having two different method for a related functionality is confusing for the users. Other issues causing confusion is: * maximum transmission unit (MTU) is payload of the Ethernet frame. And 'max_rx_pkt_len' is the size of the Ethernet frame. Difference is Ethernet frame overhead, and this overhead may be different from device to device based on what device supports, like VLAN and QinQ. * 'max_rx_pkt_len' is only valid when application requested jumbo frame, which adds additional confusion and some APIs and PMDs already discards this documented behavior. * For the jumbo frame enabled case, 'max_rx_pkt_len' is an mandatory field, this adds configuration complexity for application. As solution, both APIs gets MTU as parameter, and both saves the result in same variable '(struct rte_eth_dev)->data->mtu'. For this 'max_rx_pkt_len' updated as 'mtu', and it is always valid independent from jumbo frame. For 'rte_eth_dev_configure()', 'dev->data->dev_conf.rxmode.mtu' is user request and it should be used only within configure function and result should be stored to '(struct rte_eth_dev)->data->mtu'. After that point both application and PMD uses MTU from this variable. When application doesn't provide an MTU during 'rte_eth_dev_configure()' default 'RTE_ETHER_MTU' value is used. Additional clarification done on scattered Rx configuration, in relation to MTU and Rx buffer size. MTU is used to configure the device for physical Rx/Tx size limitation, Rx buffer is where to store Rx packets, many PMDs use mbuf data buffer size as Rx buffer size. PMDs compare MTU against Rx buffer size to decide enabling scattered Rx or not. If scattered Rx is not supported by device, MTU bigger than Rx buffer size should fail. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com> Acked-by: Huisong Li <lihuisong@huawei.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com>	2021-10-18 19:20:20 +02:00
Ferruh Yigit	54fe0cf1b8	net/enic: fix build with GCC 7.5 Build error: ../drivers/net/enic/enic_fm_flow.c: In function 'enic_fm_flow_parse': ../drivers/net/enic/enic_fm_flow.c:1467:24: error: 'dev' may be used uninitialized in this function [-Werror=maybe-uninitialized] struct rte_eth_dev dev; ^~~ ../drivers/net/enic/enic_fm_flow.c:1580:24: error: 'dev' may be used uninitialized in this function [-Werror=maybe-uninitialized] struct rte_eth_dev dev; ^~~ ../drivers/net/enic/enic_fm_flow.c:1599:24: error: 'dev' may be used uninitialized in this function [-Werror=maybe-uninitialized] struct rte_eth_dev *dev; ^~~ Build error looks like false positive, but to silence the compiler initializing the pointer with NULL. Bugzilla ID: 812 Fixes: `54bd4ebe8b` ("net/enic: support meta flow actions to overrule destinations") Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-10-18 17:53:17 +02:00
Pavan Nikhilesh	ac6deebb58	common/octeontx2: enable build only on 64-bit Linux Since AARCH32 extension is not implemented on octeontx2 family, only enable build for 64bit. Due to Linux kernel AF(Admin Function) driver dependency, only enable build for 64-bit Linux. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-15 19:13:13 +02:00
Pavan Nikhilesh	e1369718f5	common/octeontx: enable build only on 64-bit Linux Since AARCH32 extension is not implemented on octeontx family, only enable build for 64bit. Due to Linux kernel AF(Admin function) driver dependency, only enable build for 64-bit Linux. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-15 19:13:13 +02:00
Pavan Nikhilesh	9ec67c12bd	net/thunderx: enable build only on 64-bit Linux Since AARCH32 extension is not implemented on thunderx family, only enable build for 64bit. Due to Linux kernel AF(Admin function) driver dependency, only enable build for Linux. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-10-15 19:13:13 +02:00
Xueming Li	51c1b8f8a0	net/bonding: fix Tx queue release When release Tx queue, Rx queue data got freed because wrong Tx queue data located. This patch fixes the wrong Tx queue data location. Fixes: `7483341ae5` ("ethdev: change queue release callback") Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-15 12:45:58 +02:00
Li Zhang	771253ea8f	net/mlx5: fix domains selection for meter policy Fate actions are different per domain. When all the domains, ingress, egress and FDB (transfer), can support all the policy actions, i.e. [SET_TAG], the policy prepares resources for all the domains and failure happens if one of the domains misses its fate action in the policy action list. Remove the domains missing their fate action from the meter policy preparation. Now, the policy will prepare a domain only when the domain supports all the actions and when one of the domain fate actions is on the list. Fixes: `afb4aa4f12` ("net/mlx5: support meter policy operations") Cc: stable@dpdk.org Signed-off-by: Li Zhang <lizh@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-14 10:48:33 +02:00
Simei Su	9f8c4cf02d	net/ice: fix dereferenced null pointer This patch fixes coverity issue by avoiding use of null pointer in taking false branch. Coverity issue: 373360 Fixes: `437dbd2fd4` ("net/ice: support 1PPS") Signed-off-by: Simei Su <simei.su@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-14 08:26:43 +02:00
Dapeng Yu	fff4914bd9	net/ice: fix freeing queues on DCF device reset In function ice_dcf_stop_queues(), RX queues and TX queues are actually not freed, so their pointers shall not be set to NULL when queues are stopped. This patch adds function call to free queues on DCF device close, which also set the RX and TX queues' pointers to NULL on freeing queues, and avoids referring to the released resource when device is started again. Fixes: `1a86f4dbdf` ("net/ice: support DCF device reset") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 12:58:04 +02:00
Dapeng Yu	cc2f82b96a	net/ice: fix deadlock on flow redirect If flow redirect failed, the spinlock will not be unlocked. This patch fixes it. Fixes: `bc9201388d` ("net/ice: support flow redirect") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 12:53:28 +02:00
Simei Su	250e2ed8d8	net/ice: fix build when Rx descriptor size is 16 The Timestamp Overlay feature is available only in 32B Flex Descriptors. This patch adds compile option when in 16B Flex Descriptors. Fixes: `953e74e6b7` ("net/ice: enable Rx timestamp on flex descriptor") Fixes: `646dcbe6c7` ("net/ice: support IEEE 1588 PTP") Signed-off-by: Simei Su <simei.su@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 12:33:17 +02:00
Junfeng Guo	c2450e933f	net/ice/base: fix null pointer dereferences for parser Null-checking "p" suggests that it may be null, but it has already been dereferenced on all paths leading to the check. Thus correct the code lines and remove the redundant line. Fixes: `c84f8aa210` ("net/ice/base: add parser runtime skeleton") Signed-off-by: Junfeng Guo <junfeng.guo@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 12:29:54 +02:00
Robin Zhang	baf1d4cd8a	net/i40e: upgrade AQ command of MAC/VLAN remove Firmware 8.4+ will return I40E_AQ_RC_ENOENT when try to delete non-existent MAC/VLAN addresses from the HW filtering, this should not be considered as an Admin Queue error. But in i40e_asq_send_command, it will return I40E_ERR_ADMIN_QUEUE_ERROR if the return value of Admin Queue command processed by Firmware is not I40E_AQ_RC_OK or I40E_AQ_RC_EBUSY. Use i40e_aq_remove_macvlan_v2 instead so that we can get the corresponding Admin Queue status, and not report as an error in DPDK when Firmware return I40E_AQ_RC_ENOENT, and this also not break with an old firmware. Signed-off-by: Robin Zhang <robinx.zhang@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 04:22:19 +02:00
Junfeng Guo	c99174deba	net/ice/base: fix parser runtime reset Adjust the code line order of the parser runtime reset, since the struct rt->psr is used in function _rt_flag_set before assignment. Fixes: `c84f8aa210` ("net/ice/base: add parser runtime skeleton") Cc: stable@dpdk.org Signed-off-by: Junfeng Guo <junfeng.guo@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-13 04:10:10 +02:00
Andrew Rybchenko	64c952ec78	drivers/net: remove queue xstats auto-fill flag Some drivers do not provide per-queue statistics. So, there is no point to have these misleading zeros in xstats. Fixes: `f30e69b41f` ("ethdev: add device flag to bypass auto-filled queue xstats") Cc: stable@dpdk.org Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-15 00:17:44 +02:00
Jiawen Wu	e5ece1f467	net/txgbe: fix VXLAN-GPE packet checksum Parse inner L2 length to set correct packet type, and ensure that hardware can compute the checksum successfully. Fixes: `b950203be7` ("net/txgbe: support VXLAN-GPE") Cc: stable@dpdk.org Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-14 16:42:02 +02:00
Jiawen Wu	196f0e123b	net/txgbe: set fixed flag for exact link speed Setting exact link speed makes sense if auto-negotiation is disabled. Fixed flag is required to disable auto-negotiation. Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-14 16:42:02 +02:00
Jiawen Wu	ef6427a3f2	net/txgbe: fix to get interrupt status It's necessary to set 1 on TXGBE_PX_INTA register to get interrupts normally, when legacy interrupt mode is used. Fixes: `2fc745e6b6` ("net/txgbe: add interrupt operation") Cc: stable@dpdk.org Signed-off-by: Jiawen Wu <jiawenwu@trustnetic.com>	2021-10-14 16:42:02 +02:00
Viacheslav Ovsiienko	40c8fb1fd3	net/mlx5: update modify field action Update immediate value/pointer source operand support for modify field RTE Flow action: - source operand data can be presented by byte buffer (instead of former uint64_t) or by pointer - no host byte ordering is assumed anymore for immediate data buffer (not uint64_t anymore) - no immediate value offset is expected (the source subfield is located at the same offset as in destination) Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-14 14:34:31 +02:00
Ferruh Yigit	cf21c2d3c8	net/bnxt: fix build with debug macro Setting 'RTE_LIBRTE_BNXT_TRUFLOW_DEBUG' macro cause build error, removing it. Also with meson build system compile time debug macros should be documented in driver documentation, since there is no other way to figure out their existence. Fixes: `ad9eed0248` ("net/bnxt: support flow template for Thor") Cc: stable@dpdk.org Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-14 00:55:05 +02:00
Ferruh Yigit	675a6c1874	net/softnic: fix useless address check Reported by "gcc (GCC) 12.0.0 20211003 (experimental)": ./drivers/net/softnic/rte_eth_softnic_cli.c: In function ‘tmgr_hierarchy_default’: ./drivers/net/softnic/rte_eth_softnic_cli.c:634:73: error: the comparison will always evaluate as ‘true’ for the address of ‘tc_valid’ will never be NULL [-Werror=address] 634 \| (&params->shared_shaper_id.tc_valid[0]) ? 1 : 0, \| ^ Fixing it by removing useless check. Fixes: `1af2dc5111` ("net/softnic: add command for default tmgr hierarchy") Fixes: `5eb676d74f` ("net/softnic: add config flexibility to TM") Cc: stable@dpdk.org Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-10-13 23:37:17 +02:00
Andrew Rybchenko	f55b61cec9	net/sfc: support port representor flow item Add support for item PORT_REPRESENTOR which should be used instead of ambiguous item PORT_ID. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 22:59:26 +02:00
Andrew Rybchenko	8d13351d4c	net/octeontx2: support port representor flow action Action PORT_ID implementation assumes ingress only. Its semantics suggests that support for equal action PORT_REPRESENTOR be added. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 22:59:26 +02:00
Andrew Rybchenko	d35dd287a2	net/mlx5: support represented port flow action Semantics of the existing support for action PORT_ID suggests that support for equal action REPRESENTED_PORT be implemented. Helper functions keep port_id suffix since action MLX5_FLOW_ACTION_PORT_ID is still used internally. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 22:59:26 +02:00
Andrew Rybchenko	54bd4ebe8b	net/enic: support meta flow actions to overrule destinations Add support for actions PORT_REPRESENTOR and REPRESENTED_PORT based on the existing support for action PORT_ID. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Hyong Youb Kim <hyonkim@cisco.com>	2021-10-13 22:59:26 +02:00
Andrew Rybchenko	640b44aa5c	net/bnxt: support meta flow actions to overrule destinations Add support for actions PORT_REPRESENTOR and REPRESENTED_PORT based on the existing support for action PORT_ID. Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 22:59:26 +02:00
Andrew Rybchenko	a8321e0979	net/bnxt: support meta flow items to match on traffic source Add support for items PORT_REPRESENTOR and REPRESENTED_PORT based on the existing support for item PORT_ID. The use of item PORT_ID depends on the specified direction attribute. Items PORT_REPRESENTOR and REPRESENTED_PORT, in turn, define traffic direction themselves. The former matches traffic from the driver's vNIC. The latter matches packets from either a v-port (network) or a VF's vNIC (if the driver's port is a VF representor). Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 22:59:26 +02:00
Konstantin Ananyev	f9bdee267a	ethdev: hide internal structures Move rte_eth_dev, rte_eth_dev_data, rte_eth_rxtx_callback and related data into private header (ethdev_driver.h). Few minor changes to keep DPDK building after that. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Feifei Wang <feifei.wang2@arm.com>	2021-10-13 22:14:59 +02:00
Konstantin Ananyev	8d7d4fcdca	ethdev: change input parameters for Rx queue count Currently majority of fast-path ethdev ops take pointers to internal queue data structures as an input parameter. While eth_rx_queue_count() takes a pointer to rte_eth_dev and queue index. For future work to hide rte_eth_devices[] and friends it would be plausible to unify parameters list of all fast-path ethdev ops. This patch changes eth_rx_queue_count() to accept pointer to internal queue data as input parameter. While this change is transparent to user, it still counts as an ABI change, as eth_rx_queue_count_t is used by ethdev public inline function rte_eth_rx_queue_count(). Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Feifei Wang <feifei.wang2@arm.com>	2021-10-13 22:14:58 +02:00
Ivan Malov	0ead098383	net/sfc: refine pattern of group flows in tunnel offload By design, in a GROUP flow, outer match criteria go to "ENC" fields of the action rule match specification. The current HW/FW hasn't got support for these fields (except the VXLAN VNI) yet. As a workaround, start parsing the pattern from the tunnel item. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	9df2d8f5cc	net/sfc: support counters in tunnel offload jump rules Such a counter will only report the number of hits, which is actually a sum of two contributions (the JUMP rule's own counter + indirect increments issued by counters of the associated GROUP rules. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	8cd7725169	net/sfc: use action rules in tunnel offload jump rules By design, JUMP flows should be represented solely by the outer rules. But the HW/FW hasn't got support for setting Rx mark from RECIRC_ID on outer rule lookup yet. Neither does it support outer rule counters. As a workaround, an action rule of lower priority is used to do the job. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	8efb2f537e	net/sfc: override match fields in tunnel offload jump rules The current HW/FW doesn't allow to match on MAC addresses in outer rules. One day this will change for sure, but right now a workaround is needed. Match on VLAN presence in outer rules is also unsupported. Ignore it. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	7e5b479803	net/sfc: implement control path operations in tunnel offload Support generic callbacks which callers will invoke to get PMD-specific actions and items used to produce JUMP and GROUP flows and to detect tunnel information. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	012bf708c2	net/sfc: support group flows in tunnel offload GROUP is an in-house term for so-called "tunnel_match" flows. On parsing, they are detected by virtue of PMD-internal item MARK. It associates a given flow with its tunnel context. Such a flow is represented by a MAE action rule which is chained with the corresponding JUMP rule's outer rule by virtue of matching on its recirculation ID. GROUP flows do narrower match than JUMP flows do and decapsulate matching packets (full offload). Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	3a73dcfdb2	common/sfc_efx/base: match on recirc ID in action rules Currently, there is an API for setting recirculation ID in outer rules. Add an API to let action rules match on it. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	93de39f50a	net/sfc: support jump flows in tunnel offload JUMP is an in-house term for so-called "tunnel_set" flows. On parsing, they are identified by virtue of actions MARK (PMD-internal) and JUMP. The action MARK associates a given flow with its tunnel context. Such a flow is represented by a MAE outer rule (OR) which has its recirculation ID set. This ID is also associated with the tunnel context. The OR is supposed to set this ID in 8 high bits of Rx mark in matching packets. It also counts the packets. Packets that hit the OR but miss in action rule (AR) table, should go to MAE admin PF (that is, to DPDK) by default. Support for the use of action COUNT in JUMP flows will be introduced by later patches. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-13 21:30:13 +02:00
Ivan Malov	5cf153e79c	common/sfc_efx/base: support recirculation ID in outer rules When an outer rule is hit, it can pass recirculation ID down to action rule lookup, and action rules can match on this ID instead of matching on the outer rule allocation handle. By default, recirculation ID is assumed to be zero. Add an API to set recirculation ID in outer rules. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-10-13 16:39:11 +02:00
Ivan Malov	53a8051264	net/sfc: fence off 8 bits in Rx mark for tunnel offload Later patches add support for tunnel offload on Riverhead (EF100). A board can host at most 254 tunnels. Partially offloaded (missed) tunnel packets are identified by virtue of 8 high bits in Rx mark. Add basic definitions of the upcoming tunnel offload support and take care of the dedicated bits in Rx mark across the driver. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Andy Moreton <amoreton@xilinx.com>	2021-10-13 16:39:11 +02:00
Hyong Youb Kim	fb92745461	net/enic: fix filter mode detection vnic_dev_capable_filter_mode() currently fails when CMD_CAPABILITY(CMD_ADD_FILTER) returns ERR_EPERM. In turn, this failure causes the driver initialization to fail. But, firmware may legitimately return ERR_EPERM. For example, VF vNIC returns ERR_EPERM when it does not support filtering at all. So, treat ERR_EPERM as "no filtering available" instead of an unexpected error. Fixes: `322b355f21` ("net/enic/base: bring NIC interface functions up to date") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2021-10-13 15:40:50 +02:00

... 2 3 4 5 6 ...

16817 Commits