numam-dpdk

Author	SHA1	Message	Date
Ajit Khaparde	9446d7fcd9	doc: remove obsolete option from bnxt guide host-based-truflow devarg is not used anymore to enable host based flow table management functionality TruFlow. Instead this feature is now driven by a capability indicated by the firmware. TruFlow is not in tech preview anymore. Update the doc accordingly. Fixes: `da3731e2ea` ("net/bnxt: check FW capability to support TRUFLOW") Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-28 19:58:54 +02:00
Ajit Khaparde	0e7bdac71d	doc: update NIC feature matrix for bnxt Support for runtime Rx/Tx queue setup and inner RSS is not updated. Update feature matrix for bnxt PMD. Fixes: `7ed45b1a7c` ("net/bnxt: support RSS hash selection") Fixes: `0105ea1296` ("net/bnxt: support runtime queue setup") Cc: stable@dpdk.org Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-28 19:58:54 +02:00
Ajit Khaparde	ff5d251f7c	net/bnxt: remove stale compilation option Remove a stale compile option from meson build file. RTE_LIBRTE_BNXT_TF sneaked in incorrectly. Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-10-28 19:58:54 +02:00
Dapeng Yu	63741c99a6	net/ice: remove VSI update on DCF reset by PF After DCF is reset by PF, the VSI update service is unable to be completed since the DCF resource is invalid. This patch removes the call to service that updates VSI since it is useless and output too many error messages. Fixes: `c7e1a1a3bf` ("net/ice: refactor DCF VLAN handling") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-29 10:38:21 +02:00
Radu Nicolau	4bcfaf7316	net/iavf: add watchdog for VF FLR Add watchdog to iAVF PMD which support monitoring the VFLR register. If the device is not already in reset then if a VF reset in progress is detected then notify user through callback and set into reset state. If the device is already in reset then poll for completion of reset. The watchdog is disabled by default, to enable it set IAVF_DEV_WATCHDOG_PERIOD to a non zero value (microseconds) Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:22:25 +02:00
Radu Nicolau	ccb49b834c	net/iavf: support xstats for inline IPsec crypto Add per queue counters for maintaining statistics for inline IPsec crypto offload, which can be retrieved through the rte_security_session_stats_get() with more detailed errors through the rte_ethdev xstats. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:22:15 +02:00
Radu Nicolau	6bc987ecb8	net/iavf: support IPsec inline crypto Add support for inline crypto for IPsec, for ESP transport and tunnel over IPv4 and IPv6, as well as supporting the offload for ESP over UDP, and in conjunction with TSO for UDP and TCP flows. Implement support for rte_security packet metadata Add definition for IPsec descriptors, extend support for offload in data and context descriptor to support Add support to virtual channel mailbox for IPsec Crypto request operations. IPsec Crypto requests receive an initial acknowledgment from physical function driver of receipt of request and then an asynchronous response with success/failure of request including any response data. Add enhanced descriptor debugging Refactor of scalar tx burst function to support integration of offload Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Abhijit Sinha <abhijit.sinha@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Reviewed-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:22:04 +02:00
Radu Nicolau	8410842505	net/iavf: support asynchronous virtual channel message Add support for asynchronous virtual channel messages, specifically for inline IPsec messages. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Abhijit Sinha <abhijit.sinha@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:19:57 +02:00
Radu Nicolau	1e728b0112	net/iavf: rework Tx path Rework the Tx path and Tx descriptor usage in order to allow for better use of offload flags and to facilitate enabling of inline crypto offload feature. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Abhijit Sinha <abhijit.sinha@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2021-10-29 04:19:28 +02:00
Radu Nicolau	993f0d4d62	common/iavf: support IPsec inline crypto Add support for inline crypto for IPsec. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Abhijit Sinha <abhijit.sinha@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-29 04:15:31 +02:00
David Marchand	e7c727c307	net: fix build with sparse on L2TPv2 bitfields An external project that wants to do additional checks on fields endianness can remap rte_beXX types to instrumented types and use sparse. The current code breaks OVS build with sparse: ../../lib/ofp-packet.c: note: in included file (through .../ovs/dpdk-dir/build/include/rte_flow.h, ../../lib/netdev-dpdk.h, ../../lib/dp-packet.h): .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:92:37: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:93:37: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:94:40: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:95:37: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:96:40: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:97:37: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:98:37: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h:99:40: error: invalid bitfield specifier for type restricted ovs_be16. .../ovs/dpdk-dir/build/include/rte_l2tpv2.h💯39: error: invalid bitfield specifier for type restricted ovs_be16. make[3]: *** [lib/ofp-packet.lo] Error 1 Use simple uint16_t types for bitfields in L2TPv2 struct. Fixes: `3a929df1f2` ("ethdev: support L2TPv2 and PPP procotol") Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-28 20:28:01 +02:00
Andrew Rybchenko	4c96e084fd	app/testpmd: fix MTU configuration before device start There is no point to do rte_eth_dev_mtu_set() before configure since set MTU value is overwritten on configure anyway. So, setting of MTU before configure is rejected now on ethdev level. If testpmd is going to do configure (e.g. just after testpmd start with disabled devices start up or any configuration changes in stopped state which require reconfigure), just save requested MTU in device config to be applied on reconfigure. Fixes: `1bb4a528c4` ("ethdev: fix max Rx packet length") Fixes: `b26bee10ee` ("ethdev: forbid MTU set before device configure") Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-28 20:21:30 +02:00
Jie Wang	684366134c	app/testpmd: fix L2TPv2 message type In "msg_type \|= 0xc800", wider "51200" has high-order bits (0xc800) that don't affect the narrower left-hand side. This patch fixes coverity issue by changing the definition type of "msg_type" from uint8_t to uint16_t. Coverity issue: 373651 Fixes: `748530f035` ("app/testpmd: support L2TPv2 and PPP protocol pattern") Signed-off-by: Jie Wang <jie1x.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-10-28 20:04:03 +02:00
Tejasree Kondoj	af5c990935	common/cnxk: fix build with -O1 Fixing build failure with EXTRA_CFLAGS='-O1'. Fixes: `d85f9749f9` ("common/cnxk: add hash generation API") Reported-by: Longfeng Liang <longfengx.liang@intel.com> Signed-off-by: Tejasree Kondoj <ktejasree@marvell.com> Acked-by: Anoob Joseph <anoobj@marvell.com>	2021-10-28 14:54:59 +02:00
Kalesh AP	26ba9e7b91	net/bnxt: fix flow RSS failure handling With commit `239695f754` ("net/bnxt: enhance RSS action support"), when bnxt_hwrm_vnic_rss_cfg() call fails, driver was not setting flow error using "rte_flow_error_set". Fixes: `239695f754` ("net/bnxt: enhance RSS action support") Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-28 06:23:11 +02:00
Ajit Khaparde	43e7d2a30d	net/bnxt: refactor Rx ring cleanup for representors Rx ring for representors does not use aggregation rings for Rx. Instead they use simple software buffers for handling Rx packets. So there is no need to use the same cleanup routine as done by the non-representor code path. Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-27 03:52:02 +02:00
Ajit Khaparde	df07aa22d1	net/bnxt: fix RSS action parser Minor fixes are needed in the RTE_FLOW RSS action parser. 1. Update the comment in the parser to indicate RSS level 1 implies RSS on outer header. 2. RSS action will not be supported if level is > 1. 3. RSS action will not be supported if user or application specifies MARK or COUNT action. 4. If RSS types is not specified i.e., is 0, the best effort RSS should use IPv4 and IPv6 headers. Currently we are considering only IPv4. Fixes: `239695f754` ("net/bnxt: enhance RSS action support") Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-27 03:50:39 +02:00
Kalesh AP	e046deb244	net/bnxt: fix RSS behavior on Thor Move the Rx queue state update before bnxt_setup_one_vnic() is called. For Thor, rxq->rx_started and eth_dev->data->rx_queue_state[] needs to be set for all queues before bnxt_hwrm_vnic_cfg() or bnxt_vnic_rss_configure() are called. Fixes: `0105ea1296` ("net/bnxt: support runtime queue setup") Cc: stable@dpdk.org Signed-off-by: Lance Richardson <lance.richardson@broadcom.com> Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Somnath Kotur <somnath.kotur@broadcom.com>	2021-10-27 03:46:43 +02:00
Gregory Etelson	23b0a8b298	net/mlx5: fix integrity item validation and translation Integrity item validation and translation must verify that integrity item bits match L3 and L4 items in flow rule pattern. For cases when integrity item was positioned before L3 header, such verification must be split into two stages. The first stage detects integrity flow item and makes initializations for the second stage. The second stage is activated after PMD completes processing of all flow items in rule pattern. PMD accumulates information about flow items in flow pattern. When all pattern flow items were processed, PMD can apply that data to complete integrity item validation and translation. Fixes: `79f8952783` ("net/mlx5: support integrity flow item") Cc: stable@dpdk.org Signed-off-by: Gregory Etelson <getelson@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-28 10:14:39 +02:00
Gregory Etelson	06741117ec	net/mlx5: fix integrity match on inner and outer headers MLX5 PMD can match on integrity bits for inner and outer headers in a single flow. That means a single flow rule can reference both inner and outer integrity bits. That is implemented by adding 2 flow integrity items to a rule - one item for outer integrity bits and other for inner integrity bits. Integrity item `level` parameter specifies what part is being targeted. Current PMD treated integrity items for outer and inner headers as the same. The patch separates PMD verifications for inner and outer integrity items. Fixes: `79f8952783` ("net/mlx5: support integrity flow item") Cc: stable@dpdk.org Signed-off-by: Gregory Etelson <getelson@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-28 10:14:38 +02:00
Haifei Luo	a7ac7fae49	net/mlx5: enhance flow dump Multiple rules could use the same encap_decap/modify_hdr/counter action. The flow dump data could be duplicated. To avoid redundancy, flow dump value is based on the actions' pointer instead of previous rules' pointer. For counter, the data is stored in cmng of priv->sh. For encap_decap/modify_hdr, the data stored in encaps_decaps/modify_cmds. Traverse the fields and get action's pointer and information. Formats are same for information in the dump except "id" stands for actions' pointer: Counter: rec_type,id,hits,bytes Modify_hdr: rec_type,id,actions_number,actions Encap_decap: rec_type,id,buf Signed-off-by: Haifei Luo <haifeil@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-28 10:14:21 +02:00
Jiawei Wang	3c4338a421	net/mlx5: optimize device spawn time with representors During the device spawn process, mlx5 PMD queried the available flow priorities by calling mlx5_flow_discover_priorities, queried if the DR drop action was supported on the root table by calling the mlx5_flow_discover_dr_action_support routine, and queried the availability of metadata register C by calling mlx5_flow_discover_mreg_c These functions created the test flows to get the supported fields, and at the end destroyed the test flows. The test flows in the first two functions was created on the root table. If the device was spawned with multiple representors, these test flows were created and destroyed on each representor as well. The above operations took a significant amount of init time during the device spawn. This patch optimizes the device discover functions, if there is the device with multiple representors (VF/SF) being spawned, the priority and drop action and metadata register support check can be done only ones and check results can be shared for all representors. Signed-off-by: Jiawei Wang <jiaweiw@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-27 14:04:39 +02:00
Sean Zhang	53712685d7	common/mlx5: optimize debug log Remove debug log inside of mlx5_list_init to avoid flooding debug messages when creating hash list with large actual size. Fixes: `9c373c524b` ("common/mlx5: move list utility from net driver") Cc: stable@dpdk.org Signed-off-by: Sean Zhang <xiazhang@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2021-10-26 17:16:17 +02:00
Rongwei Liu	7299ab6822	net/mlx5: support socket direct mode bonding In socket direct mode, it's possible to bind any two (maybe four in future) PCIe devices with IDs like xxxx:xx:xx.x and yyyy:yy:yy.y. Bonding member interfaces are unnecessary to have the same PCIe domain/bus/device ID anymore, Kernel driver uses "system_image_guid" to identify if devices can be bound together or not. Sysfs "phys_switch_id" is used to get "system_image_guid" of each network interface. OFED 5.4+ is required to support "phys_switch_id". Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-26 13:24:20 +02:00
Rongwei Liu	4c74ad3e16	common/mlx5: support PCIe device GUID query sysfs entry "phys_switch_id" holds each PCIe device' guid. The devices which reside in the same physical NIC should have the same guid. Signed-off-by: Rongwei Liu <rongweil@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2021-10-26 11:26:14 +02:00
Dapeng Yu	627b3c5a39	net/iavf: fix shared data in multi-process The shared pointer is initialized to a static local array defined in the primary process and it shall not be accessed in the secondary process. This patch copies the local data to shared data, to avoid data access violation. Fixes: `040b44551f` ("net/iavf: unify Rx packet type table") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-28 01:52:05 +02:00
Dapeng Yu	20b631efe7	net/ice: fix function pointer in multi-process This patch uses the index value to call the function, instead of the function pointer assignment to save the selection of Receive Flex Descriptor profile ID. Otherwise the secondary process will run with wrong function address from primary process. Fixes: `7a340b0b4e` ("net/ice: refactor Rx FlexiMD handling") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com>	2021-10-27 05:29:39 +02:00
Dapeng Yu	b4f0d4ab66	net/ice: workaround DCF reset failure After DCF is reset by PF, the DCF device un-initialization cannot function normally, ignore the failure does not help since the kernel does not clean up resource. The patch workaround the issue by triggering an additional DCF enable/ disable cycle when a passive reset is detected. Fixes: `1a86f4dbdf` ("net/ice: support DCF device reset") Cc: stable@dpdk.org Signed-off-by: Dapeng Yu <dapengx.yu@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2021-10-27 05:25:25 +02:00
David Marchand	41f2f05574	ethdev: warn once when using port not ready Warning continuously is a pain when developping or if a unit test is/gets broken. It could also be a problem if application behaves badly only in some corner cases and a DoS results of those logs being continuously displayed. Let's warn once per port and per rx/tx. Getting such a log is scary, but let's make it more eye catching by dumping a backtrace with it. Tested by introducing a bug in testpmd: static int eth_dev_start_mp(uint16_t port_id) { - if (is_proc_primary()) + if (!is_proc_primary()) return rte_eth_dev_start(port_id); return 0; Then, running a basic null test: $ ./devtools/test-null.sh ... Start automatic packet forwarding io packet forwarding - ports=2 - cores=1 - streams=2 - NUMA support enabled, MP allocation mode: native Logical Core 1 (socket 0) forwards packets on 2 streams: RX P=0/Q=0 (socket 0) -> TX P=1/Q=0 (socket 0) peer=02:00:00:00:00:01 RX P=1/Q=0 (socket 0) -> TX P=0/Q=0 (socket 0) peer=02:00:00:00:00:00 lcore 0 called rx_pkt_burst for not ready port 0 8: [build/app/dpdk-testpmd() [0x59e839]] 7: [/lib64/libc.so.6(__libc_start_main+0xf5) [0x7ff481b69555]] 6: [build/app/dpdk-testpmd(main+0x54b) [0x662d24]] 5: [build/app/dpdk-testpmd(start_packet_forwarding+0x263) [0x65e795]] 4: [build/app/dpdk-testpmd() [0x65e1be]] 3: [build/app/dpdk-testpmd() [0x65a996]] 2: [build/app/dpdk-testpmd() [0xa6cbc7]] 1: [build/app/dpdk-testpmd(rte_dump_stack+0x27) [0xaee796]] lcore 0 called rx_pkt_burst for not ready port 1 8: [build/app/dpdk-testpmd() [0x59e839]] 7: [/lib64/libc.so.6(__libc_start_main+0xf5) [0x7ff481b69555]] 6: [build/app/dpdk-testpmd(main+0x54b) [0x662d24]] 5: [build/app/dpdk-testpmd(start_packet_forwarding+0x263) [0x65e795]] 4: [build/app/dpdk-testpmd() [0x65e1be]] 3: [build/app/dpdk-testpmd() [0x65a996]] 2: [build/app/dpdk-testpmd() [0xa6cbc7]] 1: [build/app/dpdk-testpmd(rte_dump_stack+0x27) [0xaee796]] io packet forwarding packets/burst=32 nb forwarding cores=1 - nb forwarding ports=2 port 0: RX queue number: 1 Tx queue number: 1 Rx offloads=0x0 Tx offloads=0x0 Fixes: `c87d435a4d` ("ethdev: copy fast-path API into separate structure") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-10-27 19:28:45 +02:00
Ferruh Yigit	411878ba25	net/memif: fix driver init with default MTU Driver is using 'ETH_FRAME_LEN' Linux defined value as max frame length, which doesn't include FCS (4 bytes CRC). But ethdev by default uses frame size with FCS when application doesn't define any explicit value. As a result device configuration fails because device is tried to be configured with a frame size length that is bigger than what device reported as supported. Device reports as max supported frame size is 1514 but configured value is 1518. Instead use DPDK macro, 'RTE_ETHER_MAX_LEN', that includes FCS in the driver to report the max supported frame size, this matches to the initial intention. Fixes: `1bb4a528c4` ("ethdev: fix max Rx packet length") Reported-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: David Christensen <drc@linux.vnet.ibm.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-27 17:48:51 +02:00
Ferruh Yigit	4e8a910719	net/af_packet: fix driver init with default MTU Driver is using 'ETH_FRAME_LEN' Linux defined value as max frame length, which doesn't include FCS (4 bytes CRC). But ethdev by default uses frame size with FCS when application doesn't define any explicit value. As a result device configuration fails because device is tried to be configured with a frame size length that is bigger than what device reported as supported. Device reports as max supported frame size is 1514 but configured value is 1518. Instead use DPDK macro, 'RTE_ETHER_MAX_LEN', that includes FCS in the driver to report the max supported frame size, this matches to the initial intention. Fixes: `1bb4a528c4` ("ethdev: fix max Rx packet length") Reported-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-10-27 17:44:49 +02:00
Olivier Matz	9bffc92850	mem: fix dynamic hugepage mapping in container Since its introduction in 2018, the SIGBUS handler was never registered, and all related functions were unused. A SIGBUS can be received by the application when accessing to hugepages even if mmap() was successful, This happens especially when running inside containers when there is not enough hugepages. In this case, we need to recover. A similar scheme can be found in eal_memory.c. Fixes: `582bed1e1d` ("mem: support mapping hugepages at runtime") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-05 15:28:55 +01:00
Ilyes Ben Hamouda	770d41bf33	malloc: fix allocation with unknown socket ID When using rte_malloc() from a thread which is not bound to a numa socket (the typical case is a control thread, but it can also happen on a dataplane thread if its cpu affinity is on cores attached to several sockets), the used heap is the one from numa socket 0, which may not have available memory. Fix this by selecting the first socket which has available memory. Note: malloc_get_numa_socket() is only used from one .c file, so move it there, and remove the inline keyword. Fixes: `b94580d688` ("malloc: avoid unknown socket id") Cc: stable@dpdk.org Signed-off-by: Ilyes Ben Hamouda <ilyes.ben_hamouda@6wind.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-11-05 15:28:49 +01:00
David Hunt	bb0bd346d5	eal: suggest using --lcores option If the user requests to use an lcore above 128 using -l, the eal will exit with "EAL: invalid core list syntax" and very little else useful information. This patch adds some extra information suggesting to use --lcores so that physical cores above RTE_MAX_LCORE (default 128) can be used. This is achieved by using the --lcores option by mapping the logical cores in the application to physical cores. For example, if "-l 12-16,130,132" is used, we see the following additional output on the command line: EAL: lcore 132 >= RTE_MAX_LCORE (128) EAL: lcore 133 >= RTE_MAX_LCORE (128) EAL: To use high physical core ids, please use --lcores to map them to lcore ids below RTE_MAX_LCORE, EAL: e.g. --lcores 0@12,1@13,2@14,3@15,4@16,5@132,6@133 The same is added to -c option parsing. For example, if "-c 0x300000000000000000000000000000000" is used, we see the following additional output on the command line: EAL: lcore 128 >= RTE_MAX_LCORE (128) EAL: lcore 129 >= RTE_MAX_LCORE (128) EAL: To use high physical core ids, please use --lcores to map them to lcore ids below RTE_MAX_LCORE, EAL: e.g. --lcores 0@128,1@129 Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-11-05 14:39:37 +01:00
Sean Zhang	80a3233197	app/flow-perf: add destination ports parameter Add optional destination ports parameter for port-id action. The parameter is not must, and the value is 1 by default as before if the parameter not provided. For example: $ dpdk-test-flow-perf -w 08:00.0,representor=[0,1] -- --transfer \ --ingress --transfer --ether --portmask=0x2 --vxlan-encap \ --port-id=0 This command means the rule created on representor 0 with port 0 as destination, since the portmask is 0x2 and dst-ports is 0: $ dpdk-test-flow-perf -w 08:00.0,representor=[0,1] \ -w 08:00.1,representor=[0,1]-- --transfer --ingress --transfer \ --ether --portmask=0x12 --vxlan-encap --port-id=0,3 This command means the rules created on both representor 0 of PF 0 and PF 1, the destination port for the first representor is PF 0, and the destination port for the other one is PF 1. Signed-off-by: Sean Zhang <xiazhang@nvidia.com> Reviewed-by: Wisam Jaddo <wisamm@nvidia.com>	2021-11-05 10:58:25 +01:00
David Marchand	f5fa0e110f	eal: promote non-EAL lcore API as stable This API has been around for more than a year (and is in LTS 20.11). It did not receive negative feedback and will be used in a next OVS release. Mark it stable. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Kevin Traynor <ktraynor@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-11-04 22:57:58 +01:00
Konstantin Ananyev	65d9b7c664	bpf: fix convert API when libpcap missing rte_bpf_convert() implementation depends on libpcap. Right now it is defined only when this library is installed and RTE_PORT_PCAP is defined. Fix that by providing for such case stub rte_bpf_convert() implementation that will always return an error. To draw user attention, if proper implementation is disabled, warning will be thrown at meson configure stage. Also move stub for another function (rte_bpf_elf_load) into the same place (bpf_stub.c). Fixes: `2eccf6afbe` ("bpf: add function to convert classic BPF to DPDK BPF") Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-11-04 19:56:20 +01:00
Konstantin Ananyev	7b0a120157	bpf: fix doxygen comment Fix typo in doxygen comments for rte_bpf_convert(). Fixes: `2eccf6afbe` ("bpf: add function to convert classic BPF to DPDK BPF") Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-11-04 19:56:14 +01:00
David Marchand	54abd300d5	pipeline: remove unreachable branch A previous change blamed it on compiler/ASan, while this is a real (yet minor) issue. This return -EINVAL is never reached since we test all combinations of fidx and fcin booleans. All branches end up with a return 0, factorize them. Fixes: `84f5ac9418` ("pipeline: fix build with ASan") Fixes: `f38913b7fb` ("pipeline: add meter array to SWX") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-11-04 18:11:08 +01:00
Yogesh Jangra	2ce3ccbe44	pipeline: fix dead code Fix minor dead code issue reported by Coverity. Coverity issue: 373653 Fixes: e9d870 ("pipeline: add SWX pipeline tables") Signed-off-by: Yogesh Jangra <yogesh.jangra@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-11-04 16:43:27 +01:00
Wojciech Liguzinski	5dbbc2ab4a	examples/ip_pipeline: support PIE congestion management Adding the PIE support for IP Pipeline Signed-off-by: Wojciech Liguzinski <wojciechx.liguzinski@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-11-04 15:43:43 +01:00
Wojciech Liguzinski	06135957c4	examples/qos_sched: support PIE congestion management patch add support enable PIE or RED by parsing config file. Signed-off-by: Wojciech Liguzinski <wojciechx.liguzinski@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-11-04 15:43:24 +01:00
Wojciech Liguzinski	44c730b0e3	sched: add PIE based congestion management Implement PIE based congestion management based on rfc8033. The Proportional Integral Controller Enhanced (PIE) algorithm works by proactively dropping packets randomly. PIE is implemented as more advanced queue management is required to address the bufferbloat problem and provide desirable quality of service to users. Tests for PIE code added to test application. Added PIE related information to documentation. Signed-off-by: Wojciech Liguzinski <wojciechx.liguzinski@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2021-11-04 15:41:49 +01:00
David Marchand	f2777b53b1	bus/pci: fix use after free on unplug rte_pci_unmap_device() needs intr_handle objects to unregister callbacks. Bugzilla ID: 845 Fixes: `d61138d4f0` ("drivers: remove direct access to interrupt handle") Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Yan Xia <yanx.xia@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-11-04 15:13:41 +01:00
David Marchand	5633173341	eal/linux: fix device hotplug The device event interrupt handler was always freed. Bugzilla ID: 845 Fixes: `c2bd9367e1` ("lib: remove direct access to interrupt handle") Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Yan Xia <yanx.xia@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-11-04 15:13:41 +01:00
David Marchand	4847122aab	eal/linux: fix uevent message parsing Caught with ASan: ==9727==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7f0daa2fc0d0 at pc 0x7f0daeefacb2 bp 0x7f0daa2fadd0 sp 0x7f0daa2fa578 READ of size 1 at 0x7f0daa2fc0d0 thread T1 #0 0x7f0daeefacb1 (/lib64/libasan.so.5+0xbacb1) #1 0x115eba1 in dev_uev_parse ../lib/eal/linux/eal_dev.c:167 #2 0x115f281 in dev_uev_handler ../lib/eal/linux/eal_dev.c:248 #3 0x1169b91 in eal_intr_process_interrupts ../lib/eal/linux/eal_interrupts.c:1026 #4 0x116a3a2 in eal_intr_handle_interrupts ../lib/eal/linux/eal_interrupts.c:1100 #5 0x116a7f0 in eal_intr_thread_main ../lib/eal/linux/eal_interrupts.c:1172 #6 0x112640a in ctrl_thread_init ../lib/eal/common/eal_common_thread.c:202 #7 0x7f0dade27159 in start_thread (/lib64/libpthread.so.0+0x8159) #8 0x7f0dadb58f72 in clone (/lib64/libc.so.6+0xfcf72) Address 0x7f0daa2fc0d0 is located in stack of thread T1 at offset 4192 in frame #0 0x115f0c9 in dev_uev_handler ../lib/eal/linux/eal_dev.c:226 This frame has 2 object(s): [32, 48) 'uevent' [96, 4192) 'buf' <== Memory access at offset 4192 overflows this variable HINT: this may be a false positive if your program uses some custom stack unwind mechanism or swapcontext (longjmp and C++ exceptions are supported) Thread T1 created by T0 here: #0 0x7f0daee92ea3 in __interceptor_pthread_create (/lib64/libasan.so.5+0x52ea3) #1 0x1126542 in rte_ctrl_thread_create ../lib/eal/common/eal_common_thread.c:228 #2 0x116a8b5 in rte_eal_intr_init ../lib/eal/linux/eal_interrupts.c:1200 #3 0x1159dd1 in rte_eal_init ../lib/eal/linux/eal.c:1044 #4 0x7a22f8 in main ../app/test-pmd/testpmd.c:4105 #5 0x7f0dada7f802 in __libc_start_main (/lib64/libc.so.6+0x23802) Bugzilla ID: 792 Fixes: `0d0f478d04` ("eal/linux: add uevent parse and process") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Yan Xia <yanx.xia@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-11-04 15:13:41 +01:00
Jim Harris	628bac7df1	eal/linux: remove unused variable for socket memory clang-13 rightfully complains that the total_mem variable in eal_parse_socket_arg is set but not used, since the final accumulated total_mem result isn't used anywhere. So just remove the total_mem variable. Fixes: `0a703f0f36` ("eal/linux: fix parsing zero socket memory and limits") Signed-off-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-11-04 13:27:18 +01:00
Vladimir Medvedkin	11c5b9b51a	fib: add RIB extension size parameter This patch adds a new parameter to the FIB configuration to specify the size of the extension for internal RIB structure. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Tested-by: Conor Walsh <conor.walsh@intel.com>	2021-11-04 12:38:03 +01:00
Xueming Li	fc382022c6	eal: fix device iterator when no bus is selected Devargs used in device iterator initialization wasn't set to zero, random data like bus string lead to invalid address access. This patch initializes devargs. Bugzilla ID: 862 Fixes: `c99a2d4c6b` ("eal: implement device iteration initialization") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@nvidia.com>	2021-11-04 11:44:49 +01:00
Vladimir Medvedkin	adeca6685f	hash: fix use after free in Toeplitz hash This patch fixes use after free in thash library, reported by ASAN. Bugzilla ID: 868 Fixes: `28ebff11c2` ("hash: add predictable RSS") Cc: stable@dpdk.org Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-11-04 11:43:20 +01:00

1 2 3 4 5 ...

30603 Commits