numam-dpdk

Author	SHA1	Message	Date
Bruce Richardson	2b403e027b	eal: fix thread header include rte_thread.h was missing the compat header to get the __rte_experimental macro definition. Fixes: `b1fd151267` ("eal: add generic thread-local-storage functions") Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-01-21 10:21:40 +01:00
Bruce Richardson	762bfccc8a	config: remove compatibility build defines As announced in the deprecation note, remove all compatibility build defines from previous make/meson versions and use only the standardized ones - RTE_LIB_<name> for libraries, and RTE_<CLASS>_<NAME> for drivers. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-01-20 01:43:25 +01:00
Alexander Kozyrev	73b68f4c54	ethdev: introduce generic modify flow action Implement the generic modify flow API to allow manipulations on an arbitrary header field (as well as mark, metadata or tag) using data from another field or a user-specified value. This generic modify mechanism removes the necessity to implement a separate RTE Flow action every time we need to modify a new packet field in the future. Supported operation are: - set: copy data from source to destination. - add: integer addition, stores the result in destination. - sub: integer subtraction, stores the result in destination. The field ID is used to specify the desired source/destination packet field in order to simplify the API for various encapsulation models. Specifying the packet field ID with the needed encapsulation level is able to quickly get a packet field for any inner packet header. Alternatively, the special ID (ITEM_START) can be used to point to the very beginning of a packet. This ID in conjunction with the offset parameter provides great flexibility to copy/modify any part of a packet as needed. The number of bits to use from a source as well as the offset can be be specified to allow a partial copy or dividing a big packet field into multiple small fields (e.g. copying 128 bits of IPv6 to 4 tags). An immediate value (or a pointer to it) can be specified instead of the level and the offset for the special FIELD_VALUE ID (or FIELD_POINTER). Can be used as a source only. Signed-off-by: Alexander Kozyrev <akozyrev@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-01-19 03:30:32 +01:00
Bruce Richardson	26fe208ad8	ethdev: avoid blocking telemetry for link status When querying the link status via telemetry interface, we don't want the client to have to wait for multiple seconds for a reply. Therefore use "rte_eth_link_get_nowait()" rather than "rte_eth_link_get()" in the telemetry callback. Fixes: `c190daedb9` ("ethdev: add telemetry callbacks") Cc: stable@dpdk.org Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Ciara Power <ciara.power@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-01-19 03:30:32 +01:00
Shiri Kuzin	2b4c72b4d1	ethdev: introduce GENEVE header TLV option item The Geneve tunneling protocol is designed to allow the user to specify some data context on the packet. The GENEVE TLV (Type-Length-Variable) Option is the mean intended to present the user data. In order to support GENEVE TLV Option the new rte_flow item "rte_flow_item_geneve_opt" is added. The new item contains the values and masks for the following fields: -option class -option type -length -data New item will be added to testpmd to support match and raw encap/decap actions. Signed-off-by: Shiri Kuzin <shirik@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-01-19 03:30:15 +01:00
Steve Yang	bf0f90d92d	ethdev: fix max Rx packet length check Ethdev is using default Ethernet overhead to decide if provided 'max_rx_pkt_len' value is bigger than max (non jumbo) MTU value, and limits it to MAX if it is. Since the application/driver used Ethernet overhead is different than the ethdev one, check result is wrong. If the driver is using Ethernet overhead bigger than the default one, the provided 'max_rx_pkt_len' is trimmed down, and in the driver when correct Ethernet overhead is used to convert back, the resulting MTU is less than the intended one, causing some packets to be dropped. Like, app -> max_rx_pkt_len = 1500/mtu/ + 22/overhead/ = 1522 ethdev -> 1522 > 1518/MAX/; max_rx_pkt_len = 1518 driver -> MTU = 1518 - 22 = 1496 Packets with size 1497-1500 are dropped although intention is to be able to send/receive them. The fix is to make ethdev use the correct Ethernet overhead for port, instead of default one. Fixes: `59d0ecdbf0` ("ethdev: MTU accessors") Cc: stable@dpdk.org Signed-off-by: Steve Yang <stevex.yang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-01-19 03:30:14 +01:00
Jeff Guo	759364892a	ethdev: add eCPRI tunnel type Add type of RTE_TUNNEL_TYPE_ECPRI into the enum of ethdev tunnel type. Signed-off-by: Jeff Guo <jia.guo@intel.com> Reviewed-by: Qi Zhang <qi.z.zhang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-01-19 03:30:13 +01:00
Viacheslav Ovsiienko	1d94d8042a	ethdev: update GTP headers This patch introduces the GTP header individual flag bit fields and the header optional word with N-PDU number, Sequence Number and Next Extension Header type. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-01-19 03:30:13 +01:00
Abhinandan Gujjar	1c3ffb9559	cryptodev: add enqueue and dequeue callbacks This patch adds APIs to add/remove callback functions on crypto enqueue/dequeue burst. The callback function will be called for each burst of crypto ops received/sent on a given crypto device queue pair. Signed-off-by: Abhinandan Gujjar <abhinandan.gujjar@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2021-01-19 18:05:44 +01:00
Stephen Hemminger	ce7b11ad99	pdump: cleanup logs and variables Checkpatch prefers 'unsigned int' over bare 'unsigned'. Reword the error messages for brevity and clarity so they don't have to be split across multiple lines. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-01-19 15:24:46 +01:00
Stephen Hemminger	2d30d42c0f	pdump: replace constant for device name size The device string has an existing size in rte_dev.h use that instead of defining our own. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-01-19 15:24:46 +01:00
Stephen Hemminger	f82067c4d5	pdump: free mbuf in bulk Use rte_pktmbuf_free_bulk instead of loop when freeing packets. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2021-01-19 15:24:46 +01:00
Anatoly Burakov	27ff8384de	fbarray: fix overlap check When we're attaching fbarrays in secondary processes, we check for whether the intended memory address for the fbarray is already in use by some other, local fbarray. However, the check for end-overlap (i.e. to see if our memory area's end overlaps with some other fbarray) is incorrectly counting end offset as part of the overlap. Fix the check. Fixes: `5b61c62cfd` ("fbarray: add internal tailq for mapped areas") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Tested-by: Zhihong Peng <zhihongx.peng@intel.com>	2021-01-19 13:12:59 +01:00
Anatoly Burakov	c691506798	eal/linux: improve no hugepages logging When no hugepages are found, we log a message about it, but we never specify on which node. We also implicitly declare the page size based on the directory name, but that's not very user friendly. Fix both by changing the text of the message to note the NUMA node (if applicable) and explicitly mention page size in kilobytes. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-01-19 13:12:59 +01:00
Liang Ma	1fe3eef5e9	ethdev: add simple power management API Add a simple API to allow getting the monitor conditions for power-optimized monitoring of the Rx queues from the PMD, as well as release notes information. Signed-off-by: Liang Ma <liang.j.ma@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-01-19 00:00:04 +01:00
Anatoly Burakov	55243435f5	eal: add monitor wakeup function Now that we have everything in a C file, we can store the information about our sleep, and have a native mechanism to wake up the sleeping core. This mechanism would however only wake up a core that's sleeping while monitoring - waking up from `rte_power_pause` won't work. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-18 23:59:42 +01:00
Anatoly Burakov	1b36fc280a	eal: remove sync version of power monitor Currently, the "sync" version of power monitor intrinsic is supposed to be used for purposes of waking up a sleeping core. However, there are better ways to achieve the same result, so remove the unneeded function. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-18 23:59:30 +01:00
Anatoly Burakov	6a17919b0e	eal: change power intrinsics API Instead of passing around pointers and integers, collect everything into struct. This makes API design around these intrinsics much easier. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-18 23:59:12 +01:00
Anatoly Burakov	68fbbb8369	eal: avoid invalid power intrinsics API usage Currently, the API documentation mandates that if the user wants to use the power management intrinsics, they need to call the `rte_cpu_get_intrinsics_support` API and check support for specific intrinsics. However, if the user does not do that, it is possible to get illegal instruction error because we're using raw instruction opcodes, which may or may not be supported at runtime. Now that we have everything in a C file, we can check for support at startup and prevent the user from possibly encountering illegal instruction errors. We also add return values to the API's as well, because why not. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-18 23:58:24 +01:00
Anatoly Burakov	56833cbd35	eal: uninline power intrinsics Currently, power intrinsics are inline functions. Make them part of the ABI so that we can have various internal data associated with them without exposing said data to the outside world. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-18 23:57:38 +01:00
Narcisa Vasile	e03d2c3c70	cfgfile: build on Windows The librte_cfgfile lib is functional on Windows. Enable compilation of this lib for Windows. Signed-off-by: Narcisa Vasile <navasile@linux.microsoft.com>	2021-01-17 23:21:14 +01:00
Nithin Dabilpuram	d70e87907a	bitmap: support 128-byte cacheline in empty check Currently bitmap line not empty check API assumes cache line of 64B and only checks 8 slabs. Since in 128B cacheline, we have 16 slabs per cacheline, rte_bitmap_clear() will mark complete line as empty as soon as 8 slabs are empty thereby breaking bitmap scan functionality. Fix it by defining new __rte_bitmap_line_not_empty() for 128B cacheline platform. Signed-off-by: Nithin Dabilpuram <ndabilpuram@marvell.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-01-17 22:40:15 +01:00
Long Li	1fef6ced07	eal/linux: allow multiple starts of event monitor In some cases, a device or infrastructure may want to enable hotplug but application may also try and start hotplug as well. Therefore change the monitor_started from a boolean into a reference count. Signed-off-by: Long Li <longli@microsoft.com>	2021-01-17 22:37:28 +01:00
Tyler Retzlaff	56446c913f	eal/windows: fix C++ compatibility Explicitly cast void * to type * so that EAL headers may be compiled as C or C++. Fixes: `e8428a9d89` ("eal/windows: add some basic functions and macros") Cc: stable@dpdk.org Signed-off-by: Tyler Retzlaff <roretzla@linux.microsoft.com>	2021-01-17 18:27:48 +01:00
Olivier Matz	de6aede17b	service: propagate init error in EAL Currently, when rte_service_init() fails at initialization, the application always gets a ENOEXEC error code. For example, with testpmd, this is displayed as: Cannot init EAL: Exec format error This error code does not describe the real issue. Instead, use the error code returned by the function. Fixes: `e398245008` ("service: initialize with EAL") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2021-01-15 16:32:19 +01:00
Tyler Retzlaff	aab8be4497	eal/windows: build reciprocal division functions Build rte_reciprocal.c and export the following functions on windows: * rte_reciprocal_value * rte_reciprocal_value_u64 Signed-off-by: Tyler Retzlaff <roretzla@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-01-15 15:04:03 +01:00
Yicai Lu	324242fb51	ip_frag: remove padding length of fragment In some situations, we would get several ip fragments, which total data length is less than min_ip_len(64) and padding with zeros. We simulated intermediate fragments by modifying the MTU. To illustrate the problem, we simplify the packet format and ignore the impact of the packet header.In namespace2, a packet whose data length is 1520 is sent. When the packet passes tap2, the packet is divided into two fragments: fragment A and B, similar to (1520 = 1510 + 10). When the packet passes tap3, the larger fragment packet A is divided into two fragments A1 and A2, similar to (1510 = 1500 + 10). Finally, the bond interface receives three fragments: A1, A2, and B (1520 = 1500 + 10 + 10). One fragmented packet A2 is smaller than the minimum Ethernet frame length, so it needs to be padded. \|---------------------------------------------------\| \| HOST \| \| \|--------------\| \|----------------------------\| \| \| \| ns2 \| \| \|--------------\| \| \| \| \| \|--------\| \| \| \|--------\| \|--------\| \| \| \| \| \| tap1 \| \| \| \| tap2 \| ns1\| tap3 \| \| \| \| \| \|mtu=1510\| \| \| \|mtu=1510\| \|mtu=1500\| \| \| \| \|--\|1.1.1.1 \|--\| \|--\|1.1.1.2 \|----\|2.1.1.1 \|--\| \| \| \|--------\| \|--------\| \|--------\| \| \| \| \| \| \| \| \|-----------------\| \| \| \| \| \| \| \|--------\| \| \| \| bond \| \| \|--------------------------------------\|mtu=1500\|---\| \|--------\| When processing the preceding packets above, DPDK would aggregate fragmented packets A2 and B. And error packets are generated, which padding(zero) is displayed in the middle of the packet. A2 + B: 0000 fa 16 3e 9f fb 82 fa 47 b2 57 dc 20 08 00 45 00 0010 00 33 b4 66 00 ba 3f 01 c1 a5 01 01 01 01 02 01 0020 01 02 c0 c1 c2 c3 c4 c5 c6 c7 00 00 00 00 00 00 0030 00 00 00 00 00 00 00 00 00 00 00 00 c8 c9 ca cb 0040 cc cd ce cf d0 d1 d2 d3 d4 d5 d6 d7 d8 d9 da db 0050 dc dd de df e0 e1 e2 e3 e4 e5 e6 So, we would calculate the length of padding, and remove the padding in pkt_len and data_len before aggregation. And also we have the fix for both ipv4 and ipv6. Fixes: `7f0983ee33` ("ip_frag: check fragment length of incoming packet") Cc: stable@dpdk.org Signed-off-by: Yicai Lu <luyicai@huawei.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-01-15 11:31:28 +01:00
Yi Yang	76f093948f	gso: support VXLAN UDP/IPv4 As most NICs do not support segmentation for VXLAN-encapsulated UDP/IPv4 packets, this patch adds VXLAN UDP/IPv4 GSO support. Signed-off-by: Yi Yang <yangyi01@inspur.com> Acked-by: Jiayu Hu <jiayu.hu@intel.com>	2021-01-15 11:31:28 +01:00
Pallavi Kadam	edd66d57d5	eal/windows: add random function The file rte_random.c is required to build i40e PMD on Windows. Add rte_rand variable to export file. Redefine _m_prefetchw for Clang toolchain due to following error with respect to conflicting types: FAILED: lib/76b5a35@@rte_eal@sta/librte_eal_common_rte_random.c.obj clang @lib/76b5a35@@rte_eal@sta/librte_eal_common_rte_random.c.obj.rsp In file included from ../lib/librte_eal/common/rte_random.c:13: In file included from ..\lib/librte_eal/include\rte_eal.h:20: In file included from ..\lib/librte_eal/include\rte_per_lcore.h:25: In file included from ..\lib/librte_eal/windows/include\pthread.h:21: In file included from ..\lib/librte_eal/windows/include\rte_windows.h:27: In file included from C:\Program Files (x86)\Windows Kits\10\include\ 10.0.18362.0\um\windows.h:171: In file included from C:\Program Files (x86)\Windows Kits\10\include\ 10.0.18362.0\shared\windef.h:24: In file included from C:\Program Files (x86)\Windows Kits\10\include\ 10.0.18362.0\shared\minwindef.h:182: C:\Program Files (x86)\Windows Kits\10\include\10.0.18362.0\um\ winnt.h:3324:1: error: conflicting types for '_m_prefetchw' _m_prefetchw ( ^ C:\Program Files\LLVM\lib\clang\10.0.0\include\prfchwintrin.h:50:1: note: previous definition is here _m_prefetchw(void *__P) ^ 1 error generated. Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com>	2021-01-14 23:21:40 +01:00
Jiayu Hu	1b7b24389c	vhost: enhance async enqueue for small packets Async enqueue offloads large copies to DMA devices, and small copies are still performed by the CPU. However, it requires users to get enqueue completed packets by rte_vhost_poll_enqueue_completed(), even if they are completed by the CPU when rte_vhost_submit_enqueue_burst() returns. This design incurs extra overheads of tracking completed pktmbufs and function calls, thus degrading performance on small packets. This patch enhances async enqueue for small packets by enabling rte_vhost_submit_enqueue_burst() to return completed packets. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Tested-by: Yinan Wang <yinan.wang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-13 18:51:58 +01:00
Jiayu Hu	9ab57ef196	vhost: cleanup async enqueue This patch removes unnecessary check and function calls, and it changes appropriate types for internal variables and fixes typos. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Tested-by: Yinan Wang <yinan.wang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-13 18:51:58 +01:00
Ruifeng Wang	67b68824a8	lpm/arm: support SVE Added new path to do lpm4 lookup by using scalable vector extension. The SVE path will be selected if compiler has flag SVE set. Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2021-01-14 16:42:25 +01:00
Ruifeng Wang	5702b7bf1c	lpm: fix vector IPv4 lookup rte_lpm_lookupx4 could return wrong next hop when more than 256 tbl8 groups are created. This is caused by incorrect type casting of tbl8 group index that been stored in tbl24 entry. The casting caused group index truncation and hence wrong tbl8 group been searched. Issue fixed by applying proper mask to tbl24 entry to get tbl8 group index. Fixes: `dc81ebbaca` ("lpm: extend IPv4 next hop field") Fixes: `cbc2f1dccf` ("lpm/arm: support NEON") Fixes: `d2cc795934` ("lpm: add AltiVec for ppc64") Cc: stable@dpdk.org Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Tested-by: David Christensen <drc@linux.vnet.ibm.com> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2021-01-14 14:19:57 +01:00
Vladimir Medvedkin	6e4d4a6381	fib6: improve AVX512 lookup performance Improved performance for AVX512 FIB6 lookup by doubling the number of flows being processed Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-01-13 22:13:37 +01:00
Ori Kam	1922db13bf	regexdev: add resource limit reached flag When scanning a buffer it is possible that the scan will abort due to some internal resource limit. This commit adds such response flag, so application can handle such cases. Signed-off-by: Francis Kelly <fkelly@nvidia.com> Signed-off-by: Ori Kam <orika@nvidia.com>	2021-01-12 23:31:39 +01:00
Tal Shnaiderman	b1fd151267	eal: add generic thread-local-storage functions Add support for TLS functionality in EAL. The following functions are added: rte_thread_tls_key_create - create a TLS data key. rte_thread_tls_key_delete - delete a TLS data key. rte_thread_tls_value_set - set value bound to the TLS key rte_thread_tls_value_get - get value bound to the TLS key TLS key is defined by the new type rte_tls_key. The API allocates the thread local storage (TLS) key. Any thread of the process can subsequently use this key to store and retrieve values that are local to the thread. Those functions are added in addition to TLS capability in rte_per_lcore.h to allow abstraction of the pthread layer for all operating systems. Windows implementation is under librte_eal/windows and implemented using WIN32 API for Windows only. Unix implementation is under librte_eal/unix and implemented using pthread for UNIX compilation. Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2021-01-11 23:28:12 +01:00
Tal Shnaiderman	d136fae560	eal: move thread affinity functions to new file Move the definition of the functions rte_thread_set_affinity and rte_thread_get_affinity to new file, rte_thread.h The file will implement generic threading functionality and will only host threading functions which do not reference pthread API. Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-01-11 23:27:39 +01:00
Maxime Coquelin	be1525c6b4	vhost: refactor memory regions mapping This patch moves memory region mmaping and related preparation in a dedicated function in order to simplify VHOST_USER_SET_MEM_TABLE request handling function. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-01-08 18:07:56 +01:00
Maxime Coquelin	761ea501ce	vhost: refactor postcopy registration This patch moves the registration of postcopy to a dedicated function, with the goal of simplifying VHOST_USER_SET_MEM_TABLE request handling function. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-01-08 18:07:56 +01:00
Maxime Coquelin	fc2225dbc5	vhost: refactor postcopy region registration This patch moves the registration of memory regions to userfaultfd to a dedicated function, with the goal of simplifying VHOST_USER_SET_MEM_TABLE request handling function. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-01-08 18:07:56 +01:00
Joyce Kong	a33c3584f3	vhost: replace SMP with thread fence for control path Simply replace the smp barriers with atomic thread fence for vhost control path, if there are no synchronization points. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:56 +01:00
Joyce Kong	5faf0a9c54	vhost: replace SMP with thread fence for packed vring Simply replace smp barriers with atomic thread fence for virtio packed vring. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Joyce Kong	10b8c36af0	vhost: relax full barriers for used idx Used idx can be synchronized by one-way barrier instead of full write barrier for split vring. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Joyce Kong	9253c34cfb	vhost: relax full barriers for desc flags Relax the full read barrier to one-way barrier for desc flags in packed vring. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Joyce Kong	2d031675b2	vhost: remove unnecessary SMP barrier for avail idx The ordering between avail index and desc reads has been enforced by load-acquire for split vring, so smp_rmb barrier is not needed behind it. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Joyce Kong	8fc9eaaac7	vhost: remove unnecessary SMP barrier for desc flags As function desc_is_avail performs a load-acquire barrier to enforce the ordering between desc flags and desc content, it is unnecessary to add a rte_smp_rmb barrier around the trace which follows desc_is_avail. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-01-08 18:07:55 +01:00
Joyce Kong	4aae2397ad	rcu: use EAL memory barrier API Use rte_atomic_thread_fence wrapper which has been provided for __atomic_thread_fence builtins to support optimized code for __ATOMIC_SEQ_CST memory order on x86 platforms. Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2021-01-11 15:34:21 +01:00
Ashish Sadanandan	397fb6a8d9	mbuf: add C++ include guard for dynamic fields header The header was missing the extern "C" directive which causes name mangling of functions by C++ compilers, leading to linker errors complaining of undefined references to these functions. Fixes: `4958ca3a44` ("mbuf: support dynamic fields and flags") Cc: stable@dpdk.org Signed-off-by: Ashish Sadanandan <ashish.sadanandan@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2021-01-11 15:34:21 +01:00
Yunjian Wang	e3e9c87c0f	eal/linux: fix handling of error events from epoll The "rev->epdata.event" assigned to "events.epdata.event" directly, which was wrong in case of epoll events. It should be set to the "evs.events". Fixes: `9efe9c6cdc` ("eal/linux: add epoll wrappers") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Harman Kalra <hkalra@marvell.com>	2021-01-11 15:34:21 +01:00
Pallavi Kadam	aacc29dacd	eal/windows: add interrupt functions stub Add some missing interrupt implementations on Windows. Also add respective functions to export file. Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-01-05 22:45:00 +01:00
Vladimir Medvedkin	e682b02084	rib: fix insertion in some cases According to GCC documentation for __builtin_clz: Returns the number of leading 0-bits in x, starting at the most significant bit position. If x is 0, the result is undefined. __builtin_clz will be called with 0 if the existing prefix address matches the one we want to insert. Fixes: `5a5793a5ff` ("rib: add RIB library") Cc: stable@dpdk.org Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2020-12-15 10:08:39 +01:00
Nick Connolly	498ec3c6da	eal/windows: fix vfprintf warning with clang When building with clang (11.0,--buildtype=debug), eal_lcore.c produces a -Wformat-nonliteral warning from the vfprintf call in log_early. Add __rte_format_printf annotation. Fixes: `b8a36b0866` ("eal/windows: improve CPU and NUMA node detection") Cc: stable@dpdk.org Suggested-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Pallavi Kadam <pallavi.kadam@intel.com>	2020-12-07 21:24:57 +01:00
Nick Connolly	a7288328a9	eal/windows: fix debug build with MinGW Compiling with MinGW in --buildtype=debug produces a redefinition error for strncasecmp. The root cause is that rte_os.h shouldn't be injecting POSIX definitions into the environment. It is the applications responsibility to decide how to handle missing functionality. Resolving this properly will require further work, but in the meantime wrap all such definitions with #ifndef/#endif. This resolves the specific issue with strncasecmp and handles similar issues that applications may encounter. Fixes: `e8428a9d89` ("eal/windows: add some basic functions and macros") Cc: stable@dpdk.org Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Nick Connolly <nick.connolly@mayadata.io>	2020-12-07 20:46:33 +01:00
Dmitry Kozlyuk	93bf432dd5	eal/windows: fix build with MinGW-w64 8 MinGW-w64 above 8.0.0 exposes VirtualAlloc2() API in headers, but lacks it in import libraries. Hence, availability of this API at compile-time can't be used to choose between locating VirtualAlloc2() manually or relying on the dynamic linker. Fix redefinition compile-time errors. Always link VirtualAlloc2() when using GCC. Fixes: `2a5d547a4a` ("eal/windows: implement basic memory management") Cc: stable@dpdk.org Reported-by: Thomas Monjalon <thomas@monjalon.net> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Thomas Monjalon <thomas@monjalon.net>	2020-12-07 14:00:22 +01:00
Andrew Rybchenko	8ca9bf26f5	ethdev: deprecate shared counters using action attribute A new generic shared actions API may be used to create shared counter. There is no point to keep duplicate COUNT action specific capability to create shared counters. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-11-27 19:16:45 +01:00
Ruifeng Wang	d123fd111a	eal/arm: fix build with gcc optimization level 0 GCC build with '-O0' on platforms with RTE_ARM_FEATURE_ATOMICS set failed for: ../lib/librte_efd/rte_efd.c Assembler messages: 3866: Error: selected processor does not support `crc32cb w0,w0,w1' 3890: Error: selected processor does not support `crc32ch w0,w0,w1' 3914: Error: selected processor does not support `crc32cw w0,w0,w1' 3938: Error: selected processor does not support `crc32cx w0,w0,x1' This was caused by an architecture specifier added for Clang. Unlike Clang, GCC considers each inline assembly block to be dependent and therefore, the architecture specifier impacts assemble of some blocks require certain extension support. Removed the architecture for GCC to fix the issue. Fixes: `8fce34cd0a` ("eal/arm: fix clang build of native target") Cc: stable@dpdk.org Reported-by: Feifei Wang <feifei.wang2@arm.com> Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2020-11-27 16:51:46 +01:00
Olivier Matz	44d00a1d12	doc: add missing network layers in API index Add missing files in doxy-api-index.md and add a short description for files that hadn't one. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-27 01:51:27 +01:00
Viacheslav Ovsiienko	0d6ce665ee	net: fix eCPRI header generic data field There was a typo in eCPRI header definition. Fixes: `d164c609e7` ("ethdev: add eCPRI key fields to flow API") Cc: stable@dpdk.org Reported-by: Rani Sharoni <ranish@nvidia.com> Signed-off-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Reviewed-by: Bing Zhao <bingz@nvidia.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-26 01:14:11 +01:00
Timothy Redaelli	c57f6e5c60	eal: fix plugin loading Commit `49b536fc30` ("eal: load only shared libs from driver plugin directories") introduced a check that any shared library must ends with .so, but it can't work, at least, on Fedora/CentOS/RHEL since .so symlinks are not installed when you install dpdk package, but only when you install dpdk-devel package. This commit adds also a check for .so.ABI_VERSION to check for shared lib. See Fedora Packaging Guidelines for more information: https://docs.fedoraproject.org/en-US/packaging-guidelines/#_devel_packages Fixes: `49b536fc30` ("eal: load only shared libs from driver plugin directories") Cc: stable@dpdk.org Signed-off-by: Timothy Redaelli <tredaelli@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Ali Alnubani <alialnu@nvidia.com>	2020-11-26 00:00:06 +01:00
Timothy Redaelli	7781950f4d	eal: fix shared lib mode detection Commit `06c7871dde` ("eal: restrict default plugin path to shared lib mode") introduced a check that enabled shared lib mode when librte_eal.so can be loaded, but it can't work, at least, on Fedora/CentOS/RHEL since .so symlinks are not installed when you install dpdk package, but only when you install dpdk-devel package. This commit uses librte_eal.so.ABI_VERSION to check for shared lib, since it exists on any linux distributions. See Fedora Packaging Guidelines for more information: https://docs.fedoraproject.org/en-US/packaging-guidelines/#_devel_packages Fixes: `06c7871dde` ("eal: restrict default plugin path to shared lib mode") Cc: stable@dpdk.org Signed-off-by: Timothy Redaelli <tredaelli@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com> Tested-by: Ali Alnubani <alialnu@nvidia.com>	2020-11-25 23:54:12 +01:00
Diogo Behrens	021b698eb5	mcslock: fix hang in weak memory model The initialization me->locked=1 in lock() must happen before next->locked=0 in unlock(), otherwise a thread may hang forever, waiting me->locked become 0. On weak memory systems (such as ARMv8), the current implementation allows me->locked=1 to be reordered with announcing the node (pred->next=me) and, consequently, to be reordered with next->locked=0 in unlock(). This fix adds a release barrier to pred->next=me, forcing me->locked=1 to happen before this operation. Fixes: `2173f3333b` ("mcslock: add MCS queued lock implementation") Cc: stable@dpdk.org Signed-off-by: Diogo Behrens <diogo.behrens@huawei.com> Acked-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2020-11-25 17:30:04 +01:00
Nick Connolly	141492be99	eal/windows: fix linkage with MinGW Linking with the 'pci' driver when building with MinGW on Windows fails with undefined symbol 'GUID_DEVCLASS_NET'. This occurs because devguid.h is included in rte_windows.h before INITGUID is defined. Move the include of devguid.h after the definition of INITGUID. Fixes: `b762221ac2` ("bus/pci: support Windows with bifurcated drivers") Cc: stable@dpdk.org Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Reviewed-by: Tal Shnaiderman <talshn@nvidia.com>	2020-11-22 18:55:01 +01:00
Yunjian Wang	a99d2521a3	malloc: fix style in free list index computation Cleanup code style issue reported by kernel checkpatch. As follows: * ERROR:CODE_INDENT: code indent should use tabs where possible * ERROR:SPACING: spaces required around that '?' (ctx:VxE) * WARNING:INDENTED_LABEL: labels should not be indented Fixes: `b0489e7bca` ("malloc: fix linear complexity") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2020-11-22 18:52:07 +01:00
Thomas Monjalon	dc328d1c55	ethdev: rename a flow shared action error code In the experimental function rte_flow_shared_action_destroy() introduced in DPDK 20.11, the errno ETOOMANYREFS was used. This errno is not always available on Windows, so it is preferred using EBUSY instead. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Tal Shnaiderman <talshn@nvidia.com> Tested-by: Tal Shnaiderman <talshn@nvidia.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-20 21:10:05 +01:00
Simei Su	46914aa1c7	ethdev: add eCPRI RSS offload type This patch defines new RSS offload types for eCPRI. For eCPRI with Message Type 0, the hash field is physical channel ID. Signed-off-by: Simei Su <simei.su@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-20 21:10:05 +01:00
Thomas Monjalon	135155a836	build: align wording of non-support reasons Reasons for building not supported generally start with lowercase because printed as the second part of a line. Other changes: - "linux" should be "Linux" with a capital letter. - ARCH_X86_64 may be simply x86_64. - aarch64 is preferred over arm64. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com>	2020-11-20 16:05:35 +01:00
Tal Shnaiderman	ba5b133e33	eal/windows: remove definition of ETOOMANYREFS The definition of ETOOMANYREFS is reverted as it breaks build of external applications already defining it. Fixes: `c917b54b0c` ("eal/windows: add definition of ETOOMANYREFS") Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Reviewed-by: Nick Connolly <nick.connolly@mayadata.io> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2020-11-20 15:59:02 +01:00
Tal Shnaiderman	c917b54b0c	eal/windows: add definition of ETOOMANYREFS The ETOOMANYREFS errno was missing from the Windows build. It is used in initialization of flow error structures. It is defined with the same error code used by WSAETOOMANYREFS. Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2020-11-16 00:14:07 +01:00
Stephen Hemminger	db27370b57	eal: replace blacklist/whitelist options Replace -w / --pci-whitelist with -a / --allow options and --pci-blacklist with --block. The -b short option remains unchanged. Allow the old options for now, but print a nag warning since old options are deprecated. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Luca Boccassi <bluca@debian.org> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2020-11-16 00:11:22 +01:00
Stephen Hemminger	a65a34a85e	eal: replace usage of blacklist/whitelist in enums Rename the enum values in the EAL include files. As a backward compatible temporary migration tool, define a replacement mapping for old values. The old names relating to blacklist and whitelist are replaced by block list and allow list, but applications may be using the older compatibility macros, marked as deprecated. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Luca Boccassi <bluca@debian.org> Acked-by: Gaetan Rivet <grive@u256.net> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2020-11-16 00:11:22 +01:00
Cristian Dumitrescu	27cc077922	pipeline: fix multiple SWX emit pattern detection Fix the detection of instruction pattern with multiple emits followed by TX. Once detected, this is one of the instruction patterns that is internally replaced with a single optimized instruction, as long as none of the instructions to be replaced is referenced by a jump instruction. The fix enforces this check for the TX instruction of the pattern. Fixes: `31035e87b2` ("pipeline: add SWX instruction optimizer") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-11-15 16:46:37 +01:00
Maxime Coquelin	bc900f86aa	vhost: fix fd leak in kick setup This patch fixes a file descriptor leak which happens in the error path of vhost_user_set_vring_kick(). Fixes: `4796ad63ba` ("examples/vhost: import userspace vhost application") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Xueming Li <xuemingl@nvidia.com>	2020-11-13 19:43:27 +01:00
Maxime Coquelin	6dc3f119ce	vhost: fix fd leak in dirty logging setup This patch fixes a file descriptor leak which happens in the error path of vhost_user_set_log_base(). Fixes: `4796ad63ba` ("examples/vhost: import userspace vhost application") Cc: stable@dpdk.org Reported-by: Xuan Ding <xuan.ding@intel.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Xueming Li <xuemingl@nvidia.com>	2020-11-13 19:43:26 +01:00
Maxime Coquelin	726a14eb83	vhost: fix error path when setting memory tables If an error is encountered before the memory regions are parsed, the file descriptors for these shared buffers are leaked. This patch fixes this by closing the message file descriptors on error, taking care of avoiding double closing of the file descriptors. guest_pages is also freed, even though it was not leaked as its pointer was not overridden on subsequent function calls. Fixes: `8f972312b8` ("vhost: support vhost-user") Cc: stable@dpdk.org Reported-by: Xuan Ding <xuan.ding@intel.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Xueming Li <xuemingl@nvidia.com>	2020-11-13 19:43:26 +01:00
Maxime Coquelin	7804bbd13a	vhost: fix virtqueue initialization This patches fixes virtqueue initialization issue causing segfault or file descriptor being closed unexpectedly. The wrong index was passed to init_vring_queue() by alloc_vring_queue() when a hole in the virtqueue array was met. Fixes: `8acd7c2133` ("vhost: fix virtqueues metadata allocation") Cc: stable@dpdk.org Reported-by: Yu Jiang <yux.jiang@intel.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: David Marchand <david.marchand@redhat.com> Tested-by: Yu Jiang <yux.jiang@intel.com>	2020-11-13 19:43:25 +01:00
Patrick Fu	45ba914134	vhost: fix async inflight packet counter Async inflight packet counter should take failed packets into account. Failed packets will be deducted in the error handling logic. Fixes: `6b3c81db8b` ("vhost: simplify async copy completion") Fixes: `cd6760da10` ("vhost: introduce async enqueue for split ring") Cc: stable@dpdk.org Signed-off-by: Patrick Fu <patrick.fu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-11-13 19:43:25 +01:00
Yi Yang	b605df71be	gro: fix packet type detection with IPv6 tunnel For VxLAN packets, GRO will mistakenly reassemble them if inner L3 is IPv6, inner L4 is TCP or UDP, and outer L3 is IPv4 because the value of IS_IPV4_VXLAN_TCP4/UDP4_PKT is true for them. This fix makes sure IS_IPV4_TCP_PKT, IS_IPV4_UDP_PKT, IS_IPV4_VXLAN_TCP4_PKT and IS_IPV4_VXLAN_UDP4_PKT can make decision precisely. Fixes: `e2d8110636` ("gro: support VXLAN UDP/IPv4") Fixes: `1ca5e67408` ("gro: support UDP/IPv4") Fixes: `9e0b9d2ec0` ("gro: support VxLAN GRO") Fixes: `0d2cbe59b7` ("lib/gro: support TCP/IPv4") Cc: stable@dpdk.org Signed-off-by: Yi Yang <yangyi01@inspur.com> Acked-by: Jiayu Hu <jiayu.hu@intel.com>	2020-11-14 10:56:30 +01:00
Nick Connolly	52a7cb0ad0	build: fix MS linker flag with meson 0.54 Meson versions >= 0.54.0 include support for handling /implib with msvc link. Specifying it explicitly causes failures when linking against the dll. Tested using Link 14.27.29112.0 and Clang 11.0.0. There were a number of changes to the way that import libraries are handled between 0.47.1 and 0.54.0. Only make the change for >= 0.54.0, leaving the behaviour unchanged for earlier versions. Fixes: `77cca7ccec` ("build: fix drivers library path on Windows") Cc: stable@dpdk.org Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Tested-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Khoa To <khot@microsoft.com>	2020-11-13 15:13:16 +01:00
Cristian Dumitrescu	5725870c8c	table: fix exact match SWX table lookup Fix for the exact match lookup function. Fixes: `d0a0096661` ("table: add exact match SWX table") Signed-off-by: Churchill Khangar <churchill.khangar@intel.com> Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-11-13 13:55:07 +01:00
Ruifeng Wang	8fce34cd0a	eal/arm: fix clang build of native target When doing Clang build with '-mcpu=native' on N1 platform, build failed with: ../lib/librte_eal/arm/include/rte_atomic_64.h:76:39: error: instruction requires: lse __ATOMIC128_CAS_OP(__cas_128_release, "caspl") This is because native detection for Neoverse N1 was added in Clang-11. Prior version of Clang's assembler doesn't know LSE support on hardware. Fixed this for Clang earlier than version 11 by specifying architecture for assembler. Referred to [1] for this fix. Fixes: `7e2c3e17fe` ("eal/arm64: add 128-bit atomic compare exchange") Cc: stable@dpdk.org [1] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=e0d5896bd356cd577f9710a02d7a474cdf58426b Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2020-11-13 10:30:48 +01:00
David Christensen	2d0a972cd6	vfio: use static window sizing for sPAPR IOMMU The SPAPR IOMMU requires that a DMA window size be defined before memory can be mapped for DMA. Current code dynamically modifies the DMA window size in response to every new memory allocation which is potentially dangerous because all existing mappings need to be unmapped/remapped in order to resize the DMA window, leaving hardware holding IOVA addresses that are temporarily unmapped. The new SPAPR code statically assigns the DMA window size on first use, using the largest physical memory memory address when IOVA=PA and the highest existing memseg virtual address when IOVA=VA. Signed-off-by: David Christensen <drc@linux.vnet.ibm.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2020-11-13 09:35:18 +01:00
Thomas Monjalon	4630290af4	mbuf: move pool pointer in first half According to the Technical Board decision (http://mails.dpdk.org/archives/dev/2020-November/191859.html), the mempool pointer in the mbuf struct is moved from the second to the first half. It may increase performance in some cases on systems having 64-byte cache line, i.e. mbuf split in two cache lines. Due to this change, all fields after "pool" are moved up. Hopefully no vector data path is impacted. Moving this field gives more space to dynfield1 while dropping the temporary dynfield0. This is how the mbuf layout looks like (pahole-style): word type name byte size 0 void * buf_addr; /* 0 + 8 / 1 rte_iova_t buf_iova / 8 + 8 / / --- RTE_MARKER64 rearm_data; / 2 uint16_t data_off; / 16 + 2 / uint16_t refcnt; / 18 + 2 / uint16_t nb_segs; / 20 + 2 / uint16_t port; / 22 + 2 / 3 uint64_t ol_flags; / 24 + 8 / / --- RTE_MARKER rx_descriptor_fields1; / 4 uint32_t union packet_type; / 32 + 4 / uint32_t pkt_len; / 36 + 4 / 5 uint16_t data_len; / 40 + 2 / uint16_t vlan_tci; / 42 + 2 / 5.5 uint64_t union hash; / 44 + 8 / 6.5 uint16_t vlan_tci_outer; / 52 + 2 / uint16_t buf_len; / 54 + 2 / 7 struct rte_mempool pool; /* 56 + 8 / / --- RTE_MARKER cacheline1; / 8 struct rte_mbuf next; /* 64 + 8 / 9 uint64_t union tx_offload; / 72 + 8 / 10 struct rte_mbuf_ext_shared_info shinfo; /* 80 + 8 / 11 uint16_t priv_size; / 88 + 2 / uint16_t timesync; / 90 + 2 / 11.5 uint32_t dynfield1[9]; / 92 + 36 / 16 / --- END 128 */ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2020-11-12 16:39:10 +01:00
Morten Brørup	3a35c1c0f6	mbuf: clean up comments and prefix The mbuf header files had some commenting style errors that affected the API documentation. Also, the RTE_ prefix was missing on a macro and a definition. Note: This patch does not touch the offload and attachment flags that are also missing the RTE_ prefix. Signed-off-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-05 17:53:15 +01:00
Dmitry Kozlyuk	c2341bb671	cmdline: avoid name clash with Windows system types cmdline_numtype member names clash with Windows system identifiers. Add RTE_ prefix to cmdline constants to avoid this and possible future conflicts. Suggested-by: Ranjit Menon <ranjit.menon@intel.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Jie Zhou <jizh@microsoft.com> Tested-by: Jie Zhou <jizh@microsoft.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-05 17:49:00 +01:00
Olivier Matz	f8e90bcc1d	eal: fix MCS lock and ticketlock headers install Add missing arch-specific headers in meson.build. Fixes: `2173f3333b` ("mcslock: add MCS queued lock implementation") Fixes: `ca49b92079` ("ticketlock: enable generic ticketlock on all arch") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: David Christensen <drc@linux.vnet.ibm.com> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com>	2020-11-05 12:08:19 +01:00
Stephen Hemminger	0429a2e1a4	mbuf: fix dynamic fields and flags with multiprocess The dynamic flag management is broken if rte_mbuf_dynflag_lookup() is done in a secondary process because the local pointer to the memzone is not ever initialized. Fix it by using the same checks as dynfield_register(). I.e if shared memory zone has not been looked up already, then discover it. Fixes: `4958ca3a44` ("mbuf: support dynamic fields and flags") Cc: stable@dpdk.org Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-04 20:47:14 +01:00
Yunjian Wang	4bb02a6f5b	ethdev: fix data type for port id The ethdev port id is 16 bits now. This patch fixes the data type of the variable for 'pid', which changing from uint32_t to uint16_t. RTE_MAX_ETHPORTS is the maximum number of ports, which customized by the user. To avoid 16-bit unsigned integer overflow, the valid value of RTE_MAX_ETHPORTS should be set from 0 to UINT16_MAX, and it is safer to cut one more port from space. So we use RTE_BUILD_BUG_ON() to ensure that RTE_MAX_ETHPORTS is less to UINT16_MAX. Fixes: `5b7ba31148` ("ethdev: add port ownership") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-04 14:48:45 +01:00
Yunjian Wang	b9b67d0674	ethdev: fix using Rx split config before null check Coverity flags that 'rx_conf' variable is used before it's checked for NULL. This patch fixes this issue. Coverity issue: 363570 Fixes: `4ff702b5df` ("ethdev: introduce Rx buffer split") Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-04 12:21:06 +01:00
Ivan Malov	9ff91c0d81	ethdev: introduce transfer attribute to shared action conf In a flow rule, attribute "transfer" means operation level at which both traffic is matched and actions are conducted. Add the very same attribute to shared action configuration. If a driver needs to prepare HW resources in two different ways, depending on the operation level, in order to set up an action, then this new attribute will indicate the level. Also, when handling a flow rule insertion, the driver will be able to turn down a shared action if its level is unfit. Signed-off-by: Ivan Malov <ivan.malov@oktetlabs.ru> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Andrey Vesnovaty <andreyv@nvidia.com>	2020-11-03 23:35:07 +01:00
Morten Brørup	4def9a8281	ethdev: document Rx packet number requirement for vector Rx Updated description of rte_eth_rx_burst() to reflect what drivers, when using vector instructions, expect from nb_pkts. Also discussed on the mailing list here: http://inbox.dpdk.org/dev/98CBD80474FA8B44BF855DF32C47DC35C61257@smartserver.smartshare.dk/ Signed-off-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-11-03 23:35:06 +01:00
Andrew Rybchenko	8fb8efd4e4	ethdev: move L2 tunnel config structure to ixgbe driver net/ixgbe driver is the only user of the struct rte_eth_l2_tunnel_conf. Move it to the driver and use ixgbe_ prefix instead of rte_eth_. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:06 +01:00
Andrew Rybchenko	cf47acc0f9	ethdev: remove L2 tunnel offload control API Remove rte_eth_dev_l2_tunnel_offload_set() and corresponding ethdev driver operation. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Jeff Guo <jia.guo@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:06 +01:00
Andrew Rybchenko	99a1b6895f	ethdev: remove API to config L2 tunnel EtherType Remove rte_eth_dev_l2_tunnel_eth_type_conf() and corresponding ethdev driver operation. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Jeff Guo <jia.guo@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:06 +01:00
Andrew Rybchenko	0b46e9b411	ethdev: remove legacy filter API functions The legacy filter API, including rte_eth_dev_filter_supported() and rte_eth_dev_filter_ctrl() is removed. Flow API should be used. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	1be514fbce	ethdev: remove legacy FDIR filter type support Instead of FDIR filters RTE flow API should be used. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	0cd2ff37c3	ethdev: remove legacy global filter configuration support Global filter configuration request was supported by net/i40e driver only to configure GRE key length. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	81db321dae	ethdev: remove legacy HASH filter type support Instead of HASH filter RTE flow API should be used. Preserve RTE_ETH_FILTER_HASH since it is used in drivers internally in RTE flow API support. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	251baec367	ethdev: remove legacy tunnel filter type support Instead of TUNNEL filter RTE flow API should be used. Move corresponding defines and helper structure to ethdev driver interface since it is still used by drivers internally. Preserve RTE_ETH_FILTER_TUNNEL because of usage in drivers. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	92067db05f	ethdev: remove legacy N-tuple filter type support Instead of N-tuple filter RTE flow API should be used. Preserve struct rte_eth_ntuple_filter in ethdev API since the structure and related defines are used in flow classify library and a number of drivers. Preserve RTE_ETH_FILTER_NTUPLE because of usage in drivers. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	ae42875d6e	ethdev: remove legacy SYN filter type support Instead of SYN filter RTE flow API should be used. Move corresponding definitions to ethdev internal driver API since it is used by drivers internally. Preserve RTE_ETH_FILTER_SYN because of it as well. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	8d1a709b6f	ethdev: move flexible filter type to e1000 driver net/e1000 driver is the only user of the struct rte_eth_flex_filter and helper defines. Move it to the driver and use igb_ prefix instead of rte_eth_. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	35b1c68af2	ethdev: remove legacy EtherType filter type support Instead of EtherType filter RTE flow API should be used. Move corresponding definitions to ethdev internal driver API since it is used by drivers internally. Preserve RTE_ETH_FILTER_ETHERTYPE because of it as well. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	a1444986e4	ethdev: move MAC filter type to i40e driver net/i40e driver is the only user of the enum rte_mac_filter_type. Move the define to the driver and use i40e_ prefix instead of rte_. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Andrew Rybchenko	9f3b3a96de	ethdev: remove legacy MACVLAN filter type support Instead of MACVLAN filter RTE flow API should be used. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:35:05 +01:00
Patrick Fu	5b4a1e7ee4	vhost: fix uninitialized local variable This patch initializes a local parameter in async data path to avoid compiler warnings. Fixes: `cd6760da10` ("vhost: introduce async enqueue for split ring") Cc: stable@dpdk.org Signed-off-by: Patrick Fu <patrick.fu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-11-03 23:35:05 +01:00
Patrick Fu	fb4b7a131c	vhost: fix guest/host physical address conversion gpa_to_hpa() function almost always fails due to the wrong setup of the binary tree search key. Since there has already been a similar function gpa_to_first_hpa() available in the vhost, instead of fixing the issue in its original logic, gpa_to_hpa() function is rewritten to be a wrapper of the gpa_to_first_hpa() to avoid code redundancy. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Fixes: `faa9867c4d` ("vhost: use binary search in address conversion") Cc: stable@dpdk.org Signed-off-by: Patrick Fu <patrick.fu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-11-03 23:35:05 +01:00
Thomas Monjalon	e9ef7ec12b	ethdev: move non-offload capabilities The definitions of RTE_ETH_DEV_CAPA_RUNTIME_RX_QUEUE_SETUP and RTE_ETH_DEV_CAPA_RUNTIME_TX_QUEUE_SETUP were inserted before the last comment of Tx offloads. It is moved in a better place, with comments moved to be before the definition. A group comment is added to better describe device capabilities. Fixes: `cac923cfea` ("ethdev: support runtime queue setup") Cc: stable@dpdk.org Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2020-11-03 23:35:02 +01:00
Lijun Ou	1848b117cc	app/testpmd: fix RSS key for flow API RSS rule When a flow API RSS rule is issued in testpmd, device RSS key is changed unexpectedly, device RSS key is changed to the testpmd default RSS key. Consider the following usage with testpmd: 1. first, startup testpmd: testpmd> show port 0 rss-hash key RSS functions: all ipv4-frag ipv4-other ipv6-frag ipv6-other ip RSS key: 6D5A56DA255B0EC24167253D43A38FB0D0CA2BCBAE7B30B477CB2DA38030F 20C6A42B73BBEAC01FA 2. create a rss rule testpmd> flow create 0 ingress pattern eth / ipv4 / udp / end \ actions rss types ipv4-udp end queues end / end 3. show rss-hash key testpmd> show port 0 rss-hash key RSS functions: all ipv4-udp udp RSS key: 74657374706D6427732064656661756C74205253532068617368206B65792 C206F76657272696465 This is because testpmd always sends a key with the RSS rule, if user provides a key as part of the rule that key is used, if user doesn't provide a key, testpmd default key is sent to the PMDs, which is causing device programmed RSS key to be changed. There was a previous attempt to fix the same issue [1], but it has been reverted back [2] because of the crash when 'key_len' is provided without 'key'. This patch follows the same approach with the initial fix [1] but also addresses the crash. After change, testpmd RSS key is 'NULL' by default, if user provides a key as part of rule it is used, if not no key is sent to the PMDs at all [1] Commit `a4391f8bae` ("app/testpmd: set default RSS key as null") [2] Commit `f3698c3d09` ("app/testpmd: revert setting default RSS") Fixes: `d0ad8648b1` ("app/testpmd: fix RSS flow action configuration") Cc: stable@dpdk.org Signed-off-by: Lijun Ou <oulijun@huawei.com> Signed-off-by: Ophir Munk <ophirmu@mellanox.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-11-03 23:24:26 +01:00
Patrick Fu	f56b6d450b	vhost: remove fallback in async enqueue API By design, async enqueue API should return directly if async device is not registered. This patch removes the corrupted implementation of the enqueue fallback from async mode to sync mode. Fixes: `cd6760da10` ("vhost: introduce async enqueue for split ring") Cc: stable@dpdk.org Signed-off-by: Patrick Fu <patrick.fu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	60db6ddf62	vhost: check virtqueue metadata pointer This patch checks whether the virtqueue metadata pointer is valid before dereferencing it. It is not considered a fix as earlier patch ensures there are no holes in the array of virtqueue metadata pointers. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	c59898131b	vhost: validate index in async API This patch validates the queue index parameter, in order to ensure no out-of-bound accesses happen. Fixes: `9eed6bfd2e` ("vhost: allow to enable or disable features") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	d2475e8903	vhost: validate index in inflight API This patch validates the queue index parameter, in order to ensure neither out-of-bound accesses nor NULL pointer dereferencing happen. Fixes: `4d891f77dd` ("vhost: add APIs to get inflight ring") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	943daec05c	vhost: validate index in live-migration API This patch validates the queue index parameter, in order to ensure no out-of-bound accesses happen. Fixes: `bd2e0c3fe5` ("vhost: add APIs for live migration") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	366374054b	vhost: validate index in guest notification API This patch validates the queue index parameter, in order to ensure neither out-of-bound accesses nor NULL pointer dereferencing happen. Fixes: `9eed6bfd2e` ("vhost: allow to enable or disable features") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	8c042f8191	vhost: validate index in available entries API This patch validates the queue index parameter, in order to ensure neither out-of-bound accesses nor NULL pointer dereferencing happen. Fixes: `a67f286a65` ("vhost: export queue free entries") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Maxime Coquelin	8acd7c2133	vhost: fix virtqueues metadata allocation The Vhost-user backend implementation assumes there will be no holes in the device's array of virtqueues metadata pointers. It can happen though, and would cause segmentation faults, memory leaks or undefined behaviour. This patch keep the assumption that there is no holes in this array, and allocate all uninitialized virtqueues metadata up to requested index. Fixes: `160cbc815b` ("vhost: remove a hack on queue allocation") Cc: stable@dpdk.org Suggested-by: Adrian Moreno <amorenoz@redhat.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2020-11-03 23:24:26 +01:00
Ciara Power	a5c79ad0f0	metrics: fix memory leak on allocation failure If an error occurred when allocating memory for metrics or names, the function returned without freeing allocated memory. This is now fixed to avoid the resource leak in the case that either metrics or names had been successfully allocated memory. Coverity issue: 362053 Fixes: `c5b7197f66` ("telemetry: move some functions to metrics library") Cc: stable@dpdk.org Reported-by: Gaurav Singh <gaurav1086@gmail.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2020-11-03 22:45:24 +01:00
Anatoly Burakov	fc61d9a89b	eal: fix power intrinsics API description Currently, the intrinsics documentation refers to `rte_cpu_get_features` as a check for whether these intrinsics are supported at runtime. This is incorrect, because actually the user should use the `rte_cpu_get_intrinsics_support` API to do said check. Fix the typo. Fixes: `1280214212` ("eal: add intrinsics support check infrastructure") Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Liang Ma <liang.j.ma@intel.com>	2020-11-03 22:45:24 +01:00
Yi Yang	c0d002aed9	gso: fix mbuf freeing responsibility rte_gso_segment decreased refcnt of pkt by one, but it is wrong if pkt is external mbuf, pkt won't be freed because of incorrect refcnt, the result is application can't allocate mbuf from mempool because mbufs in mempool are run out of. One correct way is application should call rte_pktmbuf_free after calling rte_gso_segment to free pkt explicitly. rte_gso_segment must not handle it, this should be responsibility of application. This commit changed rte_gso_segment in functional behavior and return value, so the application must take appropriate actions according to return values, "ret < 0" means it should free and drop 'pkt', "ret == 0" means 'pkt' isn't GSOed but 'pkt' can be transmitted as a normal packet, "ret > 0" means 'pkt' has been GSOed into two or multiple segments, it should use "pkts_out" to transmit these segments. The application must free 'pkt' after call rte_gso_segment when return value isn't equal to 0. Fixes: `119583797b` ("gso: support TCP/IPv4 GSO") Cc: stable@dpdk.org Signed-off-by: Yi Yang <yangyi01@inspur.com> Acked-by: Jiayu Hu <jiayu.hu@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-11-03 22:45:02 +01:00
Dmitry Kozlyuk	bd117d62a6	eal/windows: fix deadlock when setting alarm Windows alarms are both armed and executed from the interrupt thread. rte_eal_alarm_set() dispatched alarm-arming code to that thread and waited for its completion via a spinlock. However, if called from alarm callback (i.e. from the interrupt thread), this caused a deadlock, because arming could not be run until its dispatcher exits, but it could only exit after it finished waiting for arming to complete. Call arming code directly when running in the interrupt thread. Fixes: `f4cbdbc7fb` ("eal/windows: implement alarm API") Reported-by: Pallavi Kadam <pallavi.kadam@intel.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Pallavi Kadam <pallavi.kadam@intel.com>	2020-11-03 22:45:02 +01:00
Pallavi Kadam	a2b0471739	eal/windows: allow running as non-admin Currently, since there is no runtime directory set, the code tries to create a file in C:\ which is only writable with administrator privileges. As a result, if the user is not admin, the application will fail. So, forcing no_shconf to 1 to prevent the code having to create files in the runtime directory. Suggested-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com>	2020-11-03 22:45:02 +01:00
Thomas Monjalon	af270529ad	ethdev: include mbuf registration in Tx timestamp API Previously, the Tx timestamp field and flag were registered in testpmd, as described in mlx5 guide. For consistency between Rx and Tx timestamps, managing mbuf registrations inside the driver, as properly documented, is a simpler expectation. The only driver to support this feature (mlx5) is updated as well as the testpmd application. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:15 +01:00
Thomas Monjalon	26fb26f6f5	mbuf: add Tx timestamp registration helper The function rte_mbuf_dyn_tx_timestamp_register() can be used to register the required field and flag. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:15 +01:00
Thomas Monjalon	c7e857e16f	mbuf: remove deprecated timestamp field As announced in the deprecation note, the field timestamp is removed to give more space to the dynamic fields. The related offload flag PKT_RX_TIMESTAMP is also removed. This is how the mbuf layout looks like (pahole-style): word type name byte size 0 void * buf_addr; /* 0 + 8 / 1 rte_iova_t buf_iova / 8 + 8 / / --- RTE_MARKER64 rearm_data; / 2 uint16_t data_off; / 16 + 2 / uint16_t refcnt; / 18 + 2 / uint16_t nb_segs; / 20 + 2 / uint16_t port; / 22 + 2 / 3 uint64_t ol_flags; / 24 + 8 / / --- RTE_MARKER rx_descriptor_fields1; / 4 uint32_t union packet_type; / 32 + 4 / uint32_t pkt_len; / 36 + 4 / 5 uint16_t data_len; / 40 + 2 / uint16_t vlan_tci; / 42 + 2 / 5.5 uint64_t union hash; / 44 + 8 / 6.5 uint16_t vlan_tci_outer; / 52 + 2 / uint16_t buf_len; / 54 + 2 / 7 uint64_t dynfield0[1]; / 56 + 8 / / --- RTE_MARKER cacheline1; / 8 struct rte_mempool pool; /* 64 + 8 / 9 struct rte_mbuf next; /* 72 + 8 / 10 uint64_t union tx_offload; / 80 + 8 / 11 struct rte_mbuf_ext_shared_info shinfo; /* 88 + 8 / 12 uint16_t priv_size; / 96 + 2 / uint16_t timesync; / 98 + 2 / 12.5 uint32_t dynfield1[7]; / 100 + 28 / 16 / --- END 128 */ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:15 +01:00
Thomas Monjalon	c2fb882be2	ethdev: add doxygen comment for Rx timestamp API The offload flag DEV_RX_OFFLOAD_TIMESTAMP had no documentation. After switching to dynamic mbuf flag and field, it becomes even more important to explicit the feature behaviour. A doxygen comment for the timesync API was mentioning the deprecated timestamp field, so it is also updated. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:15 +01:00
Thomas Monjalon	d0c34e99ca	latency: switch Rx timestamp to dynamic mbuf field The mbuf timestamp is moved to a dynamic field in order to allow removal of the deprecated static field. The related mbuf flag is also replaced with the dynamic one. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:15 +01:00
Thomas Monjalon	9db924cc21	mbuf: add Rx timestamp flag and helpers There is already a dynamic field for timestamp, used only for Tx scheduling with the dedicated Tx offload flag. The same field can be used for Rx timestamp filled by drivers. A new dynamic flag is defined for Rx usage. A new function wraps the registration of both field and Rx flag. The type rte_mbuf_timestamp_t is defined for the API users. After migrating all Rx timestamp usages, it will be possible to remove the deprecated timestamp field. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 16:21:14 +01:00
Thomas Monjalon	52bf2010c9	eventdev: remove software Rx timestamp This a revert of the commit `569758758d` ("eventdev: add Rx timestamp"). If the Rx timestamp is not configured on the ethdev port, there is no reason to set one. Also the accuracy of the timestamp was bad because set at a late stage. Anyway there is no trace of the usage of this timestamp. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-11-03 15:28:26 +01:00
David Marchand	98ee07c20d	eventdev: check input parameter for dump op Rather than have drivers check for this, let's ensure the passed FILE * is not NULL. Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Timothy McDaniel <timothy.mcdaniel@intel.com>	2020-11-02 14:46:51 +01:00
Guy Kaneti	666c4c62b7	regexdev: add out-of-order scan capability Add out of order scan capability to check PMD support for OOS. Signed-off-by: Guy Kaneti <guyk@marvell.com> Acked-by: Ori Kam <orika@nvidia.com>	2020-11-03 02:03:25 +01:00
Venkata Suresh Kumar P	2ceb0974ee	pipeline: increase SWX immediate operand size This patch increases the immediate operand size from 32 to 64 bits. Signed-off-by: Venkata Suresh Kumar P <venkata.suresh.kumar.p@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-11-02 21:38:27 +01:00
David Marchand	79d69c6dcf	mbuf: remove seqn field As announced in the deprecation note, the field seqn is removed to give more space to the dynamic fields. This is how the mbuf layout looks like (pahole-style): word type name byte size 0 void * buf_addr; /* 0 + 8 / 1 rte_iova_t buf_iova / 8 + 8 / / --- RTE_MARKER64 rearm_data; / 2 uint16_t data_off; / 16 + 2 / uint16_t refcnt; / 18 + 2 / uint16_t nb_segs; / 20 + 2 / uint16_t port; / 22 + 2 / 3 uint64_t ol_flags; / 24 + 8 / / --- RTE_MARKER rx_descriptor_fields1; / 4 uint32_t union packet_type; / 32 + 4 / uint32_t pkt_len; / 36 + 4 / 5 uint16_t data_len; / 40 + 2 / uint16_t vlan_tci; / 42 + 2 / 5.5 uint64_t union hash; / 44 + 8 / 6.5 uint16_t vlan_tci_outer; / 52 + 2 / uint16_t buf_len; / 54 + 2 / 7 uint64_t timestamp; / 56 + 8 / / --- RTE_MARKER cacheline1; / 8 struct rte_mempool pool; /* 64 + 8 / 9 struct rte_mbuf next; /* 72 + 8 / 10 uint64_t union tx_offload; / 80 + 8 / 11 struct rte_mbuf_ext_shared_info shinfo; /* 88 + 8 / 12 uint16_t priv_size; / 96 + 2 / uint16_t timesync; / 98 + 2 / 12.5 uint32_t dynfield1[7]; / 100 + 28 / 16 / --- END 128 */ Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2020-10-31 22:14:44 +01:00
David Marchand	ca4355e4c7	eventdev: switch sequence number to dynamic mbuf field The eventdev drivers have been hacking the deprecated field seqn for internal test usage. It is moved to a dynamic mbuf field in order to allow removal of seqn. Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-10-31 22:14:42 +01:00
David Marchand	01f3496695	reorder: switch sequence number to dynamic mbuf field The reorder library used sequence numbers stored in the deprecated field seqn. It is moved to a dynamic mbuf field in order to allow removal of seqn. Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2020-10-31 22:14:30 +01:00
Thomas Monjalon	5284adad3e	mbuf: remove userdata field As announced in the deprecation note, the field userdata / udata64 is removed to give more space to the dynamic fields. This is how the mbuf layout looks like (pahole-style): word type name byte size 0 void * buf_addr; /* 0 + 8 / 1 rte_iova_t buf_iova / 8 + 8 / / --- RTE_MARKER64 rearm_data; / 2 uint16_t data_off; / 16 + 2 / uint16_t refcnt; / 18 + 2 / uint16_t nb_segs; / 20 + 2 / uint16_t port; / 22 + 2 / 3 uint64_t ol_flags; / 24 + 8 / / --- RTE_MARKER rx_descriptor_fields1; / 4 uint32_t union packet_type; / 32 + 4 / uint32_t pkt_len; / 36 + 4 / 5 uint16_t data_len; / 40 + 2 / uint16_t vlan_tci; / 42 + 2 / 5.5 uint64_t union hash; / 44 + 8 / 6.5 uint16_t vlan_tci_outer; / 52 + 2 / uint16_t buf_len; / 54 + 2 / 7 uint64_t timestamp; / 56 + 8 / / --- RTE_MARKER cacheline1; / 8 struct rte_mempool pool; /* 64 + 8 / 9 struct rte_mbuf next; /* 72 + 8 / 10 uint64_t union tx_offload; / 80 + 8 / 11 uint16_t priv_size; / 88 + 2 / uint16_t timesync; / 90 + 2 / uint32_t seqn; / 92 + 4 / 12 struct rte_mbuf_ext_shared_info shinfo; /* 96 + 8 / 13 uint64_t dynfield1[3]; / 104 + 24 / 16 / --- END 128 */ Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-31 16:13:11 +01:00
Thomas Monjalon	614af75489	security: switch metadata to dynamic mbuf field The device-specific metadata was stored in the deprecated field udata64. It is moved to a dynamic mbuf field in order to allow removal of udata64. The name rte_security_dynfield is not very descriptive but it should be replaced later by separate fields for each type of data that drivers pass to the upper layer. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Haiyue Wang <haiyue.wang@intel.com>	2020-10-31 16:13:11 +01:00
Nithin Dabilpuram	b8d737462e	node: switch IPv4 metadata to dynamic mbuf field The node_mbuf_priv1 was stored in the deprecated mbuf field udata64. It is moved to a dynamic field in order to allow removal of udata64. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Signed-off-by: Nithin Dabilpuram <ndabilpuram@marvell.com>	2020-10-31 16:13:10 +01:00
Thomas Monjalon	34d26d26b7	mbuf: fix typo in dynamic field convention note Replace "in a in PMD" with "in a PMD". Fixes: `4958ca3a44` ("mbuf: support dynamic fields and flags") Cc: stable@dpdk.org Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2020-10-31 16:13:10 +01:00
Thomas Monjalon	905592f4c4	kni: move header file from EAL Since the kernel module is not part of EAL anymore, there is no need to have the common KNI header file in EAL. The file rte_kni_common.h is moved to librte_kni. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-31 16:13:10 +01:00
David Marchand	f4d2ad3aa9	trace: make CTF metadata prettier This is simply a cosmetic change. Before: event { id = 17; name = "lib.eal.alarm.set"; fields := struct { uint64_t us;uintptr_t cb_fn;uintptr_t cb_arg;int32_t rc; }; }; After: event { id = 17; name = "lib.eal.alarm.set"; fields := struct { uint64_t us; uintptr_t cb_fn; uintptr_t cb_arg; int32_t rc; }; }; Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Sunil Kumar Kori <skori@mavell.com>	2020-10-29 22:49:22 +01:00
David Marchand	fd43b50113	trace: fix metadata dump The ctf metadata is written to the metadata file without any check for length, so this string must be null terminated. Fixes: `f1a099f5b1` ("trace: create CTF TDSL metadata in memory") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Sunil Kumar Kori <skori@mavell.com>	2020-10-29 22:49:22 +01:00
David Marchand	721cfcaf04	trace: remove size limit on CTF event description Rework registration so that it uses dynamic allocations and has no size limit. Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Sunil Kumar Kori <skori@mavell.com>	2020-10-29 22:49:22 +01:00
David Marchand	d992fa555d	trace: fixup CTF event description at registration CTF event description is currently built by appending all fields in a single string at trace point registration. When dumping the metadata, this string is split again and inspected to fixup reserved keywords and special tokens like "." or "->". Move this fixup per field at trace point registration time so that there is no need for inspecting / string parsing when dumping metadata. Use dynamic allocations to remove an artificial size limit on the CTF event description manipulations. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Sunil Kumar Kori <skori@mavell.com>	2020-10-29 22:49:22 +01:00
Liang Ma	1280214212	eal: add intrinsics support check infrastructure Currently, it is not possible to check support for intrinsics that are platform-specific, cannot be abstracted in a generic way, or do not have support on all architectures. The CPUID flags can be used to some extent, but they are only defined for their platform, while intrinsics will be available to all code as they are in generic headers. This patch introduces infrastructure to check support for certain platform-specific intrinsics, and adds support for checking support for IA power management-related intrinsics for UMWAIT/UMONITOR and TPAUSE. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Signed-off-by: Liang Ma <liang.j.ma@intel.com> Acked-by: David Christensen <drc@linux.vnet.ibm.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2020-10-29 22:46:31 +01:00
Liang Ma	cda57d9388	eal: add power management intrinsics Add two new power management intrinsics, and provide an implementation in eal/x86 based on UMONITOR/UMWAIT instructions. The instructions are implemented as raw byte opcodes because there is not yet widespread compiler support for these instructions. The power management instructions provide an architecture-specific function to either wait until a specified TSC timestamp is reached, or optionally wait until either a TSC timestamp is reached or a memory location is written to. The monitor function also provides an optional comparison, to avoid sleeping when the expected write has already happened, and no more writes are expected. For more details, please refer to Intel(R) 64 and IA-32 Architectures Software Developer's Manual, Volume 2. Signed-off-by: Liang Ma <liang.j.ma@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: David Christensen <drc@linux.vnet.ibm.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com>	2020-10-29 22:46:31 +01:00
Liang Ma	e448a5a9ed	eal/x86: add CPU flag for WAITPKG Add a new CPUID flag indicating processor support for UMONITOR/UMWAIT and TPAUSE instructions instruction. Signed-off-by: Liang Ma <liang.j.ma@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-29 22:46:27 +01:00
Cristian Dumitrescu	0dde44843b	pipeline: fix string copy into fixed size buffer Fix potential buffer overflows by string copy into fixed size buffer. Coverity issue: 362732, 362736, 362760, 362772, 362775, 362784 Coverity issue: 362800, 362803, 362806, 362811, 362814, 362816 Coverity issue: 362834, 362837, 362844, 362845, 362857, 362861 Coverity issue: 362868, 362890, 362893, 362904, 362905 Fixes: `56492fd536` ("pipeline: add new SWX pipeline type") Fixes: `1e4c88caea` ("pipeline: add SWX extern objects and funcs") Fixes: `e9d870dd93` ("pipeline: add SWX pipeline tables") Fixes: `a1711f948d` ("pipeline: add SWX Rx and extract instructions") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-29 17:32:49 +01:00
Yunjian Wang	eb7fa475f7	hash: fix check of parameter Coverity flags that 'h' variable is used before it's checked for NULL. This patch fixes this issue. Coverity issue: 363625 Fixes: `769b2de7fb` ("hash: implement RCU resources reclamation") Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Reviewed-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-10-29 16:45:17 +01:00
Yunjian Wang	39e2961a09	eal: fix interrupt trace point This patch fixes (dereference after null check) coverity issue. For this reason, we should add null check at the beginning of the function and return error directly if the 'intr_handle' is null. Coverity issue: 357695, 357751 Fixes: `05c4105738` ("trace: add interrupt tracepoints") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Reviewed-by: Harman Kalra <hkalra@marvell.com>	2020-10-29 16:30:49 +01:00
Honnappa Nagarahalli	47bec9a5ca	ring: add zero copy API Add zero-copy APIs. These APIs provide the capability to copy the data to/from the ring memory directly, without having a temporary copy (for ex: an array of mbufs on the stack). Use cases that involve copying large amount of data to/from the ring can benefit from these APIs. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-29 14:13:31 +01:00
Vladimir Medvedkin	1e5630e40d	fib6: add AVX512 lookup Add new lookup implementation for FIB6 trie algorithm using AVX512 instruction set Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:13 +01:00
Vladimir Medvedkin	d2ab1e0130	fib6: move lookup definition to header Move trie table layout and lookup definition into the private header file. This is necessary for implementing a vectorized lookup function in a separate .с file. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:13 +01:00
Vladimir Medvedkin	6f53fcc7b2	fib6: add lookup runtime selection Add type argument to trie_get_lookup_fn() Now it only supports RTE_FIB6_LOOKUP_TRIE_SCALAR Add new rte_fib6_select_lookup() - user can change lookup function type runtime. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:11 +01:00
Vladimir Medvedkin	b3509fa365	fib: add AVX512 lookup Add new lookup implementation for DIR24_8 algorithm using AVX512 instruction set Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:11 +01:00
Vladimir Medvedkin	a19791f570	fib: move lookup definition to header Move dir24_8 table layout and lookup definition into the private header file. This is necessary for implementing a vectorized lookup function in a separate .с file. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:11 +01:00
Vladimir Medvedkin	a5b0d25d81	fib: add lookup runtime selection Add type argument to dir24_8_get_lookup_fn() Now it supports 3 different lookup implementations: RTE_FIB_LOOKUP_DIR24_8_SCALAR_MACRO RTE_FIB_LOOKUP_DIR24_8_SCALAR_INLINE RTE_FIB_LOOKUP_DIR24_8_SCALAR_UNI Add new rte_fib_select_lookup() - user can change lookup function type runtime. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-28 21:29:03 +01:00
Vladimir Medvedkin	4f66d3be56	fib: remove maximum type enums FIB type RTE_FIB_TYPE_MAX is used only for sanity checks, remove it to prevent applications start using it. The same is for FIB6's RTE_FIB6_TYPE_MAX. Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-28 21:23:11 +01:00
David Marchand	c3afd1cba6	service: separate statistics dump and reset No functional change intended. service_dump_calls_per_lcore() was always called with a 0 reset flag. service_dump_one() was called with either a 0 reset flag or a NULL FILE pointer. We can split the code for readability sake. Note: there is no path to resetting calls_per_service[], this is left as is. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2020-10-27 13:21:01 +01:00
Ruifeng Wang	ced5a6ce24	lpm: hide internal data Fields except tbl24 and tbl8 in rte_lpm structure have no need to be exposed to the user. Hide the unneeded exposure of structure fields for better ABI maintainability. Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Kevin Traynor <ktraynor@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-10-24 19:08:06 +02:00
Ruifeng Wang	0e8aa9970c	lpm: fix free of data structure The container structure should be freed instead of rte_lpm structure after wrapping rte_lpm into internal structure __rte_lpm. Fixes: `8a9f8564e9` ("lpm: implement RCU rule reclamation") Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Phil Yang <phil.yang@arm.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Kevin Traynor <ktraynor@redhat.com>	2020-10-24 19:08:06 +02:00
Dharmik Thakkar	769b2de7fb	hash: implement RCU resources reclamation Currently, users have to use external RCU mechanisms to free resources when using lock free hash algorithm. Integrate RCU QSBR process to make it easier for the applications to use lock free algorithm. Refer to RCU documentation to understand various aspects of integrating RCU library into other libraries. Suggested-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Signed-off-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-10-24 09:25:13 +02:00
Dharmik Thakkar	f28202adff	rcu: build on Windows Build the lib for Windows. Signed-off-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Tested-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Pallavi Kadam <pallavi.kadam@intel.com>	2020-10-24 08:54:03 +02:00
Thomas Monjalon	19653eed40	remove config prefix used with make The config options CONFIG_RTE_* are simple RTE_* defines with meson. Now that make support is dropped, update the names in logs and comments. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 19:25:21 +02:00
Thomas Monjalon	a796c922d2	mem: fix config name in error logs When introducing the new option CONFIG_RTE_MAX_MEM_MB_PER_TYPE, some logs were referencing a wrong name: CONFIG_RTE_MAX_MEM_PER_TYPE. Fixes: `66cc45e293` ("mem: replace memseg with memseg lists") Cc: stable@dpdk.org Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 19:25:21 +02:00
Thomas Monjalon	4d8d68abdc	eal: remove comment about old partition option The main initialization function (rte_eal_init) has documentation about a feature from another era: memory partition. Curiously, this lost treasure is found only now, suggesting there may be other interesting things to discover in the doc. To all aspiring Indiana Jones: the hunt is open! Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 19:25:21 +02:00
Yunjian Wang	1cdb89c177	eal: report duplicate device event callback We should return an error value, when the callback is already exist. Fixes: `a753e53d51` ("eal: add device event monitor framework") Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 13:35:56 +02:00
Yunjian Wang	3f01891391	eal: return errors on device event callback unregister Fix return value, using -EAGAIN instead of 0 when the callback is busy and using -ENOENT instead of 0 when the callback is not found. Fixes: `a753e53d51` ("eal: add device event monitor framework") Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Jeff Guo <jia.guo@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 13:35:50 +02:00
Yunjian Wang	c78bd27d4b	eal: fix leak on device event callback unregister The event_cb->dev_name is not freed when freeing event_cb, and this causes a memory leak. Fixes: `a753e53d51` ("eal: add device event monitor framework") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Jeff Guo <jia.guo@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-23 13:35:48 +02:00
Yunjian Wang	c2402fcaf9	efd: fix tailq entry leak in error path In rte_efd_create() allocated memory for tailq entry, we should free it when error happens, otherwise it will lead to memory leak. Fixes: `56b6ef874f` ("efd: new Elastic Flow Distributor library") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-10-22 22:07:15 +02:00
David Marchand	5a1c7b6ddd	hash: use x86 common flag for jhash jhash has been forgotten when factorising the x86 arch check. Fixes: `dbf17d44f3` ("hash: use common x86 flag") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-22 22:07:15 +02:00
David Marchand	a5c369d486	bpf: use helper to install headers Libraries can use the headers variable to install headers. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-22 14:15:19 +02:00
David Marchand	6b3848e211	build: fix version map file references in documentation Fixes: `63b3907833` ("build: remove library name from version map file name") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-22 14:11:49 +02:00
Yunjian Wang	01072d52af	eal/linux: fix memory leak in uevent handling When the memory for uevent.devname is allocated in dev_uev_parse(). It is not freed when parse the subsystem layer fails in dev_uev_parse(). Before return, it is also not freed in dev_uev_handler(). These cause a memory leak. Fixes: `0d0f478d04` ("eal/linux: add uevent parse and process") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2020-10-20 16:01:37 +02:00
Tal Shnaiderman	ddfaa718b7	eal/windows: add missing stdint include Following the addition of the in_addr/in6_addr structs to in.h the header file must have stdint.h included for the definitions of the uint8_t/uint32_t types used within the new structs. Not having it could results in the following errors in places where in.h is included: in.h:30:2: error: unknown type name 'uint32_t' uint32_t s_addr; in.h:34:2: error: unknown type name 'uint8_t' uint8_t s6_addr[16]; Fixes: `f40a74cfcf` ("eal/windows: improve compatibility networking headers") Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2020-10-20 13:46:32 +02:00
Stephen Hemminger	cb056611a8	eal: rename lcore master and slave Replace master lcore with main lcore and replace slave lcore with worker lcore. Keep the old functions and macros but mark them as deprecated for this release. The "--master-lcore" command line option is also deprecated and any usage will print a warning and use "--main-lcore" as replacement. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2020-10-20 13:17:08 +02:00
Stephen Hemminger	0303192581	eal: add macro to mark macros as deprecated Add a macro that causes GCC and CLANG to emit a warning when a deprecated macro is used. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-20 11:42:29 +02:00
Gregory Etelson	174db36812	ethdev: rename tunnel flow offload callbacks Rename new rte_flow ops callbacks to emphasize relation to tunnel offload API. Signed-off-by: Gregory Etelson <getelson@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2020-10-19 23:28:22 +02:00
Stephen Hemminger	5b183ff611	ipc: fix spelling in log and comment Fixes spelling in comment and message about thread error. Found while looking at checkpatch complaints about "thead" Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-10-19 23:25:06 +02:00
Bruce Richardson	a8d0d473a0	build: replace use of old build macros Use the newer macros defined by meson in all DPDK source code, to ensure there are no errors when the old non-standard macros are removed. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Luca Boccassi <bluca@debian.org> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Rosen Xu <rosen.xu@intel.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2020-10-19 22:15:44 +02:00
Bruce Richardson	a20b2c01a7	build: standardize component names and defines As discussed on the dpdk-dev mailing list[1], we can make some easy improvements in standardizing the naming of the various components in DPDK, and their associated feature-enabled macros. Following this patch, each library will have the name in format, 'librte_<name>.so', and the macro indicating that library is enabled in the build will have the form 'RTE_LIB_<NAME>'. Similarly, for libraries, the equivalent name formats and macros are: 'librte_<class>_<name>.so' and 'RTE_<CLASS>_<NAME>', where class is the device type taken from the relevant driver subdirectory name, i.e. 'net', 'crypto' etc. To avoid too many changes at once for end applications, the old macro names will still be provided in the build in this release, but will be removed subsequently. [1] http://inbox.dpdk.org/dev/ef7c1a87-79ab-e405-4202-39b7ad6b0c71@solarflare.com/t/#u Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Luca Boccassi <bluca@debian.org> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Rosen Xu <rosen.xu@intel.com>	2020-10-19 22:15:34 +02:00
Bruce Richardson	63b3907833	build: remove library name from version map file name Since each version map file is contained in the subdirectory of the library it refers to, there is no need to include the library name in the filename. This makes things simpler in case of library renaming. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Luca Boccassi <bluca@debian.org> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Rosen Xu <rosen.xu@intel.com>	2020-10-19 22:13:59 +02:00
Ciara Power	1e6a661302	acl: check max SIMD bitwidth When choosing a vector path to take, an extra condition must be satisfied to ensure the max SIMD bitwidth allows for the CPU enabled path. These checks are added in the check alg helper functions. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-19 16:45:02 +02:00
Ciara Power	13facf47d6	node: choose vector path at runtime When choosing the vector path, max SIMD bitwidth is now checked to ensure the vector path is suitable. To do this, the scalar function is chosen by default in the struct, but at node initialisation time, this function pointer is updated to the vector version if supported, and if it is within the max SIMD bitwidth limit. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Nithin Dabilpuram <ndabilpuram@marvell.com>	2020-10-19 16:45:02 +02:00
Ciara Power	209fd1984a	net: check max SIMD bitwidth When choosing a vector path to take, an extra condition must be satisfied to ensure the max SIMD bitwidth allows for the CPU enabled path. The vector path was initially chosen in RTE_INIT, however this is no longer suitable as we cannot check the max SIMD bitwidth at that time. Default handlers are now chosen on initialisation, these default handlers are used the first time the crc calc is called, and they set the suitable handlers to be used going forward. Suggested-by: Jasvinder Singh <jasvinder.singh@intel.com> Suggested-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Jasvinder Singh <jasvinder.singh@intel.com>	2020-10-19 16:45:02 +02:00
Ciara Power	ad2ffbf6d4	efd: check max SIMD bitwidth When choosing a vector path to take, an extra condition must be satisfied to ensure the max SIMD bitwidth allows for the CPU enabled path. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-10-19 16:45:02 +02:00
Ciara Power	c9cd5806f7	member: check max SIMD bitwidth When choosing a vector path to take, an extra condition must be satisfied to ensure the max SIMD bitwidth allows for the CPU enabled path. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-10-19 16:45:02 +02:00
Ciara Power	eec67546aa	distributor: check max SIMD bitwidth When choosing a vector path to take, an extra condition must be satisfied to ensure the max SIMD bitwidth allows for the CPU enabled path. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 16:45:02 +02:00
Ciara Power	580af30dd6	eal: control max SIMD bitwidth This patch adds a max SIMD bitwidth EAL configuration. The API allows for an app to set this value. It can also be set using EAL argument --force-max-simd-bitwidth, which will lock the value and override any modifications made by the app. Each arch has a define for the default SIMD bitwidth value, this is used on EAL init to set the config max SIMD bitwidth. Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2020-10-19 16:45:02 +02:00
Stephen Hemminger	17b347dab7	malloc: add alloc_size attribute to functions By using the alloc_size() attribute the compiler can optimize better and detect errors at compile time. For example, Gcc will fail one of the invalid allocation examples in app/test/test_malloc.c because the allocation is outside the limits of memory. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-10-19 16:25:43 +02:00
Hemant Agrawal	f030bff72f	bitrate: add free function This patch adds support for free function. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-10-19 16:08:36 +02:00
Stephen Hemminger	bb548625c6	eal/linux: add function to allow interruptible epoll The existing definition of rte_epoll_wait retries if interrupted by a signal. This behavior makes it hard to use rte_epoll_wait for applications that want to use signals do do things like exit polling loop and shutdown. Since changing existing semantic might break applications, add a new rte_epoll_wait_interruptible() function that does the same thing as rte_epoll_wait but will return -1 and errno of EINTR if it receives a signal. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Harman Kalra <hkalra@marvell.com>	2020-10-19 12:17:25 +02:00
Lukasz Wojciechowski	20fa39d230	distributor: fix clearing returns buffer The patch clears distributors returns buffer in clear_returns() by setting start and count to 0. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	91d6b8235e	distributor: fix flushing in flight packets rte_distributor_flush() is using total_outstanding() function to calculate if it should still wait for processing packets. However in burst mode only backlog packets were counted. This patch fixes that issue by counting also in flight packets. There are also sum fixes to properly keep count of in flight packets for each worker in bufs[].count. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	626ceefbf4	distributor: fix scalar matching Fix improper indexes while comparing tags. In the find_match_scalar() function: * j iterates over flow tags of following packets; * w iterates over backlog or in flight tags positions. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	ed4be82d9e	distributor: fix API documentation After introducing burst API there were some artefacts in the API documentation from legacy single API. Also the rte_distributor_poll_pkt() function return values mismatched the implementation. Fixes: `c0de0eb82e` ("distributor: switch over to new API") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	f25fe0d5e3	distributor: fix return pkt calls in single mode In the single legacy version of the distributor synchronization requires continues exchange of buffers between distributor and workers. Empty buffers are sent if only handshake synchronization is required. However calls to the rte_distributor_return_pkt() with 0 buffers in single mode were ignored and not passed to the legacy algorithm implementation causing lack of synchronization. This patch fixes this issue by passing NULL as buffer which is a valid way of sending just synchronization handshakes in single mode. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	480d5a7c81	distributor: handle worker shutdown in burst mode The burst version of distributor implementation was missing proper handling of worker shutdown. A worker processing packets received from distributor can call rte_distributor_return_pkt() function informing distributor that it want no more packets. Further calls to rte_distributor_request_pkt() or rte_distributor_get_pkt() however should inform distributor that new packets are requested again. Lack of the proper implementation has caused that even after worker informed about returning last packets, new packets were still sent from distributor causing deadlocks as no one could get them on worker side. This patch adds handling shutdown of the worker in following way: 1) It fixes usage of RTE_DISTRIB_VALID_BUF handshake flag. This flag was formerly unused in burst implementation and now it is used for marking valid packets in retptr64 replacing invalid use of RTE_DISTRIB_RETURN_BUF flag. 2) Uses RTE_DISTRIB_RETURN_BUF as a worker to distributor handshake in retptr64 to indicate that worker has shutdown. 3) Worker that shuts down blocks also bufptr for itself with RTE_DISTRIB_RETURN_BUF flag allowing distributor to retrieve any in flight packets. 4) When distributor receives information about shutdown of a worker, it: marks worker as not active; retrieves any in flight and backlog packets and process them to different workers; unlocks bufptr64 by clearing RTE_DISTRIB_RETURN_BUF flag and allowing use in the future if worker requests any new packets. 5) Do not allow to: send or add to backlog any packets for not active workers. Such workers are also ignored if matched. 6) Adjust calls to handle_returns() and tags matching procedure to react for possible activation deactivation of workers. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	6bd951b482	distributor: fix buffer use after free rte_distributor_request_pkt and rte_distributor_get_pkt dereferenced oldpkt parameter when in RTE_DIST_ALG_SINGLE even if number of returned buffers from worker to distributor was 0. This patch passes NULL to the legacy API when number of returned buffers is 0. This allows passing NULL as oldpkt parameter. Distributor tests are also updated passing NULL as oldpkt and 0 as number of returned packets, where packets are not returned. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	5acce079e7	distributor: fix handshake deadlock Synchronization of data exchange between distributor and worker cores is based on 2 handshakes: retptr64 for returning mbufs from workers to distributor and bufptr64 for passing mbufs to workers. Without proper order of verifying those 2 handshakes a deadlock may occur. This can happen when worker core wants to return back mbufs and waits for retptr handshake to be cleared while distributor core waits for bufptr to send mbufs to worker. This can happen as worker core first returns mbufs to distributor and later gets new mbufs, while distributor first releases mbufs to worker and later handle returning packets. This patch fixes possibility of the deadlock by always taking care of returning packets first on the distributor side and handling packets while waiting to release new. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com>	2020-10-19 10:57:17 +02:00
Lukasz Wojciechowski	bea84d5592	distributor: fix handshake synchronization rte_distributor_return_pkt function which is run on worker cores must wait for distributor core to clear handshake on retptr64 before using those buffers. While the handshake is set distributor core controls buffers and any operations on worker side might overwrite buffers which are unread yet. Same situation appears in the legacy single distributor. Function rte_distributor_return_pkt_single shouldn't modify the bufptr64 until handshake on it is cleared by distributor lcore. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: Lukasz Wojciechowski <l.wojciechow@partner.samsung.com> Acked-by: David Hunt <david.hunt@intel.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2020-10-19 10:57:17 +02:00

... 2 3 4 5 6 ...

6926 Commits