numam-dpdk

Author	SHA1	Message	Date
Pallavi Kadam	1812f86efa	eal: disable syslog and dlopen on Windows Excluding syslog/ dlfcn definitions and parameters from Windows by adding #ifndef RTE_EXEC_ENV_WINDOWS. Note: This is a temporary change. In future, separate 'unix' directory will be created for unix specific functions. Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Pallavi Kadam	208393d67e	eal/x86: include SSE4 support on Windows Modified common/include/arch/x86/rte_vect.h to include SSE4 header for Windows. Signed-off-by: Antara Ganesh Kolar <antara.ganesh.kolar@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Keith Wiles <keith.wiles@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Pallavi Kadam	8938998a29	eal/windows: detect process type Adding a function to detect process type, also included header files to contain suitable function declarations and to support extra warning flags. Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Signed-off-by: Antara Ganesh Kolar <antara.ganesh.kolar@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Keith Wiles <keith.wiles@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Pallavi Kadam	5e373e456e	eal/windows: add getopt implementation Adding getopt files to support parsing option on Windows. The original contribution is under BSD-2 license. https://github.com/greenplum-db/libusual/blob/master/usual/getopt.c https://github.com/greenplum-db/libusual/blob/master/usual/getopt.h Signed-off-by: Antara Ganesh Kolar <antara.ganesh.kolar@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Keith Wiles <keith.wiles@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Pallavi Kadam	e8428a9d89	eal/windows: add some basic functions and macros Adding additional function definitions for pthread, cpuset implementation, asprintf implementation, in order to support common code. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Keith Wiles <keith.wiles@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Pallavi Kadam	6e1ed4cbbe	eal/windows: add dirent implementation Adding dirent.h on Windows to support common code. eal_common_options.c includes this file. The original contribution is under MIT license. https://github.com/tronkko/dirent Signed-off-by: Antara Ganesh Kolar <antara.ganesh.kolar@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Keith Wiles <keith.wiles@intel.com> Reviewed-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com> Acked-by: Narcisa Ana Maria Vasile <narcisa.vasile@microsoft.com>	2020-02-12 22:50:29 +01:00
Thomas Monjalon	43e34a229d	build: remove redundant config include The header file rte_config.h is always included by make or meson. If required in an exported API header file, it must be included in the public header file for external applications. In the internal files, explicit include of rte_config.h is useless, and can be removed. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@mellanox.com> Acked-by: David Marchand <david.marchand@redhat.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2020-02-11 16:50:59 +01:00
Harman Kalra	9058afaa26	eal: check if running in interrupt context Added an API to check if current execution is in interrupt context. This will be helpful to handle nested interrupt cases. Signed-off-by: Harman Kalra <hkalra@marvell.com> Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com>	2020-02-06 16:35:40 +01:00
Vladimir Medvedkin	247a38c520	fib: fix possible integer overflow This commit fixes possible integer overflow for prev_idx in build_common_root() CID 350596 and tbl8_idx in write_edge() CID 350597 Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression tbl8_idx * 256 with type int (32 bits, signed) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Coverity issue: 350596, 350597 Fixes: `c3e12e0f03` ("fib: add dataplane algorithm for IPv6") Cc: stable@dpdk.org Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2020-02-06 16:17:14 +01:00
Viacheslav Ovsiienko	545fa736b9	mbuf: fix pinned memory function style Minor style issue is fixed. Fixes: `6c8e50c2e5` ("mbuf: create pool with external memory buffers") Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-02-06 16:17:14 +01:00
Stephen Hemminger	5b6eaea8ea	mbuf: display more fields in dump The rte_pktmbuf_dump should display offset, refcount, and vlan info since these are often useful during debugging. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-02-06 16:17:14 +01:00
Stephen Hemminger	292f02b58c	mem: fix munmap in error unwind The loop to unwind existing mmaps was only unmapping the first segment and the error paths after mmap() were not doing munmap of the current segment. Fixes: `66cc45e293` ("mem: replace memseg with memseg lists") Cc: stable@dpdk.org Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2020-02-06 15:39:30 +01:00
Thomas Monjalon	03d24a16f7	ring: fix namespace prefix of inline functions When adding custom element size feature, some internal inline functions were added in a public header without rte_ prefix. It is fixed by adding __rte_ring_. Fixes: `cc4b218790` ("ring: support configurable element size") Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-02-06 11:48:32 +01:00
Junxiao Shi	0e8d1ea327	bpf: fix headers install with meson Previously, when librte_bpf is built with meson+ninja, its headers such as bpf_def is not installed to the system. This commit fixes this problem. Fixes: `94972f35a0` ("bpf: add BPF loading and execution framework") Cc: stable@dpdk.org Signed-off-by: Junxiao Shi <git@mail1.yoursunny.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-02-06 09:20:07 +01:00
Marcin Smoczynski	957394f726	ipsec: support CPU crypto mode Update library to handle CPU cypto security mode which utilizes cryptodev's synchronous, CPU accelerated crypto operations. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Marcin Smoczynski <marcinx.smoczynski@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-02-05 15:29:59 +01:00
Marcin Smoczynski	5d6d7e443d	security: add CPU crypto action type Introduce CPU crypto action type allowing to differentiate between regular async 'none security' and synchronous, CPU crypto accelerated sessions. This mode is similar to ACTION_TYPE_NONE but crypto processing is performed synchronously on a CPU. Signed-off-by: Marcin Smoczynski <marcinx.smoczynski@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-02-05 15:29:49 +01:00
Marcin Smoczynski	7adf992fb9	cryptodev: introduce CPU crypto API Add new API allowing to process crypto operations in a synchronous manner. Operations are performed on a set of SG arrays. Cryptodevs which allows CPU crypto operation mode have to use RTE_CRYPTODEV_FF_SYM_CPU_CRYPTO capability. Add a helper method to easily convert mbufs to a SGL form. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Marcin Smoczynski <marcinx.smoczynski@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-02-05 15:29:14 +01:00
Vladimir Medvedkin	a9f31a90bb	ipsec: move SAD name length Move IPSEC_SAD_NAMESIZE into public header and rename it to RTE_IPSEC_SAD_NAMESIZE Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com> Acked-by: Anoob Joseph <anoobj@marvell.com>	2020-02-05 15:20:51 +01:00
Maxime Coquelin	c6420a3632	vhost: catch overflow causing mmap of size 0 This patch catches an overflow that could happen if an invalid region size or page alignment is provided by the guest via the VHOST_USER_SET_MEM_TABLE request. If the sum of the size to mmap and the alignment overflows uint64_t, then RTE_ALIGN_CEIL(mmap_size, alignment) macro will return 0. This value was passed as is as size argument to mmap(). While kernel handling of mmap() syscall returns an error if size is 0, it is better to catch it earlier and provide a meaningful error log. Fixes: `ec09c280b8` ("vhost: fix mmap not aligned with hugepage size") Cc: stable@dpdk.org Reported-by: Ilja Van Sprundel <ivansprundel@ioactive.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com>	2020-02-05 11:47:18 +01:00
Adrian Moreno	c5a910dd92	vhost: fix packed virtqueue ready condition Consider a virtqueue ready when, apart from the descriptor area, both event suppression areas have been mapped. Fixes: `2d1541e2b6` ("vhost: add vring address setup for packed queues") Cc: stable@dpdk.org Signed-off-by: Adrian Moreno <amorenoz@redhat.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com>	2020-02-05 11:47:18 +01:00
Fan Zhang	03df3c7473	vhost/crypto: fix fetch size This patch fixes the incorrect rte_vhost_crypto_fetch_requests return value. Coverity issue: 343401 Fixes: `3bb595ecd6` ("vhost/crypto: add request handler") Cc: stable@dpdk.org Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Eugenio Pérez	cdf1dc5e6a	vhost: flush shadow Tx if no more packets The current implementation of vhost_net in packed vring tries to fill the shadow vector before send any actual changes to the guest. While this can be beneficial for the throughput, it conflicts with some bufferfloats methods like the linux kernel napi, that stops transmitting packets if there are too much bytes/buffers in the driver. To solve it, we flush the shadow packets at the end of virtio_dev_tx_packed if we have starved the vring, i.e. the next buffer is not available for the device. Since this last check can be expensive because of the atomic, we only check it if we have not obtained the expected "count" packets. If it happens to obtain "count" packets and there is no more available packets the caller needs to keep call virtio_dev_tx_packed again. Fixes: `31d6c6a5b8` ("vhost: optimize packed ring dequeue") Cc: stable@dpdk.org Signed-off-by: Eugenio Pérez <eperezma@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Vitaliy Mysak	bedf87c521	vhost: do not treat empty socket message as error According to recvmsg() specification, 0 is a valid return code when client is disconnecting. Therefore, it should not be reported as error, unless there are other dependencies that require message to not be empty. But there are none, since the next immediate caller of recvmsg() reports "vhost peer closed" info (not error) when message is empty. This patch changes return code check for recvmsg() so that misleading error message is not printed when the code is 0. Fixes: `8f972312b8` ("vhost: support vhost-user") Cc: stable@dpdk.org Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com>	2020-02-05 11:47:18 +01:00
Zhike Wang	499fd8e5b8	vhost: fix crash on port deletion The vhost_user_read_cb() and rte_vhost_driver_unregister() can be called at the same time by 2 threads. Eg thread1 calls vhost_user_read_cb() and removes the vsocket from conn_list, then thread2 calls rte_vhost_driver_unregister() and frees the vsocket since it is NOT in the conn_list. So thread1 will access invalid memory when trying to reconnect. The fix is to move the "removing of vsocket from conn_list" to end of the vhost_user_read_cb(), then avoid the race condition. The core trace is: Program terminated with signal 11, Segmentation fault. Fixes: `af14759181` ("vhost: introduce API to start a specific driver") Cc: stable@dpdk.org Signed-off-by: Zhike Wang <wangzhike@jd.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Takeshi Yoshimura	986f2134c3	vfio: fix mapping failures in ppc64le ppc64le failed when using large physical memory. I found problems in my two commits in the past. In commit `e072d16f89` ("vfio: fix expanding DMA area in ppc64le"), I added a sanity check using a mapped address to resolve an issue around expanding IOMMU window, but this was not enough, since memory allocation can return memory anywhere dependent on memory fragmentation. DPDK may still skip DMA mapping and attempts to unmap non-mapped DMA during expanding IOMMU window. As a result, SPDK apps using large physical memory frequently failed to proceed the communication with NVMe and/or went into an infinite loop. The root cause of the bug was in a gap between memory segments managed by DPDK and firmware-level DMA mapping. DPDK's memory segments don't contain the state of DMA mapping, and so, the memesg_walk cannot determine if an iterated memory segment is mapped or not. This resulted in incorrect DMA maps and unmaps. At this time, I added the code to avoid iterating non-mapped memory segments during DMA mapping. The memseg_walk iterates over memory segments marked as "used", and so, the code sets memory segments that will be mapped or unmapped as "free" transiently. The commit `db90b4969e` ("vfio: retry creating sPAPR DMA window") allows retring different page levels and sizes to create DMA window. However, this allows page sizes different from hugepage sizes. This inconsistency caused failures at the time of DMA mapping after the window creation. This patch fixes to retry only different page levels. Fixes: `e072d16f89` ("vfio: fix expanding DMA area in ppc64le") Fixes: `db90b4969e` ("vfio: retry creating sPAPR DMA window") Cc: stable@dpdk.org Signed-off-by: Takeshi Yoshimura <tyos@jp.ibm.com> Reviewed-by: David Christensen <drc@linux.vnet.ibm.com>	2020-02-05 21:57:21 +01:00
Andrzej Ostruszka	f2318b73c2	build: remove unneeded function versioning Timer, LPM and Distributor libraries no longer use function versioning and therefore do not need separate build for static and shared version of libraries. This patch removes use_function_versioning from their meson build files and corresponding include from the sources. Fixes: `f2fb215843` ("timer: remove deprecated code") Fixes: `6e5b516761` ("distributor: remove deprecated code") Fixes: `c381a8d554` ("lpm: remove deprecated code") Cc: stable@dpdk.org Signed-off-by: Andrzej Ostruszka <aostruszka@marvell.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-02-05 21:27:29 +01:00
Stephen Hemminger	dd80d40897	pdump: use dynamic log type The logtype USER1 should not be overloaded for library function. Instead use a dynamic log type. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Reshma Pattan <reshma.pattan@intel.com>	2020-02-05 19:42:24 +01:00
Stephen Hemminger	8f6a3ebcbe	pdump: use mbuf copy function The rte_pktmbuf_copy handles varying size mbuf pools correctly. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Reshma Pattan <reshma.pattan@intel.com>	2020-02-05 19:42:24 +01:00
Honnappa Nagarahalli	f3822eb6fb	hash: fix lock-free flag doxygen Lock-free extendable table is supported. Correct the comments. Fixes: `f401363d98` ("hash: support lock-free extendable bucket") Cc: stable@dpdk.org Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2020-02-05 19:42:24 +01:00
Thomas Monjalon	f5862ae99e	cryptodev: revert Chacha20-Poly1305 AEAD algorithm API makes think that rte_cryptodev_info_get() cannot return a value >= 3 (RTE_CRYPTO_AEAD_LIST_END in 19.11). 20.02-rc1 was returning 3 (RTE_CRYPTO_AEAD_CHACHA20_POLY1305). So the ABI compatibility contract was broken. It could be solved with some function versioning, but because a lack of time, the feature is reverted for now. This reverts following commits: - `6c9f3b347e` ("cryptodev: add Chacha20-Poly1305 AEAD algorithm") - `2c512e64d6` ("crypto/qat: support Chacha Poly") - `d55e01f579` ("test/crypto: add Chacha Poly cases") Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2020-02-05 15:14:46 +01:00
David Marchand	251d69a26e	hash: fix meson headers packaging Those headers are internal and should not be distributed. Fixes: `5b9656b157` ("lib: build with meson") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Luca Boccassi <bluca@debian.org>	2020-02-05 15:14:46 +01:00
Pavan Nikhilesh	9196db904b	lib: use common macro RTE_DIM Use RTE_DIM to calculate array size. Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-02-05 14:37:41 +01:00
David Marchand	8674b203f1	eal: remove limitation on cpuset with --lcores Contrary to the -c/-l options, where a logical core runs on the same physical core in a 1:1 fashion (example: lcore 0 runs on core 0, lcore 16 runs on core 16), the --lcores option makes it possible to select the physical cores on which runs a logical core. However the current parsing code still limits the cpuset to the [0, RTE_MAX_LCORE] range. Example, before the patch, on a 24 cores system with RTE_MAX_LCORE == 16: $ ./master/app/testpmd --no-huge --no-pci -m 512 --log-level :debug \ --lcores 0@16,1@17 -- -i --total-num-mbufs 2048 EAL: Detected lcore 0 as core 0 on socket 0 EAL: Detected lcore 1 as core 1 on socket 0 EAL: Detected lcore 2 as core 2 on socket 0 EAL: Detected lcore 3 as core 3 on socket 0 EAL: Detected lcore 4 as core 4 on socket 0 EAL: Detected lcore 5 as core 5 on socket 0 EAL: Detected lcore 6 as core 6 on socket 0 EAL: Detected lcore 7 as core 8 on socket 0 EAL: Detected lcore 8 as core 9 on socket 0 EAL: Detected lcore 9 as core 10 on socket 0 EAL: Detected lcore 10 as core 11 on socket 0 EAL: Detected lcore 11 as core 12 on socket 0 EAL: Detected lcore 12 as core 13 on socket 0 EAL: Detected lcore 13 as core 14 on socket 0 EAL: Detected lcore 14 as core 0 on socket 0 EAL: Detected lcore 15 as core 1 on socket 0 EAL: Skipped lcore 16 as core 2 on socket 0 EAL: Skipped lcore 17 as core 3 on socket 0 EAL: Skipped lcore 18 as core 4 on socket 0 EAL: Skipped lcore 19 as core 5 on socket 0 EAL: Skipped lcore 20 as core 6 on socket 0 EAL: Skipped lcore 21 as core 8 on socket 0 EAL: Skipped lcore 22 as core 9 on socket 0 EAL: Skipped lcore 23 as core 10 on socket 0 EAL: Skipped lcore 24 as core 11 on socket 0 EAL: Skipped lcore 25 as core 12 on socket 0 EAL: Skipped lcore 26 as core 13 on socket 0 EAL: Skipped lcore 27 as core 14 on socket 0 EAL: Support maximum 16 logical core(s) by configuration. EAL: Detected 16 lcore(s) EAL: Detected 1 NUMA nodes EAL: invalid parameter for --lcores We can remove this limitation by using a cpuset_t (which is a more natural type since this is what gets passed to pthread_setaffinity in the end). After the patch: $ ./master/app/testpmd --no-huge --no-pci -m 512 --log-level *:debug \ --lcores 0@16,1@17 -- -i --total-num-mbufs 2048 [...] EAL: Master lcore 0 is ready (tid=7f94217bbc00;cpuset=[16]) EAL: lcore 1 is ready (tid=7f941f491700;cpuset=[17]) Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-01-21 01:22:33 +01:00
David Marchand	927b5499c5	eal: log all detected cores on startup Add debug logs to have a trace of unused cores for -c/-l options on systems with more cores than RTE_MAX_LCORE. Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-01-21 01:22:16 +01:00
David Marchand	9f1d16c3b7	eal: do not cache lcore detection state We use this state in control path only for services cores and -c/-l options. The value is not updated when using --lcores. Use the internal helper where needed. Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-01-21 00:55:49 +01:00
David Marchand	be67d05759	eal/windows: fix cpuset macro name Fix the name of CPU_SETSIZE in hope we can reuse it in other parts of the dpdk manipulating some rte_cpuset_t. Fixes: `4dc2b4d2a4` ("eal/windows: add headers for compatibility") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-01-21 00:39:21 +01:00
Viacheslav Ovsiienko	6c8e50c2e5	mbuf: create pool with external memory buffers The dedicated routine rte_pktmbuf_pool_create_extbuf() is provided to create mbuf pool with data buffers located in the pinned external memory. The application provides the external memory description and routine initializes each mbuf with appropriate virtual and physical buffer address. It is entirely application responsibility to register external memory with rte_extmem_register() API, map this memory, etc. The new introduced flag RTE_PKTMBUF_POOL_F_PINNED_EXT_BUF is set in private pool structure, specifying the new special pool type. The allocated mbufs from pool of this kind will have the EXT_ATTACHED_MBUF flag set and initialiazed shared info structure, allowing cloning with regular mbufs (without attached external buffers of any kind). Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 23:36:38 +01:00
Viacheslav Ovsiienko	6ef1107ad4	mbuf: detach mbuf with pinned external buffer Update detach routine to check the mbuf pool type. Introduce the special internal version of detach routine to handle the special case of pinned external bufferon mbuf freeing. Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 23:36:15 +01:00
Viacheslav Ovsiienko	c621a9b66b	mbuf: introduce routine to get private mbuf pool flags The routine rte_pktmbuf_priv_flags is introduced to fetch the flags from the mbuf memory pool private structure in unified fashion. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 23:30:38 +01:00
Jerin Jacob	3e6181b070	mbuf: use structure marker from EAL Use new marker typedef available in EAL and remove private marker typedef. Signed-off-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-01-20 21:17:35 +01:00
Jerin Jacob	2b393160a4	eal: introduce structure markers Introduce EAL typedef for structure 1B, 2B, 4B, 8B alignment marking and a generic marker for a point in a structure. Signed-off-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-01-20 21:17:35 +01:00
Artur Trybula	ec40fe669f	mem: improve log message for too low memzone segments In case of too low number of memzone segments user notification was misleading. This patch improves the description by providing better explanation about the cause. Signed-off-by: Artur Trybula <arturx.trybula@intel.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2020-01-20 18:52:52 +01:00
Bruce Richardson	9ed3385f71	eal/freebsd: update CPU macro for FreeBSD 13 In (currently unreleased) FreeBSD 13, the CPU_NAND macro has been renamed to CPU_ANDNOT, so we need to use different DPDK-specific macros depending on what system-defined ones are present. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2020-01-20 17:11:48 +01:00
Eelco Chaudron	30512af820	meter: remove experimental flag from RFC4115 trTCM API Moved RFC4115 APIs to non-experimental as they have been there since 19.02. Also, these APIs are the same as the non RFC4115 APIs. Signed-off-by: Eelco Chaudron <echaudro@redhat.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-01-20 16:37:27 +01:00
Jörg Thalheim	a18e79cd3c	mbuf: improve API doc for attaching external buffer Enhance API documentation of rte_pktmbuf_attach_extbuf() to explain that the attached mbuf is initialized with length = 0. Link: https://bugs.dpdk.org/show_bug.cgi?id=362 Signed-off-by: Jörg Thalheim <joerg@thalheim.io> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 16:37:27 +01:00
Jerin Jacob	3f2d6766e3	mempool: remove memory wastage on non-x86 The existing optimize_object_size() function address the memory object alignment constraint on x86 for better performance. Different (micro) architecture may have different memory alignment constraint for better performance and it not the same as the existing optimize_object_size(). Some use, XOR(kind of CRC) scheme to enable DRAM channel distribution based on the address and some may have a different formula. Introducing arch_mem_object_align() function to abstract the difference between different (micro) architectures to avoid wasting memory for mempool object alignment for the architecture that it is not required to do so. Details on the amount of memory saving: Currently, arm64 based architectures use the default (nchan=4, nrank=1). The worst case is for an object whose size (including mempool header) is 2 cache lines, where it is optimized to 3 cache lines (+50%). Examples for cache lines size = 64: orig optimized 64 -> 64 +0% 128 -> 192 +50% 192 -> 192 +0% 256 -> 320 +25% 320 -> 320 +0% 384 -> 448 +16% ... 2304 -> 2368 +2.7% (~mbuf size) Additional details: https://www.mail-archive.com/dev@dpdk.org/msg149157.html Signed-off-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 16:37:27 +01:00
Olivier Matz	43503c59ad	mempool: fix populate with small virtual chunks To populate a mempool with a virtual area, the mempool code calls rte_mempool_populate_iova() for each iova-contiguous area. It happens (rarely) that this area is too small to store one object. In this case, rte_mempool_populate_iova() returns an error, which is forwarded by rte_mempool_populate_virt(). This case should not throw an error in rte_mempool_populate_virt(). Instead, the area that is too small should just be ignored. To fix this issue, change the return value of rte_mempool_populate_iova() to 0 when no object can be populated, so it can be ignored by the caller. As this would be an API/ABI change, only do this modification internally for now. Fixes: `354788b60c` ("mempool: allow populating with unaligned virtual area") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Tested-by: Anatoly Burakov <anatoly.burakov@intel.com> Tested-by: Alvin Zhang <alvinx.zhang@intel.com>	2020-01-20 16:34:54 +01:00
Olivier Matz	3a3d0c75b4	mempool: fix slow allocation of large pools When allocating a mempool which is larger than the largest available area, it can take a lot of time: a- the mempool calculate the required memory size, and tries to allocate it, it fails b- then it tries to allocate the largest available area (this does not request new huge pages) c- add this zone to the mempool, this triggers the allocation of a mem hdr, which request a new huge page d- back to a- until mempool is populated or until there is no more memory This can take a lot of time to finally fail (several minutes): in step a- it takes all available hugepages on the system, then release them after it fails. The problem appeared with commit `eba11e3646` ("mempool: reduce wasted space on populate"), because smaller chunks are now allowed. Previously, it had to be at least one page size, which is not the case in step b-. To fix this, implement our own way to allocate the largest available area instead of using the feature from memzone: if an allocation fails, try to divide the size by 2 and retry. When the requested size falls below min_chunk_size, stop and return an error. Fixes: `eba11e3646` ("mempool: reduce wasted space on populate") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com> Tested-by: Ali Alnubani <alialnu@mellanox.com>	2020-01-20 12:12:20 +01:00
Olivier Matz	f159c61c35	mempool: fix anonymous populate The documentation says that a negative errno is returned on error, but in most places that's not the case. Fix the documentation and the exceptions in code. The second one (return from populate_virt) also fixes a memory leak. Note that testpmd was using the function correctly. Fixes: `aa10457eb4` ("mempool: make mempool populate and free api public") Fixes: `6780f72fb8` ("mempool: populate with anonymous memory") Fixes: `66e7ba0bad` ("mempool: ensure mempool is initialized before populating") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-20 12:12:20 +01:00
Anoob Joseph	725e515ed1	ethdev: allow multiple security sessions to use flow rule The rte_security API which enables inline protocol/crypto feature mandates that for every security session an rte_flow is created. This would internally translate to a rule in the hardware which would do packet classification. In rte_security, one SA would be one security session. And if an rte_flow need to be created for every session, the number of SAs supported by an inline implementation would be limited by the number of rte_flows the PMD would be able to support. If the fields SPI & IP addresses are allowed to be a range, then this limitation can be overcome. Multiple flows will be able to use one rule for SECURITY processing. In this case, the security session provided as conf would be NULL. Application should do an rte_flow_validate() to make sure the flow is supported on the PMD. Signed-off-by: Anoob Joseph <anoobj@marvell.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Ori Kam <orika@mellanox.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2020-01-20 12:12:20 +01:00
Stephen Hemminger	cd7c59dc04	timer: add API to query ticks until the next timer It is useful to know when the next timer will expire when using rte_epoll_wait (or sleep when idle). This experimental API provides a hook to query the number of ticks remaining. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Erik Gabriel Carrillo <erik.g.carrillo@intel.com>	2020-01-20 01:55:19 +01:00
Reshma Pattan	2a178702c0	latency: fix calculation for multi-thread Make latency calculation multithread safe by using spinlock. Fixes: `5cd3cac9ed` ("latency: added new library for latency stats") Cc: stable@dpdk.org Signed-off-by: Reshma Pattan <reshma.pattan@intel.com>	2020-01-20 01:32:50 +01:00
Kumar Amber	f6320e3c11	hash: add max key id query API Adding new API function to query the maximum key ID that could possibly be returned by rte_hash_add_key and rte_hash_add_key_with_hash. When RTE_HASH_EXTRA_FLAGS_MULTI_WRITER_ADD is set, the maximum key id is larger than the entry count specified by the user. Signed-off-by: Kumar Amber <kumar.amber@intel.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-01-20 01:14:57 +01:00
Dharmik Thakkar	cd723fc70c	hash: remove unnecessary locks in lock-free Remove __hash_rw_reader_unlock() calls from lock free hash lookup Signed-off-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-01-20 00:47:46 +01:00
Liron Himi	94f73b5b6e	cfgfile: fix symbols map rte_cfgfile_section_num_entries_by_index was missing from the map file. meson build failed when calling this function, due to linking a binary to cfgfile built as a shared library. Fixes: `3d2e0448eb` ("cfgfile: add section number of entries by index") Cc: stable@dpdk.org Signed-off-by: Liron Himi <lironh@marvell.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-01-20 00:29:49 +01:00
Ali Alnubani	e8a17faa5e	eal/linux: fix build when VFIO is disabled The header linux/version.h isn't included when CONFIG_RTE_EAL_VFIO is explicitly disabled. LINUX_VERSION_CODE and KERNEL_VERSION are therefore undefined, causing the build failure: lib/librte_eal/linux/eal/eal.c: In function ‘rte_eal_init’: lib/librte_eal/linux/eal/eal.c:1076:32: error: "LINUX_VERSION_CODE" is not defined, evaluates to 0 [-Werror=undef] Fixes: `a0dede62a5` ("eal/linux: remove KNI restriction on IOVA") Cc: stable@dpdk.org Signed-off-by: Ali Alnubani <alialnu@mellanox.com>	2020-01-20 00:08:53 +01:00
Xiaoyu Min	12e6e3e78f	ethdev: add API to dump device internal flow info Introduce an API which dump the device's internal representation information of rte flows in hardware. Signed-off-by: Xiaoyu Min <jackmin@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-01-17 19:59:19 +01:00
Fang TongHao	99ef90317d	ethdev: fix secondary process memory overwrite Avoid overwriting device flags and other information in device data stored in shared memory when a secondary process probes PCI device. Fixes: `494adb7f63` ("ethdev: add device fields from PCI layer") Cc: stable@dpdk.org Signed-off-by: Fang TongHao <fangtonghao@sangfor.com.cn> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-01-17 19:59:19 +01:00
Wei Hu (Xavier)	2e02a2afff	ethdev: fix VLAN offloads set if no driver callback Currently, there is a potential problem that changing the content of dev->data->dev_conf.rxmode.offloads even when there is no vlan_offload_set driver callback. It is a good idea that prevent the side effect and make the API return success if no change requested. This patch fixes the problem, the detail information as below: - keep possibility to do dummy set even if there is no driver callback - do not touch Rx mode offloads in device data before checking the driver callback availability - ensure that Rx mode offloads are rolled back correctly if driver callback returns error Fixes: `81f9db8ecc` ("ethdev: add vlan offload support") Cc: stable@dpdk.org Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chunsong Feng <fengchunsong@huawei.com> Signed-off-by: Min Wang (Jushui) <wangmin3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com>	2020-01-17 19:59:19 +01:00
Viacheslav Ovsiienko	7e9165b1be	ethdev: fix switching domain allocation The maximum amount of unique swutching domain is supposed to be equal RTE_MAX_ETHPORTS. Current implementation allows to allocate only RTE_MAX_ETHPORTS-1 domains. The definition of RTE_ETH_DEV_SWITCH_DOMAIN_ID_INVALID is changed from 0 to UINT16_MAX, the rte_eth_dev_info_get is updated to initialize dev_ibfo structure accordingly. Fixes: `ce92504063` ("ethdev: add switch domain allocator") Cc: stable@dpdk.org Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-01-17 19:59:19 +01:00
Maxime Coquelin	5efb18e85f	vhost: fix deadlock on port deletion If the vhost-user application (e.g. OVS) deletes the vhost-user port while Qemu sends a vhost-user request, a deadlock can happen if the request handler tries to acquire vhost-user's global mutex, which is also locked by the vhost-user port deletion API (rte_vhost_driver_unregister). This patch prevents the deadlock by making rte_vhost_driver_unregister() to release the mutex and try again if a request is being handled to give a chance to the request handler to complete. Fixes: `8b4b949144` ("vhost: fix dead lock on closing in server mode") Fixes: `5fbb3941da` ("vhost: introduce driver features related APIs") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Eelco Chaudron <echaudro@redhat.com>	2020-01-17 19:46:26 +01:00
Li Feng	109c38b2e9	vhost: support config change slave message This msg is used to notify qemu that should get the config of backend. For example, vhost-user-blk uses this msg to notify guest OS the capacity of backend has changed. The need_reply flag is not mandatory because it will block the sender thread and master process will send get_config message to fetch the configuration, this need an extra thread to process the vhost message. Signed-off-by: Li Feng <fengli@smartx.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-01-17 19:46:26 +01:00
Rory Sexton	65388f4c4c	ethdev: add L2TPv3 over IP header to flow API This patch adds the new flow item RTE_FLOW_ITEM_TYPE_L2TPV3OIP to flow API to match a L2TPv3 over IP header. This patch supports only L2TPv3 over IP header format which is different to L2TPv2/L2TPv3 over UDP. The difference in header formats between L2TPv3 over IP and L2TP over UDP require a separate implementation for each. Signed-off-by: Rory Sexton <rory.sexton@intel.com> Signed-off-by: Dariusz Jagus <dariuszx.jagus@intel.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-01-17 19:46:26 +01:00
Ricardo Roldan	ba1e69f121	ethdev: fix callback unregister with wildcard argument list The function was checking -1 against the callback data instead of the given cb_arg parameter. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Ricardo Roldan <rroldan@bequant.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-01-17 19:46:02 +01:00
Suanming Mou	8482ffe4b6	ethdev: add IPv4/IPv6 DSCP rewrite action For some overlay network, such as VXLAN, the DSCP field in the new outer IP header after VXLAN decapsulation may need to be updated accordingly. This commit introduce the DSCP modify action for IPv4 and IPv6. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-01-17 19:46:02 +01:00
Xiao Wang	bf4fd5ba3e	vhost: fix socket initial value By default, a vhost socket is created without attaching VDPA device, this patch fixes the initial value of vdpa_dev_id. Fixes: `b4953225ce` ("vhost: add APIs for datapath configuration") Cc: stable@dpdk.org Signed-off-by: Xiao Wang <xiao.w.wang@intel.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com>	2020-01-17 19:46:02 +01:00
Adrian Moreno	74f45f872c	vhost: add dynamic logging system Currently there are a couple of limitations on the logging system: Most of the logs are compiled out and both datapath and controlpath logs share the same loglevel. This patch tries to help fix that situation by: - Splitting control plane and data plane logs - Making control plane logs dynamic while keeping data plane logs compiled out by default for log levels lower than the INFO. As a result, two macros are introduced: - VHOST_LOG_CONFIG(LEVEL, ...): Config path logging. Level can be dynamically controlled by "lib.vhost.config" - VHOST_LOG_DATA(LEVEL, ...): Data path logging. Level can be dynamically controlled by "lib.vhost.data". Every log macro with a level lower than RTE_LOG_DP_LEVEL (which defaults to RTE_LOG_INFO) will be compiled out. Signed-off-by: Adrian Moreno <amorenoz@redhat.com> Acked-by: Tiwei Bie <tiwei.bie@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-01-17 19:46:01 +01:00
Stephen Hemminger	ebdc7bb960	ethdev: fix flow API doxygen comment Missing asterisk so that comment is not seen by doxygen. Fixes: `9a2f44c762` ("ethdev: add flow tag") Cc: stable@dpdk.org Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-01-17 19:46:01 +01:00
Honnappa Nagarahalli	867b6d6180	eventdev: use custom element size rings Use custom element size ring APIs to replace event ring implementation. This avoids code duplication. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Mattias Rönnblom <mattias.ronnblom@ericsson.com>	2020-01-19 19:32:50 +01:00
Honnappa Nagarahalli	fbfe568103	hash: use 32-bit elements rings to save memory The freelist and external bucket indices are 32b. Using rings that use 32b element sizes will save memory. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2020-01-19 19:32:50 +01:00
Honnappa Nagarahalli	cc4b218790	ring: support configurable element size Current APIs assume ring elements to be pointers. However, in many use cases, the size can be different. Add new APIs to support configurable ring element sizes. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-01-19 19:32:48 +01:00
Balakrishna Bhamidipati	e98dc331e9	cryptodev: support ECPM Asymmetric crypto library is extended to add ECPM (Elliptic Curve Point Multiplication). The required xform type and op parameters are introduced. Signed-off-by: Anoob Joseph <anoobj@marvell.com> Signed-off-by: Balakrishna Bhamidipati <bbhamidipati@marvell.com> Signed-off-by: Sunila Sahu <ssahu@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-01-15 15:01:55 +01:00
Ayuj Verma	7bb4ea3246	cryptodev: support ECDSA Asymmetric crypto library is extended to add ECDSA. Elliptic curve xform and ECDSA op params are introduced. Signed-off-by: Anoob Joseph <anoobj@marvell.com> Signed-off-by: Ayuj Verma <ayverma@marvell.com> Signed-off-by: Sunila Sahu <ssahu@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-01-15 15:01:56 +01:00
Arek Kusztal	6c9f3b347e	cryptodev: add Chacha20-Poly1305 AEAD algorithm This patch adds Chacha20-Poly1305 AEAD algorithm to Cryptodev. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: Anoob Joseph <anoobj@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-01-15 13:34:02 +01:00
Gavin Hu	b13a21890a	ticketlock: use new API to reduce contention on aarch64 While using ticket lock, cores repeatedly poll the lock variable. This is replaced by rte_wait_until_equal API. Running ticketlock_autotest on ThunderX2, Ampere eMAG80, and Arm N1SDP[1], there were variances between runs, but no notable performance gain or degradation were seen with and without this patch. [1] https://community.arm.com/developer/tools-software/oss-platforms/w/\ docs/440/neoverse-n1-sdp Signed-off-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Tested-by: Phil Yang <phil.yang@arm.com> Tested-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com>	2020-01-17 12:02:21 +01:00
Gavin Hu	1be7855d77	eal: add wait until equal API The rte_wait_until_equal_xx APIs abstract the functionality of 'polling for a memory location to become equal to a given value'. Add the RTE_ARM_USE_WFE configuration entry for aarch64, disabled by default. When it is enabled, the above APIs will call WFE instruction to save CPU cycles and power. From a VM, when calling this API on aarch64, it may trap in and out to release vCPUs whereas cause high exit latency. Since kernel 4.18.20 an adaptive trapping mechanism is introduced to balance the latency and workload. Signed-off-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Steve Capper <steve.capper@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Reviewed-by: Phil Yang <phil.yang@arm.com> Acked-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: David Marchand <david.marchand@redhat.com>	2020-01-17 12:02:21 +01:00
Aaron Conole	2e088e6f94	service: don't walk out of bounds when checking services The service_valid call is used without properly bounds checking the input parameter. Almost all instances of the service_valid call are inside a for() loop that prevents excessive walks, but some of the public APIs don't bounds check and will pass invalid arguments. Prevent this by using SERVICE_GET_OR_ERR_RET where it makes sense, and adding a bounds check to one service_valid() use. Fixes: `8d39d3e237` ("service: fix race in service on app lcore function") Fixes: `e9139a32f6` ("service: add function to run on app lcore") Fixes: `e30dd31847` ("service: add mechanism for quiescing") Cc: stable@dpdk.org Signed-off-by: Aaron Conole <aconole@redhat.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2019-12-20 15:09:35 +01:00
David Marchand	30a0df64aa	test/common: fix log2 check We recently started to get random failures on the common_autotest ut with clang on Ubuntu 16.04.6. Example: https://travis-ci.com/DPDK/dpdk/jobs/263177424 Wrong rte_log2_u64(0) val 0, expected ffffffff Test Failed The ut passes 0 to log2() to get an expected value. Quoting log2 / log(3) manual: If x is zero, then a pole error occurs, and the functions return -HUGE_VAL, -HUGE_VALF, or -HUGE_VALL, respectively. rte_log2_uXX helpers handle 0 as a special value and return 0. Let's have dedicated tests for this case. Fixes: `05c4345ef5` ("test: add unit test for integer log2 function") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Aaron Conole <aconole@redhat.com>	2019-12-20 15:05:41 +01:00
Bruce Richardson	f26c2b39b2	build: fix soname info for 19.11 compatibility The soname for each stable ABI version should be just the ABI version major number without the minor number. Unfortunately both major and minor were used causing version 20.1 to be incompatible with 20.0. This patch fixes the issue by switching from 2-part to 3-part ABI version numbers so that we can keep 20.0 as soname and using the final digits to identify the 20.x releases which are ABI compatible. This requires changes to both make and meson builds to handle the three-digit version and shrink it to 2-digit for soname. The final fix needed in this patch is to adjust the library version number for the ethtool example library, which needs to be upped to 2-digits, as external libraries using the DPDK build system also use the logic in this file. Fixes: `cba806e07d` ("build: change ABI versioning to global") Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Tested-by: Ray Kinsella <mdr@ashroe.eu> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Kevin Laatz <kevin.laatz@intel.com> Tested-by: David Marchand <david.marchand@redhat.com>	2019-12-19 16:18:21 +01:00
David Marchand	aef1d07331	eal/linux: fix build error on RHEL 7.6 Previous fix gives hiccups to gcc on RHEL 7.6: == Build lib/librte_eal/linux/eal CC eal_interrupts.o ...lib/librte_eal/linux/eal/eal_interrupts.c: In function ‘eal_intr_thread_main’: ...lib/librte_eal/linux/eal/eal_interrupts.c:1048:9: error: missing initializer for field ‘events’ of ‘struct epoll_event’ [-Werror=missing-field-initializers] struct epoll_event ev = { }; ^ In file included from ...lib/librte_eal/linux/eal/eal_interrupts.c:15:0: /usr/include/sys/epoll.h:89:12: note: ‘events’ declared here uint32_t events; /* Epoll events */ ^ ...lib/librte_eal/linux/eal/eal_interrupts.c: At top level: cc1: error: unrecognized command line option "-Wno-address-of-packed-member" [-Werror] cc1: all warnings being treated as errors Fixes: `e0ab8020ac` ("eal/linux: fix uninitialized data valgrind warning") Cc: stable@dpdk.org Reported-by: Andrew Rybchenko <arybchenko@solarflare.com> Signed-off-by: David Marchand <david.marchand@redhat.com>	2019-12-04 14:22:02 +01:00
Stephen Hemminger	e0ab8020ac	eal/linux: fix uninitialized data valgrind warning Valgrind reports that eal interrupt thread is calling epoll_ctl with uninitialized data. This is a false positive, because the kernel is not going to care about the unused bits in the union but trivial to fix by initializing it. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: David Marchand <david.marchand@redhat.com>	2019-12-04 10:05:05 +01:00
Ferruh Yigit	de480bbf13	kni: fix build with Linux 4.9.x The 'get_user_pages_remote()' API is updated in kernel 4.10.0 [1], but the check added as > 4.9.0, this logic is broken for kernels 4.9.x, because they justify > 4.9.0 check but have the old API. Fixing the check as >= 4.10.0 [1] commit 5b56d49fc31d ("mm: add locked parameter to get_user_pages_remote()") Fixes: `d965af9e8a` ("kni: increase kernel version requirement for VA") Reported-by: Andrew Rybchenko <arybchenko@solarflare.com> Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2019-11-28 14:48:24 +01:00
Stephen Hemminger	797f07b1fe	eal/windows: remove tail queue license boilerplate The BSD license is already handled by SPDX tag. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-11-28 03:12:55 +01:00
Stephen Hemminger	0ab9d1cfd0	eal: remove reciprocal divide license boilerplate No need for extra language, covered by SPDX tag. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2019-11-28 03:12:55 +01:00
Stephen Hemminger	c6e606e2ab	eal: remove uuid license boilerplate License type is already clear from SPDX tag. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-11-28 03:12:55 +01:00
Xiaolong Ye	7751efa9bf	port: replace license text with SPDX tag Signed-off-by: Xiaolong Ye <xiaolong.ye@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Acked-by: Ethan Zhuang <zhuangwj@gmail.com>	2019-11-28 03:02:51 +01:00
Thomas Monjalon	20bbb9e045	ethdev: limit maximum number of queues A buffer overflow happens in testpmd with some drivers since the queue arrays are limited to RTE_MAX_QUEUES_PER_PORT. The advertised capabilities of mlx4, mlx5 and softnic for the number of queues were the maximum number: UINT16_MAX. They must be limited by the configured RTE_MAX_QUEUES_PER_PORT that applications expect to be respected. The limitation is applied at ethdev level (function rte_eth_dev_info_get), in order to force the configured limit for all drivers. Fixes: `14b53e27b3` ("ethdev: fix crash with multiprocess") Cc: stable@dpdk.org Reported-by: Raslan Darawsheh <rasland@mellanox.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2019-11-27 16:04:40 +01:00
Matan Azrad	64edb05e29	ethdev: fix item expansion for RSS flow When the last item in flow pattern includes "next protocol" field which is relevant for RSS flow expansion, a new item is added to the pattern according to the "next protocol" field. This field is called missed field. The missed field wrongly was not initialized what caused to some of the flow item fields to contain garbage values. As a result, the PMDs internal flow engine may crash. For example, the spec value may include garbage pointer and to cause crash. Initialize the missed field with zeroes. Fixes: `fc2dd8dd49` ("ethdev: fix expand RSS flows") Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2019-11-26 18:05:15 +01:00
Ali Alnubani	64a525aad9	eal: fix header file install with meson The header file 'rte_vfio.h' might be required by some external apps. This patch adds it to the list of common_headers so that it's installed by meson. Fixes: `610beca42e` ("build: remove library special cases") Cc: stable@dpdk.org Signed-off-by: Ali Alnubani <alialnu@mellanox.com> Reviewed-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2019-11-26 17:39:38 +01:00
Xueming Li	2808a12cc0	malloc: fix memory element size in case of padding This patch fixes wrong inner memory element size when joining two elements. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Xueming Li <xuemingl@mellanox.com> Reviewed-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-11-26 16:24:08 +01:00
Anatoly Burakov	bdc993fa3d	mem: clarify documentation of virt2iova behaviour It may not be immediately clear that rte_mem_virt2iova does not actually check the internal memseg table, and will instead either return VA (in IOVA as VA mode), or will fall back to kernel page table walk (in IOVA as PA mode). Add a note to API documentation indicating the above. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2019-11-26 00:30:51 +01:00
David Hunt	19802aaf7c	power: fix error log on guest message polling Should be passing errno rather than ret, which could be negative. Coverity issue: 350362 Fixes: `9dc843eb27` ("power: extend guest channel API for reading") Cc: stable@dpdk.org Signed-off-by: David Hunt <david.hunt@intel.com>	2019-11-26 00:29:24 +01:00
Stephen Hemminger	06710448c9	remove blank lines at end of file Remove trailing blank lines. They serve no purpose and are just editor leftovers. These can cause git to complain about whitespace errors during merges. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2019-11-26 00:12:08 +01:00
Shahaf Shuler	39a19ae03d	mbuf: extend mbuf pool private structure With the API and ABI freeze ahead, it will be good to reserve some bits on the private structure for future use. Otherwise we will potentially need to maintain two different private structure during 2020 period. There is already one use case for those reserved bits[1] The reserved field should be set to 0 by the user. [1] https://patches.dpdk.org/patch/63077/ Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2019-11-25 22:44:46 +01:00
Mattias Rönnblom	388c4c03ec	power: handle frequency increase with turbo disabled Calling pstate's or acpi's rte_power_freq_up() when on the highest non-turbo frequency results in an error, if turbo is enabled in the BIOS, but disabled via the power library. The error is in the form of a return code and a RTE_LOG() entry on the ERR level. According to the API documentation, the frequency is scaled up "according to the available frequencies". In case turbo is disabled, that frequency is not available. This patch's rte_power_freq_up() behaviour is also consistent with how rte_power_freq_max() is implemented (i.e. the highest non-turbo frequency is set, in case turbo is disabled). Fixes: `445c6528b5` ("power: common interface for guest and host") Fixes: `e6c6dc0f96` ("power: add p-state driver compatibility") Cc: stable@dpdk.org Signed-off-by: Mattias Rönnblom <mattias.ronnblom@ericsson.com> Tested-by: David Hunt <david.hunt@intel.com> Acked-by: David Hunt <david.hunt@intel.com> Reviewed-by: Liang Ma <liang.j.ma@intel.com>	2019-11-21 00:52:31 +01:00
Ferruh Yigit	d965af9e8a	kni: increase kernel version requirement for VA A build error reported related to the selected 'get_user_pages_remote()' kernel API: .../kernel/linux/kni/kni_dev.h:113:8: error: too few arguments to function ‘get_user_pages_remote’ ret = get_user_pages_remote(tsk, tsk->mm, iova, 1 ^~~~~~~~~~~~~~~~~~~~~ Currently there are three versions of the 'get_user_pages_remote()' supported, based on kernel version < 4.9, = 4.9, > 4.9. These version based checks are not working fine with the distro kernels which is the cause of reported build error. The error reported by the kernel version 4.8, but it is using API defined in > 4.9. To be able to take control of this, and possible more, related build error, increasing the minimum supported kernel version for iova=va with KNI to kernel version 4.9. This leaves us with single version of the kernel API and more manageable. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2019-11-21 00:18:02 +01:00
Ruifeng Wang	bb2a9973c0	bpf/arm: fix clang build Clang has different prototype for __builtin___clear_cache(). It requires 'char ' parameters while gcc requires 'void '. Clang version 8.0 was used. Warning messages during build: ../lib/librte_bpf/bpf_jit_arm64.c:1438:26: warning: incompatible pointer types passing 'uint32_t ' (aka 'unsigned int ') to parameter of type 'char ' [-Wincompatible-pointer-types] __builtin___clear_cache(ctx.ins, ctx.ins + ctx.idx); ^~~~~~~ ../lib/librte_bpf/bpf_jit_arm64.c:1438:35: warning: incompatible pointer types passing 'uint32_t ' (aka 'unsigned int ') to parameter of type 'char ' [-Wincompatible-pointer-types] __builtin___clear_cache(ctx.ins, ctx.ins + ctx.idx); ^~~~~~~~~~~~~~~~~ Fixes: `f3e5167724` ("bpf/arm: add prologue and epilogue") Cc: jerinj@marvell.com Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Phil Yang <phil.yang@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2019-11-21 00:30:39 +01:00
Pawel Modrak	85ff364f3b	build: align symbols with global ABI version Merge all versions in linker version script files to DPDK_20.0. This commit was generated by running the following command: :~/DPDK$ buildtools/update-abi.sh 20.0 Signed-off-by: Pawel Modrak <pawelx.modrak@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2019-11-20 23:05:39 +01:00
Marcin Baran	4ab92c53ed	distributor: rename v2.0 ABI to _single suffix The original ABI versioning was slightly misleading in that the DPDK 2.0 ABI was really a single mode for the distributor, and is used as such throughout the distributor code. Fix this by renaming all _v20 API's to _single API's, and remove symbol versioning. Signed-off-by: Marcin Baran <marcinx.baran@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: David Hunt <david.hunt@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2019-11-20 23:05:39 +01:00
Marcin Baran	6e5b516761	distributor: remove deprecated code Remove code for old ABI versions ahead of ABI version bump. Signed-off-by: Marcin Baran <marcinx.baran@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: David Hunt <david.hunt@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2019-11-20 23:05:39 +01:00

1 2 3 4 5 ...

5918 Commits