numam-dpdk

Author	SHA1	Message	Date
Dmitry Kozlyuk	2edd037c09	mem: add dirty malloc element support EAL malloc layer assumed all free elements content is filled with zeros ("clean"), as opposed to uninitialized ("dirty"). This assumption was ensured in two ways: 1. EAL memalloc layer always returned clean memory. 2. Freed memory was cleared before returning into the heap. Clearing the memory can be as slow as around 14 GiB/s. To save doing so, memalloc layer is allowed to return dirty memory. Such segments being marked with RTE_MEMSEG_FLAG_DIRTY. The allocator tracks elements that contain dirty memory using the new flag in the element header. When clean memory is requested via rte_zmalloc*() and the suitable element is dirty, it is cleared on allocation. When memory is deallocated, the freed element is joined with adjacent free elements, and the dirty flag is updated: a) If the joint element contains dirty parts, it is dirty: dirty + freed + dirty = dirty => no need to clean freed + dirty = dirty the freed memory Dirty parts may be large (e.g. initial allocation), so clearing them could create unpredictable slowdown. b) If the only dirty part of the joint element is the freed memory, the joint element can be made clean: clean + freed + clean = clean => freed memory clean + freed = clean must be cleared freed + clean = clean freed = clean This logic naturally reproduces the old behavior and always applies in modes when EAL memalloc layer returns only clean segments. As a result, memory is either cleared on free, as before, or it will be cleared on allocation if need be, but never twice. Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Reviewed-by: Anatoly Burakov <anatoly.burakov@intel.com>	2022-02-08 21:32:53 +01:00
Dmitry Kozlyuk	c62b318a9f	app/test: add allocator performance benchmark Memory allocator performance is crucial to applications that deal with large amount of memory or allocate frequently. DPDK allocator performance is affected by EAL options, API used and, at least, allocation size. New autotest is intended to be run with different EAL options. It measures performance with a range of sizes for dirrerent APIs: rte_malloc, rte_zmalloc, and rte_memzone_reserve. Work distribution between allocation and deallocation depends on EAL options. The test prints both times and total time to ease comparison. Memory can be filled with zeroes at different points of allocation path, but it always takes considerable fraction of overall timing. This is why the test measures filling speed and prints how long clearing takes for each size as a reference (for rte_memzone_reserve estimations are printed). Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Aaron Conole <aconole@redhat.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2022-02-08 21:32:53 +01:00
Dmitry Kozlyuk	1ba4f6735b	doc: add hugepage mapping details Hugepage mapping is a layer of EAL malloc builds upon. There were implicit references to its details, like mentions of segment file descriptors, but no explicit description of its modes and operation. Add an overview of mechanics used on ech supported OS. Convert memory management subsections from list items to level 4 headers: they are big and important enough. Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2022-02-08 21:04:42 +01:00
Weiguo Li	7750099d2c	eventdev: remove useless C++ include guard This private header contains an incomplete cplusplus guard, just remove it. Fixes: `d35e61322d` ("eventdev: move inline APIs into separate structure") Signed-off-by: Weiguo Li <liwg06@foxmail.com>	2022-02-08 17:15:47 +01:00
Weiguo Li	0ae7844fcd	eal/windows: remove useless C++ include guard Remove the incomplete cplusplus guard in internal header. Fixes: `6e1ed4cbbe` ("eal/windows: add dirent implementation") Signed-off-by: Weiguo Li <liwg06@foxmail.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Pallavi Kadam <pallavi.kadam@intel.com>	2022-02-08 17:13:50 +01:00
Weiguo Li	9df62f8574	net/dpaa2: remove useless C++ include guard Remove the incomplete cplusplus guard in internal headers. Fixes: `72ec7a678e` ("net/dpaa2: add soft parser driver") Signed-off-by: Weiguo Li <liwg06@foxmail.com>	2022-02-08 17:13:24 +01:00
Weiguo Li	073ab38ff7	net/cxgbe: remove useless C++ include guard Remove the incomplete cplusplus guard in internal header. Fixes: `3bd122eef2` ("cxgbe/base: add hardware API for Chelsio T5 series adapters") Signed-off-by: Weiguo Li <liwg06@foxmail.com>	2022-02-08 17:12:54 +01:00
Weiguo Li	3cacda8ff0	common/mlx5: remove useless C++ include guard Remove the incomplete cplusplus guard in internal headers. Fixes: `7525ebd8eb` ("common/mlx5: add glue functions on Windows") Signed-off-by: Weiguo Li <liwg06@foxmail.com>	2022-02-08 17:10:54 +01:00
Weiguo Li	1785d9eb62	bus/dpaa: fix C++ include guard Supplement the missing half of braces for the extern "C" block, or remove the incomplete guard in internal header. Fixes: `6d6b4f49a1` ("bus/dpaa: add FMAN hardware operations") Fixes: `919eeaccb2` ("bus/dpaa: introduce NXP DPAA bus driver skeleton") Signed-off-by: Weiguo Li <liwg06@foxmail.com>	2022-02-08 16:59:23 +01:00
Jie Zhou	0edc1f408a	test: enable subset of tests on Windows Enable a subset of unit tests for Windows CI - For driver tests, driver owners should enable corresponding tests when enabling driver for Windows. - For dump tests, currently the tests hang on Windows which require further investigation. - For telemetry tests, it has POSIX socket specific codes which require replacement for Windows. Will investigate and work on a separate patch. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	9713eae1d8	test: replace shell script with Python - Add python script to check if system supports hugepages - Remove corresponding .sh script - Replace calling of .sh with corresponding .py in meson.build Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	3c60274c09	test: skip unsupported tests on Windows Skip tests which are not yet supported for Windows: - The libraries that tests depend on are not enabled on Windows yet - The tests can compile but with issue still under investigation * test_func_reentrancy: Windows EAL has no protection against repeated calls. * test_lcores: Execution enters an infinite loops, requires investigation. * test_rcu_qsbr_perf: Execution hangs on Windows, requires investigation. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	f684672947	test: resolve name collision on Windows Add prefix to resolve name collision on Windows. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	a089d32033	test/alarm: disable bad time cases on Windows Remove two alarm_autotest test cases which do bogus range check on Windows. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	e14f1744d6	eal: differentiate strerror message on Windows On Windows, strerror returns just "Unknown error" for errnum greater than MAX_ERRNO, while linux and freebsd returns "Unknown error <num>", which is the current expectation for errno_autotest. Differentiate the error string on Windows to remove a "duplicate error code" failure. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	1570ab1129	test/log: skip regex on Windows DPDK logs_autotest on Windows failed at "dynamic log types" tests. The failures are on 2 test cases for rte_log_set_level_regexp API, due to regular expression is not supported on Windows in DPDK yet and regcomp/regexec are just stubs on Windows (in regex.h). In app/test/test_logs.c, ifndef these two test cases, and for the rte_log_set_level_pattern validation case following these two cases, differentiate the expected log level passed into macro CHECK_LEVELS Now logs_autotest completes for all dynamic log types and static log types. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	d18eb79740	test/interrupts: skip on Windows Even though test_interrupts.c can compile on Windows, skip interrupt tests for now since majority of eal_interrupt on Windows are stubs. Will remove the skip after interrupt being fully enabled on Windows. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	068cdfae1f	test/mem: fix error check Fix incorrect errno variable used in memory autotest. Use rte_errno instead. Fixes: `086d426406` ("test/mem: fix memory autotests on FreeBSD") Cc: stable@dpdk.org Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	987d40a057	test: remove POSIX-specific code - Replace POSIX-specific code with DPDK equivalents or conditionally disable it on Windows - Use NUL on Windows as /dev/null for Unix - Exclude tests not supported on Windows yet * multi-process * PMD performance statistics display on signal Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com>	2022-02-08 14:19:40 +01:00
Jie Zhou	7e71c4dce3	eal/windows: fix error code for not supported API UT memory_autotest on Windows has 2 failed cases on EAL APIs eal_memalloc_get_seg_fd and eal_memalloc_get_seg_fd_offset. These 2 APIs are not supported on Windows yet. Should return ENOTSUP such that in test_memory.c these 2 ENOTSUP cases will not be marked as failures, same as other ENOTSUP cases. Fixes: `2a5d547a4a` ("eal/windows: implement basic memory management") Cc: stable@dpdk.org Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2022-02-08 14:19:40 +01:00
Zhihong Wang	0e4dc6af06	ring: fix overflow in memory size calculation Parameters count and esize are both unsigned int, and their product can legaly exceed unsigned int and lead to runtime access violation. Fixes: `cc4b218790` ("ring: support configurable element size") Cc: stable@dpdk.org Signed-off-by: Zhihong Wang <wangzhihong.wzh@bytedance.com> Reviewed-by: Liang Ma <liangma@liangbit.com> Reviewed-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2022-02-05 18:15:33 +01:00
Robert Sanford	436e82b78c	ring: update ring size doxygen comments - Add RING_F_EXACT_SZ description to rte_ring_init and rte_ring_create param comments. - Fix ring size comments. Signed-off-by: Robert Sanford <rsanford@akamai.com> Acked-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2022-02-05 18:03:07 +01:00
Yunjian Wang	074717be3e	ring: fix error code when creating ring The error value returned by rte_ring_create_elem() should be positive integers. However, if the rte_ring_get_memsize_elem() function fails, a negative number is returned and is directly used as the return value. As a result, this will cause the external call to check the return value to fail(like called by rte_mempool_create()). Fixes: `a182620042` ("ring: get size in memory") Cc: stable@dpdk.org Reported-by: Nan Zhou <zhounan14@huawei.com> Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2022-02-05 17:53:27 +01:00
Andrzej Ostruszka	97ed4cb6fb	ring: optimize corner case for enqueue/dequeue When enqueueing/dequeueing to/from the ring we try to optimize by manual loop unrolling. The check for this optimization looks like: if (likely(idx + n < size)) { where 'idx' points to the first usable element (empty slot for enqueue, data for dequeue). The correct comparison here should be '<=' instead of '<'. This is not a functional error since we fall back to the loop with correct checks on indexes. Just a minor suboptimal behaviour for the case when we want to enqueue/dequeue exactly the number of elements that we have in the ring before wrapping to its beginning. Fixes: `cc4b218790` ("ring: support configurable element size") Fixes: `286bd05bf7` ("ring: optimisations") Signed-off-by: Andrzej Ostruszka <amo@semihalf.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Reviewed-by: Morten Brørup <mb@smartsharesystems.com>	2022-02-04 15:19:06 +01:00
Pallavi Kadam	416c1bef9d	eal/windows: set worker thread affinity at init Sometimes OS tries to switch the core. So, bind the lcore thread to a fixed core. Implement affinity call on Windows similar to Linux. Signed-off-by: Qiao Liu <qiao.liu@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com> Acked-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com> Tested-by: Idan Hackmon <idanhac@nvidia.com>	2022-02-02 23:44:05 +01:00
Morten Brørup	ed579e50c6	mempool: test performance with constant n "What gets measured gets done." This patch adds mempool performance tests where the number of objects to put and get is constant at compile time, which may significantly improve the performance of these functions. [] Also, it is ensured that the array holding the object used for testing is cache line aligned, for maximum performance. And finally, the following entries are added to the list of tests: - Number of kept objects: 512 - Number of objects to get and to put: The number of pointers fitting into a cache line, i.e. 8 or 16 [] Some example performance test (with cache) results: get_bulk=4 put_bulk=4 keep=128 constant_n=false rate_persec=280480972 get_bulk=4 put_bulk=4 keep=128 constant_n=true rate_persec=622159462 get_bulk=8 put_bulk=8 keep=128 constant_n=false rate_persec=477967155 get_bulk=8 put_bulk=8 keep=128 constant_n=true rate_persec=917582643 get_bulk=32 put_bulk=32 keep=32 constant_n=false rate_persec=871248691 get_bulk=32 put_bulk=32 keep=32 constant_n=true rate_persec=1134021836 Signed-off-by: Morten Brørup <mb@smartsharesystems.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2022-02-02 22:06:14 +01:00
Haiyue Wang	08c724b327	doc: fix KNI PMD name typo The KNI PMD name should be "net_kni". Fixes: `75e2bc54c0` ("net/kni: add KNI PMD") Cc: stable@dpdk.org Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2022-02-02 21:54:21 +01:00
Markus Theil	f1b2991c3c	kni: fix ioctl signature Fix kni's ioctl signature to correctly match the kernel's structs. This shaves off the (void) casts and uses struct file instead of struct inode*. With the correct signature, control flow integrity checkers are no longer confused at this point. Signed-off-by: Markus Theil <markus.theil@secunet.com> Tested-by: Michael Pfeiffer <michael.pfeiffer@tu-ilmenau.de> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2022-02-02 20:55:05 +01:00
Tudor Cornea	5569dd7d90	kni: allow configuring thread granularity The Kni kthreads seem to be re-scheduled at a granularity of roughly 1 millisecond right now, which seems to be insufficient for performing tests involving a lot of control plane traffic. Even if KNI_KTHREAD_RESCHEDULE_INTERVAL is set to 5 microseconds, it seems that the existing code cannot reschedule at the desired granularily, due to precision constraints of schedule_timeout_interruptible(). In our use case, we leverage the Linux Kernel for control plane, and it is not uncommon to have 60K - 100K pps for some signaling protocols. Since we are not in atomic context, the usleep_range() function seems to be more appropriate for being able to introduce smaller controlled delays, in the range of 5-10 microseconds. Upon reading the existing code, it would seem that this was the original intent. Adding sub-millisecond delays, seems unfeasible with a call to schedule_timeout_interruptible(). KNI_KTHREAD_RESCHEDULE_INTERVAL 5 /* us */ schedule_timeout_interruptible( usecs_to_jiffies(KNI_KTHREAD_RESCHEDULE_INTERVAL)); Below, we attempted a brief comparison between the existing implementation, which uses schedule_timeout_interruptible() and usleep_range(). We attempt to measure the CPU usage, and RTT between two Kni interfaces, which are created on top of vmxnet3 adapters, connected by a vSwitch. insmod rte_kni.ko kthread_mode=single carrier=on schedule_timeout_interruptible(usecs_to_jiffies(5)) kni_single CPU Usage: 2-4 % [root@localhost ~]# ping 1.1.1.2 -I eth1 PING 1.1.1.2 (1.1.1.2) from 1.1.1.1 eth1: 56(84) bytes of data. 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=2.70 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=1.00 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=1.99 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=0.985 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=1.00 ms usleep_range(5, 10) kni_single CPU usage: 50% 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=0.338 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=0.150 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=0.123 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=0.139 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=0.159 ms usleep_range(20, 50) kni_single CPU usage: 24% 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=0.202 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=0.170 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=0.171 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=0.248 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=0.185 ms usleep_range(50, 100) kni_single CPU usage: 13% 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=0.537 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=0.257 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=0.231 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=0.143 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=0.200 ms usleep_range(100, 200) kni_single CPU usage: 7% 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=0.716 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=0.167 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=0.459 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=0.455 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=0.252 ms usleep_range(1000, 1100) kni_single CPU usage: 2% 64 bytes from 1.1.1.2: icmp_seq=1 ttl=64 time=2.22 ms 64 bytes from 1.1.1.2: icmp_seq=2 ttl=64 time=1.17 ms 64 bytes from 1.1.1.2: icmp_seq=3 ttl=64 time=1.17 ms 64 bytes from 1.1.1.2: icmp_seq=4 ttl=64 time=1.17 ms 64 bytes from 1.1.1.2: icmp_seq=5 ttl=64 time=1.15 ms Upon testing, usleep_range(1000, 1100) seems roughly equivalent in latency and cpu usage to the variant with schedule_timeout_interruptible(), while usleep_range(100, 200) seems to give a decent tradeoff between latency and cpu usage, while allowing users to tweak the limits for improved precision if they have such use cases. Disabling RTE_KNI_PREEMPT_DEFAULT, interestingly seems to lead to a softlockup on my kernel. Kernel panic - not syncing: softlockup: hung tasks CPU: 0 PID: 1226 Comm: kni_single Tainted: G W O 3.10 #1 <IRQ> [<ffffffff814f84de>] dump_stack+0x19/0x1b [<ffffffff814f7891>] panic+0xcd/0x1e0 [<ffffffff810993b0>] watchdog_timer_fn+0x160/0x160 [<ffffffff810644b2>] __run_hrtimer.isra.4+0x42/0xd0 [<ffffffff81064b57>] hrtimer_interrupt+0xe7/0x1f0 [<ffffffff8102cd57>] smp_apic_timer_interrupt+0x67/0xa0 [<ffffffff8150321d>] apic_timer_interrupt+0x6d/0x80 This patch also attempts to remove this option. References: [1] https://www.kernel.org/doc/Documentation/timers/timers-howto.txt Signed-off-by: Tudor Cornea <tudor.cornea@gmail.com> Acked-by: Padraig Connolly <Padraig.J.Connolly@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2022-02-02 20:45:18 +01:00
Bruce Richardson	e16b972b1a	build: remove deprecated Meson functions Starting in meson 0.56, the functions meson.source_root() and meson.build_root() are deprecated and to be replaced by the [more descriptive] functions: project_source_root()/global_source_root() and project_build_root()/global_build_root(). Unfortunately, these new replacement functions were only added in 0.56 release too, so to use them we would need version checks for old/new functions to remove the deprecation warnings. However, the functions "current_build_dir()" and "current_source_dir()" remain unaffected by all this, so we can bypass the versioning problem, by saving off these values to "dpdk_source_root" and "dpdk_build_root" in the top-level meson.build file Bugzilla ID: 926 Cc: stable@dpdk.org Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Tested-by: Jerin Jacob <jerinj@marvell.com>	2022-02-02 18:46:53 +01:00
Bruce Richardson	d832326ae9	build: fix warning about using -Wextra flag Each build, meson would issue a warning reporting that the "warning_level" setting should be used in place of adding -Wextra directly to our build commands. Testing with meson 0.61 shows that the only difference for gcc and clang builds between warning levels 1 and 2 is the addition of -Wextra, so we can remove the warning by deleting our explicit set of Wextra and changing the build defaults to warning_level 2. Fixes: `524a0d5d66` ("build: enable extra warnings with meson") Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Luca Boccassi <bluca@debian.org>	2022-02-02 15:56:45 +01:00
Bruce Richardson	ecb904cc45	build: fix warnings when running external commands Meson 0.61.1 is giving warnings that the calls to run_command do not always explicitly specify if the result is to be checked or not, i.e. there is a missing "check" parameter. This is because the default behaviour without the parameter is due to change in the future. We can fix these warnings by explicitly adding into each call whether the result should be checked by meson or not. This patch therefore adds in "check: false" to each run_command call where the result is being checked by the DPDK meson.build code afterwards, and adds in "check: true" to any calls where the result is currently unchecked. Bugzilla ID: 921 Cc: stable@dpdk.org Reported-by: Jerin Jacob <jerinj@marvell.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Tested-by: Jerin Jacob <jerinj@marvell.com>	2022-02-02 15:44:12 +01:00
Martijn Bakker	44f44d8298	pflock: fix header file installation The generic header file was missing in the list of files to install. Fixes: `9667d97c25` ("pflock: add phase-fair reader writer locks") Cc: stable@dpdk.org Signed-off-by: Martijn Bakker <gladdyu@gmail.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2022-02-02 14:34:11 +01:00
Qi Zhang	affa9de474	doc: update matching versions in ice guide Add recommended matching list for ice PMD in DPDK 21.08 and DPDK 21.11. Cc: stable@dpdk.org Signed-off-by: Qi Zhang <qi.z.zhang@intel.com> Acked-by: Junfeng Guo <junfeng.guo@intel.com>	2022-01-28 09:55:25 +01:00
Feifei Wang	bf44442612	net/i40e: remove redundant reset operation For free buffer operation in i40e vector path, it is unnecessary to store 'NULL' into txep.mbuf. This is because when putting mbuf into Tx queue, tx_tail is the sentinel. And when doing tx_free, tx_next_dd is the sentinel. In all processes, mbuf==NULL is not a condition in check. Thus reset of mbuf is unnecessary and can be omitted. Signed-off-by: Feifei Wang <feifei.wang2@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2022-01-28 07:56:20 +01:00
Xiaoyu Min	87b26522f7	net/mlx5: reject jump to root table Currently root table as destination is not supported. The jump action which finally be translated to underlying root table in rdma-core should be rejected. Fixes: `f78f747f41` ("net/mlx5: allow jump to group lower than current") Cc: stable@dpdk.org Signed-off-by: Xiaoyu Min <jackmin@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-01-26 17:41:12 +01:00
Bing Zhao	b4a4159d4e	common/mlx5: fix probing failure code While probing the device with unsupported class, the process should fail because no appropriate driver was found. After traversing all the drivers, an error value should be returned for the case. In the previous implementation, zero value indicating probing success was wrongly returned. Fixes: `ad435d3204` ("common/mlx5: add bus-agnostic layer") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-01-26 17:41:12 +01:00
Raja Zidane	082becbf1f	net/mlx5: fix mark enabling for Rx To optimize datapath, the mlx5 pmd checked for mark action on flow creation, and flagged possible destination rxqs (through queue/RSS actions), then it enabled the mark action logic only for flagged rxqs. Mark action didn't work if no queue/rss action was in the same flow, even when the user use multi-group logic to manage the flows. So, if mark action is performed in group X and the packet is moved to group Y > X when the packet is forwarded to Rx queues, SW did not get the mark ID to the mbuf. Flag Rx datapath to report mark action for any queue when the driver detects the first mark action after dev_start operation. Fixes: `8e61555657` ("net/mlx5: fix shared RSS and mark actions combination") Cc: stable@dpdk.org Signed-off-by: Raja Zidane <rzidane@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2022-01-26 17:41:11 +01:00
Dmitry Kozlyuk	2eb92b0fbb	common/mlx5: fix MR lookup for non-contiguous mempool Memory region (MR) lookup by address inside mempool MRs was not accounting for the upper bound of an MR. For mempools covered by multiple MRs this could return a wrong MR LKey, typically resulting in an unrecoverable TxQ failure: mlx5_net: Cannot change Tx QP state to INIT Invalid argument Corresponding message from /var/log/dpdk_mlx5_port_X_txq_Y_index_Z*: Unexpected CQE error syndrome 0x04 CQN = 128 SQN = 4848 wqe_counter = 0 wq_ci = 9 cq_ci = 122 This is likely to happen with --legacy-mem and IOVA-as-PA, because EAL intentionally maps pages at non-adjacent PA to non-adjacent VA in this mode, and MLX5 PMD works with VA. Fixes: `690b2a88c2` ("common/mlx5: add mempool registration facilities") Cc: stable@dpdk.org Reported-by: Wang Yunjian <wangyunjian@huawei.com> Signed-off-by: Dmitry Kozlyuk <dkozlyuk@nvidia.com> Reviewed-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2022-01-26 17:41:10 +01:00
Maxime Coquelin	afd857cba9	vhost: use proper logging type for data path This patch changes type from config to data for functions called in the datapath. Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:46:02 +01:00
Maxime Coquelin	d84723029c	vhost: differentiate IOTLB logs Same logging messages were used for both IOTLB cache insertion failure and IOTLB pending insertion failure. This patch differentiate them to ease logs analysis. Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:55 +01:00
Maxime Coquelin	5b03016509	vhost: remove multi-line logs This patch replaces multi-lines logs in multiple single- line logs in order to ease logs filtering based on their socket path. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:49 +01:00
Maxime Coquelin	02798b0735	vhost: improve virtio-net layer logs This patch standardizes logging done in Virtio-net, so that the Vhost-user socket path is always prepended to the logs. It will ease log analysis when multiple Vhost-user ports are in use. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:43 +01:00
Maxime Coquelin	c85c35b1d4	vhost: improve socket layer logs This patch adds the Vhost socket path whenever possible in order to make debugging possible when multiple Vhost devices are in use. Some vhost-user layer functions are modified to pass the device path down to the socket layer. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:36 +01:00
Maxime Coquelin	b53c9d20fa	vhost: improve vhost-user layer logs This patch adds the Vhost-user socket path to Vhost-user layer logs in order to ease logs filtering. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:29 +01:00
Maxime Coquelin	ccf1ebd611	vhost: improve vhost layer logs This patch prepends Vhost logs with the Vhost-user socket path when available to ease filtering logs for a given port. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:11 +01:00
Maxime Coquelin	843d8e3fde	vhost: improve vDPA registration failure log This patch adds name of the device failing vDPA registration. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:45:01 +01:00
Maxime Coquelin	2d1c05c9ac	vhost: improve IOTLB logs This patch adds IOTLB mempool name when logging debug or error messages, and also prepends the socket path. to all the logs. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2022-01-27 09:44:31 +01:00
Andy Pei	8974b6c730	vhost: add log when setting vring base This patch adds log for vring related info in handling of vhost message VHOST_USER_SET_VRING_BASE, which will be useful in live migration case. Signed-off-by: Andy Pei <andy.pei@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2022-01-27 09:39:08 +01:00
Yunjian Wang	0f7438e6d4	net/virtio: fix uninitialized RSS key This patch fixes an issue that uninitialized old_rss_key is used for restoring the rss_key. Coverity issue: 373866 Fixes: `0c9d662070` ("net/virtio: support RSS") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2022-01-27 06:11:47 +01:00

1 2 3 4 5 ...

31411 Commits