numam-dpdk

Author	SHA1	Message	Date
Nicolas Chautru	99c886d3e3	app/bbdev: fix test vector symlink 5G DL default symlink was pointing to a 4G vector. Fixes: d762705308c4 ("app/bbdev: add test vectors for 5GNR") Cc: stable@dpdk.org Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Aidan Goddard <aidan.goddard@accelercomm.com> Acked-by: Dave Burley <dave.burley@accelercomm.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com>	2020-10-14 21:32:11 +02:00
Adam Dybkowski	85b00824ae	crypto/scheduler: rename slave to worker This patch replaces the usage of the word 'slave' with more appropriate word 'worker' in QAT PMD and Scheduler PMD as well as in their docs. Also the test app was modified to use the new wording. The Scheduler PMD's public API was modified according to the previous deprecation notice: rte_cryptodev_scheduler_slave_attach is now called rte_cryptodev_scheduler_worker_attach, rte_cryptodev_scheduler_slave_detach is rte_cryptodev_scheduler_worker_detach, rte_cryptodev_scheduler_slaves_get is rte_cryptodev_scheduler_workers_get. Also, the configuration value RTE_CRYPTODEV_SCHEDULER_MAX_NB_SLAVES was renamed to RTE_CRYPTODEV_SCHEDULER_MAX_NB_WORKERS. Signed-off-by: Adam Dybkowski <adamx.dybkowski@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-10-14 21:31:46 +02:00
Yunjian Wang	8839c8a1f7	crypto/dpaa_sec: fix a null pointer dereference This patch fixes a null pointer dereference after null check detected by coverity scan. Coverity issue: 349904 Fixes: 6a0c9d364afc ("crypto/dpaax_sec: support HFN override") Cc: stable@dpdk.org Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-10-14 21:30:10 +02:00
Nagadheeraj Rottela	ce96577b4d	test/crypto: replace NITROX PMD specific test suite Replace NITROX PMD specific tests with generic test suite. Signed-off-by: Nagadheeraj Rottela <rnagadheeraj@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-10-14 21:30:10 +02:00
Nicolas Chautru	a5bdedf999	doc: remove orphan bbdev PMD feature table Removing a feature table referring erroneously to a PMD not present in DPDK. Fixes: 65f1eec ("doc: add feature matrix table for bbdev") Cc: stable@dpdk.org Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-10-14 21:30:10 +02:00
Maxime Coquelin	e6925585d9	baseband/fpga_lte_fec: fix API naming DPDK APIs have to be prefixed with "rte_" in order to avoid namespace pollution. Let's fix it while fpga_lte_fec API is still experimental. Fixes: efd453698c49 ("baseband/fpga_lte_fec: add driver for FEC on FPGA") Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Tom Rix <trix@redhat.com>	2020-10-14 21:29:59 +02:00
Maxime Coquelin	7adbb468fb	baseband/fpga_5gnr_fec: fix API naming DPDK APIs have to be prefixed with "rte_" in order to avoid namespace pollution. Let's fix it while fpga_5gnr_fec API is still experimental. Fixes: 2d4306438c92 ("baseband/fpga_5gnr_fec: add configure function") Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Tom Rix <trix@redhat.com>	2020-10-14 21:28:31 +02:00
Conor Walsh	a748d24d79	ipsec: promote library as stable Since librte_ipsec was first introduced in 19.02 and there were no changes in it's public API since 19.11, it should be considered mature enough to remove the 'experimental' tag from it. The RTE_SATP_LOG2_NUM enum is also being dropped from rte_ipsec_sa.h to avoid possible ABI problems in the future. Signed-off-by: Conor Walsh <conor.walsh@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2020-10-14 21:26:36 +02:00
Nicolas Chautru	b17d70922d	baseband/acc100: add configure function Add configure function to configure the PF from within the bbdev-test itself without external application configuration the device. Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:26:25 +02:00
Nicolas Chautru	3bfc5f6040	baseband/acc100: add debug function to validate input Debug functions to validate the input API from user Only enabled in DEBUG mode at build time Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Reviewed-by: Tom Rix <trix@redhat.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	0653146415	baseband/acc100: support interrupt Adding capability and functions to support MSI interrupts, call backs and inforing. Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	f404dfe35c	baseband/acc100: support 4G processing Adding capability for 4G encode and decoder processing Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	48e17d2ffd	baseband/acc100: support HARQ loopback Additional support for HARQ memory loopback Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Reviewed-by: Tom Rix <trix@redhat.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	5ad5060f8f	baseband/acc100: add LDPC processing functions Adding LDPC decode and encode processing operations Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Dave Burley <dave.burley@accelercomm.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	060e767293	baseband/acc100: add queue configuration Adding function to create and configure queues for the device. Still no capability. Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Reviewed-by: Rosen Xu <rosen.xu@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	9200ffa5cd	baseband/acc100: add info get function Add in the "info_get" function to the driver, to allow us to query the device. No processing capability are available yet. Linking bbdev-test to support the PMD with null capability. Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	4cf9007979	baseband/acc100: add HW register definitions Add in the list of registers for the device and related HW specs definitions. Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Reviewed-by: Rosen Xu <rosen.xu@intel.com> Reviewed-by: Tom Rix <trix@redhat.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Nicolas Chautru	db7949bde4	baseband/acc100: introduce PMD for ACC100 Add stubs for the ACC100 PMD Signed-off-by: Nicolas Chautru <nicolas.chautru@intel.com> Reviewed-by: Tom Rix <trix@redhat.com> Acked-by: Liu Tianjiao <tianjiao.liu@intel.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-10-14 21:06:56 +02:00
Ankur Dwivedi	ff56727b4f	test/crypto: fix device number In testsuite_setup(), ts_params is configured for first valid device. The same device should be used as valid device in test_device_configure_invalid_dev_id test case. Fixes: 202d375c60bc ("app/test: add cryptodev unit and performance tests") Cc: stable@dpdk.org Signed-off-by: Ankur Dwivedi <adwivedi@marvell.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com>	2020-10-14 21:06:56 +02:00
Vladimir Medvedkin	d26ec0d90e	app/test-sad: fix uninitialized variable Coverity issue: 362055 Fixes: 908be0651a5a ("app/test-sad: add test application for IPsec SAD") Cc: stable@dpdk.org Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2020-10-14 21:06:56 +02:00
Ankur Dwivedi	4a35a46409	crypto/octeontx2: fix session-less mode A temporary session is created for sessionless crypto operations. rte_cryptodev_sym_session_create() should be used for creating the temporary session as it initializes the session structure in the correct way. Also the session should be set to 0 before freeing it. Fixes: 17ac2a72191b ("crypto/octeontx2: add enqueue/dequeue ops") Cc: stable@dpdk.org Signed-off-by: Ankur Dwivedi <adwivedi@marvell.com> Acked-by: Anoob Joseph <anoobj@marvell.com>	2020-10-14 21:06:56 +02:00
Thomas Monjalon	3cd73a1a1c	eal: simplify exit functions The option RTE_EAL_ALWAYS_PANIC_ON_ERROR was off by default, and not customizable with meson. It is completely removed. The function rte_dump_registers is a trace of the bare metal support era, and was not supported in userland. It is completely removed. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Kevin Traynor <ktraynor@redhat.com> Acked-by: David Marchand <david.marchand@redhat.com>	2020-10-15 22:33:47 +02:00
Harry van Haaren	31f83163cf	eal: add new prefetch write variants This commit adds new rte_prefetchX_write() variants, that suggest to the compiler to use a prefetch instruction with intention to write. As a compiler builtin, the compiler can choose based on compilation target what the best implementation for this instruction is. Three versions are provided, targeting the different levels of cache. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>	2020-10-15 21:49:59 +02:00
Eli Britstein	057d9a92f0	eal: fix build with conflicting libc variable memory_order The cited commit introduced functions with 'int memory_order' argument. The C11 standard section 7.17.1.4 defines 'memory_order' as the "enumerated type whose enumerators identify memory ordering constraints". A compilation error occurs: error: declaration of 'memory_order' shadows a global declaration [-Werror=shadow] rte_atomic_thread_fence(int memory_order) This issue was hit when trying to compile OVS with gcc 4.8.5. This compiler version does not provide stdatomic.h, so enum memory_order is redefined in OVS code. In another case, if the compiler does provide stdatomic.h header, passing -Wsystem-headers in the CFLAGS will also cause that failure. Fix it by changing the argument name 'memory_order' to 'memorder'. Fixes: 672a15056380 ("eal: add wrapper for C11 atomic thread fence") Signed-off-by: Eli Britstein <elibr@nvidia.com> Reviewed-by: Asaf Penso <asafp@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2020-10-15 18:49:53 +02:00
Konstantin Ananyev	e0a1cd7a62	acl: fix build with gcc 5.4.0 gcc 5.4 fails with: ../lib/librte_acl/acl_run_avx512x8.h: In function 'match_process_avx512x8': ../lib/librte_acl/acl_run_avx512x8.h:382:31: error: pointer targets in passing argument 1 of '_mm256_mask_i32scatter_epi32' differ in signedness [-Werror=pointer-sign] Later gcc versions work fine, as for them parameter type was changed to 'void *'. Fixed by applying explicit cast for offending argument. Bugzilla ID: 556 Fixes: b64c2295f7fc ("acl: add 256-bit AVX512 classify method") Fixes: 45da22e42ec3 ("acl: add 512-bit AVX512 classify method") Reported-by: Ali Alnubani <alialnu@nvidia.com> Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: Ali Alnubani <alialnu@nvidia.com>	2020-10-15 14:31:46 +02:00
David Marchand	0c0d0d9df7	eal: add experimental tags for write combining store Only marking the doxygen declarations is not enough. Arch specific implementations must be tagged as well since there is no common declaration of those inlines. Fixes: 8a00dfc738fe ("eal: add write combining store") Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-15 08:45:30 +02:00
Savinay Dharmappa	bf32a357e2	sched: remove redundant subport parameters Remove redundant data structure fields. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:14:28 +02:00
Savinay Dharmappa	b393ad5f56	test/sched: update subport rate dynamically Modify the test_sched application to build the hierarchical scheduler with default subport bandwidth profile. It also allows to update a subport with different subport rates dynamically Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:14:21 +02:00
Savinay Dharmappa	b5dfa6703d	net/softnic: update subport rate dynamically Modify the softnic drivers to build the hierarchical scheduler with default subport bandwidth profile. It also allows to update a subport with different subport rates dynamically. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:14:13 +02:00
Savinay Dharmappa	54a298e5f7	examples/ip_pipeline: update subport rate dynamically Modify the ip_pipeline application to build the hierarchical scheduler with default subport bandwidth profile. It also allows to update a subport with different subport rates dynamically Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:14:09 +02:00
Savinay Dharmappa	802d214dc8	examples/qos_sched: update subport rate dynamically Modify the qos_sched application to build the hierarchical scheduler with default subport bandwidth profile. It also allows to update a subport with different subport rates dynamically. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:13:59 +02:00
Savinay Dharmappa	ac6fcb841b	sched: update subport rate dynamically Add support to update subport rate dynamically. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:13:08 +02:00
Savinay Dharmappa	5f757d8fcc	sched: introduce subport profile add function API to add new subport bandwidth profile. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:11:55 +02:00
Savinay Dharmappa	0ea4c6afca	sched: add subport profile table Add subport profile table to internal port data structure and update the port config function. Signed-off-by: Savinay Dharmappa <savinay.dharmappa@intel.com> Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-15 02:11:50 +02:00
Dmitry Kozlyuk	4fa3a084be	examples/cmdline: build on Windows Enable cmdline sample application as all dependencies are met. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	841dfdd06d	cmdline: support Windows Implement terminal handling, input polling, and vdprintf() for Windows. Because Windows I/O model differs fundamentally from Unix and there is no concept of character device, polling is simulated depending on the underlying input device. Supporting non-terminal input is useful for automated testing. Windows emulation of VT100 uses "ESC [ E" for newline instead of standard "ESC E", so add a workaround. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	f40a74cfcf	eal/windows: improve compatibility networking headers Extend compatibility header system to support librte_cmdline. pthread.h has to include windows.h, which exposes struct in_addr, etc. conflicting with compatibility headers. WIN32_LEAN_AND_MEAN macro is required to disable this behavior. Use rte_windows.h to define WIN32_LEAN_AND_MEAN for pthread library. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	b5741b5704	cmdline: add internal wrapper for vdprintf Add internal wrapper for vdprintf(3) that is only available on Unix. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	9251cd97a6	cmdline: add internal wrappers for character input poll(3) is a purely Unix facility, so it cannot be directly used by common code. read(2) is limited in device support outside of Unix. Create wrapper functions and implement them for Unix. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	7b5f4e1d30	cmdline: add internal wrappers for terminal handling Add functions that set up, save, and restore terminal parameters. Use existing code as Unix implementation. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	51fcb6a1fe	cmdline: make implementation logically opaque struct cmdline exposes platform-specific members it contains, most notably struct termios that is only available on Unix. While ABI considerations prevent from hinding the definition on already supported platforms, struct cmdline is considered logically opaque from now on. Add a deprecation notice targeted at 20.11. * Remove tests checking struct cmdline content as meaningless. * Fix missing cmdline_free() in unit test. * Add cmdline_get_rdline() to access history buffer indirectly. The new function is currently used only in tests. Suggested-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-10-15 00:39:10 +02:00
Dmitry Kozlyuk	f4cbdbc7fb	eal/windows: implement alarm API Implementation is based on waitable timers Win32 API. When timer is set, a callback and its argument are supplied to the OS, while timer handle is stored in EAL alarm list. When timer expires, OS wakes up the interrupt thread and runs the callback. Upon completion it removes the alarm. Waitable timers must be set from the thread their callback will run in, eal_intr_thread_schedule() provides a way to schedule asyncronuous code execution in the interrupt thread. Alarm module builds synchronous timer setup on top of it. Windows alarms are not a type of DPDK interrupt handle and do not interact with interrupt module beyond executing in the same thread. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com>	2020-10-14 22:54:04 +02:00
Dmitry Kozlyuk	5c016fc020	eal/windows: add interrupt thread skeleton Windows interrupt support is based on IO completion ports (IOCP). Interrupt thread would send the devices requests to notify about interrupts and then wait for any request completion. Add skeleton code of this model without any hardware support. Another way to wake up the interrupt thread is APC (asynchronous procedure call), scheduled by any other thread via eal_intr_thread_schedule(). This internal API is intended for alarm implementation. Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Narcisa Vasile <navasile@linux.microsoft.com>	2020-10-14 22:48:38 +02:00
Pallavi Kadam	c76ec01b45	bus/pci: support netuio on Windows This patch adds implementations to probe PCI devices bound to netuio with the help of "netuio" class device changes. Now Windows will support both "netuio" and "net" device class and can set kernel driver type based on the device class selection. Note: Few definitions and structures have been copied from netuio_interface.h file from ("[v5] windows/netuio: add Windows NetUIO kernel driver") series and this will be fixed once the exact path for netuio source code is known. Signed-off-by: John Alexander <john.alexander@datapath.co.uk> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Reviewed-by: Tal Shnaiderman <talshn@nvidia.com> Reviewed-by: Narcisa Vasile <navasile@linux.microsoft.com>	2020-10-14 22:28:42 +02:00
Ting Xu	99541c3028	table: fix hash for 32-bit When create softnic hash table with 16 keys, it failed on 32-bit environment, because the pointer field in structure rte_bucket_4_16 is only 32 bits. Add a padding field in 32-bit environment to keep the structure to a multiple of 64 bytes. Apply this to 8-byte and 32-byte key hash function as well. Fixes: 8aa327214c ("table: hash") Cc: stable@dpdk.org Signed-off-by: Ting Xu <ting.xu@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-14 14:42:29 +02:00
Konstantin Ananyev	c5cf148d89	acl: deduplicate AVX512 code Current rte_acl_classify_avx512x32() and rte_acl_classify_avx512x16() code paths are very similar. The only differences are due to 256/512 register/instrincts naming conventions. So to deduplicate the code: - Move common code into “acl_run_avx512_common.h” - Use macros to hide difference in naming conventions Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-14 14:37:51 +02:00
Konstantin Ananyev	6fba1c8ba0	acl: optimize AVX512 classify with 4 bytes loads With current ACL implementation first field in the rule definition has always to be one byte long. Though for optimising classify implementation it might be useful to do 4B reads (as we do for rest of the fields). So at build phase, check user provided field definitions to determine is it safe to do 4B loads for first ACL field. Then at run-time this information can be used to choose classify behavior. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-14 14:23:01 +02:00
Konstantin Ananyev	45da22e42e	acl: add 512-bit AVX512 classify method Introduce classify implementation that uses AVX512 specific ISA. rte_acl_classify_avx512x32() is able to process up to 32 flows in parallel. It uses 512-bit width registers/instructions and provides higher performance then rte_acl_classify_avx512x16(), but can cause frequency level change. Note that for now only 64-bit version is supported. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-14 14:23:01 +02:00
Konstantin Ananyev	867d0d3649	acl: select 256-bit AVX512 classify method by default On supported platforms, set RTE_ACL_CLASSIFY_AVX512X16 as default ACL classify algorithm. Note that AVX512X16 implementation uses 256-bit registers/instincts only to avoid possibility of frequency drop. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-14 14:23:01 +02:00
Konstantin Ananyev	b64c2295f7	acl: add 256-bit AVX512 classify method Introduce classify implementation that uses AVX512 specific ISA. rte_acl_classify_avx512x16() is able to process up to 16 flows in parallel. It uses 256-bit width registers/instructions only (to avoid frequency level change). Note that for now only 64-bit version is supported. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2020-10-14 14:23:00 +02:00

1 2 3 4 5 ...

24750 Commits