numam-dpdk

Author	SHA1	Message	Date
Cristian Dumitrescu	dfa9491a18	pipeline: introduce custom instructions For better performance, the option to create custom instructions when the program is translated and add them on-the-fly to the pipeline is now provided. Multiple regular instructions can now be consolidated into a single C function optimized by the C compiler directly. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:09:13 +02:00
Cristian Dumitrescu	5dc6a5f2e7	pipeline: introduce action functions For better performance, the option to run a single function per action is now provided, which requires a single function call per action that can be better optimized by the C compiler, as opposed to one function call per instruction. Special table lookup instructions are added to to support this feature. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:09:11 +02:00
Cristian Dumitrescu	4bd025dc98	pipeline: enable persistent instruction meta-data Save the instruction meta-data for later use instead of freeing it up once the instruction translation is completed. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:23 +02:00
Cristian Dumitrescu	40baf712ef	pipeline: create inline functions for instruction operands Create inline functions to get the instruction operands. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:20 +02:00
Cristian Dumitrescu	0d5910ddcf	pipeline: create inline functions for meter instructions Create inline functions for the meter instructions. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:18 +02:00
Cristian Dumitrescu	c5d03ffda7	pipeline: create inline functions for register instructions Create inline functions for the register instructions. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:16 +02:00
Cristian Dumitrescu	ed7567c9d7	pipeline: create inline functions for ALU instructions Create inline functions for the ALU instructions. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:15 +02:00
Cristian Dumitrescu	fae7b2baa3	pipeline: create inline functions for DMA instruction Create inline functions for the DMA instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:13 +02:00
Cristian Dumitrescu	b82733ab25	pipeline: create inline functions for move instruction Create inline functions for the move instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:12 +02:00
Cristian Dumitrescu	4884264b17	pipeline: create inline functions for extern instruction Create inline functions for the extern instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:10 +02:00
Cristian Dumitrescu	d1a58ada1a	pipeline: create inline functions for learn instruction Create inline functions for the learn and forget instructions. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:09 +02:00
Cristian Dumitrescu	4565d7db70	pipeline: create inline functions for validate instruction Create inline functions for the validate and invalidate instructions. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:07 +02:00
Cristian Dumitrescu	d60dbdc88a	pipeline: create inline functions for emit instruction Create inline functions for the emit instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 12:03:05 +02:00
Cristian Dumitrescu	2574fd607e	pipeline: create inline functions for extract instruction Create inline functions for the extract instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 11:59:47 +02:00
Cristian Dumitrescu	fcb03ae09e	pipeline: create inline functions for Tx instruction Create inline functions for the Tx instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 11:59:37 +02:00
Cristian Dumitrescu	101d7f09bf	pipeline: create inline functions for Rx instruction Create inline functions for the Rx instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 11:59:28 +02:00
Cristian Dumitrescu	c693add3bf	pipeline: move thread inline functions to header file Move the thread inline functions to the internal header file. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 11:59:07 +02:00
Cristian Dumitrescu	97b8278ad9	pipeline: move data structures to internal header file Start to consolidate the data structures and inline functions required by the pipeline instructions into an internal header file. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 11:43:36 +02:00
Cristian Dumitrescu	4f59d37261	pipeline: support learner tables Add pipeline level support for learner tables. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:52:04 +02:00
Cristian Dumitrescu	0c06fa3bfa	table: support learner tables A learner table is typically used for learning or connection tracking, where it allows for the implementation of the "add on miss" scenario: whenever the lookup key is not found in the table (lookup miss), the data plane can decide to add this key to the table with a given action with no control plane intervention. Likewise, the table keys expire based on a configurable timeout and are automatically deleted from the table with no control plane intervention. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:30:41 +02:00
Cristian Dumitrescu	220d419b86	pipeline: add header look-ahead instruction Added look-ahead instruction to read a header from the input packet without advancing the extraction pointer. This is typically used in correlation with the special extract instruction to extract variable size headers from the input packet: the first few header fields are read without advancing the extraction pointer, just enough to detect the actual length of the header (e.g. IPv4 IHL field); then the full header is extracted. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:15:07 +02:00
Cristian Dumitrescu	5972f82936	pipeline: add variable size headers extract instruction Added a mechanism to extract variable size headers through a special flavor of the extract instruction. The length of the last struct field which has variable size is passed as argument to the instruction. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:14:56 +02:00
Cristian Dumitrescu	cef3896928	pipeline: support variable size headers Added support for variable size headers. The last field of a struct type can now have a variable size between 0 and N bytes. Useful to accommodate IPv4 packets with options, etc. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:14:45 +02:00
Cristian Dumitrescu	5f3e610422	pipeline: prepare for variable size headers The emit instruction that is responsible for pushing headers into the output packet is now reading the header length from internal run-time structures as opposed to constant value from the instruction opcode. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-09-27 09:14:41 +02:00
Harman Kalra	3f156ec284	metrics: promote deinitialize API Remove experimental flag from rte_metrics_deinit(). This API was introduced in 19.11 release. Signed-off-by: Harman Kalra <hkalra@marvell.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-09-23 19:21:47 +02:00
Tal Shnaiderman	826476bc70	eal/windows: fix debug build When building DPDK on Windows in debug mode the following warning appear: warning: token pasting of ',' and __VA_ARGS__ is a GNU extension [-Wgnu-zero-variadic-macro-arguments] #define open(path, flags, ...) _open(path, flags, ##__VA_ARGS__) Modify the 'open' macro to avoid it. Fixes: `45d62067c2` ("eal: make OS shims internal") Cc: stable@dpdk.org Signed-off-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-09-23 19:16:27 +02:00
Radu Nicolau	d2671e642a	telemetry: support dict of dicts Add support for dicts of dicts to telemetry library. Increase the max string size to 128. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Radu Nicolau <radu.nicolau@intel.com> Acked-by: Ciara Power <ciara.power@intel.com>	2021-09-23 14:15:29 +02:00
Thomas Monjalon	ca0c25bbc9	eal: reword logs for CPU and NUMA counts Some logs about cores and nodes were using hypotetic plural (s) form. A fixed plural form with value at the end is preferred. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-09-23 08:55:20 +02:00
Thomas Monjalon	7a33572057	lib: remove C++ include guard from private headers The private headers are compiled internally with a C compiler. Thus extern "C" declaration is useless in such files. Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-09-22 22:00:17 +02:00
Chenbo Xia	945ef8a040	vhost: promote some APIs to stable As reported by symbol bot, APIs listed in this patch have been experimental for more than two years. This patch promotes these 18 APIs to stable. Signed-off-by: Chenbo Xia <chenbo.xia@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Kevin Traynor <ktraynor@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-09-14 13:21:57 +02:00
Gaoxiang Liu	e53084d84b	vhost: log socket path on adding connection Add log print of socket path in vhost_user_add_connection. It's useful when adding a mass of socket connections, because the information of every connection is clearer. Fixes: `8f972312b8` ("vhost: support vhost-user") Cc: stable@dpdk.org Signed-off-by: Gaoxiang Liu <liugaoxiang@huawei.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-09-14 13:21:57 +02:00
Gaoxiang Liu	451dc0fad8	vhost: fix crash on port deletion The rte_vhost_driver_unregister() and vhost_user_read_cb() can be called at the same time by 2 threads. when memory of vsocket is freed in rte_vhost_driver_unregister(), the invalid memory of vsocket is accessed in vhost_user_read_cb(). It's a bug of both mode for vhost as server or client. E.g., vhostuser port is created as server. Thread1 calls rte_vhost_driver_unregister(). Before the listen fd is deleted from poll waiting fds, "vhost-events" thread then calls vhost_user_server_new_connection(), then a new conn fd is added in fdset when trying to reconnect. "vhost-events" thread then calls vhost_user_read_cb() and accesses invalid memory of socket while thread1 frees the memory of vsocket. E.g., vhostuser port is created as client. Thread1 calls rte_vhost_driver_unregister(). Before vsocket of reconn is deleted from reconn list, "vhost_reconn" thread then calls vhost_user_add_connection() then a new conn fd is added in fdset when trying to reconnect. "vhost-events" thread then calls vhost_user_read_cb() and accesses invalid memory of socket while thread1 frees the memory of vsocket. The fix is to move the "fdset_try_del" in front of free memory of conn, then avoid the race condition. The core trace is: Program terminated with signal 11, Segmentation fault. Fixes: `52d874dc67` ("vhost: fix crash on closing in client mode") Cc: stable@dpdk.org Signed-off-by: Gaoxiang Liu <liugaoxiang@huawei.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-09-14 13:21:57 +02:00
Jiayu Hu	abeb865255	vhost: remove copy threshold for async path Copy threshold has been introduced in async vhost data path to select the appropriate copy engine to do copies for higher efficiency. However, it may cause packets ordering issues and also introduces performance unpredictability. Therefore, this patch removes copy threshold support in async vhost data path. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-09-14 13:21:55 +02:00
Mohsin Kazmi	818ce1132a	net: fix checksum offload for outer IPv4 Preparation of the headers for the hardware offload misses the outer IPv4 checksum offload. It results in bad checksum computed by hardware NIC. This patch fixes the issue by setting the outer IPv4 checksum field to 0. Fixes: `4fb7e803eb` ("ethdev: add Tx preparation") Cc: stable@dpdk.org Signed-off-by: Mohsin Kazmi <mohsin.kazmi14@gmail.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2021-09-15 12:51:49 +02:00
David Marchand	b37ed6def3	ethdev: promote sibling iterators to stable This API saw no update since its introduction and will help applications like OVS ([1] and [2]) that currently look at rte_eth_devices[] to achieve the same. 1: https://github.com/openvswitch/ovs/blob/master/lib/netdev-dpdk.c#L1285 2: https://github.com/openvswitch/ovs/blob/master/lib/netdev-dpdk.c#L1476 Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-09-15 11:48:54 +02:00
Haiyue Wang	9fecac6c3a	ethdev: promote burst mode API The DPDK Symbol Bot reports: Please note the symbols listed below have expired. In line with the DPDK ABI policy, they should be scheduled for removal, in the next DPDK release. Symbol rte_eth_rx_burst_mode_get rte_eth_tx_burst_mode_get Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-09-15 10:46:00 +02:00
Joyce Kong	97d7b92772	ethdev: fix typo in Rx queue setup API comment Fix a typo that mb_pool was misspelt as mp_pool. Fixes: `4ff702b5df` ("ethdev: introduce Rx buffer split") Cc: stable@dpdk.org Signed-off-by: Joyce Kong <joyce.kong@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-09-15 10:38:43 +02:00
Pavan Nikhilesh	2def522abc	ethdev: promote API to set packet types Remove experimental tag from rte_eth_dev_set_ptypes(). Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-09-15 09:54:04 +02:00
Xiaoyun Li	dbd34bee98	ethdev: promote API to get interrupt FD per queue Remove the experimental tag for rte_eth_dev_rx_intr_ctl_q_get_fd API that was introduced in 18.11 and have been around for 11 releases. Signed-off-by: Xiaoyun Li <xiaoyun.li@intel.com> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu> Acked-by: David Marchand <david.marchand@redhat.com>	2021-09-14 18:18:06 +02:00
Conor Walsh	4777674c44	eal: fix memory leak when saving arguments This patch fixes a memleak which was reported in Bugzilla within the eal_save_args function. This was caused by the function mistakenly adding -- to the eal args instead of breaking beforehand. Bugzilla ID: 722 Fixes: `293c53d8b2` ("eal: add telemetry callbacks") Reported-by: Zhihong Peng <zhihongx.peng@intel.com> Signed-off-by: Conor Walsh <conor.walsh@intel.com> Signed-off-by: Conor Fogarty <conor.fogarty@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-09-16 22:24:54 +02:00
Stephen Hemminger	7df485eb3d	eal: remove deprecated noninclusive API New API for these were added in 20.11 and the old API was retained but marked deprecated. Since 21.11 is the next LTS, it is time to remove the deprecated ones. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-09-16 17:21:22 +02:00
Dmitry Kozlyuk	5ffa86a2b4	build: propagate Windows system dependencies to pkg-config Windows EAL depends on some system libraries. They were linked using add_project_link_arguments('-l<LIB>'), which prevented meson from adding them to Libs.private of pkg-config file. As a result, applications using pkg-config to find DPDK hit link errors, for example: librte_eal.a(eal_windows_eal_debug.c.obj) : error LNK2019: unresolved external symbol __imp_SymInitialize referenced in function rte_dump_stack Reference required libraries in EAL using ext_deps meson variable. bus/pci and net/pcap depend on lib/eal and will pull them automatically. Drop advapi32 dependency, as MinGW locates VirtualAlloc2() dynamically. Fixes: `2a5d547a4a` ("eal/windows: implement basic memory management") Fixes: `c91717eb75` ("eal/windows: support exit and panic") Cc: stable@dpdk.org Reported-by: William Tu <u9012063@gmail.com> Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: William Tu <u9012063@gmail.com>	2021-09-14 15:58:34 +02:00
Thomas Monjalon	557610a8ff	cryptodev: fix indent in Meson file Fixes: `af668035f7` ("cryptodev: expose driver interface as internal") Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-09-14 15:58:32 +02:00
Aman Deep Singh	a7db3afce7	net: add macro to extract MAC address bytes Added macros to simplify print of MAC address. The six bytes of a MAC address are extracted in a macro here, to improve code readablity. Signed-off-by: Aman Deep Singh <aman.deep.singh@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-09-07 19:08:05 +02:00
Aman Deep Singh	c2c4f87b12	net: add macro for MAC address print Added macro to print six bytes of MAC address. The MAC addresses will be printed in upper case hexadecimal format. In case there is a specific check for lower case MAC address, the user may need to make a change in such test case after this patch. Signed-off-by: Aman Deep Singh <aman.deep.singh@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-09-07 19:07:46 +02:00
Hemant Agrawal	864c1a40d7	security: support PDCP short MAC-I This patch add support to handle PDCP short MAC-I domain along with standard control and data domains as it has to be treaty as special case with PDCP protocol offload support. ShortMAC-I is the 16 least significant bits of calculated MAC-I. Usually when a RRC message is exchanged between UE and eNodeB it is integrity & ciphered protected. MAC-I = f(key, varShortMAC-I, count, bearer, direction). Here varShortMAC-I is prepared by using (current cellId, pci of source cell and C-RNTI of old cell). Other parameters like count, bearer and direction set to all 1. crypto-perf app is updated to take short MAC as input mode. Signed-off-by: Gagandeep Singh <g.singh@nxp.com> Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-09-08 16:54:37 +02:00
Akhil Goyal	af668035f7	cryptodev: expose driver interface as internal The rte_cryptodev_pmd.* files are for drivers only and should be private to DPDK, and not installed for app use. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Matan Azrad <matan@nvidia.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2021-09-08 09:35:12 +02:00
Akhil Goyal	e74abd4843	cryptodev: rename function to check device validity The API rte_cryptodev_pmd_is_valid_dev, can be used by the application as well as PMD to check whether the device is valid or not. Hence, _pmd is removed from the API. The applications and drivers which use this API are also updated. Signed-off-by: Akhil Goyal <gakhil@marvell.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2021-09-08 09:21:10 +02:00
David Christensen	c13e617739	eal/ppc: ignore GCC 10 stringop-overflow warnings Suppress gcc warning "warning: writing 16 bytes into a region of size 0" for users of the POWER rte_memcpy() function. Existing rte_memcpy() code takes different code paths based on the actual size of the move so the warning is already addressed. See also commit `b5b3ea803e` ("eal/x86: ignore gcc 10 stringop-overflow warnings") Cc: stable@dpdk.org Signed-off-by: David Christensen <drc@linux.vnet.ibm.com>	2021-09-13 09:18:06 +02:00
Xueming Li	b344eb5d94	devargs: parse global device syntax When parsing a devargs, try to parse using the global device syntax first. Fallback on legacy syntax on error. Example of new global device syntax: -a bus=pci,addr=82:00.0/class=eth/driver=mlx5,dv_flow_en=1 Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Gaetan Rivet <grive@u256.net>	2021-09-02 17:02:27 +02:00
Xueming Li	d2a66ad794	bus: add device arguments name parsing For device probe and iterator, devargs name was key information, parsed by rte_devargs_parse. In legacy parser, devargs name was extracted after bus name: bus:name,kv_arguments,,, Example: pci:83:00.0,arguments,... vdev:pcap0,... To be compatible with legacy parser, this patch introduces new bus driver API devargs_parse to parse devargs and update devargs name. If devargs_parse not implemented by bus driver, the new syntax parser rte_devargs_layers_parse default will resolve devargs name from bus's "name" argument. Different bus driver might choose different keys from arguments with unified format. The PCI bus implementation fills the devargs name with the "addr" argument, example: -a bus=pci,addr=83:00.0/class=eth/driver=mlx5,... name: 0000:03:00.0 -a bus=vdev,name=pcap0/class=eth/driver=pcap,... name:pcap0 Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: Gaetan Rivet <grive@u256.net>	2021-09-02 16:58:19 +02:00
Thomas Monjalon	fdab8f2e17	version: 21.11-rc0 Start a new release cycle with empty release notes. The ABI version becomes 22.0. The map files are updated to the new ABI major number (22). The ABI exceptions are dropped and CI ABI checks are disabled because compatibility is not preserved. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: David Marchand <david.marchand@redhat.com>	2021-08-17 08:37:52 +02:00
Churchill Khangar	69fa4a61ae	pipeline: fix table statistics This patch fixes the memcpy function call which was incorrect and led to memory corruption for tables with more that just a few actions. Fixes: `742b0a57f5` ("pipeline: add table statistics to SWX") Cc: stable@dpdk.org Signed-off-by: Churchill Khangar <churchill.khangar@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-08-04 14:45:13 +02:00
Ciara Power	eeaeca82b8	cryptodev: fix freeing after device release The PMD destroy function was calling the release function, which frees cryptodev->data, and then tries to free cryptodev->data->dev_private, which causes the heap use after free issue. A temporary pointer is set before the free of cryptodev->data, which can then be used afterwards to free dev_private. The free cannot be moved to before the release function is called, as dev_private is used in the PMD close function while being released. Fixes: `9e6edea418` ("cryptodev: add APIs to assist PMD initialisation") Cc: stable@dpdk.org Reported-by: Zhihong Peng <zhihongx.peng@intel.com> Signed-off-by: Ciara Power <ciara.power@intel.com> Acked-by: Akhil Goyal <gakhil@marvell.com>	2021-07-30 21:08:12 +02:00
Dmitry Kozlyuk	23ce9e0a19	eal/windows: cleanup virt2phys handle eal_mem_virt2phys_init() opens a handle for use by rte_mem_virt2phy(). Close this handle on EAL cleanup. Fixes: `2a5d547a4a` ("eal/windows: implement basic memory management") Cc: stable@dpdk.org Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Ranjit Menon <ranjit.menon@intel.com>	2021-07-30 18:56:01 +02:00
Naga Harish K S V	6922655cad	eventdev: fix event port setup in Tx adapter The event port config set by application in rte_event_eth_tx_adapter_create API is modified in default configuration callback function. This patch removes this hardcode to use application provided event port config value. Fixes: `a3bbf2e097` ("eventdev: add eth Tx adapter implementation") Cc: stable@dpdk.org Signed-off-by: Naga Harish K S V <s.v.naga.harish.k@intel.com>	2021-07-30 12:55:19 +02:00
Maxime Coquelin	3c929a0bb3	vhost: fix crash on reconnect When the vhost-user frontend like Virtio-user tries to reconnect to the restarted Vhost backend, the Vhost backend segfaults when multiqueue is enabled. This is caused by VHOST_USER_GET_VRING_BASE being called for a virtqueue that has not been created before, causing a NULL pointer dereferencing. This patch adds the VHOST_USER_GET_VRING_BASE requests to the list of requests that trigger queue pair allocations. Fixes: `160cbc815b` ("vhost: remove a hack on queue allocation") Cc: stable@dpdk.org Reported-by: Yinan Wang <yinan.wang@intel.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Tested-by: Yinan Wang <yinan.wang@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-07-28 08:27:53 +02:00
Huisong Li	c6ccd1e392	sched: rework configuration failure handling Currently, rte_sched_free_memory() is called multiple times by the exception handling code in rte_sched_subport_config() and rte_sched_pipe_config(). This patch optimizes them into a unified outlet to free memory. Fixes: `ac6fcb841b` ("sched: update subport rate dynamically") Fixes: `34a90f8665` ("sched: modify pipe functions for config flexibility") Fixes: `ce7c4fd7c2` ("sched: add pipe config to subport level") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-24 10:58:58 +02:00
Huisong Li	a042481ecd	sched: fix profile allocation failure handling This patch fixes return value judgment when allocate memory to store the subport profile, and releases memory of 'rte_sched_port' if code fails to apply for this memory. Fixes: `0ea4c6afca` ("sched: add subport profile table") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-24 10:58:39 +02:00
Richael Zhuang	d37462e56c	power: check frequencies count before filling array The freqs array size is RTE_MAX_LCORE_FREQS. Before filling the array with num_freqs elements, restrict the total num to RTE_MAX_LCORE_FREQS. This fix aims to fix the coverity scan issue like: Overrunning array "pi->freqs" of 256 bytes by passing it to a function which accesses it at byte offset 464. Coverity issue: 371913 Fixes: `ef1cc88f18` ("power: support cppc_cpufreq driver") Cc: stable@dpdk.org Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-07-24 10:09:58 +02:00
Stephen Hemminger	128c22b998	eal: fix argument in 32-bit safe BSF function The first argument to rte_bsf32_safe was incorrectly declared as a 64 bit value. The code only works on 32 bit values and the underlying function rte_bsf32 only accepts 32 bit values. This was a mistake introduced when the safe version was added and probably cause by copy/paste from the 64 bit version. The bug passed silently under the radar until some other code was built with -Wall and -Wextra in C++ and C++ complains about the missing cast. Yes, this is a API signature change, but the original code was wrong. It is an inline so not an ABI change. Fixes: `4e261f5519` ("eal: add 64-bit bsf and 32-bit safe bsf functions") Cc: stable@dpdk.org Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com>	2021-07-24 09:51:30 +02:00
Jiayu Hu	259caa21d7	vhost: handle memory hotplug for async vhost When the guest memory is hotplugged, the vhost application which enables DMA acceleration must stop DMA transfers before the vhost re-maps the guest memory. This patch is to notify the vhost application of stopping DMA transfers. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-23 10:58:53 +02:00
Cheng Jiang	b737fd6139	vhost: add unsafe async API to clear packets Applications need to stop DMA transfers and finish all the inflight packets when in VM memory hot-plug case and async vhost is used. This patch is to provide an unsafe API to clear inflight packets which are submitted to DMA engine in vhost async data path. Update the program guide and release notes for virtqueue inflight packets clear API in vhost lib. Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-23 10:58:53 +02:00
Cheng Jiang	3f63c19b2b	vhost: fix async callbacks return type The async vhost callback ops should return negative value when there are something wrong in the callback, so the return type should be changed into int32_t. The issue in vhost example is also fixed. Fixes: `cd6760da10` ("vhost: introduce async enqueue for split ring") Fixes: `819a716858` ("vhost: fix async callback return type") Fixes: `6b3c81db8b` ("vhost: simplify async copy completion") Fixes: `abec60e711` ("examples/vhost: support vhost async data path") Fixes: `6e9a9d2a02` ("examples/vhost: fix ioat dependency") Fixes: `873e8dad6f` ("vhost: support packed ring in async datapath") Cc: stable@dpdk.org Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-23 10:58:53 +02:00
Jie Zhou	b8617fcc51	eal/windows: check callback parameter of alarm functions EAL functions rte_eal_alarm_set() and rte_eal_alarm_cancel() did not for invalid parameters in Windows implementation, which is caught by the unit test alarm_autotest. Enforce parameter check to fail fast for invalid parameters. Fixes: `f4cbdbc7fb` ("eal/windows: implement alarm API") Cc: stable@dpdk.org Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-07-22 22:06:27 +02:00
Anatoly Burakov	565d01226e	power: fix multi-queue scale mode Currently in scale mode, multi-queue initialization will attempt to initialize and de-initialize the per-lcore power library structures multiple times. Fix it to only do this whenever we either enabling first queue or disabling last queue. Fixes: `5dff9a72b0` ("power: support callbacks for multiple Rx queues") Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Tested-by: David Hunt <david.hunt@intel.com>	2021-07-22 21:36:30 +02:00
Jiayu Hu	fa51f1aa08	vhost: add thread-unsafe async registration This patch adds thread unsafe version for async register and unregister functions. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Jiayu Hu	acbc38887b	vhost: rework async configuration structure This patch reworks the async configuration structure to improve code readability. In addition, add preserved padding fields on the structure for future usage. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Jiayu Hu	7f31d4ea05	vhost: fix lock on device readiness notification The vhost notifies the application of device readiness via vhost_user_notify_queue_state(), but calling this function is not protected by the lock. This patch is to make this function call lock protected. Fixes: `d0fcc38f5f` ("vhost: improve device readiness notifications") Cc: stable@dpdk.org Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Maxime Coquelin	92ed77dce6	vhost: fix packed ring index wrapping Unlike split ring, packed ring does not mandate the ring size to be a power of 2. So we have to use a modulo operation when wrapping ring index. Fixes: `873e8dad6f` ("vhost: support packed ring in async datapath") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-07-21 07:56:13 +02:00
Jiayu Hu	0c0935c5f7	vhost: allow to check in-flight packets for async vhost This patch allows to check the amount of in-flight packets for the vhost queue using async acceleration. Signed-off-by: Jiayu Hu <jiayu.hu@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Cheng Jiang	2e3f1ab0d8	vhost: fix async packed ring batch datapath We assume that in the sync path, if there is no buffer wrap in the avail descriptors fetched in a batch, there is no buffer wrap in the used descriptors which need to be written back in this batch, but this assumption is wrong in the async path since there are inflight descriptors which are processed by the DMA device. This patch refactors the batch copy code and adds used ring buffer wrap check as a batch copy condition to fix this issue. Fixes: `873e8dad6f` ("vhost: support packed ring in async datapath") Cc: stable@dpdk.org Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Cheng Jiang	8d2c1260af	vhost: fix index overflow for packed ring in async vhost We introduced some new indexes in packed ring of async vhost. They will eventually overflow and lead to errors if the ring size is not a power of 2. This patch is to check and keep these indexes within a reasonable range. Fixes: `873e8dad6f` ("vhost: support packed ring in async datapath") Cc: stable@dpdk.org Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-07-21 07:56:13 +02:00
Xiao Wang	706ba48665	vhost: check header for legacy dequeue offload When parsing the virtio net header and packet header for dequeue offload, we need to perform sanity check on the packet header to ensure: - No out-of-boundary memory access. - The packet header and virtio_net header are valid and aligned. Fixes: `d0cf91303d` ("vhost: add Tx offload capabilities") Cc: stable@dpdk.org Signed-off-by: Xiao Wang <xiao.w.wang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-21 07:56:13 +02:00
Cristian Dumitrescu	40d42de563	pipeline: fix selector freeing Due to a typo, the selector_free() function incorrectly takes an early return when the selectors array is non-NULL, as opposed to the other way around. Coverity issue: 371912 Fixes: `cdaa937d3e` ("pipeline: support selector table") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-21 13:51:17 +02:00
Anatoly Burakov	87fb608356	power: fix crash on error for intel_pstate Currently, the error paths can lead to attempts at dereferencing NULL pointers. Add the check to avoid attempts at dereferencing NULL pointers. Coverity issue: 371895 Coverity issue: 371889 Fixes: `06cffd468f` ("power: refactor ACPI and intel_pstate support") Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-07-20 17:24:00 +02:00
David Hunt	de8606bf73	distributor: fix 128-bit write alignment When the distributor sample app is built as a 32-bit app, the data buffer passed to find_match_vec can be unaligned, causing a segmentation fault due to writing a 128-bit value using _mm_store_si128(). 128-bit align the data being passed in so this does not happen. Fixes: `775003ad2f` ("distributor: add new burst-capable library") Cc: stable@dpdk.org Signed-off-by: David Hunt <david.hunt@intel.com>	2021-07-20 14:32:08 +02:00
Viacheslav Galaktionov	10eaf41d70	ethdev: keep count of representor ranges in API In its current state, the API can overflow the user-passed buffer if a new representor range appears between function calls. In order to solve this problem, augment the representor info structure with the numbers of allocated and initialized ranges. This way the users of this structure can be sure they will not overrun the buffer. Fixes: `85e1588ca7` ("ethdev: add API to get representor info") Cc: stable@dpdk.org Signed-off-by: Viacheslav Galaktionov <viacheslav.galaktionov@oktetlabs.ru> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Reviewed-by: Xueming Li <xuemingl@nvidia.com>	2021-07-10 11:29:11 +02:00
Changpeng Liu	7bc7bc3516	eal: suppress error log on multi-process hotplug This is a normal case that the primary process already owned one device while the secondary process try to attach it, so suppress the error log here to exclude this case. Signed-off-by: Changpeng Liu <changpeng.liu@intel.com>	2021-07-10 10:07:07 +02:00
Cristian Dumitrescu	a3ac0a4836	pipeline: support LPM lookup Add support for the Longest Prefix Match (LPM) lookup to the SWX pipeline. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Signed-off-by: Churchill Khangar <churchill.khangar@intel.com>	2021-07-10 08:30:59 +02:00
Cristian Dumitrescu	cdaa937d3e	pipeline: support selector table Add pipeline-level support for selector tables. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-10 08:26:12 +02:00
Cristian Dumitrescu	f7598a62d1	table: support selector table A selector table is made up of groups of weighted members, with a given member potentially part of several groups. The select operation returns a member ID by first selecting a group based on an input group ID and then selecting a member within that group based on hashing one or several input header/meta-data fields. It is very useful for implementing an ECMP/WCMP-enabled FIB or a load balancer. It is part of the action selector described by the P4 Portable Switch Architecture (PSA) specification. Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-09 23:31:54 +02:00
Cristian Dumitrescu	a57d92d73d	pipeline: fix table entry read The rte_swx_pipeline_table_entry_read() function is used to read from a character string a table entry that is to be added to the table, deleted from the table or set as the default entry of the table. Addition needs both the match and the part of the entry, deletion ignores the action part, while the default set ignores the match part, hence the need to make both the match and the action part optional. The logic for skipping the match or the action part was broken, hence the current fix. Fixes: `b32c0a2c5e` ("pipeline: add SWX table update high level API") Cc: stable@dpdk.org Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com> Signed-off-by: Venkata Suresh Kumar P <venkata.suresh.kumar.p@intel.com> Signed-off-by: Churchill Khangar <churchill.khangar@intel.com>	2021-07-09 22:52:19 +02:00
Thierry Herbelot	3fc2ddffde	table: fix bucket empty check Due to a typo, only 3 out of 4 keys in the bucket of the exact match table were considered, which can result in valid keys being incorrectly dropped from the table. Fixes: `d0a0096661` ("table: add exact match SWX table") Cc: stable@dpdk.org Signed-off-by: Thierry Herbelot <thierry.herbelot@6wind.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-07-09 22:42:24 +02:00
Chengwen Feng	5aa9189d74	config/arm: fix SVE build with GCC 8.3 If the target machine has SVE feature (e.g. "-march=armv8.2-a+sve'), and the compiler is gcc-8.3, it will produce this error: In file included from lib/eal/common/eal_common_options.c:38: lib/eal/arm/include/rte_vect.h:13:10: fatal error: arm_sve.h: No such file or directory #include <arm_sve.h> ^~~~~~~~~~~ The root cause is that gcc-8.3 supports SVE (the macro __ARM_FEATURE_SVE was 1), but it doesn't support SVE ACLE [1]. The solution: a) Detect compiler whether support SVE ACLE, if support then define RTE_HAS_SVE_ACLE macro. b) Use the RTE_HAS_SVE_ACLE macro to include SVE header file. [1] ACLE: Arm C Language Extensions, the SVE ACLE header file is <arm_sve.h>, user should include it when writing ACLE SVE code. Fixes: `67b68824a8` ("lpm/arm: support SVE") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Acked-by: Ruifeng Wang <ruifeng.wang@arm.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2021-07-09 22:25:24 +02:00
Ruifeng Wang	cac2a49b4a	ring: use WFE to wait for tail update on aarch64 Instead of polling for tail to be updated, use WFE instruction. Signed-off-by: Gavin Hu <gavin.hu@arm.com> Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Steve Capper <steve.capper@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-07-09 21:33:01 +02:00
Gavin Hu	fa6b488998	spinlock: use WFE to reduce contention on aarch64 In acquiring a spinlock, cores repeatedly poll the lock variable. This is replaced by rte_wait_until_equal API. Running micro benchmarking and testpmd and l3fwd traffic tests on ThunderX2, Ampere eMAG80 and Arm N1SDP, everything went well and no notable performance gain nor degradation was measured. Signed-off-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Phil Yang <phil.yang@arm.com> Reviewed-by: Steve Capper <steve.capper@arm.com> Reviewed-by: Ola Liljedahl <ola.liljedahl@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Tested-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2021-07-09 21:33:01 +02:00
Anatoly Burakov	f53fe635c1	power: support monitoring multiple Rx queues Use the new multi-monitor intrinsic to allow monitoring multiple ethdev Rx queues while entering the energy efficient power state. The multi version will be used unconditionally if supported, and the UMWAIT one will only be used when multi-monitor is not supported by the hardware. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: David Hunt <david.hunt@intel.com>	2021-07-09 21:13:13 +02:00
Anatoly Burakov	5dff9a72b0	power: support callbacks for multiple Rx queues Currently, there is a hard limitation on the PMD power management support that only allows it to support a single queue per lcore. This is not ideal as most DPDK use cases will poll multiple queues per core. The PMD power management mechanism relies on ethdev Rx callbacks, so it is very difficult to implement such support because callbacks are effectively stateless and have no visibility into what the other ethdev devices are doing. This places limitations on what we can do within the framework of Rx callbacks, but the basics of this implementation are as follows: - Replace per-queue structures with per-lcore ones, so that any device polled from the same lcore can share data - Any queue that is going to be polled from a specific lcore has to be added to the list of queues to poll, so that the callback is aware of other queues being polled by the same lcore - Both the empty poll counter and the actual power saving mechanism is shared between all queues polled on a particular lcore, and is only activated when all queues in the list were polled and were determined to have no traffic. - The limitation on UMWAIT-based polling is not removed because UMWAIT is incapable of monitoring more than one address. Also, while we're at it, update and improve the docs. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: David Hunt <david.hunt@intel.com>	2021-07-09 21:13:13 +02:00
Anatoly Burakov	209fd58545	power: make ethdev power management thread unsafe Currently, we expect that only one callback can be active at any given moment, for a particular queue configuration, which is relatively easy to implement in a thread-safe way. However, we're about to add support for multiple queues per lcore, which will greatly increase the possibility of various race conditions. We could have used something like an RCU for this use case, but absent of a pressing need for thread safety we'll go the easy way and just mandate that the API's are to be called when all affected ports are stopped, and document this limitation. This greatly simplifies the `rte_power_monitor`-related code. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: David Hunt <david.hunt@intel.com>	2021-07-09 21:13:13 +02:00
Anatoly Burakov	66834f2974	eal: add power monitor for multiple events Use RTM and WAITPKG instructions to perform a wait-for-writes similar to what UMWAIT does, but without the limitation of having to listen for just one event. This works because the optimized power state used by the TPAUSE instruction will cause a wake up on RTM transaction abort, so if we add the addresses we're interested in to the read-set, any write to those addresses will wake us up. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Tested-by: David Hunt <david.hunt@intel.com>	2021-07-09 21:13:13 +02:00
Anatoly Burakov	6afc4baf4f	eal: use callbacks for power monitoring comparison Previously, the semantics of power monitor were such that we were checking current value against the expected value, and if they matched, then the sleep was aborted. This is somewhat inflexible, because it only allowed us to check for a specific value in a specific way. This commit replaces the comparison with a user callback mechanism, so that any PMD (or other code) using `rte_power_monitor()` can define their own comparison semantics and decision making on how to detect the need to abort the entering of power optimized state. Existing implementations are adjusted to follow the new semantics. Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Tested-by: David Hunt <david.hunt@intel.com> Acked-by: Timothy McDaniel <timothy.mcdaniel@intel.com>	2021-07-09 21:13:13 +02:00
Juraj Linkeš	845048c522	eal/arm: update CPU flags There are two execution states on armv8 architecture, aarch64 and aarch32. Add PLATFORM_STR for the latter and update RTE_ARCH_* flags according to `e9b9739264`. Signed-off-by: Juraj Linkeš <juraj.linkes@pantheon.tech>	2021-07-09 20:00:19 +02:00
Ferruh Yigit	b67f598e23	kni: update link only on change 'rte_kni_update_link()' updates virtual KNI interface link using kernel sysfs interface. If the requested link status is same as interface link status, do not update the link status but return with success. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2021-07-09 17:22:42 +02:00
Richael Zhuang	ef1cc88f18	power: support cppc_cpufreq driver Currently in DPDK only acpi_cpufreq and pstate_cpufreq drivers are supported, which are both not available on arm64 platforms. Add support for cppc_cpufreq driver which works on most arm64 platforms. Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-07-09 16:04:46 +02:00
Anatoly Burakov	06cffd468f	power: refactor ACPI and intel_pstate support Currently, ACPI and PSTATE modes have lots of code duplication, confusing logic, and a bunch of other issues that can, and have, led to various bugs and resource leaks. This commit factors out the common parts of sysfs reading/writing for ACPI and PSTATE drivers. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Signed-off-by: David Hunt <david.hunt@intel.com>	2021-07-08 22:32:13 +02:00
Anatoly Burakov	02a6d68311	power: fix namespace for internal struct Currently, ACPI code uses rte_power_info as the struct name, which gives the appearance that this is an externally visible API. Fix to use internal namespace. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-07-08 22:32:13 +02:00
Huisong Li	02edbfab1e	ethdev: add dev configured flag Currently, if dev_configure is not called or fails to be called, users can still call dev_start successfully. So it is necessary to have a flag which indicates whether the device is configured, to control whether dev_start can be called and eliminate dependency on user invocation order. The flag stored in "struct rte_eth_dev_data" is more reasonable than "enum rte_eth_dev_state". "enum rte_eth_dev_state" is private to the primary and secondary processes, and can be independently controlled. However, the secondary process does not make resource allocations and does not call dev_configure(). These are done by the primary process and can be obtained or used by the secondary process. So this patch adds a "dev_configured" flag in "rte_eth_dev_data", like "dev_started". Signed-off-by: Huisong Li <lihuisong@huawei.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> libabigail raised a warning on this change. This change is fine wrt ABI as far as we understand, but we can't express an exception rule (see libabigail bug #28060) to waive the changes only in this part of the rte_eth_dev_data struct. The solution for now is to globally waive any change on the rte_eth_dev_data structure. Signed-off-by: David Marchand <david.marchand@redhat.com>	2021-07-08 13:05:55 +02:00
David Marchand	e7885281de	ipc: stop mp control thread on cleanup When calling rte_eal_cleanup, the mp channel cleanup routine only sets mp_fd to -1 leaving the rte_mp_handle control thread running. This control thread can spew warnings on reading on an invalid fd. This is especially noticed with ASAN enabled. To handle this situation, set mp_fd to -1 to signal the control thread it should exit, but since this thread might be sleeping on the socket, cancel the thread too. Fixes: `85d6815fa6` ("eal: close multi-process socket during cleanup") Cc: stable@dpdk.org Reported-by: Owen Hilyard <ohilyard@iol.unh.edu> Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-07-08 13:05:55 +02:00
Jan Viktorin	56912ddef2	ethdev: fix doc of flow action The struct rte_flow_action was missing from DPDK API documentation. Fixes: `3850cf0c8c` ("ethdev: add tunnel encap/decap actions") Cc: stable@dpdk.org Signed-off-by: Jan Viktorin <viktorin@cesnet.cz> Acked-by: Ori Kam <orika@nvidia.com> Acked-by: Aman Deep Singh <aman.deep.singh@intel.com>	2021-07-02 19:03:03 +02:00
Jie Zhou	799a5b9aca	eal/windows: add clock function Add clock_gettime() on Windows in rte_os_shim.h. Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-07-02 19:03:03 +02:00
Jie Zhou	3d2fcb0e0a	eal/windows: add device event stubs Add device event stubs in eal_dev.c for Windows Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-07-02 19:03:03 +02:00
Jie Zhou	22f463e181	eal/windows: add macros required by testpmd Add required macros by testpmd on Windows in rte_os_shim.h Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Tal Shnaiderman <talshn@nvidia.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-07-02 19:03:03 +02:00
Jie Zhou	786881d152	lib: build testpmd dependencies on Windows Enable building libraries that testpmd depends on for Windows Signed-off-by: Jie Zhou <jizh@linux.microsoft.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-07-02 19:03:03 +02:00
Olivier Matz	45a08ef55e	net: introduce functions to verify L4 checksums Since commit `d5df2ae042` ("net: fix unneeded replacement of TCP checksum 0"), the functions rte_ipv4_udptcp_cksum() and rte_ipv6_udptcp_cksum() can return either 0x0000 or 0xffff when used to verify a packet containing a valid checksum. Since these functions should be used to calculate the checksum to set in a packet, introduce 2 new helpers for checksum verification. They return 0 if the checksum is valid in the packet. Use this new helper in net/tap driver. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Morten Brørup <mb@smartsharesystems.com> Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-07-02 19:03:03 +02:00
David Marchand	40edb9c0d3	eal: handle compressed firmware Introduce an internal firmware loading helper to remove code duplication in our drivers and handle xz compressed firmware by calling libarchive. This helper tries to look for .xz suffixes so that drivers are not aware the firmware has been compressed. libarchive is set as an optional dependency: without libarchive, a runtime warning is emitted so that users know there is a compressed firmware. Windows implementation is left as an empty stub. Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Igor Russkikh <irusskikh@marvell.com> Acked-by: Aaron Conole <aconole@redhat.com> Tested-by: Haiyue Wang <haiyue.wang@intel.com>	2021-07-07 16:41:53 +02:00
Bruce Richardson	d5252f7d4b	telemetry: add extra log message on socket bind failure If the library fails to create the needed socket, add an additional check to report if the error is due to a missing DPDK runtime dir. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: David Marchand <david.marchand@redhat.com> Acked-by: Morten Brørup <mb@smartsharesystems.com> Acked-by: Ciara Power <ciara.power@intel.com>	2021-07-07 15:23:53 +02:00
Bruce Richardson	ce382fdddb	eal: create runtime dir even when shared data is not used When multi-process is not wanted and DPDK is run with the "no-shconf" flag, the telemetry library still needs a runtime directory to place the unix socket for telemetry connections. Therefore, rather than not creating the directory when this flag is set, we can change the code to attempt the creation anyway, but not error out if it fails. If it succeeds, then telemetry will be available, but if it fails, the rest of DPDK will run without telemetry. This ensures that the "in-memory" flag will allow DPDK to run even if the whole filesystem is read-only, for example. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Morten Brørup <mb@smartsharesystems.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-07-07 15:23:09 +02:00
Maxime Coquelin	b3d4a18b9c	vhost: use DPDK allocations for in-flight data Inflight metadata are allocated using glibc's calloc. This patch converts them to rte_zmalloc_socket to take care of the NUMA affinity. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:32:38 +02:00
Maxime Coquelin	b81c93466d	vhost: allocate all data on same node as virtqueue This patch saves the NUMA node the virtqueue is allocated on at init time, in order to allocate all other data on the same node. While most of the data are allocated before numa_realloc() is called and so the data will be reallocated properly, some data like the log cache are most likely allocated after. For the virtio device metadata, we decide to allocate them on the same node as the VQ 0. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:32:13 +02:00
Maxime Coquelin	97b4e3b1d0	vhost: improve NUMA reallocation This patch improves the numa_realloc() function by making use of rte_realloc_socket(), which takes care of the memory copy and freeing of the old data. Suggested-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:32:13 +02:00
Maxime Coquelin	6305dfeff4	vhost: fix NUMA reallocation with multi-queue Since the Vhost-user device initialization has been reworked, enabling the application to start using the device as soon as the first queue pair is ready, NUMA reallocation no more happened on queue pairs other than the first one since numa_realloc() was returning early if the device was running. This patch fixes this issue by reallocating the device metadata only if the device is running. For the virtqueues, a vring state change notification is sent to notify the application of its disablement. Since the callback is supposed to be blocking, it is safe to reallocate it afterwards. Fixes: `d0fcc38f5f` ("vhost: improve device readiness notifications") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:32:13 +02:00
Maxime Coquelin	eb40c50c17	vhost: fix missing cache logging NUMA realloc When the guest allocates virtqueues on a different NUMA node than the one the Vhost metadata are allocated, both the Vhost device struct and the virtqueues struct are reallocated. However, reallocating the log cache on the new NUMA node was not done. This patch fixes this by reallocating it if it has been allocated already, which means a live-migration is on-going. Fixes: `1818a63147` ("vhost: move dirty logging cache out of virtqueue") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:32:13 +02:00
Maxime Coquelin	57589cdfd7	vhost: fix missing guest pages table NUMA realloc When the guest allocates virtqueues on a different NUMA node than the one the Vhost metadata are allocated, both the Vhost device struct and the virtqueues struct are reallocated. However, reallocating the guest pages table was missing, which likely causes at least one cross-NUMA accesses for every burst of packets. This patch reallocates this table on the same NUMA node as the other metadata. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:26:58 +02:00
Maxime Coquelin	8119ca9114	vhost: fix missing memory table NUMA realloc When the guest allocates virtqueues on a different NUMA node than the one the Vhost metadata are allocated, both the Vhost device struct and the virtqueues struct are reallocated. However, reallocating the Vhost memory table was missing, which likely causes at least one cross-NUMA accesses for every burst of packets. This patch reallocates this table on the same NUMA node as the other metadata. Fixes: `552e8fd3d2` ("vhost: simplify memory regions handling") Cc: stable@dpdk.org Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-06-30 13:26:58 +02:00
Xueming Li	35d4f17b3d	devargs: add common key definition Add common devargs key definition for "bus", "class" and "driver". Signed-off-by: Xueming Li <xuemingl@nvidia.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>	2021-07-05 16:33:18 +02:00
Thomas Monjalon	dbba7c9efb	eal: save error in string copy The string copy api rte_strscpy() did not set rte_errno during failures, instead it just returned negative error number. Set rte_errrno if the destination buffer is too small. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Signed-off-by: Xueming Li <xuemingl@nvidia.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-07-05 15:11:30 +02:00
Ruifeng Wang	18f0b28eec	eal/arm: remove unused type Data types Elf32_auxv_t and Elf64_auxv_t are used by OS Linux auxiliary vector read, and not used by arch specific cpu flag API implementations. Hence remove them from Arm file. Reported-by: James Grant <j.grant@qub.ac.uk> Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2021-07-05 09:50:51 +02:00
Owen Hilyard	03b8372a9a	rib: fix max depth IPv6 lookup ASAN found a stack buffer overflow in lib/rib/rte_rib6.c:get_dir. The fix for the stack buffer overflow was to make sure depth was always < 128, since when depth = 128 it caused the index into the ip address to be 16, which read off the end of the array. While trying to solve the buffer overflow, I noticed that a few changes could be made to remove the for loop entirely. Fixes: `f7e861e21c` ("rib: support IPv6") Cc: stable@dpdk.org Signed-off-by: Owen Hilyard <ohilyard@iol.unh.edu> Acked-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com>	2021-06-24 15:34:45 +02:00
Owen Hilyard	016441e3c7	flow_classify: fix leaking rules on delete Rules in a classify table were not freed if the table had a delete function. Fixes: `be41ac2a33` ("flow_classify: introduce flow classify library") Cc: stable@dpdk.org Signed-off-by: Owen Hilyard <ohilyard@iol.unh.edu> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2021-06-24 15:34:45 +02:00
Yunjian Wang	0db3d5551a	kni: fix mbuf allocation for kernel side use In kni_allocate_mbufs(), we alloc mbuf for alloc_q as this code. allocq_free = (kni->alloc_q->read - kni->alloc_q->write - 1) \ & (MAX_MBUF_BURST_NUM - 1); The value of allocq_free maybe zero, for example : The ring size is 1024. After init, write = read = 0. Then we fill kni->alloc_q to full. At this time, write = 1023, read = 0. Then the kernel send 32 packets to userspace. At this time, write = 1023, read = 32. And then the userspace receive this 32 packets. Then fill the kni->alloc_q, (32 - 1023 - 1) & 31 = 0, fill nothing. ... Then the kernel send 32 packets to userspace. At this time, write = 1023, read = 992. And then the userspace receive this 32 packets. Then fill the kni->alloc_q, (992 - 1023 - 1) & 31 = 0, fill nothing. Then the kernel send 32 packets to userspace. The kni->alloc_q only has 31 mbufs and will drop one packet. Absolutely, this is a special scene. Normally, it will fill some mbufs everytime, but may not enough for the kernel to use. In this patch, we always keep the kni->alloc_q to full for the kernel to use. Fixes: `49da4e82cf` ("kni: allocate no more mbuf than empty slots in queue") Cc: stable@dpdk.org Signed-off-by: Cheng Liu <liucheng11@huawei.com> Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-06-24 09:42:37 +02:00
Balazs Nemeth	242695f612	vhost: allocate and free packets in bulk in Tx split Same idea as commit `a287ac2891` ("vhost: allocate and free packets in bulk in Tx packed"), allocate and free packets in bulk. Also remove the unused function virtio_dev_pktmbuf_alloc. Signed-off-by: Balazs Nemeth <bnemeth@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-06-23 09:55:34 +02:00
Thierry Herbelot	9cfbe67691	vhost/crypto: check request pointer before dereference Use vc_req only after it was checked not to be NULL. Fixes: `2d962bb736` ("vhost/crypto: fix possible TOCTOU attack") Cc: stable@dpdk.org Signed-off-by: Thierry Herbelot <thierry.herbelot@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-06-23 09:55:23 +02:00
Dmitry Kozlyuk	cfdaa678b3	eal/windows: cleanup interrupt resources Interrupt manager in Windows EAL allocates on IOCP and starts a control thread that runs indefinitely. At DPDK cleanup this thread was not stopped and IOCP handle was not closed. Gracefully stop interrupt-handling in rte_eal_cleanup(). The thread already closes IOCP handle before exiting. Fixes: `5c016fc020` ("eal/windows: add interrupt thread skeleton") Cc: stable@dpdk.org Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Jie Zhou <jizh@microsoft.com> Tested-by: Jie Zhou <jizh@microsoft.com>	2021-06-23 09:05:36 +02:00
Dmitry Kozlyuk	3888f31950	eal/windows: fix interrupt thread handle leakage Each time a work was scheduled in the interrupt thread, usually an alarm, a handle was opened but not closed. Opening a handle is a system call, which harms alarm precision. Instead of opening and closing a handle each time, open it when interrupt thread starts and close it when the thread finishes. Fixes: `5c016fc020` ("eal/windows: add interrupt thread skeleton") Cc: stable@dpdk.org Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Tested-by: Pallavi Kadam <pallavi.kadam@intel.com>	2021-06-23 09:04:28 +02:00
Dmitry Kozlyuk	35dff5d3b7	eal/windows: fix interrupt thread ID Interrupt thread ID retained its value after interrupt thread finish. Other interrupt routines could then operate on the wrong thread. Clear interrupt thread ID before thread termination. Fixes: `5c016fc020` ("eal/windows: add interrupt thread skeleton") Cc: stable@dpdk.org Signed-off-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Tyler Retzlaff <roretzla@linux.microsoft.com>	2021-06-23 09:03:14 +02:00
Christian Ehrhardt	31c5af644b	vfio: add stdbool include This became visible by backporting the following for the 19.11 stable tree: `c13ca4e8` "vfio: fix DMA mapping granularity for IOVA as VA" The usage of type bool in the vfio code would require "#include <stdbool.h>", but rte_vfio.h has no direct paths to stdbool.h. It happens that in eal_vfio_mp_sync.c it comes after "#include <rte_log.h>". And rte_log.h since 20.05 includes stdbool since this change: `241e67bfe` "log: add API to check if a logtype can log in a given level" and thereby mitigates the issue. It should be safe to include stdbool.h from rte_vfio.h itself to be present exactly when needed for the struct it defines using that type. Fixes: `c13ca4e81c` ("vfio: fix DMA mapping granularity for IOVA as VA") Cc: stable@dpdk.org Signed-off-by: Christian Ehrhardt <christian.ehrhardt@canonical.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2021-06-17 10:31:33 +02:00
Konstantin Ananyev	b3b36f0fbf	acl: fix build with GCC 6.3 --buildtype=debug with gcc 6.3 produces the following error: ../lib/librte_acl/acl_run_avx512_common.h: In function ‘resolve_match_idx_avx512x16’: ../lib/librte_acl/acl_run_avx512x16.h:33:18: error: the last argument must be an 8-bit immediate ^ ../lib/librte_acl/acl_run_avx512_common.h:373:9: note: in expansion of macro ‘_M_I_’ return _M_I_(slli_epi32)(mi, match_log); ^~~~~ Seems like gcc-6.3 complains about the following construct: static const uint32_t match_log = 5; ... _mm512_slli_epi32(mi, match_log); It can't substitute constant variable 'match_log' with its actual value. The fix replaces constant variable with its immediate value. Bugzilla ID: 717 Fixes: `b64c2295f7` ("acl: add 256-bit AVX512 classify method") Fixes: `45da22e42e` ("acl: add 512-bit AVX512 classify method") Cc: stable@dpdk.org Reported-by: Liang Ma <liangma@liangbit.com> Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2021-06-17 09:37:11 +02:00
David Marchand	2ca92f5441	malloc: fix size annotation for NUMA-aware realloc __rte_alloc_size is mapped to compiler alloc_size attribute. Quoting gcc documentation: """ alloc_size The alloc_size attribute is used to tell the compiler that the function return value points to memory, where the size is given by one or two of the functions parameters. GCC uses this information to improve the correctness of __builtin_object_size. The function parameter(s) denoting the allocated size are specified by one or two integer arguments supplied to the attribute. The allocated size is either the value of the single function argument specified or the product of the two function arguments specified. Argument numbering starts at one. """ In rte_realloc_socket case, only 'size' matters. Note: this has been spotted by Maxime trying to use rte_realloc_socket and compiling with gcc 11. Fixes: `17b347dab7` ("malloc: add alloc_size attribute to functions") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Tested-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2021-06-11 11:03:38 +02:00
Ivan Ilchenko	1ffd3bc125	bitmap: fix buffer overrun in bitmap init Bitmap initialization function is allowed to memset() caller-provided buffer with number of bytes exceeded this buffer size. This happens due to wrong comparison sign between buffer size and number of bytes required to initialize bitmap. Fixes: `602c9ca33a` ("sched: bitmap is now dynamically allocated") Cc: stable@dpdk.org Reported-by: Andy Moreton <amoreton@xilinx.com> Signed-off-by: Ivan Ilchenko <ivan.ilchenko@oktetlabs.ru> Reviewed-by: Andy Moreton <amoreton@xilinx.com> Signed-off-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2021-06-11 11:03:25 +02:00
Haiyue Wang	21f6adec07	bus/pci: configure PCI bus master Add the API to set 'Bus Master Enable' bit to be enabled or disabled in the PCI command register. Signed-off-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Ray Kinsella <mdr@ashroe.eu>	2021-06-04 09:38:08 +02:00
David Marchand	64307fad7d	telemetry: remove static limit on callbacks count This code is not performance sensitive and can be switched to dynamic allocations. Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Ciara Power <ciara.power@intel.com>	2021-06-03 18:36:03 +02:00
Hongbo Zheng	2d2bf7de1a	graph: fix null dereference in stats In function 'stats_mem_init', pointer 'stats' should be confirmed not null before memset it. Fixes: `af1ae8b6a3` ("graph: implement stats") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Reviewed-by: Jerin Jacob <jerinj@marvell.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-06-03 18:35:57 +02:00
Hongbo Zheng	3b47572fbe	graph: fix memory leak in stats Fix function 'stats_mem_populate' return without free dynamic memory referenced by 'stats'. Fixes: `af1ae8b6a3` ("graph: implement stats") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Reviewed-by: David Marchand <david.marchand@redhat.com>	2021-06-03 18:35:47 +02:00
Thomas Monjalon	0d025f019c	ethdev: fix comments of packet integrity flow item The Doxygen comments are placed before the related lines, but the markers were /< instead of / The struct rte_flow_item_integrity did not appear in Doxygen output because there was no general comment for the struct. Fixes: `b10a421a1f` ("ethdev: add packet integrity check flow rules") Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru> Acked-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2021-05-19 22:52:03 +02:00
David Marchand	f31ce483bc	vhost: restore IOTLB mempool allocation IOTLB messages will be sent when some queues are not enabled. If we initialize IOTLB in vhost_user_set_vring_num, it could happen that IOTLB update comes when IOTLB pool of disabled queues are not initialized. Fixes: `968bbc7e2e` ("vhost: avoid IOTLB mempool allocation while IOMMU disabled") Signed-off-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Chenbo Xia <chenbo.xia@intel.com>	2021-05-18 09:54:44 +02:00
Balazs Nemeth	3ad55b8e94	vhost: fix stored last used index The optimization introduced by commit `d18db8049c` ("vhost: read last used index once") didn't account for the fact that vhost_flush_enqueue_shadow_packed increments the last_used_idx. For this reason, store last_used_idx after the potential call to vhost_flush_enqueue_shadow_packed. Bugzilla ID: 699 Fixes: `d18db8049c` ("vhost: read last used index once") Signed-off-by: Balazs Nemeth <bnemeth@redhat.com> Reviewed-by: David Marchand <david.marchand@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Tested-by: Wei Ling <weix.ling@intel.com>	2021-05-18 09:43:35 +02:00
Cheng Jiang	35139e648a	vhost: fix sign extension in async packed ring Change the variable type in store_dma_desc_info_packed() to fix suspicious implicit sign extension. Coverity issue: 370608, 370610, 370612 Fixes: `873e8dad6f` ("vhost: support packed ring in async datapath") Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com>	2021-05-12 10:28:18 +02:00
Cheng Jiang	11a7cd8c92	vhost: fix sign extension in async split ring Change the variable type in store_dma_desc_info_split() to fix suspicious implicit sign extension. Coverity issue: 370604, 370607, 370609 Fixes: `3d6cb86b0d` ("vhost: refactor async split ring functions") Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com>	2021-05-12 10:28:08 +02:00
David Marchand	8eff201b00	net/ice: fix leak on thread termination A terminated pthread should be joined or detached so that its associated resources are released. The "ice-reset-<vf_id>" threads are used to service some reset task in the background, but they are never joined by the thread that created them. The easiest solution is to detach new threads. The Windows EAL did not provide a pthread_detach wrapper but there is no resource to release for Windows threads, so add an empty wrapper. Fixes: `3b3757bda3` ("net/ice: get VF hardware index in DCF") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Haiyue Wang <haiyue.wang@intel.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com>	2021-05-11 23:40:22 +02:00
Hongbo Zheng	1fe00fd358	power: fix sanity checks for guest channel read In function power_guest_channel_read_msg, 'lcore_id' is used before validity check, which may cause buffer 'global_fds' accessed by index 'lcore_id' overflow. This patch moves the validity check of 'lcore_id' before the 'lcore_id' being used for the first time. Fixes: `9dc843eb27` ("power: extend guest channel API for reading") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Reviewed-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: David Hunt <david.hunt@intel.com>	2021-05-12 17:18:38 +02:00
Chengwen Feng	cc994d3922	ipc: use monotonic clock Currently, the mp uses gettimeofday() API to get the time, and used as timeout parameter. But the time which gets from gettimeofday() API isn't monotonically increasing. The process may fail if the system time is changed. This fixes it by using clock_gettime() API with monotonic attribution. Fixes: `783b6e5497` ("eal: add synchronous multi-process communication") Fixes: `f05e26051c` ("eal: add IPC asynchronous request") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Morten Brørup <mb@smartsharesystems.com>	2021-05-12 16:49:08 +02:00
Lance Richardson	6beb2d2947	eal: fix memory mapping on 32-bit target For 32-bit targets, size_t is normally a 32-bit type and does not have sufficient range to represent 64-bit offsets that are needed when mapping PCI addresses. Use uint64_t instead. Found when attempting to run 32-bit Linux dpdk-testpmd using VFIO driver: EAL: pci_map_resource(): cannot map resource(63, 0xc0010000, \ 0x200000, 0x20000000000): Invalid argument ((nil)) Fixes: `c4b89ecb64` ("eal: introduce memory management wrappers") Cc: stable@dpdk.org Signed-off-by: Lance Richardson <lance.richardson@broadcom.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2021-05-11 23:01:06 +02:00
David Marchand	1657e1f871	net: fix header include order for FreeBSD Spotted by sparse in OVS build: ../../lib/netdev-dpdk.c: note: in included file (through /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_ip.h, /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h, ...): ../../include/sparse/arpa/inet.h:22:2: error: "Must include <netinet/in.h> before <arpa/inet.h> for FreeBSD support" This is a check enforced by OVS itself. See [1] for some context. 1: https://github.com/openvswitch/ovs/commit/b2befd5bb2db Fixes: `89813a522e` ("net: provide IP-related API on any OS") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Dmitry Kozlyuk <dmitry.kozliuk@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2021-05-11 15:44:38 +02:00
David Marchand	dc2c712f72	net: add endianness annotations to ethernet headers Spotted by sparse in OVS build: /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:789:27: error: incorrect type in initializer (different base types) /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:789:27: expected unsigned short [usertype] ether_type /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:789:27: got restricted ovs_be16 [usertype] /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:829:25: error: incorrect type in initializer (different base types) /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:829:25: expected unsigned short [usertype] vlan_tci /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:829:25: got restricted ovs_be16 [usertype] /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:830:26: error: incorrect type in initializer (different base types) /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:830:26: expected unsigned short [usertype] eth_proto /home/runner/work/ovs/ovs/dpdk-dir/build/include/rte_flow.h:830:26: got restricted ovs_be16 [usertype] This was not caught before as no code in headers was using those fields. This changed with commit `6f2168b69a` ("ethdev: reuse ethernet header definition in flow item") and commit `a56a262e34` ("ethdev: reuse VLAN header definition in flow item"). Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2021-05-11 15:22:26 +02:00
David Marchand	eeded2044a	log: register with standardized names Let's try to enforce the convention where most drivers use a pmd. logtype with their class reflected in it, and libraries use a lib. logtype. Introduce two new macros: - RTE_LOG_REGISTER_DEFAULT can be used when a single logtype is used in a component. It is associated to the default name provided by the build system, - RTE_LOG_REGISTER_SUFFIX can be used when multiple logtypes are used, and then the passed name is appended to the default name, RTE_LOG_REGISTER is left untouched for existing external users and for components that do not comply with the convention. There is a new Meson variable log_prefix to adapt the default name for baseband (pmd.bb.), bus (no pmd.) and mempool (no pmd.) classes. Note: achieved with below commands + reverted change on net/bonding + edits on crypto/virtio, compress/mlx5, regex/mlx5 $ git grep -l RTE_LOG_REGISTER drivers/ \| while read file; do pattern=${file##drivers/}; class=${pattern%%/}; pattern=${pattern#$class/}; drv=${pattern%%/}; case "$class" in baseband) pattern=pmd.bb.$drv;; bus) pattern=bus.$drv;; mempool) pattern=mempool.$drv;; ) pattern=pmd.$class.$drv;; esac sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern',/RTE_LOG_REGISTER_DEFAULT(\1,/' $file; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern'\.$.$,/RTE_LOG_REGISTER_SUFFIX(\1, \2,/' $file; done $ git grep -l RTE_LOG_REGISTER lib/ \| while read file; do pattern=${file##lib/}; pattern=lib.${pattern%%/}; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern',/RTE_LOG_REGISTER_DEFAULT(\1,/' $file; sed -i -e 's/RTE_LOG_REGISTER($.$, '$pattern'\.$.$,/RTE_LOG_REGISTER_SUFFIX(\1, \2,/' $file; done Signed-off-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-05-11 15:17:55 +02:00
Vladimir Medvedkin	be81f77d80	hash: fix tuple adjustment rte_thash_adjust_tuple() uses random to generate a new subtuple if fn() callback reports about collision. In some cases random changes the subtuple in a way that after complementary bits are applied the original tuple is obtained. This patch replaces random with subtuple increment. Fixes: `28ebff11c2` ("hash: add predictable RSS") Cc: vladimir.medvedkin@intel.com Reported-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Vladimir Medvedkin <vladimir.medvedkin@intel.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com> Tested-by: Stanislaw Kardach <kda@semihalf.com> Reviewed-by: Stanislaw Kardach <kda@semihalf.com>	2021-05-10 15:31:42 +02:00
David Marchand	b81bf1efe3	eal: fix leak in shared lib mode detection This is reported by our internal covscan: 1. dpdk-20.11/lib/librte_eal/common/eal_common_options.c:508: alloc_fn: Storage is returned from allocation function "dlopen". 6. dpdk-20.11/lib/librte_eal/common/eal_common_options.c:508: leaked_storage: Failing to save or free storage allocated by "dlopen("librte_eal.so.21.0", 5)" leaks it. # 506\| * shared library is not already loaded i.e. it's # statically linked.) # 507\| / # 508\|-> if (dlopen("librte_eal.so."ABI_VERSION, RTLD_LAZY \| # RTLD_NOLOAD) != NULL && # 509\| default_solib_dir != '\0' && # 510\| stat(default_solib_dir, &sb) == 0 && This leak is not an issue per se, but on the other hand, this is easy to fix and I prefer not having to waive this warning later. Fixes: `06c7871dde` ("eal: restrict default plugin path to shared lib mode") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-05-10 15:31:42 +02:00
David Marchand	627c5b41bb	build: fix default drivers list without Python If no enable_drivers option is passed, the default is to build the drivers list by calling list-dir-globs.py. But if no Python interpreter is installed, no error is reported and all drivers end up being disabled. Example on a minimal FreeBSD VM: dpdk@freebsd:~/dpdk $ meson setup build ... drivers: common/cpt: not in enabled drivers build config common/dpaax: not in enabled drivers build config common/iavf: not in enabled drivers build config common/mvep: not in enabled drivers build config common/octeontx: not in enabled drivers build config common/octeontx2: not in enabled drivers build config bus/dpaa: not in enabled drivers build config bus/fslmc: not in enabled drivers build config ... dpdk@freebsd:~/dpdk $ cd drivers/ dpdk@freebsd:~/dpdk/drivers $ ~/dpdk/buildtools/list-dir-globs.py / env: python3: No such file or directory Rely on meson internal interpreter. Check return code when calling this script. Fixes: `ab9407c3ad` ("build: allow using wildcards to disable drivers") Fixes: `2e33309ebe` ("config: enable/disable drivers in Arm builds") Cc: stable@dpdk.org Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2021-05-07 15:41:45 +02:00
Chengwen Feng	97ca1e786b	eal: fix service core list parsing This patch adds checking for service core index validity when parsing service corelist. Fixes: `7dbd7a6413` ("service: add -S corelist option") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2021-05-05 23:19:23 +02:00

1 2 3 4 5 ...

7260 Commits