numam-dpdk

Author	SHA1	Message	Date
Yongseok Koh	f4efc0eb97	net/mlx4: add control of excessive memory pinning by kernel A new PMD parameter (mr_ext_memseg_en) is added to control extension of memseg when creating a MR. It is enabled by default. If enabled, mlx4_mr_create() tries to maximize the range of MR registration so that the LKey lookup tables on datapath become smalle and get the best performance. However, it may worsen memory utilization because registered memory is pinned by kernel driver. Even if a page in the extended chunk is freed, that doesn't become reusable until the entire memory is freed and the MR is destroyed. To make freed pages available immediately, this parameter has to be turned off but it could drop performance. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	c18cf501a7	net/mlx5: enable secondary process to register DMA memory The Memory Region (MR) for DMA memory can't be created from secondary process due to lib/driver limitation. Whenever it is needed, secondary process can make a request to primary process through the EAL IPC channel (rte_mp_msg) which is established on initialization. Once a MR is created by primary process, it is immediately visible to secondary process because the MR list is global per a device. Thus, secondary process can look up the list after the request is successfully returned. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	dceb502942	net/mlx5: add control of excessive memory pinning by kernel A new PMD parameter (mr_ext_memseg_en) is added to control extension of memseg when creating a MR. It is enabled by default. If enabled, mlx5_mr_create() tries to maximize the range of MR registration so that the LKey lookup tables on datapath become smaller and get the best performance. However, it may worsen memory utilization because registered memory is pinned by kernel driver. Even if a page in the extended chunk is freed, that doesn't become reusable until the entire memory is freed and the MR is destroyed. To make freed pages available immediately, this parameter has to be turned off but it could drop performance. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	207fe7ac72	net/mlx5: fix external memory registration Secondary process is not allowed to register MR due to a restriction of library and kernel driver. Fixes: 7e43a32ee060 ("net/mlx5: support externally allocated static memory") Cc: stable@dpdk.org Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	3d1f3c7c83	net/mlx: remove debug messages on datapath Cc: stable@dpdk.org Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	0203d33a10	net/mlx4: support secondary process In order to support secondary process, a few features are required. a) rdma-core library should allocate device resources using DPDK's memory allocator. b) UAR should be remapped for secondary processes. Currently, in order not to use different data structure for secondary processes, PMD tries to reserve identical virtual address space for both primary and secondary processes. c) IPC channel is necessary, which can be easily set with rte_mp APIs. Through the channel, Verbs command FD is delivered to the secondary process and the device stop/start event is also broadcast from primary process. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	8e49376400	net/mlx4: add external allocator for Verbs object To support secondary process, the memory allocated by library such as completion rings (CQ) and buffer rings (WQ) must be manageable by EAL, in order to share it with secondary processes. With new changes in rdma-core and kernel driver, it is possible to provide an external allocator to the library layer for this purpose. All such resources will now be allocated within DPDK framework. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	099c2c5376	net/mlx4: change device reference for secondary process rte_eth_devices[] is not shared between primary and secondary process, but a static array to each process. The reverse pointer of device (priv->dev) becomes invalid if mlx4 supports secondary process. Instead, priv has the pointer to shared data of the device, struct rte_eth_dev_data *dev_data; Two macros are added, #define PORT_ID(priv) ((priv)->dev_data->port_id) #define ETH_DEV(priv) (&rte_eth_devices[PORT_ID(priv)]) Cc: stable@dpdk.org Suggested-by: Raslan Darawsheh <rasland@mellanox.com> Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	2aac5b5d11	net/mlx5: sync stop/start with secondary process Rx/Tx burst function pointers are stored in the rte_eth_dev structure, which is local to a process. Even though primary process replaces the function pointers, secondary will not run the new ones. With rte_mp APIs, primary can easily broadcast a request to stop/start the datapath of secondary processes. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	7be600c8d8	net/mlx5: rework PMD global data init There's more need to have PMD global data structure. This should be initialized once per a process regardless of how many PMD instances are probed. mlx5_init_once() is called during probing and make sure all the init functions are called once per a process. Currently, such global data and its initialization functions are even scattered. Rather than 'extern'-ing such variables and calling such functions one by one making sure it is called only once by checking the validity of such variables, it will be better to have a global storage to hold such data and a consolidated function having all the initializations. The existing shared memory gets more extensively used for this purpose. As there could be multiple secondary processes, a static storage (local to process) is also added. As the reserved virtual address for UAR remap is a PMD global resource, this doesn't need to be stored in the device priv structure, but in the PMD global data. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	9a8ab29b84	net/mlx5: replace IPC socket with EAL API Socket API is used for IPC in order for secondary process to acquire Verb command file descriptor. The FD is used to remap UAR address. The multi-process APIs (rte_mp) in EAL are newly introduced. mlx5_socket.c is replaced with mlx5_mp.c, which uses the new APIs. As it is PMD global infrastructure, only one IPC channel is established. All the IPC message types may have port_id in the message if there is need to reference a specific device. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	3ebe658059	net/mlx5: fix memory event on secondary process As the memory event is propagated to secondary processes, the event is processed redundantly. This should be processed once because the data structure used for MR and the event is global across the processes. Fixes: 974f1e7ef146 ("net/mlx5: add new memory region support") Cc: stable@dpdk.org Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Dekel Peled	de90612f40	net/mlx5: fix errno typos in comments Correct typing mistake in several locations: ernno ==> errno Fixes: 23c1d42c7138 ("net/mlx5: split flow validation to dedicated function") Cc: stable@dpdk.org Signed-off-by: Dekel Peled <dekelp@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Yongseok Koh	9c55c6bd86	net/mlx5: revert mbuf address calculation for x86 When replenishing mbufs on Rx, buffer address (mbuf->buf_addr) should be loaded. non-x86 processors (mostly RISC such as ARM and Power) are more vulnerable to load stall. For x86, reducing the number of instructions seems to matter most. For x86, this is simply a load but for other architectures, it is calculated from the address of mbuf structure by rte_mbuf_buf_addr() without having to load the first cacheline of the mbuf. Fixes: 12d468a62bc1 ("net/mlx5: fix instruction hotspot on replenishing Rx buffer") Cc: stable@dpdk.org Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-04-05 17:45:22 +02:00
Hemant Agrawal	ccdb58c630	raw/dpaa2_qdma: support non prefetch mode This patch add support for non prefetch mode in Rx functions. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-04-05 10:40:56 +02:00
Nipun Gupta	345c783b2d	raw/dpaa2: remove logs from datapath The runtime traces shall not be present in datapath Signed-off-by: Nipun Gupta <nipun.gupta@nxp.com>	2019-04-05 10:40:56 +02:00
Hemant Agrawal	4d9a3f2a01	raw/dpaa2_qdma: support RBP mode Add support for route by port mode. The route by port feature in HW helps in translating the PCI address of connected device. Signed-off-by: Minghuan Lian <minghuan.lian@nxp.com> Signed-off-by: Sachin Saxena <sachin.saxena@nxp.com> Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-04-05 10:40:43 +02:00
Hemant Agrawal	44889767af	raw/dpaa2_qdma: support burst mode This patch adds support the batch processing for the qdma jobs Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Signed-off-by: Yi Liu <yi.liu@nxp.com>	2019-04-05 01:05:56 +02:00
Shreyansh Jain	428fe6d4cb	raw/dpaa2_qdma: fix to support multiprocess execution Fixes: c22fab9a6c34 ("raw/dpaa2_qdma: support configuration APIs") Cc: stable@dpdk.org Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2019-04-05 01:05:25 +02:00
Hemant Agrawal	fb1a20331d	raw/dpaa2_qdma: remove experimental tag from APIs These APIs has been in the DPDK for few release now. This patch removes the experimental tags for the APIs. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-04-05 01:04:31 +02:00
Shreyansh Jain	55984a9bb5	net/dpaa2: update MC firmware version for FSLMC bus MC firmware is the core component of FSLMC bus and DPAA2 devices. Prior to this patch, MC firmware supported 10.10.x version. This patch bumps the min supported version to 10.14.x. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2019-04-04 23:42:15 +02:00
Shreyansh Jain	4eeb036c95	bus/fslmc: cleanup unused firmware code Removes some unused firmware code which was added in last bump of the firmware version. No current features uses these APIs. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2019-04-04 23:42:15 +02:00
Bruce Richardson	6723c0fc72	replace snprintf with strlcpy Do a global replace of snprintf(..."%s",...) with strlcpy, adding in the rte_string_fns.h header if needed. The function changes in this patch were auto-generated via command: spatch --sp-file devtools/cocci/strlcpy.cocci --dir . --in-place and then the files edited using awk to add in the missing header: gawk -i inplace '/include <rte_/ && ! seen { \ print "#include <rte_string_fns.h>"; seen=1} {print}' Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2019-04-04 22:46:05 +02:00
Bruce Richardson	f9acaf84e9	replace snprintf with strlcpy without adding extra include For files that already have rte_string_fns.h included in them, we can do a straight replacement of snprintf(..."%s",...) with strlcpy. The changes in this patch were auto-generated via command: spatch --sp-file devtools/cocci/strlcpy-with-header.cocci --dir . --in-place Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2019-04-04 22:45:54 +02:00
Bruce Richardson	f4206d1642	net/bonding: fix buffer length when printing strings Using the size of the source string is incorrect when printing using snprintf. Instead pass in the buffer size to be used appropriately. Fixes: 457ecf2953fc ("bond: add debug info for mode 6") Cc: stable@dpdk.org Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2019-04-04 22:41:11 +02:00
David Marchand	27893e4eee	drivers: remove Linux EAL from include path None of those drivers require EAL linux specific headers. Signed-off-by: David Marchand <david.marchand@redhat.com>	2019-04-04 22:06:16 +02:00
Gage Eads	e75bc77f98	mempool/stack: add lock-free stack mempool handler This commit adds support for lock-free (linked list based) stack mempool handler. In mempool_perf_autotest the lock-based stack outperforms the lock-free handler for certain lcore/alloc count/free count combinations, however: - For applications with preemptible pthreads, a standard (lock-based) stack's worst-case performance (i.e. one thread being preempted while holding the spinlock) is much worse than the lock-free stack's. - Using per-thread mempool caches will largely mitigate the performance difference. Test setup: x86_64 build with default config, dual-socket Xeon E5-2699 v4, running on isolcpus cores with a tickless scheduler. The lock-based stack's rate_persec was 0.6x-3.5x the lock-free stack's. Signed-off-by: Gage Eads <gage.eads@intel.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2019-04-04 22:06:16 +02:00
Gage Eads	734bdeb01c	mempool/stack: use stack library The new rte_stack library is derived from the mempool handler, so this commit removes duplicated code and simplifies the handler by migrating it to this new API. Signed-off-by: Gage Eads <gage.eads@intel.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2019-04-04 22:06:16 +02:00
Tomasz Cel	bbbc39b2c2	compress/isal: fix getting information about CPU This patch adds query about CPU features Fixes: 53a9baa98c36 ("compress/isal: add basic PMD ops") Cc: stable@dpdk.org Signed-off-by: Tomasz Cel <tomaszx.cel@intel.com> Acked-by: Lee Daly <lee.daly@intel.com>	2019-04-02 16:50:24 +02:00
Tomasz Jozwiak	1e796b11fe	drivers/qat: fix queue pair NUMA node This patch assigns QAT queue pair resources to the correct NUMA nodes. Any DMA'able memory should use NUMA node of QAT device rather than socket_id of the initializing process. Fixes: 98c4a35c736f ("crypto/qat: move common qat files to common dir") Fixes: a795248d740b ("compress/qat: add configure and clear functions") Cc: stable@dpdk.org Signed-off-by: Tomasz Jozwiak <tomaszx.jozwiak@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Lee Daly	1aeb9fdb2e	compress/isal: add appropriate flag on overflow This patch will change the operation status when ISA-L returns because of a recoverable out of space error, rather than a just generic fail. Signed-off-by: Lee Daly <lee.daly@intel.com> Tested-by: Tomasz Cel <tomaszx.cel@intel.com>	2019-04-02 16:50:24 +02:00
Ayuj Verma	378e08eba8	crypto/openssl: set RSA private op feature flag openssl PMD support RSA private key operation using both qt and exp key type. Set rsa key type feature flag Signed-off-by: Ayuj Verma <ayverma@marvell.com> Signed-off-by: Shally Verma <shallyv@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	3b4757fc74	crypto/dpaa2_sec: support multi-process - fle pool allocations should be done for each process. - cryptodev->data is shared across muliple processes but cryptodev itself is allocated for each process. So any information which needs to be shared between processes, should be kept in cryptodev->data. Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	e621d97000	crypto/dpaa_sec: fix session queue attach/detach session inq and qp are assigned for each core from which the packets arrive. This was not correctly handled while supporting multiple sessions per queue pair. This patch fixes the attach and detach of queues for each core. Fixes: e79416d10fa3 ("crypto/dpaa_sec: support multiple sessions per queue pair") Cc: stable@dpdk.org Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	07a5efda06	crypto/dpaa2_sec: remove unnecessary flc configurations The removed fields are required in case the SEC block allocates the buffer from bman pool. Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	7449390bb8	drivers/crypto: update inline desc for sharing mode SEC HW descriptor sharing mode can now be controlled during Session preparation by the respective drivers shared descriptors in case of non-protocol offload does not need any sync between the subsequent jobs. Thus, changing it to SHR_NEVER from SHR_SERIAL for cipher_only, auth_only, and gcm. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	a5e05ab643	crypto/dpaa2_sec: fix offset calculation for GCM In case of gcm, output buffer should have aad space before the actual buffer which needs to be written. CAAM will not write into the aad anything, it will skip auth_only_len (aad) and write the buffer afterwards. Fixes: 37f96eb01bce ("crypto/dpaa2_sec: support scatter gather") Cc: stable@dpdk.org Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Akhil Goyal	fd4f22fbd8	crypto/dpaa2_sec: fix session clearing private data should be cleared instead of the complete session Fixes: 8d1f3a5d751b ("crypto/dpaa2_sec: support crypto operation") Cc: stable@dpdk.org Signed-off-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Tomasz Jozwiak	352332744c	compress/qat: add dynamic SGL allocation This patch adds dynamic SGL allocation instead of static one. The number of element in SGL can be adjusted in each operation depend of the request. Signed-off-by: Tomasz Jozwiak <tomaszx.jozwiak@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Fan Zhang	7b2d4706c9	crypto/aesni_mb: support newer library version only As stated in 19.02 deprecation notice, this patch updates the aesni_mb PMD to remove the support of older Intel-ipsec-mb library version earlier than 0.52. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2019-04-02 16:50:24 +02:00
Fan Zhang	2d0c29a37a	crypto/aesni_mb: enable out of place processing Add out-of-place processing, i.e. different source and destination m_bufs, plus related capability update, tests and documentation. Signed-off-by: Fiona Trahe <fiona.trahe@intel.com> Signed-off-by: Paul Luse <paul.e.luse@intel.com> Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2019-04-02 16:50:24 +02:00
Arek Kusztal	8245972c04	crypto/qat: add modular multiplicative inverse This commit adds modular multiplicative inverse to Intel QuickAssist Technology driver. For capabilities or limitations please refer to qat.rst or qat_asym_capabilities.h. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Arek Kusztal	fb70b33b05	crypto/qat: add modular exponentiation This commit adds modular exponentiation to Intel QuickAssist Technology driver. For capabilities or limitations please refer to qat.rst or qat_asym_capabilities.h. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Arek Kusztal	f81cbc208f	crypto/qat: add asymmetric crypto PMD This patch adds Poll Mode Driver for asymmetric crypto functions of Intel QuickAssist Technology hardware. It contains plain driver with no functions implemented, specific algorithms will be introduced in separate patches. This patch depends on a QAT PF driver for device initialization. See the file docs/guides/cryptodevs/qat.rst for configuration details. Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Arek Kusztal	0adc033f58	common/qat: add headers for asymmetric crypto This commit adds headers to be used in conjunction with asymmetric cryptography operations using Intel QuickAssist Technology driver Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Tomasz Cel	ae1374e240	compress/isal: fix compression stream initialization This patch fixes ISAL internal state fields initialization. Fixes: dc49e6aa4879 ("compress/isal: add ISA-L compression functionality") Cc: stable@dpdk.org Signed-off-by: Tomasz Cel <tomaszx.cel@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2019-04-02 16:50:24 +02:00
Harry van Haaren	1b03e29291	event/sw: fix enqueue checks in self-test This patch fixes a number of instances of the same return value mis-check, where previously we checked for a negative return value as error, however the API returns an unsigned integer, so these return value checks are invalid. The rte_event_enqueue_burst() API returns the number of events enqueued, so in order to identify the error case, we must check for != the number of intended enqueues. Fixes: cd1a9e3eab55 ("test/eventdev: add SW tests for load balancing") Cc: stable@dpdk.org Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com>	2019-04-02 03:10:47 +02:00
Anand Rawat	fa647c5722	build: add workarounds for Windows helloworld Added meson workarounds to build helloworld on Windows. Windows currently only supports kvargs and eal libraries. This change restricts the build flow to supported libraries only. Signed-off-by: Anand Rawat <anand.rawat@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Nemanja Marjanovic	d08b6845e4	net/softnic: support QinQ PPPoE encapsulation Add implementation of QinQ PPPoE packet encapsulation action. Signed-off-by: Nemanja Marjanovic <nemanja.marjanovic@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2019-03-29 20:54:36 +01:00
Ian Stokes	ae35fd61fc	net/e1000: set min and max MTU This commit sets the min and max supported MTU values for igb devices via the eth_igb_info_get() function. Min MTU supported is set to ETHER_MIN_MTU and max MTU is calculated as the max packet length supported minus the transport overhead. To aid in these calculations a new MACRO 'E1000_ETH_OVERHEAD' has been introduced to consolidate overhead calculation and avoid duplication. Signed-off-by: Ian Stokes <ian.stokes@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-03-29 19:00:35 +01:00

1 2 3 4 5 ...

8030 Commits