numam-dpdk

Author	SHA1	Message	Date
Konstantin Ananyev	8d9c2c3a1f	mbuf: add function to generate raw Tx offload value Operations to set/update bit-fields often cause compilers to generate suboptimal code. To help avoid such situation for tx_offload fields: introduce new enum for tx_offload bit-fields lengths and offsets, and new function to generate raw tx_offload value. Add new test-case into UT for introduced function. Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2019-04-02 16:50:24 +02:00
Ayuj Verma	398ba4c13f	cryptodev: add RSA private key feature flag Add feature flag to reflect RSA private key operation support using quintuple (crt) or exponent type key. if PMD support both, then it should set both. App should query cryptodev feature flag to check if Sign and Decryt with CRT keys or exponent is supported, thus call operation with relevant key type. Signed-off-by: Ayuj Verma <ayverma@marvell.com> Signed-off-by: Shally Verma <shallyv@marvell.com> Acked-by: Akhil Goyal <akhil.goyal@nxp.com>	2019-04-02 16:50:24 +02:00
Pavan Nikhilesh	d69d085883	eventdev: check timer adapter status before start Check if timer adapter is already started before starting it. Update the unit test accordingly. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com> Acked-by: Erik Gabriel Carrillo <erik.g.carrillo@intel.com>	2019-04-02 03:10:40 +02:00
Jerin Jacob	4b3997680a	eal: allow to override init macros per OS baremetal execution environments may have a different method to enable RTE_INIT instead of using compiler constructor and/or OS specific linker scheme. Allow an option to override RTE_INIT* macros using rte_os.h or appropriate header file. Signed-off-by: Jerin Jacob <jerinj@marvell.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2019-04-03 23:52:00 +02:00
Gage Eads	640c5f09ef	eal/x86: add 128-bit atomic compare exchange This operation can be used for non-blocking algorithms, such as a non-blocking stack or ring. It is available only for x86_64. Signed-off-by: Gage Eads <gage.eads@intel.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2019-04-03 21:59:46 +02:00
Dharmik Thakkar	f401363d98	hash: support lock-free extendable bucket This patch enables lock-free read-write concurrency support for extendable bucket feature. Suggested-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Signed-off-by: Dharmik Thakkar <dharmik.thakkar@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Yipeng Wang <yipeng1.wang@intel.com>	2019-04-03 20:52:35 +02:00
Shahaf Shuler	237060c4ad	mem: limit use of address hint The commit below added an address hint as starting address for 64-bit systems in case an explicit base virtual address was not set by the user. The justification for such hint was to help devices that work in VA mode and has a address range limitation to work smoothly with the eal memory subsystem. While the base address value selected may work fine for the eal initialization, it easily breaks when trying to register external memory using rte_extmem_register API. Trying to register anonymous memory on RH x86_64 machine took several minutes, during them the function eal_get_virtual_area repeatedly scanned for a good VA candidate. The attempt to guess which VA address will be free for mapping will always result in not portable, error prone code: * different application may use different libraries along w/ DPDK. One can never guess which library was called first and how much virtual memory it consumed. * external memory can be registered at any time in the application run time. In order not to break the existing secondary process design, this patch only limits the max number of tries that will be done with the address hint. When the number of tries exceeds the threshold the code will use the suggested address from kernel. Fixes: `1df2170287` ("mem: use address hint for mapping hugepages") Cc: stable@dpdk.org Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Tested-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Alejandro Lucero <alejandro.lucero@netronome.com>	2019-04-03 19:10:47 +02:00
Stephen Hemminger	c0c055f5f7	ring: fix an error message Log message should end with newline. Fixes: `4e32101f9b` ("ring: support freeing") Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Rami Rosen <ramirose@gmail.com>	2019-04-03 18:39:16 +02:00
Stephen Hemminger	d0885cb781	eal: align hexdump output This fixes the issue where if the length of the output is not a multiple of 16 the formatting was off. Before: 00000000: 45 00 00 1C 12 34 2C E0 40 06 B8 2E C0 A8 01 12 \| E....4,.@....... 00000010: C0 A8 01 37 \| \| \| \| \| \| \| \| \| \| \| \| \| ...7 After: 00000000: 45 00 00 1C 12 34 2C E0 40 06 B8 2E C0 A8 01 12 \| E....4,.@....... 00000010: C0 A8 01 37 \| ...7 Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2019-04-03 18:34:59 +02:00
Stephen Hemminger	779d9d0986	eal: clean formatting of hexdump functions The hexdump code obviously came from somewhere else originally. It is not formatted according to DPDK coding style. Also, drop the comment which is not useful the docbock comment is already in the rte_hexdump.h Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2019-04-03 18:32:42 +02:00
Stephen Hemminger	6d96b48af8	eal: make u64 reciprocal divisor const The divisor is not modified here. Doesn't really matter for optimizaton since the function is inline already; but helps with expressing intent. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2019-04-03 18:32:41 +02:00
Anand Rawat	fa647c5722	build: add workarounds for Windows helloworld Added meson workarounds to build helloworld on Windows. Windows currently only supports kvargs and eal libraries. This change restricts the build flow to supported libraries only. Signed-off-by: Anand Rawat <anand.rawat@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Anand Rawat	53ffd9f080	eal/windows: add minimum viable code Add Windows specific logic for eal.c, eal_lcore.c, eal_debug.c and eal_thread.c. Updated header files to contain suitable function declarations. Signed-off-by: Anand Rawat <anand.rawat@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Anand Rawat	4dc2b4d2a4	eal/windows: add headers for compatibility Added headers to support Windows environment for common source. These headers will have Windows specific implementions of the system library APIs provided in Linux and FreeBSD. Signed-off-by: Anand Rawat <anand.rawat@intel.com> Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Anand Rawat	846ff907ee	eal/windows: add sys/queue.h implementation copy Adding sys/queue.h on Windows for supporting common code. This implementation has BSD-3-Clause licensing. Signed-off-by: Ranjit Menon <ranjit.menon@intel.com> Signed-off-by: Anand Rawat <anand.rawat@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Anand Rawat	82ba4416dd	build: add module definition files for Windows Updated lib/meson.build to create shared libraries on Windows. Added DEF files to list the exports for the eal and kvargs libraries. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Anand Rawat <anand.rawat@intel.com> Reviewed-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:31 +02:00
Anand Rawat	58836e93f5	eal/windows: add wrappers for string functions Updated rte_common.h to include rte_os.h to contain OS specific macros and functions. Updated rte_string_fns.h to include rte_common.h for rte_os.h Signed-off-by: Anand Rawat <anand.rawat@intel.com> Reviewed-by: Pallavi Kadam <pallavi.kadam@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:21:15 +02:00
Anand Rawat	428eb983f5	eal: add OS specific header file Added rte_os.h files to support OS specific functionality. Updated build system to contain OS headers in the include path. Signed-off-by: Anand Rawat <anand.rawat@intel.com> Reviewed-by: Pallavi Kadam <pallavi.kadam@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2019-04-03 01:11:56 +02:00
Anand Rawat	98edcbb5ab	eal/windows: introduce Windows support Added initial stub source files and required meson changes for Windows support. kernel/windows/meson is a stub file added to support Windows specific source in future releases. Signed-off-by: Pallavi Kadam <pallavi.kadam@intel.com> Signed-off-by: Anand Rawat <anand.rawat@intel.com> Reviewed-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Reviewed-by: Ranjit Menon <ranjit.menon@intel.com> Acked-by: Harini Ramakrishnan <harini.ramakrishnan@microsoft.com>	2019-04-03 01:06:01 +02:00
Thomas Monjalon	3c45889189	eal: remove exec-env directory Only one header file (rte_kni_common.h) was in the sub-directory include/exec-env/ This file was installed in a sub-directory of the same name in the makefile-based build. Source and install directories are moved as below: lib/librte_eal/linux/eal/include/exec-env/ -> lib/librte_eal/linux/eal/include/ build/include/exec-env/ -> build/include/ The consequence is to have a file hierarchy a bit more flat. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: David Marchand <david.marchand@redhat.com> Tested-by: David Marchand <david.marchand@redhat.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-04-02 21:49:35 +02:00
Andrew Rybchenko	dfc6b2fd8d	mbuf: remove Intel offload checks from generic API rte_validate_tx_offload() is used in Tx prepare callbacks (RTE_LIBRTE_ETHDEV_DEBUG only) to check Tx offloads consistency. Requirement that packet headers should not be fragmented is not documented and unclear where it comes from except rte_net_intel_cksum_prepare() functions which relies on it. It could be NIC vendor specific driver or hardware limitation, but, if so, it should be documented and checked in corresponding Tx prepare callbacks. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2019-04-02 16:42:52 +02:00
Anatoly Burakov	1e3380a2f4	mem: do not use lockfiles for single file segments mode Due to internal glibc limitations [1], DPDK may exhaust internal file descriptor limits when using smaller page sizes, which results in inability to use system calls such as select() by user applications. Single file segments option stores lock files per page to ensure that pages are deleted when there are no more users, however this is not necessary because the processes will be holding onto the pages anyway because of mmap(). Thus, removing pages from the filesystem is safe even though they may be used by some other secondary process. As a result, single file segments mode no longer stores inordinate amounts of segment fd's, and the above issue with fd limits is solved. However, this will not work for legacy mem mode. For that, simply document that using bigger page sizes is the only option. [1] https://mails.dpdk.org/archives/dev/2019-February/124386.html Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-04-02 16:07:25 +02:00
Anatoly Burakov	848cbff836	mem: refactor segment resizing function Currently, segment resizing code sits in one giant function which handles both in-memory and regular modes. Split them up into individual functions. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-04-02 16:07:13 +02:00
Darek Stojaczyk	ea4e3ab7bd	eal: initialize alarms early On Linux, we currently initialize rte_alarms after starting to listen for IPC hotplug requests, which gives us a data race window. Upon receiving such hotplug request we always try to set an alarm and this obviously doesn't work if the alarms weren't initialized yet. To fix it, we initialize alarms before starting to listen for IPC hotplug messages. Specifically, we move rte_eal_alarm_init() right after rte_eal_intr_init() as it makes some sense to keep those two close to each other. We update the BSD code as well to keep the initialization order the same in both EAL implementations. Fixes: `244d513071` ("eal: enable hotplug on multi-process") Cc: stable@dpdk.org Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-04-02 15:00:26 +02:00
Pavan Nikhilesh	e840cb3c2a	eal: increase max number of interrupt vectors MSI-X permits a device to allocate up to 2048 interrupts as per PCIe spec. Increase the max number of vectors to a reasonable value of 512. Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>	2019-04-02 02:59:04 +02:00
Bruce Richardson	c12cefa379	telemetry: fix mapping of statistics If we have two NIC ports which have a different set of NIC stats we can end up having two different stats registered with xstats with the same name. [Since the stats are updated in bulk as a contiguous set, the second driver re-using the registration of the first is not possible.] This causes issues with the invalid stat for one driver being found due to a lookup by name which is unnecessary. Instead of getting stat names involved do the lookup by ID instead. Fixes: `1b756087db` ("telemetry: add parser for client socket messages") Cc: stable@dpdk.org Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Kevin Laatz <kevin.laatz@intel.com>	2019-04-02 02:30:53 +02:00
David Hunt	b60fd5f8b1	power: add bit for high frequency cores This patch adds a new bit in the capabilities mask that's returned by rte_power_get_capabilities(), allowing application to query which cores have the higher frequencies, and can then pin the workloads accordingly. Returned Bits: 0 - Turbo Boost enabled 1 - Higher core base_frequency Signed-off-by: Liang Ma <liang.j.ma@intel.com> Signed-off-by: David Hunt <david.hunt@intel.com> Reviewed-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-04-02 02:22:08 +02:00
David Hunt	08a710642d	power: fix governor storage to trim newlines Currently the Power Libray stores the governor name with an embedded newline read from the scaling_governor sysfs file. This patch strips it out. Fixes: `445c6528b5` ("power: common interface for guest and host") Cc: stable@dpdk.org Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-04-01 22:23:44 +02:00
Nemanja Marjanovic	a2fe4c1b38	net: add PPPoE ethertypes Add PPPoE ethertypes in to rte_ether.h. Signed-off-by: Nemanja Marjanovic <nemanja.marjanovic@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2019-03-29 20:55:25 +01:00
Nemanja Marjanovic	8692f6591e	pipeline: support QinQ PPPoE encapsulation Add support of QinQ PPPoE packet encapsulation action. Signed-off-by: Nemanja Marjanovic <nemanja.marjanovic@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2019-03-29 20:52:09 +01:00
Eelco Chaudron	ebe3a76991	meter: fix divide by zero for RFC4115 RFC 4115 allows a meter with either cir and/or eir configured. When only one is configured a divide by zero would occur. Fixes: `655796d2b5` ("meter: support RFC4115 trTCM") Signed-off-by: Eelco Chaudron <echaudro@redhat.com>	2019-03-29 20:51:25 +01:00
Stephen Hemminger	ad97ceece1	ethdev: add min/max MTU to device info This addresses the usability issue raised by OVS at DPDK Userspace summit. It adds general min/max MTU into device info. For compatibility, and to save space, it fits in a hole in existing structure. The initial version sets max MTU to normal Ethernet, it is up to PMD to set larger value if it supports Jumbo frames. Also remove the deprecation notice introduced in 18.11 regarding this change and bump ethdev ABI version. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Ian Stokes <ian.stokes@intel.com> Acked-by: Andrew Rybchenko <arybchenko@solarflare.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-03-29 18:57:42 +01:00
Fan Zhang	bc5560c15e	vhost/crypto: fix parens Coverity issue: 277214, 277220, 277233, 277236 Fixes: `cd1e8f03ab` ("vhost/crypto: fix packet copy in chaining mode") Cc: stable@dpdk.org Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2019-03-29 17:25:32 +01:00
Rami Rosen	a15b7a0e53	ethdev: fix a typo This patch fixes a trivial typo in rte_ethdev.h. retieve=>retrieve Fixes: `80a1deb4c7` ("ethdev: add API to retrieve queue information") Cc: stable@dpdk.org Signed-off-by: Rami Rosen <ramirose@gmail.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-03-29 17:25:31 +01:00
Natanael Copa	c2d82896ac	eal/linux: remove thread ID from debug message There is no guarantee that pthread_self() returns the thread ID or that pthread_t is an integer. The thread ID is not that useful so simply remove it. This fixes the following warning when building with musl libc: lib/librte_eal/linuxapp/eal/eal_dev.c: In function 'sigbus_handler': lib/librte_eal/linuxapp/eal/eal_dev.c:70:3: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast] (int)pthread_self(), info->si_addr); ^ Fixes: `0fc54536b1` ("eal: add failure handling for hot-unplug") Cc: stable@dpdk.org Signed-off-by: Natanael Copa <ncopa@alpinelinux.org>	2019-03-31 01:01:28 +01:00
Shahaf Shuler	c33a675b62	bus: introduce device level DMA memory mapping The DPDK APIs expose 3 different modes to work with memory used for DMA: 1. Use the DPDK owned memory (backed by the DPDK provided hugepages). This memory is allocated by the DPDK libraries, included in the DPDK memory system (memseg lists) and automatically DMA mapped by the DPDK layers. 2. Use memory allocated by the user and register to the DPDK memory systems. Upon registration of memory, the DPDK layers will DMA map it to all needed devices. After registration, allocation of this memory will be done with rte_malloc APIs. 3. Use memory allocated by the user and not registered to the DPDK memory system. This is for users who wants to have tight control on this memory (e.g. avoid the rte_malloc header). The user should create a memory, register it through rte_extmem_register API, and call DMA map function in order to register such memory to the different devices. The scope of the patch focus on #3 above. Currently the only way to map external memory is through VFIO (rte_vfio_dma_map). While VFIO is common, there are other vendors which use different ways to map memory (e.g. Mellanox and NXP). The work in this patch moves the DMA mapping to vendor agnostic APIs. Device level DMA map and unmap APIs were added. Implementation of those APIs was done currently only for PCI devices. For PCI bus devices, the pci driver can expose its own map and unmap functions to be used for the mapping. In case the driver doesn't provide any, the memory will be mapped, if possible, to IOMMU through VFIO APIs. Application usage with those APIs is quite simple: allocate memory * call rte_extmem_register on the memory chunk. * take a device, and query its rte_device. * call the device specific mapping function for this device. Future work will deprecate the rte_vfio_dma_map and rte_vfio_dma_unmap APIs, leaving the rte device APIs as the preferred option for the user. Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2019-03-30 16:48:56 +01:00
Shahaf Shuler	0cbce3a167	vfio: skip DMA map failure if already mapped Currently vfio DMA map function will fail in case the same memory segment is mapped twice. This is too strict, as this is not an error to map the same memory twice. Instead, use the kernel return value to detect such state and have the DMA function to return as successful. For type1 mapping the kernel driver returns EEXISTS. For spapr mapping EBUSY is returned since kernel 4.10. Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2019-03-30 16:48:55 +01:00
Shahaf Shuler	4106d89a18	vfio: allow DMA map to the default container Enable users the option to call rte_vfio_dma_map with request to map to the default vfio fd. Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2019-03-30 16:47:54 +01:00
Anatoly Burakov	23d5455517	mem: warn user when running without NUMA support Running in non-legacy mode on a NUMA-enabled system without libnuma is unsupported, so explicitly print out a warning when trying to do so. Running in legacy mode without libnuma is still supported whether or not we are running with libnuma support enabled, so also fix init to allow that scenario. Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-30 00:13:04 +01:00
Kevin Traynor	e1e4dafbc7	power: fix frequency list buffer validation The frequency list buffer was already validated in power_acpi_cpufreq_freqs(), so the newly added check was redundant. To keep consistency with power_pstate_cpufreq_freqs(), remove the original check and update the log message. Fixes: `2e6ccdb4e0` ("power: fix frequency list to handle null buffer") Cc: stable@dpdk.org Signed-off-by: Kevin Traynor <ktraynor@redhat.com>	2019-03-29 14:58:27 +01:00
Anatoly Burakov	3660216ef1	malloc: fix IPC message initialization The memset size for an IPC message is set incorrectly. Fix it to cover the entire IPC message. Fixes: `07dcbfe010` ("malloc: support multiprocess memory hotplug") Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-29 12:55:07 +01:00
Anatoly Burakov	b8a86c83e0	fbarray: fix init unlock without lock Certain failure paths of rte_fbarray_init() will unlock the mem area lock without locking it first. Fix this by properly handling the failures. Fixes: `5b61c62cfd` ("fbarray: add internal tailq for mapped areas") Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-29 12:49:35 +01:00
Darek Stojaczyk	5a98bc5e83	fbarray: fix attach deadlock rte_fbarray_attach() currently locks its internal spinlock, but never releases it. Secondary processes won't even start if there is more than one fbarray to be attached to - the second rte_fbarray_attach() would be just stuck. Fix it by releasing the lock at the end of rte_fbarray_attach(). I believe this was the original intention. Fixes: `5b61c62cfd` ("fbarray: add internal tailq for mapped areas") Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Gavin Hu <gavin.hu@arm.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-29 12:49:35 +01:00
Anatoly Burakov	1fd3bcf3f9	vfio: document multiprocess limitation for container API Currently, there is no support for sharing custom VFIO containers between multiple processes, but it is not documented. Document this limitation. Cc: stable@dpdk.org Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-29 00:07:16 +01:00
Thomas Monjalon	3a1a885e03	eal: remove redundant atomic API description Atomic functions are described in doxygen of the file lib/librte_eal/common/include/generic/rte_atomic.h The copies in arch-specific files are redundant and confuse readers about the genericity of the API. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Shahaf Shuler <shahafs@mellanox.com>	2019-03-28 23:52:53 +01:00
Dekel Peled	8015c5593a	eal/ppc: fix global memory barrier From previous patch description: "to improve performance on PPC64, use light weight sync instruction instead of sync instruction." Excerpt from IBM doc [1], section "Memory barrier instructions": "The second form of the sync instruction is light-weight sync, or lwsync. This form is used to control ordering for storage accesses to system memory only. It does not create a memory barrier for accesses to device memory." This patch removes the use of lwsync, so calls to rte_wmb() and rte_rmb() will provide correct memory barrier to ensure order of accesses to system memory and device memory. [1] https://www.ibm.com/developerworks/systems/articles/powerpc.html Fixes: `d23a6bd04d` ("eal/ppc: fix memory barrier for IBM POWER") Cc: stable@dpdk.org Signed-off-by: Dekel Peled <dekelp@mellanox.com>	2019-03-28 23:48:28 +01:00
Michał Mirosław	a1c6b70786	mem: count overcommit hugepages as available With nr_overcommit_hugepages > 0 application may be able to allocate hugepages even when free_hugepages == 0. Take this into account when counting available hugepages. Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Reviewed-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-28 23:33:50 +01:00
Anatoly Burakov	034f1fb616	mem: attempt multiple hugepage allocations at init When requesting memory with ``-m`` or ``--socket-mem`` flags, currently the init will fail if the requested memory amount was bigger than any one memseg list, even if total amount of available memory was sufficient. Fix this by making EAL to attempt to allocate pages multiple times, until we either fulfill our memory requirements, or run out of hugepages to allocate. Bugzilla ID: 95 Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-28 23:28:58 +01:00
Anatoly Burakov	bec5625588	mem: improve best-effort allocation Previously, when using non-exact allocation, we were requesting N pages to be allocated, but allowed the memory subsystem to allocate less than requested. However, we were still expecting to see N contigous free pages in the memseg list. This presents a problem because there is no way to try and allocate as many pages as possible, even if there isn't enough contiguous free entries in the list. To address this, use the new "find biggest" fbarray API's when allocating non-exact number of pages. This way, we will first check how many entries in the list are actually available, and then try to allocate up to that number. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-28 23:28:54 +01:00
Anatoly Burakov	7353ee7344	fbarray: add API to find biggest used or free chunks Currently, while there is a way to find total amount of used/free space in an fbarray, there is no way to find biggest contiguous chunk. Add such API, as well as unit tests to test this API. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2019-03-28 23:28:52 +01:00

1 2 3 4 5 ...

5241 Commits