numam-dpdk

Author	SHA1	Message	Date
Aaron Conole	10f6c93cea	eal: do not panic on PCI failures Some devices may be inaccessible for a variety of reasons, or the PCI-bus may be unavailable causing the whole thing to fail. Still, better to continue attempts at probes. Since PCI isn't neccessarily required, it may be possible to simply log the error and continue on letting the user check the logs and restart the application when things have failed. This will usually be an issue because of permissions. However, it could also be caused by OOM. In either case, errno will contain the underlying cause. For linux, it is safe to re-init the system here, so allow the application to take corrective action and reinit. For BSD, this is not the case, for other reasons, including hugepage allocation has already happened, and needs to be properly uninitialized. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:58:00 +02:00
Aaron Conole	4fe1d33987	eal: do not panic if plugins fail to init Plugins are useful and important. However, it seems crazy to abort everything just because they don't initialize properly. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:57:13 +02:00
Aaron Conole	c050e5abae	eal: do not panic on interrupt thread init There could be some confusion as to why the call failed - this change will always reflect the value of the error in rte_error. When initializing the interrupt thread, there are a number of possible reasons for failure - some of which are correctable by the application. Do not panic() needlessly, and give the application a change to reflect this information to the user. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:56:59 +02:00
Aaron Conole	330bed86d3	eal: do not panic on timer init failure After code inspection, there is no way for eal_timer_init() to fail. It simply returns 0 in all cases. As such, this test could either go-away or stay here as 'future-proofing'. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:55:49 +02:00
Aaron Conole	7d5c430f69	eal: do not panic on a number of conditions When log initialization fails, it's generally because the fopencookie failed. While this is rare in practice, it could happen, and it is likely because of memory pressure. So, flag the error, and allow the user to retry. Memory init can only fail when access to hugepages (either as primary or secondary process) fails (and that is usually permissions). Since the manner of failure is not reversible, we cannot allow retry. There are some theoretical racy conditions in the system that _could_ cause early tailq init to fail; however, no need to panic the application. While it can't continue using DPDK, it could make better alerts to the user. rte_eal_alarm_init() call uses the linux timerfd framework to create a poll()-able timer using standard posix file operations. This could fail for a few reasons given in the man-pages, but many could be corrected by the user application. No need to panic. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:54:49 +02:00
Aaron Conole	8f113d9818	eal: set errno when exiting for already initialized Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:53:46 +02:00
Aaron Conole	ce3bede01e	eal: do not panic on memzone init failure When memzone initialization fails, report the error to the calling application rather than panic(). Without a good way of detaching / releasing hugepages, at this point the application will have to restart. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:53:06 +02:00
Aaron Conole	a0222a4679	eal: do not panic on argument parsing error It's possible that the application could take a corrective action here, and either prompt the user for different arguments, or at least perform a better logging. Exiting this early prevents any useful information gathering from the application layer. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:52:08 +02:00
Aaron Conole	547a61af71	eal: do not panic on hugepage info init When attempting to scan hugepages, signal to the eal that an error has occurred, rather than performing a panic. If we fail to acquire hugepage information, simply signal an error to the application. This clears the run_once counter, allowing the user or application to take a corrective action and retry. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:50:37 +02:00
Aaron Conole	37e97ad2c5	eal: do not panic when CPU is not supported This adds a new API to check for the eal cpu versions. It's now possible to gracefully exit the application, or for applications which support non-dpdk datapaths working in concert with DPDK datapaths, there no longer is the possibility of exiting for unsupported CPUs. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:50:09 +02:00
Aaron Conole	647644e51f	eal: do not panic on CPU detection There may be no way to gracefully recover, but the application should be notified that a failure happened, rather than completely aborting. This allows the user to proceed with a "slow-path" type solution. After this change, the EAL CPU NUMA node resolution step can no longer emit an rte_panic. This aligns with the code in rte_eal_init, which expects failures to return an error code. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:47:10 +02:00
Ben Walker	24a5357968	pci: fix device registration on FreeBSD The FreeBSD implementation wasn't registering new devices with the device framework on start up. However, common code attempts to unregister them on shutdown which causes a SEGFAULT. This fix makes the FreeBSD code do the same thing as the Linux code for registration. Fixes: 13a1317d3ba7 ("pci: create device list and fallback on its members") Cc: stable@dpdk.org Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-03-27 12:07:53 +02:00
Vladyslav Buslov	d89a5bce1d	lpm6: extend next hop field This patch extend next_hop field from 8-bits to 21-bits in LPM library for IPv6. Added versioning symbols to functions and updated library and applications that have a dependency on LPM library. Signed-off-by: Vladyslav Buslov <vladyslav.buslov@harmonicinc.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-15 18:49:41 +01:00
Matt Peters	b61befb48c	igb_uio: support devices with only I/O BAR Allow the BAR setup to succeed if a device has at least 1 BAR region defined. Previously, the device probe would only succeed if at least one memory BAR existed, but there are devices that have only port I/O BARs. For example, on Virtual Box a virtio device has only a single I/O BAR because by default MSI-X is not enabled. While in qemu/kvm the virtio device has MSI-X enabled and therefore has both an I/O and Memory BAR. The following are excerpts from "lspci -nnvvvv -s 00:09.0" on both types of systems. Virtual Box: Region 0: I/O ports at d260 [size=32] Capabilities: [80] #00 [0000] QEMU/KVM: Region 0: I/O ports at c060 [size=32] Region 1: Memory at febd1000 (32-bit, non-prefetchable) [size=4K] Expansion ROM at feb80000 [disabled] [size=256K] Capabilities: [40] MSI-X: Enable+ Count=3 Masked- Vector table: BAR=1 offset=00000000 PBA: BAR=1 offset=00000800 Signed-off-by: Matt Peters <matt.peters@windriver.com> Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-15 14:02:41 +01:00
Hemant Agrawal	5a11168d9b	mbuf: use pktmbuf helper to create the pool When possible, replace the uses of rte_mempool_create() with the helper provided in librte_mbuf: rte_pktmbuf_pool_create(). This is the preferred way to create a mbuf pool. This also updates the documentation. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-15 13:48:02 +01:00
Thomas Monjalon	31123211bd	remove unmaintained TILE-Gx architecture The TILE-Gx architecture and its driver mpipe are not maintained. The code is removed to avoid confusion. A last update has been done in 17.05 before removal. It can be built with the updated toolchain: http://www.mellanox.com/repository/solutions/tile-scm/ and libgxio: http://www.mellanox.com/repository/solutions/tile-scm/libgxio-1.0.tar.xz Quote from http://dpdk.org/ml/archives/dev/2017-February/057940.html " Mellanox agrees to remove TILE-Gx support from DPDK.org, but will continue to support customers using DPDK. Customer that needs support should contact Mellanox directly. " Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-03-15 11:40:57 +01:00
Olivier Matz	0ef850c4f6	ethdev: move a queue id check to generic layer The check of queue_id is done in all drivers implementing rte_eth_rx_queue_count(). Factorize this check in the generic function. Note that the nfp driver was doing the check differently, which could induce crashes if the queue index was too big. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-09 19:29:51 +01:00
Olivier Matz	44e93f4a34	ethdev: clarify API comments of Rx queue count The API comments are not consistent between each other. The function rte_eth_rx_queue_count() returns the number of used descriptors on a receive queue. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-09 19:27:40 +01:00
Gowrishankar Muthukrishnan	0fe9830b53	eal/ppc: support sPAPR IOMMU for vfio-pci Below changes adds pci probing support for vfio-pci devices in power8. Signed-off-by: Gowrishankar Muthukrishnan <gowrishankar.m@linux.vnet.ibm.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chao Zhu <chaozhu@linux.vnet.ibm.com>	2017-03-09 18:39:45 +01:00
Ben Walker	cdc242f260	eal/linux: support running as unprivileged user For Linux kernel 4.0 and newer, the ability to obtain physical page frame numbers for unprivileged users from /proc/self/pagemap was removed. Instead, when an IOMMU is present, simply choose our own DMA addresses instead. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com>	2017-03-09 17:08:46 +01:00
Bruce Richardson	03437f2947	ring: add a function to return the ring size Applications and other libraries should not be reading inside the rte_ring structure directly to get the ring size. Instead add a fn to allow it to be queried. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-08 16:05:19 +01:00
Jan Blunck	b2fba63690	eal: ensure constness of container_of target This adds a check to ensure that the container_of() macro is not used to cast away (remove) constness. Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-08 14:04:29 +01:00
Jan Blunck	7cfd280578	eal: fix container_of macro for const members This fixes the usage of structure members that are declared const to get a pointer to the embedding parent structure. Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-08 13:48:36 +01:00
Chris Metcalf	dd0eedb1cf	tile: fix build Re-enable CONFIG_RTE_LIBRTE_SCHED, since it is needed to build correctly. Fix a few warnings when compiling mpipe_tilegx.c. Remove an empty rte_cpu_feature_table[] array using a bogus type. Properly set RTE_OBJCOPY_{TARGET,ARCH} in mk/arch/tile/rte.vars.mk. Signed-off-by: Chris Metcalf <cmetcalf@mellanox.com>	2017-02-27 16:44:32 +01:00
Chris Metcalf	f80468b680	eal/tile: avoid use of non-upstreamed header It's trivial to directly invoke a read of the special-purpose register that holds the clock cycle counter, so just do that. Signed-off-by: Chris Metcalf <cmetcalf@mellanox.com>	2017-02-27 16:44:23 +01:00
Olivier Matz	93092a5610	mempool: remove deprecated get and put functions As announced in the deprecation notice, remove the functions for single/multi producer/consumer enqueue/dequeue. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-02-21 12:05:46 +01:00
Olivier Matz	f3bc028909	mempool: remove deprecated count functions As announced in the deprecation notice, remove these functions. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-02-21 12:05:46 +01:00
Thomas Monjalon	420195e6af	log: remove old symbols from map When removing log history functions, the map has not been updated. Fixes: d7e61ad3ae36 ("log: remove deprecated history dump") Reported-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-02-21 11:43:45 +01:00
Ferruh Yigit	aa0d7c2d32	kni: remove KNI vhost support Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-02-21 11:43:07 +01:00
Thomas Monjalon	d450914ab8	version: 17.05-rc0 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-02-17 12:17:39 +01:00
Thomas Monjalon	b9ebab26d9	version: 17.02.0 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-02-14 22:17:45 +01:00
Pablo de Lara	b09efeb9d5	doc: add thread-safety information about EFD library Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-02-14 21:48:36 +01:00
Dmitriy Yakovlev	e7ee2ca1c9	cfgfile: fix uninitialized variable on load error Uninitialized scalar variable. Using uninitialized value cfg->sections[curr_section]->num_entries when calling rte_cfgfile_close. And memory in variables cfg->sections[curr_section], sect->entries[curr_entry] maybe not equal NULL. We must decrement counters curr_section, curr_entry when failed to realloc. Fixes: eaafbad419bf ("cfgfile: library to interpret config files") Signed-off-by: Dmitriy Yakovlev <bombermag@gmail.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-02-14 18:13:48 +01:00
Qi Zhang	2eed820fd4	vfio: fix maximum number of interrupt for MSI-X The max number of interrupt request is possible be changed after rte_intr_callback_register, so in get_max_intr, we need to check if necessary to update the max_intr. Signed-off-by: Qi Zhang <qi.z.zhang@intel.com>	2017-02-13 22:25:04 +01:00
Andrew Rybchenko	549b3587f5	ethdev: fix typo in UDP tunnel API description Fixes: 1cbe755fef47 ("ethdev: rename UDP tunnel port functions") Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com>	2017-02-13 22:22:19 +01:00
Thomas Monjalon	47aa9d4e0d	version: 17.02-rc3 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-02-10 17:15:32 +01:00
Slawomir Mrozowicz	b75a76d354	cryptodev: fix crash when querying device by name This patch fixes a segmentation fault in function rte_cryptodev_devices_get(), due to incorrect driver name path. It reworks the function to use correct types and clean up for visibility. Coverity issue: 141067 Fixes: 38227c0e3ad2 ("cryptodev: retrieve device info") Signed-off-by: Slawomir Mrozowicz <slawomirx.mrozowicz@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-02-10 15:57:29 +01:00
Jingjing Wu	64c1375b83	mbuf: fix bitmask of Tx offload flags Add missed PKT_TX_MACSEC and PKT_TX_IEEE1588_TMST flags to bitmask of all supported packet Tx offload features flags. Fixes: 4fb7e803eb1a ("ethdev: add Tx preparation") Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-02-10 12:25:49 +01:00
Yong Wang	511a4c74b8	pci: fix UIO interrupt file descriptor check before close The "dev->intr_handle.fd" is possibly a negative value while it is passed as an argument to function "close". Fix the check to the fd. Fixes: 5a60a7ffc801 ("pci: introduce functions to alloc and free uio resource") Signed-off-by: Yong Wang <wang.yong19@zte.com.cn>	2017-02-10 14:23:27 +01:00
Alan Dewar	3b780b9e9e	sched: fix crash when freeing port Prevent a segmentation fault in rte_sched_port_free by only accessing the port structure after the NULL pointer check has been made. Fixes: 7b3c4f35 ("sched: fix releasing enqueued packets") Cc: stable@dpdk.org Signed-off-by: Alan Dewar <adewar@brocade.com> Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-02-09 18:46:52 +01:00
Patrick MacArthur	811b6b2506	vfio: fix file descriptor leak in multi-process When a secondary process wants access to the VFIO container file descriptor, the primary process calls vfio_get_container_fd() which always opens an entirely new file descriptor on /dev/vfio/vfio. However, once the file descriptor has been passed to the subprocess, it is effectively duplicated, meaning that the copy of the file descriptor in the primary process is no longer needed. However, the primary process does not close the duplicate fd, which results in a resource leak. This can be reproduced by starting a primary process with a small RLIMIT_NOFILE limit configured to use VFIO for at least one device, and repeatedly launching secondary processes until the file descriptor limit is exceeded. Fix the resource leak by closing the local vfio container file descriptor after passing it to the secondary process. Fixes: 2f4adfad0a69 ("vfio: add multiprocess support") Cc: stable@dpdk.org Signed-off-by: Patrick MacArthur <patrick@patrickmacarthur.net> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-02-09 18:39:30 +01:00
Thomas Monjalon	5b243cbab2	version: 17.02-rc2 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-30 23:47:11 +01:00
Emmanuel Roullit	68759bbe73	vhost: remove unneeded variable assignment Found with clang static analysis: lib/librte_vhost/vhost_user.c:996:3: warning: Value stored to 'ret' is never read ret = vhost_user_get_vring_base(dev, &msg.payload.state); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-30 13:47:20 +01:00
Emmanuel Roullit	5c1f70daaf	vhost: do not GSO when no header is present Found with clang static analysis: lib/librte_vhost/virtio_net.c:723:17: warning: Access to field 'data_off' results in a dereference of a null pointer (loaded from variable 'tcp_hdr') m->l4_len = (tcp_hdr->data_off & 0xf0) >> 2; ^~~~~~~~~~~~~~~~~ Fixes: d0cf91303d73 ("vhost: add Tx offload capabilities") Cc: stable@dpdk.org Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-30 13:46:57 +01:00
Yuanhan Liu	b8b992e93f	vhost: fix long stall of negotiation Setting up the mapping from GPA (guest physical address) to HPA (guest physical address) could be very time consuming when the guest memory is backened with small pages (4K). The bigger the guest memory, the longer it takes. This could lead a very long vhost-user negotiation. Since the mapping is only needed in zero copy mode so far, we could avoid such time consuming settup when zero copy is turned off (which is the default case). It's actually a workaround, a right fix might be to start a new thread, and hide the big latency there. Fixes: e246896178e6 ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-01-28 14:25:40 +01:00
Yuanhan Liu	cc7301908c	vhost: fix dead loop in enqueue path If a malicious guest forges a dead loop desc chain (let desc->next point to itself) and desc->len is zero, this could lead to a dead loop in copy_mbuf_to_desc(following is a simplified code to show this issue clearly): while (mbuf_is_not_totally_consumed) { if (desc_avail == 0) { desc = &descs[desc->next]; desc_avail = desc->len; } COPY(desc, mbuf, desc_avail); } I have actually fixed a same issue before: commit a436f53ebfeb ("vhost: avoid dead loop chain"); it fixes the dequeue path though, leaving the enqueue path still vulnerable. The fix is the same. Add a var nr_desc to avoid the dead loop. Fixes: f1a519ad981c ("vhost: fix enqueue/dequeue to handle chained vring descriptors") Cc: stable@dpdk.org Reported-by: Xieming Katty <katty.xieming@huawei.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-01-28 14:25:23 +01:00
Slawomir Mrozowicz	38227c0e3a	cryptodev: retrieve device info This patch adds helper functions for new performance application which provide identifiers and number of crypto device and provide and check capabilities available for defined device and algorithm. The performance application can be used to measure throughput and latency of cryptography operation performed by crypto device. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Slawomir Mrozowicz <slawomirx.mrozowicz@intel.com> Signed-off-by: Marcin Kerlin <marcinx.kerlin@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-30 17:46:36 +01:00
Fan Zhang	547017d80a	cryptodev: add scheduler PMD name and type This patch adds the cryptodev scheduler PMD name and type identifier to librte_cryptodev. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-30 17:23:33 +01:00
Hemant Agrawal	3b84878c4b	cryptodev: decouple from PCI device This makes struct rte_cryptodev independent of struct rte_pci_device by replacing it with a pointer to the generic struct rte_device. This is inline with the recent changes in ethdev Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: John Griffin <john.griffin@intel.com> Reviewed-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-30 17:23:33 +01:00
Declan Doherty	b815872e70	cryptodev: uninline some functions rte_cryptodev_pmd_get_dev, rte_cryptodev_pmd_get_named_dev, rte_cryptodev_pmd_is_valid_dev were incorrectly marked as inline and therefore not useable from crypto PMDs when built as shared libraries as they accessed the global rte_cryptodev_globals device structure. Fixes: d11b0f30 ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Declan Doherty <declan.doherty@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com>	2017-01-30 17:23:33 +01:00

1 2 3 4 5 ...

2940 Commits