numam-dpdk

Author	SHA1	Message	Date
Pablo de Lara	afe6050060	cryptodev: add algorithm string parsers Adds functions to get the cipher/authentication algorithm enums, given a string. This is useful for applications which gets the algorithm required from the user, to have a common string-enum mapping. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2017-04-06 00:17:44 +02:00
Pablo de Lara	e26690f4fc	cryptodev: add missing algorithm strings DES-CBC and AUTH NULL algorithms were missing in the array of algorithm strings. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2017-04-06 00:17:44 +02:00
Ed Czeck	65d9e621fa	ring: fix C++ cast error build error: include/rte_ring.h:459:22: error: invalid conversion from ‘void’ to ‘void’ [-fpermissive] ENQUEUE_PTRS(r, &r[1], prod_head, obj_table, n, void ); Implicit casts of void* to void** are considered warnings in some compilers. E.g. g++ version 5.8. Cast directly to object types Fixes: `a6619414` ("ring: make struct and macros type agnostic") Signed-off-by: Ed Czeck <ed.czeck@atomicrules.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-06 17:55:54 +02:00
Bruce Richardson	8b2928c978	ethdev: fix missing function in version map file The function rte_eth_find_next is missing in the map file, which causes errors with shared library builds. .../test-pmd/testpmd.c:1693: undefined reference to `rte_eth_find_next' Adding function to map file fixes the issue. Fixes: `5588909af2` ("ethdev: add device iterator") Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-06 12:52:52 +02:00
Jasvinder Singh	986ff526fb	net: add CRC computation API APIs for selecting the architecure specific implementation and computing the crc (16-bit and 32-bit CRCs) are added. For CRCs calculation, scalar as well as x86 intrinsic(sse4.2) versions are implemented. The scalar version is based on generic Look-Up Table(LUT) algorithm, while x86 intrinsic version uses carry-less multiplication for fast CRC computation. Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-04-05 23:03:23 +02:00
Gaetan Rivet	5588909af2	ethdev: add device iterator This iterator helps applications iterate over the device list and skip holes caused by invalid or detached devices. Signed-off-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2017-04-05 22:47:57 +02:00
Gaetan Rivet	d52268a8b2	ethdev: expose device states The hotplug API introduced multiple states for a device with possible values defined internally, while the related field in struct rte_eth_dev was made public. Exposing those states improves consistency because applications have to deal with the device list directly. "DEV_DETACHED" is renamed "RTE_ETH_DEV_UNUSED" to better reflect that the emptiness of a slot is not necessarily the result of detaching a device. Signed-off-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2017-04-05 22:47:57 +02:00
Ferruh Yigit	3061d0d961	ring: fix build with icc build error: In file included from .../lib/librte_ring/rte_ring.c(90): .../lib/librte_ring/rte_ring.h(162): error #1366: a reduction in alignment without the "packed" attribute is ignored } __rte_cache_aligned; ^ Alignment attribute moved to first element of the struct Fixes: `a6619414e0` ("ring: make struct and macros type agnostic") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-05 18:20:42 +02:00
Reshma Pattan	5cd3cac9ed	latency: added new library for latency stats Add a library designed to calculate latency statistics and report them to the application when queried. The library measures minimum, average and maximum latencies, and jitter in nano seconds. The current implementation supports global latency stats, i.e. per application stats. Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Signed-off-by: Remy Horton <remy.horton@intel.com> Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-05 18:00:42 +02:00
Remy Horton	2ad7ba9a65	bitrate: add bitrate statistics library This patch adds a library that calculates peak and average data-rate statistics. For ethernet devices. These statistics are reported using the metrics library. Signed-off-by: Remy Horton <remy.horton@intel.com>	2017-04-05 17:59:43 +02:00
Remy Horton	349950ddb9	metrics: add information metrics library This patch adds a new information metrics library. This Metrics library implements a mechanism by which producers can publish numeric information for later querying by consumers. Metrics themselves are statistics that are not generated by PMDs, and hence are not reported via ethdev extended statistics. Metric information is populated using a push model, where producers update the values contained within the metric library by calling an update function on the relevant metrics. Consumers receive metric information by querying the central metric data, which is held in shared memory. Signed-off-by: Remy Horton <remy.horton@intel.com>	2017-04-05 17:58:51 +02:00
Olivier Matz	4f0981e6ec	eal: deprecate log functions Deprecate the following functions: - rte_set_log_level(), replaced by rte_log_set_global_level() - rte_get_log_level(), replaced by rte_log_get_global_level() - rte_set_log_type(), replaced by rte_log_set_level() - rte_get_log_type(), replaced by rte_log_get_level() The new functions provide a better control of the per-type log level, and have a better name prefix (rte_log_). Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:48:40 +02:00
Olivier Matz	845afe51e4	eal: change specific log levels at startup Example of use: ./app/test-pmd --log-level='pmd\.i40e.*,8' This enables debug logs for all dynamic logs whose type starts with 'pmd.i40e'. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:37:17 +02:00
Olivier Matz	a5279180f5	eal: change several log levels matching a regexp Introduce a function to set the log level of several log types that match a regular expression. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:37:17 +02:00
Olivier Matz	432050bfd0	eal: dump registered log types Introduce a function to dump the global level and the registered log types. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:37:17 +02:00
Olivier Matz	c1b5fa94a4	eal: support dynamic log types Introduce 2 new functions to support dynamic log types: - rte_log_register(): register a log name, and return a log type id - rte_log_set_level(): set the log level of a given log type Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:37:17 +02:00
Olivier Matz	12e58e707c	mbuf: bump library version The reorganization of the mbuf structure induces an ABI breakage. Bump the library version, and update the documentation accordingly. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 13:37:17 +02:00
Olivier Matz	918ae9dc77	mbuf: add a timestamp field The field itself is not fully described yet, but this commit reserves the room in the mbuf. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Olivier Matz	a22659550c	mbuf: move sequence number in second cache line Move this field in the second cache line, since no driver use it in Rx path. The freed space will be used by a timestamp in next commit. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Olivier Matz	97cb466d65	mbuf: use 2 bytes for port and nb segments Change the size of m->port and m->nb_segs to 16 bits. It is now possible to reference a port identifier larger than 256 and have a mbuf chain larger than 256 segments. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Jerin Jacob	dc448dc460	mbuf: make rearm data address naturally aligned To avoid multiple stores on fast path, Ethernet drivers aggregate the writes to data_off, refcnt, nb_segs and port to an uint64_t data and write the data in one shot with uint64_t* at &mbuf->rearm_data address. Some of the non-IA platforms have store operation overhead if the store address is not naturally aligned.This patch fixes the performance issue on those targets. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Olivier Matz	8f094a9ac5	mbuf: set mbuf fields while in pool Set the value of m->refcnt to 1, m->nb_segs to 1 and m->next to NULL when the mbuf is stored inside the mempool (unused). This is done in rte_pktmbuf_prefree_seg(), before freeing or recycling a mbuf. Before this patch, the value of m->refcnt was expected to be 0 while in pool. The objectives are: - to avoid drivers to set m->next to NULL in the early Rx path, since this field is in the second 64B of the mbuf and its access could trigger a cache miss - rationalize the behavior of raw_alloc/raw_free: one is now the symmetric of the other, and refcnt is never changed in these functions. To optimize the freeing of the segments, we try try to only update m->refcnt, m->next, and m->nb_segs when it's required (idea from Konstantin Ananyev <konstantin.ananyev@intel.com>). Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Olivier Matz	1f88c0a22b	mbuf: make raw free function public Rename __rte_mbuf_raw_free() as rte_mbuf_raw_free() and make it public. The old function is kept for compat but is marked as deprecated. The next commit changes the behavior of rte_mbuf_raw_free() to make it more consistent with rte_mbuf_raw_alloc(). Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Olivier Matz	54e9290269	mbuf: make segment prefree function public Document the function and make it public, since it is used at several places in the drivers. The old one is marked as deprecated. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Harry van Haaren	d924782d84	eventdev: improve docs of start function This commit documents two error return values for the rte_event_dev_start() function. -ESTALE indicates not all ports are configured -ENOLINK indicates that not all queues are linked to ports. If an application enqueues to such a queue it can lead to deadlock Suggested-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:19:51 +02:00
Gage Eads	406aed4e0d	eventdev: add errno-style return values This commit adds rte_errno return values to rte_event_enqueue_burst() and rte_event_dequeue_burst(). These return values allows user software to differentiate between an invalid argument (such as an invalid queue_id or sched_type in an enqueued event) and backpressure from the event device. The port and device ID checks are placed in RTE_LIBRTE_EVENTDEV_DEBUG header guards to avoid the performance hit in non-debug execution. Signed-off-by: Gage Eads <gage.eads@intel.com>	2017-04-04 19:19:51 +02:00
Bruce Richardson	3ed7fc039a	eventdev: add extended stats Add in APIs for extended stats so that eventdev implementations can report out information on their internal state. The APIs are based on, but not identical to, the equivalent ethdev functions. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:19:51 +02:00
Harry van Haaren	361e733645	eventdev: remove default queue overriding PMDs that only do a specific type of scheduling cannot provide CFG_ALL_TYPES, so the Eventdev infrastructure should not demand that every PMD supports CFG_ALL_TYPES. By not overriding the default configuration of the queue as suggested by the PMD, the eventdev_common unit tests can pass on all PMDs, regardless of their capabilities. RTE_EVENT_QUEUE_CFG_DEFAULT is no longer used by the eventdev layer it can be removed now. Applications should use CFG_ALL_TYPES if they require enqueue of all types a queue, or specify which type of queue they require. The CFG_DEFAULT value is changed to CFG_ALL_TYPES in event/skeleton, to not break the compile. A capability flag is added that indicates if the underlying PMD supports creating queues of ALL_TYPES. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:19:51 +02:00
Jerin Jacob	836a9ddc3f	eventdev: return code in dequeue timeout conversion eventdev driver may return error on dequeue timeout tick conversion. Change the pmd callback interface to address the same. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:17:39 +02:00
Gage Eads	d6c40e22cd	eventdev: fix links map initialization for SW PMD This patch initializes the links_map array entries to EVENT_QUEUE_SERVICE_PRIORITY_INVALID, as expected by rte_event_port_links_get(). This is necessary for the sw eventdev PMD, which does not initialize links_map when rte_event_port_setup() calls rte_event_port_unlink(). Fixes: `4f0804bbdf` ("eventdev: implement the northbound APIs") Signed-off-by: Gage Eads <gage.eads@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:17:33 +02:00
Nipun Gupta	c163219950	eventdev: use generic device holder rte_device is a generic device which is available to the applications and EAL. This patch replaces rte_pci_device in 'struct rte_eventdev' and in 'struct rte_event_dev_info' with common rte_device. Signed-off-by: Nipun Gupta <nipun.gupta@nxp.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-04-04 19:17:33 +02:00
Harry van Haaren	23cabf3017	eventdev: improve API doc for timeout ticks Improve the documentation of the return values of the rte_event_dequeue_timeout_ticks() function, adding a -ENOTSUP value for eventdevs that do not support waiting. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:17:33 +02:00
Harry van Haaren	dcd8f8bf3a	eventdev: increase size of enq/deq conf variables Large port enqueue sizes were not supported as the value it was stored in was a uint8_t. Using uint8_ts to save space in config apis makes no sense - increasing the 3 instances of uint8_t enqueue / dequeue depths to more appropriate values (based on the context around them). Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:17:33 +02:00
Nipun Gupta	7bf3729c5f	eventdev: amend timeout criteria comment for burst dequeue Signed-off-by: Nipun Gupta <nipun.gupta@nxp.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:17:33 +02:00
Gage Eads	4f9cf0bc40	eventdev: clarify some parameter descriptions This commit clarifies the usage of nb_links and nb_unlinks when passing a NULL pointer as the queues argument. Signed-off-by: Gage Eads <gage.eads@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:17:33 +02:00
Nipun Gupta	83afcdd1bc	eventdev: amend comments for events limit and threshold Updated the comments on 'nb_events_limit' of 'struct rte_event_dev_config' and 'new_event_threshold' of 'struct rte_event_port_conf'. Signed-off-by: Nipun Gupta <nipun.gupta@nxp.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:17:33 +02:00
Jerin Jacob	af316ecf58	eventdev: limit port link operation to configured queues On port_setup, the link_map is updated only for configured number of event queues. Limit the port_links_get scan only to configured number of event queues. Also, Limit the port link and unlink queue validation to configured number of event queues. Fixes: `4f0804bbdf` ("eventdev: implement the northbound APIs") Reported-by: Nipun Gupta <nipun.gupta@nxp.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Nipun Gupta <nipun.gupta@nxp.com>	2017-04-04 19:17:26 +02:00
Jerin Jacob	8b78c1991d	eventdev: support vdev uninit Added eventdev vdev uninit support to release the resources allocated in eventdev vdev init. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:17:26 +02:00
Jerin Jacob	7b37f6f7ad	eventdev: fix event driver name to eventdev lookup - Removed uninitialized max_devs value - Corrected dev assignment Fixes: `4f0804bbdf` ("eventdev: implement the northbound APIs") Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com>	2017-04-04 19:17:08 +02:00
Bruce Richardson	2ed019db7c	eventdev: remove unneeded dependencies Since eventdev uses event structures rather than working directly on mbufs, there is no actual dependencies on the mbuf library. The inclusion of an mbuf pointer element inside the event itself does not require the inclusion of the mbuf header file. Similarly the pci header is not needed, but following their removal, rte_memory.h is needed for the definition of the __rte_cache_aligned macro. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:12:00 +02:00
Nipun Gupta	d3e281a540	eventdev: update event port link and unlink callbacks Added a pointer to the rte_eventdev type in the event port link and unlink callbacks. This device shall be used by some of the event drivers to fetch queue related information. Also, update the skeleton eventdev driver with corresponding changes. Signed-off-by: Nipun Gupta <nipun.gupta@nxp.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 19:12:00 +02:00
Jerin Jacob	322d0345c2	eventdev: implement PMD registration functions This patch adds infrastructure for registering the vdev or the PCI based event device. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-04 19:12:00 +02:00
Jerin Jacob	4f0804bbdf	eventdev: implement the northbound APIs This patch implements northbound eventdev API interface using southbond driver interface Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-04 19:12:00 +02:00
Jerin Jacob	5223a1f3b8	eventdev: define southbound driver interface Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-04 19:12:00 +02:00
Jerin Jacob	71f2384328	eventdev: introduce event driven programming model In a polling model, lcores poll ethdev ports and associated rx queues directly to look for packet. In an event driven model, by contrast, lcores call the scheduler that selects packets for them based on programmer-specified criteria. Eventdev library adds support for event driven programming model, which offer applications automatic multicore scaling, dynamic load balancing, pipelining, packet ingress order maintenance and synchronization services to simplify application packet processing. By introducing event driven programming model, DPDK can support both polling and event driven programming models for packet processing, and applications are free to choose whatever model (or combination of the two) that best suits their needs. This patch adds the eventdev specification header file. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-04 19:12:00 +02:00
Yuanhan Liu	27052cd63f	vhost: do not destroy device on repeat mem table message It doesn't make any sense to invoke destroy_device() callback at while handling SET_MEM_TABLE message. From the vhost-user spec, it's the GET_VRING_BASE message indicates the end of a vhost device: the destroy_device() should be invoked from there (luckily, we already did that). Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	ea67f6ccf7	vhost: workaround the build dependency on mbuf header rte_mbuf struct is something more likely will be used only in vhost-user net driver, while we have made vhost-user generic enough that it can be used for implementing other drivers (such as vhost-user SCSI), they have also include <rte_mbuf.h>. Otherwise, the build will be broken. We could workaround it by using forward declaration, so that other non-net drivers won't need include <rte_mbuf.h>. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	a798beb47c	vhost: rename header file Rename "rte_virtio_net.h" to "rte_vhost.h", to not let it be virtio net specific. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	af14759181	vhost: introduce API to start a specific driver We used to use rte_vhost_driver_session_start() to trigger the vhost-user session. It takes no argument, thus it's a global trigger. And it could be problematic. The issue is, currently, rte_vhost_driver_register(path, flags) actually tries to put it into the session loop (by fdset_add). However, it needs a set of APIs to set a vhost-user driver properly: * rte_vhost_driver_register(path, flags); * rte_vhost_driver_set_features(path, features); * rte_vhost_driver_callback_register(path, vhost_device_ops); If a new vhost-user driver is registered after the trigger (think OVS-DPDK that could add a port dynamically from cmdline), the current code will effectively starts the session for the new driver just after the first API rte_vhost_driver_register() is invoked, leaving later calls taking no effect at all. To handle the case properly, this patch introduce a new API, rte_vhost_driver_start(path), to trigger a specific vhost-user driver. To do that, the rte_vhost_driver_register(path, flags) is simplified to create the socket only and let rte_vhost_driver_start(path) to actually put it into the session loop. Meanwhile, the rte_vhost_driver_session_start is removed: we could hide the session thread internally (create the thread if it has not been created). This would also simplify the application. NOTE: the API order in prog guide is slightly adjusted for showing the correct invoke order. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	52f8091f05	vhost: export APIs for live migration support Export few APIs for the vhost-user driver to log the guest memory writes, which is a must for live migration support. This patch basically moves vhost_log_write() and vhost_log_used_vring() into vhost.h and then add an wrapper (the public API) to them. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	abd53c16b6	vhost: add features changed callback Features could be changed after the feature negotiation. For example, VHOST_F_LOG_ALL will be set/cleared at the start/end of live migration, respecitively. Thus, we need a new callback to inform the application on such change. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	cb04355743	vhost: rename virtio-net to vhost Rename "virtio-net" to "vhost" in the API comments and vhost prog guide. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	7c12903746	vhost: rename device ops struct rename "virtio_net_device_ops" to "vhost_device_ops", to not let it be virtio-net specific. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	aca49772f6	vhost: do not include net specific headers Include it internally, at vhost.h. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	f53cf83980	vhost: drop the Rx and Tx queue macro They are virtio-net specific and should be defined inside the virtio-net driver. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	c0674b1bc8	vhost: move the device ready check at proper place Currently, we check vq->desc, vq->kickfd and vq->callfd to know whether a virtio device is ready or not. However, we only do it when handling SET_VRING_KICK message, which could be wrong if a vhost-user frontend send SET_VRING_KICK first and SET_VRING_CALL later. To work for all possible vhost-user frontend implementations, we could move the ready check at the end of vhost-user message handler. Meanwhile, since we do the check more often than before, the "virtio not ready" message is dropped, to not flood the screen. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	b50a203986	vhost: export the number of vrings We used to use rte_vhost_get_queue_num() for telling how many vrings. However, the return value is the number of "queue pairs", which is very virtio-net specific. To make it generic, we should return the number of vrings instead, and let the driver do the proper translation. Say, virtio-net driver could turn it to the number of queue pairs by dividing 2. Meanwhile, mark rte_vhost_get_queue_num as deprecated. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	ab4d7b9f1a	vhost: turn queue pair to vring The queue pair is very virtio-net specific, other devices don't have such concept. To make it generic, we should log the number of vrings instead of the number of queue pairs. This patch just does a simple convert, a later patch would export the number of vrings to applications. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	dba9bf127b	vhost: export API to translate gpa to vva Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	40ef286f23	vhost: export vhost vring info Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:42:44 +02:00
Yuanhan Liu	ca33faf9ef	vhost: introduce API to fetch negotiated features Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:41:54 +02:00
Yuanhan Liu	eb32247457	vhost: export guest memory regions Some vhost-user driver may need this info to setup its own page tables for GPA (guest physical addr) to HPA (host physical addr) translation. SPDK (Storage Performance Development Kit) is one example. Besides, by exporting this memory info, we could also export the gpa_to_vva() as an inline function, which helps for performance. Otherwise, it has to be referenced indirectly by a "vid". Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:40:13 +02:00
Yuanhan Liu	93433b639d	vhost: make notify ops per vhost driver Assume there is an application both support vhost-user net and vhost-user scsi, the callback should be different. Making notify ops per vhost driver allow application define different set of callbacks for different driver. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:40:13 +02:00
Yuanhan Liu	0917f9d1f0	vhost: use new APIs to handle features Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:40:13 +02:00
Yuanhan Liu	5fbb3941da	vhost: introduce driver features related APIs Introduce few APIs to set/get/enable/disable driver features. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 10:40:13 +02:00
Yuanhan Liu	65388b43f5	vhost: fix fd leaks for vhost-user server mode A vhost-user server socket could have many connections, thus many connfd. However, we currently just use one single int var to store it. Meaning, it will get overwritten every time a new connection is created. While this will not create fatal issue as it sounds (since the correct connfd is closured to the event loop thread by fdset_add), it may cause fd leaks if a user invokes rte_vhost_driver_unregister before shutting down all connections: it just closes the recent connfd. A simple example that should be able to reproduce this leaks issues is, del the ovs vhost-user port while the connected VMs are still alive. (Note that it's suggested to use one socket for one VM, which makes the issue not that fatal as it sounds again). Since we already use a struct "vhost_user_connection" to track all info about one connection, it's obvious that we should put the connfd there. Then we could build a connection list inside the vhost_user_socket struct, to represent all connections belong that socket file. Fixes: `164fd39678` ("vhost: fix unregistering in client mode") Cc: stable@dpdk.org Cc: Ilya Maximets <i.maximets@samsung.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Jianfeng Tan	99998feec9	eal/linux: add interrupt type for vdev A new interrupt type, RTE_INTR_HANDLE_VDEV, is added to support lsc and rxq interrupt for vdev. For lsc interrupt, except from original EPOLLIN events, we also listen for socket peer closed connection event (EPOLLRDHUP and EPOLLHUP). For rxq interrupt, add a precondition to avoid invoking any vfio and uio code. For intr_handle initialization, let each vdev driver to do that. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com>	2017-04-01 10:36:17 +02:00
Kevin Traynor	4e9474141e	vhost: fix false sharing The broadcast_rarp field in the virtio_net struct is checked in the dequeue datapath regardless of whether descriptors are available or not. As it is checked with cmpset leading to a write, false sharing on the virtio_net struct can happen between enqueue and dequeue datapaths regardless of whether a RARP is requested. In OVS, the issue can cause a uni-directional performance drop of up to 15%. Fix that by only performing the cmpset if a read of broadcast_rarp indicates that the cmpset is likely to succeed. Fixes: `a66bcad322` ("vhost: arrange struct fields for better cache sharing") Cc: stable@dpdk.org Signed-off-by: Kevin Traynor <ktraynor@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Maxime Coquelin	72e8543093	vhost: add API to get MTU value This patch implements the function for the application to get the MTU value. rte_vhost_get_mtu() fills the mtu parameter with the MTU value set in QEMU if VIRTIO_NET_F_MTU has been negotiated and returns 0, -ENOTSUP otherwise. The function returns -EAGAIN if Virtio feature negotiation didn't happened yet. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Maxime Coquelin	4c5d8459d2	vhost: add new ready status flag This patch adds a new status flag indicating the Virtio device is ready to operate. This is required to be able to call rte_vhost_mtu_get() in the .new_device() callback, as rte_vhost_mtu_get needs that the negotiation is done, but it is too early to rely on running status flag, which is set just after .new_device() returns. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Maxime Coquelin	23f1e756ca	vhost: support MTU protocol feature This patch implements the vhost-user MTU protocol feature support. When VIRTIO_NET_F_MTU is negotiated, QEMU notifies the vhost-user backend with the configured MTU if dedicated protocol feature is supported. The value can be used by the application to ensure consistency with value set by the user. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Maxime Coquelin	3d3c6590b5	vhost: enable virtio MTU feature This patch enables the new VIRTIO_NET_F_MTU feature, which makes possible for the host to advise the guest with its maximum supported MTU. MTU value is set via QEMU parameters, either via Libvirt XML, or directly in virtio-net device command line arguments. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:17 +02:00
Yuanhan Liu	160cbc815b	vhost: remove a hack on queue allocation We used to allocate queues based on the index from SET_VRING_CALL request: if corresponding queue hasn't been allocated, allocate it. Though it's pratically right (it's the first per-vring request we will get from QEMU for vhost-user negotiation), but it's not technically right: it's not documented in the vhost-user spec that it will always be the first per-vring request. For example, SET_VRING_ADDR could also be the first per-vring request. Thus, we should not depend the SET_VRING_CALL on queue allocation. Instead, we could catch all the per-vring messages at the entrance of request handler, and allocate one if it hasn't been allocated before. By that, we could remove a hack. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 10:36:06 +02:00
Yuanhan Liu	de8e1fdcec	vhost: fix max queues 0x8000 is the max virito-net queue pairs the virtio 1.0 spec claims to support. While for vhost-user, it's a different story: the max vring index could be passed by the vhost-user spec is 0xff, masked by the VHOST_USER_VRING_IDX_MASK. That said, the max queue pairs could vhost-user could supported is 0x80. If user are asking more, I think the vhost-user need be extended. Fixes: `b09b198bfb` ("vhost-user: announce queue number in message") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 08:58:54 +02:00
Yuanhan Liu	8d286dbeb8	vhost: fix multiple queue not enabled for old kernels Some macros (say VIRTIO_NET_F_MQ) are needed for enabling multiple queue, however they are introduced since kernel v3.8, meaning build error happens if we build DPDK vhost on those platforms. `71dfdbe66a` ("vhost: fix build with kernel < 3.8") meant to fix it, but in a wrong way: it completely disables the MQ features for those kernels. However, the MQ feature doesn't depend on the kernel at all (except the macros dependency stated above), that we could still enable the MQ feature even the host kernel has no such support. The right fix is to define the macro if it's not defined. Fixes: `71dfdbe66a` ("vhost: fix build with kernel < 3.8") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-04-01 08:58:54 +02:00
Ilya Maximets	29b851e8de	vhost: change log levels in client mode Inability to connect to socket is a normal situation in client mode because, in common case, server isn't started yet. RTE_LOG_WARNING should be suitable for the case of some unusual errors. Message about reconnection is not an error at all. Fixes: `e623e0c6d8` ("vhost: add reconnect ability") Cc: stable@dpdk.org Signed-off-by: Ilya Maximets <i.maximets@samsung.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-04-01 08:58:54 +02:00
Matthias Gatto	1b815b8959	vhost: try to shrink pfdset when fdset_add fails fdset_add increments pfdset->num, but fdset_del doesn't decrement pfdset->num, so if we call fdset_add then fdset_del in a loop without calling fdset_shrink, we can easily exceed MAX_FDS with only a few number of fds used. So my solution is simply to call fdset_shrink in fdset_add when it exceeds MAX_FDS. Because fdset_shrink and fdset_add locks pfdset->fd_mutex we can't call fdset_shrink inside fdset_add because that would cause a dead lock, so this patch split fdset_shrink in two, fdset_shrink and fdset_shrink_nolock. Fixes: `59317cef24` ("vhost: allow many vhost-user ports") Cc: stable@dpdk.org Signed-off-by: Matthias Gatto <matthias.gatto@outscale.com>	2017-04-01 08:58:54 +02:00
Yongseok Koh	6f60ca5e5e	ethdev: remove requirement of aligned RETA size In rte_eth_check_reta_mask(), it is required to align the size of the RETA table to RTE_RETA_GROUP_SIZE but as the size can be less than the limit, this should be removed. The change is also applied to a command of testpmd. Signed-off-by: Yongseok Koh <yskoh@mellanox.com>	2017-04-04 19:03:02 +02:00
Beilei Xing	7cd048321d	ethdev: add MPLS and GRE flow API items This patch adds MPLS and GRE items to generic rte flow. Signed-off-by: Beilei Xing <beilei.xing@intel.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-04-04 19:02:58 +02:00
Shahaf Shuler	49e2f374e4	eal/linux: support external Rx interrupt Prior to this patch only UIO/VFIO interrupt handlers types were supported. This patch adds support for the external interrupt handler type, allowing external drivers to set their own fds with specific interrupt handlers. Signed-off-by: Shahaf Shuler <shahafs@mellanox.com> Acked-by: Yongseok Koh <yskoh@mellanox.com>	2017-04-04 18:59:39 +02:00
Nirmoy Das	2972254ce1	kni: fix build on Suse 12 SP3 Add support for SLES12SP3, which uses kernel 4.4, but backported features from newer kernels. Signed-off-by: Nirmoy Das <ndas@suse.de> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-04-04 17:11:26 +02:00
Allain Legacy	216079fb1d	cfgfile: support empty value This commit adds support to the cfgfile library for parsing a key=value line that has no value string specified (e.g., "key="). This can be used to override a configuration attribute that has a default value or default list of values to set it back to an undefined value to disable functionality. Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-04-04 16:32:06 +02:00
Joseph Richard	3f3d51ebc8	cfgfile: fix parsing of long fields When parsing a ini file with a "key = value" line that has both "key" and "value" sized to the maximum allowed length causes a parsing failure. The internal "buffer" variable should be sized at least as large as the maximum for both fields. This commit updates the local array to be sized to hold the max name, max value, " = ", and the nul terminator. Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Keith Wiles <keith.wiles@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-04-04 16:32:06 +02:00
Allain Legacy	8eaff74f22	cfgfile: constrain string search The call to memchr() uses the absolute length of the string buffer instead of the actual length of the string returned by fgets(). This causes the search to go beyond the '\n' character and find ';' characters in random garbage on the stack. This then causes the 'len' variable to be updated and the subsequent search for the '=' character to potentially find one beyond the first newline character. Since this bug relies on ';' and '=' characters appearing in random places in the 'buffer' variable it is intermittently reproducible at best. Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-04-04 16:32:06 +02:00
Allain Legacy	f3b1a6981f	cfgfile: support configurable comment character The current cfgfile comment character is hardcoded to ';'. This commit a new API to allow the user to specify which comment character to use while parsing the file. This is to ease adoption by applications that have an existing configuration file which may use a different comment character. For instance, an application may already have a configuration file that uses the '#' as the comment character. The approach of using a new API with an extensible parameters structure was used rather than simply adding a new argument to the existing API to allow for additional arguments to be introduced in the future. Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-04-04 16:32:06 +02:00
Allain Legacy	1a5efe7499	cfgfile: support global properties section The current implementation of the cfgfile library requires that all key=value pairs be within [SECTION] definitions. The ini file standard allows for key=value pairs in an unnamed section. https://en.wikipedia.org/wiki/INI_file#Global_properties This commit adds the capability of parsing key=value pairs from such an unnamed section. The CFG_FLAG_GLOBAL_SECTION flag must be passed to the rte_cfgfile_load() API to enable this functionality. Any key=value pairs found before the first section can be accessed in the section named "GLOBAL". Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-04-04 16:32:06 +02:00
David Hunt	7c37da5a9a	distributor: fix creation error checks Coverity issue 143258: not freeing distributor instance Coverity issue 143254: not checking return code from malloc Fixes: `775003ad2f` ("distributor: add new burst-capable library") Signed-off-by: David Hunt <david.hunt@intel.com>	2017-04-04 14:58:49 +02:00
Jerin Jacob	9256eed78a	eal/linux: fix build with glibc 2.25 glibc 2.25 is warning about if applications depend on sys/types.h for makedev macro, it expects to be included from <sys/sysmacros.h> Found this error while testing with GCC 6.3.1 on archlinux. lib/librte_eal/linuxapp/eal/eal_pci_uio.c: In function ‘pci_mknod_uio_dev’: lib/librte_eal/linuxapp/eal/eal_pci_uio.c:134:13: error: In the GNU C Library, "makedev" is defined by <sys/sysmacros.h>. For historical compatibility, it is currently defined by <sys/types.h> as well, but we plan to remove this soon. To use "makedev", include <sys/sysmacros.h> directly. If you did not intend to use a system-defined macro "makedev", you should undefine it after including <sys/types.h>. [-Werror] dev = makedev(major, minor); ^~~~~~~~~~~~~~~~~ Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-04-04 14:52:06 +02:00
Bruce Richardson	8415429e43	nic_uio: fix device binding at boot When loading nic_uio from /boot/loader.conf as specified in the Getting Started Guide doc, the NIC devices were not bound at boot. Unloading the nic_uio driver and reloading it would cause them to be bound, however. The root cause appears to be the fact that when the module is loaded at boot, the call to find the pci device when parsing the b:d:f parameter fails to return the device. That means that later on when the device is probed as part of a PCI scan, no action is taken as it's not recorded as a device to be used. We fix this by having the b:d:f string parsed again on probe if the initial check to see if it's an already-known device fails. In my tests, this causes the NIC devices to be successfully bound at boot time, as well as leaving things working as before in the case the module is loaded post-boot. Fixes: `764bf26873` ("add FreeBSD support") Cc: stable@dpdk.org Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2017-04-04 12:28:03 +02:00
Jianfeng Tan	dd18a2f0b2	vfio: fix secondary process start When binding with vfio-pci, secondary process cannot be started with an error message: cannot find TAILQ entry for PCI device. It's due to: struct rte_pci_addr is padded with 1 byte for alignment by compiler. Then below comparison in commit `2f4adfad0a` ("vfio: add multiprocess support") will fail if the last byte is not initialized. memcmp(&vfio_res->pci_addr, &dev->addr, sizeof(dev->addr) And commit `cdc242f260` ("eal/linux: support running as unprivileged user") just triggers this bug by using a stack un-initialized variable. The fix is to use rte_eal_compare_pci_addr() for pci addr comparison. Fixes: `2f4adfad0a` ("vfio: add multiprocess support") Fixes: `cdc242f260` ("eal/linux: support running as unprivileged user") Cc: stable@dpdk.org Reported-by: Pawel Rutkowski <pawelx.rutkowski@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com>	2017-04-04 11:58:57 +02:00
Anatoly Burakov	9fa5993f10	vfio: fix build Some compilers require definition of vfio_iommu_spapr_tce_ddw_info before its use in vfio_iommu_spapr_tce_info, so move tce_info definition below tce_ddw_info. Fixes: `468f42cc26` ("vfio: fix build on old kernel") Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-04-03 20:00:23 +02:00
Ferruh Yigit	21be6fb6af	igb_uio: fix build with kernel < 3.2 Recently added "dma_zalloc_coherent()" call is causing build error for Linux kernels < 3.2. compile error: lib/librte_eal/linuxapp/igb_uio/igb_uio.c: In function ‘igbuio_pci_probe’: lib/librte_eal/linuxapp/igb_uio/igb_uio.c:434:2: error: implicit declaration of function ‘dma_zalloc_coherent’ [-Werror=implicit-function-declaration] map_addr = dma_zalloc_coherent(&dev->dev, 1024, ^ dma_zalloc_coherent() introduced with Linux kernel 3.2, with commit Linux: 842fa69f3e0c ("include/linux/dma-mapping.h: add dma_zalloc_coherent()") Since it does not exist for older kernels, causing a build error. Switched to dma_alloc_coherent() API to prevent build error. Fixes: `d287e4d41b` ("igb_uio: map dummy DMA forcing IOMMU domain attachment") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-04-03 19:49:58 +02:00
Shreyansh Jain	1263b426ff	mempool: move stack handler as a driver Moved from lib/librte_mempool, stack mempool handler is an independent driver. Shared builds would now require to link in librte_mempool_stack for "stack" mempool handler. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-03 19:45:45 +02:00
Shreyansh Jain	9a8e9b57f5	mempool: move ring handler as a driver Moved from lib/librte_mempool, ring mempool is now an independent driver. Shared builds would now need to add librte_mempool_ring for: * ring_mp_mc * ring_sp_sc * ring_sp_mc * ring_mp_sc Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-03 19:45:45 +02:00
Shreyansh Jain	44cebef721	mempool: fix crash when handler not found In case the stack or ring mempool handler are compiled as shared library and not linked in with test binary, segfault is reported. This is because return value of rte_mempool_set_ops_byname is not being checked in rte_mempool_ops_alloc. This patch handles error returned from rte_mempool_set_ops_byname when a mempool is not found. Fixes: `449c49b93a` ("mempool: support handler operations") Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-03 18:53:10 +02:00
Andriy Berestovskyy	240e992f74	mempool: fix typos Signed-off-by: Andriy Berestovskyy <andriy.berestovskyy@caviumnetworks.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-03 18:53:10 +02:00
Gage Eads	8477f2f50d	mempool: update non-EAL thread note Commit `30e6399892` ("mempool: support non-EAL thread") added the capability for non-EAL threads to use the mempool library. This commit removes the note indicating that the mempool library cannot be used safely by non-EAL threads, and replaces it with a more up-to-date note. Signed-off-by: Gage Eads <gage.eads@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-03 18:53:10 +02:00
David Su	f0d1896fa1	igb_uio: use non-threaded ISR This eliminates the overhead of a task switch when an interrupt arrives. Signed-off-by: David Su <david.w.su@intel.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-30 22:26:07 +02:00
Alejandro Lucero	d287e4d41b	igb_uio: map dummy DMA forcing IOMMU domain attachment For using a DPDK app when iommu is enabled, it requires to add iommu=pt to the kernel command line. But using igb_uio driver makes DMAR errors because the device has not an IOMMU domain. Since kernel 3.15, iommu=pt requires to use the internal kernel DMA API for attaching the device to the IOMMU 1:1 mapping, aka si_domain. Previous versions did attach the device to that domain when intel iommu notifier was called. This is not a problem if the driver does later some call to the DMA API because the mapping can be done then. But DPDK apps do not use that DMA API at all. Doing this dma map and unmap is harmless even when iommu is not enabled at all. Signed-off-by: Alejandro Lucero <alejandro.lucero@netronome.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-30 22:26:07 +02:00
Alejandro Lucero	94c0776b1b	vfio: support hotplug Current device hotplug is just supported by UIO managed devices. This patch adds same functionality with VFIO. It has been validated through tests using IOMMU and also with VFIO and no-iommu mode. Signed-off-by: Alejandro Lucero <alejandro.lucero@netronome.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-03-30 18:40:15 +02:00
Nikhil Rao	bb7927fd21	vfio: fix disabling INTx The flags member of irq_set should be ORed with VFIO_IRQ_SET_ACTION_MASK and not VFIO_IRQ_SET_ACTION_UNMASK. The bug was found by code inspection. Fixes: `5c782b3928` ("vfio: interrupts") Cc: stable@dpdk.org Signed-off-by: Nikhil Rao <nikhil.rao@intel.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-03-30 16:59:49 +02:00
Anatoly Burakov	468f42cc26	vfio: fix build on old kernel Fixing compile failures for kernels without sPAPR IOMMU support. Fixes: `0fe9830b53` ("eal/ppc: support sPAPR IOMMU for vfio-pci") Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-03-30 16:55:57 +02:00
Ferruh Yigit	d4d2380cbb	kni: fix build with kernel 4.11 compile error: .../build/build/lib/librte_eal/linuxapp/kni/kni_net.c:124:6: error: implicit declaration of function ‘signal_pending’ [-Werror=implicit-function-declaration] if (signal_pending(current) \|\| ret_val <= 0) { ^~~~~~~~~~~~~~ Linux 4.11 moves signal function declarations to its own header file: Linux: 174cd4b1e5fb ("sched/headers: Prepare to move signal wakeup & sigpending methods from <linux/sched.h> into <linux/sched/signal.h>") Use new header file "linux/sched/signal.h" to fix the build error. Cc: stable@dpdk.org Reported-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Tested-by: Pankaj Gupta <pagupta@redhat.com>	2017-03-30 16:45:36 +02:00
Olivier Matz	93e32ea349	mk: fix dependencies to optional configs In rte.lib.mk, the list of libraries passed to the link command (LDLIBS) is generated from the DEPDIRS-xxx variables. If a library is not compiled because it is disabled in configuration, it should not appear in DEPDIRS-xxx. - librte_port depends on librte_kni only if it is enabled. - librte_table depends on librte_acl only if it is enabled. Fixes: `feb9f680cd` ("mk: optimize directory dependencies") Reported-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-30 15:38:43 +02:00
Olivier Matz	b1b700ce7d	ethdev: add descriptor status API Introduce a new API to get the status of a descriptor. For Rx, it is almost similar to rx_descriptor_done API, except it differentiates "used" descriptors (which are hold by the driver and not returned to the hardware). For Tx, it is a new API. The descriptor_done() API, and probably the rx_queue_count() API could be replaced by this new API as soon as it is implemented on all PMDs. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com>	2017-03-30 15:02:05 +02:00
Bruce Richardson	a6619414e0	ring: make struct and macros type agnostic Modify the enqueue and dequeue macros to support copying any type of object by passing in the exact object type. Rather than using the "ring" structure member of rte_ring, which is of type "array of void *", instead have the macros take the start of the ring a a pointer value, thereby leaving the rte_ring structure as purely a header value. This allows it to be reused by other future ring types which can add on extra fields if they want, or even to have the actual ring elements, of whatever type stored separate from the ring header. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:20 +02:00
Bruce Richardson	6a68df7f23	ring: create common function for updating tail index Both producer and consumer use the same logic for updating the tail index so merge into a single function. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:20 +02:00
Bruce Richardson	0dfc98c507	ring: separate out head index manipulation We can write a single common function for head manipulation for enq and a common one for deq, allowing us to have a single worker function for enq and deq, rather than two of each. Update all other inline functions to use the new functions. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:20 +02:00
Bruce Richardson	3fe963a85a	ring: reduce scope of local variables The local variable i is only used for loop control so define it in the enqueue and dequeue blocks directly, rather than at the function level. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:20 +02:00
Bruce Richardson	ecaed092b6	ring: return remaining entry count when dequeuing Add an extra parameter to the ring dequeue burst/bulk functions so that those functions can optionally return the amount of remaining objs in the ring. This information can be used by applications in a number of ways, for instance, with single-consumer queues, it provides a max dequeue size which is guaranteed to work. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:20 +02:00
Bruce Richardson	14fbffb0aa	ring: return free space when enqueuing Add an extra parameter to the ring enqueue burst/bulk functions so that those functions can optionally return the amount of free space in the ring. This information can be used by applications in a number of ways, for instance, with single-producer queues, it provides a max enqueue size which is guaranteed to work. It can also be used to implement watermark functionality in apps, replacing the older functionality with a more flexible version, which enables apps to implement multiple watermark thresholds, rather than just one. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:32:04 +02:00
Bruce Richardson	cfa7c9e6fc	ring: make bulk and burst return values consistent The bulk fns for rings returns 0 for all elements enqueued and negative for no space. Change that to make them consistent with the burst functions in returning the number of elements enqueued/dequeued, i.e. 0 or N. This change also allows the return value from enq/deq to be used directly without a branch for error checking. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:37 +02:00
Bruce Richardson	77dd306427	ring: remove watermark support Remove the watermark support. A future commit will add support for having enqueue functions return the amount of free space in the ring, which will allow applications to implement their own watermark checks, while also being more useful to the app. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:34 +02:00
Bruce Richardson	82cb88375c	ring: remove the yield when waiting for tail update There was a compile time setting to enable a ring to yield when it entered a loop in mp or mc rings waiting for the tail pointer update. Build time settings are not recommended for enabling/disabling features, and since this was off by default, remove it completely. If needed, a runtime enabled equivalent can be used. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:29 +02:00
Bruce Richardson	8c82198978	ring: remove debug setting The debug option only provided statistics to the user, most of which could be tracked by the application itself. Remove this as a compile time option, and feature, simplifying the code. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:27 +02:00
Bruce Richardson	d1e138e1b0	ring: eliminate duplication of size and mask fields The size and mask fields are duplicated in both the producer and consumer data structures. Move them out of that into the top level structure so they are not duplicated. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:23 +02:00
Bruce Richardson	8526571400	ring: create common structure for prod and cons metadata create a common structure to hold the metadata for the producer and the consumer, since both need essentially the same information - the head and tail values, the ring size and mask. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:25:13 +02:00
Bruce Richardson	d9f0d3a1ff	ring: remove split cacheline build setting Users compiling DPDK should not need to know or care about the arrangement of cachelines in the rte_ring structure. Therefore just remove the build option and set the structures to be always split. On platforms with 64B cachelines, for improved performance use 128B rather than 64B alignment since it stops the producer and consumer data being on adjacent cachelines. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-29 22:21:51 +02:00
David Hunt	d55362bd87	distributor: add symbol versioning Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
David Hunt	c0de0eb82e	distributor: switch over to new API This is the main switch over between the legacy API and the new burst API. We rename all the functions in rte_distributor.c to remove the _v1705, and we add in _v20 in the rte_distributor_v20.c We also rename the rte_distributor_next.h as rte_distributor.h, as this is now the public header. At the same time, we need the autotests and sample app to compile properly, hence those changes are in this patch also. Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
David Hunt	6690f105dc	distributor: add SIMD flow matching Add an optimised version of the in-flight flow matching algorithm using SIMD instructions. This should give up to 1.5x over the scalar versions performance. Falls back to scalar version if SSE4.2 not available Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
David Hunt	775003ad2f	distributor: add new burst-capable library This patch includes the code for new burst-capable distributor library. It also includes the rte_distributor_next.h file which will be used as the public header once we add in the symbol versioning for v20 and v1705 APIs, at which stage we will rename it to rte_distributor.h. The new distributor code contains a very similar API to the legacy code, but now sends bursts of up to 8 mbufs to each worker. Flow ID's are reduced to 15 bits for an optimal flow matching algorithm. Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
David Hunt	66ec3e8bd2	distributor: create private header file We'll be adding internal implementation definitions in here that are common to both burst and legacy APIs. Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
David Hunt	73f08e03c9	distributor: rename legacy files Move files out of the way so that we can replace with new versions of the distributor library. Files are named in such a way as to match the symbol versioning that we will apply for backward ABI compatibility. Signed-off-by: David Hunt <david.hunt@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-29 16:46:57 +02:00
Bruce Richardson	6f6d2a66f8	eal/bsd: query the cpu count only once Rather than querying the number of CPUs on the system multiple times, and printing out the number each time, just query the value from sysctl once and store it for future reuse. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 23:56:58 +02:00
Olivier Matz	feb9f680cd	mk: optimize directory dependencies Before this patch, the management of dependencies between directories had several issues: - the generation of .depdirs, done at configuration is slow: it can take more than one minute on some slow targets (usually ~10s on a standard PC without -j). - for instance, it is possible to express a dependency like: - app/foo depends on lib/librte_foo - and lib/librte_foo depends on app/bar But this won't work because the directories are traversed with a depth-first algorithm, so we have to choose between doing 'app' before or after 'lib'. - the script depdirs-rule.sh is too complex. - we cannot use "make -d" for debug, because the output of make is used for the generation of .depdirs. This patch moves the DEPDIRS-* variables in the upper Makefile, making the dependencies much easier to calculate. A DEPDIRS variable is still used to process library dependencies in LDLIBS. After this commit, "make config" is almost immediate. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Tested-by: Robin Jarry <robin.jarry@6wind.com> Tested-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-03-27 23:28:43 +02:00
Billy McFall	44a718c457	ethdev: add API to free consumed buffers in Tx ring Add a new API to force free consumed buffers on Tx ring. API will return the number of packets freed (0-n) or error code if feature not supported (-ENOTSUP) or input invalid (-ENODEV). Signed-off-by: Billy McFall <bmcfall@redhat.com> Acked-by: Keith Wiles <keith.wiles@intel.com>	2017-03-27 17:17:33 +02:00
Aaron Conole	ac71108d64	eal: add info about various init error codes The rte_eal_init function will now pass failure reason hints to the application. To help app developers decipher this, add some brief information about what the codes are indicating. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:59:53 +02:00
Aaron Conole	1908008f5d	eal: do not panic on bus probe/scan failure For now, exit the init. It's likely that even aborting the initialization is premature in this case, as it may be possible to proceed even if one bus or another is not available. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:59:06 +02:00
Aaron Conole	e2c0413f2d	eal: do not panic on vdev init failure Even if one vdev should fail, there's no need to prevent further processing. Log the error, and reflect it to the higher levels to decide. Seems like it's possible to continue. At least, the error is reflected properly in the logs. A user could then go and correct or investigate the situation. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:58:28 +02:00
Aaron Conole	10f6c93cea	eal: do not panic on PCI failures Some devices may be inaccessible for a variety of reasons, or the PCI-bus may be unavailable causing the whole thing to fail. Still, better to continue attempts at probes. Since PCI isn't neccessarily required, it may be possible to simply log the error and continue on letting the user check the logs and restart the application when things have failed. This will usually be an issue because of permissions. However, it could also be caused by OOM. In either case, errno will contain the underlying cause. For linux, it is safe to re-init the system here, so allow the application to take corrective action and reinit. For BSD, this is not the case, for other reasons, including hugepage allocation has already happened, and needs to be properly uninitialized. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:58:00 +02:00
Aaron Conole	4fe1d33987	eal: do not panic if plugins fail to init Plugins are useful and important. However, it seems crazy to abort everything just because they don't initialize properly. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:57:13 +02:00
Aaron Conole	c050e5abae	eal: do not panic on interrupt thread init There could be some confusion as to why the call failed - this change will always reflect the value of the error in rte_error. When initializing the interrupt thread, there are a number of possible reasons for failure - some of which are correctable by the application. Do not panic() needlessly, and give the application a change to reflect this information to the user. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:56:59 +02:00
Aaron Conole	330bed86d3	eal: do not panic on timer init failure After code inspection, there is no way for eal_timer_init() to fail. It simply returns 0 in all cases. As such, this test could either go-away or stay here as 'future-proofing'. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:55:49 +02:00
Aaron Conole	7d5c430f69	eal: do not panic on a number of conditions When log initialization fails, it's generally because the fopencookie failed. While this is rare in practice, it could happen, and it is likely because of memory pressure. So, flag the error, and allow the user to retry. Memory init can only fail when access to hugepages (either as primary or secondary process) fails (and that is usually permissions). Since the manner of failure is not reversible, we cannot allow retry. There are some theoretical racy conditions in the system that _could_ cause early tailq init to fail; however, no need to panic the application. While it can't continue using DPDK, it could make better alerts to the user. rte_eal_alarm_init() call uses the linux timerfd framework to create a poll()-able timer using standard posix file operations. This could fail for a few reasons given in the man-pages, but many could be corrected by the user application. No need to panic. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:54:49 +02:00
Aaron Conole	8f113d9818	eal: set errno when exiting for already initialized Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:53:46 +02:00
Aaron Conole	ce3bede01e	eal: do not panic on memzone init failure When memzone initialization fails, report the error to the calling application rather than panic(). Without a good way of detaching / releasing hugepages, at this point the application will have to restart. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:53:06 +02:00
Aaron Conole	a0222a4679	eal: do not panic on argument parsing error It's possible that the application could take a corrective action here, and either prompt the user for different arguments, or at least perform a better logging. Exiting this early prevents any useful information gathering from the application layer. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:52:08 +02:00
Aaron Conole	547a61af71	eal: do not panic on hugepage info init When attempting to scan hugepages, signal to the eal that an error has occurred, rather than performing a panic. If we fail to acquire hugepage information, simply signal an error to the application. This clears the run_once counter, allowing the user or application to take a corrective action and retry. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:50:37 +02:00
Aaron Conole	37e97ad2c5	eal: do not panic when CPU is not supported This adds a new API to check for the eal cpu versions. It's now possible to gracefully exit the application, or for applications which support non-dpdk datapaths working in concert with DPDK datapaths, there no longer is the possibility of exiting for unsupported CPUs. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:50:09 +02:00
Aaron Conole	647644e51f	eal: do not panic on CPU detection There may be no way to gracefully recover, but the application should be notified that a failure happened, rather than completely aborting. This allows the user to proceed with a "slow-path" type solution. After this change, the EAL CPU NUMA node resolution step can no longer emit an rte_panic. This aligns with the code in rte_eal_init, which expects failures to return an error code. Signed-off-by: Aaron Conole <aconole@redhat.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-27 15:47:10 +02:00
Ben Walker	24a5357968	pci: fix device registration on FreeBSD The FreeBSD implementation wasn't registering new devices with the device framework on start up. However, common code attempts to unregister them on shutdown which causes a SEGFAULT. This fix makes the FreeBSD code do the same thing as the Linux code for registration. Fixes: `13a1317d3b` ("pci: create device list and fallback on its members") Cc: stable@dpdk.org Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-03-27 12:07:53 +02:00
Vladyslav Buslov	d89a5bce1d	lpm6: extend next hop field This patch extend next_hop field from 8-bits to 21-bits in LPM library for IPv6. Added versioning symbols to functions and updated library and applications that have a dependency on LPM library. Signed-off-by: Vladyslav Buslov <vladyslav.buslov@harmonicinc.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-15 18:49:41 +01:00
Matt Peters	b61befb48c	igb_uio: support devices with only I/O BAR Allow the BAR setup to succeed if a device has at least 1 BAR region defined. Previously, the device probe would only succeed if at least one memory BAR existed, but there are devices that have only port I/O BARs. For example, on Virtual Box a virtio device has only a single I/O BAR because by default MSI-X is not enabled. While in qemu/kvm the virtio device has MSI-X enabled and therefore has both an I/O and Memory BAR. The following are excerpts from "lspci -nnvvvv -s 00:09.0" on both types of systems. Virtual Box: Region 0: I/O ports at d260 [size=32] Capabilities: [80] #00 [0000] QEMU/KVM: Region 0: I/O ports at c060 [size=32] Region 1: Memory at febd1000 (32-bit, non-prefetchable) [size=4K] Expansion ROM at feb80000 [disabled] [size=256K] Capabilities: [40] MSI-X: Enable+ Count=3 Masked- Vector table: BAR=1 offset=00000000 PBA: BAR=1 offset=00000800 Signed-off-by: Matt Peters <matt.peters@windriver.com> Signed-off-by: Allain Legacy <allain.legacy@windriver.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-15 14:02:41 +01:00
Hemant Agrawal	5a11168d9b	mbuf: use pktmbuf helper to create the pool When possible, replace the uses of rte_mempool_create() with the helper provided in librte_mbuf: rte_pktmbuf_pool_create(). This is the preferred way to create a mbuf pool. This also updates the documentation. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-15 13:48:02 +01:00
Thomas Monjalon	31123211bd	remove unmaintained TILE-Gx architecture The TILE-Gx architecture and its driver mpipe are not maintained. The code is removed to avoid confusion. A last update has been done in 17.05 before removal. It can be built with the updated toolchain: http://www.mellanox.com/repository/solutions/tile-scm/ and libgxio: http://www.mellanox.com/repository/solutions/tile-scm/libgxio-1.0.tar.xz Quote from http://dpdk.org/ml/archives/dev/2017-February/057940.html " Mellanox agrees to remove TILE-Gx support from DPDK.org, but will continue to support customers using DPDK. Customer that needs support should contact Mellanox directly. " Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-03-15 11:40:57 +01:00
Olivier Matz	0ef850c4f6	ethdev: move a queue id check to generic layer The check of queue_id is done in all drivers implementing rte_eth_rx_queue_count(). Factorize this check in the generic function. Note that the nfp driver was doing the check differently, which could induce crashes if the queue index was too big. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-09 19:29:51 +01:00
Olivier Matz	44e93f4a34	ethdev: clarify API comments of Rx queue count The API comments are not consistent between each other. The function rte_eth_rx_queue_count() returns the number of used descriptors on a receive queue. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-03-09 19:27:40 +01:00
Gowrishankar Muthukrishnan	0fe9830b53	eal/ppc: support sPAPR IOMMU for vfio-pci Below changes adds pci probing support for vfio-pci devices in power8. Signed-off-by: Gowrishankar Muthukrishnan <gowrishankar.m@linux.vnet.ibm.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: Chao Zhu <chaozhu@linux.vnet.ibm.com>	2017-03-09 18:39:45 +01:00
Ben Walker	cdc242f260	eal/linux: support running as unprivileged user For Linux kernel 4.0 and newer, the ability to obtain physical page frame numbers for unprivileged users from /proc/self/pagemap was removed. Instead, when an IOMMU is present, simply choose our own DMA addresses instead. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com>	2017-03-09 17:08:46 +01:00
Bruce Richardson	03437f2947	ring: add a function to return the ring size Applications and other libraries should not be reading inside the rte_ring structure directly to get the ring size. Instead add a fn to allow it to be queried. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-03-08 16:05:19 +01:00
Jan Blunck	b2fba63690	eal: ensure constness of container_of target This adds a check to ensure that the container_of() macro is not used to cast away (remove) constness. Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-08 14:04:29 +01:00
Jan Blunck	7cfd280578	eal: fix container_of macro for const members This fixes the usage of structure members that are declared const to get a pointer to the embedding parent structure. Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-03-08 13:48:36 +01:00
Chris Metcalf	dd0eedb1cf	tile: fix build Re-enable CONFIG_RTE_LIBRTE_SCHED, since it is needed to build correctly. Fix a few warnings when compiling mpipe_tilegx.c. Remove an empty rte_cpu_feature_table[] array using a bogus type. Properly set RTE_OBJCOPY_{TARGET,ARCH} in mk/arch/tile/rte.vars.mk. Signed-off-by: Chris Metcalf <cmetcalf@mellanox.com>	2017-02-27 16:44:32 +01:00
Chris Metcalf	f80468b680	eal/tile: avoid use of non-upstreamed header It's trivial to directly invoke a read of the special-purpose register that holds the clock cycle counter, so just do that. Signed-off-by: Chris Metcalf <cmetcalf@mellanox.com>	2017-02-27 16:44:23 +01:00
Olivier Matz	93092a5610	mempool: remove deprecated get and put functions As announced in the deprecation notice, remove the functions for single/multi producer/consumer enqueue/dequeue. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-02-21 12:05:46 +01:00
Olivier Matz	f3bc028909	mempool: remove deprecated count functions As announced in the deprecation notice, remove these functions. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-02-21 12:05:46 +01:00
Thomas Monjalon	420195e6af	log: remove old symbols from map When removing log history functions, the map has not been updated. Fixes: `d7e61ad3ae` ("log: remove deprecated history dump") Reported-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-02-21 11:43:45 +01:00
Ferruh Yigit	aa0d7c2d32	kni: remove KNI vhost support Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-02-21 11:43:07 +01:00
Thomas Monjalon	d450914ab8	version: 17.05-rc0 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2017-02-17 12:17:39 +01:00
Thomas Monjalon	b9ebab26d9	version: 17.02.0 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-02-14 22:17:45 +01:00
Pablo de Lara	b09efeb9d5	doc: add thread-safety information about EFD library Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-02-14 21:48:36 +01:00
Dmitriy Yakovlev	e7ee2ca1c9	cfgfile: fix uninitialized variable on load error Uninitialized scalar variable. Using uninitialized value cfg->sections[curr_section]->num_entries when calling rte_cfgfile_close. And memory in variables cfg->sections[curr_section], sect->entries[curr_entry] maybe not equal NULL. We must decrement counters curr_section, curr_entry when failed to realloc. Fixes: `eaafbad419` ("cfgfile: library to interpret config files") Signed-off-by: Dmitriy Yakovlev <bombermag@gmail.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-02-14 18:13:48 +01:00
Qi Zhang	2eed820fd4	vfio: fix maximum number of interrupt for MSI-X The max number of interrupt request is possible be changed after rte_intr_callback_register, so in get_max_intr, we need to check if necessary to update the max_intr. Signed-off-by: Qi Zhang <qi.z.zhang@intel.com>	2017-02-13 22:25:04 +01:00
Andrew Rybchenko	549b3587f5	ethdev: fix typo in UDP tunnel API description Fixes: `1cbe755fef` ("ethdev: rename UDP tunnel port functions") Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com>	2017-02-13 22:22:19 +01:00
Thomas Monjalon	47aa9d4e0d	version: 17.02-rc3 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-02-10 17:15:32 +01:00
Slawomir Mrozowicz	b75a76d354	cryptodev: fix crash when querying device by name This patch fixes a segmentation fault in function rte_cryptodev_devices_get(), due to incorrect driver name path. It reworks the function to use correct types and clean up for visibility. Coverity issue: 141067 Fixes: `38227c0e3a` ("cryptodev: retrieve device info") Signed-off-by: Slawomir Mrozowicz <slawomirx.mrozowicz@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-02-10 15:57:29 +01:00
Jingjing Wu	64c1375b83	mbuf: fix bitmask of Tx offload flags Add missed PKT_TX_MACSEC and PKT_TX_IEEE1588_TMST flags to bitmask of all supported packet Tx offload features flags. Fixes: `4fb7e803eb` ("ethdev: add Tx preparation") Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-02-10 12:25:49 +01:00
Yong Wang	511a4c74b8	pci: fix UIO interrupt file descriptor check before close The "dev->intr_handle.fd" is possibly a negative value while it is passed as an argument to function "close". Fix the check to the fd. Fixes: `5a60a7ffc8` ("pci: introduce functions to alloc and free uio resource") Signed-off-by: Yong Wang <wang.yong19@zte.com.cn>	2017-02-10 14:23:27 +01:00
Alan Dewar	3b780b9e9e	sched: fix crash when freeing port Prevent a segmentation fault in rte_sched_port_free by only accessing the port structure after the NULL pointer check has been made. Fixes: `7b3c4f35` ("sched: fix releasing enqueued packets") Cc: stable@dpdk.org Signed-off-by: Alan Dewar <adewar@brocade.com> Signed-off-by: Jan Blunck <jblunck@infradead.org> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-02-09 18:46:52 +01:00
Patrick MacArthur	811b6b2506	vfio: fix file descriptor leak in multi-process When a secondary process wants access to the VFIO container file descriptor, the primary process calls vfio_get_container_fd() which always opens an entirely new file descriptor on /dev/vfio/vfio. However, once the file descriptor has been passed to the subprocess, it is effectively duplicated, meaning that the copy of the file descriptor in the primary process is no longer needed. However, the primary process does not close the duplicate fd, which results in a resource leak. This can be reproduced by starting a primary process with a small RLIMIT_NOFILE limit configured to use VFIO for at least one device, and repeatedly launching secondary processes until the file descriptor limit is exceeded. Fix the resource leak by closing the local vfio container file descriptor after passing it to the secondary process. Fixes: `2f4adfad0a` ("vfio: add multiprocess support") Cc: stable@dpdk.org Signed-off-by: Patrick MacArthur <patrick@patrickmacarthur.net> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2017-02-09 18:39:30 +01:00
Thomas Monjalon	5b243cbab2	version: 17.02-rc2 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-30 23:47:11 +01:00
Emmanuel Roullit	68759bbe73	vhost: remove unneeded variable assignment Found with clang static analysis: lib/librte_vhost/vhost_user.c:996:3: warning: Value stored to 'ret' is never read ret = vhost_user_get_vring_base(dev, &msg.payload.state); ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-30 13:47:20 +01:00
Emmanuel Roullit	5c1f70daaf	vhost: do not GSO when no header is present Found with clang static analysis: lib/librte_vhost/virtio_net.c:723:17: warning: Access to field 'data_off' results in a dereference of a null pointer (loaded from variable 'tcp_hdr') m->l4_len = (tcp_hdr->data_off & 0xf0) >> 2; ^~~~~~~~~~~~~~~~~ Fixes: `d0cf91303d` ("vhost: add Tx offload capabilities") Cc: stable@dpdk.org Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-30 13:46:57 +01:00
Yuanhan Liu	b8b992e93f	vhost: fix long stall of negotiation Setting up the mapping from GPA (guest physical address) to HPA (guest physical address) could be very time consuming when the guest memory is backened with small pages (4K). The bigger the guest memory, the longer it takes. This could lead a very long vhost-user negotiation. Since the mapping is only needed in zero copy mode so far, we could avoid such time consuming settup when zero copy is turned off (which is the default case). It's actually a workaround, a right fix might be to start a new thread, and hide the big latency there. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-01-28 14:25:40 +01:00
Yuanhan Liu	cc7301908c	vhost: fix dead loop in enqueue path If a malicious guest forges a dead loop desc chain (let desc->next point to itself) and desc->len is zero, this could lead to a dead loop in copy_mbuf_to_desc(following is a simplified code to show this issue clearly): while (mbuf_is_not_totally_consumed) { if (desc_avail == 0) { desc = &descs[desc->next]; desc_avail = desc->len; } COPY(desc, mbuf, desc_avail); } I have actually fixed a same issue before: commit `a436f53ebf` ("vhost: avoid dead loop chain"); it fixes the dequeue path though, leaving the enqueue path still vulnerable. The fix is the same. Add a var nr_desc to avoid the dead loop. Fixes: `f1a519ad98` ("vhost: fix enqueue/dequeue to handle chained vring descriptors") Cc: stable@dpdk.org Reported-by: Xieming Katty <katty.xieming@huawei.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-01-28 14:25:23 +01:00
Slawomir Mrozowicz	38227c0e3a	cryptodev: retrieve device info This patch adds helper functions for new performance application which provide identifiers and number of crypto device and provide and check capabilities available for defined device and algorithm. The performance application can be used to measure throughput and latency of cryptography operation performed by crypto device. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Signed-off-by: Slawomir Mrozowicz <slawomirx.mrozowicz@intel.com> Signed-off-by: Marcin Kerlin <marcinx.kerlin@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-30 17:46:36 +01:00
Fan Zhang	547017d80a	cryptodev: add scheduler PMD name and type This patch adds the cryptodev scheduler PMD name and type identifier to librte_cryptodev. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-30 17:23:33 +01:00
Hemant Agrawal	3b84878c4b	cryptodev: decouple from PCI device This makes struct rte_cryptodev independent of struct rte_pci_device by replacing it with a pointer to the generic struct rte_device. This is inline with the recent changes in ethdev Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com> Acked-by: John Griffin <john.griffin@intel.com> Reviewed-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-30 17:23:33 +01:00
Declan Doherty	b815872e70	cryptodev: uninline some functions rte_cryptodev_pmd_get_dev, rte_cryptodev_pmd_get_named_dev, rte_cryptodev_pmd_is_valid_dev were incorrectly marked as inline and therefore not useable from crypto PMDs when built as shared libraries as they accessed the global rte_cryptodev_globals device structure. Fixes: `d11b0f30` ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Declan Doherty <declan.doherty@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com>	2017-01-30 17:23:33 +01:00
Andrew Rybchenko	370621b5a1	pdump: fix coverage warning Fix GCC 4.8.2 20140120 (Red Hat 4.8.2-16) (RHEL 7.0) false warning when build with EXTRA_CFLAGS='--coverage'. Fixes: `278f945402` ("pdump: add new library for packet capture") Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com>	2017-01-30 17:04:46 +01:00
Michał Mirosław	aad0c999b3	acl: fix flow data comments Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-30 11:15:11 +01:00
Michał Mirosław	c6c7a8d7e6	acl: allow zero verdict This enables ACL matches to return 0 where the distinction from no-match case is not needed. Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-30 11:08:47 +01:00
Olivier Matz	cb8fac62be	efd: fix build by removing dependency to libmath When we compile the dpdk with: CONFIG_RTE_LIBRTE_EFD=y CONFIG_RTE_LIBRTE_NFP_PMD=n CONFIG_RTE_LIBRTE_THUNDERX_NICVF_PMD=n CONFIG_RTE_LIBRTE_SCHED=n CONFIG_RTE_LIBRTE_METER=n The linker gives the following error: lib/librte_efd.a(rte_efd.o): In function `rte_efd_create': lib/librte_efd/rte_efd.c:560: undefined reference to `log2' collect2: error: ld returned 1 exit status This is because the '-lm' is missing in mk/rte.app.mk. An alternative, which is proposed by this patch, is to use the compiler builtin rte_bsf32() to process log2 instead of the libmath log2() that requires to include math.h and link with -lm. Fixes: `56b6ef874f` ("efd: new Elastic Flow Distributor library") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-30 10:58:40 +01:00
Emmanuel Roullit	7eb8895d40	ethdev: remove useless pointer initialization Found with clang static analysis: lib/librte_ether/rte_ethdev.c:2467:22: warning: Value stored to 'dev' during its initialization is never read struct rte_eth_dev *dev = &rte_eth_devices[port_id]; ^~~ ~~~~~~~~~~~~~~~~~~~~~~~~~ Fixes: `88ac4396ad` ("ethdev: add VMDq support") Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com>	2017-01-30 10:36:18 +01:00
Steve Shin	9bdfc1e596	ethdev: fix MAC address replay This patch fixes a bug in replaying MAC address to the hardware in rte_eth_dev_config_restore() routine. Added default MAC replay as well. Fixes: `4bdefaade6` ("ethdev: VMDQ enhancements") Signed-off-by: Steve Shin <jonshin@cisco.com> Reviewed-by: Igor Ryzhov <iryzhov@nfware.com>	2017-01-30 10:15:57 +01:00
Ilya V. Matveychikov	7c93195224	mbuf: remove redundant assignment when attaching mi->next will be assigned to NULL few lines later, trivial patch Fixes: `ea672a8b16` ("mbuf: remove the rte_pktmbuf structure") Signed-off-by: Ilya V. Matveychikov <matvejchikov@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-30 10:08:54 +01:00
Olivier Matz	e09ff22d53	mempool: fix stack handler dequeue The return value of the stack handler is wrong: it should be 0 on success, not the number of objects dequeued. This could lead to memory leaks depending on how the caller checks the return value (ret < 0 or ret != 0). This was also breaking autotests with debug enabled, because the debug cookies are only updated when the function returns 0, so the cookies were not updated, leading to an abort(). Fixes: 295a530b0844 ("mempool: add stack mempool handler") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-29 23:38:33 +01:00
Emmanuel Roullit	3cdfdf2a33	devargs: reset driver name pointer on parsing failure The pointer set by strdup() needs to be cleared on failure to avoid a potential double-free from the caller. Found with clang static analysis: lib/librte_eal/common/eal_common_devargs.c:123:2: warning: Attempt to free released memory free(buf); ^~~~~~~~~ Fixes: `0fe11ec592` ("eal: add vdev init and uninit") Signed-off-by: Emmanuel Roullit <emmanuel.roullit@gmail.com>	2017-01-29 23:34:07 +01:00
Olivier Matz	ec7df18bb7	eal: fix warning about debug log at startup The log "Debug logs available - lower performance" should now only be displayed when dataplane debug logs are enabled. The issue occurs only if the default log level (CONFIG_RTE_LOG_LEVEL) is set to DEBUG in the configuration, which is not the case by default. Fixes: `5d8f0baf69` ("log: do not drop debug logs at compile time") Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-29 22:58:14 +01:00
Michał Mirosław	d613f57dd0	kni: guard against unterminated name oops If the name is too long, it triggers BUG in alloc_netdev(). Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-29 22:50:28 +01:00
Michał Mirosław	4d0db6df1c	kni: set interface name source as user-space Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Acked-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-29 22:47:30 +01:00
Ferruh Yigit	b2b0f85182	kni: add build option for ethtool support Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-29 22:36:26 +01:00
Yuanhan Liu	61207d014f	ethdev: fix data reset when allocating port Fix an silly error by auto-complete while managing the merge conflicts. It's the eth_dev_data (but not eth_dev) entry should be memset. Fixes: `d948f596fe` ("ethdev: fix port data mismatched in multiple process model") Reported-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-20 19:03:57 +01:00
Pablo de Lara	2ee926f1fd	eal: fix FreeBSD build rte_bus_scan() and rte_bus_probe() have been introduced in eal.c, but it is missing the rte_bus.h header file, for BSD systems. Fixes: `f44abbc12f` ("bus: add scanning") Fixes: `c3cec1d807` ("bus: add probing") Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-19 15:29:45 +01:00
Thomas Monjalon	6818a7f480	version: 17.02-rc1 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-19 05:54:41 +01:00
Shreyansh Jain	c3cec1d807	bus: add probing Bus implementations can implement a probe handler to match the devices scanned against the drivers registered. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-19 04:58:17 +01:00
Shreyansh Jain	f44abbc12f	bus: add scanning Scan for bus discovers the devices available on the bus and adds them to a bus specific device list. Each bus mandatorily implements this method. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-19 04:58:12 +01:00
Shreyansh Jain	a97725791e	bus: introduce bus abstraction This patch introduces the rte_bus abstraction for EAL. The model is: - One or more devices are connected to a Bus - Drivers are running instances which manage one or more devices - Bus is responsible for identifying devices (and interrupt propogation) - Driver is responsible for initializing the device This patch adds a 'rte_bus' base class which would be extended for specific implementations. It also introduces Bus registration and deregistration functions. Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-19 04:57:18 +01:00
Zbigniew Bodek	c2fec02245	cryptodev: introduce ARM-specific feature flags Add two new feature flags: * RTE_CRYPTODEV_FF_CPU_NEON represents ARM NEON (TM) instructions * RTE_CRYPTODEV_FF_CPU_ARM_CE represents ARM crypto extensions Add them to both cryptodev library, documentation and relevant PMD driver for ARMv8. Signed-off-by: Zbigniew Bodek <zbigniew.bodek@caviumnetworks.com>	2017-01-19 01:00:55 +01:00
Zbigniew Bodek	169ca3db55	crypto/armv8: add PMD optimized for ARMv8 processors This patch introduces crypto poll mode driver using ARMv8 cryptographic extensions. CPU compatibility with this driver is detected in run-time and virtual crypto device will not be created if CPU doesn't provide: AES, SHA1, SHA2 and NEON. This PMD is optimized to provide performance boost for chained crypto operations processing, such as encryption + HMAC generation, decryption + HMAC validation. In particular, cipher only or hash only operations are not provided. The driver currently supports AES-128-CBC in combination with: SHA256 HMAC and SHA1 HMAC and relies on the external armv8_crypto library: https://github.com/caviumnetworks/armv8_crypto Build ARMv8 crypto PMD if compiling for ARM64 and CONFIG_RTE_LIBRTE_PMD_ARMV8_CRYPTO option is enable in the configuration file. ARMV8_CRYPTO_LIB_PATH environment variable will point to the appropriate library directory. Signed-off-by: Zbigniew Bodek <zbigniew.bodek@caviumnetworks.com> Reviewed-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-19 01:00:55 +01:00
Fan Zhang	d803b4439d	cryptodev: add user defined name for vdev This patch adds a user defined name initializing parameter to cryptodev library. Originally, for software cryptodev PMD, the vdev name parameter is treated as the driver identifier, and will create an unique name for each device automatically, which is not necessarily as same as the vdev parameter. This patch allows the user to either create a unique name for his software cryptodev, or by default, let the system creates a unique one. This should help the user managing the created cryptodevs easily. Examples: CLI command fragment 1: --vdev "crypto_aesni_gcm_pmd" The above command will result in creating a AESNI-GCM PMD with name of "crypto_aesni_gcm_X", where postfix X is the number assigned by the system, starting from 0. This fragment can be placed in the same CLI command multiple times, resulting the postfixs incremented by one for each new device. CLI command fragment 2: --vdev "crypto_aesni_gcm_pmd,name=gcm1" The above command will result in creating a AESNI-GCM PMD with name of "gcm1". This fragment can be placed in the same CLI command multiple times, as long as each having a unique name value. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2017-01-18 21:48:56 +01:00
Tomasz Kulasek	2d03ec6abd	crypto: support scatter-gather in software drivers This patch introduces RTE_CRYPTODEV_FF_MBUF_SCATTER_GATHER feature flag informing that selected crypto device supports segmented mbufs natively and doesn't need to be coalesced before crypto operation. While using segmented buffers in crypto devices may have unpredictable results, for PMDs which doesn't support it natively, additional check is made for debug compilation. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2017-01-18 21:48:56 +01:00
Fan Zhang	c5b70818ad	cryptodev: fix loop in device query This patch fixes the dev value update problem in rte_cryptodev_pmd_get_named_dev, orginally, dev won't be updated after the initial step in the loop. Fixes: `d11b0f30df` ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2017-01-18 21:48:56 +01:00
Declan Doherty	84d7965866	crypto/aesni_mb: support AVX512 Release v0.44 of Intel(R) Multi-Buffer Crypto for IPsec library adds support for AVX512 instructions. This patch enables the new AVX512 accelerated functions from the aesni_mb_pmd crypto poll mode driver. This patch set requires that the aesni_mb_pmd is linked against the version 0.44 or greater of the Multi-Buffer Crypto for IPsec library. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-18 21:48:56 +01:00
Arek Kusztal	d59626753c	cryptodev: add DES CBC cipher algorithm This commit adds DES CBC ciper algorithm to available algorithms Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2017-01-18 21:45:15 +01:00
Jerin Jacob	53a3ba0c36	cryptodev: fix crash on null dereference crypodev->data->name will be null when rte_cryptodev_get_dev_id() invoked without a valid crypto device instance. Fixes: `d11b0f30df` ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Arek Kusztal <arkadiuszx.kusztal@intel.com>	2017-01-18 21:45:15 +01:00
Fiona Trahe	50d7f314de	cryptodev: remove unused digest-appended feature The cryptodev API had specified that if the digest address field was left empty on an authentication operation, then the PMD would assume the digest was appended to the source or destination data. This case was not handled at all by most PMDs and incorrectly handled by the QAT PMD. As no bugs were raised, it is assumed to be not needed, so this patch removes it, rather than add handling for the case on all PMDs. The digest can still be appended to the data, but its address must now be provided in the op. Signed-off-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: John Griffin <john.griffin@intel.com>	2017-01-18 21:45:15 +01:00
Pablo de Lara	86d8989688	efd: add AVX2 vector lookup function Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Signed-off-by: Saikrishna Edupuganti <saikrishna.edupuganti@intel.com> Acked-by: Christian Maciocco <christian.maciocco@intel.com>	2017-01-18 20:53:45 +01:00
Pablo de Lara	56b6ef874f	efd: new Elastic Flow Distributor library Elastic Flow Distributor (EFD) is a distributor library that uses perfect hashing to determine a target/value for a given incoming flow key. It has the following advantages: - First, because it uses perfect hashing, it does not store the key itself and hence lookup performance is not dependent on the key size. - Second, the target/value can be any arbitrary value hence the system designer and/or operator can better optimize service rates and inter-cluster network traffic locating. - Third, since the storage requirement is much smaller than a hash-based flow table (i.e. better fit for CPU cache), EFD can scale to millions of flow keys. Finally, with current optimized library implementation performance is fully scalable with number of CPU cores. Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Signed-off-by: Saikrishna Edupuganti <saikrishna.edupuganti@intel.com> Acked-by: Christian Maciocco <christian.maciocco@intel.com>	2017-01-18 20:53:28 +01:00
Jerin Jacob	b15740bd43	eal/arm64: change barrier definitions to macros Change rte_*wb definitions to macros in order to keep consistent with other barrier definitions in the file. Suggested-by: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:18:26 +01:00
Jerin Jacob	e783b81d44	eal/arm64: override I/O device read/write access Override the generic I/O device memory read/write access and implement it using armv8 instructions for arm64. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:18:26 +01:00
Jerin Jacob	a2c4cd8648	eal: let all architectures use generic I/O implementation Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:17:28 +01:00
Jerin Jacob	4540fbb2ae	eal: add generic I/O device read/write implementation This patch implements the generic version of rte_read[b/w/l/q]_[relaxed] and rte_write[b/w/l/q]_[relaxed] using rte_io_wmb() and rte_io_rmb() Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:12:05 +01:00
Jerin Jacob	69736db1d8	eal: introduce I/O device memory read/write operations This commit introduces 8-bit, 16-bit, 32bit, 64bit I/O device memory read/write operations along with the relaxed versions. The weakly-ordered machine like ARM needs additional I/O barrier for device memory read/write access over PCI bus. By introducing the eal abstraction for I/O device memory read/write access, The drivers can access I/O device memory in architecture agnostic manner. The relaxed version does not have additional I/O memory barrier, useful in accessing the device registers of integrated controllers which implicitly strongly ordered with respect to memory access. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	2cf953cfd8	eal/arm64: define I/O device memory barriers CC: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	67ce81bd3d	eal/arm64: define SMP barrier dmb instruction based barrier is used for smp version of memory barrier. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	84733fd0d7	eal/arm64: fix memory barrier definition dsb instruction based barrier is used for non smp version of memory barrier. Fixes: `d708f01b71` ("eal/arm: add atomic operations for ARMv8") Cc: stable@dpdk.org Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Jianbo Liu <jianbo.liu@linaro.org>	2017-01-18 16:57:11 +01:00
Jerin Jacob	b41508b7a4	eal/armv7: define I/O device memory barriers The patch does not provide any functional change for ARMv7. I/O barriers are mapped to existing smp barriers. CC: Jan Viktorin <viktorin@rehivetech.com> CC: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	38b636b7cc	eal/arm: separate SMP barrier definition for ARMv7 and ARMv8 Separate the smp barrier definition for arm and arm64 for fine control on smp barrier definition for each architecture. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	da07a35be5	eal/ppc64: define I/O device memory barriers The patch does not provide any functional change for ppc_64. I/O barriers are mapped to existing smp barriers. CC: Chao Zhu <chaozhu@linux.vnet.ibm.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	dff90714b1	eal/tile: define I/O device memory barriers The patch does not provide any functional change for tile. I/O barriers are mapped to existing smp barriers. CC: Zhigang Lu <zlu@ezchip.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	e802521171	eal/x86: define I/O device memory barriers The patch does not provide any functional change for IA. I/O barriers are mapped to existing smp barriers. CC: Bruce Richardson <bruce.richardson@intel.com> CC: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	1ea155733e	eal: introduce I/O device memory barriers This commit introduce rte_io_mb(), rte_io_wmb() and rte_io_rmb(), in order to enable memory barriers between I/O device and CPU. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Wei Zhao	99e7003831	net/ixgbe: parse L2 tunnel filter check if the rule is a L2 tunnel rule, and get the L2 tunnel info. Signed-off-by: Wei Zhao <wei.zhao1@intel.com> Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Beilei Xing <beilei.xing@intel.com> Acked-by: Wei Dai <wei.dai@intel.com>	2017-01-17 19:41:43 +01:00
Michał Mirosław	d75547718c	net/i40e: return errno when interrupt setup fails Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Reviewed-by: Jingjing Wu <jingjing.wu@intel.com>	2017-01-17 19:41:42 +01:00
Bernard Iremonger	5e823a4512	ethdev: remove some VF functions remove the following API's: rte_eth_dev_set_vf_rxmode rte_eth_dev_set_vf_rx rte_eth_dev_set_vf_tx rte_eth_dev_set_vf_vlan_filter rte_eth_dev_set_vf_rate_limit Increment LIBABIVER in Makefile Remove deprecation notice for removing rte_eth_dev_set_vf_* API's. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com>	2017-01-17 19:40:50 +01:00
Qiming Yang	2191347120	ethdev: add firmware version get This patch adds a new API 'rte_eth_dev_fw_version_get' for fetching firmware version by a given device. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:35 +01:00
Zhihong Wang	f5472703c0	eal: optimize aligned memcpy on x86 This patch optimizes rte_memcpy for well aligned cases, where both dst and src addr are aligned to maximum MOV width. It introduces a dedicated function called rte_memcpy_aligned to handle the aligned cases with simplified instruction stream. The existing rte_memcpy is renamed as rte_memcpy_generic. The selection between them 2 is done at the entry of rte_memcpy. The existing rte_memcpy is for generic cases, it handles unaligned copies and make store aligned, it even makes load aligned for micro architectures like Ivy Bridge. However alignment handling comes at a price: It adds extra load/store instructions, which can cause complications sometime. DPDK Vhost memcpy with Mergeable Rx Buffer feature as an example: The copy is aligned, and remote, and there is header write along which is also remote. In this case the memcpy instruction stream should be simplified, to reduce extra load/store, therefore reduce the probability of load/store buffer full caused pipeline stall, to let the actual memcpy instructions be issued and let H/W prefetcher goes to work as early as possible. This patch is tested on Ivy Bridge, Haswell and Skylake, it provides up to 20% gain for Virtio Vhost PVP traffic, with packet size ranging from 64 to 1500 bytes. The test can also be conducted without NIC, by setting loopback traffic between Virtio and Vhost. For example, modify the macro TXONLY_DEF_PACKET_LEN to the requested packet size in testpmd.h, rebuild and start testpmd in both host and guest, then "start" on one side and "start tx_first 32" on the other. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com>	2017-01-17 16:40:05 +01:00
Yuanhan Liu	d948f596fe	ethdev: fix port data mismatched in multiple process model Assume we have two virtio ports, 00:03.0 and 00:04.0. The first one is managed by the kernel driver, while the later one is managed by DPDK. Now we start the primary process. 00:03.0 will be skipped by DPDK virtio PMD driver (since it's being used by the kernel). 00:04.0 would be successfully initiated by DPDK virtio PMD (if nothing abnormal happens). After that, we would get a port id 0, and all the related info needed by virtio (virtio_hw) is stored at rte_eth_dev_data[0]. Then we start the secondary process. As usual, 00:03.0 will be firstly probed. It firstly tries to get a local eth_dev structure for it (by rte_eth_dev_allocate): port_id = rte_eth_dev_find_free_port(); ... eth_dev = &rte_eth_devices[port_id]; eth_dev->data = &rte_eth_dev_data[port_id]; ... return eth_dev; Since it's a first PCI device, port_id will be 0. eth_dev->data would then point to rte_eth_dev_data[0]. And here things start going wrong, as rte_eth_dev_data[0] actually stores the virtio_hw for 00:04.0. That said, in the secondary process, DPDK will continue to drive PCI device 00.03.0 (despite the fact it's been managed by kernel), with the info from PCI device 00:04.0. Which is wrong. The fix is to attach the port already registered by the primary process. That is, iterate the rte_eth_dev_data[], and get the port id who's PCI ID matches the current PCI device. This would let us maintain same port ID for the same PCI device, keeping the chance of referencing to wrong data minimal. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 09:20:18 +01:00
Jan Wickbom	59317cef24	vhost: allow many vhost-user ports Currently select() is used to monitor file descriptors for vhostuser ports. This limits the number of ports possible to create since the fd number is used as index in the fd_set and we have seen fds > 1023. This patch changes select() to poll(). This way we can keep an packed (pollfd) array for the fds, e.g. as many fds as the size of the array. Also see: http://dpdk.org/ml/archives/dev/2016-April/037024.html Reported-by: Patrik Andersson <patrik.r.andersson@ericsson.com> Signed-off-by: Jan Wickbom <jan.wickbom@ericsson.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Maxime Coquelin	73c8f9f69c	vhost: introduce reply ack feature REPLY_ACK features provide a generic way for QEMU to ensure both completion and success of a request. As described in vhost-user spec in QEMU repository, QEMU sets VHOST_USER_NEED_REPLY flag (bit 3) when expecting a reply_ack from the backend. Backend must reply with 0 for success or non-zero otherwise when flag is set. Currently, only VHOST_USER_SET_MEM_TABLE request implements reply_ack, in order to synchronize mapping updates. This patch enables REPLY_ACK feature generally, but only checks error code for VHOST_USER_SET_MEM_TABLE. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yong Wang	5731d6acc7	vhost: fix memory leak In function vhost_new_device(), current code dose not free 'dev' in "i == MAX_VHOST_DEVICE" condition statements. It will lead to a memory leak. Fixes: `45ca9c6f7b` ("vhost: get rid of linked list for devices") Cc: stable@dpdk.org Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Haifeng Lin	8c33fc10f6	vhost: fix guest/host physical address mapping When reg_size < page_size the function read in rte_mem_virt2phy would not return, because host_user_addr is invalid. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Haifeng Lin <haifeng.lin@huawei.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Tomasz Kulasek	1feda4d8fc	mbuf: add a function to linearize a packet This patch adds function rte_pktmbuf_linearize to let crypto PMD coalesce chained mbuf before crypto operation and extend their capabilities to support segmented mbufs when device cannot handle them natively. Included unit tests for rte_pktmbuf_linearize functionality: 1) Creates banch of segmented mbufs with different size and number of segments. 2) Fills noncontigouos mbuf with sequential values. 3) Uses rte_pktmbuf_linearize to coalesce segmented buffer into one contiguous. 4) Verifies data in linearized buffer. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:30:00 +01:00
Tiwei Bie	375008544b	ethdev: add MACsec capability flags If these flags are advertised by a PMD, the NIC supports the MACsec offload. The incoming MACsec traffics can be offloaded transparently after the MACsec offload is configured correctly by the application. And the application can set the PKT_TX_MACSEC flag in mbufs to enable the MACsec offload for the packets to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:19 +01:00
Tiwei Bie	8609a7a86c	ethdev: add MACsec event type This commit adds a below event type: - RTE_ETH_EVENT_MACSEC This event will occur when the PN counter in a MACsec connection reaches the exhaustion threshold. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:03 +01:00
Tiwei Bie	223d629f8c	mbuf: add MACsec flag Add a new Tx flag in mbuf, that can be set by applications to enable the MACsec offload for a packet to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:15:51 +01:00
Bruce Richardson	0615355ffa	kvargs: make pointers in string arrays const Change the parameters of functions from const char valid[] to const char const valid[]. This additional const is needed to allow us to fix some checkpatch warnings, as well as being good programming practice. For the checkpatch warnings, if we have a set of command line args that we want to check defined as: static const char args[] = { "arg1", "arg2", NULL }; kvlist = rte_kvargs_parse(params, args); checkpatch will complain: WARNING:STATIC_CONST_CHAR_ARRAY: static const char array should probably be static const char * const Adding the additional const to the definition of the args will then trigger a compiler error in the absence of this change to the kvargs library, as we lose the const in the call to kvargs_parse. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 19:28:26 +01:00
Wenfeng Liu	454a0a7009	mempool: use cache in single producer or consumer mode Currently we will check mempool flags when we put/get objects from mempool. However, this makes cache useless when mempool is SC\|SP, SC\|MP, MC\|SP cases. This patch makes cache available in above cases and improves performance. Signed-off-by: Wenfeng Liu <liuwf@arraynetworks.com.cn> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 16:38:09 +01:00
Ben Walker	b0d3e3f73b	pci: use address struct in function arguments Instead of passing domain, bus, devid, func, just pass an rte_pci_addr. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:55:16 +01:00
Ben Walker	22dda618c0	pci: separate detaching ethernet ports from PCI devices Attaching and detaching ethernet ports from an application is not the same thing as physically removing a PCI device, so clarify the flags indicating support. All PCI devices are assumed to be physically removable, so no flag is necessary in the PCI layer. Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2017-01-12 15:48:54 +01:00
Ben Walker	e84ad157b7	pci: unmap resources if probe fails If resources were mapped prior to probe, unmap them if probe fails. This does not handle the case where the kernel driver was forcibly unbound prior to probe. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:47:45 +01:00
Adrien Mazarguil	6de5c0f130	ethdev: define default item masks in flow API Leaving default pattern item mask values up for interpretation by PMDs is an undefined behavior that applications might find difficult to use in the wild. It also needlessly complicates PMD implementation. This commit addresses this by defining consistent default masks for each item type. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:52 +01:00
Adrien Mazarguil	f4e7c8bc84	ethdev: clarify RSS action in flow API Contrary to the current description, mbuf RSS hash result storage does not overlap with the returned MARK value (hash.fdir.lo vs. hash.fdir.hi), and both may be combined. Reflect this change by allowing testpmd to display both values simultaneously. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:47 +01:00
Adrien Mazarguil	ebaa064e2c	ethdev: clarify MARK and FLAG actions in flow API Both actions share the PKT_RX_FDIR mbuf flag, as a result there is no way to tell them apart. Moreover, the maximum allowed value for the MARK action may not necessarily cover the entire 32-bit space. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:22 +01:00
Adrien Mazarguil	a757583e55	ethdev: modify flow API error function Based on initial PMD implementations of the flow API, returning the error structure which may be NULL is useless and always discarded. Returning the error code instead appears to be much more convenient. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:53:07 +01:00
Nelio Laranjeiro	86c743cf91	eal: define generic vector types Add common vector type definitions to all CPU architectures. Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Chao Zhu <chaozhu@linux.vnet.ibm.com>	2017-01-04 21:50:19 +01:00
Thomas Monjalon	c6dab2a873	tools: move to usertools Rename tools/ into usertools/ to differentiate from buildtools/ and devtools/ while making clear these scripts are part of DPDK runtime. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-04 21:17:32 +01:00
Tomasz Kulasek	4fb7e803eb	ethdev: add Tx preparation Added API for `rte_eth_tx_prepare` uint16_t rte_eth_tx_prepare(uint8_t port_id, uint16_t queue_id, struct rte_mbuf tx_pkts, uint16_t nb_pkts) Added fields to the `struct rte_eth_desc_lim`: uint16_t nb_seg_max; /< Max number of segments per whole packet. / uint16_t nb_mtu_seg_max; /< Max number of segments per one MTU / These fields can be used to create valid packets according to the following rules: * For non-TSO packet, a single transmit packet may span up to "nb_mtu_seg_max" buffers. * For TSO packet the total number of data descriptors is "nb_seg_max", and each segment within the TSO may span up to "nb_mtu_seg_max". Added functions: int rte_validate_tx_offload(struct rte_mbuf m) to validate general requirements for tx offload set in mbuf of packet such a flag completness. In current implementation this function is called optionaly when RTE_LIBRTE_ETHDEV_DEBUG is enabled. int rte_net_intel_cksum_prepare(struct rte_mbuf m) to prepare pseudo header checksum for TSO and non-TSO tcp/udp packets before hardware tx checksum offload. - for non-TSO tcp/udp packets full pseudo-header checksum is counted and set. - for TSO the IP payload length is not included. int rte_net_intel_cksum_flags_prepare(struct rte_mbuf *m, uint64_t ol_flags) this function uses same logic as rte_net_intel_cksum_prepare, but allows application to choose which offloads should be taken into account, if full preparation is not required. PERFORMANCE TESTS ----------------- This feature was tested with modified csum engine from test-pmd. The packet checksum preparation was moved from application to Tx preparation step placed before burst. We may expect some overhead costs caused by: 1) using additional callback before burst, 2) rescanning burst, 3) additional condition checking (packet validation), 4) worse optimization (e.g. packet data access, etc.) We tested it using ixgbe Tx preparation implementation with some parts disabled to have comparable information about the impact of different parts of implementation. IMPACT: 1) For unimplemented Tx preparation callback the performance impact is negligible, 2) For packet condition check without checksum modifications (nb_segs, available offloads, etc.) is 14626628/14252168 (~2.62% drop), 3) Full support in ixgbe driver (point 2 + packet checksum initialization) is 14060924/13588094 (~3.48% drop) Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-04 20:40:15 +01:00

... 3 4 5 6 7 ...

3220 Commits