numam-dpdk

Author	SHA1	Message	Date
Huawei Xie	7c845c1fcd	vhost: add makefile vhost lib is turned off by default. vhost lib is based on cuse, which requires fuse development package to be installed. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: fix build dependencies]	2014-10-13 19:16:54 +02:00
Huawei Xie	22c668d494	vhost: comment identified issues 1) FIXME: concurrent calls to vhost set mem table from different guests could cause mem_temp to be overrided. 2) TODO: cmpset cost quite some cpu cyles. Allow app to disable this feature if there is no contention in real workload. 3) FIXME: fix scatter gather mbuf copy to vhost vring chained buffers. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	60ddca7654	vhost: coding style fixes Fix serious coding style issues reported by checkpatch. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	38726fb1b6	vhost: static variable fixes Add "static" for some variable definitions. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	da74110053	vhost: clean includes Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	1c01d52392	vhost: add debug print Define PRINT_PACKET and LOG_DEBUG macros. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	a58f905514	vhost: add private context field priv field could be used to store application specific context. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	3d671af711	vhost: supported features VHOST_SUPPORTED_FEATURES is the feature mask that vhost lib supports. VHOST_FEATURES is the feature mask vhost currently supports after some features are turned on/off. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: split patch]	2014-10-13 19:16:54 +02:00
Huawei Xie	9eed6bfd2e	vhost: allow to enable or disable features Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: split patch]	2014-10-13 19:16:54 +02:00
Huawei Xie	7202b0a824	vhost: get available vring entries Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: split patch]	2014-10-13 19:16:54 +02:00
Huawei Xie	28689ff04d	vhost: rename ops registering function Rename init_virtio_net as rte_vhost_callback_register API. rte_vhost_callback_register register the callbacks called when a vhost device is created and ready to be added to data processing core or is de-actived by guest. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	e5c2ded503	vhost: expose register and start functions Rename register_cuse_device as rte_vhost_driver_register API. Rename start_session_loop as rte_vhost_driver_session_start API. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	f782576959	vhost: get internal ops when registering vhost_net_device_ops is internal implementation in vhost lib. register_cuse_device will be vhost driver register API. There is no need for it to know the internal vhost ops. Instead, that ops is retrieved in register_cuse_device through get_virtio_net_callbacks. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	17b8320a3e	vhost: remove index parameter Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	a4fff3bba8	vhost: enqueue/dequeue burst rte_vhost_enqueue_burst copies host packets to guest. rte_vhost_enqueue_burst will call virtio_dev_rx and virtio_dev_merge_rx respectively depending on whether merge-able feature is negotiated or not in the vhost device. virtio_dev_merge_tx is renamed to rte_vhost_dequeue_burst. rte_vhost_dequeue_burst gets to-be-sent packets from guest. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: merged patches]	2014-10-13 19:16:54 +02:00
Huawei Xie	830bea8bc4	vhost: add queue id parameter queue_id parameter is added to Rx/Tx functions for multiple queue support in future. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:54 +02:00
Huawei Xie	fa325fa413	vhost: calculate mbuf size As a lib, we have no idea the app defined mbuf size. This patch will calculate mbuf size dynamically. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:53 +02:00
Huawei Xie	20f16ce646	vhost: return packets to upper layer This patch makes virtio_dev_merge_tx return the received packets to app layer. Previously virtio_tx_route was called to route these packets and then free them. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:16:53 +02:00
Huawei Xie	7f456f6d61	vhost: move address translation function Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: split from a previous patch]	2014-10-13 19:16:04 +02:00
Huawei Xie	c19dc7db15	vhost: move internal structure The structure virtio_net_config_ll is moved to virtio_net.c. It is related to internal virtio device management, so it should not be exposed to other files. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:13:10 +02:00
Huawei Xie	8a7576c3dc	vhost: remove retry logic It was used to wait some time and retry when there are not enough descriptors. App could implement this policy easily if it needs. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:13:10 +02:00
Huawei Xie	0a739691fc	vhost: remove zero copy memory region generation logic Currently zero copy feature isn't generic as it couples closely with nic. It isn't put in the vhost lib in this version. gpa(guest physical address) to hpa(host physical address) mapping region logic is removed. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:13:10 +02:00
Huawei Xie	68e6490476	vhost: remove switching related logics The following logics will be moved to vhost example: 1. mac learning, which is used to learn the mac address from the first transmitted packet of guest and bind the vhost device to a queue in a pool of VMDQ. 2. VMDQ mac/vlan filter: Each pool the vhost device is bind to is assigned a mac/vlan filter. 3. num_devices is used to specify the maximum vhost devices the nic supports. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-13 19:13:10 +02:00
Huawei Xie	f14c0b4db8	vhost: remove useless code for Rx/Tx Remove all other codes and only keep virtio_dev_rx, copy_from_mbuf_to_vring, virtio_dev_merge_rx, virtio_dev_merge_tx. Previous vhost merge-able feature introduces another version of tx function, virtio_dev_merge_tx. Actually it is not related to merge-able feature but is the fix for memcpy between mbuf and vring descriptors. This lib will create the tx functions based on virtio_dev_merge_tx. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: do not remove code used or moved later]	2014-10-13 19:11:38 +02:00
Huawei Xie	5c7a80aec3	vhost: move from examples to dedicated library Those files will be refactored in subsequent patches to form user space vhost library. Makefile and main.h are removed. main.c is renamed to vhost_rxtx.c and will provide vring enqueue/dequeue API. virtio-net.h is renamed to rte_virtio_net.h which is the API header file. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> [Thomas: remove from examples Makefile and merge file renaming]	2014-10-13 19:10:09 +02:00
Daniel Mrzyglod	ee1a5470fa	tools: fix setup script for Fedora 21 script was expecting /lib/modules/$(uname -r)/kernel/drivers/uio/uio.ko but in fedora 21 there are Compressed kernel modules - xz (LZMA) Signed-off-by: Daniel Mrzyglod <danielx.t.mrzyglod@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2014-10-10 17:50:31 +02:00
Keith Wiles	afe9637e7b	mempool: remove useless variable Remove n_orig variable as it is not required. Signed-off-by: Keith Wiles <keith.wiles@windriver.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-10 17:49:09 +02:00
Ouyang Changchun	03a35595d0	ixgbe/base: disable some gcc warnings This patch disables compilation complain from lower GCC version (less than 4.6). Note: Only supported versions of GCC are 4.x. Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com>	2014-10-10 17:45:18 +02:00
Pablo de Lara	81f7ecd934	examples: use factorized default Rx/Tx configuration For apps that were using default rte_eth_rxconf and rte_eth_txconf structures, these have been removed and now they are obtained by calling rte_eth_dev_info_get, just before setting up RX/TX queues. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2014-10-10 13:01:49 +02:00
Pablo de Lara	27b31ee33f	i40e: set default Rx/Tx configuration Many sample apps use duplicated code to set rte_eth_txconf and rte_eth_rxconf structures. This patch allows the user to get a default optimal RX/TX configuration through rte_eth_dev_info get, and still any parameters may be tweaked as wished, before setting up queues. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> [Thomas: split patch]	2014-10-10 12:54:19 +02:00
Pablo de Lara	9fad0ce847	ixgbe: set default Rx/Tx configuration Many sample apps use duplicated code to set rte_eth_txconf and rte_eth_rxconf structures. This patch allows the user to get a default optimal RX/TX configuration through rte_eth_dev_info get, and still any parameters may be tweaked as wished, before setting up queues. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> [Thomas: split patch]	2014-10-10 12:54:19 +02:00
Pablo de Lara	90daa93a1c	igb: set default Rx/Tx configuration Many sample apps use duplicated code to set rte_eth_txconf and rte_eth_rxconf structures. This patch allows the user to get a default optimal RX/TX configuration through rte_eth_dev_info get, and still any parameters may be tweaked as wished, before setting up queues. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> [Thomas: split patch]	2014-10-10 12:54:13 +02:00
Pablo de Lara	fbde27f19a	ethdev: get default Rx/Tx configuration from dev info Many sample apps use duplicated code to set rte_eth_txconf and rte_eth_rxconf structures. This patch allows the user to get a default optimal RX/TX configuration through rte_eth_dev_info get, and still any parameters may be tweaked as wished, before setting up queues. Besides, if a NULL pointer is passed to rte_eth_rx_queue_setup or rte_eth_tx_queue_setup, these functions get internally the default RX/TX configuration for the user. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> [Thomas: split patch]	2014-10-10 12:47:05 +02:00
Pablo de Lara	a30268e9a2	ethdev: reset whole dev info structure before filling To guarantee that RX/TX configuration structures are reseted before modifying them, plus the other dev info fields, dev info structure is zeroed beforehand. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2014-10-10 12:47:05 +02:00
Pablo de Lara	dd6303b230	examples/netmap_compat: add default build target Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-09 20:02:35 +02:00
Nicolás Pernas Maradei	0069e277db	app/testpmd: print message if queue start/stop is not supported Print an error message to the user when trying to start/stop a rx/tx queue and this function is not supported by the PMD driver. The patch does not check if the return value is -EINVAL because testpmd is already validating the port and queue id. Signed-off-by: Nicolás Pernas Maradei <nico@emutex.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-09 20:02:35 +02:00
Nicolás Pernas Maradei	a0fce1193c	pcap: fix double stop error librte_pmd_pcap driver was opening the pcap/interfaces only at init time and closing them only when the port was being stopped. This behaviour would cause problems (leading to segfault) if the user closed the port 2 times. The first time the pcap/interfaces would be normally closed but libpcap would throw an error causing a segfault if the closed pcaps/interfaces were closed again. This behaviour is solved by re-openning pcaps/interfaces when the port is started (only if these weren't open already for example at init time). Signed-off-by: Nicolás Pernas Maradei <nico@emutex.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2014-10-09 20:02:35 +02:00
Jim Harris	f7eda85b9c	i40e: fix Tx descriptors reset Fix the descriptor initialization loop, so that it initializes the i40e_tx_desc::cmd_type_offset_bsz for the correct index into the tx_ring array. Previously it would use the index once to initialize the txd local variable, then again when setting cmd_type_offset_bsz. Signed-off-by: Jim Harris <james.r.harris@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2014-10-09 20:02:34 +02:00
Pablo de Lara	7a10de5e27	ixgbe: fix build with bypass enabled Since commit `aae1047905` ("use the right debug macro"), DEBUGOUT was replaced by PMD_DRV_LOG which requires at least 2 arguments. But the level argument was missing. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-09 20:02:34 +02:00
Keith Wiles	0e539d1f9b	mempool: fix build with debug enabled and clang When enabling RTE_LIBRTE_MEMPOOL_DEBUG and compiling with clang compiler an error occurs, because ifdefed code includes push/pop pragmas. Signed-off-by: Keith Wiles <keith.wiles@windriver.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-09 20:02:34 +02:00
David Marchand	b1a3e0f773	eal/bsd: fix core detection Following "options parsing" patchset (commit `d7cb626f` and `489a9d6c`), core detection is not working correctly on bsd. ./x86_64-native-bsdapp-gcc/app/test -c f -n 4 -- -i [...] EAL: lcore 0 unavailable EAL: invalid coremask Align bsd to linux: - commit `f563a372` "eal: fix recording of detected/enabled logical cores" - commit `4f04db8b` "eal: check coremask against detected lcores" Reported-by: Zhan, Zhaochen <zhaochen.zhan@intel.com> Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Tested-by: Zhaochen Zhan <zhaochen.zhan@intel.com>	2014-10-09 17:52:06 +02:00
Bruce Richardson	17c696b4ff	mbuf: comment for ctrl mbuf flag Add in a doxygen comment for the ctrl mbuf flag definition. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-08 14:45:36 +02:00
Bruce Richardson	578cca42da	mbuf: update Rx flag format Update the format of the RX flags to match that of the TX flags. In general the flags are now specified as "1ULL << X", with a few exceptions. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-08 14:45:14 +02:00
Bruce Richardson	dff38e8e0d	mbuf: group Tx flags near end of field This patch takes the existing TX flags defined for the mbuf and shifts each uniquely defined one left so that additional RX flags can be defined without having RX and TX flags mixed together. Under the new scheme, RX flags start at bit 0 and work left, TX flags start at bit 55 and work right, and bits 56-63 are reserved for generic mbuf use, not for offloads. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2014-10-08 14:43:47 +02:00
Bruce Richardson	fa73fe80a7	app/testpmd: change rxfreet default to 32 To improve performance by using bulk alloc or vectored RX routines, we need to set rx free threshold (rxfreet) value to 32, so make this the testpmd default. Thirty-two is the minimum setting needed to enable either the bulk alloc or vector RX routines inside the ixgbe driver, so it's best made the default for that reason. Please see "check_rx_burst_bulk_alloc_preconditions()" in ixgbe_rxtx.c, and RX function assignment logic in "ixgbe_dev_rx_queue_setup()" in the same file. The difference in IO performance for testpmd when called without any optional parameters, and using 10G NICs using the ixgbe driver, can be significant - approx 25% or more. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2014-10-08 14:24:59 +02:00
Bruce Richardson	d803387a7f	ixgbe: add prefetch to improve slow-path tx perf Make a small improvement to slow path TX performance by adding in a prefetch for the second mbuf cache line. Also move assignment of l2/l3 length values only when needed. What I've done with the prefetches is two-fold: 1) changed it from prefetching the mbuf (first cache line) to prefetching the mbuf pool pointer (second cache line) so that when we go to access the pool pointer to free transmitted mbufs we don't get a cache miss. When clearing the ring and freeing mbufs, the pool pointer is the only mbuf field used, so we don't need that first cache line. 2) changed the code to prefetch earlier - in effect to prefetch one mbuf ahead. The original code prefetched the mbuf to be freed as soon as it started processing the mbuf to replace it. Instead now, every time we calculate what the next mbuf position is going to be we prefetch the mbuf in that position (i.e. the mbuf pool pointer we are going to free the mbuf to), even while we are still updating the previous mbuf slot on the ring. This gives the prefetch much more time to resolve and get the data we need in the cache before we need it. In terms of performance difference, a quick sanity test using testpmd on a Xeon (Sandy Bridge uarch) platform showed performance increases between approx 8-18%, depending on the particular RX path used in conjuntion with this TX path code. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2014-10-08 14:24:59 +02:00
Bruce Richardson	a62bfb72b9	mbuf: switch vlan_tci and reserved2 fields Move the vlan_tci field up by two bytes in the mbuf data structure. This has two effects: * Ensures the the ixgbe vector driver places the vlan tag in the correct place in the mbuf. * Allows a second vlan tag field, if one is added in the future, to be placed after the existing vlan field, rather than before. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2014-10-08 14:24:59 +02:00
Bruce Richardson	4cd917b308	mbuf: add userdata pointer field While some applications may store metadata about packets in the packet mbuf headroom, this is not a workable solution for packet metadata which is either: * larger than the headroom (or headroom is needed for adding pkt headers) * needs to be shared or copied among packets To support these use cases in applications, we reserve a general "userdata" pointer field inside the second cache-line of the mbuf. This is better than having the application store the pointer to the external metadata in the packet headroom, as it saves an additional cache-line from being used. Apart from storing metadata, this field also provides a general 8-byte scratch space inside the mbuf for any other application uses that are applicable. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2014-10-08 14:24:59 +02:00
Bruce Richardson	7f78e67701	mbuf: ensure next pointer is set to null on free The receive functions for packets do not modify the next pointer so the next pointer should always be cleared on mbuf free, just in case. The slow-path TX needs to clear it, and the standard mbuf free function also needs to clear it. Fast path TX does not handle chained mbufs so is unaffected Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2014-10-08 14:12:31 +02:00
Helin Zhang	3043ce5011	i40e/base: fix arq_event_info struct Overloading the 'msg_size' field in the 'arq_event_info' struct is a bad idea. It leads to bugs when the structure is used in a loop, since the input value (buffer size) is overwritten by the output value (actual message length). The fix introduces one more field of 'buf_len' for the buffer size, and renames the field of 'msg_size' to 'msg_len' for the real message size. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Reviewed-by: Chen Jing <jing.d.chen@intel.com> Tested-by: HuilongX Xu <huilongx.xu@intel.com>	2014-10-07 18:02:11 +02:00

1 2 3 4 5 ...

1145 Commits