numam-dpdk

Author	SHA1	Message	Date
Olivier Matz	3976f19e13	net/virtio: fix compilation with -Og The compilation with gcc-6.3.0 and EXTRA_CFLAGS=-Og gives the following error: CC virtio_rxtx.o virtio_rxtx.c: In function ‘virtio_rx_offload’: virtio_rxtx.c:680:10: error: ‘csum’ may be used uninitialized in this function [-Werror=maybe-uninitialized] csum = ~csum; ~~~~~^~~~~~~ The function rte_raw_cksum_mbuf() may indeed return an error, and in this case, csum won't be initialized. Fix it by initializing csum to 0. Fixes: 96cb6711939e ("net/virtio: support Rx checksum offload") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-10-12 01:36:57 +01:00
Olivier Matz	0964936308	net/virtio: keep Rx handler whatever the Tx queue config Split use_simple_rxtx into use_simple_rx and use_simple_tx, and ensure that only use_simple_tx is updated when txq flags forces to use the standard Tx handler. This change is also useful for next commit (disable simple Rx path when Rx checksum is requested). Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yliu@fridaylinux.org>	2017-10-10 15:52:27 +02:00
Olivier Matz	4819eae8d9	net/virtio: rationalize setting of Rx/Tx handlers The selection of Rx/Tx handlers is done at several places, group them in one function set_rxtx_funcs(). The update of hw->use_simple_rxtx is also rationalized: - initialized to 1 (prefer simple path) - in dev configure or rx/tx queue setup, if something prevents from using the simple path, change it to 0. - in dev start, set the handlers according to hw->use_simple_rxtx. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yliu@fridaylinux.org>	2017-10-10 15:51:04 +02:00
Olivier Matz	efc83a1e7f	net/virtio: fix queue setup consistency In rx/tx queue setup functions, some code is executed only if use_simple_rxtx == 1. The value of this variable can change depending on the offload flags or sse support. If Rx queue setup is called before Tx queue setup, it can result in an invalid configuration: - dev_configure is called: use_simple_rxtx is initialized to 0 - rx queue setup is called: queues are initialized without simple path support - tx queue setup is called: use_simple_rxtx switch to 1, and simple Rx/Tx handlers are selected Fix this by postponing a part of Rx/Tx queue initialization in dev_start(), as it was the case in the initial implementation. Fixes: 48cec290a3d2 ("net/virtio: move queue configure code to proper place") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yliu@fridaylinux.org>	2017-10-10 15:51:04 +02:00
Tiwei Bie	6a8cbb31ff	net/virtio: refactor coding style in Rx Make the code more readable. No functional change. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-07-19 11:09:13 +03:00
Zhiyong Yang	b7be4f461a	net/virtio: support to turn on/off traffic flow Current virtio_dev_stop only disables interrupt and marks link down, When it is invoked, tx/rx traffic flows still work. This is a strange behavior. The patch supports the switch of flow. Signed-off-by: Zhiyong Yang <zhiyong.yang@intel.com>	2017-04-19 10:49:06 +02:00
Olivier Matz	ebb7bcabb8	drivers/net: do not touch mbuf next or nb segs on Rx Now that the m->next pointer and m->nb_segs is expected to be set (to NULL and 1 respectively) after a mempool_get(), we can avoid to write them in the Rx functions of drivers. Only some drivers are patched, it's not an exhaustive patch. It gives the idea to do the same in other drivers. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-04-05 11:30:29 +02:00
Zhiyong Yang	f53fe364d6	net/virtio: remove the redundant computing The minor change aims to remove the redundant computing and make it easier to understand the code. Signed-off-by: Zhiyong Yang <zhiyong.yang@intel.com>	2017-04-01 08:58:54 +02:00
Ferruh Yigit	f2462150ec	drivers/net: remove redundant new line from logs Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2017-01-30 22:18:27 +01:00
Yuanhan Liu	16994abee2	net/virtio: optimize header reset on any layout When any layout is used, the header is stored in the head room of mbuf. mbuf is allocated and filled by user, means there is no gurateen the header is all zero for non TSO case. Therefore, we have to do the reset by ourself: memest(hdr, 0, head_size); The memset has two impacts on performance: - memset could not be inlined, which is a bit costly. - more importantly, it touches the mbuf, which could introduce severe cache issues as described by former patch. Similiary, we could do the same trick: reset just when necessary, when the corresponding field is already 0, which is likely true for a simple l2 forward case. It could boost the performance up to 20+% in micro benchmarking. Cc: stable@dpdk.org Cc: Maxime Coquelin <maxime.coquelin@redhat.com> Cc: Michael S. Tsirkin <mst@redhat.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2017-01-30 14:33:12 +01:00
Yuanhan Liu	c9ea670c1d	net/virtio: fix performance regression due to TSO TSO is now enabled, but it's not actually being used by default in a simple L2 forward mode. In such case, we have to zero the virtio net headers, to inform the vhost backend that no offload is being used: hdr->csum_start = 0; hdr->csum_offset = 0; hdr->flags = 0; hdr->gso_type = 0; hdr->gso_size = 0; hdr->hdr_len = 0; Such writes could be very costly; it introduces severe cache issues: The above operations introduce cache write for each packet, which stalls the read operation from the vhost backend. The fact that virtio net header is initiated to zero in PMD driver init stage means that these costly writes are unnecessary and could be avoided: if (hdr->csum_start != 0) hdr->csum_start = 0; And that's what the macro ASSIGN_UNLESS_EQUAL does. With this, the performance drop introduced by TSO enabling is recovered: it could be up to 20% in micro benchmarking. Fixes: 58169a9c8153 ("net/virtio: support Tx checksum offload") Fixes: 696573046e9e ("net/virtio: support TSO") Cc: stable@dpdk.org Cc: Olivier Matz <olivier.matz@6wind.com> Cc: Maxime Coquelin <maxime.coquelin@redhat.com> Cc: Michael S. Tsirkin <mst@redhat.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Reviewed-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-30 14:33:04 +01:00
Jianfeng Tan	b0caba1a13	net/virtio: add Rx descriptor check Under interrupt mode, rx_descriptor_done is used as an indicator for applications to check if some number of packets are ready to be received. This patch enables this by checking used ring's local consumed idx with shared (with backend) idx. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:47 +01:00
Pierre Pfister	9edfedf5e4	net/virtio: use any layout for version 1.0 Current virtio driver advertises VERSION_1 support, but does not handle device's VERSION_1 support when sending packets (it looks for ANY_LAYOUT feature, which is absent). This patch enables 'can_push' in tx path when VERSION_1 is advertised by the device. This significantly improves small packets forwarding rate towards devices advertising VERSION_1 feature. Signed-off-by: Pierre Pfister <ppfister@cisco.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Yuanhan Liu	48cec290a3	net/virtio: move queue configure code to proper place The only piece of code of virtio_dev_rxtx_start() is actually doing queue configure/setup work. So, move it to corresponding queue_setup callback. Once that is done, virtio_dev_rxtx_start() becomes an empty function, thus it's being removed. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2016-11-07 15:40:13 +01:00
Yuanhan Liu	f4d1ad1579	net/virtio: initiate vring at init stage virtio_dev_vring_start() is actually doing the vring initiation job. And the vring initiation job should be done at the dev init stage, as stated with great details in former commit. So move it there, and rename it to virtio_init_vring(). Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2016-11-07 15:40:08 +01:00
Yuanhan Liu	69c80d4ef8	net/virtio: allocate queue at init stage Queue allocation should be done once, since the queue related info (such as vring addreess) will only be informed to the vhost-user backend once without virtio device reset. That means, if you allocate queues again after the vhost-user negotiation, the vhost-user backend will not be informed any more. Leading to a state that the vring info mismatches between virtio PMD driver and vhost-backend: the driver switches to the new address has just been allocated, while the vhost-backend still sticks to the old address has been assigned in the init stage. Unfortunately, that is exactly how the virtio driver is coded so far: queue allocation is done at queue_setup stage (when rte_eth_tx/rx_queue_setup is invoked). This is wrong, because queue_setup can be invoked several times. For example, $ start_testpmd.sh ... --txq=1 --rxq=1 ... > port stop 0 > port config all txq 1 # just trigger the queue_setup callback again > port config all rxq 1 > port start 0 The right way to do is allocate the queues in the init stage, so that the vring info could be persistent with the vhost-user backend. Besides that, we should allocate max_queue pairs the device supports, but not nr queue pairs firstly configured, to make following case work. $ start_testpmd.sh ... --txq=1 --rxq=1 ... > port stop 0 > port config all txq 2 > port config all rxq 2 > port start 0 Since the allocation is switched to init stage, the free should also moved from the rx/tx_queue_release to dev close stage. That leading we could do nothing an empty rx/tx_queue_release() implementation. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-11-07 15:40:03 +01:00
Olivier Matz	696573046e	net/virtio: support TSO Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-10-13 20:45:56 +02:00
Olivier Matz	86d59b2146	net/virtio: support LRO Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-10-13 20:45:56 +02:00
Olivier Matz	58169a9c81	net/virtio: support Tx checksum offload Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-10-13 20:45:56 +02:00
Olivier Matz	96cb671193	net/virtio: support Rx checksum offload Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-10-13 20:45:56 +02:00
Jerin Jacob	2d7c37194e	net/virtio: add NEON based Rx handler Added neon based Rx vector implementation. Selection of the new handler based neon availability at runtime. Updated the release notes and MAINTAINERS file. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Jianbo Liu <jianbo.liu@linaro.org>	2016-09-28 02:18:39 +02:00
Jerin Jacob	ed35184a0f	net/virtio: select data handler depending on CPU flag Introduced cpuflag based run-time detection to select the SSE based simple Rx handler Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-09-28 02:18:39 +02:00
Jerin Jacob	17483cb210	net/virtio: cleanup conditional compilation Removed unnecessary compile time dependency on "use_simple_rxtx". Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-09-28 02:18:39 +02:00
Yuanhan Liu	834ac655ba	net/virtio: fix crash on null dereference The rxq/txq for the queue_release callback could be NULL, say when rte_eth_dev_configure() fails that the queue is not setup at all. Do a simple NULL check would fix the crash issue. Fixes: 01ad44fd374f ("net/virtio: split Rx/Tx queue") Reported-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-22 00:30:08 +02:00
Olivier Matz	25f80d1087	net/virtio: fix packet corruption The support of virtio-user changed the way the mbuf dma address is retrieved, using a physical address in case of virtio-pci and a virtual address in case of virtio-user. This change introduced some possible memory corruption in packets, replacing: m->buf_physaddr + RTE_PKTMBUF_HEADROOM by: m->buf_physaddr + m->data_off (through a macro) This patch fixes this issue, restoring the original behavior. By the way, it also rework the macros, adding a "VIRTIO_" prefix and API comments. Fixes: f24f8f9fee8a ("net/virtio: allow virtual address to fill vring descriptors") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-22 00:27:29 +02:00
Jianfeng Tan	f24f8f9fee	net/virtio: allow virtual address to fill vring descriptors This patch is related to how to calculate relative address for vhost backend. The principle is that: based on one or multiple shared memory regions, vhost maintains a reference system with the frontend start address, backend start address, and length for each segment, so that each frontend address (GPA, Guest Physical Address) can be translated into vhost-recognizable backend address. To make the address translation efficient, we need to maintain as few regions as possible. In the case of VM, GPA is always locally continuous. But for some other case, like virtio-user, GPA continuous is not guaranteed, therefore, we use virtual address here. It basically means: a. when set_base_addr, VA address is used; b. when preparing RX's descriptors, VA address is used; c. when transmitting packets, VA is filled in TX's descriptors; d. in TX and CQ's header, VA is used. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Huawei Xie	01ad44fd37	net/virtio: split Rx/Tx queue We keep a common vq structure, containing only vq related fields, and then split others into RX, TX and control queue respectively. Signed-off-by: Huawei Xie <huawei.xie@intel.com> [Jianfeng Tan: found and fixed 2 bugs] Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 06:10:54 +02:00
Olivier Matz	88c107840d	net/virtio: check mbuf is direct when using any layout The commit dd856dfcb9e74 introduced an optimization that prepends virtio header to mbuf data. It can be used when the tx mbuf is writeable, so we need to check that the mbuf is direct (i.e. it embeds its own data). Fixes: dd856dfcb9e7 ("virtio: use any layout on Tx") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 06:10:54 +02:00
Olivier Matz	fbfd99551c	mbuf: add raw allocation function Many drivers provide their own implementation of rte_mbuf_raw_alloc(), duplicating the code. Introduce a new public function in rte_mbuf to allocate a raw mbuf (uninitialized). Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-05-17 08:31:33 +02:00
Jianfeng Tan	2c0eb46f51	virtio: fix segfault on Tx desc flags setup After the do-while loop, idx could be VQ_RING_DESC_CHAIN_END (32768) when it's the last vring desc buf we can get. Therefore, following expresssion could lead to a segfault error, as it tries to access beyond the desc memory boundary. start_dp[idx].flags &= ~VRING_DESC_F_NEXT; This bug could be reproduced easily with "set fwd txonly" in the guest PMD, where the dequeue on host is slower than the guest Tx, that running out of free desc buf is pretty easy. The fix is straightforward and easy, just remove it, as we have already set desc flags properly inside the do-while loop. Fixes: dd856dfcb9e ("virtio: use any layout on Tx") [Yuanhan Liu: commit log reword] Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:53:28 -07:00
Jianfeng Tan	e908312704	virtio: fix newline under debug mode Issue: output of appliations and debug info of DPDK may be mixed up in same line when enabling below debug options of virtio: CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_INIT CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_TX CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_DRIVER This patch adds "\n" in the tail of definitions like PMD_RX_LOG, PMD_TX_LOG, and PMD_DRV_LOG, and removes some "\n" when using these macros. Fixes: c1f86306a026 ("virtio: add new driver") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:52:01 -07:00
Rich Lane	610e0a8b62	virtio: use zeroed memory for simple Tx header For simple TX the virtio-net header must be zeroed, but it was using memory that had been initialized with indirect descriptor tables. This resulted in "unsupported gso type" errors from librte_vhost. We can use the same memory for every descriptor to save cachelines in the vswitch. Fixes: 6dc5de3a ("virtio: use indirect ring elements") Signed-off-by: Rich Lane <rich.lane@bigswitch.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2016-04-06 12:27:57 +02:00
Kyle Larose	3eabd79c50	virtio: fix Rx ring descriptor starvation Virtio has an mbuf descriptor ring containing mbufs to be used for receiving traffic. When the host queues traffic to be sent to the guest, it consumes these descriptors. If none exist, it discards the packet. The virtio pmd allocates mbufs to the descriptor ring every time it successfully receives a packet. However, it never does it if it does not receive a valid packet. If the descriptor ring is exhausted, and the mbuf mempool does not have any mbufs free (which can happen for various reasons, such as queueing along the processing pipeline), then the receive call will not allocate any mbufs to the descriptor ring, and when it finishes, the descriptor ring will be empty. The ring being empty means that we will never receive a packet again, which means we will never allocate mbufs to the ring: we are stuck. Ultimately, the problem arises because there is a dependency between receiving packets and making the descriptor ring not be empty, and a dependency between the descriptor ring not being empty, and receiving packets. To fix the problem, this pakes makes virtio always try to allocate mbufs to the descriptor ring, if necessary, when polling for packets. Do this by removing the early exit if no packets were received. Since the packet loop later will do nothing if there are no packets, this is fine. I reproduced the problem by pushing packets through a pipelined systems (such as the client_server sample application) after artificially decreasing the size of the mbuf pool and introducing a delay in a secondary stage. Without the fix, the process stops receiving packets fairly quicky. With the fix, it continues to receive packets. Fixes: c1f86306a026 ("virtio: add new driver") Signed-off-by: Kyle Larose <klarose@sandvine.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-25 19:01:37 +01:00
Stephen Hemminger	17cbf09fe1	virtio: optimize Tx enqueue All the error checks in virtqueue_enqueue_xmit are already done by the caller. Therefore they can be removed to improve performance. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:35 +01:00
Stephen Hemminger	dd856dfcb9	virtio: use any layout on Tx Virtio supports a feature that allows sender to put transmit header prepended to data. It requires that the mbuf be writeable, correct alignment, and the feature has been negotiatied. If all this works out, then it will be the optimum way to transmit a single segment packet. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:25 +01:00
Stephen Hemminger	6dc5de3a6a	virtio: use indirect ring elements The virtio ring in QEMU/KVM is usually limited to 256 entries and the normal way that virtio driver was queuing mbufs required nsegs + 1 ring elements. By using the indirect ring element feature if available, each packet will take only one ring slot even for multi-segment packets. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:25 +01:00
Igor Ryzhov	64a7619ee8	virtio: remove broadcast packets from multicast statistics Signed-off-by: Igor Ryzhov <iryzhov@nfware.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Applied with coding standards fixes: Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2016-03-16 18:52:18 +01:00
Huawei Xie	3b1e3e4e36	virtio: fix descriptors pointing to the same buffer The virtio_net_hdr desc all pointed to the same buffer. It doesn't cause issue because in the simple TX mode we don't use the header. This patch makes the header desc point to different buffer. Fixes: b4ae9c505f2e ("virtio: optimize ring layout") Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-16 18:52:18 +01:00
Ravi Kerur	d6b324c00f	mbuf: get DMA address Macros RTE_MBUF_DATA_DMA_ADDR and RTE_MBUF_DATA_DMA_ADDR_DEFAULT are defined in each PMD driver file. Convert macros to inline functions and move them to common lib/librte_mbuf/rte_mbuf.h file. PMD drivers include rte_mbuf.h file directly/indirectly hence no additioanl header file inclusion is necessary. Signed-off-by: Ravi Kerur <rkerur@gmail.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-03-04 16:01:15 +01:00
Santosh Shukla	69d308e1c0	virtio: restrict vector Rx/Tx to x86 SSSE3 Temporary implementation to let virtio operate in non-vec mode for archs which doesn't support _ssse_ cpuflag. todo: 1) Move virtio_recv_pkts_vec() implementation to drivers/virtio/virtio_vec_<arch>.h file. 2) Remove use_simple_rxtx flag, so that virtio/virtio_vec_<arch>.h files to provide vectored/non-vectored rx/tx apis. Fixes: fc3d66212fed ("virtio: add vector Rx") Fixes: c121c8d6d31a ("virtio: add simple Tx") Fixes: 8d8393fb1861 ("virtio: pick simple Rx/Tx") Signed-off-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-03 14:00:28 +01:00
Yuanhan Liu	1905e101dc	virtio: retrieve header size from device setting The mergeable virtio net hdr format has been the standard and the only virtio net hdr format since virtio 1.0. Therefore, we can not hardcode hdr_size to "sizeof(struct virtio_net_hdr)" any more at virtio_recv_pkts(), otherwise, there would be a mismatch of hdr size from rte_vhost_enqueue_burst() and virtio_recv_pkts(), leading a packet corruption. Instead, we should retrieve it from hw->vtnet_hdr_size; we will do proper settings at eth_virtio_dev_init() in later patches. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	c47787cfaa	virtio: do not set vring address again at queue startup As we have already set up it at virtio_dev_queue_setup(), and a vq restart will not reset the settings. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Stephen Hemminger	43ffe8aa86	virtio: clean up Tx space checks The space check for transmit ring only needs a single conditional. I.e only need to recheck for space if there was no space in first check. This can help performance and simplifies loop. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-12-07 01:03:12 +01:00
Stephen Hemminger	c21835fab8	virtio: fix Rx mbuf initialization The virtio driver was not initializing all the fields in the receive mbuf. This would cause bugs where previous usage of mbuf would leave stale TCI and offload flags. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-12-07 01:03:12 +01:00
Harry van Haaren	76d4c652e0	virtio: add extended stats Add xstats() functions and statistic strings to virtio PMD. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Maryam Tahhan <maryam.tahhan@intel.com>	2015-11-03 00:19:25 +01:00
Huawei Xie	8d8393fb18	virtio: pick simple Rx/Tx simple rx/tx func is chose when merge-able rx is disabled and user specifies single segment and no offload support. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:34:44 +01:00
Huawei Xie	fc3d66212f	virtio: add vector Rx With fixed avail ring, we don't need to get desc idx from avail ring. virtio driver only has to deal with desc ring. This patch uses vector instruction to accelerate processing desc ring. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:33:43 +01:00
Huawei Xie	cab0461234	virtio: fill Rx avail ring with blank mbufs Add software RX ring in virtqueue. Add fake_mbuf in virtqueue for wraparound processing. Fill avail ring with blank mbufs in virtio_dev_vring_start Add virtio_rxtx.h header file for RTE_VIRTIO_PMD_MAX_BURST. Would move all rx/tx related declarations into this header file in future. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:32:19 +01:00
Huawei Xie	b4ae9c505f	virtio: optimize ring layout In DPDK based switching environment, mostly vhost runs on a dedicated core while virtio processing in guest VMs runs on different cores. Take RX for example, with generic implementation, for each guest buffer, a) virtio driver allocates a descriptor from free descriptor list b) modify the entry of avail ring to point to allocated descriptor c) after packet is received, free the descriptor When vhost fetches the avail ring, it need to fetch the modified L1 cache from virtio core, which is a heavy cost in current CPU implementation. This idea of this optimization is: allocate the fixed descriptor for each entry of avail ring, so avail ring will always be the same during the run. This removes L1M cache transfer from virtio core to vhost core for avail ring. (Note we couldn't avoid the cache transfer for descriptors). Besides, descriptor allocation and free operation is eliminated. This also makes vector procesing possible to further accelerate the processing. This is the layout for the avail ring(take 256 ring entries for example), with each entry pointing to the descriptor with the same index. avail idx + \| +----+----+---+-------------+------+ \| 0 \| 1 \| 2 \| ... \| 254 \| 255 \| avail ring +-+--+-+--+-+-+---------+---+--+---+ \| \| \| \| \| \| \| \| \| \| \| \| v v v \| v v +-+--+-+--+-+-+---------+---+--+---+ \| 0 \| 1 \| 2 \| ... \| 254 \| 255 \| desc ring +----+----+---+-------------+------+ \| \| +----+----+---+-------------+------+ \| 0 \| 1 \| 2 \| \| 254 \| 255 \| used ring +----+----+---+-------------+------+ \| + This is the ring layout for TX. As we need one virtio header for each xmit packet, we have 128 slots available. ++ \|\| \|\| +-----+-----+-----+--------------+------+------+------+ \| 0 \| 1 \| ... \| 127 \|\| 128 \| 129 \| ... \| 255 \| avail ring +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| \| \| \|\| \| \| \| v v v \|\| v v v +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| 128 \| 129 \| ... \| 255 \|\| 128 \| 129 \| ... \| 255 \| desc ring for virtio_net_hdr +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| \| \| \|\| \| \| \| v v v \|\| v v v +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| 0 \| 1 \| ... \| 127 \|\| 0 \| 1 \| ... \| 127 \| desc ring for tx dat +-----+-----+-----+--------------+------+------+------+ \|\| \|\| ++ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:31:42 +01:00
Stephen Hemminger	1e7bd2380f	virtio: fix Coverity unsigned warnings There are some places in virtio driver where uint16_t or int are used where it would be safer to use unsigned. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2015-10-21 16:14:02 +02:00

1 2

54 Commits