numam-dpdk

Author	SHA1	Message	Date
Jan Viktorin	13a1317d3b	pci: create device list and fallback on its members Now that rte_device is available, drivers can start using its members (numa, name) as well as link themselves into another rte_device list. As of now no one is using this list, but can be used for moving over all devices (pdev/vdev/Xdev) and perform bulk actions (like cleanup). Signed-off-by: Jan Viktorin <viktorin@rehivetech.com> [Shreyansh: Reword commit log for extra rte_device list] Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-10-03 16:34:03 +02:00
Jan Viktorin	2f3193cf0f	pci: inherit common driver in PCI driver Remove the 'name' member from rte_pci_driver and move to generic rte_driver. Most of the PMD drivers were initially using DRIVER_REGISTER_PCI(<name>..) as well as assigning a name to eth_driver.pci_drv.name member. In this patch, only the original DRIVER_REGISTER_PCI(<name>..) name has been populated into the rte_driver.name member - assignments through eth_driver has been removed. Signed-off-by: Jan Viktorin <viktorin@rehivetech.com> [Shreyansh: Rebase and expand changes to newly added files] Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-10-03 16:33:55 +02:00
Jan Viktorin	2695c6df69	eal: remove unused PMD types - All devices register themselfs by calling a kind of DRIVER_REGISTER_XXX. The PMD_REGISTER_DRIVER is not used anymore. - PMD_VDEV type is also not being used - can be removed from all VDEVs. Signed-off-by: Jan Viktorin <viktorin@rehivetech.com> Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-10-03 16:33:51 +02:00
Jan Viktorin	fe363dd425	drivers: use vdev registration All PMD_VDEV drivers can now use rte_vdev_driver instead of the rte_driver (which is embedded in the rte_vdev_driver). Signed-off-by: Jan Viktorin <viktorin@rehivetech.com> Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-10-03 16:33:48 +02:00
David Marchand	6751f6deb7	ethdev: get rid of device type Now that hotplug has been moved to eal, there is no reason to keep the device type in this layer. Signed-off-by: David Marchand <david.marchand@6wind.com> Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2016-10-03 16:33:39 +02:00
David Marchand	c830cb2954	drivers: use PCI registration macro Simplify crypto and ethdev pci drivers init by using newly introduced init macros and helpers. Those drivers then don't need to register as "rte_driver"s anymore. Exceptions: - virtio and mlx* use RTE_INIT directly as they have custom initialization steps. - VDEV devices are not modified - they continue to use PMD_REGISTER_DRIVER. Update documentation for replacing an example referring to PMD_REGISTER_DRIVER. Signed-off-by: David Marchand <david.marchand@6wind.com> Signed-off-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2016-10-03 16:33:23 +02:00
Pablo de Lara	2f45703c17	drivers: make driver names consistent As discussed in the past release, driver names are modified to be more consistent, and the future driver should follow this new convention. Driver names consist of: "driver category"_"driver folder name"_"optional extra name". For example: - Crypto null driver -> "crypto_null" - Network IXGBE VF driver -> "net_ixgbe_vf" Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2016-09-16 11:55:59 +02:00
Jianfeng Tan	e8df94b86f	net/virtio-user: fix inconsistent name The commit `cb6696d220` ("drivers: update registration macro usage") changes the name from virtio-user to virtio_user, because hyphen cannot be used in a C symbol name. However, this commit does not update the strings in docs and source code, which could lead to failure to start this device as per the docs. This patch updates related strings in the docs and source code. Fixes: `cb6696d220` ("drivers: update registration macro usage") Reported-by: Tiwei Bie <tiwei.bie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-22 11:53:32 +02:00
Yuanhan Liu	834ac655ba	net/virtio: fix crash on null dereference The rxq/txq for the queue_release callback could be NULL, say when rte_eth_dev_configure() fails that the queue is not setup at all. Do a simple NULL check would fix the crash issue. Fixes: `01ad44fd37` ("net/virtio: split Rx/Tx queue") Reported-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-22 00:30:08 +02:00
Olivier Matz	25f80d1087	net/virtio: fix packet corruption The support of virtio-user changed the way the mbuf dma address is retrieved, using a physical address in case of virtio-pci and a virtual address in case of virtio-user. This change introduced some possible memory corruption in packets, replacing: m->buf_physaddr + RTE_PKTMBUF_HEADROOM by: m->buf_physaddr + m->data_off (through a macro) This patch fixes this issue, restoring the original behavior. By the way, it also rework the macros, adding a "VIRTIO_" prefix and API comments. Fixes: `f24f8f9fee` ("net/virtio: allow virtual address to fill vring descriptors") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-22 00:27:29 +02:00
Maxime Coquelin	9cca159efa	net/virtio-user: fix build with gcc 6 The error is reported using test build script: $ scripts/test-build.sh x86_64-native-linuxapp-gcc ... drivers/net/virtio/virtio_user_ethdev.c:345:2: error: this ‘if’ clause does not guard... [-Werror=misleading-indentation] if (rte_kvargs_count(kvlist, VIRTIO_USER_ARG_PATH) == 1) ^~ Fixes: `404bd6bfe3` ("net/virtio-user: fix return value not checked") Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-15 22:15:21 +02:00
David Marchand	98dd7ad4da	net/virtio: move PCI device ids to the driver Reused defines from the driver. Used RTE_PCI_DEVICE in place of RTE_PCI_DEV_ID_DECL* stuff. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-11 17:41:10 +02:00
Ferruh Yigit	dbd8bdfc04	net/virtio: fix 32-bit build with gcc 6 This is for target i686-native-linuxapp-gcc and gcc6, Compilation error is: In file included from include/rte_mempool.h:77:0, from drivers/net/virtio/virtio_rxtx_simple.c: In function `virtio_xmit_pkts_simple': include/rte_memcpy.h:551:2: error: array subscript is above array bounds rte_mov16((uint8_t )dst + 1 16, (const uint8_t )src + 1 16); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Call stack is as following: virtio_xmit_pkts_simple virtio_xmit_cleanup rte_mempool_put_bulk rte_mempool_generic_put __mempool_generic_put rte_memcpy The array used as source buffer in virtio_xmit_cleanup (free) is a pointer array with 32 elements, in 32bit this makes 128 bytes. in rte_memcpy() implementation, there a code piece as following: if (size > 256) { rte_move128(...); rte_move128(...); <--- [1] .... } The compiler traces the array all through the call stack and knows the size of array is 128 and generates a warning on above [1] which tries to access beyond byte 128. But unfortunately it ignores the "(size > 256)" check. Giving a hint to compiler that variable "size" is related to the size of the source buffer fixes compiler warning. Fixes: `863bfb4744` ("mempool: optimize copy in cache") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-11 07:41:09 +02:00
Jianfeng Tan	3bd60a27e9	net/virtio: fix null pointer dereference There is a logic bug in this code, that could lead to null pointer dereference when cvq is NULL. Fix this problem by changing logic && to logic \|\|. >> CID 127480: Null pointer dereferences (FORWARD_NULL) >> Dereferencing null pointer "cvq". if (!cvq && !cvq->vq) { ... } Coverity issue: 127480 Fixes: `01ad44fd37` ("net/virtio: split Rx/Tx queue") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-05 14:14:40 +02:00
Jianfeng Tan	542849c09c	net/virtio-user: fix string unterminated When use strcpy() to copy string with length exceeding the last parameter of strcpy(), it may lead to the destination string unterminated. We replaced strncpy with snprintf to make sure it's NULL terminated. Coverity issue: 127476 Fixes: `ce2eabdd43` ("net/virtio-user: add virtual device") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-05 13:30:25 +02:00
Jianfeng Tan	14f06474b8	net/virtio-user: fix resource leaks The return value by rte_kvargs_parse is not free(d), which leads to memory leak. Coverity issue: 127482 Fixes: `ce2eabdd43` ("net/virtio-user: add virtual device") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-05 13:30:24 +02:00
Jianfeng Tan	80ceb374e2	net/virtio-user: fix string overflow When parsing /proc/self/maps to get hugepage information, the string was being copied with strcpy(), which could, theoretically but in fact not possiblly, overflow the destination buffer. Anyway, to avoid the false alarm, we replaced strncpy with snprintf for safely copying the strings. Coverity issue: 127484 Fixes: `6a84c37e39` ("net/virtio-user: add vhost-user adapter layer") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-05 13:30:24 +02:00
Jianfeng Tan	404bd6bfe3	net/virtio-user: fix return value not checked When return values of function calls are not checked, Coverity will report errors like: if (rte_kvargs_count(kvlist, VIRTIO_USER_ARG_PATH) == 1) >>> CID 127477: (CHECKED_RETURN) >>> Calling "rte_kvargs_process" without checking return value (as is done elsewhere 25 out of 30 times). rte_kvargs_process(kvlist, VIRTIO_USER_ARG_PATH, &get_string_arg, &path); Coverity issue: 127477, 127478 Fixes: `ce2eabdd43` ("net/virtio-user: add virtual device") Fixes: `6a84c37e39` ("net/virtio-user: add vhost-user adapter layer") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-05 13:30:00 +02:00
Jianfeng Tan	17450351ff	net/virtio-user: fix build on Suse 11 On some older systems, such as SUSE 11, the compiling error shows as: .../dpdk/drivers/net/virtio/virtio_user/virtio_user_dev.c:67:22: error: ‘O_CLOEXEC’ undeclared (first use in this function) The fix is to use EFD_CLOEXEC, which is defined in sys/eventfd.h, instead of O_CLOEXEC which needs _GNU_SOURCE defined on some old systems. Fixes: `37a7eb2ae8` ("net/virtio-user: add device emulation layer") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-07-04 04:08:41 +02:00
Pablo de Lara	44e32a671d	drivers: add virtio and xenvirt parameters infos Virtio and Xenvirt are two virtual device drivers that admit arguments, so DRIVER_REGISTER_PARAM_STRING should be used in them. Fixes: `cb6696d220` ("drivers: update registration macro usage") Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2016-07-10 14:51:09 +02:00
Pablo de Lara	bae696ebd4	drivers: remove static driver names Since now the PMD_REGISTER_DRIVER macro sets the driver names, there is no need to have the rte_driver structure setting it statically, as it will get overridden. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2016-07-10 14:51:09 +02:00
Neil Horman	cb6696d220	drivers: update registration macro usage Modify the PMD_REGISTER_DRIVER macro, adding a name argument to it. The addition of a name argument creates a token that can be used for subsequent macros in the creation of unique symbol names to export additional bits of information for use by the pmdinfogen tool. For example: PMD_REGISTER_DRIVER(ena_driver, ena); registers the ena_driver struct as it always did, and creates a symbol const char this_pmd_name0[] __attribute__((used)) = "ena"; which pmdinfogen can search for and extract. The subsequent macro DRIVER_REGISTER_PCI_TABLE(ena, ena_pci_id_map); creates a symbol const char ena_pci_tbl_export[] __attribute__((used)) = "ena_pci_id_map"; Which allows pmdinfogen to find the pci table of this driver Using this pattern, we can export arbitrary bits of information. pmdinfo uses this information to extract hardware support from an object file and create a json string to make hardware support info discoverable later. Signed-off-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Remy Horton <remy.horton@intel.com>	2016-07-06 23:21:40 +02:00
Jianfeng Tan	d911c94d25	net/virtio-user: fix build with icc Implicit int to enum conversion is not allowed when icc is used as the compiler. It raises the compiling error like, drivers/net/virtio/virtio_user/vhost_user.c(257): error #188: enumerated type mixed with another type msg.request = req; ^ The fix is simple, change the type of parameter req to enum vhost_user_request. Fixes: `6a84c37e39` ("net/virtio-user: add vhost-user adapter layer") Suggested-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-30 07:46:29 +02:00
Remy Horton	d085232a14	ethdev: remove redundant id field in xstats name lookup For all drivers that currently implement xstats, the id field in the rte_eth_stats_name structure equals the entry's array index. This patch eliminates the redundant id field as a direct index lookup is faster than a search for the matching id field. Suggested-by: Olivier Matz <olivier.matz@6wind.com> Signed-off-by: Remy Horton <remy.horton@intel.com>	2016-07-01 16:09:06 +02:00
Thomas Monjalon	f8e9cbe2aa	mk: fix internal dependencies Some libraries were missing their dependency on eal, mbuf, mempool, ring and kvargs. It is revealed by the linker option "-z defs". Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-06-29 13:33:01 +02:00
Thomas Monjalon	479e160b2e	net/virtio-user: fix 32-bit build The compilation for 32-bit fails when CONFIG_RTE_VIRTIO_USER is enabled: drivers/net/virtio/virtio_user_ethdev.c:84:47: error: format ‘%llu’ expects argument of type ‘long long unsigned int’, but argument 5 has type ‘size_t {aka unsigned int}’ Fixes: `e9efa4d938` ("net/virtio-user: add new virtual PCI driver") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-06-23 22:54:41 +02:00
Huawei Xie	b81026f1e7	net/virtio: fix used index retrieved only once In the following loop: while (vq->vq_used_cons_idx != vq->vq_ring.used->idx) { ... } There is no external function call or any explict memory barrier in the loop, the re-read of used->idx might be optimized and only be retrieved once. Use of voaltile normally should be prohibited, and access_once is Linux kernel's style to handle this issue; Once we have that macro in DPDK, we could change to that style. virtio_recv_mergable_pkts might also have the same issue, so fix it as well. Fixes: `823ad64795` ("virtio: support multiple queues") Fixes: `13ce5e7eb9` ("virtio: mergeable buffers") Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Yuanhan Liu	7e1eb993f2	net/virtio: fix crash on querying xstats Trying to access xstats_names after "if (xstats_names == NULL)" is obviously wrong, which would result to a crash while running "show port xstats 0" in testpmd with virtio PMD. The fix is straightforward; just reverse the check. Fixes: `baf91c395b` ("net/virtio: fetch extended statistics with integer ids") Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	1b69528e5f	net/virtio-user: handle control queue in driver In virtio-user driver, when notify ctrl-queue, invoke API of virtio-user device emulation to handle ctrl-q command. Besides, multi-queue requires ctrl-queue and ctrl-queue will be enabled automatically when multi-queue is specified. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	f9b9d1a557	net/virtio-user: add multiple queues in device emulation The main purpose of this patch is to enable multi-queue. But multi-queue requires ctrl-queue so that driver can send how many queues will be enabled through ctrl-queue messages. So we partially implement ctrl-queue to handle control command with class of VIRTIO_NET_CTRL_MQ and with cmd of VIRTIO_NET_CTRL_MQ_VQ_PAIRS_SET to handle mq support. This patch provides a function, virtio_user_handle_cq(), for driver to handle ctrl-queue messages. Besides, multi-queue requires VIRTIO_NET_F_MQ and VIRTIO_NET_F_CTRL_VQ are enabled when we do feature negotiation. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	0b6df936c8	net/virtio-user: add multiple queues in vhost-user adapter This patch mainly adds method in vhost user adapter to communicate enable/disable queues messages with vhost user backend, aka, VHOST_USER_SET_VRING_ENABLE. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	ce2eabdd43	net/virtio-user: add virtual device Add a new virtual device named virtio-user, which can be used just like eth_ring, eth_null, etc. To reuse the code of original virtio, we do some adjustment in virtio_ethdev.c, such as remove key _static_ of eth_virtio_dev_init() so that it can be reused in virtual device; and we add some check to make sure it will not crash. Configured parameters include: - queues (optional, 1 by default), number of queue pairs, multi-queue not supported for now. - cq (optional, 0 by default), not supported for now. - mac (optional), random value will be given if not specified. - queue_size (optional, 256 by default), size of virtqueues. - path (madatory), path of vhost user. When enable CONFIG_RTE_VIRTIO_USER (enabled by default), the compiled library can be used in both VM and container environment. Examples: path_vhost=<path_to_vhost_user> # use vhost-user as a backend sudo ./examples/l2fwd/build/l2fwd -c 0x100000 -n 4 \ --socket-mem 0,1024 --no-pci --file-prefix=l2fwd \ --vdev=virtio-user0,mac=00:01:02:03:04:05,path=$path_vhost -- -p 0x1 Known issues: - Control queue and multi-queue are not supported yet. - Cannot work with --huge-unlink. - Cannot work with no-huge. - Cannot work when there are more than VHOST_MEMORY_MAX_NREGIONS(8) hugepages. - Root privilege is a must (mainly becase of sorting hugepages according to physical address). - Applications should not use file name like HUGEFILE_FMT ("%smap_%d"). - Cannot work with vhost-net backend. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	e9efa4d938	net/virtio-user: add new virtual PCI driver This patch implements another new instance of struct virtio_pci_ops to drive the virtio-user virtual device. Instead of rd/wr ioport or PCI configuration space, this virtual pci driver will rd/wr the virtual device struct virtio_user_hw, and when necessary, invokes APIs provided by device emulation later to start/stop the device. ---------------------- \| ------------------ \| \| \| virtio driver \| \|----> (virtio_user_ethdev.c) \| ------------------ \| \| \| \| \| ------------------ \| ------> virtio-user PMD \| \| device emulate \| \| \| \| \| \| \| \| vhost adapter \| \| \| ------------------ \| ---------------------- \| \| \| ------------------ \| vhost backend \| ------------------ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	37a7eb2ae8	net/virtio-user: add device emulation layer Few device emulation layer functions are added for virtio driver to call: - virtio_user_start_device() - virtio_user_stop_device() - virtio_user_dev_init() - virtio_user_dev_uninit() These functions will get called by virtio driver, and they call vhost adapter layer functions to implement the functionality. All stats related to virtual user device as logged in virtio_user_dev structure. ---------------------- \| ------------------ \| \| \| virtio driver \| \| \| ------------------ \| \| \| \| \| ------------------ \| ------> virtio-user PMD \| \| device emulate \|-\|----> (virtio_user_dev.c, virtio_user_dev.h) \| \| \| \| \| \| vhost adapter \| \| \| ------------------ \| ---------------------- \| \| \| ------------------ \| vhost backend \| ------------------ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	6a84c37e39	net/virtio-user: add vhost-user adapter layer This patch provides vhost adapter layer implementation. Two main help functions are provided to upper layer (device emulation): - vhost_user_setup(), to set up vhost user backend; - vhost_user_sock(), to talk with vhost user backend. ---------------------- \| ------------------ \| \| \| virtio driver \| \| \| ------------------ \| \| \| \| \| ------------------ \| ------> virtio-user PMD \| \| device emulate \| \| \| \| \| \| \| \| vhost adapter \|-\|----> (vhost_user.c) \| ------------------ \| ---------------------- \| \| -------------- --> (vhost-user protocol) \| ------------------ \| vhost backend \| ------------------ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	f24f8f9fee	net/virtio: allow virtual address to fill vring descriptors This patch is related to how to calculate relative address for vhost backend. The principle is that: based on one or multiple shared memory regions, vhost maintains a reference system with the frontend start address, backend start address, and length for each segment, so that each frontend address (GPA, Guest Physical Address) can be translated into vhost-recognizable backend address. To make the address translation efficient, we need to maintain as few regions as possible. In the case of VM, GPA is always locally continuous. But for some other case, like virtio-user, GPA continuous is not guaranteed, therefore, we use virtual address here. It basically means: a. when set_base_addr, VA address is used; b. when preparing RX's descriptors, VA address is used; c. when transmitting packets, VA is filled in TX's descriptors; d. in TX and CQ's header, VA is used. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Jianfeng Tan	595454c5ac	net/virtio: hide vring address check inside PCI ops This patch moves phys addr check from virtio_dev_queue_setup to pci ops. To make that happen, make sure virtio_ops.setup_queue return the result if we pass through the check. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Huawei Xie	7e40200c56	net/virtio: fix crash when no devargs We skip kernel managed virtio devices, if it isn't whitelisted. Before checking if the virtio device is whitelisted, check if devargs is specified. Fixes: `ac5e1d838d` ("virtio: skip error when probing kernel managed device") Reported-by: Vincent Li <vincent.mc.li@gmail.com> Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 09:47:12 +02:00
Huawei Xie	01ad44fd37	net/virtio: split Rx/Tx queue We keep a common vq structure, containing only vq related fields, and then split others into RX, TX and control queue respectively. Signed-off-by: Huawei Xie <huawei.xie@intel.com> [Jianfeng Tan: found and fixed 2 bugs] Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 06:10:54 +02:00
Olivier Matz	88c107840d	net/virtio: check mbuf is direct when using any layout The commit `dd856dfcb9` introduced an optimization that prepends virtio header to mbuf data. It can be used when the tx mbuf is writeable, so we need to check that the mbuf is direct (i.e. it embeds its own data). Fixes: `dd856dfcb9` ("virtio: use any layout on Tx") Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-06-22 06:10:54 +02:00
Remy Horton	e2aae1c1ce	ethdev: remove name from extended statistic fetch The current extended ethernet statistics fetching involve doing several string operations, which causes performance issues if there are lots of statistics and/or network interfaces. This patch changes the test-pmd and proc_info applications to use the new xstats API, and removes deprecated code associated with the old API. Signed-off-by: Remy Horton <remy.horton@intel.com>	2016-06-16 18:12:00 +02:00
Remy Horton	baf91c395b	net/virtio: fetch extended statistics with integer ids The current extended ethernet statistics fetching involve doing several string operations, which causes performance issues if there are lots of statistics and/or network interfaces. This patch changes the virtio driver to use the new API that seperates name string and value queries. Signed-off-by: Remy Horton <remy.horton@intel.com>	2016-06-16 17:57:29 +02:00
David Marchand	281ccccb1a	virtio: fix PCI accesses for ppc64 in legacy mode Although ppc supports both endianesses, qemu supposes that the cpu is big endian and enforces this for the virtio-net stuff. Fix PCI accesses in legacy mode. Only ppc64le is supported at the moment. Signed-off-by: David Marchand <david.marchand@6wind.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-06-15 19:06:53 +02:00
Jan Viktorin	53c3c30c11	pci: allow to override sysfs path The SYSFS_PCI_DEVICES is a constant that makes the PCI testing difficult as it points to an absolute path. We remove using this constant and introducing a function pci_get_sysfs_path that gives the same value. However, the user can pass a SYSFS_PCI_DEVICES env variable to override the path. It is now possible to create a fake sysfs hierarchy for testing. Signed-off-by: Jan Viktorin <viktorin@rehivetech.com>	2016-06-13 21:08:48 +02:00
Olivier Matz	fbfd99551c	mbuf: add raw allocation function Many drivers provide their own implementation of rte_mbuf_raw_alloc(), duplicating the code. Introduce a new public function in rte_mbuf to allocate a raw mbuf (uninitialized). Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-05-17 08:31:33 +02:00
Jianfeng Tan	2963d99a8b	virtio: fix memory leak of virtqueue memzones When virtio was proposed in DPDK, there is no API to free memzones. But this has changed since rte_memzone_free() has been implemented by commit `ff909fe21f` ("mem: introduce memzone freeing"). This patch is to make sure memzones in struct virtqueue, like mz and virtio_net_hdr_mz, are freed when queue is released or setup fails. Fixes: `c1f86306a0` ("virtio: add new driver") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 11:22:39 -07:00
Jianfeng Tan	4166bbf631	virtio: simplify queue allocation Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 11:22:33 -07:00
Jianfeng Tan	62a785a68e	virtio: fix overwritten driver flags The "drv_flags" is set with device as the input, which means different device (say, modern vs legacy) could end up with a different value. And the fact that "drv_flags" is shared by all devices means that every time we add a new device, it simply overwrites the value configured from the last device. Therefore, when two virtio devices have different flags, it may lead to wrong result, such as virtio would set irq config when it's not supported. Making the flag per device (using "dev->data->dev_flags") could let us have different value for each device, which would avoid the above issue. Fixes: `da978dfdc4` ("virtio: use port IO to get PCI resource") Reported-by: David Marchand <david.marchand@6wind.com> Suggested-by: David Marchand <david.marchand@6wind.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:57:10 -07:00
Huawei Xie	e928fd0bb0	virtio: optimize avail ring update Avail ring is updated by the frontend and consumed by the backend. There are frequent core to core cache transfers for the avail ring. This optmization avoids avail ring entry index update if the entry already holds the same value. As DPDK virtio PMD implements FIFO free descriptor list (also for performance reason of CACHE), in which descriptors are allocated from the head and freed to the tail, with this patch in most cases avail ring will remain the same, then it would be valid in both caches of frontend and backend. Suggested-by: Michael S. Tsirkin <mst@redhat.com> Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:57:10 -07:00
Huawei Xie	fac0b224c8	virtio: fix mbuf headroom size check check merge-able header as it is supported. previously we don't support merge-able feature, so non merge-able header is checked. Fixes: `13ce5e7eb9` ("virtio: mergeable buffers") Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:53:28 -07:00
Jianfeng Tan	2c0eb46f51	virtio: fix segfault on Tx desc flags setup After the do-while loop, idx could be VQ_RING_DESC_CHAIN_END (32768) when it's the last vring desc buf we can get. Therefore, following expresssion could lead to a segfault error, as it tries to access beyond the desc memory boundary. start_dp[idx].flags &= ~VRING_DESC_F_NEXT; This bug could be reproduced easily with "set fwd txonly" in the guest PMD, where the dequeue on host is slower than the guest Tx, that running out of free desc buf is pretty easy. The fix is straightforward and easy, just remove it, as we have already set desc flags properly inside the do-while loop. Fixes: `dd856dfcb9` ("virtio: use any layout on Tx") [Yuanhan Liu: commit log reword] Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:53:28 -07:00
Jianfeng Tan	e908312704	virtio: fix newline under debug mode Issue: output of appliations and debug info of DPDK may be mixed up in same line when enabling below debug options of virtio: CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_INIT CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_TX CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_DRIVER This patch adds "\n" in the tail of definitions like PMD_RX_LOG, PMD_TX_LOG, and PMD_DRV_LOG, and removes some "\n" when using these macros. Fixes: `c1f86306a0` ("virtio: add new driver") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-05-10 10:52:01 -07:00
Rich Lane	610e0a8b62	virtio: use zeroed memory for simple Tx header For simple TX the virtio-net header must be zeroed, but it was using memory that had been initialized with indirect descriptor tables. This resulted in "unsupported gso type" errors from librte_vhost. We can use the same memory for every descriptor to save cachelines in the vswitch. Fixes: `6dc5de3a` ("virtio: use indirect ring elements") Signed-off-by: Rich Lane <rich.lane@bigswitch.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2016-04-06 12:27:57 +02:00
Marc Sune	1131900006	ethdev: use constants for link duplex Some duplex values are replaced from 0 to half-duplex when link is down. Some drivers are still using their own constants for duplex modes. Signed-off-by: Marc Sune <marcdevel@gmail.com>	2016-04-01 21:38:34 +02:00
Thomas Monjalon	09419f235e	ethdev: use constants for link state Define and use ETH_LINK_UP and ETH_LINK_DOWN where appropriate. Signed-off-by: Marc Sune <marcdevel@gmail.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-04-01 21:38:34 +02:00
Kyle Larose	3eabd79c50	virtio: fix Rx ring descriptor starvation Virtio has an mbuf descriptor ring containing mbufs to be used for receiving traffic. When the host queues traffic to be sent to the guest, it consumes these descriptors. If none exist, it discards the packet. The virtio pmd allocates mbufs to the descriptor ring every time it successfully receives a packet. However, it never does it if it does not receive a valid packet. If the descriptor ring is exhausted, and the mbuf mempool does not have any mbufs free (which can happen for various reasons, such as queueing along the processing pipeline), then the receive call will not allocate any mbufs to the descriptor ring, and when it finishes, the descriptor ring will be empty. The ring being empty means that we will never receive a packet again, which means we will never allocate mbufs to the ring: we are stuck. Ultimately, the problem arises because there is a dependency between receiving packets and making the descriptor ring not be empty, and a dependency between the descriptor ring not being empty, and receiving packets. To fix the problem, this pakes makes virtio always try to allocate mbufs to the descriptor ring, if necessary, when polling for packets. Do this by removing the early exit if no packets were received. Since the packet loop later will do nothing if there are no packets, this is fine. I reproduced the problem by pushing packets through a pipelined systems (such as the client_server sample application) after artificially decreasing the size of the mbuf pool and introducing a delay in a secondary stage. Without the fix, the process stops receiving packets fairly quicky. With the fix, it continues to receive packets. Fixes: `c1f86306a0` ("virtio: add new driver") Signed-off-by: Kyle Larose <klarose@sandvine.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-25 19:01:37 +01:00
Huawei Xie	0bb159ad74	virtio: remove redundant function names in log Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2016-03-16 19:05:46 +01:00
Stephen Hemminger	17cbf09fe1	virtio: optimize Tx enqueue All the error checks in virtqueue_enqueue_xmit are already done by the caller. Therefore they can be removed to improve performance. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:35 +01:00
Stephen Hemminger	dd856dfcb9	virtio: use any layout on Tx Virtio supports a feature that allows sender to put transmit header prepended to data. It requires that the mbuf be writeable, correct alignment, and the feature has been negotiatied. If all this works out, then it will be the optimum way to transmit a single segment packet. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:25 +01:00
Stephen Hemminger	6dc5de3a6a	virtio: use indirect ring elements The virtio ring in QEMU/KVM is usually limited to 256 entries and the normal way that virtio driver was queuing mbufs required nsegs + 1 ring elements. By using the indirect ring element feature if available, each packet will take only one ring slot even for multi-segment packets. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-03-16 19:05:25 +01:00
Igor Ryzhov	64a7619ee8	virtio: remove broadcast packets from multicast statistics Signed-off-by: Igor Ryzhov <iryzhov@nfware.com> Acked-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Applied with coding standards fixes: Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2016-03-16 18:52:18 +01:00
Huawei Xie	3b1e3e4e36	virtio: fix descriptors pointing to the same buffer The virtio_net_hdr desc all pointed to the same buffer. It doesn't cause issue because in the simple TX mode we don't use the header. This patch makes the header desc point to different buffer. Fixes: `b4ae9c505f` ("virtio: optimize ring layout") Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-16 18:52:18 +01:00
Bernard Iremonger	c680a4a88c	virtio: fix crash in statistics functions This initialisation of nb_rx_queues and nb_tx_queues has been removed from eth_virtio_dev_init. The nb_rx_queues and nb_tx_queues were being initialised in eth_virtio_dev_init before the tx_queues and rx_queues arrays were allocated. The arrays are allocated when the ethdev port is configured and the nb_tx_queues and nb_rx_queues are initialised. If any of the following functions were called before the ethdev port was configured there was a segmentation fault because rx_queues and tx_queues were NULL: rte_eth_stats_get rte_eth_stats_reset rte_eth_xstats_get rte_eth_xstats_reset Fixes: `823ad64795` ("virtio: support multiple queues") Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-16 18:52:18 +01:00
Jianfeng Tan	9a0615af77	virtio: fix restart Fix the issue that virtio device cannot be started after stopped. The field, hw->started, should be changed by virtio_dev_start/stop instead of virtio_dev_close. Fixes: `a85786dc81` ("virtio: fix states handling during initialization") Reported-by: Pavel Fedin <p.fedin@samsung.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-03-16 18:52:18 +01:00
Yuanhan Liu	36ea36efb4	virtio: fix query of legacy features Declare dst as type uint32_t instead of uint64_t, otherwise, we will get a random upper 32 bit feature bits, as the following io port read reads lower 32 bit only. It could lead a feature bits that include VIRTIO_F_VERSION_1 (the 32th bit) for legacy virtio, which is obviously wrong. Fixes: `b8f04520ad` ("virtio: use PCI ioport API") Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Reviewed-by: David Marchand <david.marchand@6wind.com>	2016-03-14 23:16:15 +01:00
Huawei Xie	ac5e1d838d	virtio: skip error when probing kernel managed device virtio PMD could use IO port to configure the virtio device without using UIO/VFIO driver in legacy mode. There are two issues with previous implementation: 1) virtio PMD will take over the virtio device(s) blindly even if not intended for DPDK. 2) driver conflict between virtio PMD and virtio-net kernel driver. This patch checks if there is kernel driver other than UIO/VFIO managing the virtio device before using port IO. If legacy_virtio_resource_init fails and kernel driver other than VFIO/UIO is managing the device, return 1 to tell the upper layer we don't take over this device. For all other IO port mapping errors, return -1. Note than if VFIO/UIO fails, now we don't fall back to port IO. Fixes: `da978dfdc4` ("virtio: use port IO to get PCI resource") Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-03-10 00:36:51 +01:00
Ravi Kerur	d6b324c00f	mbuf: get DMA address Macros RTE_MBUF_DATA_DMA_ADDR and RTE_MBUF_DATA_DMA_ADDR_DEFAULT are defined in each PMD driver file. Convert macros to inline functions and move them to common lib/librte_mbuf/rte_mbuf.h file. PMD drivers include rte_mbuf.h file directly/indirectly hence no additioanl header file inclusion is necessary. Signed-off-by: Ravi Kerur <rkerur@gmail.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-03-04 16:01:15 +01:00
Santosh Shukla	69d308e1c0	virtio: restrict vector Rx/Tx to x86 SSSE3 Temporary implementation to let virtio operate in non-vec mode for archs which doesn't support _ssse_ cpuflag. todo: 1) Move virtio_recv_pkts_vec() implementation to drivers/virtio/virtio_vec_<arch>.h file. 2) Remove use_simple_rxtx flag, so that virtio/virtio_vec_<arch>.h files to provide vectored/non-vectored rx/tx apis. Fixes: `fc3d66212f` ("virtio: add vector Rx") Fixes: `c121c8d6d3` ("virtio: add simple Tx") Fixes: `8d8393fb18` ("virtio: pick simple Rx/Tx") Signed-off-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-03 14:00:28 +01:00
David Marchand	b8f04520ad	virtio: use PCI ioport API Move all os / arch specifics to eal. Signed-off-by: David Marchand <david.marchand@6wind.com> Reviewed-by: Santosh Shukla <sshukla@mvista.com> Tested-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
David Marchand	7a66c72d6c	virtio: fix check when mapping PCI resources According to the api, rte_eal_pci_map_device is only successful when returning 0. Fixes: `6ba1f63b5a` ("virtio: support specification 1.0") Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
David Marchand	25294cd3a6	virtio: fix FreeBSD build Fixes: `c52afa68d7` ("virtio: move left PCI stuff in the right file") Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
Huawei Xie	693f715da4	remove extra parentheses in return statement fix the error reported by checkpatch: "ERROR: return is not a function, parentheses are not required" remove parentheses in return like: "return (logical expressions)" remove parentheses in return a function like: "return (rte_mempool_lookup(...))" Fixes: `6307b909b8` ("lib: remove extra parenthesis after return") Signed-off-by: Huawei Xie <huawei.xie@intel.com>	2016-02-10 15:47:50 +01:00
Yuanhan Liu	b86af7b1b5	virtio: move ioport macros virtio_pci.c is the only file references macros VIRTIO_READ/WRITE_REG_X. Move them there. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:50 +01:00
Yuanhan Liu	6ba1f63b5a	virtio: support specification 1.0 Modern (v1.0) virtio pci device defines several pci capabilities. Each cap has a configure structure corresponding to it, and the cap.bar and cap.offset fields tell us where to find it. Firstly, we map the pci resources by rte_eal_pci_map_device(). We then could easily locate a cfg structure by: cfg_addr = dev->mem_resources[cap.bar].addr + cap.offset; Therefore, the entrance of enabling modern (v1.0) pci device support is to iterate the pci capability lists, and to locate some configs we care; and they are: - common cfg For generic virtio and virtqueue configuration, such as setting/getting features, enabling a specific queue, and so on. - nofity cfg Combining with `queue_notify_off' from common cfg, we could use it to notify a specific virt queue. - device cfg Where virtio_net_config structure is located. - isr cfg Where to read isr (interrupt status). If any of above cap is not found, we fallback to the legacy virtio handling. If succeed, hw->vtpci_ops is assigned to modern_ops, where all operations are implemented by reading/writing a (or few) specific configuration space from above 4 cfg structures. And that's basically how this patch works. Besides those changes, virtio 1.0 introduces a new status field: FEATURES_OK, which is set after features negotiation is done. Last, set the VIRTIO_F_VERSION_1 feature flag. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:50 +01:00
Yuanhan Liu	1905e101dc	virtio: retrieve header size from device setting The mergeable virtio net hdr format has been the standard and the only virtio net hdr format since virtio 1.0. Therefore, we can not hardcode hdr_size to "sizeof(struct virtio_net_hdr)" any more at virtio_recv_pkts(), otherwise, there would be a mismatch of hdr size from rte_vhost_enqueue_burst() and virtio_recv_pkts(), leading a packet corruption. Instead, we should retrieve it from hw->vtnet_hdr_size; we will do proper settings at eth_virtio_dev_init() in later patches. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	3891f233f7	virtio: switch to 64 bit features Switch to 64 bit features, which virtio 1.0 supports. While legacy virtio only supports 32 bit features, it complains aloud and quit when trying to setting > 32 bit features. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	c52afa68d7	virtio: move left PCI stuff in the right file virtio_pci.c is a more proper place for pci stuff; virtio_ethdev is not. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	d5bbeefca8	virtio: introduce PCI implementation structure Introduce struct virtio_pci_ops, to let legacy virtio (v0.95) and modern virtio (1.0) have different implementation regarding to a specific pci action, such as read host status. With that, this patch reimplements all exported pci functions, in a way like: vtpci_foo_bar(struct virtio_hw *hw) { hw->vtpci_ops->foo_bar(hw); } So that we need pay attention to those pci related functions only while adding virtio 1.0 support. This patch introduced a new vtpci function, vtpci_init(), to do proper virtio pci settings. It's pretty simple so far: just sets hw->vtpci_ops to legacy_ops as we don't support 1.0 yet. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	4c2277ff45	virtio: define offset as size_t type offset arg of vtpci_read/write_dev_config is derived from offsetof(), which is of size_t type, instead of uint64_t. So, define it as size_t type. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	c47787cfaa	virtio: do not set vring address again at queue startup As we have already set up it at virtio_dev_queue_setup(), and a vq restart will not reset the settings. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	6ed346a462	virtio: fix wrong queue index We should provide VIRTIO_PCI_QUEUE_SEL with vq->vq_queue_idx, but not vq->queue_id. vq->queue_id is the queue id from rte_eth_rx/tx_queue_setup(), which always starts from 0 no matter which queue it is. However, for virtio, even number is for RX queue, and odd number is for TX queue. Fixes: `5382b188fb` ("virtio: add queue release") Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-12-09 22:02:33 +01:00
Igor Ryzhov	e1cf0d0853	ethdev: fix reset of Rx mbuf allocation failures The rx_mbuf_alloc_failed counter was only cleared by virtio driver. Now it is cleared by common rte_eth_stats_reset function for all drivers at once. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-12-07 04:55:31 +01:00
Bernard Iremonger	d15339b928	virtio: fix link state interrupt call rte_eth_copy_pci_info() after the RTE_PCI_DRV_INTR_LSC has been initialised. Fixes: `eeefe73f0a` ("drivers: copy PCI device info to ethdev data") Reported-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-12-07 01:03:12 +01:00
Stephen Hemminger	43ffe8aa86	virtio: clean up Tx space checks The space check for transmit ring only needs a single conditional. I.e only need to recheck for space if there was no space in first check. This can help performance and simplifies loop. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-12-07 01:03:12 +01:00
Stephen Hemminger	c21835fab8	virtio: fix Rx mbuf initialization The virtio driver was not initializing all the fields in the receive mbuf. This would cause bugs where previous usage of mbuf would leave stale TCI and offload flags. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-12-07 01:03:12 +01:00
Jerin Jacob	4c02e453cc	eal: introduce SMP memory barriers This commit introduce rte_smp_mb(), rte_smp_wmb() and rte_smp_rmb(), in order to enable memory barriers between lcores. The patch does not provide any functional change for IA, the goal is to have infrastructure for weakly ordered machines like ARM to work on DPDK. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-11-18 22:44:01 +01:00
Bernard Iremonger	eeefe73f0a	drivers: copy PCI device info to ethdev data Use new function rte_eth_copy_pci_info. Copy device info for the following pdevs: bnx2x cxgbe e1000 enic fm10k i40e ixgbe mlx4 mlx5 virtio vmxnet3 Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-11-03 18:39:26 +01:00
Ivan Boule	b5b0467ca8	virtio: fix size of MAC address array Make the virtio PMD allocate the array of unicast MAC addresses with the maximum of entries (VIRTIO_MAX_MAC_ADDRS) that it exports. Signed-off-by: Ivan Boule <ivan.boule@6wind.com> Signed-off-by: David Marchand <david.marchand@6wind.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2015-11-03 11:40:58 +01:00
Harry van Haaren	76d4c652e0	virtio: add extended stats Add xstats() functions and statistic strings to virtio PMD. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Maryam Tahhan <maryam.tahhan@intel.com>	2015-11-03 00:19:25 +01:00
Huawei Xie	8d8393fb18	virtio: pick simple Rx/Tx simple rx/tx func is chose when merge-able rx is disabled and user specifies single segment and no offload support. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:34:44 +01:00
Huawei Xie	c121c8d6d3	virtio: add simple Tx Bulk free of mbufs when clean used ring. Shift operation of idx could be saved if vq_free_cnt means free slots rather than free descriptors. TODO: rearrange vq data structure, pack the stats var together so that we could use one vec instruction to update all of them. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:34:03 +01:00
Huawei Xie	fc3d66212f	virtio: add vector Rx With fixed avail ring, we don't need to get desc idx from avail ring. virtio driver only has to deal with desc ring. This patch uses vector instruction to accelerate processing desc ring. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:33:43 +01:00
Huawei Xie	cab0461234	virtio: fill Rx avail ring with blank mbufs Add software RX ring in virtqueue. Add fake_mbuf in virtqueue for wraparound processing. Fill avail ring with blank mbufs in virtio_dev_vring_start Add virtio_rxtx.h header file for RTE_VIRTIO_PMD_MAX_BURST. Would move all rx/tx related declarations into this header file in future. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:32:19 +01:00
Huawei Xie	b4ae9c505f	virtio: optimize ring layout In DPDK based switching environment, mostly vhost runs on a dedicated core while virtio processing in guest VMs runs on different cores. Take RX for example, with generic implementation, for each guest buffer, a) virtio driver allocates a descriptor from free descriptor list b) modify the entry of avail ring to point to allocated descriptor c) after packet is received, free the descriptor When vhost fetches the avail ring, it need to fetch the modified L1 cache from virtio core, which is a heavy cost in current CPU implementation. This idea of this optimization is: allocate the fixed descriptor for each entry of avail ring, so avail ring will always be the same during the run. This removes L1M cache transfer from virtio core to vhost core for avail ring. (Note we couldn't avoid the cache transfer for descriptors). Besides, descriptor allocation and free operation is eliminated. This also makes vector procesing possible to further accelerate the processing. This is the layout for the avail ring(take 256 ring entries for example), with each entry pointing to the descriptor with the same index. avail idx + \| +----+----+---+-------------+------+ \| 0 \| 1 \| 2 \| ... \| 254 \| 255 \| avail ring +-+--+-+--+-+-+---------+---+--+---+ \| \| \| \| \| \| \| \| \| \| \| \| v v v \| v v +-+--+-+--+-+-+---------+---+--+---+ \| 0 \| 1 \| 2 \| ... \| 254 \| 255 \| desc ring +----+----+---+-------------+------+ \| \| +----+----+---+-------------+------+ \| 0 \| 1 \| 2 \| \| 254 \| 255 \| used ring +----+----+---+-------------+------+ \| + This is the ring layout for TX. As we need one virtio header for each xmit packet, we have 128 slots available. ++ \|\| \|\| +-----+-----+-----+--------------+------+------+------+ \| 0 \| 1 \| ... \| 127 \|\| 128 \| 129 \| ... \| 255 \| avail ring +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| \| \| \|\| \| \| \| v v v \|\| v v v +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| 128 \| 129 \| ... \| 255 \|\| 128 \| 129 \| ... \| 255 \| desc ring for virtio_net_hdr +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| \| \| \|\| \| \| \| v v v \|\| v v v +--+--+--+--+-----+---+------+---+--+---+------+--+---+ \| 0 \| 1 \| ... \| 127 \|\| 0 \| 1 \| ... \| 127 \| desc ring for tx dat +-----+-----+-----+--------------+------+------+------+ \|\| \|\| ++ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com>	2015-11-02 15:31:42 +01:00
Changchun Ouyang	6d7740e2c1	virtio: fix deadloop after wrong config read The old code adjusts the config bytes we want to read depending on what kind of features we have, but we later cast the entire buf we read with "struct virtio_net_config", which is obviously wrong. The wrong config reading results to a dead loop at virtio_send_command() while starting testpmd. The right way to go is to read related config bytes when corresponding feature is set, which is exactly what this patch does. Fixes: `823ad64795` ("virtio: support multiple queues") Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Flavio Leitner <fbl@sysclose.org> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-10-26 21:23:53 +01:00
Stephen Hemminger	1e7bd2380f	virtio: fix Coverity unsigned warnings There are some places in virtio driver where uint16_t or int are used where it would be safer to use unsigned. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2015-10-21 16:14:02 +02:00
Stephen Hemminger	954ea11540	virtio: do not report link state feature unless available If host does not support virtio link state (like current DPDK vhost) then don't set the flag. This keeps applications from incorrectly assuming that link state is available when it is not. It also avoids useless "guess what works in the config". Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-10-21 16:12:32 +02:00
Bernard Iremonger	ce8e121870	virtio: fix crash when releasing null queue if input parameter vq is NULL, hw = vq->hw, causes a segmentation fault. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-10-20 23:29:37 +02:00
Stephen Hemminger	27b31d130e	virtio: small cleanups Some minor cleanups. * pass constant to virtio_dev_queue_setup * fix message on rx_queue_setup * get rid of extra double spaces Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-07-22 10:55:26 +02:00
Stephen Hemminger	945884f14b	virtio: fix queue size and number of descriptors The virtual queue ring size and the number of slots actually usable are separate parameters. In the most common environment (QEMU) the virtual queue ring size is 256, but some environments the ring maybe much larger. The ring size comes from the host and the driver must use the actual size passed. The number of descriptors can be either zero to use the whole available ring, or some value smaller. This is used to limit the number of mbufs allocated for the receive ring. If more descriptors are requested than available the size is silently truncated. Note: the ring size (from host) must be a power of two, but the number of descriptors used can be any size from 1 to the size of the virtual ring. Fixes: `d78deadae4` ("virtio: fix ring size negotiation") Reported-by: Changchun Ouyang <changchun.ouyang@intel.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-07-22 10:33:50 +02:00
Bernard Iremonger	941d64b5bf	virtio: free queue memory when closing Add function virtio_free_queues() and call from virtio_dev_close() Use virtio_dev_rx_queue_release() and virtio_dev_tx_queue_release() Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-07-19 22:24:42 +02:00
Bernard Iremonger	5382b188fb	virtio: add queue release Add functions virtio_dev_queue_release(), virtio_dev_rx_queue_release() and virtio_dev_tx_queue_release(). Use queue_release in virtio_dev_uninit(). Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-07-19 22:24:42 +02:00
Bernard Iremonger	2f7fdb9d52	virtio: check virtqueue parameter when detaching If vq is NULL, there is a segmentation fault. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-07-19 22:24:42 +02:00
Bernard Iremonger	abf4c84b25	virtio: support port hotplug This patch depends on the Port Hotplug Framework. It implements the eth_dev_uninit_t() function for virtio pmd. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-07-19 22:24:42 +02:00
Sergio Gonzalez Monroy	2f9d47013e	mem: move librte_malloc to eal/common Move malloc inside eal and create a new section in MAINTAINERS file for Memory Allocation in EAL. Create a dummy malloc library to avoid breaking applications that have librte_malloc in their DT_NEEDED entries. This is the first step towards using malloc to allocate memory directly from memsegs. Thus, memzones would allocate memory through malloc, allowing to free memzones. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com>	2015-07-16 13:44:48 +02:00
Zoltan Kiss	72514b5d55	ethdev: fix check of threshold for Tx freeing The parameter tx_free_thresh is not consistent between the drivers: some use it as rte_eth_tx_burst() requires, some release buffers when the number of free descriptors drop below this value. Let's use it as most fast-path code does, which is the latter, and update comments throughout the code to reflect that. Signed-off-by: Zoltan Kiss <zoltan.kiss@linaro.org> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-07-07 16:31:48 +02:00
Damjan Marion	9e71668b42	virtio: fix crash if CQ is not negotiated Fix NULL dereference if virtio control queue is not negotiated. Signed-off-by: Damjan Marion <damarion@cisco.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-06-12 14:50:06 +02:00
Stephen Hemminger	d78deadae4	virtio: fix ring size negotiation Negotiate the virtio ring size. The host may allow for very large rings but application may only want a smaller ring. Conversely, if the number of descriptors requested exceeds the virtio host queue size, then just silently use the smaller host size. This fixes issues with virtio in non-QEMU envirionments. For example Google Compute Engine allows up to 16K elements in ring. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-06-12 14:44:59 +02:00
Stephen Hemminger	4a92b67151	virtio: clarify feature bit handling Change the features from bit mask to bit number. This allows the DPDK driver to use the definitions from Linux (yes the header files already use a license compatiable with DPDK). This makes DPDK driver handle future feature bit changes. Get rid of double negative code in the feature bit intialization. Instead just have a new define with the list of feature bits implemented. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-06-12 14:43:40 +02:00
Stephen Hemminger	2704623620	virtio: do not set mac table unless negotiated Don't attempt to set the MAC address table unless the host allows it in feature negotiation. Also, don't return a value from mac_table_set since all callers ignore the return value. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-06-12 14:40:15 +02:00
Stephen Hemminger	e9e414a41a	virtio: do not enable/disable Rx modes unless supported If negotiation with host says that controlling Rx mode is not supported, then don't try. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-06-12 14:36:38 +02:00
Stephen Hemminger	4ecce8356e	virtio: remove blank lines Putting blank line between function and following conditional just wastes screen space, and makes code less obvious. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-06-12 14:35:10 +02:00
Stephen Hemminger	6c52c126f2	drivers: explicit initialization of pci drivers Upcoming drivers will need to be able to support other bus types. This is a transparent change to how struct eth_driver is initialized. It has not function or ABI layout impact, but makes adding a later bus type (Xen, Hyper-V, ...) much easier. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-06-12 11:10:10 +02:00
Bruce Richardson	6c3169a3dc	virtio: move to drivers/net/ Move virtio PMD to drivers/net directory Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-05-22 16:06:23 +02:00

... 3 4 5 6 7

314 Commits