numam-dpdk

Author	SHA1	Message	Date
Nélio Laranjeiro	fe5fe3820e	net/mlx5: fix leak when starvation occurs The list of segments to free was wrongly manipulated ending by only freeing the first segment instead of freeing all of them. The last one still belongs to the NIC and thus should not be freed. Fixes: `a1bdb71a32` ("net/mlx5: fix crash in Rx") Reported-by: Liming Sun <lsun@mellanox.com> Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-17 19:24:51 +01:00
Remy Horton	5ebb74a12c	net/i40e: fix spelling Fixes: `da61cd0849` ("i40evf: add extended stats") Fixes: `0eedec25ea` ("i40e: clean log messages") Signed-off-by: Remy Horton <remy.horton@intel.com> Acked-by: Kevin Traynor <ktraynor@redhat.com>	2017-01-17 19:24:51 +01:00
Remy Horton	1ef3b073f7	net/i40e: fix xstats value mapping The offsets used in rte_i40evf_stats_strings for transmission statistics were wrong, returning the total byte count rather than the respective (unicast, multicast, broadcast, drop, & error) packet counts. Fixes: `da61cd0849` ("i40evf: add extended stats") Signed-off-by: Remy Horton <remy.horton@intel.com> Acked-by: Kevin Traynor <ktraynor@redhat.com>	2017-01-17 19:24:51 +01:00
Thomas Monjalon	e754c959fc	net/virtio: fix build without virtio-user When CONFIG_RTE_VIRTIO_USER is disabled (default on FreeBSD), the virtio driver cannot be compiled: librte_pmd_virtio.a(virtio_ethdev.o): In function `eth_virtio_dev_init': (.text+0x1eba): undefined reference to `virtio_user_ops' Reported-by: Andrew Rybchenko <arybchenko@solarflare.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 23:25:08 +01:00
Qiming Yang	1e07b4ecb1	examples/ethtool: display firmware version This patch enhances the ethtool example to support to show firmware version, in the same way that the Linux kernel ethtool does. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:36 +01:00
Qiming Yang	ed0dfdd0e9	net/i40e: add firmware version get This patch add a new function i40e_fw_version_get. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:36 +01:00
Qiming Yang	8b0b565742	net/ixgbe: add firmware version get This patch adds a new function ixgbe_fw_version_get. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:36 +01:00
Qiming Yang	b883c0644a	net/e1000: add firmware version get This patch adds a new function eth_igb_fw_version_get. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:35 +01:00
Qiming Yang	2191347120	ethdev: add firmware version get This patch adds a new API 'rte_eth_dev_fw_version_get' for fetching firmware version by a given device. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:35 +01:00
Olivier Matz	c1e55ed3f7	net/virtio: fix advertised Rx offload capabilities When the virtio PMD is used on top of a vhost that does not support offloads, Rx offload capabilities are still advertised by virtio_dev_info_get(). But if an application tries to start the PMD with Rx offloads enabled (rxmode.hw_ip_checksum = 1), the initialization of the device will fail with -ENOTSUP and the following log: rx ip checksum not available on this host This patch fixes the Rx offload capabilities returned by virtio_dev_info_get() to be consistent with features advertised by the host. Fixes: `96cb671193` ("net/virtio: support Rx checksum offload") Fixes: `86d59b2146` ("net/virtio: support LRO") Cc: stable@dpdk.org Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 12:06:24 +01:00
Tomasz Kulasek	faf08c8b0f	examples/performance-thread: add packet type parsing Last changes in Niantic and Fortville NIC drivers causes that vector Rx path is chosen by default in l3fwd-thread application. This path doesn't support propagation of hw packet type recognition to the packet_type field in mbuf, and packets cannot be classified properly. The approach to solve this problem is similar to the commit: `71a7e2424e` ("examples/l3fwd: fix using packet type blindly"). To use sw packet analyzer, new command line option "--parse-ptype" is introduced. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com>	2017-01-17 18:40:17 +01:00
Jasvinder Singh	d30185b7bf	examples/ip_pipeline: fix parsing of pass-through pipeline This patch fixes the configuration file parsing error when load balancing function is enabled in pass-through pipeline. error log: pipeline> [APP] Initializing PIPELINE1 ... [PIPELINE1] Pass-through Parse error in section "PIPELINE1": entry "lb" has invalid value ("hash") Fixes: `cbe82f6cfb` ("examples/ip_pipeline: add swap action in pass-through") Cc: stable@dpdk.org Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-01-17 18:37:41 +01:00
Sankar Chokkalingam	ac6bad59f1	examples/ip_pipeline: fix coremask limitation Issue: coremask used in IP Pipeline is limited to 64 cores. Solution: Modified coremask as an array of uint64_t to support RTE_MAX_LCORE Fixes: `7f64b9c004` ("examples/ip_pipeline: rework config file syntax") Fixes: `eb32fe7c55` ("examples/ip_pipeline: rework initialization parameters") Fixes: `b4aee0fb9c` ("examples/ip_pipeline: reconfigure thread binding dynamically") Fixes: `4e14069328` ("examples/ip_pipeline: measure CPU utilization") Signed-off-by: Sankar Chokkalingam <sankarx.chokkalingam@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-01-17 18:37:41 +01:00
Anand B Jyoti	50c644fc1f	examples/ip_pipeline: check VLAN and MPLS parameters This commit add to CLI command check for the following errors 1. SVLAN and CVLAN IDs greater than 12 bits 2. MPLS ID greater than 20 bits 3. max number of supported MPLS labels to avoid array overflow It prevents running CLI commands with invalid parameters. Signed-off-by: Anand B Jyoti <anand.b.jyoti@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2017-01-17 18:37:41 +01:00
Thomas Monjalon	45e1c8b782	examples/ip_pipeline: remove useless makefile line A dollar sign is missing and it is not needed because of VPATH. Reported-by: Ilya V. Matveychikov <matvejchikov@gmail.com> Suggested-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 18:21:11 +01:00
Olivier Matz	88617471b8	examples/l3fwd: rework long options parsing Avoid the use of several strncpy() since getopt is able to map a long option with an id, which can be matched in the same switch/case than short options. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-17 18:10:50 +01:00
Olivier Matz	6876790da1	examples/l2fwd: rework long options parsing Do the same than in l3fwd to avoid strcmp() for long options. For l2fwd, there is no long option that take advantage of this new mechanism as --mac-updating and --no-mac-updating are directly setting a flag without needing an entry in the switch/case. So this patch just prepares the framework in case a new long option is added in the future. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-17 18:10:43 +01:00
Yongseok Koh	f2b9988926	doc: fix links to Linux in contribution guide A referenced document in the Linux Kernel has been moved to a sub-directory. And kernel community has moved to RST/Sphinx. The links are replaced with HTML rendered links. Signed-off-by: Yongseok Koh <yskoh@mellanox.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-01-17 17:04:47 +01:00
Pablo de Lara	e0ef2aecde	doc: simplify l3fwd example guide L3 Forwarding sample app user guides have some inconsistencies between the example command line and the configuration table. Also, they were showing too complicated configuration, using two different NUMA nodes for two ports, which will probably lead to performance drop due to use cross-socket channel. This patch simplifies the configuration of these examples, by using a single NUMA node and a single queue per port. Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-01-17 17:04:47 +01:00
Yong Wang	580f8e3682	doc: fix a typo in prog guide Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-01-17 16:54:59 +01:00
Rami Rosen	81c85700a1	doc: fix a typo in proc_info guide This patch fixes a typo in proc_info guide (tools). Signed-off-by: Rami Rosen <rami.rosen@intel.com>	2017-01-17 16:54:59 +01:00
Rami Rosen	0e2db9d3cd	doc: fix a typo in testpmd guide This patch fixes a trivial typo in testpmd application guide. Signed-off-by: Rami Rosen <rami.rosen@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2017-01-17 16:54:59 +01:00
Wenzhuo Lu	dc0537e6d6	app/testpmd: fix check for invalid ports Some CLIs don't check the input port ID, it may cause segmentation fault (core dumped). Fixes: `425781ff5a` ("app/testpmd: add ixgbe VF management") Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2017-01-17 16:40:05 +01:00
Zhihong Wang	f5472703c0	eal: optimize aligned memcpy on x86 This patch optimizes rte_memcpy for well aligned cases, where both dst and src addr are aligned to maximum MOV width. It introduces a dedicated function called rte_memcpy_aligned to handle the aligned cases with simplified instruction stream. The existing rte_memcpy is renamed as rte_memcpy_generic. The selection between them 2 is done at the entry of rte_memcpy. The existing rte_memcpy is for generic cases, it handles unaligned copies and make store aligned, it even makes load aligned for micro architectures like Ivy Bridge. However alignment handling comes at a price: It adds extra load/store instructions, which can cause complications sometime. DPDK Vhost memcpy with Mergeable Rx Buffer feature as an example: The copy is aligned, and remote, and there is header write along which is also remote. In this case the memcpy instruction stream should be simplified, to reduce extra load/store, therefore reduce the probability of load/store buffer full caused pipeline stall, to let the actual memcpy instructions be issued and let H/W prefetcher goes to work as early as possible. This patch is tested on Ivy Bridge, Haswell and Skylake, it provides up to 20% gain for Virtio Vhost PVP traffic, with packet size ranging from 64 to 1500 bytes. The test can also be conducted without NIC, by setting loopback traffic between Virtio and Vhost. For example, modify the macro TXONLY_DEF_PACKET_LEN to the requested packet size in testpmd.h, rebuild and start testpmd in both host and guest, then "start" on one side and "start tx_first 32" on the other. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com>	2017-01-17 16:40:05 +01:00
Jianfeng Tan	e2a6f1246e	examples/l3fwd-power: fix stop and close on signal As it gets killed, in SIGINT signal handler, device is not stopped and closed. In virtio's case, vector assignment in the KVM is not deassigned. This patch will invoke dev_stop() and dev_close() in signal handler. Fixes: `d7937e2e3d` ("power: initial import") Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com>	2017-01-17 09:27:16 +01:00
Jianfeng Tan	82bea46616	examples/l3fwd-power: add --parse-ptype option To support those devices that do not provide packet type info when receiving packets, add a new option, --parse-ptype, to analyze packet type in the Rx callback. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com>	2017-01-17 09:27:14 +01:00
Jianfeng Tan	9ebdeefee8	net/virtio: unmap queue/irq when closing When closing virtio devices, close eventfds, free the struct to store queue/irq mapping. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:59 +01:00
Jianfeng Tan	349a447b47	net/virtio: unbind interrupt/eventfd when stopping When virtio devices get stopped, tell the kernel to unbind the mapping between interrupts and eventfds. Note: it behaves differently from other NICs which close eventfds, free struct. In virtio, we do those things when close device in following patch. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:57 +01:00
Jianfeng Tan	26b683b4f7	net/virtio: setup Rx queue interrupts This patch mainly allocates structure to store queue/irq mapping, and configure queue/irq mapping down through PCI ops. It also creates eventfds for each Rx queue and tell the kernel about the eventfd/intr binding. Note: So far, we hard-code 1:1 queue/irq mapping (each rx queue has one exclusive interrupt), like this: vec 0 -> config irq vec 1 -> rxq0 vec 2 -> rxq1 ... which means, the "vectors" option of QEMU should be configured with a value >= N+1 (N is the number of the queue pairs). Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:54 +01:00
Jianfeng Tan	c056be239d	net/virtio: add Rx interrupt enable/disable functions This patch implements interrupt enable/disable functions for each Rx queue. And we rely on flags of avail queue as the hint for virtio device to interrupt virtio driver or not. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:52 +01:00
Jianfeng Tan	c49526acec	net/virtio: add PCI operation for queue/irq binding Add handler in virtio_pci_ops to set queue/irq bind. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:49 +01:00
Jianfeng Tan	b0caba1a13	net/virtio: add Rx descriptor check Under interrupt mode, rx_descriptor_done is used as an indicator for applications to check if some number of packets are ready to be received. This patch enables this by checking used ring's local consumed idx with shared (with backend) idx. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:47 +01:00
Jianfeng Tan	981e61f55f	net/virtio: invoke method directly for setting IRQ config We need to define a prototype for such wrapper, which makes thing too complicated. Remove wrapper and call set_config_irq directly. Suggested-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:45 +01:00
Jianfeng Tan	f229eb41ee	net/virtio: fix rewriting LSC flag The LSC flag is decided according to if VIRTIO_NET_F_STATUS feature is negotiated. Copy the PCI info after the judgement will rewrite the correct result. Fixes: `198ab33677` ("net/virtio: move device initialization in a function") CC: stable@dpdk.org Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:26:38 +01:00
Jianfeng Tan	be7a4707f7	net/virtio-user: enable multiqueue with kernel vhost With vhost kernel, to enable multiqueue, we need backend device in kernel support multiqueue feature. Specifically, with tap as the backend, as linux/Documentation/networking/tuntap.txt shows, we check if tap supports IFF_MULTI_QUEUE feature. And for vhost kernel, each queue pair has a vhost fd, and with a tap fd binding this vhost fd. All tap fds are set with the same tap interface name. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:24:56 +01:00
Jianfeng Tan	5e97e42025	net/virtio-user: enable offloading When used with vhost kernel backend, we can offload at both directions. - From vhost kernel to virtio_user, the offload is enabled so that DPDK app can trust the flow is checksum-correct; and if DPDK app sends it through another port, the checksum needs to be recalculated or offloaded. It also applies to TSO. - From virtio_user to vhost_kernel, the offload is enabled so that kernel can trust the flow is L4-checksum-correct, no need to verify it; if kernel will consume it, DPDK app should make sure the l3-checksum is correctly set. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:24:56 +01:00
Jianfeng Tan	e3b434818b	net/virtio-user: support kernel vhost This patch add support vhost kernel as the backend for virtio_user. Three main hook functions are added: - vhost_kernel_setup() to open char device, each vq pair needs one vhostfd; - vhost_kernel_ioctl() to communicate control messages with vhost kernel module; - vhost_kernel_enable_queue_pair() to open tap device and set it as the backend of corresonding vhost fd (that is to say, vq pair). Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:24:56 +01:00
Jianfeng Tan	33d24d65fe	net/virtio-user: abstract backend operations Add a struct virtio_user_backend_ops to abstract three kinds of backend operations: - setup, create the unix socket connection; - send_request, sync messages with backend; - enable_qp, enable some queue pair. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:23:27 +01:00
Jianfeng Tan	5526b0cbd5	net/virtio-user: move vhost-user specific code To support vhost kernel as the backend of net_virtio_user in coming patches, we move vhost_user specific structs and macros into vhost_user.c, and only keep common definitions in vhost.h. Besides, remove VHOST_USER_MQ feature check. Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:23:27 +01:00
Jianfeng Tan	c12a26ee20	net/virtio-user: fix not properly reset device virtio_user is not properly reset when users call vtpci_reset(), as it ignores VIRTIO_CONFIG_STATUS_RESET status in virtio_user_set_status(). This might lead to initialization failure as it starts to re-init the device before sending RESET messege to backend. Besides, previous callfds and kickfds are not closed. To fix it, we add support to disable virtqueues when it's set to DRIVER OK status, and re-init fields in struct virtio_user_dev. Fixes: `e9efa4d938` ("net/virtio-user: add new virtual PCI driver") Fixes: `37a7eb2ae8` ("net/virtio-user: add device emulation layer") Cc: stable@dpdk.org Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:23:27 +01:00
Jianfeng Tan	142678d429	net/virtio-user: fix wrongly get/set features Before the commit `86d59b2146` ("net/virtio: support LRO"), features in virtio PMD, is decided and properly set at device initialization and will not be changed. But afterward, features could be changed in virtio_dev_configure(), and will be re-negotiated if it's changed. In virtio-user, device features is obtained at driver probe phase only once, but we did not store it. So the added feature bits in re-negotiation will fail. To fix it, we store it down, and will be used to feature negotiation either at device initialization phase or device configure phase. Fixes: `e9efa4d938` ("net/virtio-user: add new virtual PCI driver") Cc: stable@dpdk.org Signed-off-by: Jianfeng Tan <jianfeng.tan@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:23:27 +01:00
Yuanhan Liu	9470427c88	net/virtio: do not store PCI device pointer at shared memory hw->dev, a pointer to pci_dev, was actually not used, until the refactor of decouping from PCI device. This would somehow break the multiple process again, since "hw" is stored at shared memory, while "pci_dev" is not: the primary and secondary process could have different address for it, while just one value is allowed. Thus we should not store it to "hw", instead, we could retrieve it from the "eth_dev->device" field. Fixes: `ae34410a8a` ("ethdev: move info filling of PCI into drivers") Fixes: `eac901ce29` ("ethdev: decouple from PCI device") Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:23:27 +01:00
Yuanhan Liu	61e3ee1756	net/virtio: access interrupt handler directly Since commit `0e1b45a284` ("ethdev: decouple interrupt handling from PCI device"), intr_handle is stored at eth_dev struct, that we could use it directly. Thus there is no need to get it from hw. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	6d890f8ab5	net/virtio: fix multiple process support The introduce of virtio 1.0 support brings yet another set of ops, badly, it's not handled correctly, that it breaks the multiple process support. The issue is the data/function pointer may vary from different processes, and the old used to do one time set (for primary process only). That said, the function pointer the secondary process saw is actually from the primary process space. Accessing it could likely result to a crash. Kudos to the last patches, we now be able to maintain those info that may vary among different process locally, meaning every process could have its own copy for each of them, with the correct value set. And this is what this patch does: - remap the PCI (IO port for legacy device and memory map for modern device) - set vtpci_ops correctly After that, multiple process would work like a charm. (At least, it passed my fuzzy test) Fixes: `b8f04520ad` ("virtio: use PCI ioport API") Fixes: `d5bbeefca8` ("virtio: introduce PCI implementation structure") Fixes: `6ba1f63b5a` ("virtio: support specification 1.0") Cc: stable@dpdk.org Reported-by: Juho Snellman <jsnell@iki.fi> Reported-by: Yaron Illouz <yaroni@radcom.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	1ca893f11d	net/virtio: store IO port info locally Like vtpci_ops, the rte_pci_ioport has to store in local memory. This is basically for the rte_pci_device field is allocated from process local memory, but not from shared memory. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	553f45932f	net/virtio: store PCI operators pointer locally We used to store the vtpci_ops at virtio_hw structure. The struct, however, is stored in shared memory. That means only one value is allowed. For the multiple process model, however, the address of vtpci_ops should be different among different processes. Take virtio PMD as example, the vtpci_ops is set by the primary process, based on its own process space. If we access that address from the secondary process, that would be an illegal memory access, A crash then might happen. To make the multiple process model work, we need store the vtpci_ops in local memory but not in a shared memory. This is what the patch does: a local virtio_hw_internal array of size RTE_MAX_ETHPORTS is allocated. This new structure is used to store all these kind of info in a non-shared memory. Current, we have: - vtpci_ops - rte_pci_ioport - virtio pci mapped memory, such as common_cfg. The later two will be done in coming patches. Later patches would also set them correctly for secondary process, so that the multiple process model could work. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	d4be35a913	net/virtio: fix wrong Rx/Tx method for secondary process If the primary enables the vector Rx/Tx path, the current code would let the secondary always choose the non vector Rx/Tx path. This results to a Rx/Tx method mismatch between primary and secondary process. Werid errors then may happen, something like: PMD: virtio_xmit_pkts() tx: virtqueue_enqueue error: -14 Fix it by choosing the correct Rx/Tx callbacks for the secondary process. That is, use vector path if it's given. Fixes: `8d8393fb18` ("virtio: pick simple Rx/Tx") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	d948f596fe	ethdev: fix port data mismatched in multiple process model Assume we have two virtio ports, 00:03.0 and 00:04.0. The first one is managed by the kernel driver, while the later one is managed by DPDK. Now we start the primary process. 00:03.0 will be skipped by DPDK virtio PMD driver (since it's being used by the kernel). 00:04.0 would be successfully initiated by DPDK virtio PMD (if nothing abnormal happens). After that, we would get a port id 0, and all the related info needed by virtio (virtio_hw) is stored at rte_eth_dev_data[0]. Then we start the secondary process. As usual, 00:03.0 will be firstly probed. It firstly tries to get a local eth_dev structure for it (by rte_eth_dev_allocate): port_id = rte_eth_dev_find_free_port(); ... eth_dev = &rte_eth_devices[port_id]; eth_dev->data = &rte_eth_dev_data[port_id]; ... return eth_dev; Since it's a first PCI device, port_id will be 0. eth_dev->data would then point to rte_eth_dev_data[0]. And here things start going wrong, as rte_eth_dev_data[0] actually stores the virtio_hw for 00:04.0. That said, in the secondary process, DPDK will continue to drive PCI device 00.03.0 (despite the fact it's been managed by kernel), with the info from PCI device 00:04.0. Which is wrong. The fix is to attach the port already registered by the primary process. That is, iterate the rte_eth_dev_data[], and get the port id who's PCI ID matches the current PCI device. This would let us maintain same port ID for the same PCI device, keeping the chance of referencing to wrong data minimal. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 09:20:18 +01:00
Jan Wickbom	59317cef24	vhost: allow many vhost-user ports Currently select() is used to monitor file descriptors for vhostuser ports. This limits the number of ports possible to create since the fd number is used as index in the fd_set and we have seen fds > 1023. This patch changes select() to poll(). This way we can keep an packed (pollfd) array for the fds, e.g. as many fds as the size of the array. Also see: http://dpdk.org/ml/archives/dev/2016-April/037024.html Reported-by: Patrik Andersson <patrik.r.andersson@ericsson.com> Signed-off-by: Jan Wickbom <jan.wickbom@ericsson.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Maxime Coquelin	73c8f9f69c	vhost: introduce reply ack feature REPLY_ACK features provide a generic way for QEMU to ensure both completion and success of a request. As described in vhost-user spec in QEMU repository, QEMU sets VHOST_USER_NEED_REPLY flag (bit 3) when expecting a reply_ack from the backend. Backend must reply with 0 for success or non-zero otherwise when flag is set. Currently, only VHOST_USER_SET_MEM_TABLE request implements reply_ack, in order to synchronize mapping updates. This patch enables REPLY_ACK feature generally, but only checks error code for VHOST_USER_SET_MEM_TABLE. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00

1 2 3 4 5 ...

6303 Commits