numam-dpdk

Author	SHA1	Message	Date
Jasvinder Singh	fa11a8a725	port: fix crash for ring writer nodrop Error log: [APP] Initializing PIPELINE0 ... pipeline> [APP] Initializing PIPELINE1 ... [PIPELINE1] Pass-through [APP] Initializing PIPELINE2 ... [PIPELINE2] Pass-through Segmentation fault (core dumped) Fixes: `5f4cd47309` ("port: add ring writer nodrop") Fixes: `d58f69c541` ("port: add ring multi reader or writer") Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com>	2016-03-07 11:52:39 +01:00
Jasvinder Singh	04f366906a	port: fix crash for ethdev writer nodrop Error log: [APP] Initializing PIPELINE0 ... pipeline> [APP] Initializing PIPELINE1 ... [PIPELINE1] Pass-through Segmentation fault (core dumped) Fixes: `304c8091e9` ("port: add ethdev writer nodrop") Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-07 11:52:14 +01:00
Jerin Jacob	716bf82080	eal/arm: check support of armv8.1 atomics armv8.1 adds support for new atomic instructions. Linux kernel v4.3 onwards, the presence of atomic instruction support can detect through HWCAP_ATOMICS Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Reviewed-by: Jan Viktorin <viktorin@rehivetech.com>	2016-03-05 19:46:50 +01:00
Thomas Monjalon	f9f7c949ff	config: remove EAL flags for OS environment CONFIG_RTE_LIBRTE_EAL_APP can be replaced by CONFIG_RTE_EXEC_ENV_APP. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Keith Wiles <keith.wiles@intel.com>	2016-03-05 11:09:31 +01:00
Jingjing Wu	1409f127d7	ethdev: fix byte order consistency of flow director Fixed issue of byte order in ethdev library that the structure for setting fdir's mask and flow entry is inconsist and made inputs of mask be in big endian. Fixes: `2d4c1a9ea2` ("ethdev: add new flow director masks") Fixes: `76c6f89e80` ("ixgbe: support new flow director masks") Reported-by: Yaacov Hazan <yaacovh@mellanox.com> Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Zhe Tao <zhe.tao@intel.com> Acked-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2016-03-04 16:50:58 +01:00
Bruce Richardson	5ecdeba601	lpm: merge tbl24 and tbl8 structures The tbl8 and tbl24 structures were essentially identical except for slightly different names for one or two fields. Merge these two structures into a single structure definition. Two fields have been renamed as part of this change: the "ext_entry" field in the tbl24 has been renamed to "valid_group" to match the tbl8 value to make the merge easier, and the "tbl8_gindex" field has been renamed to "group_idx". The "valid_group" field now serves two purposes: in a tbl8 it indicates if the group, i.e. the tbl8, is valid, and in a tbl24, it indicates if the "group_idx" is valid, i.e. whether the value is a next_hop or a tbl8 index. [The name "group_idx" was used to make this latter link between the fields clearer] Suggested-by: Vladimir Medvedkin <medvedkinv@gmail.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2016-03-04 16:01:15 +01:00
Ravi Kerur	d6b324c00f	mbuf: get DMA address Macros RTE_MBUF_DATA_DMA_ADDR and RTE_MBUF_DATA_DMA_ADDR_DEFAULT are defined in each PMD driver file. Convert macros to inline functions and move them to common lib/librte_mbuf/rte_mbuf.h file. PMD drivers include rte_mbuf.h file directly/indirectly hence no additioanl header file inclusion is necessary. Signed-off-by: Ravi Kerur <rkerur@gmail.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-03-04 16:01:15 +01:00
Marc Sune	ac78cb27a8	cmdline: fix missing include cmdline_parse_.h headers use struct cmdline_token_hdr / cmdline_parse_token_hdr_t which is defined in cmdline_parse.h, but do not include it, forcing manual inclusion. This commit includes cmdline_parse.h in all cmdline_parse_.h. Signed-off-by: Marc Sune <marcdevel@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2016-03-04 15:31:16 +01:00
Nelio Laranjeiro	a9963a86b2	ethdev: increase RETA entry size Several NICs can handle 512 entries/queues in their RETA table, an 8 bit field is not large enough for them. Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2016-03-03 20:39:47 +01:00
Nelio Laranjeiro	fb76dd26a3	cmdline: increase command line buffer Allow long command lines in testpmd (like flow director with IPv6, ...). Signed-off-by: John McNamara <john.mcnamara@intel.com> Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2016-03-03 20:39:47 +01:00
Ralf Hoffmann	a5f6b5ddca	eal/linux: change hugepage sorting to avoid overlapping memcpy with only one hugepage or already sorted hugepage addresses, the sort function called memcpy with same src and dst pointer. Debugging with valgrind will issue a warning about overlapping area. This patch changes the sort method to qsort to avoid this behavior. The separate sort function is no longer necessary. Suggested-by: Jay Rolette <rolette@infiniteio.com> Signed-off-by: Ralf Hoffmann <ralf.hoffmann@allegro-packets.com> Acked-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com>	2016-03-03 11:36:32 +01:00
Yi Lu	3560681d68	eal/linux: fix build with hpet Fix compile error when enable CONFIG_RTE_LIBEAL_USE_HPET. Error messages: lib/librte_eal/linuxapp/eal/eal_timer.c: In function ‘rte_eal_hpet_init’: lib/librte_eal/linuxapp/eal/eal_timer.c:222:2: error: implicit declaration of function ‘rte_thread_setname’ Fixes: `badb3688ff` ("eal/linux: fix build with glibc < 2.12") Signed-off-by: Yi Lu <luyi68@live.com> Acked-by: David Marchand <david.marchand@6wind.com>	2016-03-03 11:36:32 +01:00
Thomas Monjalon	21e10f983e	eal: fix symbol map version number The version 2.3 has been renamed 16.04. Fixes: `6d7de6d2e3` ("version: switch to year.month numbers") Reported-by: Panu Matilainen <pmatilai@redhat.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-03-03 11:36:32 +01:00
Panu Matilainen	948fd64bef	mk: replace the combined library with a linker script The physically linked-together combined library has been an increasing source of problems, as was predicted when library and symbol versioning was introduced. Replace the complex and fragile construction with a simple linker script which achieves the same without all the problems, remove the related kludges from eg mlx drivers. Since creating the linker script is practically zero cost, remove the config option and just create it always. Based on a patch by Sergio Gonzales Monroy, linker script approach initially suggested by Neil Horman. Suggested-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Suggested-by: Neil Horman <nhorman@tuxdriver.com> Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-03-01 14:37:27 +01:00
Didier Pallard	9792848c65	hash: fix CRC32c computation Fix crc32c hash functions to return a valid crc32c value for data lengths not multiple of 4 bytes. ARM code is not tested. Fixes: `af75078fec` ("first public release") Signed-off-by: Didier Pallard <didier.pallard@6wind.com> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2016-03-01 14:37:26 +01:00
Huawei Xie	9ec201f5d6	mbuf: provide bulk allocation rte_pktmbuf_alloc_bulk allocates a bulk of packet mbufs. There is related thread about this bulk API. http://dpdk.org/dev/patchwork/patch/4718/ Thanks to Konstantin's loop unrolling. Attached the wiki page about duff's device. It explains the performance optimization through loop unwinding, and also the most dramatic use of case label fall-through. https://en.wikipedia.org/wiki/Duff%27s_device In this implementation, while() loop is used because we could not assume count is strictly positive. Using while() loop saves one line of check. Signed-off-by: Gerald Rogers <gerald.rogers@intel.com> Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2016-02-29 17:26:38 +01:00
Yuanhan Liu	bb66588304	vhost: broadcast RARP by injecting in receiving mbuf array Broadcast RARP packet by injecting it to receiving mbuf array at rte_vhost_dequeue_burst(). Commit `33226236a3` ("vhost: handle request to send RARP") iterates all host interfaces and then broadcast it by all of them. It did notify the switches about the new location of the migrated VM, however, the mac learning table in the target host is wrong (at least in my test with OVS): $ ovs-appctl fdb/show ovsbr0 port VLAN MAC Age 1 0 b6:3c:72:71:cd:4d 10 LOCAL 0 b6:3c:72:71:cd:4e 10 LOCAL 0 52:54:00:12:34:68 9 1 0 56:f6:64:2c:bc:c0 1 Where 52:54:00:12:34:68 is the mac of the VM. As you can see from the above, the port learned is "LOCAL", which is the "ovsbr0" port. That is reasonable, since we indeed send the pkt by the "ovsbr0" interface. The wrong mac table lead all the packets to the VM go to the "ovsbr0" in the end, which ends up with all packets being lost, until the guest send a ARP quest (or reply) to refresh the mac learning table. Jianfeng then came up with a solution I have thought of firstly but NAKed by myself, concerning it has potential issues [0]. The solution is as title stated: broadcast the RARP packet by injecting it to the receiving mbuf arrays at rte_vhost_dequeue_burst(). The re-bring of that idea made me think it twice; it looked like a false concern to me then. And I had done a rough verification: it worked as expected. [0]: http://dpdk.org/ml/archives/dev/2016-February/033527.html Another note is that while preparing this version, I found that DPDK has some ARP related structures and macros defined. So, use them instead of the one from standard header files here. Cc: Thibaut Collet <thibaut.collet@6wind.com> Suggested-by: Jianfeng Tan <jianfeng.tan@intel.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-29 16:55:30 +01:00
Stephen Hemminger	726da47b20	log: add missing symbols rte_get_log_type and rte_get_log_level functions has been available for many versions. But they are missing from the shared library map and therefore do not get exported correctly. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Neil Horman <nhorman@tuxdriver.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-29 16:06:11 +01:00
Rich Lane	c2189745c3	cfgfile: support looking up sections by index This is useful when sections have duplicate names. Signed-off-by: Rich Lane <rich.lane@bigswitch.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-02-29 11:28:31 +01:00
Marcin Kerlin	930cd79735	jobstats: add abort function This patch adds new function rte_jobstats_abort. It marks job as finished and time of this work will be add to management time instead of execution time. This function should be used instead of rte_jobstats_finish if condition occurs, condition is defined by the application for example when receiving n>0 packets. Example of usage is added to the example l2fwd-jobstats. At maximum load do-while loop inside Idle job will be execute once because one or more jobs waiting to be executed, so this time should not be include as the execution time by calling rte_jobstats_abort(). Signed-off-by: Marcin Kerlin <marcinx.kerlin@intel.com> Acked-by: Fan Zhang <roy.fan.zhang@intel.com>	2016-02-29 11:22:53 +01:00
Reshma Pattan	d505ba80a1	ethdev: support unidirectional configuration User should be able to configure ethdev with zero rx/tx queues, but both should not be zero. After above change, rte_eth_dev_tx_queue_config, rte_eth_dev_rx_queue_config should allocate memory for rx/tx queues only when number of rx/tx queues are nonzero. Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-24 19:15:28 +01:00
Reshma Pattan	dc309365ab	cryptodev: allow full control from secondary process Macro RTE_PROC_PRIMARY_OR_ERR_RET blocking the secondary process from API usage. API access should be given to both secondary and primary. Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-24 19:15:26 +01:00
Reshma Pattan	525e478f5e	ethdev: allow full control from secondary process Macros RTE_PROC_PRIMARY_OR_ERR_RET and RTE_PROC_PRIMARY_OR_RET are blocking the secondary process from using the APIs. API access should be given to both secondary and primary. Reported-by: Sean Harte <sean.harte@intel.com> Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-24 19:15:22 +01:00
Santosh Shukla	c316ed45bd	vfio: support PCI ioport Include vfio map/rd/wr support for pci ioport. Signed-off-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: David Marchand <david.marchand@6wind.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-24 11:44:55 +01:00
Santosh Shukla	c5d8315f97	vfio: ignore mapping for ioport region vfio_pci_mmap() try to map all pci bars. ioport region are not mapped in vfio/kernel so ignore mmaping for ioport. Signed-off-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-24 11:44:55 +01:00
Santosh Shukla	0291476ae3	eal/linux: never check iopl for arm iopl() syscall not supported in linux-arm/arm64 so always return 0 value. Suggested-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Santosh Shukla <sshukla@mvista.com> Acked-by: Jan Viktorin <viktorin@rehivetech.com> Acked-by: David Marchand <david.marchand@6wind.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-24 11:44:55 +01:00
Thomas Monjalon	43cb19a526	mbuf_offload: fix header for C++ When built in a C++ application, the include fails for 2 reasons: rte_mbuf_offload.h:128:24: error: invalid conversion from ‘void’ to ‘rte_pktmbuf_offload_pool_private’ [-fpermissive] rte_mempool_get_priv(mpool); ^ The cast must be explicit for C++. rte_mbuf_offload.h:304:1: error: expected declaration before ‘}’ token There was a closing brace for __cplusplus but not an opening one. Fixes: `78c8709b5d` ("mbuf_offload: introduce library to attach offloads to mbuf") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-21 11:47:57 +01:00
Thomas Monjalon	fa2f06b70e	hash: fix header for C++ When built in a C++ application, the jhash include fails: rte_jhash.h:123:22: error: invalid conversion from ‘const void’ to ‘const uint32_t’ [-fpermissive] const uint32_t *k = key; ^ The cast must be explicit for C++. Fixes: `8718219a87` ("hash: add new jhash functions") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2016-02-21 11:47:57 +01:00
Thomas Monjalon	1a8dbad49a	eal: fix keep alive header for C++ When built in a C++ application, the keepalive include fails: rte_keepalive.h:142:41: error: ‘ALIVE’ was not declared in this scope keepcfg->state_flags[rte_lcore_id()] = ALIVE; ^ C++ requires to use a scope operator to access an enum inside a struct. There was also a namespace issue for the values (no RTE prefix). The solution is to move the struct and related code out of the header file. Fixes: `75583b0d1e` ("eal: add keep alive monitoring") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Remy Horton <remy.horton@intel.com>	2016-02-21 11:46:48 +01:00
Pavel Fedin	2f29ce885a	vhost: check memory map before address translation Malfunctioning virtio clients may not send VHOST_USER_SET_MEM_TABLE for some reason. This causes NULL dereference in qva_to_vva(). Signed-off-by: Pavel Fedin <p.fedin@samsung.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-21 11:17:48 +01:00
Rich Lane	a90ca1a12e	vhost: remove device operations pointers The vhost_net_device_ops indirection is unnecessary because there is only one implementation of the vhost common code. Removing it makes the code more readable. Signed-off-by: Rich Lane <rich.lane@bigswitch.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-19 19:33:31 +01:00
Olivier Matz	86f36ff957	mempool: fix leak when creation fails Since commits `ff909fe21f` and `4e32101f9b`, it is now possible to free memzones and rings. The rte_mempool_create() should be modified to take advantage of this and not leak memory when an allocation fails. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2016-02-19 16:17:45 +01:00
Rich Lane	ca67ed289a	vhost: fix leak of fds and mmaps The common vhost code only supported a single mmap per device. vhost-user worked around this by saving the address/length/fd of each mmap after the end of the rte_virtio_memory struct. This only works if the vhost-user code frees dev->mem, since the common code is unaware of the extra info. The VHOST_USER_RESET_OWNER message is one situation where the common code frees dev->mem and leaks the fds and mappings. This happens every time I shut down a VM. The new code calls back into the implementation (vhost-user or vhost-cuse) to clean up these resources. The vhost-cuse changes are only compile tested. Signed-off-by: Rich Lane <rich.lane@bigswitch.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-19 16:13:32 +01:00
Yuanhan Liu	d22929db97	vhost: remove duplicate header include unistd.h has been included twice; remove one. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-19 16:00:03 +01:00
Yuanhan Liu	d639996a74	vhost: enable log_shmfd protocol feature To claim that we support vhost-user live migration support: SET_LOG_BASE request will be send only when this feature flag is set. Besides this flag, we actually need another feature flag set to make vhost-user live migration work: VHOST_F_LOG_ALL. Which, however, has been enabled long time ago. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-02-19 15:53:38 +01:00
Yuanhan Liu	33226236a3	vhost: handle request to send RARP While in former patch we enabled GUEST_ANNOUNCE feature, so that the guest OS will broadcast a GARP message after migration to notify the switch about the new location of migrated VM, the thing is that GUEST_ANNOUNCE is enabled since kernel v3.5 only. For older kernel, VHOST_USER_SEND_RARP request comes to rescue. The payload of this new request is the mac address of the migrated VM, with that, we could construct a RARP message, and then broadcast it to host interfaces. That's how this patch works: - list all interfaces, with the help of SIOCGIFCONF ioctl command - construct an RARP message and broadcast it Cc: Thibaut Collet <thibaut.collet@6wind.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-19 15:49:02 +01:00
Yuanhan Liu	d293dac8f3	vhost: claim support of guest announce It's actually a feature already enabled in Linux kernel (since v3.5). What we need to do is simply to claim that we support such feature, and nothing else. With that, the guest will send an ARP message after live migration to notify the switches about the new location of migrated VM. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-02-19 15:47:20 +01:00
Yuanhan Liu	699e3577e6	vhost: log vring desc buffer changes Every time we copy a buf to vring desc, we need to log it. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Victor Kaplansky <victork@redhat.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-02-19 15:46:46 +01:00
Yuanhan Liu	b171fad1ff	vhost: log used vring changes Introduce vhost_log_write() helper function to log the dirty pages we touched. Page size is harded code to 4096 (VHOST_LOG_PAGE), and each log is presented by 1 bit. Therefore, vhost_log_write() simply finds the right bit for related page we are gonna change, and set it to 1. dev->log_base denotes the start of the dirty page bitmap. Every time we update virtio used ring, we need to log it. And it's been done by a new vhost_log_write() wrapper, vhost_log_used_vring(). Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Victor Kaplansky <victork@redhat.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-02-19 15:44:13 +01:00
Yuanhan Liu	54f9e32305	vhost: handle dirty pages logging request VHOST_USER_SET_LOG_BASE request is used to tell the backend (dpdk vhost-user) where we should log dirty pages, and how big the log buffer is. This request introduces a new payload: typedef struct VhostUserLog { uint64_t mmap_size; uint64_t mmap_offset; } VhostUserLog; Also, a fd is delivered from QEMU by ancillary data. With those info given, an area of memory is mmaped, assigned to dev->log_base, for logging dirty pages. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Victor Kaplansky <victork@redhat.com> Tested-by: Pavel Fedin <p.fedin@samsung.com>	2016-02-19 15:42:54 +01:00
Panu Matilainen	f1fe8388d5	vhost: fix build dependency Commit `d0cf91303d` added dependency on librte_net headers to vhost but did not add this to the Makefile, which makes builds non-deterministic. Curiously it is non-parallel build that is consistently broken by this missing dependency, usually it's the other way around, but trying to build without -j(n) fails with: lib/librte_vhost/vhost_rxtx.c:41:20: fatal error: rte_ip.h: No such file or directory Fixes: `d0cf91303d` ("vhost: add Tx offload capabilities") Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-18 20:25:15 +01:00
Jijiang Liu	859b480d5a	vhost: add guest offload setting Add guest offload setting in vhost lib. Virtio 1.0 spec (5.1.6.4 Processing of Incoming Packets) says: 1. If the VIRTIO_NET_F_GUEST_CSUM feature was negotiated, the VIRTIO_NET_HDR_F_NEEDS_CSUM bit in flags can be set: if so, the packet checksum at offset csum_offset from csum_start and any preceding checksums have been validated. The checksum on the packet is incomplete and csum_start and csum_offset indicate how to calculate it (see Packet Transmission point 1). 2. If the VIRTIO_NET_F_GUEST_TSO4, TSO6 or UFO options were negotiated, then gso_type MAY be something other than VIRTIO_NET_HDR_GSO_NONE, and gso_size field indicates the desired MSS (see Packet Transmission point 2). In order to support these features, the following changes are added, 1. Extend 'VHOST_SUPPORTED_FEATURES' macro to add the offload features negotiation. 2. Enqueue these offloads: convert some fields in mbuf to the fields in virtio_net_hdr. There are more explanations for the implementation. For VM2VM case, there is no need to do checksum, for we think the data should be reliable enough, and setting VIRTIO_NET_HDR_F_NEEDS_CSUM at RX side will let the TCP layer to bypass the checksum validation, so that the RX side could receive the packet in the end. In terms of us-vhost, at vhost RX side, the offload information is inherited from mbuf, which is in turn inherited from TX side. If we can still get those info at RX side, it means the packet is from another VM at same host. So, it's safe to set the VIRTIO_NET_HDR_F_NEEDS_CSUM, to skip checksum validation. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-17 22:56:44 +01:00
Jijiang Liu	d0cf91303d	vhost: add Tx offload capabilities Add vhost TX offload (CSUM and TSO) support capabilities in vhost lib. In order to support these features, and the following changes are added, 1. Extend 'VHOST_SUPPORTED_FEATURES' macro to add the offload features negotiation. 2. Dequeue TX offload: convert the fileds in virtio_net_hdr to the related fileds in mbuf. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-17 22:56:44 +01:00
David Marchand	756ce64b1e	eal: introduce PCI ioport API Most of the code is inspired on virtio driver. rte_pci_ioport structure is filled at map time with anything needed for later read / write calls. At the moment, base field is used to store a x86 ioport (uint16_t) and will be reused for other arches. Signed-off-by: David Marchand <david.marchand@6wind.com> Tested-by: Santosh Shukla <sshukla@mvista.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
Thomas Monjalon	0972d7c22b	eal: remove compiler optimization workaround The compiler optimization was disabled a long time ago without describing what was the exact issue. Maybe it does not apply anymore. As it looks unneeded, let's remove this strange pragma. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	9369dcb7a6	eal/ppc: adapt CPU flags check to the arch The structure feature_entry does not need leaf/subleaf which were copied from x86 CPUID implementation. On x86, a valid flag is detected with the non-zero leaf value. This check is replaced by a check with a dummy "none" register. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	5851aa9171	eal/arm: adapt CPU flags check to the arch The structure feature_entry does not need leaf/subleaf which were copied from x86 CPUID implementation. On x86, a valid flag is detected with the non-zero leaf value. This check is replaced by a check with a dummy "none" register. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Tested-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	ba560ac30c	eal: move CPU flag functions out of headers The patch `c344eab3ee` has moved the hardware definition of CPU flags. Now the functions checking these hardware flags are also moved. The function rte_cpu_get_flag_enabled() is no more inline. The benefits are: - remove rte_cpu_feature_table from the ABI (recently added) - hide hardware details from the API - allow to adapt structures per arch (done in next patch) Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	9f8faed956	eal: get CPU flag name The new function rte_cpu_get_flag_name() is added to the EAL API. It is implemented (duplicated) in each arch because the next patch will remove the public exposure of the feature tables. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Jerin Jacob	ab3af0959d	eal: introduce non-temporal prefetch non-temporal/transient/stream version of rte_prefetch0() The non-temporal prefetch is intended as a prefetch hint that processor will use the prefetched data only once or short period, unlike the rte_prefetch0() function which imply that prefetched data to use repeatedly. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Jan Viktorin <viktorin@rehivetech.com>	2016-02-16 07:19:19 +01:00

1 2 3 4 5 ...

2133 Commits