numam-dpdk

Author	SHA1	Message	Date
Yuanhan Liu	6d890f8ab5	net/virtio: fix multiple process support The introduce of virtio 1.0 support brings yet another set of ops, badly, it's not handled correctly, that it breaks the multiple process support. The issue is the data/function pointer may vary from different processes, and the old used to do one time set (for primary process only). That said, the function pointer the secondary process saw is actually from the primary process space. Accessing it could likely result to a crash. Kudos to the last patches, we now be able to maintain those info that may vary among different process locally, meaning every process could have its own copy for each of them, with the correct value set. And this is what this patch does: - remap the PCI (IO port for legacy device and memory map for modern device) - set vtpci_ops correctly After that, multiple process would work like a charm. (At least, it passed my fuzzy test) Fixes: `b8f04520ad` ("virtio: use PCI ioport API") Fixes: `d5bbeefca8` ("virtio: introduce PCI implementation structure") Fixes: `6ba1f63b5a` ("virtio: support specification 1.0") Cc: stable@dpdk.org Reported-by: Juho Snellman <jsnell@iki.fi> Reported-by: Yaron Illouz <yaroni@radcom.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	1ca893f11d	net/virtio: store IO port info locally Like vtpci_ops, the rte_pci_ioport has to store in local memory. This is basically for the rte_pci_device field is allocated from process local memory, but not from shared memory. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	553f45932f	net/virtio: store PCI operators pointer locally We used to store the vtpci_ops at virtio_hw structure. The struct, however, is stored in shared memory. That means only one value is allowed. For the multiple process model, however, the address of vtpci_ops should be different among different processes. Take virtio PMD as example, the vtpci_ops is set by the primary process, based on its own process space. If we access that address from the secondary process, that would be an illegal memory access, A crash then might happen. To make the multiple process model work, we need store the vtpci_ops in local memory but not in a shared memory. This is what the patch does: a local virtio_hw_internal array of size RTE_MAX_ETHPORTS is allocated. This new structure is used to store all these kind of info in a non-shared memory. Current, we have: - vtpci_ops - rte_pci_ioport - virtio pci mapped memory, such as common_cfg. The later two will be done in coming patches. Later patches would also set them correctly for secondary process, so that the multiple process model could work. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	d4be35a913	net/virtio: fix wrong Rx/Tx method for secondary process If the primary enables the vector Rx/Tx path, the current code would let the secondary always choose the non vector Rx/Tx path. This results to a Rx/Tx method mismatch between primary and secondary process. Werid errors then may happen, something like: PMD: virtio_xmit_pkts() tx: virtqueue_enqueue error: -14 Fix it by choosing the correct Rx/Tx callbacks for the secondary process. That is, use vector path if it's given. Fixes: `8d8393fb18` ("virtio: pick simple Rx/Tx") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yuanhan Liu	d948f596fe	ethdev: fix port data mismatched in multiple process model Assume we have two virtio ports, 00:03.0 and 00:04.0. The first one is managed by the kernel driver, while the later one is managed by DPDK. Now we start the primary process. 00:03.0 will be skipped by DPDK virtio PMD driver (since it's being used by the kernel). 00:04.0 would be successfully initiated by DPDK virtio PMD (if nothing abnormal happens). After that, we would get a port id 0, and all the related info needed by virtio (virtio_hw) is stored at rte_eth_dev_data[0]. Then we start the secondary process. As usual, 00:03.0 will be firstly probed. It firstly tries to get a local eth_dev structure for it (by rte_eth_dev_allocate): port_id = rte_eth_dev_find_free_port(); ... eth_dev = &rte_eth_devices[port_id]; eth_dev->data = &rte_eth_dev_data[port_id]; ... return eth_dev; Since it's a first PCI device, port_id will be 0. eth_dev->data would then point to rte_eth_dev_data[0]. And here things start going wrong, as rte_eth_dev_data[0] actually stores the virtio_hw for 00:04.0. That said, in the secondary process, DPDK will continue to drive PCI device 00.03.0 (despite the fact it's been managed by kernel), with the info from PCI device 00:04.0. Which is wrong. The fix is to attach the port already registered by the primary process. That is, iterate the rte_eth_dev_data[], and get the port id who's PCI ID matches the current PCI device. This would let us maintain same port ID for the same PCI device, keeping the chance of referencing to wrong data minimal. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 09:20:18 +01:00
Jan Wickbom	59317cef24	vhost: allow many vhost-user ports Currently select() is used to monitor file descriptors for vhostuser ports. This limits the number of ports possible to create since the fd number is used as index in the fd_set and we have seen fds > 1023. This patch changes select() to poll(). This way we can keep an packed (pollfd) array for the fds, e.g. as many fds as the size of the array. Also see: http://dpdk.org/ml/archives/dev/2016-April/037024.html Reported-by: Patrik Andersson <patrik.r.andersson@ericsson.com> Signed-off-by: Jan Wickbom <jan.wickbom@ericsson.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Maxime Coquelin	73c8f9f69c	vhost: introduce reply ack feature REPLY_ACK features provide a generic way for QEMU to ensure both completion and success of a request. As described in vhost-user spec in QEMU repository, QEMU sets VHOST_USER_NEED_REPLY flag (bit 3) when expecting a reply_ack from the backend. Backend must reply with 0 for success or non-zero otherwise when flag is set. Currently, only VHOST_USER_SET_MEM_TABLE request implements reply_ack, in order to synchronize mapping updates. This patch enables REPLY_ACK feature generally, but only checks error code for VHOST_USER_SET_MEM_TABLE. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yong Wang	b3bee7d87d	examples/vhost: fix lcore initialization when "TAILQ_INIT()" was added to the loop of "for (lcore_id = 0; ...)" statement, the assignment to "lcore_ids" was removed out of the loop. It changed the original initialization of "lcore_ids". Fix it by introducing two braces. Fixes: `45657a5c68` ("examples/vhost: use tailq to link vhost devices") Cc: stable@dpdk.org Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yong Wang	12ee45a362	examples/vhost: fix calculation of mbuf count When calculating 'nr_mbufs_per_core', 'MAX_PKT_BURST' was mutiplied twice. Fix it by removing one of them. Fixes: `bdb19b771e` ("examples/vhost: fix mbuf allocation failure") Cc: stable@dpdk.org Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Chas Williams	1ce3c7fe14	net/vhost: emulate device start/stop behavior .dev_start()/.dev_stop() roughly corresponds to the local device's port being ready. This is different from the remote client being connected which is roughly link up or down. Emulate the device start/stop behavior by separately tracking the start/stop state to determine if we should allow packets to be queued to/from the remote client. Signed-off-by: Chas Williams <ciwillia@brocade.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Chas Williams	aed0b12930	net/vhost: fix socket file deleted on stop If you create a vhost server device, it doesn't create the actual datagram socket until you call .dev_start(). If you call .dev_stop() is also deletes those sockets. For QEMU clients, this is a problem since QEMU doesn't know how to re-attach to datagram sockets that have gone away. To fix this, register and unregister the datagram sockets during device creation and removal. Fixes: `ee584e9710` ("vhost: add driver on top of the library") Cc: stable@dpdk.org Signed-off-by: Chas Williams <ciwillia@brocade.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Yong Wang	5731d6acc7	vhost: fix memory leak In function vhost_new_device(), current code dose not free 'dev' in "i == MAX_VHOST_DEVICE" condition statements. It will lead to a memory leak. Fixes: `45ca9c6f7b` ("vhost: get rid of linked list for devices") Cc: stable@dpdk.org Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Pierre Pfister	9edfedf5e4	net/virtio: use any layout for version 1.0 Current virtio driver advertises VERSION_1 support, but does not handle device's VERSION_1 support when sending packets (it looks for ANY_LAYOUT feature, which is absent). This patch enables 'can_push' in tx path when VERSION_1 is advertised by the device. This significantly improves small packets forwarding rate towards devices advertising VERSION_1 feature. Signed-off-by: Pierre Pfister <ppfister@cisco.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Haifeng Lin	8c33fc10f6	vhost: fix guest/host physical address mapping When reg_size < page_size the function read in rte_mem_virt2phy would not return, because host_user_addr is invalid. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Haifeng Lin <haifeng.lin@huawei.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Tomasz Kulasek	1feda4d8fc	mbuf: add a function to linearize a packet This patch adds function rte_pktmbuf_linearize to let crypto PMD coalesce chained mbuf before crypto operation and extend their capabilities to support segmented mbufs when device cannot handle them natively. Included unit tests for rte_pktmbuf_linearize functionality: 1) Creates banch of segmented mbufs with different size and number of segments. 2) Fills noncontigouos mbuf with sequential values. 3) Uses rte_pktmbuf_linearize to coalesce segmented buffer into one contiguous. 4) Verifies data in linearized buffer. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:30:00 +01:00
Tiwei Bie	bb98856f24	app/testpmd: add MACsec commands Below MACsec offload commands are added: - set macsec offload <port_id> on encrypt on\|off replay-protect on\|off - set macsec offload <port_id> off - set macsec sc tx\|rx <port_id> <mac> <pi> - set macsec sa tx\|rx <port_id> <idx> <an> <pn> <key> Also update the testpmd user guide. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2017-01-15 19:16:48 +01:00
Tiwei Bie	b35d309710	net/ixgbe: add MACsec offload MACsec (or LinkSec, 802.1AE) is a MAC level encryption/authentication scheme defined in IEEE 802.1AE that uses symmetric cryptography. This commit adds the MACsec offload support for ixgbe. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2017-01-15 19:16:37 +01:00
Tiwei Bie	375008544b	ethdev: add MACsec capability flags If these flags are advertised by a PMD, the NIC supports the MACsec offload. The incoming MACsec traffics can be offloaded transparently after the MACsec offload is configured correctly by the application. And the application can set the PKT_TX_MACSEC flag in mbufs to enable the MACsec offload for the packets to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:19 +01:00
Tiwei Bie	8609a7a86c	ethdev: add MACsec event type This commit adds a below event type: - RTE_ETH_EVENT_MACSEC This event will occur when the PN counter in a MACsec connection reaches the exhaustion threshold. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:03 +01:00
Tiwei Bie	223d629f8c	mbuf: add MACsec flag Add a new Tx flag in mbuf, that can be set by applications to enable the MACsec offload for a packet to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:15:51 +01:00
Bruce Richardson	0615355ffa	kvargs: make pointers in string arrays const Change the parameters of functions from const char valid[] to const char const valid[]. This additional const is needed to allow us to fix some checkpatch warnings, as well as being good programming practice. For the checkpatch warnings, if we have a set of command line args that we want to check defined as: static const char args[] = { "arg1", "arg2", NULL }; kvlist = rte_kvargs_parse(params, args); checkpatch will complain: WARNING:STATIC_CONST_CHAR_ARRAY: static const char array should probably be static const char * const Adding the additional const to the definition of the args will then trigger a compiler error in the absence of this change to the kvargs library, as we lose the const in the call to kvargs_parse. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 19:28:26 +01:00
Jerin Jacob	d74cbeca75	app/testpmd: fix static build link ordering By introducing explicit -lrte_pmd_ixgbe link request in testpmd Makefile,"-Wl,-lrte_pmd_ixgbe" provided twice, and linker removes the duplication by keeping only first occurrence. This moves "-Wl,-lrte_pmd_ixgbe" out of "-Wl,--whole-archive" flag and makes symbol generation totally different than previous version in case of static build. This patch fixes the static build linking order by introducing -lrte_pmd_ixgbe under the shared library config (CONFIG_RTE_BUILD_SHARED_LIB). Fixes: `425781ff5a` ("app/testpmd: add ixgbe VF management") Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-13 17:31:16 +01:00
Bruce Richardson	2d150e938b	devtools: skip capitalization check for commit prefixes The prefix in the commit title must be a valid component name and is checked in separate checks. For capitalization, just check the part after the colon. This is already done for most capitalization checks, just make the remainder consistent with this. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2017-01-13 17:03:43 +01:00
Wenfeng Liu	454a0a7009	mempool: use cache in single producer or consumer mode Currently we will check mempool flags when we put/get objects from mempool. However, this makes cache useless when mempool is SC\|SP, SC\|MP, MC\|SP cases. This patch makes cache available in above cases and improves performance. Signed-off-by: Wenfeng Liu <liuwf@arraynetworks.com.cn> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 16:38:09 +01:00
Ben Walker	b0d3e3f73b	pci: use address struct in function arguments Instead of passing domain, bus, devid, func, just pass an rte_pci_addr. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:55:16 +01:00
Ben Walker	22dda618c0	pci: separate detaching ethernet ports from PCI devices Attaching and detaching ethernet ports from an application is not the same thing as physically removing a PCI device, so clarify the flags indicating support. All PCI devices are assumed to be physically removable, so no flag is necessary in the PCI layer. Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2017-01-12 15:48:54 +01:00
Ben Walker	e84ad157b7	pci: unmap resources if probe fails If resources were mapped prior to probe, unmap them if probe fails. This does not handle the case where the kernel driver was forcibly unbound prior to probe. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:47:45 +01:00
Adrien Mazarguil	6de5c0f130	ethdev: define default item masks in flow API Leaving default pattern item mask values up for interpretation by PMDs is an undefined behavior that applications might find difficult to use in the wild. It also needlessly complicates PMD implementation. This commit addresses this by defining consistent default masks for each item type. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:52 +01:00
Adrien Mazarguil	f4e7c8bc84	ethdev: clarify RSS action in flow API Contrary to the current description, mbuf RSS hash result storage does not overlap with the returned MARK value (hash.fdir.lo vs. hash.fdir.hi), and both may be combined. Reflect this change by allowing testpmd to display both values simultaneously. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:47 +01:00
Adrien Mazarguil	ebaa064e2c	ethdev: clarify MARK and FLAG actions in flow API Both actions share the PKT_RX_FDIR mbuf flag, as a result there is no way to tell them apart. Moreover, the maximum allowed value for the MARK action may not necessarily cover the entire 32-bit space. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:22 +01:00
Adrien Mazarguil	a757583e55	ethdev: modify flow API error function Based on initial PMD implementations of the flow API, returning the error structure which may be NULL is useless and always discarded. Returning the error code instead appears to be much more convenient. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:53:07 +01:00
Adrien Mazarguil	bd9e3bc20f	app/testpmd: fix array bounds checks This commit addresses several obvious issues reported by Coverity with array bounds checks in functions related to the flow API. Coverity issue: 139596, 139597, 139598, 139599 Fixes: `938a184a18` ("app/testpmd: implement basic support for flow API") Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:52:20 +01:00
Neil Horman	348e470bc6	pmdinfogen: fix null dereference Coverity reports a forward null dereference from a for loop that works with a variable previously tested for null that had no error handling or condition to prevent it. Pretty obvious fix below. Coverity issue: 139593 Fixes: `98b0fdb0ff` ("pmdinfogen: add buildtools and pmdinfogen utility") Signed-off-by: Neil Horman <nhorman@tuxdriver.com>	2017-01-06 11:40:30 +01:00
Nelio Laranjeiro	86c743cf91	eal: define generic vector types Add common vector type definitions to all CPU architectures. Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Chao Zhu <chaozhu@linux.vnet.ibm.com>	2017-01-04 21:50:19 +01:00
Thomas Monjalon	c6dab2a873	tools: move to usertools Rename tools/ into usertools/ to differentiate from buildtools/ and devtools/ while making clear these scripts are part of DPDK runtime. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-04 21:17:32 +01:00
Thomas Monjalon	9a98f50e89	scripts: move to devtools The remaining scripts in the scripts/ directory are only useful to developers. That's why devtools/ is a better name. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-04 21:17:32 +01:00
Thomas Monjalon	4ec6960aec	scripts: move to buildtools There is already a directory buildtools for pmdinfogen used by the build system. The scripts used in makefiles are moved here. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-04 21:17:32 +01:00
John McNamara	d76a592782	doc: add required python versions Add a requirement to support both Python 2 and 3 to the DPDK Python Coding Standards and Getting started Guide. Signed-off-by: John McNamara <john.mcnamara@intel.com>	2017-01-04 21:14:09 +01:00
John McNamara	54ca545dce	make python scripts python2/3 compliant Make all the DPDK Python apps work with Python 2 or 3 to allow them to work with whatever is the system default. Signed-off-by: John McNamara <john.mcnamara@intel.com>	2017-01-04 21:13:44 +01:00
John McNamara	be05eb445c	make python scripts PEP8 compliant Make all DPDK python application compliant with the PEP8 standard to allow for consistency checking of patches and to allow further refactoring. Signed-off-by: John McNamara <john.mcnamara@intel.com>	2017-01-04 21:12:58 +01:00
Ferruh Yigit	1b80b9c854	mk: disable icc warning 188 error #188: enumerated type mixed with another type This is get when an integer assigned to an enum variable. Since this usage is common and causing many ICC compilation errors, and other compilers accept this usage. Disabling the warning. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-04 20:55:04 +01:00
Tomasz Kulasek	6b520d54eb	app/testpmd: use Tx preparation in checksum engine Since all current drivers supports Tx preparation API, it is used in csum forwarding engine by default for all drivers. Adding additional step to the csum engine costs about 3-4% of performance drop, on my setup with ixgbe driver. It's caused mostly by the need of reaccessing and modification of packet data. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:28 +01:00
Konstantin Ananyev	b3fc5a1ae1	net/ena: add Tx preparation Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:26 +01:00
Konstantin Ananyev	baf3bbae55	net/vmxnet3: add Tx preparation Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Yong Wang <yongwang@vmware.com>	2017-01-04 20:40:25 +01:00
Tomasz Kulasek	9b134aa397	net/fm10k: add Tx preparation Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:24 +01:00
Tomasz Kulasek	3f33e643e5	net/i40e: add Tx preparation Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:24 +01:00
Tomasz Kulasek	7829b8d52b	net/ixgbe: add Tx preparation Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:22 +01:00
Tomasz Kulasek	2b76648872	net/e1000: add Tx preparation Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2017-01-04 20:40:20 +01:00
Tomasz Kulasek	4fb7e803eb	ethdev: add Tx preparation Added API for `rte_eth_tx_prepare` uint16_t rte_eth_tx_prepare(uint8_t port_id, uint16_t queue_id, struct rte_mbuf tx_pkts, uint16_t nb_pkts) Added fields to the `struct rte_eth_desc_lim`: uint16_t nb_seg_max; /< Max number of segments per whole packet. / uint16_t nb_mtu_seg_max; /< Max number of segments per one MTU / These fields can be used to create valid packets according to the following rules: * For non-TSO packet, a single transmit packet may span up to "nb_mtu_seg_max" buffers. * For TSO packet the total number of data descriptors is "nb_seg_max", and each segment within the TSO may span up to "nb_mtu_seg_max". Added functions: int rte_validate_tx_offload(struct rte_mbuf m) to validate general requirements for tx offload set in mbuf of packet such a flag completness. In current implementation this function is called optionaly when RTE_LIBRTE_ETHDEV_DEBUG is enabled. int rte_net_intel_cksum_prepare(struct rte_mbuf m) to prepare pseudo header checksum for TSO and non-TSO tcp/udp packets before hardware tx checksum offload. - for non-TSO tcp/udp packets full pseudo-header checksum is counted and set. - for TSO the IP payload length is not included. int rte_net_intel_cksum_flags_prepare(struct rte_mbuf *m, uint64_t ol_flags) this function uses same logic as rte_net_intel_cksum_prepare, but allows application to choose which offloads should be taken into account, if full preparation is not required. PERFORMANCE TESTS ----------------- This feature was tested with modified csum engine from test-pmd. The packet checksum preparation was moved from application to Tx preparation step placed before burst. We may expect some overhead costs caused by: 1) using additional callback before burst, 2) rescanning burst, 3) additional condition checking (packet validation), 4) worse optimization (e.g. packet data access, etc.) We tested it using ixgbe Tx preparation implementation with some parts disabled to have comparable information about the impact of different parts of implementation. IMPACT: 1) For unimplemented Tx preparation callback the performance impact is negligible, 2) For packet condition check without checksum modifications (nb_segs, available offloads, etc.) is 14626628/14252168 (~2.62% drop), 3) Full support in ixgbe driver (point 2 + packet checksum initialization) is 14060924/13588094 (~3.48% drop) Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-04 20:40:15 +01:00
Olivier Matz	6d52d1d4af	ethdev: clarify extended statistics documentation Reword the API documentation of ethdev xstats. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-04 19:08:21 +01:00

1 2 3 4 5 ...

6260 Commits