numam-dpdk

Author	SHA1	Message	Date
Zbigniew Bodek	169ca3db55	crypto/armv8: add PMD optimized for ARMv8 processors This patch introduces crypto poll mode driver using ARMv8 cryptographic extensions. CPU compatibility with this driver is detected in run-time and virtual crypto device will not be created if CPU doesn't provide: AES, SHA1, SHA2 and NEON. This PMD is optimized to provide performance boost for chained crypto operations processing, such as encryption + HMAC generation, decryption + HMAC validation. In particular, cipher only or hash only operations are not provided. The driver currently supports AES-128-CBC in combination with: SHA256 HMAC and SHA1 HMAC and relies on the external armv8_crypto library: https://github.com/caviumnetworks/armv8_crypto Build ARMv8 crypto PMD if compiling for ARM64 and CONFIG_RTE_LIBRTE_PMD_ARMV8_CRYPTO option is enable in the configuration file. ARMV8_CRYPTO_LIB_PATH environment variable will point to the appropriate library directory. Signed-off-by: Zbigniew Bodek <zbigniew.bodek@caviumnetworks.com> Reviewed-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-19 01:00:55 +01:00
Fan Zhang	d803b4439d	cryptodev: add user defined name for vdev This patch adds a user defined name initializing parameter to cryptodev library. Originally, for software cryptodev PMD, the vdev name parameter is treated as the driver identifier, and will create an unique name for each device automatically, which is not necessarily as same as the vdev parameter. This patch allows the user to either create a unique name for his software cryptodev, or by default, let the system creates a unique one. This should help the user managing the created cryptodevs easily. Examples: CLI command fragment 1: --vdev "crypto_aesni_gcm_pmd" The above command will result in creating a AESNI-GCM PMD with name of "crypto_aesni_gcm_X", where postfix X is the number assigned by the system, starting from 0. This fragment can be placed in the same CLI command multiple times, resulting the postfixs incremented by one for each new device. CLI command fragment 2: --vdev "crypto_aesni_gcm_pmd,name=gcm1" The above command will result in creating a AESNI-GCM PMD with name of "gcm1". This fragment can be placed in the same CLI command multiple times, as long as each having a unique name value. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2017-01-18 21:48:56 +01:00
Tomasz Kulasek	2d03ec6abd	crypto: support scatter-gather in software drivers This patch introduces RTE_CRYPTODEV_FF_MBUF_SCATTER_GATHER feature flag informing that selected crypto device supports segmented mbufs natively and doesn't need to be coalesced before crypto operation. While using segmented buffers in crypto devices may have unpredictable results, for PMDs which doesn't support it natively, additional check is made for debug compilation. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2017-01-18 21:48:56 +01:00
Fan Zhang	c5b70818ad	cryptodev: fix loop in device query This patch fixes the dev value update problem in rte_cryptodev_pmd_get_named_dev, orginally, dev won't be updated after the initial step in the loop. Fixes: `d11b0f30df` ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2017-01-18 21:48:56 +01:00
Declan Doherty	84d7965866	crypto/aesni_mb: support AVX512 Release v0.44 of Intel(R) Multi-Buffer Crypto for IPsec library adds support for AVX512 instructions. This patch enables the new AVX512 accelerated functions from the aesni_mb_pmd crypto poll mode driver. This patch set requires that the aesni_mb_pmd is linked against the version 0.44 or greater of the Multi-Buffer Crypto for IPsec library. Signed-off-by: Declan Doherty <declan.doherty@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2017-01-18 21:48:56 +01:00
Arek Kusztal	d59626753c	cryptodev: add DES CBC cipher algorithm This commit adds DES CBC ciper algorithm to available algorithms Signed-off-by: Arek Kusztal <arkadiuszx.kusztal@intel.com> Acked-by: Fiona Trahe <fiona.trahe@intel.com>	2017-01-18 21:45:15 +01:00
Jerin Jacob	53a3ba0c36	cryptodev: fix crash on null dereference crypodev->data->name will be null when rte_cryptodev_get_dev_id() invoked without a valid crypto device instance. Fixes: `d11b0f30df` ("cryptodev: introduce API and framework for crypto devices") Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Arek Kusztal <arkadiuszx.kusztal@intel.com>	2017-01-18 21:45:15 +01:00
Fiona Trahe	50d7f314de	cryptodev: remove unused digest-appended feature The cryptodev API had specified that if the digest address field was left empty on an authentication operation, then the PMD would assume the digest was appended to the source or destination data. This case was not handled at all by most PMDs and incorrectly handled by the QAT PMD. As no bugs were raised, it is assumed to be not needed, so this patch removes it, rather than add handling for the case on all PMDs. The digest can still be appended to the data, but its address must now be provided in the op. Signed-off-by: Fiona Trahe <fiona.trahe@intel.com> Acked-by: John Griffin <john.griffin@intel.com>	2017-01-18 21:45:15 +01:00
Pablo de Lara	86d8989688	efd: add AVX2 vector lookup function Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Signed-off-by: Saikrishna Edupuganti <saikrishna.edupuganti@intel.com> Acked-by: Christian Maciocco <christian.maciocco@intel.com>	2017-01-18 20:53:45 +01:00
Pablo de Lara	56b6ef874f	efd: new Elastic Flow Distributor library Elastic Flow Distributor (EFD) is a distributor library that uses perfect hashing to determine a target/value for a given incoming flow key. It has the following advantages: - First, because it uses perfect hashing, it does not store the key itself and hence lookup performance is not dependent on the key size. - Second, the target/value can be any arbitrary value hence the system designer and/or operator can better optimize service rates and inter-cluster network traffic locating. - Third, since the storage requirement is much smaller than a hash-based flow table (i.e. better fit for CPU cache), EFD can scale to millions of flow keys. Finally, with current optimized library implementation performance is fully scalable with number of CPU cores. Signed-off-by: Byron Marohn <byron.marohn@intel.com> Signed-off-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Signed-off-by: Saikrishna Edupuganti <saikrishna.edupuganti@intel.com> Acked-by: Christian Maciocco <christian.maciocco@intel.com>	2017-01-18 20:53:28 +01:00
Jerin Jacob	b15740bd43	eal/arm64: change barrier definitions to macros Change rte_*wb definitions to macros in order to keep consistent with other barrier definitions in the file. Suggested-by: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:18:26 +01:00
Jerin Jacob	e783b81d44	eal/arm64: override I/O device read/write access Override the generic I/O device memory read/write access and implement it using armv8 instructions for arm64. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:18:26 +01:00
Jerin Jacob	a2c4cd8648	eal: let all architectures use generic I/O implementation Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:17:28 +01:00
Jerin Jacob	4540fbb2ae	eal: add generic I/O device read/write implementation This patch implements the generic version of rte_read[b/w/l/q]_[relaxed] and rte_write[b/w/l/q]_[relaxed] using rte_io_wmb() and rte_io_rmb() Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 17:12:05 +01:00
Jerin Jacob	69736db1d8	eal: introduce I/O device memory read/write operations This commit introduces 8-bit, 16-bit, 32bit, 64bit I/O device memory read/write operations along with the relaxed versions. The weakly-ordered machine like ARM needs additional I/O barrier for device memory read/write access over PCI bus. By introducing the eal abstraction for I/O device memory read/write access, The drivers can access I/O device memory in architecture agnostic manner. The relaxed version does not have additional I/O memory barrier, useful in accessing the device registers of integrated controllers which implicitly strongly ordered with respect to memory access. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	2cf953cfd8	eal/arm64: define I/O device memory barriers CC: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	67ce81bd3d	eal/arm64: define SMP barrier dmb instruction based barrier is used for smp version of memory barrier. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	84733fd0d7	eal/arm64: fix memory barrier definition dsb instruction based barrier is used for non smp version of memory barrier. Fixes: `d708f01b71` ("eal/arm: add atomic operations for ARMv8") Cc: stable@dpdk.org Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Jianbo Liu <jianbo.liu@linaro.org>	2017-01-18 16:57:11 +01:00
Jerin Jacob	b41508b7a4	eal/armv7: define I/O device memory barriers The patch does not provide any functional change for ARMv7. I/O barriers are mapped to existing smp barriers. CC: Jan Viktorin <viktorin@rehivetech.com> CC: Jianbo Liu <jianbo.liu@linaro.org> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	38b636b7cc	eal/arm: separate SMP barrier definition for ARMv7 and ARMv8 Separate the smp barrier definition for arm and arm64 for fine control on smp barrier definition for each architecture. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	da07a35be5	eal/ppc64: define I/O device memory barriers The patch does not provide any functional change for ppc_64. I/O barriers are mapped to existing smp barriers. CC: Chao Zhu <chaozhu@linux.vnet.ibm.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	dff90714b1	eal/tile: define I/O device memory barriers The patch does not provide any functional change for tile. I/O barriers are mapped to existing smp barriers. CC: Zhigang Lu <zlu@ezchip.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	e802521171	eal/x86: define I/O device memory barriers The patch does not provide any functional change for IA. I/O barriers are mapped to existing smp barriers. CC: Bruce Richardson <bruce.richardson@intel.com> CC: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Jerin Jacob	1ea155733e	eal: introduce I/O device memory barriers This commit introduce rte_io_mb(), rte_io_wmb() and rte_io_rmb(), in order to enable memory barriers between I/O device and CPU. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2017-01-18 16:57:11 +01:00
Wei Zhao	99e7003831	net/ixgbe: parse L2 tunnel filter check if the rule is a L2 tunnel rule, and get the L2 tunnel info. Signed-off-by: Wei Zhao <wei.zhao1@intel.com> Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Beilei Xing <beilei.xing@intel.com> Acked-by: Wei Dai <wei.dai@intel.com>	2017-01-17 19:41:43 +01:00
Michał Mirosław	d75547718c	net/i40e: return errno when interrupt setup fails Signed-off-by: Michał Mirosław <michal.miroslaw@atendesoftware.pl> Reviewed-by: Jingjing Wu <jingjing.wu@intel.com>	2017-01-17 19:41:42 +01:00
Bernard Iremonger	5e823a4512	ethdev: remove some VF functions remove the following API's: rte_eth_dev_set_vf_rxmode rte_eth_dev_set_vf_rx rte_eth_dev_set_vf_tx rte_eth_dev_set_vf_vlan_filter rte_eth_dev_set_vf_rate_limit Increment LIBABIVER in Makefile Remove deprecation notice for removing rte_eth_dev_set_vf_* API's. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com>	2017-01-17 19:40:50 +01:00
Qiming Yang	2191347120	ethdev: add firmware version get This patch adds a new API 'rte_eth_dev_fw_version_get' for fetching firmware version by a given device. Signed-off-by: Qiming Yang <qiming.yang@intel.com> Acked-by: Remy Horton <remy.horton@intel.com>	2017-01-17 22:34:35 +01:00
Zhihong Wang	f5472703c0	eal: optimize aligned memcpy on x86 This patch optimizes rte_memcpy for well aligned cases, where both dst and src addr are aligned to maximum MOV width. It introduces a dedicated function called rte_memcpy_aligned to handle the aligned cases with simplified instruction stream. The existing rte_memcpy is renamed as rte_memcpy_generic. The selection between them 2 is done at the entry of rte_memcpy. The existing rte_memcpy is for generic cases, it handles unaligned copies and make store aligned, it even makes load aligned for micro architectures like Ivy Bridge. However alignment handling comes at a price: It adds extra load/store instructions, which can cause complications sometime. DPDK Vhost memcpy with Mergeable Rx Buffer feature as an example: The copy is aligned, and remote, and there is header write along which is also remote. In this case the memcpy instruction stream should be simplified, to reduce extra load/store, therefore reduce the probability of load/store buffer full caused pipeline stall, to let the actual memcpy instructions be issued and let H/W prefetcher goes to work as early as possible. This patch is tested on Ivy Bridge, Haswell and Skylake, it provides up to 20% gain for Virtio Vhost PVP traffic, with packet size ranging from 64 to 1500 bytes. The test can also be conducted without NIC, by setting loopback traffic between Virtio and Vhost. For example, modify the macro TXONLY_DEF_PACKET_LEN to the requested packet size in testpmd.h, rebuild and start testpmd in both host and guest, then "start" on one side and "start tx_first 32" on the other. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com> Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Lei Yao <lei.a.yao@intel.com>	2017-01-17 16:40:05 +01:00
Yuanhan Liu	d948f596fe	ethdev: fix port data mismatched in multiple process model Assume we have two virtio ports, 00:03.0 and 00:04.0. The first one is managed by the kernel driver, while the later one is managed by DPDK. Now we start the primary process. 00:03.0 will be skipped by DPDK virtio PMD driver (since it's being used by the kernel). 00:04.0 would be successfully initiated by DPDK virtio PMD (if nothing abnormal happens). After that, we would get a port id 0, and all the related info needed by virtio (virtio_hw) is stored at rte_eth_dev_data[0]. Then we start the secondary process. As usual, 00:03.0 will be firstly probed. It firstly tries to get a local eth_dev structure for it (by rte_eth_dev_allocate): port_id = rte_eth_dev_find_free_port(); ... eth_dev = &rte_eth_devices[port_id]; eth_dev->data = &rte_eth_dev_data[port_id]; ... return eth_dev; Since it's a first PCI device, port_id will be 0. eth_dev->data would then point to rte_eth_dev_data[0]. And here things start going wrong, as rte_eth_dev_data[0] actually stores the virtio_hw for 00:04.0. That said, in the secondary process, DPDK will continue to drive PCI device 00.03.0 (despite the fact it's been managed by kernel), with the info from PCI device 00:04.0. Which is wrong. The fix is to attach the port already registered by the primary process. That is, iterate the rte_eth_dev_data[], and get the port id who's PCI ID matches the current PCI device. This would let us maintain same port ID for the same PCI device, keeping the chance of referencing to wrong data minimal. Fixes: `af75078fec` ("first public release") Cc: stable@dpdk.org Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-17 09:20:18 +01:00
Jan Wickbom	59317cef24	vhost: allow many vhost-user ports Currently select() is used to monitor file descriptors for vhostuser ports. This limits the number of ports possible to create since the fd number is used as index in the fd_set and we have seen fds > 1023. This patch changes select() to poll(). This way we can keep an packed (pollfd) array for the fds, e.g. as many fds as the size of the array. Also see: http://dpdk.org/ml/archives/dev/2016-April/037024.html Reported-by: Patrik Andersson <patrik.r.andersson@ericsson.com> Signed-off-by: Jan Wickbom <jan.wickbom@ericsson.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Maxime Coquelin	73c8f9f69c	vhost: introduce reply ack feature REPLY_ACK features provide a generic way for QEMU to ensure both completion and success of a request. As described in vhost-user spec in QEMU repository, QEMU sets VHOST_USER_NEED_REPLY flag (bit 3) when expecting a reply_ack from the backend. Backend must reply with 0 for success or non-zero otherwise when flag is set. Currently, only VHOST_USER_SET_MEM_TABLE request implements reply_ack, in order to synchronize mapping updates. This patch enables REPLY_ACK feature generally, but only checks error code for VHOST_USER_SET_MEM_TABLE. Signed-off-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:18 +01:00
Yong Wang	5731d6acc7	vhost: fix memory leak In function vhost_new_device(), current code dose not free 'dev' in "i == MAX_VHOST_DEVICE" condition statements. It will lead to a memory leak. Fixes: `45ca9c6f7b` ("vhost: get rid of linked list for devices") Cc: stable@dpdk.org Signed-off-by: Yong Wang <wang.yong19@zte.com.cn> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Haifeng Lin	8c33fc10f6	vhost: fix guest/host physical address mapping When reg_size < page_size the function read in rte_mem_virt2phy would not return, because host_user_addr is invalid. Fixes: `e246896178` ("vhost: get guest/host physical address mappings") Cc: stable@dpdk.org Signed-off-by: Haifeng Lin <haifeng.lin@huawei.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2017-01-17 09:20:17 +01:00
Tomasz Kulasek	1feda4d8fc	mbuf: add a function to linearize a packet This patch adds function rte_pktmbuf_linearize to let crypto PMD coalesce chained mbuf before crypto operation and extend their capabilities to support segmented mbufs when device cannot handle them natively. Included unit tests for rte_pktmbuf_linearize functionality: 1) Creates banch of segmented mbufs with different size and number of segments. 2) Fills noncontigouos mbuf with sequential values. 3) Uses rte_pktmbuf_linearize to coalesce segmented buffer into one contiguous. 4) Verifies data in linearized buffer. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:30:00 +01:00
Tiwei Bie	375008544b	ethdev: add MACsec capability flags If these flags are advertised by a PMD, the NIC supports the MACsec offload. The incoming MACsec traffics can be offloaded transparently after the MACsec offload is configured correctly by the application. And the application can set the PKT_TX_MACSEC flag in mbufs to enable the MACsec offload for the packets to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:19 +01:00
Tiwei Bie	8609a7a86c	ethdev: add MACsec event type This commit adds a below event type: - RTE_ETH_EVENT_MACSEC This event will occur when the PN counter in a MACsec connection reaches the exhaustion threshold. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com>	2017-01-15 19:16:03 +01:00
Tiwei Bie	223d629f8c	mbuf: add MACsec flag Add a new Tx flag in mbuf, that can be set by applications to enable the MACsec offload for a packet to be transmitted. Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-15 19:15:51 +01:00
Bruce Richardson	0615355ffa	kvargs: make pointers in string arrays const Change the parameters of functions from const char valid[] to const char const valid[]. This additional const is needed to allow us to fix some checkpatch warnings, as well as being good programming practice. For the checkpatch warnings, if we have a set of command line args that we want to check defined as: static const char args[] = { "arg1", "arg2", NULL }; kvlist = rte_kvargs_parse(params, args); checkpatch will complain: WARNING:STATIC_CONST_CHAR_ARRAY: static const char array should probably be static const char * const Adding the additional const to the definition of the args will then trigger a compiler error in the absence of this change to the kvargs library, as we lose the const in the call to kvargs_parse. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 19:28:26 +01:00
Wenfeng Liu	454a0a7009	mempool: use cache in single producer or consumer mode Currently we will check mempool flags when we put/get objects from mempool. However, this makes cache useless when mempool is SC\|SP, SC\|MP, MC\|SP cases. This patch makes cache available in above cases and improves performance. Signed-off-by: Wenfeng Liu <liuwf@arraynetworks.com.cn> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-01-13 16:38:09 +01:00
Ben Walker	b0d3e3f73b	pci: use address struct in function arguments Instead of passing domain, bus, devid, func, just pass an rte_pci_addr. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:55:16 +01:00
Ben Walker	22dda618c0	pci: separate detaching ethernet ports from PCI devices Attaching and detaching ethernet ports from an application is not the same thing as physically removing a PCI device, so clarify the flags indicating support. All PCI devices are assumed to be physically removable, so no flag is necessary in the PCI layer. Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2017-01-12 15:48:54 +01:00
Ben Walker	e84ad157b7	pci: unmap resources if probe fails If resources were mapped prior to probe, unmap them if probe fails. This does not handle the case where the kernel driver was forcibly unbound prior to probe. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com>	2017-01-12 15:47:45 +01:00
Adrien Mazarguil	6de5c0f130	ethdev: define default item masks in flow API Leaving default pattern item mask values up for interpretation by PMDs is an undefined behavior that applications might find difficult to use in the wild. It also needlessly complicates PMD implementation. This commit addresses this by defining consistent default masks for each item type. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:52 +01:00
Adrien Mazarguil	f4e7c8bc84	ethdev: clarify RSS action in flow API Contrary to the current description, mbuf RSS hash result storage does not overlap with the returned MARK value (hash.fdir.lo vs. hash.fdir.hi), and both may be combined. Reflect this change by allowing testpmd to display both values simultaneously. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:47 +01:00
Adrien Mazarguil	ebaa064e2c	ethdev: clarify MARK and FLAG actions in flow API Both actions share the PKT_RX_FDIR mbuf flag, as a result there is no way to tell them apart. Moreover, the maximum allowed value for the MARK action may not necessarily cover the entire 32-bit space. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:54:22 +01:00
Adrien Mazarguil	a757583e55	ethdev: modify flow API error function Based on initial PMD implementations of the flow API, returning the error structure which may be NULL is useless and always discarded. Returning the error code instead appears to be much more convenient. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2017-01-11 16:53:07 +01:00
Nelio Laranjeiro	86c743cf91	eal: define generic vector types Add common vector type definitions to all CPU architectures. Signed-off-by: Nelio Laranjeiro <nelio.laranjeiro@6wind.com> Acked-by: Chao Zhu <chaozhu@linux.vnet.ibm.com>	2017-01-04 21:50:19 +01:00
Thomas Monjalon	c6dab2a873	tools: move to usertools Rename tools/ into usertools/ to differentiate from buildtools/ and devtools/ while making clear these scripts are part of DPDK runtime. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Tested-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-01-04 21:17:32 +01:00
Tomasz Kulasek	4fb7e803eb	ethdev: add Tx preparation Added API for `rte_eth_tx_prepare` uint16_t rte_eth_tx_prepare(uint8_t port_id, uint16_t queue_id, struct rte_mbuf tx_pkts, uint16_t nb_pkts) Added fields to the `struct rte_eth_desc_lim`: uint16_t nb_seg_max; /< Max number of segments per whole packet. / uint16_t nb_mtu_seg_max; /< Max number of segments per one MTU / These fields can be used to create valid packets according to the following rules: * For non-TSO packet, a single transmit packet may span up to "nb_mtu_seg_max" buffers. * For TSO packet the total number of data descriptors is "nb_seg_max", and each segment within the TSO may span up to "nb_mtu_seg_max". Added functions: int rte_validate_tx_offload(struct rte_mbuf m) to validate general requirements for tx offload set in mbuf of packet such a flag completness. In current implementation this function is called optionaly when RTE_LIBRTE_ETHDEV_DEBUG is enabled. int rte_net_intel_cksum_prepare(struct rte_mbuf m) to prepare pseudo header checksum for TSO and non-TSO tcp/udp packets before hardware tx checksum offload. - for non-TSO tcp/udp packets full pseudo-header checksum is counted and set. - for TSO the IP payload length is not included. int rte_net_intel_cksum_flags_prepare(struct rte_mbuf *m, uint64_t ol_flags) this function uses same logic as rte_net_intel_cksum_prepare, but allows application to choose which offloads should be taken into account, if full preparation is not required. PERFORMANCE TESTS ----------------- This feature was tested with modified csum engine from test-pmd. The packet checksum preparation was moved from application to Tx preparation step placed before burst. We may expect some overhead costs caused by: 1) using additional callback before burst, 2) rescanning burst, 3) additional condition checking (packet validation), 4) worse optimization (e.g. packet data access, etc.) We tested it using ixgbe Tx preparation implementation with some parts disabled to have comparable information about the impact of different parts of implementation. IMPACT: 1) For unimplemented Tx preparation callback the performance impact is negligible, 2) For packet condition check without checksum modifications (nb_segs, available offloads, etc.) is 14626628/14252168 (~2.62% drop), 3) Full support in ixgbe driver (point 2 + packet checksum initialization) is 14060924/13588094 (~3.48% drop) Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2017-01-04 20:40:15 +01:00

1 2 3 4 5 ...

2820 Commits