numam-dpdk

Author	SHA1	Message	Date
Benoît Canet	45db8927a8	vhost: add hint on how to add or remove device to a data core Let's make sure people will not forget to set and unset VIRTIO_DEV_RUNNING. Signed-off-by: Benoît Canet <benoit.canet@nodalink.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-03-17 12:39:52 +01:00
Vlad Zolotarov	01fa1d6215	ixgbe: unify Rx setup - Set the callback in a single function that is called from ixgbe_dev_rx_init() for a primary process and from eth_ixgbe_dev_init() for a secondary processes. This is instead of multiple, hard to track places. - Added ixgbe_hw.rx_bulk_alloc_allowed - see ixgbe_hw.rx_vec_allowed description below. - Added ixgbe_hw.rx_vec_allowed: like with Bulk Allocation, Vector Rx is enabled or disabled on a per-port level. All queues have to meet the appropriate preconditions and if any of them doesn't - the feature has to be disabled. Therefore ixgbe_hw.rx_vec_allowed will be updated during each queues configuration (rte_eth_rx_queue_setup()) and then used in rte_eth_dev_start() to configure the appropriate callbacks. The same happens with ixgbe_hw.rx_vec_allowed in a Bulk Allocation context. - Bugs fixed: - Vector scattered packets callback was called regardless the appropriate preconditions: - Vector Rx specific preconditions. - Bulk Allocation preconditions. - Vector Rx was enabled/disabled according to the last queue setting and not based on all queues setting (which may be different for each queue). Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-17 00:46:01 +01:00
Vlad Zolotarov	18f81142d6	ixgbe: fix Rx CRC stripping for X540 According to x540 spec chapter 8.2.4.8.9 CRCSTRIP field of RDRXCTL should be configured to the same value as HLREG0.RXCRCSTRP. Clearing the RDRXCTL.RSCFRSTSIZE field for x540 is not required by the spec but seems harmless. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-17 00:46:01 +01:00
Vlad Zolotarov	d07fb25860	ixgbe: fix endianness of ring descriptor access Use the rte_le_to_cpu_xx()/rte_cpu_to_le_xx() when reading/setting HW ring descriptor fields. Fixed the above in ixgbe_rx_alloc_bufs() and in ixgbe_recv_scattered_pkts(). Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-17 00:46:01 +01:00
Michael Qiu	9fc985ae9d	fm10k: set pointer to NULL after free It could be a potential not safe issue. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Acked-by: Jing Chen <jing.d.chen@intel.com>	2015-03-17 00:46:01 +01:00
Huawei Xie	28a1ccca41	vhost: add build option for vhost-user Turn on CONFIG_RTE_LIBRTE_VHOST to enable vhost. vhost-user is turned on by default. Turn off CONFIG_RTE_LIBRTE_VHOST_USER to enable vhost-cuse implementation. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-17 00:46:01 +01:00
John McNamara	66abc3f310	eal: fix type casting of value to align Fix a warning when the rte_common.h header is included in a compilation using -Wbad-function-cast, such as in Open vSwitch where the following warning is emitted repeatedly: ../rte_common.h: In function 'rte_is_aligned': ../rte_common.h:184:9: warning: cast from function call of type 'uintptr_t' to non-matching type 'void *' [-Wbad-function-cast] This change fixes the issue in rte_common.h by using the RTE_ALIGN_FLOOR macro to get the aligned floor value with generic type casting. Also removed the rte_align_floor_int() function and replaced it with the RTE_PTR_ALIGN_FLOOR() macro. Signed-off-by: John McNamara <john.mcnamara@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-17 00:46:01 +01:00
Thomas Monjalon	9a01c31b94	eal: fix build with icc and gcc < 4.4 x86intrin.h cannot be directly included as it is not always available. rte_common_vect.h handles compiler differences. Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-03-16 16:59:53 +01:00
David Marchand	f07246766d	eal: fix tailq init for uio and vfio resources Commit `a2348166ea` ("tailq: move to dynamic tailq") introduced a bug in uio/vfio resources list init. These resources list were pointed at through a pointer initialised only once but too early in the eal init (before tailqs init). Fix this by "resolving" this pointer when used (which is well after tailqs init). Fixes: `a2348166ea` ("tailq: move to dynamic tailq") Reported-by: Marvin Liu <yong.liu@intel.com> Reported-by: Tetsuya Mukawa <mukawa@igel.co.jp> Signed-off-by: David Marchand <david.marchand@6wind.com> Tested-by: John McNamara <john.mcnamara@intel.com> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-03-12 08:32:48 +01:00
Jingjing Wu	0ef38281f9	ixgbe: fix supported flow types Ixgbe doesn't support the ipv4-frag and ipv6-frag flow types, remove them. Ixgbe doesn't support configure flex mask on each kind of flow type, this patch uses RTE_ETH_FLOW_UNKNOWN to specify global flex mask setting. This patch also changes the string "raw" to "none" to indicate flex mask setting unrelated with flow type in testpmd for ixgbe. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-03-10 16:44:54 +01:00
Bruce Richardson	a4a61f79e5	eal/bsd: enable contigmem blocks >1GB in size The contigmem module was using an "int" type for specifying the size of blocks of memory to be reserved. A 2GB block was therefore overflowing the signed 32-bit value, making 1GB the largest block size that could be reserved as a single unit. The fix is to change the type used for the buffer/block size to an "int64_t" value. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>	2015-03-10 16:19:43 +01:00
Adrien Mazarguil	66e1591687	mlx4: avoid init errors when kernel modules are not loaded Mimic UIO/VFIO drivers behavior by not causing errors when a device cannot be initialized due to missing or mismatching kernel modules. Display helpful messages instead, such as: [...] EAL: PCI device 0000:83:00.0 on NUMA socket 1 EAL: probe driver: 15b3:1007 librte_pmd_mlx4 PMD: librte_pmd_mlx4: PCI information matches, using device "mlx4_0" (VF: false) PMD: librte_pmd_mlx4: cannot use device, are drivers up to date? EAL: PCI device 0000:84:00.0 on NUMA socket 1 EAL: probe driver: 15b3:1007 librte_pmd_mlx4 PMD: librte_pmd_mlx4: PCI information matches, using device "mlx4_1" (VF: false) PMD: librte_pmd_mlx4: cannot use device, are drivers up to date? EAL: No probed ethernet devices [...] Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2015-03-10 16:08:15 +01:00
Stephen Hemminger	1257d1734b	ixgbe: rename igb prefix To avoid any possible confusion or breakage, rename all the structures of ixgbe driver to use ixgbe_ rather than igb_ because igb is a different driver. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-10 15:24:06 +01:00
Stephen Hemminger	4bb4414040	ixgbe: prefix global function All global functions in a driver should use the same prefix to avoid any future name collisions. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Bruce Richardson <bruce.richardson@intel.com> [Bruce: one instance was missing] Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-10 15:23:21 +01:00
Stephen Hemminger	b74381cf91	ixgbe: make bulk alloc static Only used in this file, make it static. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-10 15:16:23 +01:00
Stephen Hemminger	19731cfdf3	ixgbe: make register maps const These are const data structures, just put them in txt segment rather than having compiler emit code to set them up on the stack. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-10 15:12:15 +01:00
Stephen Hemminger	3fd612a907	ixgbe: make txq_ops const All virtual function tables should be const so they are put in text segment rather than data. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-10 15:04:54 +01:00
David Marchand	da0113c539	eal: remove useless errno There is no remaining reference to E_RTE_NO_TAILQ. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 12:17:31 +01:00
David Marchand	95b6a46fa6	tailq: remove static slots No static entry remaining, the rte_tailq api is for "internal use" only, get rid of the static slots. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 12:15:14 +01:00
David Marchand	a2348166ea	tailq: move to dynamic tailq Use dynamic tailq rather than static entries. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 12:06:08 +01:00
David Marchand	873a61c752	tailq: introduce dynamic register system This register system makes it possible to reserve a tailq for the dpdk libraries. The "dynamic" tailqs are right after the "static" tailqs in shared mem. Primary process is responsible for writing the tailq names, so that secondary processes can find them. This is a temp commit, "static" tailqs are removed after conversion of all users in next commits. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:58:02 +01:00
David Marchand	598a9cc804	tailq: remove unused macros A lot of places just protect against concurrent access and I can not see the gain of having those macros. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:55:07 +01:00
David Marchand	9b7e0dbb6c	tailq: get rid of broken reserve api The "reserve" macros and functions do not check if the requested entry is free. They do nothing more than the lookup function (which itself "creates" entries ...). The rte_tailq api is marked as "internal use" in documentation and these macros are only used in test application, so just get rid of them. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:51:12 +01:00
David Marchand	f6b4f6c9c1	tailq: use a single cast macro No need to cast everywhere, define a common macro for this, plus it can be used in future commits. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:49:26 +01:00
David Marchand	ff708facfc	tailq: remove unneeded inclusions Only keep inclusion where really needed. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:47:46 +01:00
David Marchand	b8320b9824	pci: use lookup tailq api There is no reason why we should use the "reserve" tailq api, since the pci entry is already statically reserved. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-10 11:40:21 +01:00
David Marchand	47cb11c9bf	eal: remove remaining reference to pm Hopefully, this is the last reference to pm. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-03-09 18:14:29 +01:00
Thomas Monjalon	1f693b8f37	ethdev: remove useless parameter in init functions The pointer to struct eth_driver is not used and is already set in struct rte_eth_dev. It's a small cleanup in PMD API which probably needs more attention to make clear what is a driver, a PCI driver, an ethernet driver, etc. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-03-09 18:07:44 +01:00
Michael Qiu	12fa4a0078	hash: fix unsupported crc instruction in i686 platform Error: unsupported instruction `crc32' The root cause is that i686 platform does not support 'crc32q' Need make it only available in x86_64 platform. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Acked-by: Yerden Zhumabekov <yerden.zhumabekov@sts.kz>	2015-03-09 18:07:43 +01:00
Michael Qiu	2eca94d521	eal/x86: fix redeclaration of registers include/rte_cpuflags.h:154:2: error: redeclaration of enumerator ‘REG_EAX’ In i686, from REG_EAX to REG_EDX are all defined in /usr/include/sys/ucontext.h Rename to RTE_REG_EAX to avoid this issue. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-03-09 18:07:43 +01:00
Konstantin Ananyev	2a0911ec8b	eal: fix C++11 compilation When compiling C++11-code or above (--std=c++11), the build fails with lots of rte_eth_ctrl.h:517:3: note: in expansion of macro RTE_ALIGN (RTE_ALIGN(RTE_ETH_FLOW_MAX, UINT32_BIT)/UINT32_BIT) ^ When reading the GCC info pages, I get the feeling that __typeof__ is a better choice, and that indeed works when including the headers in C++ files (--std=c++11). Signed-off-by: Simon Kagstrom <simon.kagstrom@netinsight.net> Signed-off-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-09 17:38:41 +01:00
Michal Jastrzebski	64b01ee0b2	app/testpmd: check vlan filter configuration This patch modifies testpmd behavior when setting: rx_vlan add all vf_port (enabling all vlanids to be passed thru rx filter on VF). Rx_vlan_all_filter_set() function, checks if the next vlanid can be enabled by the driver. Number of vlanids is limited by the NIC and thus the NIC do not allow to enable more vlanids than it can allocate in VFTA table. Signed-off-by: Michal Jastrzebski <michalx.k.jastrzebski@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-03-09 12:47:26 +01:00
Takuya Asada	cff90bb7ec	virtio: add default Tx configuration When I tried to launch test-pmd on KVM guest of Fedora21, I got following error: Configuring Port 0 (socket 0) Fail to configure port 0 tx queues EAL: Error - exiting with code: 1 Cause: Start ports failed I found that the error caused here, and actual error message was "TX checksum offload not supported". This patch adds default_txconf on virtio pmd, to avoid the error. Signed-off-by: Takuya Asada <syuu@cloudius-systems.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-03-09 12:46:46 +01:00
Huawei Xie	4d16fff496	vhost: check file descriptor before closing This avoids closing -1 in our case. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-03-09 12:46:46 +01:00
Huawei Xie	64ab971791	vhost: fix file descriptors naming Previous vhost implementation wrongly name kickfd as callfd and callfd as kickfd. It is functional correct, but causes confusion. Exchange kickfd and callfd to avoid confusion. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-03-09 12:46:46 +01:00
Huawei Xie	aee87dd706	vhost: use loop instead of goto This patch reorder the code a bit to use loop instead of goto. Besides, remove abudant check 'fd != -1'. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-09 12:46:46 +01:00
Huawei Xie	31ff0c6a45	vhost: combine select with sleep combine sleep into select when there is no file descriptors to be monitored. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-03-09 12:46:46 +01:00
Ouyang Changchun	db566f5930	ixgbe: fix VF Tx for X550 X550 should use the correct macro to set the VFTDT and VFRDT register address. This patch fixes the VF TX issue for Sageville. Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Cunming Liang <cunming.liang@intel.com>	2015-03-09 12:46:46 +01:00
Michael Qiu	7eb689794a	eal/x86: fix integer cast in memcpy ./i686-native-linuxapp-gcc/include/rte_memcpy.h:592:23: error: cast from pointer to integer of different size [-Werror=pointer-to-int-cast] dstofss = 16 - (int)((long long)(void )dst & 0x0F) + 16; Type 'long long' is 64-bit in i686 platform while 'void ' is 32-bit. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Signed-off-by: Zhihong Wang <zhihong.wang@intel.com>	2015-03-09 12:46:46 +01:00
Zhihong Wang	76746eb13f	eal/x86: fix strict aliasing rules Fixed strict-aliasing rules breaking errors for some GCC version. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com> Acked-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Michael Qiu <michael.qiu@intel.com>	2015-03-09 12:46:46 +01:00
Huawei Xie	391b5f425c	vhost: fix crash by removing device when requested This patch fixes the segfault issue in the case vhost receives new VHOST_SET_MEM_TABLE message without VHOST_VRING_GET_VRING_BASE (which we uses as the stop message). Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tommy Long <thomas.long@intel.com>	2015-03-05 22:08:27 +01:00
Panu Matilainen	5e62bdde85	ethdev: add missing symbol export for port release Fixes: `36ec8585b2` ("ethdev: release port") Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-03-05 21:58:07 +01:00
Keith Wiles	d8b8517893	ethdev: fix hotplug check for Rx and Tx callbacks Some checks with rte_eth_dev_is_valid_port() were missed when merging hotplug and callbacks features. Fixes: `c282abd2a6` ("ethdev: remove assumption that port will not be detached") Signed-off-by: Keith Wiles <keith.wiles@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2015-03-05 21:30:47 +01:00
Cunming Liang	352078e8e1	ixgbe: check rxd number to avoid mbuf leak The mbuf leak happens when the assigned number of rx descriptor is not power of 2 in vector mode. As it's presumed on vpmd rx (for rx_tail wrap), adding condition check to prevent it. The root cause reference code in _recv_raw_pkts_vec as below. "rxq->rx_tail = (uint16_t)(rxq->rx_tail & (rxq->nb_rx_desc - 1));". Reported-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-03-05 20:06:06 +01:00
Stephen Hemminger	490a1f0a25	enic: remove useless cast Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-03-04 21:50:42 +01:00
Stephen Hemminger	d886477be7	eal/linux: remove useless memset The path variable is set via snprintf, and does not need to memset before that. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-03-04 21:50:42 +01:00
Stephen Hemminger	944127d1ed	eal/bsd: remove useless assignments If variable is set in the next line, it doesn't need to be initialized. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2015-03-04 21:50:42 +01:00
Pawel Wodkowski	03b2f7716b	cmdline: fix parameter type Fix warning reported during static analysis about size_t to int cast when passing parameters to parse_set_list(). This patch fix code formating errors that give checkpatch.pl errors after generating patch. Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-03-04 11:19:37 +01:00
Pawel Wodkowski	460a165075	ring: fix memory leak Free kvlist on function exit to avoid memory leak during devinit. Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-03-04 11:19:37 +01:00
Pawel Wodkowski	c34af7424e	kvargs: fix freeing behaviour for null By convention free() functions should ignore NULL parameter. This patch add this behaviour for rte_kvargs_free(). Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-03-04 11:19:37 +01:00
Pawel Wodkowski	5663c25dcc	timer: fix callback declaration inconsistency This patch remove inconsistency between declaration of type rte_timer_cb_t, field f in struct rte_timer and function __rte_timer_reset(). Although compiler treat both of them the same, the static analysis tool like complain about that. Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-03-04 11:19:37 +01:00
Thomas Monjalon	4b1b380213	bond: remove debug function to fix link with shared lib The function print_client_stats was used in the example without being clearly exported in the map file. So it breaks linking with shared library when debug is enabled. It's better to remove this function as it probably could be implemented with statistics API. Fixes: `cc7e8ae84f` ("add example application for link bonding mode 6") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-03-04 11:19:25 +01:00
Thomas Monjalon	a91d686358	mlx4: mute auto config in quiet mode If verbose is off, auto-config-h.sh script should be quiet. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2015-03-04 11:19:21 +01:00
Thomas Monjalon	9d740b97f2	mlx4: fix build with mempool debug enabled The mempool header forces error on -Wcast-qual and makes verbs.h failing. Let's include verbs before as a system header. Fixes: `7fae69eeff` ("mlx4: new poll mode driver") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2015-03-04 11:19:07 +01:00
Thomas Monjalon	80edfef4a8	virtio: fix build with debug enabled With CONFIG_RTE_LIBRTE_VIRTIO_DEBUG_INIT=y: error: ‘devname’ undeclared (first use in this function) Fixes: `da978dfdc4` ("virtio: use port IO to get PCI resource") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-04 11:18:53 +01:00
Thomas Monjalon	459132bc2c	virtio: fix build with mempool debug enabled The mempool header forces error on -Wcast-qual: error: cast discards ‘const’ qualifier from pointer target type Let's fix it by removing const qualifier of pci driver from commit `5e9f6d1340` ("pci: reference driver structure for each device") It's needed because the driver flags are changed depending on using uio or not. Actually these driver flags should be directly attached to each device. Fixes: `da978dfdc4` ("virtio: use port IO to get PCI resource") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-03-04 11:18:36 +01:00
Thomas Monjalon	ae6ad6b0c4	fm10k: fix build with debug enabled error: implicit declaration of function ‘RTE_LOG’ Fixes: `a6061d9e70` ("fm10k: register PF driver") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-03-04 11:18:36 +01:00
Thomas Monjalon	22e34d1211	mempool: fix build with debug enabled error: format ‘%p’ expects argument of type ‘void ’, but argument 5 has type ‘const struct rte_mempool ’ [-Werror=format=] mp type is (const struct rte_mempool *) and must be casted into a simpler type to be printed. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-03-04 11:18:27 +01:00
Thomas Monjalon	8cf61ec829	eal/linux: fix build Compilation fails in some distributions because of missing unistd.h needed for pread/pwrite (seen with Suse): lib/librte_eal/linuxapp/eal/eal_pci_uio.c:62:2: error: implicit declaration of function ‘pread’ Fixes: `4a499c6495` ("eal/linux: enable uio_pci_generic support") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-03-03 23:21:33 +01:00
Pawel Wodkowski	a001589ec1	devargs: fix null dereferencing on failure On failure devargs->args should not be accessed if devargs is NULL. Fixes: `c07691ae10` ("devargs: remove limit on parameters length") Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-03-02 19:40:54 +01:00
Neil Horman	c3615e4a80	eal: clean up export of socket id variable Theres no need to export this variable. Its set and queried from an API call that doesn't exist in the hot path. Instead just export the rte_socket_id symbol and make the variable private to protect it from type changes. We should do this with the other exported variables too, but I think its too late in the release cycle to do that. tested using distributor_autotest (which uses rte_socket_id), successfully. Only tested on linux, as I don't currently have a bsd system spun up, but the changes are symmetric, and should be fine Signed-off-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Cunming Liang <cunming.liang@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-03-02 19:40:20 +01:00
Jijiang Liu	ea5c23df30	i40e: advertise TSO capability Advertise the DEV_TX_OFFLOAD_TCP_TSO flag in the PMD features. It means that the i40e PMD supports the offload of TSO. Test report: http://www.dpdk.org/ml/archives/dev/2015-March/014467.html Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Miroslaw Walukiewicz <miroslaw.walukiewicz@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com> Tested-by: Min Cao <min.cao@intel.com>	2015-03-02 07:36:30 +01:00
Jijiang Liu	6fa72b444c	i40e: enable TSO support This patch enables i40e TSO feature for both non-tunneling packet and tunneling packet. Test report: http://www.dpdk.org/ml/archives/dev/2015-March/014467.html Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Miroslaw Walukiewicz <miroslaw.walukiewicz@intel.com> Signed-off-by: Grzegorz Galkowski <grzegorz.galkowski@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com> Tested-by: Min Cao <min.cao@intel.com>	2015-03-02 07:36:13 +01:00
Jijiang Liu	1f02d682d3	i40e: move Tx offloads parameters to separate structure The structure size is u64 so it could be used with single cpu operation. Test report: http://www.dpdk.org/ml/archives/dev/2015-March/014467.html Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Miroslaw Walukiewicz <miroslaw.walukiewicz@intel.com> Signed-off-by: Grzegorz Galkowski <grzegorz.galkowski@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com> Tested-by: Min Cao <min.cao@intel.com>	2015-03-02 07:35:26 +01:00
Tetsuya Mukawa	e34550c8b9	null: fix build with gcc-4.7 This patch fixes following errors with gcc-4.7. lib/librte_pmd_null/rte_eth_null.c:302:28: error: array subscript is above array bounds Reported-by: John McNamara <john.mcnamara@intel.com> Reported-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: John McNamara <john.mcnamara@intel.com>	2015-02-28 00:22:00 +01:00
Tetsuya Mukawa	2929553854	null: fix build with icc This patch fixes following errors with icc. rte_eth_null.c(47): error #83: type qualifier specified more than once Reported-by: John McNamara <john.mcnamara@intel.com> Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: John McNamara <john.mcnamara@intel.com>	2015-02-28 00:19:46 +01:00
Tetsuya Mukawa	911920812f	ethdev: fix build with icc This patch fixes following errors with icc. error #188: enumerated type mixed with another type return -1; Fixes: `92d94d3744` ("ethdev: attach or detach port") Reported-by: John McNamara <john.mcnamara@intel.com> Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: John McNamara <john.mcnamara@intel.com>	2015-02-28 00:14:51 +01:00
Changchun Ouyang	a8e3219a92	virtio: fix build on freebsd This patch fixes the compilation issue on freebsd: lib/librte_pmd_virtio/virtio_ethdev.c: In function 'virtio_resource_init': lib/librte_pmd_virtio/virtio_ethdev.c:1071:56: error: unused parameter 'pci_dev' [-Werror=unused-parameter] Fixes: `da978dfdc4` ("virtio: use port IO to get PCI resource") Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Cunming Liang <cunming.liang@intel.com>	2015-02-28 00:12:37 +01:00
Thomas Monjalon	00c685634b	mlx4: fix build There was a missing change due by hotplug integration. Fixes: `9f1653e7b7` ("ethdev: add device type") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-26 18:02:24 +01:00
Thomas Monjalon	ae438314b1	mlx4: remove old version compatibility No need to check DPDK version. It just has to work on HEAD. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-26 18:02:17 +01:00
Thomas Monjalon	7609e66093	ethdev: fix build without hotplug After setting CONFIG_RTE_LIBRTE_EAL_HOTPLUG=n, GCC stop compiling: rte_ethdev.c:430:1: error: ‘rte_eth_dev_get_device_type’ defined but not used rte_ethdev.c:438:1: error: ‘rte_eth_dev_save’ defined but not used rte_ethdev.c:450:1: error: ‘rte_eth_dev_get_changed_port’ defined but not used rte_ethdev.c:464:1: error: ‘rte_eth_dev_get_addr_by_port’ defined but not used rte_ethdev.c:481:1: error: ‘rte_eth_dev_get_name_by_port’ defined but not used rte_ethdev.c:503:1: error: ‘rte_eth_dev_is_detachable’ defined but not used The hotplug option allows to build in environment (BSD) not yet supported by this new feature. It should be removed when BSD will be supported. Waiting this day, let's fix build with hotplug disabled. Reported-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-26 11:55:10 +01:00
Thomas Monjalon	b67578ccdf	version: 2.0.0-rc1 Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-26 00:41:57 +01:00
Tetsuya Mukawa	78d9a89884	null: support port hotplug This patch adds port hotplug support to Null PMD. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-02-26 00:32:31 +01:00
Tetsuya Mukawa	c743e50c47	null: new poll mode driver Null PMD is a driver of the virtual device particularly designed to measure performance of DPDK PMDs. When an application call rx, Null PMD just allocates mbufs and returns those. Also tx, the PMD just frees mbufs. The PMD has following options. - size: specify packe size allocated by RX. Default packet size is 64. - copy: specify 1 or 0 to enable or disable copy while RX and TX. Default value is 0(disabled). This option is used for emulating more realistic data transfer. Copy size is equal to packet size. To use the PMD, enable CONFIG_RTE_BUILD_SHARED_LIB in config file. Then compile the PMD as shared library. The library can be linked using '-d' option when an application invokes. Here is an example. $ sudo ./testpmd -c f -n 4 -d librte_pmd_null.so \ --vdev 'eth_null0' --vdev 'eth_null1' -- -i --no-flush-rx If testpmd is compiled with CONFIG_RTE_BUILD_SHARED_LIB, it may need to specify more libraries using '-d' option. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-02-26 00:31:45 +01:00
Tetsuya Mukawa	c956caa6ea	pcap: support port hotplug This patch adds finalization code to free resources allocated by the PMD. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-02-26 00:19:16 +01:00
Tetsuya Mukawa	92d94d3744	ethdev: attach or detach port These functions are used for attaching or detaching a port. When rte_eth_dev_attach() is called, the function tries to realize the device name as pci address. If this is done successfully, rte_eth_dev_attach() will attach physical device port. If not, attaches virtual devive port. When rte_eth_dev_detach() is called, the function gets the device type of this port to know whether the port is come from physical or virtual. And then specific detaching function will be called. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	9f1653e7b7	ethdev: add device type This new parameter is needed to keep device type like PCI or virtual. Port detaching processes are different between PCI device and virtual device. RTE_ETH_DEV_PCI indicates device type is PCI. RTE_ETH_DEV_VIRTUAL indicates device is virtual. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	6c6829475b	ethdev: close device The patch adds function pointer to rte_pci_driver and eth_driver structure. These function pointers are used when ports are detached. Also, the patch adds rte_eth_dev_uninit(). So far, it's not called by anywhere, but it will be called when port hotplug function is implemented. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	36ec8585b2	ethdev: release port This patch adds rte_eth_dev_release_port(). The function is used for changing an attached status of the device that has specified name. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	c282abd2a6	ethdev: remove assumption that port will not be detached To remove assumption, do like followings. This patch adds "RTE_PCI_DRV_DETACHABLE" to drv_flags of rte_pci_driver structure. The flags indicate the driver can detach devices at runtime. Also, remove assumption that port will not be detached. To remove the assumption. - Add 'attached' member to rte_eth_dev structure. This member is used for indicating the port is attached, or not. DEV_ATTACHED indicates a port is attached. DEV_DETACHED indicates a port is detached. - Add rte_eth_dev_allocate_new_port(). This function is used for allocating new port. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	0fe11ec592	eal: add vdev init and uninit The patch adds following functions. - rte_eal_vdev_init(); - rte_eal_vdev_uninit(); - rte_eal_parse_devargs_str(). These functions are used for driver initialization and finalization. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	dbe6b4b61b	pci: probe or close device - Add pci_close_all_drivers() The function tries to find a driver for the specified device, and then close the driver. - Add rte_eal_pci_probe_one() and rte_eal_pci_close_one() The functions are used for probe and close a device. First the function tries to find a device that has the specified PCI address. Then, probe or close the device. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:08:25 +01:00
Tetsuya Mukawa	b8be05722f	pci: unmap igb_uio resources The patch adds functions for unmapping igb_uio resources. The patch is only for Linux and igb_uio environment. VFIO and BSD are not supported. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-26 00:03:07 +01:00
Tetsuya Mukawa	c0ce0577e8	pci: consolidate address comparisons This patch replaces pci_addr_comparison() and memcmp() of pci addresses by rte_eal_compare_pci_addr(). To compare PCI addresses, rte_eal_compare_pci_addr() doesn't use memcmp(). This is because sizeof(struct rte_pci_addr) returns 6, but actually this structure is like below. struct rte_pci_addr { uint16_t domain; /*< Device domain / uint8_t bus; /*< Device bus / uint8_t devid; /*< Device ID / uint8_t function; /*< Device function. / }; If the structure is dynamically allocated in a function without bzero, last 1 byte may have value. As a result, memcmp may not work. To avoid such a case, rte_eal_compare_pci_addr() compare following values. dev_addr = (addr->domain << 24) \| (addr->bus << 16) \| (addr->devid << 8) \| addr->function; Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-25 23:38:00 +01:00
Michael Qiu	cc4fe5c45d	pci: select memory mapping from driver type With the driver type flag in struct rte_pci_dev, we do not need to always map uio devices with vfio related function when vfio enabled. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-02-25 23:38:00 +01:00
Michael Qiu	d9a8cd9595	pci: add kernel driver type Currently, dpdk has no ability to know which type of driver( vfio-pci/igb_uio/uio_pci_generic) the device used. It only can check whether vfio is enabled or not statically. It really useful to have the flag, because different type need to handle differently in runtime. For example, pci memory map, pot hotplug, and so on. This patch add a flag field for pci device to solve above issue. Signed-off-by: Michael Qiu <michael.qiu@intel.com> Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2015-02-25 23:38:00 +01:00
Panu Matilainen	4dc41c7be6	ixgbe: fix build with gcc 5 gcc 5 supports a new logical-not-parentheses warning which ixgbe_common.c triggers, causing build failure with -Werror. Since this source must not be modified, silence the warning instead. Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-25 16:24:15 +01:00
Adrien Mazarguil	7fae69eeff	mlx4: new poll mode driver This PMD manages all variants of Mellanox ConnectX-3 (EN 40, EN 10, Pro EN 40) as well as their virtual functions in SR-IOV context through IB Verbs (libibverbs) and the dedicated user-space driver (libmlx4). It is disabled by default due to dependencies on these libraries and only supports Linux userland at the moment partly because /sys (sysfs) support is required. Also claim responsibility in the MAINTAINERS file. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com> Signed-off-by: Olga Shern <olgas@mellanox.com>	2015-02-25 16:07:57 +01:00
Zhihong Wang	9144d6bcde	eal/x86: optimize memcpy for SSE and AVX Main code changes: 1. Differentiate architectural features based on CPU flags a. Implement separated move functions for SSE/AVX/AVX2 to make full utilization of cache bandwidth b. Implement separated copy flow specifically optimized for target architecture 2. Rewrite the memcpy function "rte_memcpy" a. Add store aligning b. Add load aligning based on architectural features c. Put block copy loop into inline move functions for better control of instruction order d. Eliminate unnecessary MOVs 3. Rewrite the inline move functions a. Add move functions for unaligned load cases b. Change instruction order in copy loops for better pipeline utilization c. Use intrinsics instead of assembly code 4. Remove slow glibc call for constant copies Test report: http://dpdk.org/ml/archives/dev/2015-January/011848.html Signed-off-by: Zhihong Wang <zhihong.wang@intel.com> Tested-by: Jingguo Fu <jingguox.fu@intel.com> Reviewed-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-25 11:50:53 +01:00
Robert Sanford	7085f6c738	timer: fix reset return value - API rte_timer_reset() should return -1 when the timer is in the RUNNING or CONFIG state. Instead, it ignores the return value of internal function __rte_timer_reset() and always returns 0. We change rte_timer_reset() to return the value returned by __rte_timer_reset(). - Enhance timer stress test 2 to report how many timer reset collisions occur, i.e., how many times rte_timer_reset() fails due to a timer being in the CONFIG state. Signed-off-by: Robert Sanford <rsanford2@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-02-25 10:43:27 +01:00
Robert Sanford	bf2fef39b5	timer: pause in reset sync In rte_timer_reset_sync(), insert rte_pause() into loop that waits for rte_timer_reset() to succeed. Signed-off-by: Robert Sanford <rsanford2@gmail.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-02-25 10:39:42 +01:00
Cunming Liang	9bacd17535	eal: fix missing symbol in version map As per_lcore__socket_id and rte_sys_gettid are missing in version map, it causes compiling error when CONFIG_RTE_BUILD_SHARED_LIB is enabled. Fixes: `ef76436c68` ("eal: get unique thread id") Fixes: `9e29251b2a` ("eal: thread affinity API") Reported-by: Tetsuya Mukawa <mukawa@igel.co.jp> Signed-off-by: Cunming Liang <cunming.liang@intel.com> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: John McNamara <john.mcnamara@intel.com>	2015-02-25 09:59:41 +01:00
Bruce Richardson	54991196e8	eal/linux: remove unnecessary check for primary instance In pci_uio_map_resource we check that we are in a primary process before calling pci_uio_set_bus_master. However, there is already an earlier check which means that we are always in a primary instance at this point in the code, so the check can be removed. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-02-24 22:34:06 +01:00
Bruce Richardson	0fbbb63fbf	eal/linux: populate uio maps from pci resources array Rather than scanning the resource file in sysfs a second time, we can pull the information on physical addresses of BARs from the pci resource information already present in the dev structure. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-02-24 22:31:25 +01:00
Bruce Richardson	9e67561acd	eal/linux: mmap uio resources using resourceX files Instead of distinguishing the BAR mappings via offset within a single file, originally /dev/uioX, switch to mapping each individual bar via the appropriately numbered resourceX file. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-02-24 22:29:43 +01:00
Pawel Wodkowski	597b0f74e2	jobstats: new library This library provide API to measure time spend in particular parts of code and to calculate optimal polling time. To calculate a those statistics application code need to be divided into parts (called jobs) that do something. It is up to application to decide what is considered a job. Series of jobs must be surrounded with the rte_jobstats_context_start() and rte_jobstats_context_finish() calls. After that, jobs might be started. Each job must be surrounded with rte_jobstats_start() and rte_jobstats_finish() calls. After job finishes its execution, period in which it should be called again is adjusted. It might be used to minimize time wasted on unnecessary polls/calls. Adjustment is based on data provided by job itself (ex: number of packets it processed). After all jobs in serie are executed fallowing statistics are updated and might be used by application. Statistics can be reset. Some of provided statistic data: - total/min/max execution - time spent in executing jobs. - total/min/max management - time spent outside execution area. This value might be used to measure overhead of scheduling jobs. This time also contains overhead of rte_jobstats library itself. - number of loops that executed at least one job - executed jobs - time when statistics were reset. Each job provide total/min/max execution time and execution count statistics. Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-24 22:12:35 +01:00
David Marchand	23df14d1ba	devargs: restore empty devargs Following commit `c07691ae10`, an implicit change has been done in the devargs API. This triggers problem in virtual pmds that did not check for parameters validity as it was implicitely valid. Fix this by restoring the empty argument as "" and add a note in the api. Restore associated tests. Fixes: `c07691ae10` ("devargs: remove limit on parameters length") Reported-by: Tetsuya Mukawa <mukawa@igel.co.jp> Signed-off-by: David Marchand <david.marchand@6wind.com> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 20:23:11 +01:00
Cunming Liang	4e01799aea	ring: add optional yield to avoid spin forever Add a sched_yield() syscall if the thread spins for too long, waiting other thread to finish its operations on the ring. That gives pre-empted thread a chance to proceed and finish with ring enqueue/dequeue operation. The purpose is to reduce contention on the ring. By ring_perf_test, it doesn't shows additional perf penalty. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:23:07 +01:00
Cunming Liang	b4bee5f66a	ring: support non-EAL thread ring debug stat won't take care non-EAL thread. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:23:02 +01:00
Cunming Liang	30e6399892	mempool: support non-EAL thread For non-EAL thread, bypass per lcore cache, directly use ring pool. It allows using rte_mempool in either EAL thread or any user pthread. As in non-EAL thread, it directly rely on rte_ring and it's none preemptive. It doesn't suggest to run multi-pthread/cpu which compete the rte_mempool. It will get bad performance and has critical risk if scheduling policy is RT. Haven't found significant performance decrease by mempool_perf_test. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:57 +01:00
Cunming Liang	6295e793aa	timer: support non-EAL thread Allow to setup timers only for EAL (lcore) threads (__lcore_id < MAX_LCORE_ID). E.g. – dynamically created thread will be able to reset/stop timer for lcore thread, but it will be not allowed to setup timer for itself or another non-lcore thread. rte_timer_manage() for non-lcore thread would simply do nothing and return straightway. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:52 +01:00
Cunming Liang	ca2e2dab07	spinlock: support non-EAL thread In non-EAL thread, lcore_id always be LCORE_ID_ANY. It can't be used as unique id for recursive spinlock. Then use rte_gettid() to replace it. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:48 +01:00
Cunming Liang	fd4a5ce87d	log: support non-EAL thread For those non-EAL thread, _lcore_id is invalid and probably larger than RTE_MAX_LCORE. The patch adds the check and allows only EAL thread using EAL per thread log level and log type. Others shares the global log level. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:43 +01:00
Cunming Liang	3e10f2368a	eal: initialize lcore and socket id Set _lcore_id and _socket_id to (-1) by default. For those non EAL thread, _lcore_id shall always be LCORE_ID_ANY. The libraries using _lcore_id as index need to take care. _socket_id always be SOCKET_ID_ANY until the thread changes the affinity by rte_thread_set_affinity(). Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:39 +01:00
Cunming Liang	b94580d688	malloc: avoid unknown socket id Add check for rte_socket_id(), avoid get unexpected return like (-1). By using rte_malloc_socket(), socket id is assigned by socket_arg. If socket_arg set to SOCKET_ID_ANY, it expects to use the socket id to which the current cores belongs. As the thread may affinity on a cpuset, the cores in the cpuset may belongs to different NUMA nodes. The value of _socket_id probably be SOCKET_ID_ANY(-1), the case is not expected in origin malloc_get_numa_socket(). Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:34 +01:00
Cunming Liang	8baacdd30e	eal: apply thread affinity by assigned cpuset EAL threads use assigned cpuset to set core affinity during startup. It keeps 1:1 mapping, if no '--lcores' option is used. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:29 +01:00
Cunming Liang	9e29251b2a	eal: thread affinity API 1. add two TLS _socket_id and _cpuset 2. add one internal API, eal_cpu_socket_id/eal_thread_dump_affinity 3. add two public API, rte_thread_set/get_affinity 4. update EAL version map for EAL public API The API works for both EAL thread and non EAL thread. When calling rte_thread_set_affinity, the _socket_id and _cpuset of calling thread will be updated if the thread successfully set the cpu affinity. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:24 +01:00
Cunming Liang	ef76436c68	eal: get unique thread id The rte_gettid() wraps the linux and freebsd syscall gettid(). It provides a persistent unique thread id for the calling thread. It will save the unique id in TLS on the first time. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:19 +01:00
Cunming Liang	f8e0f0163a	eal: get socket id from cpu id It defines eal_cpu_socket_id() which exposing the origin private cpu_socket_id(). The function is only used inside EAL. It returns socket_id of the specified cpu_id. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:15 +01:00
Cunming Liang	a9b1c67a2c	eal: fix strnlen return value with icc The problem is that strnlen() here may return invalid value with 32bit icc. (actually it returns it’s second parameter,e.g: sysconf(_SC_ARG_MAX)). It starts to manifest hwen max_len parameter is > 2M and using icc –m32 –O2 (or above). Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:22:08 +01:00
Cunming Liang	53e54bf817	eal: new option --lcores for cpu assignment It supports one new eal long option '--lcores' for EAL thread cpuset assignment. The format pattern: --lcores='<lcores[@cpus]>[<,lcores[@cpus]>...]' lcores, cpus could be a single digit/range or a group. '(' and ')' are necessary if it's a group. If not supply '@cpus', the value of cpus uses the same as lcores. e.g. '1,2@(5-7),(3-5)@(0,2),(0,6),7-8' means starting 9 EAL thread as below lcore 0 runs on cpuset 0x41 (cpu 0,6) lcore 1 runs on cpuset 0x2 (cpu 1) lcore 2 runs on cpuset 0xe0 (cpu 5,6,7) lcore 3,4,5 runs on cpuset 0x5 (cpu 0,2) lcore 6 runs on cpuset 0x41 (cpu 0,6) lcore 7 runs on cpuset 0x80 (cpu 7) lcore 8 runs on cpuset 0x100 (cpu 8) Test report: http://dpdk.org/ml/archives/dev/2015-February/013383.html Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Tested-by: Qun Wan <qun.wan@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:21:59 +01:00
Cunming Liang	798a71d703	eal: add cpuset into lcore config The patch adds 'cpuset' into per-lcore configure 'lcore_config[]', as the lcore no longer always 1:1 pinning with physical cpu. The lcore now stands for a EAL thread rather than a logical cpu. It doesn't change the default behavior of 1:1 mapping, but allows to affinity the EAL thread to multiple cpus. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:21:54 +01:00
Cunming Liang	0e2e511b38	enic: fix bsd namespace conflict Some macros already been defined by freebsd 'sys/param.h'. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:21:50 +01:00
Cunming Liang	ca31321cde	eal/bsd: fix namespace conflict Fix namespace with EAL prefix. Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:21:14 +01:00
Cunming Liang	d55b8f3a49	eal/bsd: standardize init sequence between linux and bsd Signed-off-by: Cunming Liang <cunming.liang@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-24 20:20:52 +01:00
Panu Matilainen	6eb85c0e44	mk: fix build with Debian/Ubuntu-specific gcc version Commit `71f0ab1849` broke compilation on some versions of Debian and Ubuntu where gcc has been modified to only emit MAJOR.MINOR part of the version from 'gcc -dumpversion'. Drop the micro-version from gcc version comparisons to work around this, it wasn't being used for anything anyway. Fixes: `71f0ab1849` ("mk: rework gcc version detection to permit versions newer than 4.x") Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-02-24 12:11:16 +01:00
Thomas Monjalon	a906cf28fd	eal: add help option Help is printed with -h or --help. Help is also printed for an unknown option. This was broken since the rework of options. Fixes: `489a9d6c9f` ("merge bsd and linux common options parsing") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-02-24 12:08:01 +01:00
Thomas Monjalon	97bf974ca2	eal: sort and align options lists Options listing in usage help was a mess. The main usage line is fixed and shorter. The options in usage output are logically sorted (cpu/mem/dev/proc), aligned and lightly reworded. The options in declarations are alphabetically sorted. Code in swith statement is not moved. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-02-24 11:57:33 +01:00
Stephen Hemminger	4dc01c1dd7	enic: change probe log message level Drivers should be silent on boot. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: David Marchand <david.marchand@6wind.com>	2015-02-24 03:57:32 +01:00
Stephen Hemminger	518b590803	enic: replace use of printf with log Device driver should log via DPDK log, not to printf which is sends to /dev/null in a daemon application. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: David Marchand <david.marchand@6wind.com> [Thomas: include rte_log.h] Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 03:56:44 +01:00
Panu Matilainen	71f0ab1849	mk: rework gcc version detection to permit versions newer than 4.x Separately comparing major and minor versions becomes seriously clumsy when with major version changes, convert the entire version string into a numeric value (ie 4.6.0 becomes 460 and 5.0.0 becomes 500) and use that for comparisons, eliminate unnecessary negations while at it. This makes the comparisons simpler, more obvious and makes gcc 5.0 naturally recognized at least as capable as newest 4.x. This three-digit scheme would run into trouble if gcc ever went to two-digit version segments, but that hasn't happened in the last 10+ years so it seems like a safe assumption. Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 03:47:29 +01:00
Keith Wiles	9903387f32	ixgbe: remove unused function causing error with clang Signed-off-by: Keith Wiles <keith.wiles@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 03:18:55 +01:00
Jeff Shaw	44a7fe6e1b	fm10k: fix clang warning flags This commit fixes the following error which was reported when compiling with clang by removing the option. error: unknown warning option '-Wno-unused-but-set-variable' Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Acked-by: Keith Wiles <keith.wiles@intel.com>	2015-02-24 03:13:32 +01:00
Jeff Shaw	e8f85d7c7d	fm10k: fix build with unused debug function This commit fixes the following error which was reported when compiling with clang by moving the function inside an RTE_LIBRTE_FM10K_DEBUG_RX ifdef block. error: unused function 'dump_rxd' Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Acked-by: Keith Wiles <keith.wiles@intel.com>	2015-02-24 03:09:57 +01:00
Sergio Gonzalez Monroy	e1545b393a	mbuf: fix a couple of doxygen comments Fix a couple of doxygen comments in mbuf structure: - seqn had no doxygen syntax. - usr was not generating proper link to function. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 03:00:31 +01:00
Stefan Puiu	4db87f9739	lib: fix C++11 compilation In C++11 concatenated string literals need to have a space in between. Found with clang++-3.4, IIRC g++-4.8 also complains about this. Sample error message: error: invalid suffix on literal; C++11 requires a space between literal and identifier [-Wreserved-user-defined-literal] Signed-off-by: Stefan Puiu <stefan.puiu@gmail.com> Reviewed-by: John McNamara <john.mcnamara@intel.com>	2015-02-24 02:46:52 +01:00
Hemant Agrawal	3e12a98fe3	kni: optimize Rx burst The current implementation of rte_kni_rx_burst polls the fifo for buffers. Irrespective of success or failure, it allocates the mbuf and try to put them into the alloc_q if the buffers are not added to alloc_q, it frees them. This waste lots of cpu cycles in allocating and freeing the buffers if alloc_q is full. The logic has been changed to: 1. Initially allocand add buffer(burstsize) to alloc_q 2. Add buffers to alloc_q only when you are pulling out the buffers. Signed-off-by: Hemant Agrawal <hemant@freescale.com> Reviewed-by: Jay Rolette <rolette@infiniteio.com>	2015-02-24 02:26:24 +01:00
Igor Ryzhov	e128e53879	lpm: fix overflow issue LPM table overflow may occur if table is full and added rule has the biggest depth that already have some rules. Signed-off-by: Igor Ryzhov <iryzhov@nfware.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-24 02:08:19 +01:00
Ildar Mustafin	dc783e74cf	pipeline: fix port meta for non-default entries Signed-off-by: Ildar Mustafin <imustafin@bk.ru> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2015-02-24 02:01:13 +01:00
Huawei Xie	dbfa62d63f	vhost: support dynamically registering server * support calling rte_vhost_driver_register after rte_vhost_driver_session_start * add mutext to protect fdset from concurrent access * add busy flag in fdentry. this flag is set before cb and cleared after cb is finished. mutex lock scenario in vhost: * event_dispatch(in rte_vhost_driver_session_start) runs in a separate thread, infinitely processing vhost messages through cb(callback). * event_dispatch acquires the lock, get the cb and its context, mark the busy flag, and releases the mutex. * vserver_new_vq_conn cb calls fdset_add, which acquires the mutex and add new fd into fdset. * vserver_message_handler cb frees data context, marks remove flag to request to delete connfd(connection fd) from fdset. * after cb returns, event_dispatch 1. clears busy flag. 2. if there is remove request, call fdset_del, which acquires mutex, checks busy flag, and removes connfd from fdset. * rte_vhost_driver_unregister(not implemented) runs in another thread, acquires the mutex, calls fdset_del to remove fd(listenerfd) from fdset. Then it could free data context. The above steps ensures fd data context isn't freed when cb is using. VM(s) should have been shutdown before rte_vhost_driver_unregister. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:17 +01:00
Huawei Xie	54292e9520	vhost: support ifname for vhost-user for vhost-cuse, ifname is the name of the tap device for vhost-user, ifname is the name of the unix domain socket path Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Przemyslaw Czesnowicz <przemyslaw.czesnowicz@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:16 +01:00
Huawei Xie	8f972312b8	vhost: support vhost-user In rte_vhost_driver_register(), vhost unix domain socket listener fd is created and added to polled(based on select) fdset. In rte_vhost_driver_session_start(), fds in the fdset are checked for processing. If there is new connection from qemu, connection fd accepted is added to polled fdset. The listener and connection fds in the fdset are then both checked. When there is message on the connection fd, its callback vserver_message_handler is called to process vhost-user messages. To support identifying which virtio is from which guest VM, we could call rte_vhost_driver_register with different socket path. Virtio devices from same VM will connect to VM specific socket. The socket path information is stored in the virtio_net structure. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Przemyslaw Czesnowicz <przemyslaw.czesnowicz@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:15 +01:00
Huawei Xie	fbf7e07ca1	vhost: add select based event driven processing for more generic event driven processing, refer to: http://libevent.org/ Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:14 +01:00
Huawei Xie	9464a44160	vhost: implement cuse memory table remove set_memory_table ops vhost-cuse or vhost-user will both implement their own set_memory_region handler. In current vhost-cuse implementation, guest numa memory isn't supported. Assume that guest memory is backed by only one file. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Przemyslaw Czesnowicz <przemyslaw.czesnowicz@intel.com>	2015-02-24 01:38:14 +01:00
Huawei Xie	c89d3e5afd	vhost: make host memory mapping more generic This functions accepts a virtual address and pid(qemu), and maps it into current process(vhost)'s address space. The memory behind the virtual address should be backed by a file, and virtual address should be the starting address. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:13 +01:00
Huawei Xie	6ca9df2812	vhost: copy host memory mapping to a new cuse file Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:12 +01:00
Huawei Xie	c2f60667bf	vhost: move fd copying into cuse subdirectory File descriptor is copied from qemu process into vhost process. vhost-user doesn't need eventfd kernel module to copy fds between processes. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Przemyslaw Czesnowicz <przemyslaw.czesnowicz@intel.com>	2015-02-24 01:38:11 +01:00
Huawei Xie	34f4c46dc4	vhost: rename header file Rename vhost-net-cdev.h to vhost-net.h. This file defines common operations provided by virtio-net(.c). Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:10 +01:00
Huawei Xie	6ee36ead58	vhost: move cuse related handling in a subdirectory Create vhost_cuse directory and move vhost-net-cdev.c into vhost_cuse. vhost-cuse driver will be divided into two parts: cuse driver specific message handling(in cuse directory) and common message handling(in virtio-net.c). vhost ioctl message is pre-processed in cuse and then sent to virtio-net if is not terminated. virtio-net.c provides common message handling for both vhost-cuse and vhost-user. Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:09 +01:00
Huawei Xie	04d696037a	vhost: enable virtio control channel Rx mode VIRTIO_NET_F_CTRL_RX is dependant on VIRTIO_NET_F_CTRL_VQ. Observed that virtio-net driver in guest would crash with only CTRL_RX enabled. In virtnet_send_command: /* Caller should know better */ BUG_ON(!virtio_has_feature(vi->vdev, VIRTIO_NET_F_CTRL_VQ) \|\| (out + in > VIRTNET_SEND_COMMAND_SG_MAX)); Signed-off-by: Huawei Xie <huawei.xie@intel.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp>	2015-02-24 01:38:07 +01:00
Bruce Richardson	4dc294158c	ethdev: support optional Rx and Tx callbacks Add optional support for inline processing of packets inside the RX or TX call. For an RX callback, what happens is that we get a set of packets from the NIC and then pass them to a callback function, if configured, to allow additional processing to be done on them, e.g. filling in more mbuf fields, before passing back to the application. On TX, the packets are similarly post-processed before being handed to the NIC for transmission. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: John McNamara <john.mcnamara@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 00:38:27 +01:00
Bruce Richardson	1a9a0b9b02	ethdev: rename interrupt callbacks field The 'callbacks' member of the rte_eth_dev structure has been renamed to 'link_intr_cbs' to make it clear that it refers to callbacks from NIC interrupts. This allows us to add other types of callbacks to the structure without ambiguity. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: John McNamara <john.mcnamara@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-24 00:38:14 +01:00
Jingjing Wu	2ccabd8cd1	i40e: enable internal switch of PF This patch enables PF's internal switch by setting ALLOWLOOPBACK flag when VEB is created. With this patch, traffic from PF can be switched on the VEB. Test report: http://www.dpdk.org/ml/archives/dev/2015-February/013237.html Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com> Tested-by: Min Cao <min.cao@intel.com>	2015-02-15 07:39:43 +01:00
Jingjing Wu	ac9491a96d	i40e: fix vsi configuration In i40e_vsi_config_tc_queue_mapping, should add a flag to indicate another valid setting by OR operation, but not set this flag to valid_sections, otherwise it will overwrite the flags set before. Test report: http://www.dpdk.org/ml/archives/dev/2015-February/013237.html Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com> Tested-by: Min Cao <min.cao@intel.com>	2015-02-15 07:39:12 +01:00
Helin Zhang	c9223a2bf5	i40e: workaround for XL710 performance On XL710, performance number is far from the expectation on recent firmware versions, if promiscuous mode is disabled, or promiscuous mode is enabled and port MAC address is equal to the packet destination MAC address. The fix for this issue may not be integrated in the following firmware version. So the workaround in software driver is needed. For XL710, it needs to modify the initial values of 3 internal only registers, which are the same as X710. Note that the values for X710 and XL710 registers could be different, and the workaround can be removed when it is fixed in firmware in the future. Test report: http://www.dpdk.org/ml/archives/dev/2015-February/012749.html Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com>	2015-02-15 07:32:44 +01:00
Dan Aloni	90a1633b23	eal/linux: allow to map BARs with MSI-X tables While VFIO doesn't allow us to map complete BARs with MSI-X tables, it does allow us to map around them in PAGE_SIZE granularity. There might be adapters that provide their registers in the same BAR but on a different page. For example, Intel's NVME adapter, though not a network adapter, provides only one MMIO BAR that contains the MSI-X table. Signed-off-by: Dan Aloni <dan@kernelim.com> Acked-by: Anatoly Burakov <anatoly.burakov@intel.com>	2015-02-23 21:57:31 +01:00
Sergio Gonzalez Monroy	4769bc5a27	mbuf: remove build option to disable refcnt This patch removes all references to RTE_MBUF_REFCNT, setting the refcnt field in the mbuf struct permanently. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-23 19:31:24 +01:00
Sergio Gonzalez Monroy	e8b9ef877e	mbuf: introduce indirect attached flag Currently for mbufs with refcnt, we cannot free mbufs with external memory buffers (ie. vhost zero copy), as they are recognized as indirect attached mbufs and therefore we free the direct mbuf it points to, resulting in an error in the case of external memory buffers. We solve the issue by introducing the IND_ATTACHED_MBUF flag, which indicates that the mbuf is an indirect attached mbuf pointing to another mbuf. When we free an mbuf, we only free the direct mbuf if the flag is set. Freeing an mbuf with external buffer is the same as freeing a non attached mbuf. The flag is set during attach and clear on detach. So in the case of vhost zero copy where we have mbufs with external buffers, by default we just free the mbuf and it is up to the user to deal with the external buffer. This patch would allow the removal of the RTE_MBUF_REFCNT config option, setting refcnt for all mbufs permanently. The patch also modifies the vhost example as it was using the RTE_MBUF_INDIRECT macro to detect if it was an mbuf with external buffer. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-23 19:27:06 +01:00
Marc Sune	16eaf25232	kni: add build option to disable preempting This patch introduces CONFIG_RTE_KNI_PREEMPT_DEFAULT flag. When set to 'no', KNI kernel thread(s) do not call schedule_timeout_interruptible(), which improves overall KNI performance at the expense of CPU cycles (polling). Default values is 'yes', maintaining the same behaviour as of now. Signed-off-by: Marc Sune <marc.sune@bisdn.de> Acked-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-23 18:49:19 +01:00
Yerden Zhumabekov	614289298d	hash: slice CRC data into 8-byte pieces Calculating hash for data of variable length is more efficient when that data is sliced into 8-byte pieces. The rest part of data is hashed using CRC32 functions with either 8 and 4 byte operands. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:30:05 +01:00
Yerden Zhumabekov	8bae1da2af	hash: fallback to software CRC32 implementation Initially, SSE4.2 support is detected via the constructor function. Added rte_hash_crc_set_alg() function to detect and set CRC32 implementation if necessary. SSE4.2 is allowed by default. rte_hash_crc_*byte() functions reworked so they choose available CRC32 implementation in the runtime. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:24:05 +01:00
Yerden Zhumabekov	d2b989045f	hash: add CRC function for 8 bytes SSE4.2 provides CRC32 intrinsic with 8-byte operand. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:23:13 +01:00
Yerden Zhumabekov	e068294180	hash: replace built-in functions implementing SSE4.2 Give up using built-in intrinsics and use our own assembly implementation. Remove #include entry as well. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:15:58 +01:00
Yerden Zhumabekov	00bf774bab	hash: add assembly implementation of CRC32 intrinsics Added: - crc32c_sse42_u32() emits 'crc32l' asm instruction; - crc32c_sse42_u64() emits 'crc32q' asm instruction; - crc32c_sse42_u64_mimic(), wrapper in case of run on 32-bit platform. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:14:41 +01:00
Yerden Zhumabekov	d983cf4169	hash: add software CRC32 implementation Add lookup tables for CRC32 algorithm, crc32c_1word() and crc32c_2words() functions returning hash of 32-bit and 64-bit operand. Signed-off-by: Yerden Zhumabekov <e_zhumabekov@sts.kz> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2015-02-23 18:13:09 +01:00
Jijiang Liu	352378ede8	i40e: support NVGRE in Rx tunnel filtering The filter types supported are listed below for NVGRE packet: 1. Inner MAC and Inner VLAN ID. 2. Inner MAC address, inner VLAN ID and tenant ID. 3. Inner MAC and tenant ID. 4. Inner MAC address. 5. Outer MAC address, tenant ID and inner MAC address. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Declan Doherty <declan.doherty@intel.com>	2015-02-23 16:29:58 +01:00
Jijiang Liu	09ba608848	ether: add transparent ethernet bridging type Add an Ethernet type definition for Transparent Ethernet Bridging. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Declan Doherty <declan.doherty@intel.com>	2015-02-23 16:25:55 +01:00
Helin Zhang	b12964f621	ethdev: unification of RSS offload types RSS offload types were defined separately for 1/10G and 40G NICs, and have no relationship with flow types. The modifications are to unify all RSS offload types for all PMDs. Unified RSS offload types have new and common names which can be used for any PMD or applications, and decouple from specific hardwares. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> [Thomas: merge with fm10k] Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-22 23:56:20 +01:00
Helin Zhang	7fa96d696f	ethdev: unification of flow types Flow types was defined actually for i40e hardware specifically, and wasn't able to be used for defining RSS offload types of all PMDs. It removed the enum flow types, and uses macros instead with new names. The new macros can be used for defining RSS offload types later. Also modifications are made in i40e and testpmd accordingly. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> [Thomas: merge with new flow director API] Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-22 23:56:20 +01:00
Helin Zhang	ea4c89aeee	ethdev: fix size of flow type mask array It wrongly calculates the size of the flow type mask array. The fix is to align the flow type maximum index ID with the number of element bit width, and then divide the number of element bit width. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 23:56:20 +01:00
Helin Zhang	38eef41d6c	ethdev: minor comment changes Added code style fixes. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 23:56:20 +01:00
Helin Zhang	138c9c4023	i40e: remove some useless line breaks Added code style fixes. Signed-off-by: Helin Zhang <helin.zhang@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 23:56:20 +01:00
Thomas Monjalon	12cf4d6e0a	ethdev: remove old ethertype filter ABI The old ethertype filter API was removed in commit `75db20648`, but was still in (newly integrated) version map for ABI. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-22 04:06:16 +01:00
Jingjing Wu	083b5148e8	ethdev: remove old ntuple filter API Following structures are removed: - rte_2tuple_filter - rte_5tuple_filter Following APIs are removed: - rte_eth_dev_add_2tuple_filter - rte_eth_dev_remove_2tuple_filter - rte_eth_dev_get_2tuple_filter - rte_eth_dev_add_5tuple_filter - rte_eth_dev_remove_5tuple_filter - rte_eth_dev_get_5tuple_filter It also move macros TCP_*_FLAG to rte_eth_ctrl.h, and removes the macro TCP_UGR_FLAG which is duplicated with TCP_URG_FLAG. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> [Thomas: remove also from version map]	2015-02-22 04:05:36 +01:00
Jingjing Wu	4c54a7e7bd	ixgbe: migrate ntuple filter to new API This patch defines new functions dealing with ntuple filters which is corresponding to 5tuple in HW. It removes old functions which deal with 5tuple filters. Ntuple filter is dealt with through entrance ixgbe_dev_filter_ctrl. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-22 04:04:53 +01:00
Jingjing Wu	fdaa76971a	igb: migrate ntuple filter to new API This patch defines new functions dealing with ntuple filters which is corresponding to 2tuple filter for 82580 and i350 in HW, and to 5tuple filter for 82576 in HW. It removes old functions which deal with 2tuple and 5tuple filters in igb driver. Ntuple filter is dealt with through entrance eth_igb_filter_ctrl. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-22 04:04:53 +01:00
Jingjing Wu	5e4944a1d6	ethdev: new ntuple filter API This patch defines ntuple filter type RTE_ETH_FILTER_NTUPLE and its structure rte_eth_ntuple_filter. It also corrects the typo TCP_UGR_FLAG to TCP_URG_FLAG Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-22 04:04:53 +01:00
Jingjing Wu	222928b3dd	ethdev: remove old syn filter API Structure rte_syn_filter is removed. Following APIs are removed: - rte_eth_dev_add_syn_filter - rte_eth_dev_remove_syn_filter - rte_eth_dev_get_syn_filter Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> [Thomas: remove also from version map]	2015-02-22 03:09:05 +01:00
Jingjing Wu	afad6c82c6	ixgbe: migrate syn filter to new API This patch defines new functions dealing with syn filter. It removes old functions which deal with syn filter. Syn filter is dealt with through entrance ixgbe_dev_filter_ctrl. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 03:09:04 +01:00
Jingjing Wu	dbd7cfcfe1	igb: migrate syn filter to new API This patch defines new functions dealing with syn filter. It removes old functions of syn filter in igb driver. Syn filter is dealt with through entrance eth_igb_filter_ctrl. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 03:09:04 +01:00
Jingjing Wu	fd28727da2	ethdev: new syn filter API This patch defines syn filter type RTE_ETH_FILTER_SYN and its structure rte_eth_syn_filter. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 02:53:33 +01:00
Jingjing Wu	942524d605	ethdev: remove old flex filter API Structure rte_flex_filter is removed. Following APIs are removed: - rte_eth_dev_add_flex_filter - rte_eth_dev_remove_flex_filter - rte_eth_dev_get_flex_filter Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> [Thomas: remove also from version map]	2015-02-22 02:53:29 +01:00
Jingjing Wu	231d43909a	igb: migrate flex filter to new API This patch defines new functions dealing with flex filter. It removes old functions of flex filter in igb driver. Syn filter is dealt with through entrance eth_igb_filter_ctrl. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 02:53:24 +01:00
Jingjing Wu	2d72306de1	ethdev: new flex filter API This patch defines flex filter type RTE_ETH_FILTER_FLEXIBLE and its structure rte_eth_flex_filter. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com>	2015-02-22 02:48:31 +01:00
Jingjing Wu	81f38e676e	ixgbe: support flow director flush This patch implement RTE_ETH_FILTER_FLUSH operation to delete all flow director filters in ixgbe driver. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:34:53 +01:00
Jingjing Wu	fc84329941	ixgbe: migrate flow director info and statistic to new API This patch changes the get info operation to be implemented through filter_ctrl API and RTE_ETH_FILTER_INFO/RTE_ETH_FILTER_STATS ops. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:31:51 +01:00
Jingjing Wu	76c6f89e80	ixgbe: support new flow director masks This patch implement the mask configuration of flow director filter, by using the mask defined in rte_fdir_conf instead of callback function fdir_set_masks. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:29:05 +01:00
Jingjing Wu	2d4c1a9ea2	ethdev: add new flow director masks This patch defines structure rte_eth_fdir_masks. It extends rte_fdir_conf and rte_eth_fdir_info to contain mask's configuration. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:24:32 +01:00
Jingjing Wu	299191e0c9	ethdev: remove flexbytes offset from flow director This patch removes the flexbytes_offset from rte_fdir_conf, because the flexible payload setting is done by flex_conf instead of flexbytes_offset. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:14:31 +01:00
Jingjing Wu	d54a988826	ixgbe: support flexpayload configuration of flow director This patch implement the flexpayload configuration of flow director filter. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 01:14:21 +01:00
Jingjing Wu	1145eec608	ethdev: extend flow type and flexible payload type for flow director This patch adds RTE_ETH_FLOW_TYPE_RAW and RTE_ETH_RAW_PAYLOAD to support the flexible payload is started from the beginning of the packet. Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 00:59:11 +01:00
Jingjing Wu	f9072f8b90	ixgbe: migrate flow director filtering to new API This patch changes the add/delete/update operations to be implemented through filter_ctrl API and RTE_ETH_FILTER_ADD/RTE_ETH_FILTER_DELETE/RTE_ETH_FILTER_UPDATE ops. It also removes the callback functions: - ixgbe_eth_dev_ops.fdir_add_signature_filter - ixgbe_eth_dev_ops.fdir_update_signature_filter - ixgbe_eth_dev_ops.fdir_remove_signature_filter - ixgbe_eth_dev_ops.fdir_add_perfect_filter - ixgbe_eth_dev_ops.fdir_update_perfect_filter - ixgbe_eth_dev_ops.fdir_remove_perfect_filter Signed-off-by: Jingjing Wu <jingjing.wu@intel.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-22 00:58:03 +01:00
Danny Zhou	c112df6875	eal/linux: toggle interrupt for uio_pci_generic enable/disable interrupt by manipulating a control bit of command register on NIC's PCIe configuration space. Signed-off-by: Danny Zhou <danny.zhou@intel.com> Tested-by: Qun Wan <qun.wan@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-20 23:34:40 +01:00
Danny Zhou	4a499c6495	eal/linux: enable uio_pci_generic support Change the EAL PCI code so that it can work with both the uio_pci_generic in-tree driver, as well as the igb_uio DPDK-specific driver. This involves changes to 1) Modify method of retrieving BAR resource mapping information 2) Mapping using resource files in /sys rather than /dev/uio* 2) Setup bus master bit in NIC's PCIe configuration space for uio_pci_generic. Signed-off-by: Danny Zhou <danny.zhou@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-20 23:34:31 +01:00
Daniel Mrzyglod	200866003a	bond: rename mode 5 This patch modify mode older name from BONDING_MODE_ADAPTIVE_TRANSMIT_LOAD_BALANCING to BONDING_MODE_TLB This patch also changes order of TEST_ASSERT macro in test_tlb_verify_slave_link_status_change_failover. Signed-off-by: Daniel Mrzyglod <danielx.t.mrzyglod@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-20 23:07:02 +01:00
Michal Jastrzebski	457ecf2953	bond: add debug info for mode 6 This patch add some debug information when using link bonding mode 6. It prints basic information about ARP packets on RX and TX (MAC, ip, packet number, arp packet type). If CONFIG_RTE_LIBRTE_BOND_DEBUG_ALB == y. If CONFIG_RTE_LIBRTE_BOND_DEBUG_ALB_L1 is enabled instead of previous one, use show command to see IPv4 balancing from clients. Signed-off-by: Michal Jastrzebski <michalx.k.jastrzebski@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-20 23:07:01 +01:00
Maciej Gajdzica	06fe78b98c	bond: add mode 6 This mode includes adaptive TLB and receive load balancing (RLB). In RLB the bonding driver intercepts ARP replies send by local system and overwrites its source MAC address, so that different peers send data to the server on different slave interfaces. When local system sends ARP request, it saves IP information from it. When ARP reply from that peer is received, its MAC is stored, one of slave MACs assigned and ARP reply send to that peer. Signed-off-by: Maciej Gajdzica <maciejx.t.gajdzica@intel.com> Signed-off-by: Michal Jastrzebski <michalx.k.jastrzebski@intel.com> Signed-off-by: Daniel Mrzyglod <danielx.t.mrzyglod@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-20 22:17:52 +01:00
Maciej Gajdzica	31db4d38de	net: change arp header struct declaration Changed MAC address type from uint8_t[6] to struct ether_addr and IP address type from uint8_t[4] to uint32_t to make it consistent with other DPDK code using MAC and IP addresses. It allows us to use is_same_ether_addr and ether_addr_copy functions on MAC addresses in ARP header. Also removed union from arp_hdr struct to make calls to arp_data items shorter. Updated test-pmd to match new arp_hdr version. Signed-off-by: Maciej Gajdzica <maciejx.t.gajdzica@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com> [Thomas: doxygenize comments]	2015-02-20 22:10:52 +01:00
Ouyang Changchun	e033c005c2	virtio: remove Rx hotspots from early return Remove those hotspots which is unnecessary when early returning occurs. Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:21 +01:00
Ouyang Changchun	da978dfdc4	virtio: use port IO to get PCI resource Make virtio not require UIO for some security reasons, this is to match 6WIND's virtio-net-pmd. Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:20 +01:00
Stephen Hemminger	32c118fd00	virtio: free mbuf's with threshold This makes virtio driver work like ixgbe. Transmit buffers are held until a transmit threshold is reached. The previous behavior was to hold mbuf's until the ring entry was reused which caused more memory usage than needed. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:19 +01:00
Stephen Hemminger	5186fb1f37	virtio: set MAC address Need to have do special things to set default mac address. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:16 +01:00
Stephen Hemminger	1b306359e5	virtio: suport multiple MAC addresses Virtio support multiple MAC addresses. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:13 +01:00
Stephen Hemminger	9f4f2846ef	virtio: support vlan filtering Virtio supports vlan filtering. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:10 +01:00
Stephen Hemminger	a85786dc81	virtio: fix states handling during initialization Change order of initialization to match Linux kernel. Don't blow away control queue by doing reset when stopped. Calling dev_stop then dev_start would not work. Dev_stop was calling virtio reset and that would clear all queues and clear all feature negotiation. Resolved by only doing reset on device removal. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:19:05 +01:00
Stephen Hemminger	fb9d170496	virtio: move allocation before initialization If allocation fails, don't want to leave virtio device stuck in middle of initialization sequence. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:57 +01:00
Ouyang Changchun	894bb9cb0e	virtio: use soft vlan strip with mergeable Rx packets To keep the consistent logic with normal Rx path, the mergeable Rx path also needs software vlan strip/decap if it is enabled. Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:56 +01:00
Stephen Hemminger	1d5ced9176	virtio: use soft vlan encap/decap Implement VLAN stripping in software. This allows application to be device independent. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:52 +01:00
Stephen Hemminger	c974021a59	ether: add soft vlan encap/decap It is helpful to allow device drivers that don't support hardware VLAN stripping to emulate this in software. This allows application to be device independent. Avoid discarding shared mbufs. Make a copy in rte_vlan_insert() of any packet to be tagged that has a reference count > 1. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:47 +01:00
Stephen Hemminger	9ffd50826f	virtio: remove redundant alignment field Since vq_alignment is constant (always 4K), it does not need to be part of the vring struct. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:44 +01:00
Stephen Hemminger	c7bca33e42	virtio: remove unnecessary adapter structure Cleanup virtio code by eliminating unnecessary nesting of virtio hardware structure inside adapter structure. Also allows removing unneeded macro, making code clearer. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:42 +01:00
Stephen Hemminger	8d6d7e5cb3	virtio: support link state interrupt Virtio has link state interrupt which can be used. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:39 +01:00
Stephen Hemminger	069739780b	virtio: allow starting with link down Starting driver with link down should be ok, it is with every other driver. So just allow it. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:35 +01:00
Ouyang Changchun	8c09c20fb4	virtio: fix update of vring descriptor index Updating the vring descriptor index should be done before notifying host; Remove 2 duplicated store memory barriers in both Rx and Tx path because there is store memory barrier in vq_update_avail_idx function; Notify the host only if packets actually transmitted(nb_tx > 0). Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:34 +01:00
Stephen Hemminger	faf79b7854	virtio: use weaker barriers The DPDK driver only has to deal with the case of running on PCI and with SMP. In this case, the code can use the weaker barriers instead of using hard (fence) barriers. This will help performance. The rationale is explained in Linux kernel virtio_ring.h. To make it clearer that this is a virtio thing and not some generic barrier, prefix the barrier calls with virtio_. Add missing (and needed) barrier between updating ring data structure and notifying host. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:26 +01:00
Stephen Hemminger	dec08c28c0	virtio: check packet headroom at compile time Better to check at compile time than fail at runtime. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:22 +01:00
Stephen Hemminger	a8c0f835a7	virtio: make a function local Make vtpci_get_status a local function as it is used in one file. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:18:09 +01:00
Stephen Hemminger	41d7fcf5e3	virtio: rearrange resource initialization For clarity make the setup of PCI resources for Linux into a function rather than block of code #ifdef'd in middle of dev_init. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-02-20 19:17:45 +01:00
Panu Matilainen	3c21004855	i40e: fix build with gcc 5 Eliminate ambiguity in the condition which trips up a "logical not is only applied to the left..." warning from gcc 5, causing build failure with -Werror. Besides non-ambiguous, the condition is far more obvious this way. Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2015-02-20 15:10:09 +01:00
Thomas Monjalon	485e82c3f1	ixgbe: forbid building vpmd without Rx bulk alloc CONFIG_RTE_LIBRTE_IXGBE_RX_ALLOW_BULK_ALLOC is a prerequisite of CONFIG_RTE_IXGBE_INC_VECTOR. Reported-by: Alexander Belyakov <abelyako@gmail.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-20 12:04:05 +01:00
Bruce Richardson	51d4554896	ixgbe: fix vector receive of chained mbuf When the vector pmd was receiving a mix of packets of various sizes, some of which were split across multiple mbufs, there was an issue with reassembly of the jumbo frames. This was due to a skipped increment when using "continue" in a while loop. Changing the loop to a "for" loop fixes this problem, by ensuring the increment is always performed. Reported-by: Prashant Upadhyaya <praupadhyaya@gmail.com> Reported-by: Martin Weiser <martin.weiser@allegro-packets.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Tested-by: Martin Weiser <martin.weiser@allegro-packets.com>	2015-02-20 11:58:07 +01:00
Xuelin Shi	3658eb95fe	ixgbe: fix big endian access ixgbe is little endian, but cpu maybe not. add necessary conversions. rte_cpu_to_le_32(...) for PCI write rte_le_to_cpu_32(...) for PCI read. Signed-off-by: Xuelin Shi <xuelin.shi@freescale.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-20 11:52:29 +01:00
Xuelin Shi	89626cb570	e1000: fix big endian access e1000 is little endian, but cpu maybe not. add necessary conversions. rte_cpu_to_le_32(...) for PCI write rte_le_to_cpu_32(...) for PCI read. Signed-off-by: Xuelin Shi <xuelin.shi@freescale.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-20 11:44:52 +01:00
Sergio Gonzalez Monroy	a8c50a37b3	eal/linux: fix fscanf format mismatch Variables are unsigned int but format scans for signed int. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-20 11:08:29 +01:00
Sergio Gonzalez Monroy	90f1adc3bd	eal: fix dynamic link with virtio Building shared libraries and using virtio PMD results in undefined reference to 'rte_eal_iopl_init'. Add missing function to eal version map. Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-20 11:04:52 +01:00
Olivier Matz	ac09cd9732	cmdline: fix check in port list parsing The argument ressize contains the size of the result buffer which should be large enough to store the parsed result of a token. In this case, it should be larger or equal to sizeof(cmdline_portlist_t) (4 bytes), not PORTLIST_TOKEN_SIZE which is the max size of the token string. This is not a critical, it fixes cases where the total length of the parsed instruction is greater than the maximum. Signed-off-by: Olivier Matz <olivier.matz@6wind.com>	2015-02-20 10:50:16 +01:00
Michal Jastrzebski	960d1081c0	bond: change warning Remove function name from warning. Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 19:07:34 +01:00
Tomasz Kulasek	6107d0f738	bond: fix symbol export rte_eth_bond_8023ad_setup and rte_eth_bond_8023ad_conf_get entries need to be exported to be used by test application for shared libraries compilation. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 18:58:55 +01:00
Tomasz Kulasek	f111e60427	ring: fix MAC per device Initialization procedure fix to allow per device MAC configuration. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 18:58:55 +01:00
Tomasz Kulasek	bc6e95d597	ring: add MAC address add/remove Added MAC addr add/remove dummy callbacks. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 18:58:55 +01:00
Tomasz Kulasek	7ab04035e6	ring: add link up/down Link up and down implementation. Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 18:58:55 +01:00
Declan Doherty	0e0a4018a7	bond: fix memory leak on kvargs processing failure identified by klockwork scan Signed-off-by: Declan Doherty <declan.doherty@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2015-02-18 18:12:58 +01:00
Jia Yu	a720aadd44	ethdev: return status of stats read rte_eth_stats_get is to get physical device statistics. Without return status, caller does not know whether function fails or not (i.e. invalid port_id, driver has no stats_get callback). Caller cannot differiente normal 0 stats or error case. This fix adds a return status to the function. Signed-off-by: Jia Yu <jyu@vmware.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-18 17:50:15 +01:00
Jia Yu	99610782ed	ethdev: update link status after start Since LSR interrupt is disabled by pmd drivers, link status in rte_eth_device is always down. Bond slave_configure() enables LSR interrupt on devices to get notification if link status changes. However, the LSC interrupt at device start time is still lost. In this fix, call link_update to read link status from hardware register at device start time. Signed-off-by: Jia Yu <jyu@vmware.com> Acked-by: Helin Zhang <helin.zhang@intel.com>	2015-02-18 17:50:15 +01:00
Sergio Gonzalez Monroy	b70b56032b	reorder: new library This library provides reordering capability for out of order mbufs based on a sequence number in the mbuf structure. Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Signed-off-by: Richardson Bruce <bruce.richardson@intel.com> Signed-off-by: Sergio Gonzalez Monroy <sergio.gonzalez.monroy@intel.com> Acked-by: Neil Horman <nhorman@tuxdriver.com> Acked-by: Declan Doherty <declan.doherty@intel.com>	2015-02-18 16:52:05 +01:00
David Marchand	c07691ae10	devargs: remove limit on parameters length As far as I know, there is no reason why we should have a limit on the length of parameters that can be given for a device. Remove this limit by using dynamic allocations. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-18 12:16:38 +01:00
David Marchand	0215a4c61f	devargs: indent and cleanup Prepare for next commit. Fix some indent issues, refactor error code. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2015-02-18 12:16:20 +01:00
Jeff Shaw	4c287332c3	fm10k: add PF and VF interrupt handling 1. Add functions to enable PF/VF interrupt. 2. Add function to process error message passed from interrupt. 2. Add 2 interrupt handling functions, one for PF and one for VF. 2. Enable interrupt after completing initialization of NIC. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	c0eb314107	fm10k: add VF support fm10k pmd driver will support both PF and VF device with single copy of code. The reason is NIC maps registers with same function in PF and VF to same PCI I/O address. Then, PF/VF drivers use same address to access registers belonging to it, HW will translate the request to correct units. For some functionalities that are unique to PF, driver will check current driver type and behave correctly. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	fbf3e0f911	fm10k: add vlan filter Add fm10k_vlan_filter_set to set vlan. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	c82dd0a7bf	fm10k: add scatter receive 1. Add fm10k_recv_scattered_pkts function to receive jumbo frame and multi-segment packets. 2. Configure correct receive function in rx_init and dev_init. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	57033cdf8f	fm10k: add PF RSS 1. Configure RSS in fm10k_dev_rx_init function. 2. Add fm10k_rss_hash_update and fm10k_rss_hash_conf_get to get and inquery RSS configuration. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	4b61d3bfa9	fm10k: add receive and tranmit 1. Add fm10k_recv_pkts and fm10k_xmit_pkts functions. 2. Link app function pointer to actual fm10k recv/xmit functions. 3. Change Makefile to compile new file fm10k_rxtx.c Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	9ae6068c86	fm10k: add dev start/stop 1. Add function to initialize RX queues. 2. Add function to initialize TX queues. 3. Add fm10k_dev_start, fm10k_dev_stop and fm10k_dev_close functions. 4. Add function to close mailbox service. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	387bf3006c	fm10k: add Rx/Tx single queue start/stop 1. Add 4 functions fm10k_dev_rx_queue_start, fm10k_dev_rx_queue_stop, fm10k_dev_tx_queue_start, and fm10k_dev_tx_queue_stop. 2. verify Rx packet buffer alignment is valid. Hardware requires specific alignment for Rx packet buffers. At least one of the following two conditions must be satisfied. 1) Address is 512B aligned 2) Address is 8B aligned and buffer does not cross 4K boundary. Alignment is checked by the driver when the Rx queue is reset. It is assumed that if an entire descriptor ring can be filled with buffers containing valid alignment, then all buffers in that mempool have valid address alignment. It is the responsibility of the user to ensure all buffers have valid alignment, as it is the user who creates the mempool. It is assumed the buffer needs only to store a maximum size Ethernet frame. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	98068e0e04	fm10k: add Tx queue setup/release Add fm10k_tx_queue_setup and fm10k_tx_queue_release functions. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	6cfe8969c9	fm10k: add Rx queue setup/release Add fm10k_rx_queue_setup and fm10k_rx_queue_release functions. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	7e72cbfa32	fm10k: add reta query/update 1. Add fm10k_reta_update and fm10k_reta_query functions. 2. Add fm10k_link_update and fm10k_dev_infos_get functions. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	a6061d9e70	fm10k: register PF driver 1. Add init function to scan and initialize fm10k PF device. 2. Add implementation to register fm10k pmd PF driver. 3. Add 3 functions fm10k_dev_configure, fm10k_stats_get and fm10k_stats_get. 4. Add fm10k.h to define macros and basic data structure. 5. Add fm10k_logs.h to control log message output. 6. Change config/common_bsdapp and config/common_linuxapp, add macros to control fm10k pmd driver compile for linux and bsd. 7. Add Makefile. 8. Change lib/Makefile to add fm10k driver into compile list. 9. Change mk/rte.app.mk to add fm10k lib into link. 10. Add ABI version of librte_pmd_fm10k Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com> Signed-off-by: Michael Qiu <michael.qiu@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	fe92cff407	eal: add fm10k device id Add fm10k device ID list into rte_pci_dev_ids.h. Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Jeff Shaw	7223d200c2	fm10k: add base driver Base driver is developed and maintained by Intel ND team, includes basic functional service to Intel Ethernet Switch FM10000 Series of silicons. Any suggestion on bug fix and improvement within this directory is welcome, but need this team to change and update. Signed-off-by: Chen Jing D(Mark) <jing.d.chen@intel.com>	2015-02-17 15:25:30 +01:00
Olivier Matz	3e11f931b8	i40e: add debug logs for Tx context descriptors This could be useful to have this values for debug purposes. Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:18 +01:00
Olivier Matz	0c0c43705c	i40e: fix offloading of outer checksum for IPIP tunnel When offloading the checksums of ipip tunnels, m->l2_len is set to 0 as there is no tunnel or inner l2 header. Since this is a valid value remove the test. By the way, also remove the same test with l3_len because at this point, it is expected that the software provides proper values in the mbuf. It should avoid a test in dataplane processing and therefore slightly increase performance. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:18 +01:00
Jijiang Liu	2af384931c	i40e: advertise outer IPv4 checksum capability Advertise the DEV_TX_OFFLOAD_OUTER_IPV4_CKSUM flag in the PMD features. It means that the i40e PMD supports the offload of outer IP checksum when transmitting tunneling packet. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-02-16 19:21:18 +01:00
Jijiang Liu	bb58239767	ethdev: add outer IP checksum capability flag If the flag is advertised by a PMD, the NIC supports the outer IP checksum TX offload of tunneling packets, therefore an application can set the PKT_TX_OUTER_IP_CKSUM flag in mbufs when transmitting on this port. Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2015-02-16 19:21:18 +01:00
Olivier Matz	c9433176b3	mbuf: remove UDP tunnel flag Since previous commit, the flag PKT_TX_UDP_TUNNEL_PKT is not used by any PMD, remove it from mbuf API and from csumonly (testpmd). In csumonly, the PKT_TX_OUTER_IP_CKSUM flag is already set for vxlan checksum, providing enough information to the underlying driver. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:17 +01:00
Olivier Matz	59cfa76a22	i40e: remove the use of UDP tunnel flag The definition of the flag PKT_TX_UDP_TUNNEL_PKT in rte_mbuf.h was: TX packet is an UDP tunneled packet. It must be specified when using outer checksum offload (PKT_TX_OUTER_IP_CKSUM) This flag was used to tell the NIC that the offload type is UDP (I40E_TXD_CTX_UDP_TUNNELING flag). In the datasheet, it says it's required to specify the tunnel type in the register. However, some tests (see [1]) showed that it also works without this flag. Moreover, it is not explained how the hardware use this information. From a network perspective, this information is useless for calculating the outer IP checksum as it does not depend on the payload. Having this flag in the API would force the application to specify the tunnel type for something that looks only useful for this PMD. It will limit the number of possible tunnel types (we would need a flag for each tunnel type) and therefore prevent to support outer IP checksum for proprietary tunnels. Finally, if a hardware advertises "I support outer IP checksum", it must be supported for any payload types. This has been validated by [2], knowing that the ipip test case was fixed after this test report [3]. [1] http://dpdk.org/ml/archives/dev/2015-January/011380.html [2] http://dpdk.org/ml/archives/dev/2015-January/011475.html [3] http://dpdk.org/ml/archives/dev/2015-January/011610.html Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:17 +01:00
Olivier Matz	e3f0151fc6	i40e: enable Tx checksum only for offloaded packets From i40e datasheet: The IP header type and its offload. In case of tunneling, the IIPT relates to the inner IP header. See also EIPT field for the outer (External) IP header offload. 00 - non IP packet or packet type is not defined by software 01 - IPv6 packet 10 - IPv4 packet with no IP checksum offload 11 - IPv4 packet with IP checksum offload Therefore it is not needed to fill the IIPT field if no offload is requested (we can keep the value to 00). For instance, the linux driver code does not set it when (skb->ip_summed != CHECKSUM_PARTIAL). We can do the same in the dpdk driver. The function i40e_txd_enable_checksum() that fills the offload registers can only be called for packets requiring an offload. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:17 +01:00
Olivier Matz	609dd68ef1	mbuf: enhance the API documentation of offload flags Based on http://dpdk.org/ml/archives/dev/2015-January/011127.html Also adapt the csum forward engine code to the API. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:17 +01:00
Olivier Matz	fac4c750b2	mbuf: remove flag alias for IP checksum The alias PKT_TX_IPV4_CSUM is only used in one place of i40e driver. Remove it and only keep the legacy flag PKT_TX_IP_CSUM. Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Acked-by: Jijiang Liu <jijiang.liu@intel.com>	2015-02-16 19:21:17 +01:00

... 3 4 5 6 7 ...

1624 Commits