numam-dpdk

Author	SHA1	Message	Date
Hyong Youb Kim	5af7af4d6b	net/enic: enable limited passthru flow action Some apps like VPP use PASSTHRU+MARK flow rules to offload packet matching to the NIC. Just like MARK+RSS used by OVS-DPDK and others, PASSTHRU+MARK is used to "mark and then receive normally". Recent VIC adapters support such flow rules, so enable PASSTHRU for this limited use case. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	e9434f6f60	net/enic: enable limited RSS flow action Some apps like OVS-DPDK use MARK+RSS flow rules in order to offload packet matching to the NIC. The RSS action in such flow rules simply indicates "receive packet normally", not trying to override the port wide RSS. The action is included in the flow rules simply to terminate them, as MARK is not a fate-deciding action. And, the RSS action has a most basic config: default hash, level, types, null key, and identity queue mapping. Recent VIC adapters can support these "mark and receive" flow rules. So, enable support for RSS action for this limited use case. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	4d8e9aa483	net/enic: check for unsupported flow item types Currently a pattern with an unsupported item type causes segfault, because the flow handler is using the type as an array index without checking bounds. Add an explicit check for unsupported item types and avoid out-of-bound accesses. Fixes: `6ced137607` ("net/enic: flow API for NICs with advanced filters enabled") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	e7347a8aff	net/enic: allow flow mark ID 0 The driver currently accepts mark ID 0 but does not report it in matching packet's mbuf. For example, the following testpmd command succeeds. But, the mbuf of a matching IPv4 UDP packet does not have PKT_RX_FDIR_ID set. flow create 0 ingress pattern ... actions mark id 0 / queue index 0 / end The problem has to do with mapping mark IDs (32-bit) to NIC filter IDs. Filter ID is currently 16-bit, so values greater than 0xffff are rejected. The firmware reserves filter ID 0 for filters that do not mark (e.g. steer w/o mark). And, the driver reserves 0xffff for the flag action. This leaves 1...0xfffe for app use. It is possible to simply reject mark ID 0 as unsupported. But, 0 is commonly used (e.g. OVS-DPDK and VPP). So, when adding a filter, set filter ID = mark ID + 1 to support mark ID 0. The receive handler subtracts 1 from filter ID to get back the original mark ID. Fixes: `dfbd6a9cb5` ("net/enic: extend flow director support for 1300 series") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	3a1c3cd01b	net/enic: fix SCTP match for flow API The driver needs to explicitly set the protocol number (132) in the IP header pattern, as the current firmware filter API lacks "match SCTP packet" flag. Otherwise, the resulting NIC filter may lead to false positives (i.e. NIC reporting non-SCTP packets as SCTP packets). The flow director handler does the same (enic_clsf.c). Fixes: `6ced137607` ("net/enic: flow API for NICs with advanced filters enabled") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	4182ee7f02	net/enic: fix flow director SCTP matching The firmware filter API does not have flags indicating "match SCTP packet". Instead, the driver needs to explicitly add an IP match and set the protocol number (132 for SCTP) in the IP header. The existing code (copy_fltr_v2) has two bugs. 1. It sets the protocol number (132) in the match value, but not the mask. The mask remains 0, so the match becomes a wildcard match. The NIC ends up matching all protocol numbers (i.e. thinks non-SCTP packets are SCTP). 2. It modifies the input argument (rte_eth_fdir_input). The driver tracks filters using rte_hash_{add,del}_key(input). So, addding (RTE_ETH_FILTER_ADD) and deleting (RTE_ETH_FILTER_DELETE) must use the same input argument for the same filter. But, overwriting the protocol number while adding the filter breaks this assumption, and causes delete operation to fail. So, set the mask as well as protocol value. Do not modify the input argument, and use const in function signatures to make the intention clear. Also move a couple function declarations to enic_clsf.c from enic.h as they are strictly local. Fixes: `dfbd6a9cb5` ("net/enic: extend flow director support for 1300 series") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Hyong Youb Kim	0d0dcd64b8	net/enic: remove unused functions Remove unused functions. Specifically, vnic_set_rss_key() is obsolete. enic_{add,del}_vlan() has never been supported in the firmware. And, remove vnic_rss.c altogether as it becomes empty. These were discovered by cppcheck. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2019-03-08 17:52:22 +01:00
Bruce Richardson	d23e141ffa	build: set RTE_ARCH_64 based on pointer size Rather than relying on the target machine architecture, use the size of a pointer from the compiler to determine if we are 64-bits or not. This allows correct behaviour when you pass -m32 as a compile option. It also allows us to use this value repeatedly throughout the repo rather than continually testing for the sizeof(void*). Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Tested-by: Luca Boccassi <bluca@debian.org> Acked-by: Luca Boccassi <bluca@debian.org>	2019-02-26 18:34:28 +01:00
Hyong Youb Kim	9bd48e2d30	net/enic: remove redundant log level check Fixes: `8d49699534` ("net/enic: support multicast filtering") Cc: stable@dpdk.org Suggested-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2019-01-14 17:44:30 +01:00
Hyong Youb Kim	a1c40a3a3b	net/enic: remove useless include libgen.h is not used, so do not include it. Fixes: `fefed3d1e6` ("enic: new driver") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2019-01-14 17:44:30 +01:00
Hyong Youb Kim	8d49699534	net/enic: support multicast filtering The VIC hardware has 64 MAC filters per vNIC, which can be either unicast or multicast. Use one half for unicast and the other half for multicast, as the VIC kernel drivers for Linux and Windows do. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-12-13 17:52:54 +00:00
Hyong Youb Kim	293430677e	net/enic: add handler to return firmware version Cisco VIC adapters run firmware. Add the fw_version_get handler to help diagnostics. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-12-13 17:52:04 +00:00
Hyong Youb Kim	7f34bb5202	net/enic: release port upon close Set RTE_ETH_DEV_CLOSE_REMOVE upon probe so rte_eth_dev_close() can later free port resources including mac_addrs. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-12-13 17:50:26 +00:00
Hyong Youb Kim	7ac790d63b	net/enic: fix size check in Tx prepare handler The current code wrongly assumes that packets are non-TSO and ends up rejecting large TSO packets. Check non-TSO and TSO max packet sizes separately. Fixes: `5a12c38740` ("net/enic: check maximum packet size in Tx prepare handler") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-11-14 00:35:53 +01:00
Hyong Youb Kim	eeef60b0eb	net/enic: use macro for attribute weak Fixes: `8a6ff33d6d` ("net/enic: add AVX2 based vectorized Rx handler") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-11-05 15:01:25 +01:00
Ferruh Yigit	9757358342	fix global variable issues Various fixes related to the global variable usage. Fixes: `43e610bb85` ("compress/octeontx: introduce octeontx zip PMD") Fixes: `c378f084d6` ("compress/octeontx: add device setup ops") Fixes: `b43ebc65aa` ("compress/octeontx: create private xform") Fixes: `b1ce8ebd97` ("eventdev: add PMD callbacks for eth Rx adapter") Fixes: `3810ae4357` ("eventdev: add interrupt driven queues to Rx adapter") Fixes: `fefed3d1e6` ("enic: new driver") Cc: stable@dpdk.org Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Nikhil Rao <nikhil.rao@intel.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2018-10-29 02:34:27 +01:00
Hyong Youb Kim	59e657aa0b	net/enic: add missing Tx offload flags The following commit has added a number of existing offload flags such as PKT_TX_IPV4 and PKT_TX_IPV6 to PKT_TX_OFFLOAD_MASK defined in rte_mbuf.h. That change breaks the enic driver's Tx prepare handler. commit ef28cfa73822 ("mbuf: fix Tx offload mask") The enic driver keeps the supported offload flags in a local variable (tx_offload_mask), which is strictly a subset of PKT_TX_OFFLOAD_MASK. This variable is then used to compute the unsupported flags (tx_offload_notsup_mask), and the Tx prepare handler (tx_pkt_prepare) uses it to reject packets with unsupported offload flags. As is, tx_offload_notsup_mask ends up containing flags like PKT_TX_IPV4 that are actually supported by the driver, which then breaks any application that uses checksum offloads and calls the Tx prepare handler. So add the flags to tx_offload_mask that the driver supports but were missing in PKT_TX_OFFLOAD_MASK. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-26 22:14:05 +02:00
Hyong Youb Kim	432ed10d81	net/enic: fix supported packet types The handler for dev_supported_ptypes_get currently returns null when the vectorized Rx handler is used. It is also missing tunnel packet types. Add the missing packet types to the supported list, and return the right list for the vectorized Rx handler. Fixes: `8a6ff33d6d` ("net/enic: add AVX2 based vectorized Rx handler") Fixes: `93fb21fdbe` ("net/enic: enable overlay offload for VXLAN and GENEVE") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-26 22:14:05 +02:00
John Daley	1b4ce87dc5	net/enic: fix counter action - track whether counter DMAs are active or not so they can be stopped if needed before DMA memory is freed - fix counter DMA shut-down by changing vnic_dev_counter_dma_cfg() to take the number of counters to DMA instead of high counter index and use num counters = 0 to shut off DMAs - remove unnecessary checks that DMA counter memory is valid and that counter DMAs are in use - change the minimum DMA period to match what 1400 series adapter is capable of - fix comments and change a couple variable names to make more sense Fixes: `86df6c4e2f` ("net/enic: support flow counter action") Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-10-18 10:24:39 +02:00
Hyong Youb Kim	8a6ff33d6d	net/enic: add AVX2 based vectorized Rx handler Add the vectorized version of the no-scatter Rx handler. It aims to process 8 descriptors per loop using AVX2 SIMD instructions. This handler is in its own file enic_rxtx_vec_avx2.c, and makefile and meson.build are modified to compile it when the compiler supports AVX2. Under ideal conditions, the vectorized handler reduces cycles/packet by more than 30%, when compared against the no-scatter Rx handler. Most implementation ideas come from i40e's AVX2 based handler, so credit goes to its authors. At this point, the new handler is meant for field trials, and is not selected by default. So add a new devarg enable-avx2-rx to allow the user to request the use of the new handler. When enable-avx2-rx=1, the driver will consider using the new handler. Also update the guide doc and introduce the vectorized handler. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:49 +02:00
Hyong Youb Kim	cd4e7b3250	net/enic: move common Rx functions to a new header file Move a number of Rx functions to the header file so that the avx2 based Rx handler can use them. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:49 +02:00
John Daley	86df6c4e2f	net/enic: support flow counter action Support counter action for 1400 series adapters. The adapter API for allocating and freeing counters is independent of the adapter match/action API. If the filter action is requested, a counter is first allocated and then assigned to the filter, and when the filter is deleted, the counter must also be deleted. Counters are DMAd to pre-allocated consistent memory periodically, controlled by the define VNIC_FLOW_COUNTER_UPDATE_MSECS. The default is 100 milliseconds. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-10-11 18:53:48 +02:00
John Daley	85b0ccec38	net/enic: fix flow API memory leak rte_flow structures were not being freed when destroyed or flushed. Fixes: `6ced137607` ("net/enic: flow API for NICs with advanced filters enabled") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Signed-off-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	308b514b8e	net/enic: explicitly disable overlay offload Reopening vNIC does not automatically disable overlay offload. If it is previously enabled, it remains enabled even when the user restarts DPDK and requests overlay offload to be disabled via devarg disable-overlay=1. So explicitly disable overlay offload when requested. Fixes: `93fb21fdbe` ("net/enic: enable overlay offload for VXLAN and GENEVE") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	70401fd778	net/enic: add VLAN and csum offloads to simple Tx handler Currently the simple Tx handler supports no offloads, which makes it usable only for a small number of benchmarks. Add vlan and checksum offloads to the handler, as cycles/packet increases only by about 3 cycles, and applications commonly use those offloads. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	828cf603a1	net/enic: do not use deprecated Tx VLAN packet flag Replace PKT_TX_VLAN_PKT (deprecated) with PKT_TX_VLAN. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	fe5383d133	net/enic: set Rx VLAN offload flag for non-stripped packets The NIC indicates VLAN TCI to the driver even when VLAN stripping is disabled. The driver sets mbuf's vlan_tci but not PKT_RX_VLAN. Set PKT_RX_VLAN to indicate that vlan_tci is valid. Fixes: `c6f4555074` ("net/enic: add ethernet VLAN packet type") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	c0aae00d7d	net/enic: enable IOVA mode Cisco VIC models support RTE_IOVA_VA, so enable it. This change allows the driver to work properly when --no-huge is used, in combination with vfio and iommu. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Hyong Youb Kim	329380b3a1	net/enic: do not use non-standard integer types Bugzilla ID: 39 Fixes: `9913fbb91d` ("enic/base: common code") Fixes: `322b355f21` ("net/enic/base: bring NIC interface functions up to date") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-10-11 18:53:48 +02:00
Ferruh Yigit	323e7b667f	ethdev: make default behavior CRC strip on Rx Removed DEV_RX_OFFLOAD_CRC_STRIP offload flag. Without any specific Rx offload flag, default behavior by PMDs is to strip CRC. PMDs that support keeping CRC should advertise DEV_RX_OFFLOAD_KEEP_CRC Rx offload capability. Applications that require keeping CRC should check PMD capability first and if it is supported can enable this feature by setting DEV_RX_OFFLOAD_KEEP_CRC in Rx offload flag in rte_eth_dev_configure() Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Tomasz Duszynski <tdu@semihalf.com> Acked-by: Shahaf Shuler <shahafs@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Jan Remes <remes@netcope.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-09-14 20:08:41 +02:00
Hyong Youb Kim	0caa07034a	net/enic: reset VXLAN port during initialization The NIC persists the vxlan port number across vNIC init/de-init (e.g. restart testpmd). So, explicitly reset the setting to the default value (4789) as part of the initialization. Fixes: `8a4efd1741` ("net/enic: add handlers to add/delete vxlan port number") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-08-02 10:34:04 +02:00
Hyong Youb Kim	d16623dd39	net/enic: revert mbuf fast free offload This reverts the patch that enabled mbuf fast free. There are two main reasons. First, enic_fast_free_wq_bufs is broken. When DEV_TX_OFFLOAD_MBUF_FAST_FREE is enabled, the driver calls this function to free transmitted mbufs. This function currently does not reset next and nb_segs. This is simply wrong as the fast-free flag does not imply anything about next and nb_segs. We could fix enic_fast_free_wq_bufs by making it to call rte_pktmbuf_prefree_seg to reset the required fields. But, it negates most of cycle saving. Second, there are customer applications that blindly enable all Tx offloads supported by the device. Some of these applications do not satisfy the requirements of mbuf fast free (i.e. a single pool per queue and refcnt = 1), and end up crashing or behaving badly. Fixes: `bcaa54c1a1` ("net/enic: support mbuf fast free offload") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-08-02 10:26:02 +02:00
Hyong Youb Kim	920aae3f5c	net/enic: pick the right Rx handler after changing MTU enic_set_mtu always reverts to the default Rx handler after changing MTU. Try to use the simpler, non-scatter handler in this case as well. Fixes: `35e2cb6a17` ("net/enic: add simple Rx handler") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-23 23:55:26 +02:00
Thomas Monjalon	f8e9989606	remove useless constructor headers A constructor is usually declared with RTE_INIT* macros. As it is a static function, no need to declare before its definition. The macro is used directly in the function definition. Signed-off-by: Thomas Monjalon <thomas@monjalon.net>	2018-07-12 00:00:35 +02:00
John Daley	2c06cebeb9	net/enic: cap Rx packet processing to end of desc ring In the default Rx handler stop processing packets at the end of the completion ring so that wrapping doesn't have to be checked in the inner while loop. Also, check the color bit in the completion without using a conditional. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-07-03 01:54:27 +02:00
John Daley	35e2cb6a17	net/enic: add simple Rx handler Add an optimized Rx handler for non-scattered Rx. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Signed-off-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:26 +02:00
Hyong Youb Kim	5a12c38740	net/enic: check maximum packet size in Tx prepare handler The default tx handler checks the maximum packet size. Check it in the prepare handler too. WQ stops working if the app/driver tries to send oversized packets, so these checks are unavoidable. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:25 +02:00
Hyong Youb Kim	ed933c35ca	net/enic: add the simple version of Tx handler Add a much-simplified handler that works when all offloads are disabled, except mbuf fast free. When compared against the default handler, under ideal conditions, cycles per packet drop by 60+%. The driver tries to use the simple handler first. The idea of using specialized/simplified handlers is from the Intel and Mellanox drivers. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:23 +02:00
Hyong Youb Kim	c55614d102	net/enic: reduce Tx completion updates Request one completion update per roughly 32 buffers. It saves DMA resources on the NIC, PCIe utilization, and cache miss rates. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:22 +02:00
Hyong Youb Kim	bcaa54c1a1	net/enic: support mbuf fast free offload Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:20 +02:00
Hyong Youb Kim	d355a942b1	net/enic: use mbuf pointer array for inflight Tx packets WQ is currently using vnic_wq_buf to store mbuf pointers for Tx packets. But, it contains an unused mempool pointer and mbuf is unnecessarily cast to void pointer. Remove vnic_wq_buf entirely and use an mbuf pointer array instead. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:19 +02:00
Hyong Youb Kim	8a4efd1741	net/enic: add handlers to add/delete vxlan port number The NIC has one configurable VXLAN port, which is set to the default 4789 upon vNIC reset. Adding a non-default port replaces this single VXLAN port. Deleting the previously added non-default port restores the VXLAN port to the hardware default. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:17 +02:00
Hyong Youb Kim	e39c2756e2	net/enic: add devarg to specify ingress VLAN rewrite mode Add a new devarg "ig-vlan-rewrite" to allow the user to set non-default rewrite mode. The UCS VIC may add/remove/modify the VLAN header of an ingress packet depending on the ingress VLAN rewrite mode. By default, the driver sets the pass-through mode, which tells the NIC "do not touch VLAN header and preserve it as is". This mode is usually sufficient, but can complicate deployments for certain environments. For example, OVS-DPDK in UCS blade environments may want to use "untag default VLAN mode", which removes the VLAN header from an ingress packet if it matches vNIC's default VLAN. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:15 +02:00
Hyong Youb Kim	9466a38d38	net/enic: report ring limits and preferred default values Report min/max ring sizes, alignments, and so on, and rely on the common checks implemented in the rte_ethdev layer. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:54:12 +02:00
Hyong Youb Kim	1c7c3ad1a0	net/enic: initialize RQ fetch index before enabling RQ The fetch index must be initialized only when RQ is disabled. Otherwise, it may lead to stale entries in IG descriptor cache on the VIC. Fixes: `a74629cfa3` ("net/enic: enable RQ first and then post Rx buffers") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:53:57 +02:00
Hyong Youb Kim	2a7e3d5465	net/enic: do not overwrite admin Tx queue limit Currently, enic_alloc_wq (via rte_eth_tx_queue_setup) may overwrite the admin limit with a lower value. This is wrong as seen in the following sequence. 1. UCS admin-set Tx queue limit (config.wq_desc_count) = 4096 2. Set up tx queue with 512 descriptors The admin limit (config.wq_desc_count) becomes 512. 3. Stop ports and now set up Tx queue with 1024 descriptors. This fails because 1024 is greater than the admin limit (512). Do not modify the admin limit, and when queried, report the current number of descriptors instead of the admin limit. The rx queue setup (enic_alloc_rq) does not this problem. Fixes: `fefed3d1e6` ("enic: new driver") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:53:44 +02:00
Hyong Youb Kim	5bc989e6db	net/enic: update the UDP RSS detection mechanism The UDP RSS interface has changed in the release firmware for 100G VIC adapters. The capability bit is now in NIC_CFG. Also the driver is supposed to use CMD_NIC_CFG_CHK and check if RSS config is successful. No more changes are expected with respect to UDP RSS API. Fixes: `94c3518958` ("net/enic: update UDP RSS controls") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:53:26 +02:00
Hyong Youb Kim	15666e5309	net/enic: fix receive packet types Fix missing or incorrect packet types discovered by DTS. - Non-IP inner packets Set the tunnel flag. - Inner Ethernet packets All supported tunnel packets have Ethernet as inner packets. So, set INNER_L2_ETHER for all tunnel types. - IPv4 fragments carrying TCP/UDP The NIC indicates TCP/UDP based on the protocol in IP header. For fragments, ignore that bit and always set L4_FRAG. - IPv6 fragments The NIC does regconize fragments (IPv6 packets with fragment extension headers). Set packet types for these. Fixes: `93fb21fdbe` ("net/enic: enable overlay offload for VXLAN and GENEVE") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-07-03 01:49:51 +02:00
Hyong Youb Kim	2b9919feab	net/enic: fix missing offload capabilities Add the following missing flags to the advertised offloads. - DEV_RX_OFFLOAD_CRC_STRIP CRC is always stripped. - DEV_RX_OFFLOAD_JUMBO_FRAME Jumbo support is always enabled on the NIC. - DEV_RX_OFFLOAD_SCATTER Scatter Rx is currently supported. - DEV_TX_OFFLOAD_MULTI_SEGS Multiple-segment transmit has always been supported. Fixes: `93fb21fdbe` ("net/enic: enable overlay offload for VXLAN and GENEVE") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-05-14 22:32:23 +01:00
Hyong Youb Kim	9713ece25c	net/enic: fix flow drop action Drop is a fate-deciding action, so mark it as FATE. It was missing in a previous commit. Fixes: `cc17feb904` ("ethdev: alter behavior of flow API actions") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-05-14 22:31:52 +01:00
Hyong Youb Kim	94c3518958	net/enic: update UDP RSS controls Current adapters which support UDP RSS piggyback on TCP RSS. Change the controls to be forward compatible with future adapters, which will have independent control of UDP and TCP. Fixes: `9bd04182bb` ("net/enic: support UDP RSS on 1400 series adapters") Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: Aaron Conole <aconole@redhat.com>	2018-05-14 22:31:50 +01:00
Hyong Youb Kim	3656c30e48	net/enic: fix RSS hash type advertisement The NIC can hash these RSS packet types, but they are not advertised via flow_type_rss_offloads. So add them. - Part of the IPv4 hash: ETH_RSS_FRAG_IPV4 ETH_RSS_NONFRAG_IPV4_OTHER - Part of the IPv6 hash: ETH_RSS_FRAG_IPV6 ETH_RSS_NONFRAG_IPV6_OTHER - Part of the UDP hash: ETH_RSS_IPV6_UDP_EX Also, do not use NIC_CFG_RSS_HASH_TYPE_IPV6_EX and NIC_CFG_RSS_HASH_TYPE_TCP_IPV6_EX, as they are not needed to enable RSS over IPv6 with extension headers. Fixes: `c2fec27b5c` ("net/enic: allow to change RSS settings") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com> Reviewed-by: Aaron Conole <aconole@redhat.com>	2018-05-14 22:31:50 +01:00
John Daley	579cc855af	net/enic: set rte errno to positive value Related to `d9fff8a31`, where rte_errno should always have positive errno values. Technically this is an ABI change since it fixes an error code introduced in 18.02, but is minor and inconsequential. Fixes: `1e81dbb532` ("net/enic: add Tx prepare handler") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com> Reviewed-by: Aaron Conole <aconole@redhat.com>	2018-05-14 22:31:50 +01:00
Hyong Youb Kim	95faa2a9df	net/enic: fix the MTU handler to rely on max packet length The RQ setup functions (enic_alloc_rq and enic_alloc_rx_queue_mbufs) have changed to rely on max_rx_pkt_len to determine the use of scatter and buffer size. But, the MTU handler only updates ethdev's MTU value. So make it update max_rx_pkt_len as well. Other PMDs also update both mtu and max_rx_pkt_len in their MTU handlers. Also the condition for taking a short cut (scatter is disabled) in the MTU handler is wrong. Even when scatter is disabled, a change in max_rx_pkt_len may affect the buffer size posted to the NIC. So remove that condition. Finally, fix a comment and a warning message condition. Fixes: `422ba91716` ("net/enic: heed the requested max Rx packet size") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com> Reviewed-by: Aaron Conole <aconole@redhat.com>	2018-05-14 22:31:50 +01:00
Hyong Youb Kim	a74629cfa3	net/enic: enable RQ first and then post Rx buffers Future VIC adapters may require that the driver enable RQ before posting new buffers to the NIC. So split enic_alloc_rx_queue_mbufs() into two functions, one that allocates buffers and fills RQ and the other that posts them (i.e. PIO write to a doorbell). And, call the post function only after enabling RQ. Currently released models are not affected by this change, as they work fine whether the driver posts buffers before or after enabling RQ. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com> Reviewed-by: Aaron Conole <aconole@redhat.com>	2018-05-14 22:31:50 +01:00
Adrien Mazarguil	76e9a55b5b	ethdev: add transfer attribute to flow API This new attribute enables applications to create flow rules that do not simply match traffic whose origin is specified in the pattern (e.g. some non-default physical port or VF), but actively affect it by applying the flow rule at the lowest possible level in the underlying device. It breaks ABI compatibility for the following public functions: - rte_flow_copy() - rte_flow_create() - rte_flow_validate() Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2018-04-27 18:00:54 +01:00
Adrien Mazarguil	e58638c324	ethdev: fix TPID handling in flow API TPID handling in rte_flow VLAN and E_TAG pattern item definitions is not consistent with the normal stacking order of pattern items, which is confusing to applications. Problem is that when followed by one of these layers, the EtherType field of the preceding layer keeps its "inner" definition, and the "outer" TPID is provided by the subsequent layer, the reverse of how a packet looks like on the wire: Wire: [ ETH TPID = A \| VLAN EtherType = B \| B DATA ] rte_flow: [ ETH EtherType = B \| VLAN TPID = A \| B DATA ] Worse, when QinQ is involved, the stacking order of VLAN layers is unspecified. It is unclear whether it should be reversed (innermost to outermost) as well given TPID applies to the previous layer: Wire: [ ETH TPID = A \| VLAN TPID = B \| VLAN EtherType = C \| C DATA ] rte_flow 1: [ ETH EtherType = C \| VLAN TPID = B \| VLAN TPID = A \| C DATA ] rte_flow 2: [ ETH EtherType = C \| VLAN TPID = A \| VLAN TPID = B \| C DATA ] While specifying EtherType/TPID is hopefully rarely necessary, the stacking order in case of QinQ and the lack of documentation remain an issue. This patch replaces TPID in the VLAN pattern item with an inner EtherType/TPID as is usually done everywhere else (e.g. struct vlan_hdr), clarifies documentation and updates all relevant code. It breaks ABI compatibility for the following public functions: - rte_flow_copy() - rte_flow_create() - rte_flow_query() - rte_flow_validate() Summary of changes for PMDs that implement ETH, VLAN or E_TAG pattern items: - bnxt: EtherType matching is supported with and without VLAN, but TPID matching is not and triggers an error. - e1000: EtherType matching is only supported with the ETHERTYPE filter, which does not support VLAN matching, therefore no impact. - enic: same as bnxt. - i40e: same as bnxt with existing FDIR limitations on allowed EtherType values. The remaining filter types (VXLAN, NVGRE, QINQ) do not support EtherType matching. - ixgbe: same as e1000, with additional minor change to rely on the new E-Tag macro definition. - mlx4: EtherType/TPID matching is not supported, no impact. - mlx5: same as bnxt. - mvpp2: same as bnxt. - sfc: same as bnxt. - tap: same as bnxt. Fixes: `b1a4b4cbc0` ("ethdev: introduce generic flow API") Fixes: `99e7003831` ("net/ixgbe: parse L2 tunnel filter") Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com> Acked-by: Andrew Rybchenko <arybchenko@solarflare.com>	2018-04-27 18:00:54 +01:00
Adrien Mazarguil	cc17feb904	ethdev: alter behavior of flow API actions This patch makes the following changes to flow rule actions: - List order now matters, they are redefined as performed first to last instead of "all simultaneously". - Repeated actions are now supported (e.g. specifying QUEUE multiple times now duplicates traffic among them). Previously only the last action of any given kind was taken into account. - No more distinction between terminating/non-terminating/meta actions. Flow rules themselves are now defined as always terminating unless a PASSTHRU action is specified. These changes alter the behavior of flow rules in corner cases in order to prepare the flow API for actions that modify traffic contents or properties (e.g. encapsulation, compression) and for which order matter when combined. Previously one would have to do so through multiple flow rules by combining PASSTRHU with priority levels, however this proved overly complex to implement at the PMD level, hence this simpler approach. This breaks ABI compatibility for the following public functions: - rte_flow_create() - rte_flow_validate() PMDs with rte_flow support are modified accordingly: - bnxt: no change, implementation already forbids multiple actions and does not support PASSTHRU. - e1000: no change, same as bnxt. - enic: modified to forbid redundant actions, no support for default drop. - failsafe: no change needed. - i40e: no change, implementation already forbids multiple actions. - ixgbe: same as i40e. - mlx4: modified to forbid multiple fate-deciding actions and drop when unspecified. - mlx5: same as mlx4, with other redundant actions also forbidden. - sfc: same as mlx4. - tap: implementation already complies with the new behavior except for the default pass-through modified as a default drop. Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com>	2018-04-27 18:00:53 +01:00
John Daley	05e8568269	net/enic: fix uninitialized variable A local variable was used without initialization and triggered a coverity issue. Is is fixed here, but there is no ill effect of not initializing the variable in this case. 'rxq_interrupt_offset' is irrelevant if 'rxq_interrupt_enable' is not set (the condition caught by coverity). Coverity issue: 268314 Fixes: `fc2c8c0668` ("net/enic: use Tx completion index instead of messages") Cc: stable@dpdk.org Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-04-27 15:54:56 +01:00
Hyong Youb Kim	93fb21fdbe	net/enic: enable overlay offload for VXLAN and GENEVE Recent NIC models support overlay offload. The overlay offload feature enables the following on the NIC. - Rx/Tx checksum offloads for both inner and outer packets. - Rx inner packet type classification. - TSO. - Inner RSS. TX descriptors do not require any changes, except the header length for TSO. The NIC parses outer/inner packets and performs offloads on them as necessary. The header length for tunneled TSO includes both inner and outer headers. The NIC actually parses and performs the above for NVGRE as well. DPDK currently has no offload flags for NVGRE, and the hardware has no controls to individually enable tunnel types either. So do nothing for now. The driver enables overlay offload by default. Add a devargs 'disable-overlay=<0\|1>' to allow the app to disable it. Also update the enic guide doc. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-04-27 15:54:55 +01:00
David Marchand	740f5bf176	net/enic: add primary MAC address handler Modified enic_del_mac_address() to get a return value from the vnic layer. Reused the .mac_addr_add and .mac_addr_del callbacks code to implement primary mac address handler. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-04-27 15:54:55 +01:00
Ferruh Yigit	cd8c7c7ce2	ethdev: replace bus specific struct with generic dev Public struct rte_eth_dev_info has a "struct rte_pci_device" field in it although it is common for all ethdev in all buses. Replacing pci specific struct with generic device struct and updating places that are using pci device in a way to get this information from generic device. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: David Marchand <david.marchand@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2018-04-14 00:41:44 +02:00
Hyong Youb Kim	036c545da1	net/enic: support drop flow action 1330 and 1400 series adapters support the drop action. Check for its availability and set the necessary flag when creating NIC filters. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-04-14 00:41:44 +02:00
John Daley	33a2d65949	net/enic: fix crash on MTU update with non-setup queues The enic code called from rte_eth_dev_set_mtu() was assuming that the Rx queues are already set up via a call to rte_eth_tx_queue_setup(). OVS calls rte_eth_dev_set_mtu() before rte_eth_rx_queue_setup() and a null pointer was dereferenced. Fixes: `c3e09182bc` ("net/enic: support scatter Rx in MTU update") Cc: stable@dpdk.org Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-04-14 00:41:44 +02:00
John Daley	9bd04182bb	net/enic: support UDP RSS on 1400 series adapters Recent models support IPv4/IPv6 UDP RSS. There is no control bit to enable UDP RSS alone. Instead, the NIC enables/disables TCP and UDP RSS together. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-04-14 00:41:44 +02:00
Hyong Youb Kim	fe26a3bb33	net/enic: do not flush descriptor cache when opening vNIC The firmware on new hardware models flushes the global descriptor cache by default. Use CMD_OPENF_IG_DESCCACHE to avoid cache flushing. This flag has no effect on older models. Suggested-by: Govindarajulu Varadarajan <gvaradar@cisco.com> Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-04-14 00:41:44 +02:00
Anatoly Burakov	46e4fb122f	net/enic: use contiguous allocation for DMA memory All hardware drivers should allocate IOVA-contiguous memzones for their hardware resources. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Acked-by: John Daley <johndale@cisco.com> Tested-by: Santosh Shukla <santosh.shukla@caviumnetworks.com> Tested-by: Hemant Agrawal <hemant.agrawal@nxp.com> Tested-by: Gowrishankar Muthukrishnan <gowrishankar.m@linux.vnet.ibm.com>	2018-04-11 19:45:39 +02:00
Stephen Hemminger	5042dde07d	net/enic: use link status helper functions Use new rte_eth_linkstatus_get/set helper functions to handle link status update. This driver was not doing atomic update of link status information. And the return value was different than others. The hardware also does not do autonegotiation (at least on Linux). Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	e14dca8dc1	net/enic: support meson Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	7895d17cf4	net/enic: avoid strict aliasing warnings Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	0f872d3129	net/enic: support Rx queue interrupts Enable rx queue interrupts if the app requests them, and vNIC has enough interrupt resources. Use interrupt vector 0 for link status and errors. Use vector 1 for rx queue 0, vector 2 for rx queue 1, and so on. So, with n rx queues, vNIC needs to have at n + 1 interrupts. For VIC, enabling and disabling rx queue interrupts are simply mask/unmask operations. VIC's credit based interrupt moderation is not used, as the app wants to explicitly control when to enable/disable interrupts. This version requires MSI-X (vfio-pci). Sharing one interrupt for link status and rx queues is possible, but is rather complex and has no user demands. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	8d782f3f89	net/enic: allocate stats DMA buffer upfront during probe The driver provides a DMA buffer to the firmware when it requests port stats. The NIC then fills that buffer with latest stats. Currently, the driver allocates the DMA buffer the first time it requests stats and saves it for later use. This can lead to crashes when primary/secondary processes are involved. For example, the following sequence crashes the secondary process. 1. Start a primary app that does not call rte_eth_stats_get() 2. dpdk-procinfo -- --stats dpdk-procinfo crashes while trying to allocate the stats DMA buffer because the alloc function pointer (vdev.alloc_consistent) is valid only in the primary process, not in the secondary process. Overwriting the alloc function pointer in the secondary process is not an option, as it will simply make the pointer invalid in the primary process. Instead, allocate the DMA buffer during probe so that only the primary process does both allocate and free. This allows the secondary process to dump stats as well. Fixes: `9913fbb91d` ("enic/base: common code") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	92ca7ea444	net/enic: add Rx/Tx queue configuration getters Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	f9416bbafd	net/enic: remove VLAN filter handler VIC does not support VLAN filtering at the moment. The firmware does accept the filter add/del commands and returns success. But, they are no-ops. To avoid confusion, remove the filter set handler so the app sees an error instead of silent failure. Also during the device configure time, enicpmd_vlan_offload_set would not print a warning message about unsupported VLAN filtering, because the caller specifies only ETH_VLAN_STRIP_MASK. This is wrong, as we should attempt to apply all requested offloads at the configure time. So, pass all VLAN offload masks, which triggers a warning message about VLAN filtering, if requested. Finally, enicpmd_vlan_offload_set should check both mask and rxmode.offloads, not just mask. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	422ba91716	net/enic: heed the requested max Rx packet size Currently, enic completely ignores the requested max Rx packet size (rxmode.max_rx_pkt_len). The desired behavior is that the NIC hardware drops packets larger than the requested size, even though they are still smaller than MTU. Cisco VIC does not have such a feature. But, we can accomplish a similar (not same) effect by reducing the size of posted receive buffers. Packets larger than the posted size get truncated, and the receive handler drops them. This is also how the kernel enic driver enforces the Rx side MTU. This workaround works only when scatter mode is not used. When scatter is used, there is currently no way to support rxmode.max_rx_pkt_len, as the NIC always receives packets up to MTU. For posterity, add a copious amount of comments regarding the hardware's drop/receive behavior with respect to max/current MTU. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
Hyong Youb Kim	c2fec27b5c	net/enic: allow to change RSS settings Currently, when more than 1 receive queues are configured, the driver always enables RSS with the driver's own default hash type, key, and RETA. The user is unable to change any of the RSS settings. Address this by implementing the ethdev RSS API as follows. Correctly report the RETA size, key size, and supported hash types through rte_eth_dev_info. During dev_configure(), initialize RSS according to the device's mq_mode and rss_conf. Start with the default RETA, and use the default key unless a custom key is provided. Add the RETA and rss_conf query/set handlers to let the user change RSS settings after the initial configuration. The hardware is able to change hash type, key, and RETA individually. So, the handlers change only the affected settings. Refactor/rename several functions in order to make their intentions clear. For example, remove all traces of RSS from enicpmd_vlan_offload_set() as it is confusing. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-03-30 14:08:43 +02:00
John Daley	d98f9d5c86	net/enic: remove extern in function declarations Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-03-30 14:08:43 +02:00
Harry van Haaren	415103fb33	net/enic: align dynamic log names with standard This commit aligns the names for dynamic logging with the newly defined logging format. Signed-off-by: Harry van Haaren <harry.van.haaren@intel.com> Acked-by: Hyong Youb Kim <hyonkim@cisco.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2018-01-31 09:28:06 +01:00
Hyong Youb Kim	d26ddeaf11	net/enic: set L4 checksum flags for IPv6 packets enic_cq_rx_to_pkt_flags() currently sets checksum good/bad flags only for IPv4. The hardware actually validates the TCP/UDP checksum of IPv6 packets too. Set PKT_RX_L4_CKSUM_{GOOD,BAD} accordingly. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-29 10:04:28 +01:00
Hyong Youb Kim	1e81dbb532	net/enic: add Tx prepare handler Like most NICs, this hardware (Cisco VIC) also requires partial checksum in the packet for checksum offload and TSO. So, add the tx_pkt_prepare handler like other PMDs do. Technically, VIC has an offload mode that does not require partial checksum for non-TSO packets. But, it has no such mode for TSO packets, making tx_pkt_prepare unavoidable. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-29 10:04:28 +01:00
Hyong Youb Kim	6c45c33058	net/enic: fix crash due to static max number of queues ENIC_CQ_MAX, ENIC_WQ_MAX and others are arbitrary values that prevent the app from using more queues when they are available on hardware. Remove them and dynamically allocate vnic_cq and such arrays to accommodate all available hardware queues. As a side effect of removing ENIC_CQ_MAX, this commit fixes a segfault that would happen when the app requests more than 16 CQs, because enic_set_vnic_res() does not consider ENIC_CQ_MAX. For example, the following command causes a crash. testpmd -- --rxq=16 --txq=16 Fixes: `ce93d3c36d` ("net/enic: fix resource check failures when bonding devices") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-29 10:04:28 +01:00
Ferruh Yigit	ffc905f3b8	ethdev: separate driver APIs Create a rte_ethdev_driver.h file and move PMD specific APIs here. Drivers updated to include this new header file. There is no update in header content and since ethdev.h included by ethdev_driver.h, nothing changed from driver point of view, only logically grouping of APIs. From applications point of view they can't access to driver specific APIs anymore and they shouldn't. More PMD specific data structures still remain in ethdev.h because of inline functions in header use them. Those will be handled separately. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Acked-by: Shreyansh Jain <shreyansh.jain@nxp.com> Acked-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2018-01-22 01:26:49 +01:00
John Daley	721c662594	net/enic: remove a conditional from the Tx path The VLAN insert flag and VLAN tag used in the VIC write descriptor can be set unconditionally. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-01-16 18:47:49 +01:00
John Daley	4bb0a6feb3	net/enic: use TSO flags Depend on the tx_offload flags in the mbuf to determine the length of the headers instead of looking into the packet itself. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	2e99ea80f8	net/enic: use BSD-3-Clause enic is currently using BSD-2-Clause, whereas the DPDK approved license is BSD-3-Clause. So replace license text with BSD-3-Clause. Remove LICENSE as it is redundant. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	36efba2f93	net/enic: use dynamic log types "pmd.enic.init" replaces CONFIG_RTE_LIBRTE_ENIC_DEBUG "pmd.enic.flow" replaces CONFIG_RTE_LIBRTE_ENIC_DEBUG_FLOW Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	b5df2f7ac5	net/enic: refill only the address of the RQ descriptor Once the RQ descriptors are initialized (enic_alloc_rx_queue_mbufs), their length_type does not change during normal RX operations. rx_pkt_burst only needs to reset their address field for newly allocated mbufs. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	e5f5b9269f	net/enic: remove a couple unnecessary statements No need to zero ol_flags as it is overwritten at the end of the function. No need to check for EOP as the caller (enic_recv_pkts) has already checked it. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	c23cd156f0	net/enic: remove remaining header-split code Header splitting has been disabled at least since the following commit. Remove the remaining code to avoid confusion. commit `947d860c82` ("enic: improve Rx performance") Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	5dbff3af25	net/enic: fix L4 Rx ptype comparison For non-UDP/TCP packets, enic may wrongly set PKT_RX_L4_CKSUM_BAD in ol_flags. The comparison that checks if a packet is UDP or TCP assumes that RTE_PTYPE_L4 values are bit flags, but they are not. For example, the following evaluates to true because NONFRAG is 0x600 and UDP is 0x200, and causes the current code to think the packet is UDP. !!(RTE_PTYPE_L4_NONFRAG & RTE_PTYPE_L4_UDP) So, fix this by comparing the packet type against UDP and TCP individually. Fixes: `453d15059b` ("net/enic: use new Rx checksum flags") Cc: stable@dpdk.org Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	023f19d669	net/enic: do not set checksum unknown offload flag PKT_RX_IP_CKSUM_UNKNOWN and PKT_RX_L4_CKSUM_UNKNOWN are zeros, so no need to set them. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Hyong Youb Kim	a062bafa62	net/enic: use the new ethdev offloads API The following commits deprecate the use of the offload bit fields (e.g. header_split) in rte_eth_rxmode and txq_flags in rte_eth_txconf. commit `ce17eddefc` ("ethdev: introduce Rx queue offloads API") commit `cba7f53b71` ("ethdev: introduce Tx queue offloads API") For enic, the required changes are mechanical. Use the new 'offloads' field in rxmode instead of the bit fields. And, no changes required with respect to txq_flags, as enic does not use it at all. Per-queue RX offload capabilities are not set, as all offloads are per-port at the moment. Signed-off-by: Hyong Youb Kim <hyonkim@cisco.com> Reviewed-by: John Daley <johndale@cisco.com>	2018-01-16 18:47:49 +01:00
Thomas Monjalon	cebe3d7b3d	ethdev: remove useless parameter in callback process The pointer to the user parameter of the callback registration is automatically pass to the callback function. There is no point to allow changing this user parameter by a caller. That's why this parameter is always set to NULL by PMDs and set only in ethdev layer before calling the callback function. The history is that the user parameter was initially used by the callback implementation to pass some information between the application and the driver: `c1ceaf3ad0` ("ethdev: add an argument to internal callback function") Then a new parameter has been added to leave the user parameter to its standard usage of context given at registration: `d6af1a13d7` ("ethdev: add return values to callback process API") The NULL parameter in the internal callback processing function is now removed. It makes clear that the callback parameter is user managed and opaque from a DPDK point of view. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2018-01-16 18:47:49 +01:00
John Daley	cafba10bc1	net/enic: fix TSO for packets greater than 9208 bytes A check was previously added to drop Tx packets greater than what the Nic is capable of sending since such packets can freeze the send queue. The check did not account for TSO packets however, so TSO was limited to 9208 bytes. Check packet length only for non-TSO packets. Also insure that TSO packet segment size plus the headers do not exceed what the Nic is capable of since this also can freeze the send queue. Use the PKT_TX_TCP_SEG ol_flag instead of m->tso_segsz which is the preferred way to check for TSO. Fixes: `ed6e564c21` ("net/enic: fix memory leak with oversized Tx packets") Cc: stable@dpdk.org Signed-off-by: John Daley <johndale@cisco.com>	2017-11-02 19:32:04 +01:00
Santosh Shukla	455da54539	mbuf: rename physical address to IOVA Rename buf_physaddr to buf_iova. Keep the deprecated name in an anonymous union to avoid breaking the API. Signed-off-by: Santosh Shukla <santosh.shukla@caviumnetworks.com> Reviewed-by: Anatoly Burakov <anatoly.burakov@intel.com> Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2017-11-06 22:44:26 +01:00
Thomas Monjalon	f17ca7870f	memzone: rename address from physical to IOVA The struct rte_memzone field .phys_addr is renamed to .iova. The deprecated name is kept in an anonymous union to avoid breaking the API. Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Santosh Shukla <santosh.shukla@caviumnetworks.com>	2017-11-06 22:25:44 +01:00
Gaetan Rivet	c752998b5e	pci: introduce library and driver The PCI lib defines the types and methods allowing to use PCI elements. The PCI bus implements a bus driver for PCI devices by constructing rte_bus elements using the PCI lib. Move the relevant code out of the EAL to its expected place. Libraries, drivers, unit tests and applications are updated to use the new rte_bus_pci.h header when necessary. Signed-off-by: Gaetan Rivet <gaetan.rivet@6wind.com>	2017-10-26 23:17:31 +02:00
David Harton	289ba0c0f5	ethdev: allow returning error on VLAN offload ops Some devices may not support or fail setting VLAN offload configuration based on dynamic circumstances so the vlan_offload_set_t vector is modified to return an int so the caller can determine success or not. rte_eth_dev_set_vlan_offload is updated to return the value provided by the vector when called along with restoring the original offload configs on failure. Existing vlan_offload_set_t vectors are modified to return an int. Majority of cases return 0 but a few that actually can fail now return their failure codes. Finally, a vlan_offload_set_t vector is added to virtio to facilitate dynamically turning VLAN strip on or off. Signed-off-by: David Harton <dharton@cisco.com> Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-10-26 02:33:01 +02:00
Olivier Matz	380a7aab1a	mbuf: rename deprecated VLAN flags PKT_RX_VLAN_PKT and PKT_RX_QINQ_PKT are deprecated for a while. As explained in [1], these flags were kept to let the applications and PMDs move to the new flag. There is also a need to support Rx vlan offload without vlan strip (at least for the ixgbe driver). This patch renames the old flags for this feature, knowing that some PMDs were using PKT_RX_VLAN_PKT and PKT_RX_QINQ_PKT to indicate that the vlan tci has been saved in the mbuf structure. It is likely that some PMDs do not set the proper flags when doing vlan offload, and it would be worth making a pass on all of them. Link: [1] http://dpdk.org/ml/archives/dev/2017-June/067712.html Signed-off-by: Olivier Matz <olivier.matz@6wind.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2017-10-26 02:33:01 +02:00
John Daley	ea5f15b1c4	net/enic: fix packet loss after MTU change If multiple Rx queues and Rx Scatter are used and the MTU is modified so that the number of mbufs per packet changes, packet loss is possible. The enic completion queue index was miscalculated leaving the upper half of the queues uninitialized after an MTU change, possibly leading to completions on those queues not getting processed. Fixes: `c3e09182bc` ("net/enic: support scatter Rx in MTU update") Cc: stable@dpdk.org Signed-off-by: John Daley <johndale@cisco.com>	2017-10-26 02:33:01 +02:00

1 2 3 4 5 ...

291 Commits