numam-dpdk

Author	SHA1	Message	Date
John Daley	947d860c82	enic: improve Rx performance This is a wholesale replacement of the Enic PMD receive path in order to improve performance and code clarity. The changes are: - Simplify and reduce code path length of receive function. - Put most of the fast-path receive functions in one file. - Reduce the number of posted_index updates (pay attention to rx_free_thresh) - Remove the unneeded container structure around the RQ mbuf ring - Prefetch next Mbuf and descriptors while processing the current one - Use a lookup table for converting CQ flags to mbuf flags. Signed-off-by: John Daley <johndale@cisco.com>	2016-03-16 16:57:12 +01:00
Yoann Desmouceaux	39ab35a36a	enic: fix DMA address of outgoing packets The enic PMD driver send function uses a constant offset instead of relying on the data_off in the mbuf to find the start of the packet. Fixes: `fefed3d1e6` ("enic: new driver") Signed-off-by: Yoann Desmouceaux <ydesmouc@cisco.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2016-03-16 16:55:54 +01:00
Rahul Lakkireddy	13b0f50006	cxgbe: fix PCI info copy to ports under same PF Chelsio NIC ports share a single PF. Move rte_eth_copy_pci_info() to copy the pci device information to the remaining ports as well. Also update license year to 2016. Fixes: `eeefe73f0a` ("drivers: copy PCI device info to ethdev data") Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2016-03-16 16:55:01 +01:00
Rahul Lakkireddy	1c1789ccb3	cxgbe: fix memory leak after initialization failure Add missing code to free adapter when the device initialization fails. Fixes: `8318984927` ("cxgbe: add pmd skeleton") Reported-by: Seth Arnold <seth.arnold@canonical.com> Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2016-03-16 16:54:13 +01:00
Rahul Lakkireddy	5a9e303a38	cxgbe: fix setting wrong MTU max_rx_pkt_len already includes ETHER_HDR_LEN and ETHER_CRC_LEN for the mtu. But, the firmware also adds ETHER_HDR_LEN and ETHER_CRC_LEN to the mtu specified. Fix by subtracting these values from the mtu before passing it to firmware. Fixes: `4b2eff452d` ("cxgbe: enable jumbo frames") Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2016-03-16 16:52:42 +01:00
Rahul Lakkireddy	8dca8cc5c6	cxgbe: fix allocated size for RSS table The size of each entry in the port's rss table is actually 2 bytes and not 1 byte. A segfault occurs when accessing part of port 0's rss table because it gets overwritten by subsequent port 1's part of the rss table. Fix by setting the size of each entry appropriately. Fixes: `92c8a63223` ("cxgbe: add device configuration and Rx support") Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2016-03-16 16:51:19 +01:00
Charles (Chas) Williams	9ec8127c3d	bnx2x: determine queue sizes sooner The VF needs to determine the queues sizes before .dev_infos_get so that it can hint to the upper layer the proper sizes. Move bnx2x_vf_get_resources() to .eth_dev_init and probe with the guesses from bnx2x_init_rte(). Signed-off-by: Chas Williams <3chas3@gmail.com> Acked-by: Rasesh Mody <rasesh.mody@qlogic.com>	2016-03-16 16:51:12 +01:00
Charles (Chas) Williams	31ec57cb6b	bnx2x: fix resource allocattion error handling bnx2x_loop_obtain_resources() returns a struct containing the status and the error message. If bnx2x_do_req4pf() fails, it shouldn't return both of these fields set to 0 indicating failure and no error. Further, bnx2x_do_req4pf() needs to be able fail and return NO_RESOURCES so that bnx2x_loop_obtain_resources() can negotiate reduced resource requirments. This requires additional checking around bnx2x_do_req4pf(). Fixes: `540a211084` ("bnx2x: driver core") Signed-off-by: Chas Williams <3chas3@gmail.com> Acked-by: Rasesh Mody <rasesh.mody@qlogic.com>	2016-03-16 16:50:23 +01:00
Stephen Hemminger	52be0607bf	bnx2x: remove unused variable The mbuf_alloc_size is leftover from BSD or some other code base. It is set but never used in DPDK driver. After that the related defines can also be eliminated. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Harish Patil <harish.patil@qlogic.com>	2016-03-16 16:49:15 +01:00
Liming Sun	d364ef9f5e	mpipe: fix crash when testpmd is quit under load Fixes: the hung/crash issue when quitting testpmd under high traffic rate. The following issue were found and fixed. 1. edesc->size is not initialized properly in mpipe_do_xmit() and could cause buffer leak or corruption when HW buffer return is used. 2. Check the 'idesc.be' error bit in mpipe_recv_flush() to make sure buffer is valid before releasing it. This is to avoid issues when running out of buffers. 3. priv->rx_buffers counter is not accurate when HW buffer return is used. Remove this counter to simplify the code. Signed-off-by: Liming Sun <lsun@ezchip.com> Acked-by: Zhigang Lu <zlu@ezchip.com>	2016-03-16 16:48:06 +01:00
Liming Sun	6069d815bc	mpipe: fix link initialization ordering Mpipe link structure is initialized in function mpipe_link_init(). Currently it's only called from the eth_dev_ops.dev_start, which caused crashes when link mgmt APIs (like promiscuous_enable) was called before eth_dev_ops.dev_start(). This submit fixed it by calling mpipe_link_init() in rte_pmd_mpipe_devinit(). Fixes: `a8dd50513d` ("mpipe: add TILE-Gx mPIPE poll mode driver") Signed-off-by: Liming Sun <lsun@ezchip.com> Acked-by: Zhigang Lu <zlu@ezchip.com>	2016-03-16 16:44:51 +01:00
Liming Sun	ca7e25e632	mpipe: optimize buffer return mechanism This submit has changes to optimize the mpipe buffer return. When a packet is received, instead of allocating and refilling the buffer stack right away, it tracks the number of pending buffers, and use HW buffer return as an optimization when the pending number is below certain threshold, thus save two MMIO writes and improves performance especially for bidirectional traffic case. Signed-off-by: Liming Sun <lsun@ezchip.com> Acked-by: Zhigang Lu <zlu@ezchip.com>	2016-03-16 16:44:41 +01:00
Liming Sun	23f58cd012	mk: support native build on TILE-Gx The CROSS variable has empty default value (for native) and must be set when using a cross-toolchain. Signed-off-by: Liming Sun <lsun@ezchip.com> Acked-by: Zhigang Lu <zlu@ezchip.com>	2016-03-16 15:24:38 +01:00
Thomas Monjalon	39fee04fc0	doc: fix IPsec entry in the release notes It was inserted in the "Resolved Issues" section. Move the entry with the new features. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-03-15 19:51:07 +01:00
Tetsuya Mukawa	fb871d0a4d	vhost: fix default value of kickfd and callfd Currently, default values of kickfd and callfd are -1. If the values are -1, current code guesses kickfd and callfd haven't been initialized yet. Then vhost library will guess the virtqueue isn't ready for processing. But callfd and kickfd will be set as -1 when "--enable-kvm" isn't specified in QEMU command line. It means we cannot treat -1 as uninitialized state. The patch defines -1 and -2 as VIRTIO_INVALID_EVENTFD and VIRTIO_UNINITIALIZED_EVENTFD, and uses VIRTIO_UNINITIALIZED_EVENTFD for the default values of kickfd and callfd. Signed-off-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-15 00:20:29 +01:00
Yuanhan Liu	a436f53ebf	vhost: avoid dead loop chain If a malicious guest forges a dead loop chain, it could lead to a dead loop of copying the desc buf to mbuf, which results to all mbuf being exhausted. Add a var nr_desc to avoid such case. Suggested-by: Huawei Xie <huawei.xie@intel.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-15 00:07:32 +01:00
Yuanhan Liu	c687b0b635	vhost: check for ring descriptors overflow A malicious guest may easily forge some illegal vring desc buf. To make our vhost robust, we need make sure desc->next will not go beyond the vq->desc[] array. Suggested-by: Rich Lane <rich.lane@bigswitch.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-15 00:05:59 +01:00
Yuanhan Liu	623bc47054	vhost: do sanity check for ring descriptor length We need make sure that desc->len is bigger than the size of virtio net header, otherwise, unexpected behaviour might happen due to "desc_avail" would become a huge number with for following code: desc_avail = desc->len - vq->vhost_hlen; For dequeue code path, it will try to allocate enough mbuf to hold such size of desc buf, which ends up with consuming all mbufs, leading to no free mbuf is available. Therefore, you might see an error message: Failed to allocate memory for mbuf. Also, for both dequeue/enqueue code path, while it copies data from/to desc buf, the big "desc_avail" would result to access memory not belong the desc buf, which could lead to some potential memory access errors. A malicious guest could easily forge such malformed vring desc buf. Every time we restart an interrupted DPDK application inside guest would also trigger this issue, as all huge pages are reset to 0 during DPDK re-init, leading to desc->len being 0. Therefore, this patch does a sanity check for desc->len, to make vhost robust. Reported-by: Rich Lane <rich.lane@bigswitch.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-15 00:03:46 +01:00
Yuanhan Liu	c252bcf9ec	vhost: remove wrong unlikely prediction in Rx VIRTIO_NET_F_MRG_RXBUF is a default feature supported by vhost. Adding unlikely for VIRTIO_NET_F_MRG_RXBUF detection doesn't make sense to me at all. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-14 23:59:47 +01:00
Yuanhan Liu	a98240621d	vhost: remove rte_memcpy from header copy First of all, rte_memcpy() is mostly useful for copying big packets by leveraging hardware advanced instructions like AVX. But for virtio net hdr, which is 12 bytes at most, invoking rte_memcpy() will not introduce any performance boost. And, to my suprise, rte_memcpy() is VERY huge. Since rte_memcpy() is inlined, it increases the binary code size linearly every time we call it at a different place. Replacing the two rte_memcpy() with directly copy saves nearly 12K bytes of code size! Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-14 23:58:11 +01:00
Yuanhan Liu	932a00b85a	vhost: refactor mergeable Rx Current virtio_dev_merge_rx() implementation just looks like the old rte_vhost_dequeue_burst(), full of twisted logic, that you can see same code block in quite many different places. However, the logic of virtio_dev_merge_rx() is quite similar to virtio_dev_rx(). The big difference is that the mergeable one could allocate more than one available entries to hold the data. Fetching all available entries to vec_buf at once makes the difference a bit bigger then. The refactored code looks like below: while (mbuf_has_not_drained_totally \|\| mbuf_has_next) { if (this_desc_has_no_room) { this_desc = fetch_next_from_vec_buf(); if (it is the last of a desc chain) update_used_ring(); } if (this_mbuf_has_drained_totally) mbuf = fetch_next_mbuf(); COPY(this_desc, this_mbuf); } This patch reduces quite many lines of code, therefore, make it much more readable. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-14 23:56:41 +01:00
Yuanhan Liu	282a94ba99	vhost: refactor Rx This is a simple refactor, as there isn't any twisted logic in old code. Here I just broke the code and introduced two helper functions, reserve_avail_buf() and copy_mbuf_to_desc() to make the code more readable. Also, it saves nearly 1K bytes of binary code size. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-14 23:55:06 +01:00
Yuanhan Liu	bc7f87a2c1	vhost: refactor dequeueing The current rte_vhost_dequeue_burst() implementation is a bit messy and logic twisted. And you could see repeat code here and there. However, rte_vhost_dequeue_burst() acutally does a simple job: copy the packet data from vring desc to mbuf. What's tricky here is: - desc buff could be chained (by desc->next field), so that you need fetch next one if current is wholly drained. - One mbuf could not be big enough to hold all desc buff, hence you need to chain the mbuf as well, by the mbuf->next field. The simplified code looks like following: while (this_desc_is_not_drained_totally \|\| has_next_desc) { if (this_desc_has_drained_totally) { this_desc = next_desc(); } if (mbuf_has_no_room) { mbuf = allocate_a_new_mbuf(); } COPY(mbuf, desc); } Note that the old patch does a special handling for skipping virtio header. However, that could be simply done by adjusting desc_avail and desc_offset var: desc_avail = desc->len - vq->vhost_hlen; desc_offset = vq->vhost_hlen; This refactor makes the code much more readable (IMO), yet it reduces binary code size. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-03-14 23:49:36 +01:00
Yuanhan Liu	36ea36efb4	virtio: fix query of legacy features Declare dst as type uint32_t instead of uint64_t, otherwise, we will get a random upper 32 bit feature bits, as the following io port read reads lower 32 bit only. It could lead a feature bits that include VIRTIO_F_VERSION_1 (the 32th bit) for legacy virtio, which is obviously wrong. Fixes: `b8f04520ad` ("virtio: use PCI ioport API") Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Jianfeng Tan <jianfeng.tan@intel.com> Reviewed-by: David Marchand <david.marchand@6wind.com>	2016-03-14 23:16:15 +01:00
Keith Wiles	6b5a857fb0	eal: decrease log level of some debug messages When log level is set to 7 (INFO) these messages are still displayed and should be set to DEBUG. Signed-off-by: Keith Wiles <keith.wiles@intel.com>	2016-03-13 23:44:35 +01:00
Stephen Hemminger	03d00293ca	sched: eliminate floating point in calculating byte clock The old code was doing a floating point divide for each rte_dequeue() which is very expensive. Change to using fixed point scaled inverse multiply. To maintain equivalent precision, scaled math is used. The application ABI is the same. This improved performance from 5Gbit/sec to 10 Gbit/sec when configured for 10 Gbit/sec rate. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2016-03-13 23:31:59 +01:00
Stephen Hemminger	ffe3ec811e	sched: introduce reciprocal divide This adds (with permission of the original author) reciprocal divide based on algorithm in Linux. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Hannes Frederic Sowa <hannes@stressinduktion.org>	2016-03-13 23:31:59 +01:00
Stephen Hemminger	4d51afb5cd	sched: keep track of RED drops Add new statistic to keep track of drops due to RED. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2016-03-13 23:28:00 +01:00
Panu Matilainen	4758404a30	mk: fix eal shared library dependencies Add DT_NEEDED entries for librte_eal external dependencies. Details between the platforms differ somewhat, and for static builds they need to be handled from mk/exec-env still. Signed-off-by: Panu Matilainen <pmatilai@redhat.com>	2016-03-13 20:27:38 +01:00
Panu Matilainen	0d822b8047	mk: fix vhost shared library dependencies Add DT_NEEDED entries for external library dependencies which are the most critical ones for sane operation. Clean up vhost_cuse CFLAGS/LDFLAGS confusion while at it. Signed-off-by: Panu Matilainen <pmatilai@redhat.com>	2016-03-13 20:27:26 +01:00
Panu Matilainen	e86a699cf6	mk: fix shared library dependencies on libm and librt There are two places that need -lm (test app and librte_sched) and exactly one that needs -lrt (librte_sched). Add the relevant DT_NEEDED entries to both, and eliminate the bogus discrepancy between Linux and BSD EXECENV_LDLIBS wrt these libs. Signed-off-by: Panu Matilainen <pmatilai@redhat.com>	2016-03-13 20:27:07 +01:00
Reshma Pattan	5a8fb55c48	app/testpmd: support unidirectional configuration Added testpmd support to validate zero nb_rxq/nb_txq changes of ethdev (`d505ba8`). Signed-off-by: Reshma Pattan <reshma.pattan@intel.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2016-03-13 16:22:55 +01:00
Panu Matilainen	e04f6a64f1	examples/ip_pipeline: use unsigned constants for left shift operations Tell the compiler to use unsigned constants for left shift ops, otherwise building with gcc >= 6.0 fails due to multiple warnings like: warning: left shift of negative value [-Wshift-negative-value] Signed-off-by: Panu Matilainen <pmatilai@redhat.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:22:55 +01:00
Jasvinder Singh	69a2f4dea7	examples/ip_pipeline: add load balancing to pass-through The pass-through pipeline implementation is extended with load balancing function. This function allows uniform distribution of the packets among its output ports. For packets distribution, any application level logic can be applied. For instance, in this implementation, hash value computed over specific header fields of the incoming packets has been used to spread traffic uniformly among the output ports. The following pass-through configuration can be used for implementing load balancing function over ipv4 traffic; [PIPELINE0] type = PASS-THROUGH core = 0 pktq_in = RXQ0.0 RXQ1.0 RXQ2.0 RXQ3.0 pktq_out = TXQ0.0 TXQ1.0 TXQ2.0 TXQ3.0 dma_src_offset = 278; mbuf (128) + headroom (128) + 1st ethertype offset (14) + ttl offset within ip header = 278 (ipv4) dma_dst_offset = 128; mbuf (128) dma_size = 16 dma_src_mask = 00FF0000FFFFFFFFFFFFFFFFFFFFFFFF dma_hash_offset = 144; (dma_dst_offset+dma_size) lb = hash Signed-off-by: Jasvinder Singh <jasvinder.singh@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:08:55 +01:00
Fan Zhang	fe5d046213	examples/ip_pipeline: add pcap file dump This patch add packet dumping feature to ip_pipeline. Output port type SINK now supports dumping packets to PCAP file before releasing mbuf back to mempool. This feature can be applied by specifying parameters in configuration file as shown below: [PIPELINE1] type = PASS-THROUGH core = 1 pktq_in = SOURCE0 SOURCE1 pktq_out = SINK0 SINK1 pcap_file_wr = /path/to/eth1.pcap /path/to/eth2.pcap pcap_n_pkt_wr = 80 0 The configuration section "pcap_file_wr" contains full path and name of the PCAP file which the packets will be dumped to. If multiple SINKs exists, each shall have its own PCAP file path listed in this section, separated by spaces. Multiple SINK ports shall NOT share same PCAP file to be dumped. The configuration section "pcap_n_pkt_wr" contains integer value(s) and indicates the maximum number of packets to be dumped to the PCAP file. If this value is "0", the "infinite" dumping mode will be used. If this value is N (N > 0), the dumping will be finished when the number of packets dumped to the file reaches N. To enable PCAP dumping support to IP pipeline, the compiler option CONFIG_RTE_PORT_PCAP must be set to 'y'. It is possible to disable this feature by removing "pcap_file_wr" and "pcap_n_pkt_wr" lines from the configuration file. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:04:16 +01:00
Fan Zhang	eb5f4119b2	port: add pcap file dump Originally, sink ports in librte_port releases received mbufs back to mempool. This patch adds optional packet dumping to PCAP feature in sink port: the packets will be dumped to user defined PCAP file for storage or debugging. The user may also choose the sink port's activity: either it continuously dump the packets to the file, or stops at certain dumping This feature shares same CONFIG_RTE_PORT_PCAP compiler option as source port PCAP file support feature. Users can enable or disable this feature by setting CONFIG_RTE_PORT_PCAP compiler option "y" or "n". Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:04:11 +01:00
Fan Zhang	0e1e7d53ae	examples/ip_pipeline: add pcap file source This patch add PCAP file support to ip_pipeline. Input port type SOURCE now supports loading specific PCAP file and sends the packets in it to pipeline instance. The packets are then released by SINK output port. This feature can be applied by specifying parameters in configuration file as shown below; [PIPELINE1] type = PASS-THROUGH core = 1 pktq_in = SOURCE0 SOURCE1 pktq_out = SINK0 SINK1 pcap_file_rd = /path/to/eth1.PCAP /path/to/eth2.PCAP pcap_bytes_rd_per_pkt = 0 64 The configuration section "pcap_file_rd" contains full path and name of the PCAP file to be loaded. If multiple SOURCEs exists, each shall have its own PCAP file path listed in this section, separated by spaces. Multiple SOURCE ports may share same PCAP file to be copied. The configuration section "pcap_bytes_rd_per_pkt" contains integer value and indicates the maximum number of bytes to be copied from each packet in the PCAP file. If this value is "0", all packets in the file will be copied fully; if the packet size is smaller than the assigned value, the entire packet is copied. Same as "pcap_file_rd", every SOURCE shall have its own maximum copy byte number. To enable PCAP support to IP pipeline, the compiler option CONFIG_RTE_PORT_PCAP must be set to 'y'. It is possible to disable PCAP support by removing "pcap_file_rd" and "pcap_bytes_rd_per_pkt" lines from the configuration file. Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:04:06 +01:00
Fan Zhang	d4b42133d8	port: add pcap file source Originally, source ports in librte_port is an input port used as packet generator. Similar to Linux kernel /dev/zero character device, it generates null packets. This patch adds optional PCAP file support to source port: instead of sending NULL packets, the source port generates packets copied from a PCAP file. To increase the performance, the packets in the file are loaded to memory initially, and copied to mbufs in circular manner. Users can enable or disable this feature by setting CONFIG_RTE_PORT_PCAP compiler option "y" or "n". Signed-off-by: Fan Zhang <roy.fan.zhang@intel.com> Acked-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2016-03-13 16:04:02 +01:00
Xutao Sun	765ecf2521	i40e: add tunnel filter for IP in GRE Signed-off-by: Xutao Sun <xutao.sun@intel.com> Signed-off-by: Jijiang Liu <jijiang.liu@intel.com>	2016-03-13 15:27:20 +01:00
Xutao Sun	7b1312891b	ethdev: add IP in GRE tunnel Signed-off-by: Xutao Sun <xutao.sun@intel.com> Signed-off-by: Jijiang Liu <jijiang.liu@intel.com>	2016-03-13 15:27:20 +01:00
Xutao Sun	dd76f93c2d	ethdev: rework tunnel filtering structure Change the fields of outer_mac and inner_mac in struct rte_eth_tunnel_filter_conf from pointer to struct in order to keep the code's readability. Signed-off-by: Xutao Sun <xutao.sun@intel.com> Signed-off-by: Jijiang Liu <jijiang.liu@intel.com> Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-03-13 15:26:55 +01:00
Wenzhuo Lu	d87c3058bb	ixgbe: offload VxLAN and NVGRE Tx checksum on X550 The patch add VxLAN & NVGRE TX checksum off-load. When the flag of outer IP header checksum offload is set, we'll set the context descriptor to enable this checksum off-load. Also update release notes for VxLAN & NVGRE checksum off-load support. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-03-13 11:52:58 +01:00
Wenzhuo Lu	d909af8f72	ixgbe: offload VxLAN and NVGRE Rx checksum on X550 X550 will do VxLAN & NVGRE RX checksum off-load automatically. This patch exposes the result of the checksum off-load. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-03-13 11:52:52 +01:00
Wenzhuo Lu	01c631b724	ixgbe: configure UDP tunnel port Add UDP tunnel port add/del support on ixgbe. Now only support VxLAN port configuration. Although according to the specification the VxLAN port has a default value 4789, it can be changed. We support VxLAN port configuration to meet the change. Note, the default value of VxLAN port in ixgbe NICs is 0. So please set it when using VxLAN off-load. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-03-13 11:48:26 +01:00
Wenzhuo Lu	1cbe755fef	ethdev: rename UDP tunnel port functions The names of function for tunnel port configuration are not accurate. They're tunnel_add/del, better change them to tunnel_port_add/del. The old functions are directly replaced because the API and ABI compatibility of ethdev are already broken in 16.04. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-03-13 11:44:33 +01:00
Wenzhuo Lu	0eec728ec9	app/testpmd: add commands for E-tag operation Add the CLIs to support the E-tag operation. 1, Offloading of E-tag insertion and stripping. 2, Forwarding the E-tag packets to pools based on the GRP and E-CID_base. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Shaopeng He <shaopeng.he@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Yong Liu <yong.liu@intel.com>	2016-03-11 23:25:58 +01:00
Wenzhuo Lu	05f1b9c82e	app/testpmd: add commands for L2 tunnel config Add CLIs to config ether type of l2 tunnel, and to enable/disable a type of l2 tunnel. Now only e-tag tunnel is supported. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Shaopeng He <shaopeng.he@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Yong Liu <yong.liu@intel.com>	2016-03-11 23:25:58 +01:00
Wenzhuo Lu	22e77d4501	ixgbe: support L2 tunnel operations Add support of l2 tunnel configuration and operations. 1, Support modifying ether type of a type of l2 tunnel. 2, Support enabling and disabling the support of a type of l2 tunnel. 3, Support enabling/disabling l2 tunnel tag insertion/stripping. 4, Support enabling/disabling l2 tunnel packets forwarding. 5, Support adding/deleting forwarding rules for l2 tunnel packets. Only support E-tag now. Also update the release note. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Shaopeng He <shaopeng.he@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Yong Liu <yong.liu@intel.com>	2016-03-11 23:25:58 +01:00
Wenzhuo Lu	c49b2bad6a	ethdev: support L2 tunnel operations Add functions to support l2 tunnel configuration and operations. 1, L2 tunnel ether type modification. It means modifying the ether type of a specific type of tunnel. So the packet with this ether type will be parsed as this type of tunnel. 2, Enabling/disabling l2 tunnel support. It means enabling/disabling the ability of parsing the specific type of tunnel. This ability should be enabled before we enable filtering, forwarding, offloading for this specific type of tunnel. 3, Insertion and stripping for l2 tunnel tag. 4, Forwarding the packets to a pool based on l2 tunnel tag. Only support e-tag tunnel now. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Shaopeng He <shaopeng.he@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Yong Liu <yong.liu@intel.com>	2016-03-11 23:25:57 +01:00
Wenzhuo Lu	43a942393d	ixgbe: select pool by MAC when using double VLAN On X550, as required by datasheet, E-tag packets are not expected when double VLAN are used. So modify the register PFVTCTL after enabling double VLAN to select pool by MAC but not MAC or E-tag. An introduction of E-tag: It's defined in IEEE802.1br. Please reference this website, http://www.ieee802.org/1/pages/802.1br.html. A brief description. E-tag means external tag, and it's a kind of l2 tunnel. It means a tag will be inserted in the l2 header. Like below, \|31 24\|23 16\|15 8\|7 0\| 0\| Destination MAC address \| 4\| Dest MAC address(cont.) \| Src MAC address \| 8\| Source MAC address(cont.) \| 12\| E-tag Etherenet type (0x893f) \| E-tag header \| 16\| E-tag header(cont.) \| 20\| VLAN Ethertype(optional) \| VLAN header(optional) \| 24\| Original type \| ...... \| ...\| ...... \| The E-tag format is like below, \|0 15\|16 18\|19 \|20 31\| \| Ethertype - 0x893f \| E-PCP \|DEI\| Ingress E-CID_base \| \|32 33\|34 35\|36 47\|48 55 \|56 63\| \| RSV \| GRP \|E-CID_base\|Ingress_E-CID_ext\| E-CID_ext \| The Ingess_E-CID_ext and E-CID_ext are always zero for endpoints and are effectively reserved. The more details of E-tag is in IEEE 802.1BR. 802.1BR is used to replace 802.1Qbh. 802.1BR is a standard for Bridge Port Extension. It specifies the operation of Bridge Port Extenders, including management, protocols, and algorithms. Bridge Port Extenders operate in support of the MAC Service by Extended Bridges. The E-tag is added to l2 header to identify the VM channel and the virtual port. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com> Acked-by: Shaopeng He <shaopeng.he@intel.com> Acked-by: Jingjing Wu <jingjing.wu@intel.com> Tested-by: Yong Liu <yong.liu@intel.com>	2016-03-11 22:56:00 +01:00

1 2 3 4 5 ...

4108 Commits