numam-dpdk

Author	SHA1	Message	Date
Chengwen Feng	59dc46043c	net/hns3: add reporting tunnel GRE packet type This patch supports reporting TUNNEL GRE packet type when rxd advanced layout enabled. Fixes: `fb5e906940` ("net/hns3: support Rx descriptor advanced layout") Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-14 19:45:27 +02:00
Chengwen Feng	7d6df32cf7	net/hns3: fix missing outer L4 UDP flag for VXLAN This patch adds RTE_PTYPE_L4_UDP flag when parsed tunnel vxlan packet. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-14 19:45:27 +02:00
Chengwen Feng	e40ad6fca4	net/hns3: fix verification of NEON support This patch adds verification of whether NEON supported. Fixes: `a3d4f4d291` ("net/hns3: support NEON Rx") Fixes: `e31f123db0` ("net/hns3: support NEON Tx") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-14 19:45:27 +02:00
Chengchang Tang	18da3c854b	net/hns3: fix queue state when concurrent with reset At the end of the reset, the state of queues need to be restored according to the states saved in the driver. If the start and stop operations of the queues are concurrent at this time, it may cause the final status to be uncertain. This patch requires queues to acquire the hw lock before starting and stopping. If the device is being restored due to reset at this time, it will block until the reset is completed. Fixes: `fa29fe45a7` ("net/hns3: support queue start and stop") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-13 11:13:41 +02:00
Chengchang Tang	fde636caf4	net/hns3: fix timing in resetting queues During the task queue pairs reset, the getimeofday is used to obtain the timestamp to determine whether the command execution times out. But gettimeofday is not monotonous, it can be modified by system administrators, so the timing may not be accurate or even cause the loop to wait consistently. And actually, in this scenario, it is not necessary to obtain the timestamp. This patch removes the operation of obtaining the timestamp from the task queue pairs reset function. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-13 11:13:41 +02:00
Chengwen Feng	1f303606e8	net/hns3: fix some packet types Currently, the packet type calculated by vlan/ovlan/l3id/l4id/ol3id/ol4id fields have the following problems: 1) Identify error when exist VLAN strip which will lead to the data buffer has non VLAN header but mbuf's ptype have L2_ETHER_VLAN flag. 2) Some packet identifies error, eg: hardware report it's RARP or unknown packet, but ptype will marked with L2_ETHER . So driver will calculate packet type only by l3id/l4id/ol3id/ol4id fields. Fixes: `0e98d5e6d9` ("net/hns3: fix packet type report in Rx") Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-13 11:13:41 +02:00
Chengwen Feng	7e2e162ed0	net/hns3: simplify selecting Rx/Tx function Currently, there are four control variables (rx_simple_allowed, rx_vec_allowed, tx_simple_allowed and tx_vec_allowed) which are used to impact the selection of Rx/Tx burst function. The purpose of the design is to provide a way to control the selection of Rx/Tx burst function by modifying it's values, but these variables have no entry to modify unless make intrusive modifications. Now we already support runtime config to select Rx/Tx function, these variables could be removed. Fixes: `a124f9e959` ("net/hns3: add runtime config to select IO burst function") Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-13 11:13:41 +02:00
Chengwen Feng	7feb2aee0e	net/hns3: log selected datapath This patch adds debug info for Rx/Tx burst function which was choosing. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-13 11:11:08 +02:00
Hongbo Zheng	3f3fac61bd	net/hns3: fix code style Add one space before the left brace to solve the static warning. Fixes: `63e05f19b8` ("net/hns3: support Rx descriptor status query") Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-08 18:57:09 +02:00
Min Hu (Connor)	53e6f86cf5	net/hns3: fix copyright date This patch updates copyright date for hns3 PMD files. Fixes: `565829db8b` ("net/hns3: add build and doc infrastructure") Fixes: `952ebacce4` ("net/hns3: support SVE Rx") Fixes: `e31f123db0` ("net/hns3: support NEON Tx") Fixes: `c09c7847d8` ("net/hns3: support traffic management") Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-08 17:55:35 +02:00
Min Hu (Connor)	fa485faca2	net/hns3: update HiSilicon copyright syntax According to the suggestion of our legal department, to standardize the copyright license of our code to avoid potential copyright risks, we make a unified modification to the "Hisilicon", which was nonstandard, in the main modules we maintain. We change it to "HiSilicon", which is consistent with the terms used on the following official website: https://www.hisilicon.com/en/terms-of-use. Fixes: `565829db8b` ("net/hns3: add build and doc infrastructure") Fixes: `952ebacce4` ("net/hns3: support SVE Rx") Fixes: `e31f123db0` ("net/hns3: support NEON Tx") Fixes: `c09c7847d8` ("net/hns3: support traffic management") Cc: stable@dpdk.org Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-06 18:28:13 +02:00
Min Hu (Connor)	38b539d96e	net/hns3: support IEEE 1588 PTP Add hns3 support for new ethdev APIs to enable and read IEEE1588/ 802.1AS PTP timestamps. Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-04-01 18:39:55 +02:00
Chengchang Tang	6911e7c22c	net/hns3: fix long task queue pairs reset time Currently, the queue reset process needs to be performed one by one, which is inefficient. However, the queues reset in the same function is almost at the same stage. To optimize the queue reset process, a new function has been added to the firmware command HNS3_OPC_CFG_RST_TRIGGER to reset all queues in the same function at a time. And the related queue reset MBX message is adjusted in the same way too. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-30 12:30:46 +02:00
Chengchang Tang	8f01e2f847	net/hns3: fix Tx checksum for UDP packets with special port For Kunpeng920 network engine, UDP packets with destination port 6081, 4789 or 4790 will be identified as tunnel packets. If the UDP CKSUM offload is set in the mbuf, and the TX tunnel mask is not set, the CKSUM of these packets will be wrong. In this case, the upper layer user may not identify the packet as a tunnel packet, and processes it as non-tunnel packet, and expect to offload the outer UDP CKSUM, so they may not fill the outer L2/L3 length to mbuf. However, the HW identifies these packet as tunnel packets and therefore offload the inner UDP CKSUM. As a result, the inner and outer UDP CKSUM are incorrect. And for non-tunnel UDP packets with preceding special destination port will also exist similar checksum error. For the new generation Kunpeng930 network engine, the above errata have been fixed. Therefore, the concept of udp_cksum_mode is introduced. There are two udp_cksum_mode for hns3 PMD, HNS3_SPECIAL_PORT_HW_CKSUM_MODE means HW could solve the above problem. And in HNS3_SPECIAL_PORT_SW_CKSUM_MODE, hns3 PMD will check packets in the Tx prepare and perform the UDP CKSUM for such packets to avoid a checksum error. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-30 12:30:46 +02:00
Chengchang Tang	a1d0caa92c	net/hns3: fix processing Tx offload flags Currently, if the PKT_TX_TCP_SEG and PKT_TX_TCP_CKSUM offload flags set in the same time, hns3 PMD can not process the descriptors correctly. This patch fixes it by adding the processing of this situation. Fixes: `fb6eb9009f` ("net/hns3: fix Tx checksum with fixed header length") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-30 12:30:46 +02:00
Hongbo Zheng	63e05f19b8	net/hns3: support Rx descriptor status query Add support for query Rx descriptor status in hns3 driver. Check the descriptor specified and provide the status information of the corresponding descriptor. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-23 13:04:33 +01:00
Hongbo Zheng	656a6d9cc0	net/hns3: support Tx descriptor status query Add support for query Tx descriptor status in hns3 driver. Check the descriptor specified and provide the status information of the corresponding descriptor. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-23 13:04:33 +01:00
Chengchang Tang	d0ab89e633	net/hns3: support outer UDP checksum Kunpeng930 support outer UDP cksum, this patch add support for it. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-23 13:04:32 +01:00
Chengwen Feng	a124f9e959	net/hns3: add runtime config to select IO burst function Currently, the driver support multiple IO burst function and auto selection of the most appropriate function based on offload configuration. Most applications such as l2fwd/l3fwd don't provide the means to change offload configuration, so it will use the auto selection's io burst function. This patch support runtime config to select io burst function, which add two config: rx_func_hint and tx_func_hint, both could assign vec/sve/simple/common. The driver will use the following rules to select io burst func: a. if hint equal vec and meet the vec Rx/Tx usage condition then use the neon function. b. if hint equal sve and meet the sve Rx/Tx usage condition then use the sve function. c. if hint equal simple and meet the simple Rx/Tx usage condition then use the simple function. d. if hint equal common then use the common function. e. if hint not set then: e.1. if meet the vec Rx/Tx usage condition then use the neon function. e.2. if meet the simple Rx/Tx usage condition then use the simple function. e.3. else use the common function. Note: the sve Rx/Tx usage condition based on the vec Rx/Tx usage condition and runtime environment (which must support SVE). In the previous versions, driver will preferred use the sve function when meet the sve Rx/Tx usage condition, but in this case driver could get better performance if use the neon function. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2021-03-23 13:04:32 +01:00
Chengwen Feng	fb5e906940	net/hns3: support Rx descriptor advanced layout Currently, the driver get packet type by parse the L3_ID/L4_ID/OL3_ID/OL4_ID from Rx descriptor and then lookup multiple tables, it's time consuming. Now Kunpeng930 support advanced RXD layout, which: 1. Combine OL3_ID/OL4_ID to 8bit PTYPE filed, so the driver get packet type by lookup only one table. Note: L3_ID/L4_ID become reserved fields. 2. The 1588 timestamp located at Rx descriptor instead of query from firmware. 3. The L3E/L4E/OL3E/OL4E will be zero when L3L4P is zero, so driver could optimize the good checksum calculations (when L3E/L4E is zero then mark PKT_RX_IP_CKSUM_GOOD/PKT_RX_L4_CKSUM_GOOD). Considering compatibility, the firmware will report capability of RXD advanced layout, the driver will identify and enable it by default. This patch only provides basic function: identify and enable the RXD advanced layout, and lookup ptype table if supported. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-03-04 15:07:14 +01:00
Min Hu (Connor)	fdcd6a3e02	net/hns3: add bytes stats In current HNS3 PMD, Rx/Tx bytes from packet stats are not implemented. This patch implemented Rx/Tx bytes using soft counters. Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-03-04 15:07:13 +01:00
Chengwen Feng	dfecc3201f	net/hns3: implement Tx mbuf free on demand This patch add support tx_done_cleanup ops, which could support for the API rte_eth_tx_done_cleanup to free consumed mbufs on Tx ring. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-03-04 15:07:13 +01:00
Chengchang Tang	2b6b09817d	net/hns3: fix interrupt resources in Rx interrupt mode For Kunpeng930, the NIC engine support 1280 tqps being taken over by a PF. In this case, a maximum of 1281 interrupt resources are also supported in this PF. To support the maximum number of queues, several patches are made. But the interrupt related modification are missing. So, in RX interrupt mode, a large number of queues will be aggregated into one interrupt due to insufficient interrupts. It will lead to waste of interrupt resources and reduces usability. To utilize all these interrupt resources, related IMP command has been extended. And, the I/O address of the extended interrupt resources are different from the existing ones. So, a function used for calculating the address offset has been added. Fixes: `76d794566d` ("net/hns3: maximize queue number") Fixes: `27911a6e62` ("net/hns3: add Rx interrupts compatibility") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2021-01-29 18:16:12 +01:00
Huisong Li	86c551d1d8	net/hns3: move queue stats to xstats One of the hot discussions in community recently was moving queue stats to xstats. In this solution, a temporary 'RTE_ETH_DEV_AUTOFILL_QUEUE_XSTATS' device flag is created to implement the smooth switch. And the first half of this work has been completed in the ethdev framework. Now driver needs to remove the flag from the driver initialization process and does the rest of work. For better readability and reasonability, per-queue stats also should be cleared when rte_eth_stats is cleared. Otherwise, the sum of one item in per-queue stats may be greater than corresponding item in rte_eth_stats. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-01-29 18:16:11 +01:00
Huisong Li	9b77f1fe30	net/hns3: encapsulate DFX stats in datapath pkt_len_errors and l2_errors in Rx datapath indicate that driver needs to discard received packets. And driver does not discard packets for l3/l4/ol3/ol4_csum_errors in Rx datapath and others stats in Tx datapath. Therefore, it is necessary for improving code readability and maintainability to encapsulate error stats and dfx stats. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-01-29 18:16:11 +01:00
Bruce Richardson	df96fd0d73	ethdev: make driver-only headers private The rte_ethdev_driver.h, rte_ethdev_vdev.h and rte_ethdev_pci.h files are for drivers only and should be a private to DPDK and not installed. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Steven Webster <steven.webster@windriver.com>	2021-01-29 20:59:09 +01:00
Lijun Ou	1f089a3a80	net/hns3: use C11 atomics builtins for resetting Use C11 atomic builtins with explicit ordering instead of rte_atomic ops with the resetting member of hns3_reset_data structure. Signed-off-by: Lijun Ou <oulijun@huawei.com>	2021-01-19 03:30:32 +01:00
Ruifeng Wang	21c4f1c7b2	net/hns3: fix build with SVE Building with SVE extension enabled stopped with error: error: ACLE function ‘svwhilelt_b64_s32’ requires ISA extension ‘sve’ 18 \| #define PG64_256BIT svwhilelt_b64(0, 4) This is caused by unintentional cflags reset. Fixed the issue by not touching cflags, and using flags defined by compiler. Fixes: `952ebacce4` ("net/hns3: support SVE Rx") Cc: stable@dpdk.org Signed-off-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com>	2021-01-14 16:42:25 +01:00
Chengchang Tang	a1e7e04bac	net/hns3: fix HW exception for unbalanced Rx/Tx queues For kupeng 930, there are 3 registers to control the enable status of a TQP(i.e. task queue pair, include a txq and a rxq). One of them controls whether the TQP is enabled, and the other two controls whether the rxq and txq are enabled. The registers used to control the enabled status of the rxq and txq are enabled by default. Therefore, after the TQP is enabled, the rxq and txq are enabled by default. Currently, when the number of rxq is not equal to the number of txq, the unused rxqs or txqs are not disabled by driver, so these unused queues will be enabled in this situation. And the related HW rings have not been initialized which could lead to a hardware exception. This patch fix it by disable these unused queues during enable the TQPs. Fixes: `fa29fe45a7` ("net/hns3: support queue start and stop") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-20 21:10:05 +01:00
Lijun Ou	ee1607167b	net/hns3: remove some blank lines According to the rule of the static check tools that arrange blank lines properly to keep the code compact, here remove some unnecessary blank line to fix the above rule warning. Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-13 19:43:26 +01:00
Chengchang Tang	80ec1bbd5b	net/hns3: fix queue state after reset FLR operation will reset the queue enabling state and the driver needs to restore the state after reset. If the driver does not restore the state, it will result in unpredictable behavior with reset when user start or stop queue by calling the relevant function if. This patch fix it by add a queue enabling state restore function to the reset handler. Fixes: `fa29fe45a7` ("net/hns3: support queue start and stop") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-13 19:43:26 +01:00
Hongbo Zheng	2427c27e03	net/hns3: use correct logging format specifiers In current driver print log function, some print format symbols does not match with the actual variable types. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-13 19:43:25 +01:00
Lijun Ou	445b0c8eba	net/hns3: cleanup includes Some header files have included by others. Also, some header files have a header file self-contained error will trigger building warning. As a result, it is unnecessary and move it into the correct location. Beside, here also remove some unused lines. Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:07 +01:00
Hongbo Zheng	44df0175dd	net/hns3: check quantity limiter support before using it If hardware does not support QL (quantity limiter), the int_ql_max is 0, software should confirm ql_value is less than int_ql_max before write QL register. This patch add check of int_ql_max value from firmware and delete the unused variable coalesce_mode. Fixes: `27911a6e62` ("net/hns3: add Rx interrupts compatibility") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:07 +01:00
Chengchang Tang	938eb23693	net/hns3: support VXLAN-GPE TSO and checksum Kupeng920 support tso and checksum offload for VXLAN_GPE with the next protocol id 3(i.e., Ethernet). Kupeng930 support TSO and checksum offload for VXLAN_GPE with the next protocol id 1,2,3(i.e., IPv4, IPv6 and Ethernet). This patch add support for this tunnel type. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:07 +01:00
Chengchang Tang	fb6eb9009f	net/hns3: fix Tx checksum with fixed header length Currently, the header length of all the layers are fixed, It would lead to a csum error when the header length changed. This patch fixes above problem by using the header length in mbuf instead of the fixed header length to perform the TX cksum offload. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:07 +01:00
Chengchang Tang	f9edada651	net/hns3: fix Tx checksum outer header prepare Currently, there are two mistakes in Tx checksum outer header prepare. 1) Check whether the packet outer header is IPV4 based on PKT_TX_IPV4 which is incorrect. 2) For HIP08, the outer UDP cksum could not be offloaded. And driver should ensure the outer udp cksum filed set to 0. In current code, PKT_TX_UDP_CKSUM is used to determine whether the outer layer of the packet is a UDP header. Actually, for tunnel TSO, the flag will never be set. For the first mistake, it is fixed by replacing PKT_TX_IPV4 with PKT_TX_OUTER_IPV4. And the protocol number in L3 header is used to check whether the outer L4 header is UDP. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Fixes: `6dca716c9e` ("net/hns3: support TSO") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:07 +01:00
Chengchang Tang	821496d214	net/hns3: fix clearing HW ring after queue stop Currently, the rx HW ring is not cleared after queue stop. When there are packets remaining in the HW rings and the queues have been stopped, if upper layer user calls the rx_burst function at this time, an illegal memory access will occur due to the sw rings has been released. This patch fix this by reset the sw ring after disable the queue. Fixes: `fa29fe45a7` ("net/hns3: support queue start and stop") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:06 +01:00
Huisong Li	708ecc07d2	net/hns3: fix data type to store queue number Currently, u8 type variable is used to control to release fake queues in hns3_fake_rx/tx_queue_config function. Although there is no case in which more than 256 fake queues are created in hns3 network engine, it is unreasonable to compare u8 variable with u16 variable. Fixes: `a951c1ed3a` ("net/hns3: support different numbers of Rx and Tx queues") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:06 +01:00
Chengchang Tang	0e98d5e6d9	net/hns3: fix packet type report in Rx Currently, hns3 supports recognizing a lot of ptypes, but most tunnel packet types are not reported to the API rte_eth_dev_get_supported_ptypes. And there are some errors in L2 and L3 packet recognition. The ARP and LLDP are classified to L3 field in RX descriptor. So, the ptype of LLDP and ARP packets will be set twice. And ptypes are assigned by bitwise OR, which will eventually cause the ptype result to be incorrect. Besides, when a packet with only L2 header, its ptype will not report by hns3 PMD. This is because the L2/L3 ptype table is not initialized properly. In this case, the table query result is 0 by default. As a result, it fixes missing supported ptypes and the mistake in L2/L3 packet recognition and the unreported L2 packet ptype by reporting its L2 type when the L3 type unrecognized.. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:06 +01:00
Lijun Ou	2be91035b3	net/hns3: get number of used descriptors of Rx queue Implement the available and used rxd number count function. In Kunpeng series, the NIC hardware supports to read the bd numbers which wait processed from the hardware FBD (Full Buffer Descriptor), and the driver maintains the bd number to be written back hardware. Compare the number of FBDs with the number of BDs to be written back to the hardware. The number of used descriptors of a rx queue is computed as follows: The fbd numbers of reading from FBD register plus the bd numbers to be written back to hardware maintained by the driver. Signed-off-by: Lijun Ou <oulijun@huawei.com>	2020-11-03 23:35:06 +01:00
Chengwen Feng	f0c243a6cb	net/hns3: support SVE Tx This patch adds SVE vector instructions to optimize Tx burst process. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com>	2020-10-16 19:48:19 +02:00
Wei Hu (Xavier)	952ebacce4	net/hns3: support SVE Rx This patch adds SVE vector instructions to optimize Rx burst process. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2020-10-16 19:48:19 +02:00
Chengchang Tang	fa29fe45a7	net/hns3: support queue start and stop The new generation hns3 network engine supports independent enabling and disabling of a single Tx/Rx queue. So, it can support the queue start and stop feature. In addition, when different numbers of Tx and Rx queues need to be enabled in some applications, hns3 pmd does not need to create fake queues to enable these scenarios. This patch Add queue start and stop feature for the new generation hns3 networking engine. Cancel the creation of fake queue on the new generation network engine. And the previously improperly named queue related function was renamed to improve readability. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Wei Hu (Xavier)	76d794566d	net/hns3: maximize queue number The maximum number of queues for hns3 PF and VF driver is 64 based on hns3 network engine with revision_id equals 0x21. Based on hns3 network engine with revision_id equals 0x30, the hns3 PF PMD driver can support up to 1280 queues, and hns3 VF PMD driver can support up to 128 queues. The following points need to be modified to support maximizing queue number and maintain better compatibility: 1) Maximizing the number of queues for hns3 PF and VF PMD driver In current version, VF is not supported when PF is driven by hns3 PMD driver. If maximum queue numbers allocated to PF PMD driver is less than total tqps_num allocated to this port, all remaining number of queues are mapped to VF function, which is unreasonable. So we fix that all remaining number of queues are mapped to PF function. Using RTE_LIBRTE_HNS3_MAX_TQP_NUM_PER_PF which comes from configuration file to limit the queue number allocated to PF device based on hns3 network engine with revision_id greater than 0x30. And PF device still keep the maximum 64 queues based on hns3 network engine with revision_id equals 0x21. Remove restriction of the macro HNS3_MAX_TQP_NUM_PER_FUNC on the maximum number of queues in hns3 VF PMD driver and use the value allocated by hns3 PF kernel netdev driver. 2) According to the queue number allocated to PF device, a variable array for Rx and Tx queue is dynamically allocated to record the statistics of Rx and Tx queues during the .dev_init ops implementation function. 3) Add an extended field in hns3_pf_res_cmd to support the case that numbers of queue are greater than 1024. 4) Use new base address of Rx or Tx queue if QUEUE_ID of Rx or Tx queue is greater than 1024. 5) Remove queue id mask and use all bits of actual queue_id as the queue_id to configure hardware. 6) Currently, 0~9 bits of qset_id in hns3_nq_to_qs_link_cmd used to record actual qset id and 10 bit as VLD bit are configured to hardware. So we also need to use 11~15 bits when actual qset_id is greater than 1024. 7) The number of queue sets based on different network engine are different. We use it to calculate group number and configure to hardware in the backpressure configuration. 8) Adding check operations for number of Rx and Tx queue user configured when mapping queue to tc Rx queue numbers under a single TC must be less than rss_size_max supported by a single TC. Rx and Tx queue numbers are allocated to every TC by average. So Rx and Tx queue numbers must be an integer multiple of 2, or redundant queues are not available. 9) We can specify which packets enter the queue with a specific queue number, when creating flow table rules by rte_flow API. Currently, driver uses 0~9 bits to record the queue_id. So it is necessary to extend one bit field to record queue_id and configure to hardware, if the queue_id is greater than 1024. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Wei Hu (Xavier)	dd1e461182	net/hns3: add TSO pseudo header calculation compatibility In kunpeng 920, when process pkts which need TSO, the network driver need to erase the L4 len value of the TCP TSO pseudo header and recalculate the pseudo header checksum. kunpeng930 support not need to erase the L4 len value of the TCP TSO pseudo header. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2020-09-30 19:19:10 +02:00
Hongbo Zheng	da17b003f3	net/hns3: add max number of segments compatibility Kunpeng 920 supports the maximum nb_segs of non-tso packet is 8 in Tx direction, kunpeng 930 expands this limit value to 18, this patch sets the corresponding value by querying the maximum number of non-tso nb_segs supported by the device during initialization. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2020-09-30 19:19:10 +02:00
Chengchang Tang	e788224747	net/hns3: add default case to switch in Rx VLAN processing This patch solves the static check warning as follow: "The switch statement must have a 'default' branch". Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-30 19:19:10 +02:00
Wei Hu (Xavier)	992b24a1ce	net/hns3: add VLAN configuration compatibility Because of hardware limitation based on the old version of hns3 network engine, there are some restrictions: a) HNS3 PMD driver needs select different processing mode for VLAN based on whether PVID is set which means our driver need sense the PVID states. b) For packets transmitting process, only two layer of VLAN tag is supported. If the total number of VLAN tags in mbuf and VLAN offload by hardware (VLAN insert by descriptor) exceeds two, the VLAN in mbuf will be overwritten by VLAN in the descriptor. c) If port based VLAN is set, only one VLAN header is allowed in mbuf or it will be discard by hardware. In order to solve these restriction, two change is implemented on the new versions of network engine. 1) add a new VLAN tagged insertion mode, named tag shift mode; 2) add a new VLAN strip control bit, named strip hide enable; The tag shift mode means that VLAN tag will shift automatically when the inserted place has a tag. For PMD driver, the VLAN tag1 and tag2 configurations in Tx side do not need to be considered because the hardware completes it. However, the related configuration will still be retained to be compatible with the old version of network engine. The VLAN strip hide means that hardware will strip the VLAN tag and hide VLAN in descriptor (VLAN ID exposed as zero and related STRIP_TAGP is off). These changes make it no longer necessary for the hns3 PMD driver to be aware of the PVID status and have the ability to send mult-layer (more than two) VLANs packets. Therefore, hns3 PMD driver introduces the concept of VLAN mode and adds a new VLAN mode named HNS3_PVID_MODE to indicate that PVID-related IO process can be implemented by the hardware. And VF driver does not need to be modified because the related mailbox messages will not be sent by PF kernel mode netdev driver under new network engine and all the related hardware configuration is on the PF side. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-30 19:19:10 +02:00
Chengchang Tang	e692c74691	net/hns3: add Rx buffer size to Rx queue info Report hns3 PMD configured Rx buffer size in Rx queue information query. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Reviewed-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-21 18:05:38 +02:00
Chengchang Tang	0134a5c7b4	net/hns3: fix crash when Tx multiple buffer packets Currently, there is a possibility that segment faults occur when sending packets whose payloads are stored in multiple buffers based on hns3 network engine. The related core dump information as follows: Program terminated with signal 11, Segmentation fault. 0 hns3_reassemble_tx_pkts 2512 temp = temp->next; Missing separate debuginfos, use: (gdb) bt 0 hns3_reassemble_tx_pkts 1 0x0000000000969c60 in hns3_check_non_tso_pkt 2 0x000000000096adbc in hns3_xmit_pkts 3 0x000000000050d4d0 in rte_eth_tx_burst 4 0x000000000050fca4 in pkt_burst_transmit 5 0x00000000004ca6b8 in run_pkt_fwd_on_lcore 6 0x00000000004ca7fc in start_pkt_forward_on_core 7 0x00000000006975a4 in eal_thread_loop 8 0x0000ffffa6f7fc48 in start_thread 9 0x0000ffffa6ed1600 in thread_start The root cause is that hns3 PMD driver invokes the rte_pktmbuf_free_seg API function to release the same rte_mbuf multiple times. The rte_mbuf pointer is not set to NULL in the internal function hns3_rx_queue_release_mbufs which is invoked during queue setup, stop and close. As a result the rte_mbuf in Rx queues will be repeatedly released when the user application setup queues or stop/start the dev for multiple times. Probably for performance reasons, DPDK mempool lib does not check for the repeated rte_mbuf releases. The Address of released rte_mbuf are directly stored into the per lcore cache of the mempool. This makes the rte_mbufs obtained from mempool by calling rte_mempool_get_bulk API function repetitively. ultimately, it causes to access to a NULL pointer in PMD driver. This patch fixes this problem by setting released mbuf pointer to NULL in the internal function named hns3_rx_queue_release_mbuf. And the other internal function named hns3_reassemble_tx_pkts is optimized to avoid a similar problem. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	a3d4f4d291	net/hns3: support NEON Rx This patch adds NEON vector instructions to optimize Rx burst process. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	e31f123db0	net/hns3: support NEON Tx This patch adds NEON vector instructions to optimize Tx burst process. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	7ef933908f	net/hns3: add simple Tx path This patch adds simple Tx process function. When multiple segment packets are not needed, Which means that DEV_TX_OFFLOAD_MBUF_FAST_FREE offload is not set, we can simple Tx process. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	521ab3e933	net/hns3: add simple Rx path This patch adds simple Rx process function and support chose Rx function by real Rx offloads capability. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	323df8941b	net/hns3: reduce address calculation in Rx This patch adds the internal function named hns3_write_reg_opt to avoid performance loss from address calculation during register access in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. In addition, because hardware always access register in little-endian mode based on hns3 network engine, so driver should also call rte_cpu_to_le_32 to convert data in little-endian mode before writing register and call rte_le_to_cpu_32 to convert data after reading from register. Here the driver encapsulates the data conversion operation in the register read/write operation function as below: hns3_write_reg hns3_write_reg_opt hns3_read_reg Therefore, when calling these functions, conversion is not required again. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	ceabee45be	net/hns3: report Rx free threshold This patch reports .rx_free_thresh value in the .dev_infos_get ops implementation function named hns3_dev_infos_get and hns3vf_dev_infos_get. In addition, the name of the member variable of struct hns3_rx_queue is modified and comments are added to improve code readability. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-21 18:05:38 +02:00
Wei Hu (Xavier)	395b5e08ef	net/hns3: add Tx short frame padding compatibility There are difference about padding ultra-short frame in Tx procession for different versions of hardware network engine. If packet length is less than minimum packet length supported by hardware in Tx direction, driver need to pad it to avoid error. The minimum packet length in Tx direction is 33 based on kunpeng 920, and 9 based on kunpeng 930. Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2020-09-18 18:55:07 +02:00
Wei Hu (Xavier)	27911a6e62	net/hns3: add Rx interrupts compatibility There are difference about queue's interrupt configurations for different versions of hardware network engine, such as queue's interrupt mapping mode, coalesce configuration, etc. The following uses the configuration differences of the interrupt mapping mode as an example. 1) For some versions of hardware network engine, such as kunpeng 920, because of the hardware constraint, we need implement unmmapping relationship configurations by binding all queues to the last interrupt vector and reserving the last interrupt vector. This results in a decrease of the maximum queues when upper applications call the rte_eth_dev_configure API function to enable Rx interrupt. 2) And for another versions, such as kunpeng 930, hns3 PMD driver can map/unmmap all interrupt vectors with queues when Rx interrupt is enabled. This patch resolves configuration differences about Rx interrupts based on kunpeng 920 and kunpeng 930. Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-18 18:55:07 +02:00
Huisong Li	091a0f95b5	net/hns3: support getting queue information This patch adds support for querying Rx/Tx queue information. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-18 18:55:06 +02:00
Wei Hu (Xavier)	a02f1461c7	net/hns3: report Rx drop packets enable configuration Currently, if there are not available Rx buffer descriptors in receiving direction based on hns3 network engine, incoming packets will always be dropped by hardware. This patch reports the '.rx_drop_en' information to DPDK framework in the '.dev_infos_get', '.rxq_info_get' and '.rx_queue_setup' ops implementation function. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-09-18 18:55:06 +02:00
Min Hu (Connor)	8973d7c4ca	net/hns3: support keeping CRC CRC is the end of frame, which occupies 4 bytes. Keeping CRC is a feature of MAC, which will not strip CRC field when receiving frames. The feature can be enabled using DEV_RX_OFFLOAD_KEEP_CRC offload by upper level application. And the feature is only supported for hns3 PF PMD driver, not supported for hns3 VF PMD driver Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-17 18:21:21 +02:00
Wei Hu (Xavier)	fc9b57ff57	net/hns3: fix inserted VLAN tag position in Tx Based on hns3 network engine, in order to configure hardware VLAN insert offload in Tx direction, PMD driver reads the VLAN tags from the vlan_tci_outer and vlan_tci of the structure rte_mbuf, fills them into the Tx Buffer Descriptor and sets the related offload flag for every packet. Currently, there are two VLAN related problems in the 'tx_pkt_burst' ops implementation function: 1) When setting the related offload flag, PMD driver inserts the VLAN tag into the position that close to L3 header. So, when upper application sends a packet with a VLAN tag in the data buffer, the VLAN offloaded by hardware will be added to the wrong position. It is supposed to add the VLAN tag from the rte_mbuf to the position close to the MAC header in the packet when using VLAN insertion. And when PF PVID is enabled by calling the API function named rte_eth_dev_set_vlan_pvid or VF PVID is enabled by hns3 PF kernel ether driver, the VLAN tag from the structure rte_mbuf to enable the VLAN insertion should be filled into the position that close to L3 header to avoid to be overwritten by the PVID which will always be inserted in the position that close to the MAC address. 2) When sending multiple segment packets, VLAN information is required to be filled into the first Tx Buffer descriptor. However, currently hns3 PMD driver incorrectly placed it in the last Tx Buffer Descriptor. This results in VLAN insert offload failure when sending multiple segment packets. This patch fixed them by filling the VLAN information into the position of the Tx Buffer Descriptor. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com>	2020-07-07 23:38:28 +02:00
Chengchang Tang	a001f09d11	net/hns3: cleanup duplicated code on processing TSO in Tx This patch fixes up paylen calculation twice when processing TSO request in the '.tx_pkt_burst' ops implementation function to avoid performance loss. Fixes: `6dca716c9e` ("net/hns3: support TSO") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com>	2020-07-07 23:38:28 +02:00
Wei Hu (Xavier)	6c44219f99	net/hns3: fix reassembling multiple segment packets in Tx Because of the hardware constraints, hns3 network engine doesn't support sending packets with more than eight fragments. And hns3 pmd driver tries to reassemble these kind of packets to meet hardware requirements. Currently, there are two problems: 1) when the input buffer_len * 8 < pkt_len, the packets are impossible to be reassembled into 8 Buffer Descriptors. In this case, the packets will be passed to hardware, which eventually causes a hardware reset. 2) The meta data in origin packets which are required to fill into the descriptor haven't been copied into the reassembled pkts. This patch adds a check for 1) to ensure such packets will be dropped by driver and copies useful meta data from the origin packets to the reassembled packets. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2020-07-07 23:38:26 +02:00
Wei Hu (Xavier)	dfac40d93e	net/hns3: fix Rx buffer size Currently, rx_buf_size of hns3 PMD driver is fixed on, and it's value depends on the firmware which will decrease the flexibility of PMD. The receive side mbufs was allocated from the mempool given by upper application calling rte_eth_rx_queue_setup API function. So the memory chunk used for net device DMA is depend on the data room size of the objects in this mempool. Hns3 PMD driver should set the rx_buf_len smaller than the data room size of mempool and our hardware only support the following four specifications: 512, 1024, 2148 and 4096. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-07 23:38:26 +02:00
Chengchang Tang	b4e4d7ac9f	net/hns3: support setting VF PVID by PF driver This patch adds support setting VF PVID by hns3 PF kernel ethdev driver on the host by "ip link set <eth num> vf <vf id> vlan <vlan tag>" command. Because of the hardware constraints, the striped VLAN tag will always in Rx descriptors which should has been dropped when PVID is enabled and the PVID will overwrite the outer VLAN tag in Tx descriptor. So, hns3 PMD driver need to change the processing of VLAN tags in the process of Tx and Rx according to whether PVID is enabled. 1) If the hns3 PF kernel ethdev driver sets the PVID for VF device before the initialization of the related VF device, hns3 VF PMD driver should get the PVID state from PF driver through mailbox and update the related state in txq and rxq maintained by hns3 VF driver to change the process of Tx and Rx. 2) If the hns3 PF kernel ethdev driver sets the PVID for VF device after initialization of the related VF device, the PF driver will notify VF driver to update the PVID state. The VF driver will update the PVID configuration state immediately to ensure that the VLAN process in Tx and Rx is correct. But in the window period of this state transition, packets loss or packets with wrong VLAN may occur. 3) Due to hardware limitations, we only support two-layer VLAN hardware offload in Tx direction based on hns3 network engine, so when PVID is enabled, QinQ insert is no longer supported. And when PVID is enabled, in the following two cases: i) packets with more than two VLAN tags. ii) packets with one VLAN tag while the hardware VLAN insert is enabled. The packets will be regarded as abnormal packets and discarded by hardware in Tx direction. For debugging purposes, a validation check for these types of packets is added to the '.tx_pkt_prepare' ops implementation function named hns3_prep_pkts to inform users that these packets will be discarded. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com>	2020-07-07 23:38:26 +02:00
Chengchang Tang	8c7449779c	net/hns3: decrease non-nearby memory access in Rx Currently, hns3 PMD driver needs know the PVID configuration state and do different processing in the 'rx_pkt_burst' ops implementation function. This patch adds a member to struct hns3_rx_queue/hns3_tx_queue of the driver to indicate the PVID configuration status, so it isn't need to access other data structure in the 'rx_pkt_burst' ops implementation, to avoid performance loss because of reducing cache miss. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-07 23:38:26 +02:00
Wei Hu (Xavier)	1f295c40da	net/hns3: support LRO This patch adds support of LRO offload for hns3 PMD driver. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-07 23:38:26 +02:00
Hongbo Zheng	b68259f775	net/hns3: check TSO segment size during Tx Base on hns3 network engine, when the rte_eth_tx_burst API is called by Upper Level Process, if PKT_TX_TCP_SEG flag is set and tso_segsz is 0 in the input parameter structure rte_mbuf, hns3 PMD driver will process this packet as an non-TSO packet, otherwise hardware will enter an abnormal state. Fixes: `6dca716c9e` ("net/hns3: support TSO") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-06-05 11:32:08 +02:00
Wei Hu (Xavier)	e28bc14765	net/hns3: fix VLAN tags reported in Rx Currently, based on hns3 network engine, driver always reports the incoming packet's VLAN tags to the structure rte_mbuf those are the output parameter pointers in '.rx_pkt_burst' ops implementation function, and never reports PKT_RX_VLAN_STRIPPED flag to the structure rte_mbuf even if Upper Level Process configured hardware strip by calling rte_eth_dev_configure or rte_eth_dev_set_vlan_offload API function. It makes the ULP unable to know the stripping of VLAN. It is supposed to present the stripped flags to the mbuf ol_flags, and report the right VLAN tag. And as hardware constraints, the stripped VLAN tag will always in the Rx descriptor. Even if setting a PVID based on the function, the PVID will be reported to the Rx descriptor. So the driver need to determine which VLAN tag should be reported to output the structure rte_mbuf in '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Fixes: `411d23b9ea` ("net/hns3: support VLAN") Cc: stable@dpdk.org Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-06-05 11:32:08 +02:00
Wei Hu (Xavier)	16c374402f	net/hns3: fix Tx less than 60 bytes Currently, when running testpmd application based on hns3 network engine with csum fwd mode by "set fwd csum" command in the prompt line, sending 42 consecutive bytes of ARP packets to network port with packets generator. But in fact hardware can't send the ARP packets and the related logs as below: "Preparing packet burst to failed: Invalid argument" The hardware doesn't support transmit packets less than 60 bytes, and in the '.tx_pkt_burst' ops implementation function named hns3_xmit_pkts appending operation has been added for less than 60 bytes packets. So the interception needs to be removed in the '.tx_pkt_prepare' ops implementation function named hns3_prep_pkts. Fixes: `de620754a1` ("net/hns3: fix sending packets less than 60 bytes") Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>	2020-05-28 17:57:07 +02:00
Wei Hu (Xavier)	c4b7d6761d	net/hns3: get Tx abnormal errors in xstats When upper level application calls the rte_eth_tx_burst API function to send multiple packets at a time with burst mode based on hns3 network engine, there are some abnormal conditions that cause the driver to fail to operate the hardware to send packets correctly. This patch adds some statistic counts for the abnormal errors of Tx data path to the extend device statistics. The upper level application can get them by calling the rte_eth_xstats_get API function. Note: When using burst mode to call the rte_eth_tx_burst API function to send multiple packets at a time. When the first abnormal error is detected, add one to the relevant error statistics item, and then exit the loop of sending multiple packets of the function. That is to say, even if there are multiple packets in which abnormal errors may be detected in the burst, the relevant error statistics in the driver will only be increased by one. The detail description of the Tx abnormal errors statistic items as below: - TX_OVER_LENGTH_PKT_CNT Total number of greater than HNS3_MAX_FRAME_LEN the driver supported. - TX_EXCEED_LIMITED_BD_PKT_CNT Total number of exceeding the hardware limited bd which process a packet needed bd numbers. - TX_EXCEED_LIMITED_BD_PKT_REASSEMBLE_FAIL_CNT Total number of exceeding the hardware limited bd fail which process a packet needed bd numbers and reassemble fail. - TX_UNSUPPORTED_TUNNEL_PKT_CNT Total number of unsupported tunnel packet. The unsupported tunnel type: vxlan_gpe, gtp, ipip and MPLSINUDP, MPLSINUDP is a packet with MPLS-in-UDP RFC 7510 header. - TX_QUEUE_FULL_CNT Total count which the available bd numbers in current bd queue is less than the bd numbers with the pkt process needed. - TX_SHORT_PKT_PAD_FAIL_CNT Total count which the packet length is less than minimum packet size HNS3_MIN_PKT_SIZE and fail to be appended with 0. Signed-off-by: Lijun Ou <oulijun@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Hao Chen <chenhao164@huawei.com>	2020-05-05 15:54:26 +02:00
Chengwen Feng	c4ae39b2cf	net/hns3: fix Rx interrupt after reset Currently, Rx interrupt cannot work normally after reset (such as FLR, global reset and IMP reset), when running l3fwd-power application based on hns3 network engine. The root cause is that the hardware configuration about Rx interrupt does not recover after reset. This patch fixes it with the following modification. 1. The internal static function named hns3(vf)_init_ring_with_vector is moved from hns3_init_pf to hns3(vf)_init_hardware because hns3(vf)_init_hardware is called both in the initialization and the RESET_STAGE_DEV_INIT stage of the reset process. 2. The internal static function named hns3(vf)_restore_rx_interrupt is added in hns3(vf)_restore_conf, it is used to recover hardware configuration about interrupt vectors of rx queues in the RESET_STAGE_DEV_INIT stage of the reset process. 3. The internal static function named hns3_dev_all_rx_queue_intr_enable and hns3_enable_all_queues are added in hns3(vf)_dev_start(which called in the initialization, so after calling the rte_eth_dev_start API successfully, the driver is ready to work. 4. The function named hns3_dev_all_rx_queue_intr_enable and hns3_enable_all_queues are also added in hns3(vf)_start_service(which called in the RESET_STAGE_DEV_INIT stage of the reset process), so after start_service, the driver is ready to work. Note: 1. Because FLR will clear queue's interrupt enable bit hardware configuration, so we add calling hns3_dev_all_rx_queue_intr_enable to enable interrupt before enabling queues. 2. After finished the initialization, we can enable queues to work by calling the internal function named hns3_enable_all_queues. Fixes: `02a7b55657` ("net/hns3: support Rx interrupt") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com>	2020-04-21 13:57:07 +02:00
Chengwen Feng	af531efa4b	net/hns3: fix packets offload features flags in Rx Currently there is a certain probability of the unexpected ol_flag of the Rx packets's rte_mbuf when receiving packets. The root cause as below: 1. The member variable named ol_flag of the structure named rte_mbuf is not properly initialized to zero in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. 2. When multi-segment rte_mbufs are needed for long packet in Rx operation, the driver should assign value to the ol_flag of the first segment, not to the ol_flag of the last segment. This patch fixes it with the following modification in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. 1. Where the first write operation in the '.rx_pkt_burst' ops implementation function, assign PKT_RX_RSS_HASH to ol_flags directly using '=' operation instead of '\|=' operation. 2. In the static function named hns3_rx_set_cksum_flag, the last rte_mbuf's ol_flags should be assigned when processing multi-segment. We fix it by passing first_seg variable to the function instead of rxm(the last segment's address). Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Fixes: `ad7cf94823` ("net/hns3: fix offload flag for RSS hash") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-04-21 13:57:04 +02:00
Wei Hu (Xavier)	ef2e785c36	net/hns3: fix Tx interrupt when enabling Rx interrupt Currently, when receiving and transmitting packets based on hns3 network engine there are probably unexpected and redundant Tx interrupts if Rx interrupt is enabled. The root cause as below: Tx and Rx queues with the same number share the interrupt vector in hns3 network engine, and in this case there are the residual hardware mapping relationship configuration between queue and interrupt vector configured in hns3 kernel ethdev driver. We should clear the all hardware mapping relationship configurations in the initialization. Because of the hardware constraints, we have to implement clearing the relationship by binding all queues to the last interrupt vector and reserving the last interrupt vector, this method results in a decrease of the maximum queues when upper applications call the rte_eth_dev_configure API function to enable Rx interrupt. Fixes: `02a7b55657` ("net/hns3: support Rx interrupt") Cc: stable@dpdk.org Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Lijun Ou <oulijun@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-03-18 10:21:42 +01:00
Hongbo Zheng	6dca716c9e	net/hns3: support TSO This patch adds TCP segment offload support for hns3 PMD driver. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-03-18 10:21:42 +01:00
Chengwen Feng	8162238b7d	net/hns3: replace memory barrier with data dependency order This patch optimizes the Rx performance by using data dependency ordering to instead of memory barrier which is rte_cio_rmb in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:59:19 +01:00
Wei Hu (Xavier)	ffd0ec015b	net/hns3: add free threshold in Rx This patch optimizes the Rx performance by adding the rx_free_thresh related process in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. The related change as follows: 1. Adding the rx_free_thresh related process to reduce the number of writing the HNS3_RING_RX_HEAD_REG register. 2. Adjusting the internal macro named DEFAULT_RX_FREE_THRESH to 32 and adjusting HNS3_MIN_RING_DESC to 64 to make the effect of the thresh more obvious. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:26 +01:00
Wei Hu (Xavier)	5cf7a75b2c	net/hns3: remove one IO barrier in Rx When receiving a packet, hns3 hardware network engine firstly writes the packet content to the memory pointed by the 'addr' field of the Rx Buffer Descriptor, secondly fills the result of parsing the packet include the valid field into the Rx Buffer Descriptor in one write operation, and thirdly writes the number of the Buffer Descriptor not processed by the driver to the HNS3_RING_RX_FBDNUM_REG register. This patch optimizes the Rx performance by removing one rte_io_rmb call in the '.rx_pkt_burst' ops implementation function named hns3_recv_pkts. The change as follows: 1. Driver no longer read HNS3_RING_RX_FBDNUM_REG register, so remove one rte_io_rmb call, and directly read the valid flag of Rx Buffer Descriptor to check whether the BD is ready. 2. Delete the non_vld_descs field from the statistic information of the hns3 driver because now it has become a common case that the valid flag of Rx Buffer Descriptor read by the driver is invalid. Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:26 +01:00
Yisen Zhuang	eb570862a2	net/hns3: reduce judgements of free Tx ring space This patch reduces the number of the judgement of the free Tx ring space in the 'tx_pkt_burst' ops implementation function to avoid performance loss. According to hardware constraints, we need to reserve a Tx Buffer Descriptor in the TX ring in hns3 network engine. Signed-off-by: Yisen Zhuang <yisen.zhuang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:26 +01:00
Wei Hu (Xavier)	a951c1ed3a	net/hns3: support different numbers of Rx and Tx queues Hardware does not support individually enable/disable/reset the Tx or Rx queue in hns3 network engine, driver must enable/disable/reset Tx and Rx queues at the same time. Currently, hns3 PMD driver does not support the scenarios as below: 1) When calling the following function, the input parameter nb_rx_q and nb_tx_q are not equal. rte_eth_dev_configure(uint16_t port_id, uint16_t nb_rx_q, uint16_t nb_tx_q, const struct rte_eth_conf *dev_conf); 2) When calling the following functions to setup queues, the cumulatively setup Rx queues are not the same as the setup Tx queues. rte_eth_rx_queue_setup(uint16_t port_id, uint16_t rx_queue_id,,,); rte_eth_tx_queue_setup(uint16_t port_id, uint16_t tx_queue_id,,,); However, these are common usage scenarios in some applications, such as, l3fwd, ip_ressmbly and OVS-DPDK, etc. This patch adds support for this usage of these functions by setup fake Tx or Rx queues to adjust numbers of Tx/Rx queues. But these fake queues are imperceptible, and can not be used by upper applications. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:26 +01:00
Wei Hu (Xavier)	27f9707785	net/hns3: remove unnecessary assignments in Tx This patch removes the unnecessary assignment in the '.tx_pkt_burst' ops implementation function to avoid performance loss. Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Yisen Zhuang <yisen.zhuang@huawei.com>	2020-01-17 19:46:01 +01:00
Huisong Li	89c04d8117	net/hns3: remove custom macro for minimum length This patch replaces custom macro named HNS3_MIN_FRAME_LEN for ethernet minimum frame length with the macro named RTE_ETHER_MIN_LEN that defined in DPDK framework. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:01 +01:00
Hao Chen	02a7b55657	net/hns3: support Rx interrupt This patch adds supports of receive packets through interrupt mode for hns3 PF/VF driver. The following ops functions should be implemented defined in the struct eth_dev_ops: rx_queue_intr_enable rx_queue_intr_disable Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-01-17 19:46:01 +01:00
Wei Hu (Xavier)	8f64f2846d	net/hns3: fix checking enough Tx BDs In .tx_pkt_burst ops implementation function of hns3 PMD driver, there is one check whether there are enough BDs in the TX queue. If not, driver will stop sending the packets. Currently in the 'for' process loop, the next_to_use member of TX queue is not updated in time after processing BDs of one packet, which results in the invalid action of checking whether there are enough BDs and failure in sending packets. This patch fixes it by moving the assignment statment of the next_to_use member of TX queue to the place after porcessing TX BDs in the 'for' loop. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2019-11-26 18:05:15 +01:00
Wei Hu (Xavier)	de620754a1	net/hns3: fix sending packets less than 60 bytes Ethernet minimum packet length is 64 bytes. If upper application sends packets with less than 60 bytes in length(no CRC), driver adds padding processing to avoid failure. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2019-11-26 18:05:15 +01:00
Wei Hu (Xavier)	ad7cf94823	net/hns3: fix offload flag for RSS hash This patch adds PKT_RX_RSS_HASH flag to rx packet's ol_flags to repair the bug that hns3 pmd driver doesn't set PKT_RX_RSS_HASH flag. In hns3 network engine RSS is always enabled. Fixes: `bba6366983` ("net/hns3: support Rx/Tx and related operations") Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2019-10-25 19:23:23 +02:00
Hao Chen	1e627b8d99	net/hns3: fix statistics This patch fixes the statistics problems for sending and receiving message as belows: 1.In receiving direction, for FCS error messages, drivers should not record them in rte_eth_stats.ipackets statistics. 2.In sending direction, for messages of illegal length, too long or equals 0, drivers should not notify the network card hardware to send them, should not continue to send the remaining message in burst, and record them in rte_eth_stats.opackets statistics. Fixes: `8839c5e202` ("net/hns3: support device stats") Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2019-10-25 19:23:23 +02:00
Flavia Musatescu	512d873ff1	net: add new header file for VXLAN The VXLAN related definitions and structures are moved from rte_ether.h to a new header file: rte_xvlan.h. Also introducing a new define macro for VXLAN default port id: RTE_VXLAN_DEFAULT_PORT Signed-off-by: Flavia Musatescu <flavia.musatescu@intel.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com> Tested-by: Raslan Darawsheh <rasland@mellanox.com>	2019-10-25 19:00:22 +02:00
Wei Hu (Xavier)	2790c64647	net/hns3: support device reset This patch adds reset related process for hns3 PMD driver. The following three scenarios will trigger the reset process, and the driver settings will be restored after the reset is successful: 1. Receive a reset interrupt 2. PF receives a hardware error interrupt 3. VF is notified by PF to reset Signed-off-by: Chunsong Feng <fengchunsong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-10-07 15:00:57 +02:00
Wei Hu (Xavier)	bba6366983	net/hns3: support Rx/Tx and related operations This patch adds queue related operation, package sending and receiving function codes. Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Chunsong Feng <fengchunsong@huawei.com> Signed-off-by: Min Wang (Jushui) <wangmin3@huawei.com> Signed-off-by: Min Hu (Connor) <humin29@huawei.com> Signed-off-by: Hao Chen <chenhao164@huawei.com> Signed-off-by: Huisong Li <lihuisong@huawei.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2019-10-07 15:00:57 +02:00

1 2 3

142 Commits