numam-dpdk

Author	SHA1	Message	Date
Suanming Mou	5522da6b20	net/mlx5: add option to allocate memory from system Currently, for MLX5 PMD, once millions of flows created, the memory consumption of the flows are also very huge. For the system with limited memory, it means the system need to reserve most of the memory as huge page memory to serve the flows in advance. And other normal applications will have no chance to use this reserved memory any more. While most of the time, the system will not have lots of flows, the reserved huge page memory becomes a bit waste of memory at most of the time. By the new sys_mem_en devarg, once set it to be true, it allows the PMD allocate the memory from system by default with the new add mlx5 memory management functions. Only once the MLX5_MEM_RTE flag is set, the memory will be allocate from rte, otherwise, it allocates memory from system. So in this case, the system with limited memory no need to reserve most of the memory for hugepage. Only some needed memory for datapath objects will be enough to allocated with explicitly flag. Other memory will be allocated from system. For system with enough memory, no need to care about the devarg, the memory will always be from rte hugepage. One restriction is that for DPDK application with multiple PCI devices, if the sys_mem_en devargs are different between the devices, the sys_mem_en only gets the value from the first device devargs, and print out a message to warn that. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Suanming Mou	d38e3d5266	common/mlx5: add memory management functions Add the internal mlx5 memory management functions: mlx5_malloc_mem_select(); mlx5_memory_stat_dump(); mlx5_rellaocate(); mlx5_malloc(); mlx5_free(); User will be allowed to manage memory from system or from rte memory with the unified functions. In this case, for the system with limited memory which can not reserve lots of rte hugepage memory in advanced, will allocate the memory from system for some of not so important control path objects based on the sys_mem_en configuration. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	6db1bcf038	doc: update release notes and mlx5 guide for eCPRI Update the release notes of mlx5 PMD part by adding the support of eCPRI. Update the firmware configuration in the mlx5 NIC guide to support the usage of eCPRI. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	d7c49561d3	net/mlx5: add eCPRI flex parser capacity check If the NIC or the FW does not support the dynamic flex parser, it will return error when trying to create the parser for eCRPI. Then it is hard to know the detail error reason of the failure. Before creating the parser node and the following usage of the parser, the capacity bit saved in the HCA_CAP could be used to confirm if the dynamic flex parser is supported. If no, an error will be returned directly with ENOTSUP to prevent the following steps to be executed. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	1c5064044f	net/mlx5: create and destroy eCPRI flex parser eCPRI protocol has unified format layout for the variants, over ETH layer (including .1Q) and UDP layer. The common header of the message has 4 bytes fixed length, and the message payload layers are different based on the type field. Now only type #0, #2 and #5 will be supported, and 2 bytes are needed. When creating the flex parser, the header will be extended to 8 bytes and 2 DW samples are needed. The 1st DW starts from offset 0 and will be used for the type field of the common header. The 2nd DW starts from offset 4 and will be used for the physical channel ID, real-time control ID or measurement ID fields. The parser will be created once a flow with eCPRI item is observed for the first time. After creating, it will remain in the system and HW until the device is stopped. Right now, there is no need to destroy the eCPRI flex parser after the last flow with eCPRI item is destroyed. This is to get rid of the alternate states of creating and destroying eCPRI flex parser with a single eCPRI flow. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	38119ebe01	common/mlx5: add DevX command for flex parsers In order to use dynamic flex parser to parse protocols that is not supported natively, two steps are needed. Firstly, creating the parse graph node. There are three parts of the flex parser: node, arc and sample. Node is the whole structure of a flex parser, when creating, the length of the protocol should be specified. Then the input arc(s) is(are) mandatory, it will tell the HW when to use this parser to parse the packet. For a single parser node, up to 8 input arcs could be supported and it gives SW ability to support this protocol over multiple layers. The output arc is optional and also up to 8 arcs could be supported. If the protocol is the last header of the stack, then output arc should be NULL. Or else it should be specified. The protocol type in the arc is used to indicate the parser pointing to or from this flex parser node. For output arc, the next header type field offset and size should be set in the node structure, then the HW could get the proper type of the next header and decide which parser to point to. Note: the parsers have two types now, native parser and flex parser. The arc between two flex parsers are not supported in this stage. Secondly, querying the sample IDs. If the protocol header parsed with flex parser needs to used in flow rule offloading, the DW samples are needed when creating the parse graph node. The offset of bytes starting from the header needs to be set. After creating the node successfully, a general object handle will be returned. This object could be queried with Devx command to get the sample IDs. When creating a flow, sample IDs could be used to sample a DW from the parsed header - 4 continuous bytes starting from the offset. The flow entry could specify some mask to use part of this DW for matching. Up to 8 samples could be supported for a single parse graph node. The offset should not exceed the header length. The HW resources have some limitation, low layer driver error should be checked once there is a failure of creating parse graph node. Signed-off-by: Netanel Gonen <netanelg@mellanox.com> Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	711aedf187	common/mlx5: add flex parser DevX structures The structures and other definitions will be used for the dynamic flex parser creation via Devx command interface. These structures will be used as some some intermediate variables and input parameters for the parser creation API. It is better to keep all members consistent with the PRM definition even though some of them will not be used. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	daa38a8924	net/mlx5: add flow translation of eCPRI header In the translation stage, the eCPRI item should be translated into the format that lower layer driver could use. All the fields that need to match must be in network byte order after translation, as well as the mask. Since the header in the item belongs to the network layers stack, and the input parameter of the header is considered to be in big-endian format already. Base on the definition in the PRM, the DW samples will be used for matching in the FTE/STE. Now, the type field and only the PC ID, RTC ID, and DLY MSR ID of the payload will be supported. The masks should be 00 ff 00 00 ff ff(00) 00 00 in the network order. Two DWs are needed to support such matching. The mask fields could be zeros to support some wildcard rules. But it makes no sense to support the rule matching only on the payload but without matching type field. The DW samples should be stored after the flex parser creation for eCPRI. There is no need to query the sample IDs each time when creating a flow rule with eCPRI item. It will not introduce insertion rate degradation significantly. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Bing Zhao	c7eca23657	net/mlx5: add flow validation of eCPRI header When creating a flow with eCPRI header item, the validation of it is mandatory. The detailed limitations are listed below: 1. Over Ether / VLAN, ethertype must be 0xAEFE. 2. No tunnel support is described in the specification now. 3. L3 layer is only supported when L4 is UDP, see #4. 4. Over TCP is not supported from the specification, and over UDP is not supported right now. 5. Concatenation indicator matching is not supported now. 6. No need to check the revision. 7. Only type field in the common header is mandatory, and one byte should be matched integrally. 8. Fields in the message payload header are optional. 9. Only messages with type #0, #2 and #5 are supported now. Some limitations are only from software right now, because there is no need to support all the message types and variants of protocol stack listed in the specification. Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	a2854c4de1	net/mlx5: convert Rx timestamps in real-time format The ConnectX-6DX supports the timestamps in various formats, the new realtime format is introduced - the upper 32-bit word of timestamp contains the UTC seconds and the lower 32-bit word contains the nanoseconds. This patch detects what format is configured in the NIC and performs the conversion accordingly. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	bb7ef9a962	common/mlx5: add register access DevX routine The DevX routine to read/write NIC registers via DevX API is added. This is the preparation step to check timestamp modes and units and gather the extended statistics. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	3b025c0ca4	net/mlx5: provide send scheduling error statistics The mlx5 PMD exposes the following new introduced extended statistics counter to report the errors of packet send scheduling on timestamps: - txpp_err_miss_int - rearm queue interrupt was not handled was not handled in time and service routine might miss the completions - txpp_err_rearm_queue - reports errors in rearm queue - txpp_err_clock_queue - reports errors in clock queue - txpp_err_ts_past - timestamps in the packet being sent were found in the past, timestamps were ignored - txpp_err_ts_future - timestamps in the packet being sent were found in the too distant future (beyond HW/clock queue capabilities to schedule, typically it is about 16M of tx_pp devarg periods) - txpp_jitter - estimated jitter in device clocks between 8K completions of Clock Queue. - txpp_wander - estimated wander in device clocks between 16M completions of Clock Queue. - txpp_sync_lost - error flag, the Clock Queue completions synchronization is lost, accurate packet scheduling can not be handled, timestamps are being ignored, the restart of all ports using scheduling must be performed. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	b94d93ca73	net/mlx5: support reading device clock If send schedule feature is engaged there is the Clock Queue created, that reports reliable the current device clock counter value. The device clock counter can be read directly from the Clock Queue CQE. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	2f827f5ea6	net/mlx5: support scheduling on send routine template This patch adds send scheduling on timestamps into tx_burst routine template. The feature is controlled by static configuration flag, the actual routines supporting the new feature are generated over this updated template. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	0febfcce36	net/mlx5: prepare Tx to support scheduling The new static control flag is introduced to control routine generating from template, enabling the scheduling on timestamps. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	085ff447f0	net/mlx5: convert timestamp to completion index The application provides timestamps in Tx mbuf as clocks, the hardware performs scheduling on Clock Queue completion index match. This patch introduces the timestamp-to-completion-index inline routine. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	3172c471b8	net/mlx5: prepare Tx queue structures to support timestamp The fields to support send scheduling on dynamic timestamp field are introduced and initialized on device start. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	77522be0a5	net/mlx5: introduce clock queue service routine Service routine is invoked periodically on Rearm Queue completion interrupts, typically once per some milliseconds (1-16) to track clock jitter and wander in robust fashion. It performs the following: - fetches the completed CQEs for Rearm Queue - restarts Rearm Queue on errors - pushes new requests to Rearm Queue to make it continuously running and pushing cross-channel requests to Clock Queue - reads and caches the Clock Queue CQE to be used in datapath - gathers statistics to estimate clock jitter and wander - gathers Clock Queue errors statistics Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	aef1e20ebe	net/mlx5: allocate packet pacing context This patch allocates the Packet Pacing context from the kernel, configures one according to requested pace send scheduling granularity and assigns to Clock Queue. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	3a87b964ed	net/mlx5: create Tx queues with DevX To provide the packet send schedule on mbuf timestamp the Tx queue must be attached to the same UAR as Clock Queue is. UAR is special hardware related resource mapped to the host memory and provides doorbell registers, the assigning UAR to the queue being created is provided via DevX API only. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	551c94c83e	net/mlx5: create rearm queue for packet pacing The dedicated Rearm Queue is needed to fire the work requests to the Clock Queue in realtime. The Clock Queue should never stop, otherwise the clock synchronization might be broken and packet send scheduling would fail. The Rearm Queue uses cross channel SEND_EN/WAIT operations to provides the requests to the Clock Queue in robust way. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	d133f4cdb7	net/mlx5: create clock queue for packet pacing This patch creates the special completion queue providing reference completions to schedule packet send from other transmitting queues. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	fc4d4f732b	net/mlx5: introduce shared UAR resource This is preparation step before moving the Tx queue creation to the DevX approach. Some features require the shared UAR for Tx queues and scheduling completion queues, the patch manages the shared UAR. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	24feb04596	net/mlx5: fix UAR lock sharing for multiport devices The master and representors might be created over the multiport Infiniband devices and the UAR resource allocated for sibling ports might belong to the same underlying Infiniband device. Hardware requires the write access to the UAR must be performed as atomic 64-bit write, on 32-bit systems this is two sequential writes, protected by lock. Due to possibility to share the same UAR between sibling devices the locks must be moved to shared context. Fixes: `f048f3d479` ("net/mlx5: switch to the shared IB device context") Cc: stable@dpdk.org Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	8f848f32fc	net/mlx5: introduce send scheduling devargs This patch introduces the new devargs: tx_pp - enables accurate packet send scheduling on mbuf timestamps in the PMD. On the device start if "rte_dynflag_timestamp" dynamic flag is registered and this devarg non-zero value is specified, the driver initializes all necessary internal infrastructure to provide packet scheduling. The parameter value specifies scheduling granularity in nanoseconds. tx_skew - the parameter adjusts the send packet scheduling on timestamps and represents the average delay between beginning of the transmitting descriptor processing by the hardware and appearance of actual packet data on the wire. The value should be provided in nanoseconds and is valid only if tx_pp parameter is specified. The default value is zero. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Viacheslav Ovsiienko	79a7e409a2	common/mlx5: prepare support of packet pacing This patch prepares the common part of the mlx5 PMDs to support packet send scheduling on mbuf timestamps: - the DevX routine to query the packet pacing HCA capabilities - packet pacing Send Queue attributes support - the hardware related definitions Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Thomas Monjalon	b458bd4cf7	common/mlx5: fix link with ibverbs glue dlopen option In case the ibverbs glue is a separate library to dlopen, the PMD library must allocate a glue structure to be filled by dlopen. The glue management was in mlx5_common.c and moved to mlx5_common_os.c, but the variable allocation was not removed from the original file. The consequence was a link failure, if ibverbs dlopen option is enabled, because of the redefinition of the variable (with GCC 10): multiple definition of 'mlx5_glue' The original definition is removed to keep only the one moved in the Linux sub-directory. Fixes: `79aa430721` ("common/mlx5: split common file under Linux directory") Cc: stable@dpdk.org Signed-off-by: Thomas Monjalon <thomas@monjalon.net> Acked-by: Matan Azrad <matan@mellanox.com>	2020-07-21 15:44:36 +02:00
Jeff Guo	4237be2b9e	net/e1000: fix crash on Tx done clean up As tx mbuf is not set for some advanced descriptors, if there is no mbuf checking before rte_pktmbuf_free_seg() function be called on the process of tx done clean up, that will cause a segfault. So add a NULL pointer check to fix it. Bugzilla ID: 501 Fixes: `8d907d2b79` ("net/igb: free consumed Tx buffers on demand") Cc: stable@dpdk.org Signed-off-by: Jeff Guo <jia.guo@intel.com> Reviewed-by: Wei Zhao <wei.zhao1@intel.com>	2020-07-21 13:54:54 +02:00
Jeff Guo	34c3e885be	net/iavf: fix GTPU L4 hash When the configure pattern involve GTPU inner l3 and l4, even the configure input set only l3 but not l4, the different l4 protocol header should also be configured for the different l4 protocol. Fixes: `215a247b5f` ("net/iavf: refactor hash flow") Fixes: `642f201950` ("net/iavf: support RSS for IPv4 IPv6 mix of GTP") Signed-off-by: Jeff Guo <jia.guo@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2020-07-21 13:54:54 +02:00
Jeff Guo	e4d3cac43c	net/ice: fix GTPU L4 hash When the configure pattern involve GTPU inner l3 and l4, even the configure input set only l3 but not l4, the different l4 protocol header should also be configured for the different l4 protocol. Fixes: `0b952714e9` ("net/ice: refactor PF hash flow") Fixes: `de32fa2ba2` ("net/ice: support RSS for IPv6 prefix") Signed-off-by: Jeff Guo <jia.guo@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2020-07-21 13:54:54 +02:00
Jeff Guo	ef1d314d90	net/ice: fix symmetric hash configuration Some protocol don't support symmetric hash, need to handle these cases. When set an invalid symmetric hash rule, just return failed. Fixes: `4eafe71ee9` ("net/ice: fix RSS type") Signed-off-by: Jeff Guo <jia.guo@intel.com> Acked-by: Qi Zhang <qi.z.zhang@intel.com>	2020-07-21 13:54:54 +02:00
Zhihong Peng	21f4a1fb30	net/e1000: report VLAN extend capability The rte_eth_dev_set_vlan_offload function will check vlan rx offload capability, the i350/i210/i211 nics have vlan extend feature but DEV_RX_OFFLOAD_VLAN_EXTEND is not set into the capability, that will cause setting fail. So need to add this capability in igb_get_rx_port_offloads_capa function. Fixes: `ef990fb56e` ("net/e1000: convert to new Rx offloads API") Cc: stable@dpdk.org Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Reviewed-by: Wei Zhao <wei.zhao1@intel.com>	2020-07-21 13:54:54 +02:00
Zhihong Peng	664c253e7f	net/i40e: report VLAN filter capability The rte_eth_dev_set_vlan_offload function will check vlan rx offload capability, the i40e vf has vlan filter feature but DEV_RX_OFFLOAD_VLAN_FILTER is not set into the capability, that will cause setting fail. So need to add this capability in i40e_vf_representor_dev_infos_get function. Fixes: `e0cb96204b` ("net/i40e: add support for representor ports") Cc: stable@dpdk.org Signed-off-by: Zhihong Peng <zhihongx.peng@intel.com> Acked-by: Jeff Guo <jia.guo@intel.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	2cf9264f2d	net/dpaa: support queue info routines This patch add support for rxq_info_get and txq_info_get Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	de1d70f0b8	net/dpaa2: support queue info routines This patch add support for rxq_info_get and txq_info_get Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	b58690951b	net/dpaa2: support using Tx queue descriptor size Add support to use Tx queue desc size to configure congestion notification on TX queue Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	988a7c38cc	net/dpaa2: report error on queue deferred start This patch add support to reports errors on configuring deferred start for rx or tx queues. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	e335cce430	net/dpaa: report error on using deferred start This patch add support to report on error for rx and tx deferred start config Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Apeksha Gupta	ddbc2b6658	net/dpaa2: add Tx/Rx burst mode info Retrieve burst mode information according to the selected Tx/Rx mode and offloads. Signed-off-by: Apeksha Gupta <apeksha.gupta@nxp.com>	2020-07-21 13:54:54 +02:00
Apeksha Gupta	2e6f565700	net/dpaa: add Tx/Rx burst mode info Retrieve burst mode information according to the selected Rx/Tx mode and offloads. Signed-off-by: Apeksha Gupta <apeksha.gupta@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	724f79dff0	net/dpaa2: support per-port Rx mbuf timestamp DEV_RX_OFFLOAD_TIMESTAMP is per port, so the internal implementation shall enable it on per port basis only. Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Hemant Agrawal	422aa32c4c	net/dpaa2: remove Rx timestamp enable PMD API This experimental API is no longer required as the same purpose can be solved with standard DEV_RX_OFFLOAD_TIMESTAMP Signed-off-by: Hemant Agrawal <hemant.agrawal@nxp.com>	2020-07-21 13:54:54 +02:00
Andrew Rybchenko	520059a41a	net: check fragmented headers in non-debug as well Pseudo-header checksum calculation requires contiguous headers. There is no any formal requirements on data location and mbuf structure which could be used by the application. Since commit `dfc6b2fd8d` ("mbuf: remove Intel offload checks from generic API") fragmented headers checks are done inside rte_net_intel_cksum_flags_prepare() in RTE_LIBRTE_ETHDEV_DEBUG build because it is moved from rte_validate_tx_offload() which is called under debug only. Make corresponding check to be done in non-debug build as well to avoid bad accesses, incorrect checksum calculation and to return appropriate error from Tx prepare. Make no-offloads check more precise and do it in non-debug build as well to avoid contiguous headers check and Tx prepare failure if it is not actually required. Signed-off-by: Andrew Rybchenko <arybchenko@solarflare.com> Acked-by: Olivier Matz <olivier.matz@6wind.com>	2020-07-21 13:54:54 +02:00
Weifeng Li	0bf68c660e	net/bonding: change state machine to defaulted A dpdk bonding 802.3ad network as follows: +----------+ +-----------+ \|dpdk lacp \|bond1.1 <------> bond2.1\|switch lacp\| \| \|bond1.2 <------> bond2.2\| \| +----------+ +-----------+ If a fiber optic go wrong about single pass during normal running like this: bond1.2 -----> bond2.2 ok bond1.2 <--x-- bond2.2 error: bond1.2 receive no LACPDU Some packets from switch to dpdk will choose bond2.2 and lost. DPDK lacp state machine will transits to the expired state if no LACPDU is received before the current_while_timer expires. But if no LACPDU is received before the current_while_timer expires again, DPDK lacp state machine has no change. Bond2.2 can not change to inactive depend on the received LACPDU. According to IEEE 802.3ad, if no lacpdu is received before the current_while_timer expires again, the state machine should transits from expired to defaulted. Bond2.2 will change to inactive depend on the LACPDU with defaulted state. This patch adds a state machine change from expired to defaulted when no lacpdu is received before the current_while_timer expires again according to IEEE 802.3ad: If no LACPDU is received before the current_while timer expires again, the state machine transits to the DEFAULTED state. The record Default function overwrites the current operational parameters for the Partner with administratively configured values. This allows configuration of aggregations and individual links when no protocol partner is present, while still permitting an active partner to override default settings. The update_Default_Selected function sets the Selected variable FALSE if the Link Aggregation Group has changed. Since all operational parameters are now set to locally administered values there can be no disagreement as to the Link Aggregation Group, so the Matched variable is set TRUE. The relevant description is in the chapter 43.4.12 of the link below: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=850426 Signed-off-by: Weifeng Li <liweifeng96@126.com> Acked-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-21 13:54:54 +02:00
Dongyang Pan	2db4cf9d01	net/bonding: delete redundant code The function valid_bonded_port_id() has already contains function rte_eth_dev_is_valid_port(), so delete redundant check. Fixes: `588ae95e79` ("net/bonding: fix port ID check") Cc: stable@dpdk.org Signed-off-by: Dongyang Pan <197020236@qq.com> Reviewed-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-07-21 13:54:54 +02:00
Kishore Padmanabha	fe82f3e027	net/bnxt: support exact match templates Added support for exact match templates Signed-off-by: Kishore Padmanabha <kishore.padmanabha@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com>	2020-07-21 13:54:54 +02:00
Kishore Padmanabha	cef3749d50	net/bnxt: update TruFlow resource allocation numbers The truflow session open allocation parameters are updated to support NAT records, L2 context regions, engress encap features. Signed-off-by: Kishore Padmanabha <kishore.padmanabha@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com>	2020-07-21 13:54:54 +02:00
Kishore Padmanabha	6eb4ccff38	net/bnxt: modify default egress rule for VF representor The default egress rule should include buffer descriptor action record only if the VF representor is enabled. Signed-off-by: Kishore Padmanabha <kishore.padmanabha@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com> Reviewed-by: Mike Baucom <michael.baucom@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-07-21 13:54:54 +02:00
Kishore Padmanabha	292611672b	net/bnxt: fix null pointer dereference Avoid dereferencing a null pointer. Fixes: `313ac35ac7` ("net/bnxt: support ULP session manager init") Cc: stable@dpdk.org Signed-off-by: Kishore Padmanabha <kishore.padmanabha@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com> Reviewed-by: Mike Baucom <michael.baucom@broadcom.com>	2020-07-21 13:54:54 +02:00
Randy Schacher	a5ac40fbb3	net/bnxt: use SPDX license tag Update cfa_resource_types.h to use SPDX license header. Reported-by: Stephen Hemminger <stephen@networkplumber.org> Signed-off-by: Randy Schacher <stuart.schacher@broadcom.com> Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-07-21 13:54:54 +02:00

1 2 3 4 5 ...

23587 Commits