numam-dpdk

Author	SHA1	Message	Date
Bing Zhao	0ba70e439e	net/mlx5: fix actions validation on root table The maximal supported header modifications number of a single modify context on the root table cannot be queried from firmware directly. It is a fixed value of 16 in the latest releases. In the validation stage, PMD driver should ensure that no more than 16 header modify actions exist in a single context. In some old firmware releases, the supported value is 8. PMD driver should try its best to create the flow. Firmware will return error and refuse to create the flow if the actions number exceeds the maximal value. Fixes: 72a944dba163 ("net/mlx5: fix header modify action validation") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Suanming Mou	691b3d3ebb	net/mlx5: fix indexed pool bitmap initialization Currently, the indexed memory pool bitmap start address is not aligned to cacheline size explicitly. The bitmap initialization requires the address should be cacheline aligned. In that case, the initialization maybe failed if the address is not cacheline aligned. Add RTE_CACHE_LINE_ROUNDUP() to the trunk size calculation to make sure the bitmap offset address will start with cacheline aligned. Fixes: a3cf59f56c47 ("net/mlx5: add indexed memory pool") Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Tested-by: Lijian Zhang <lijian.zhang@arm.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com>	2020-05-05 15:54:26 +02:00
Alexander Kozyrev	9a6ea33af9	net/mlx5: fix packet length assert in MPRQ The assert that checks if there is a enough room for the whole packet minus headroom data is written incorrectly. The check should be negated in order to work properly. Fixes: bd0d5930bf56 ("net/mlx5: enable MPRQ multi-stride operations") Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Alexander Kozyrev	0c55591588	net/mlx5: fix assert in dynamic metadata handling The assert in dynamic flow metadata handling is wrong after the fix for the performance degradation. The assert meant to check the metadata mask but was updated with the metadata offset instead. Fix this assert and restore proper metadata mask checking. Fixes: 6c55b622a956 ("net/mlx5: set dynamic flow metadata in Rx queues") Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Suanming Mou	dd76f43612	net/mlx5: save meter index instead of meter id Currently, while creating the flow with meter, meter id is saved to the rte flow. While destroying the flow, the meter object will be found by the meter id, so the meter object will be released accordingly. But as the meter id is configured by user, while the meter id is set to 0, it doesn't make any sense to flow destroy since 0 means flow doesn't have meter. The meter object with id 0 will be leaked. As meter object is allocated from indexed memory, and the index starts from 1, save the internal generated index instead of user defined meter id will never meet the issue as above. This patch saves meter index instead of meter id in rte flow. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Asaf Penso	9b080425e3	net/mlx5: fix assert in doorbell lookup The asserts makes sure that 'i' doesn't exceed the expected value. This to prevent an out of bound access to dbr_bitmap. The current location of the assert protects the assignment of dbr_bitmap, but not the access to it. Moved the assert to the correct place, to protect both cases. Also, used an existing define for the assert. Fixes: 21cae8580fd0 ("net/mlx5: allocate door-bells via DevX") Cc: stable@dpdk.org Signed-off-by: Asaf Penso <asafp@mellanox.com> Reviewed-by: Dekel Peled <dekelp@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Bing Zhao	f21a98196a	net/mlx5: fix empty flow error structure The output flow error parameter is used to indicate the detailed reason of the failure when calling a rte_flow_* interface. Even though sometimes the application will not check it or use it, the PMD must fill it in the failure branch before returning. Or else, some dirty value in the stack, heap will be accessed as a pointer and then cause a crash. In this case, when a port is stopped, it is not allowed to insert a flow from application. The detailed error information should be filled. If the application needs to check the detailed error reason, it will get the information but not result in any crash. Fixes: 40b9e7f65fe1 ("net/mlx5: check device status before creating flow") Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Ori Kam <orika@mellanox.com>	2020-05-05 15:54:26 +02:00
Bing Zhao	351b54f5cf	net/mlx5: fix Rx queue flags on destroying flow After inserting an offload flow, the software flag information will be updated based on the flow. When receiving a packet on this queue, the hardware packet type bits and the software flag will be used together to get the inner packet and tunnel header type (if any) from the global packet type table. When destroying a flow, the corresponding Rx queue flag needs to be updated. All flags should be cleared when closing a device because all control flows and application flows are invalid anymore. Such behavior is missed when implementing the non-cached mode. Fixes: 8db7e3b69822 ("net/mlx5: change operations for non-cached flows") Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-05-05 15:54:26 +02:00
Ori Kam	f5bf02df31	eal/ppc: fix bool type after altivec include The AltiVec header file breaks boolean type. [1] [2] Currently the workaround was located only in mlx5 device. Adding the trace module caused this issue to appear again, due to order of includes, it keeps overriding the local fix. This patch solves this issue by resetting the bool type, immediately after it is being changed. [1] https://mails.dpdk.org/archives/dev/2018-August/110281.html [2] In file included from dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h:18:0, from dpdk/ppc_64-power8-linux-gcc/include/rte_mempool.h:54, from dpdk/drivers/common/mlx5/mlx5_common_mr.c:7: dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h: In function '__rte_trace_point_fp_is_enabled': dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:226:2: error: incompatible types when returning type 'int' but '__vector __bool int' was expected return false; ^ In file included from dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:281:0, from dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h:18, from dpdk/ppc_64-power8-linux-gcc/include/rte_mempool.h:54, from dpdk/drivers/common/mlx5/mlx5_common_mr.c:7: dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h: In function 'rte_mempool_trace_ops_dequeue_bulk': dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point_provider.h:104:6: error: wrong type argument to unary exclamation mark if (!__rte_trace_point_fp_is_enabled()) \ ^ dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:49:2: note: in expansion of macro '__rte_trace_point_emit_header_fp' __rte_trace_point_emit_header_##_mode(&__##_tp); \ ^ dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:99:2: note: in expansion of macro '__RTE_TRACE_POINT' __RTE_TRACE_POINT(fp, tp, args, __VA_ARGS__) ^ dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h:20:1: note: in expansion of macro 'RTE_TRACE_POINT_FP' RTE_TRACE_POINT_FP( ^ dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h: In function 'rte_mempool_trace_ops_dequeue_contig_blocks': dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point_provider.h:104:6: error: wrong type argument to unary exclamation mark if (!__rte_trace_point_fp_is_enabled()) \ ^ dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:49:2: note: in expansion of macro '__rte_trace_point_emit_header_fp' __rte_trace_point_emit_header_##_mode(&__##_tp); \ ^ dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point.h:99:2: note: in expansion of macro '__RTE_TRACE_POINT' __RTE_TRACE_POINT(fp, tp, args, __VA_ARGS__) ^ dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h:29:1: note: in expansion of macro 'RTE_TRACE_POINT_FP' RTE_TRACE_POINT_FP( ^ dpdk/ppc_64-power8-linux-gcc/include/rte_mempool_trace_fp.h: In function 'rte_mempool_trace_ops_enqueue_bulk': dpdk/ppc_64-power8-linux-gcc/include/rte_trace_point_provider.h:104:6: error: wrong type argument to unary exclamation mark if (!__rte_trace_point_fp_is_enabled()) \ Fixes: 725f5dd0bfb5 ("net/mlx5: fix build on PPC64") Signed-off-by: Ori Kam <orika@mellanox.com> Signed-off-by: David Christensen <drc@linux.vnet.ibm.com> Tested-by: David Christensen <drc@linux.vnet.ibm.com> Tested-by: Raslan Darawsheh <rasland@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-05-06 11:45:13 +02:00
Luca Boccassi	611faa5f46	fix various typos found by Lintian Cc: stable@dpdk.org Signed-off-by: Luca Boccassi <bluca@debian.org>	2020-04-25 19:53:47 +02:00
Alexander Kozyrev	a24431dffb	net/mlx5: improve logging of MPRQ selection MPRQ is silently turned off in case there is not enough Rx queues configured. Improve the logging to show a warning in this case to notify a user about the Rx burst function selected. Fixes: 7d6bf6b866b8 ("net/mlx5: add Multi-Packet Rx support") Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 22:28:06 +02:00
Alexander Kozyrev	6c55b622a9	net/mlx5: set dynamic flow metadata in Rx queues Using a global mbuf dynamic field for metadata incurs some performance penalty on a datapath. Store this information in the Rx queue descriptor for a better cache locality. Fixes: a18ac6113331 ("net/mlx5: add metadata support to Rx datapath") Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 22:28:06 +02:00
Bing Zhao	72a944dba1	net/mlx5: fix header modify action validation The header modify actions number supported now has some limitation, and it is decided by both driver and hardware. If the configuration is different or the table to insert the flow is different, the result might be different if the flow contains header modify actions. Currently, the actual action number could only be calculated in the later stage called translate, from user specified value to the driver format. And the action numbers checking is missed in the flow validation. So PMD will return incorrect result to indicate the flow actions are valid by rte_flow_validate but then it will fail when calling rte_flow_create. Adding some simple checking in the validation will help to get rid of this incorrect checking. Most of the actions will only consume 1 SW action field except the MAC address and IPv6 address. And from SW POV, the maximal action fields for these will be consumed even if only part of such field will be modified because that there is no mask in the flow actions and the mask will always be all ONEs. The metering or extra metadata supports will cost one more action. Fixes: 9597330c6844 ("net/mlx5: update modify header action translator") Cc: stable@dpdk.org Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 22:28:06 +02:00
Tonghao Zhang	0d7d180a0d	net/mlx5: fix crash when releasing meter table The meters of ports share the same meter table on the port. When releasing meters, don't check value returned using assert. Because other meters may reference to it. Fixes: 46a5e6bc6a85 ("net/mlx5: prepare meter flow tables") Fixes: 9dbaf7eef6e1 ("net/mlx5: fix meter suffix table leak") Cc: stable@dpdk.org Signed-off-by: Tonghao Zhang <xiangxia.m.yue@gmail.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 22:28:06 +02:00
Wentao Cui	78466e086a	net/mlx5: optimize memory for flow meter This commit focus on flow meter data structures optimization: mlx5_flow_meter. Optimize memory consumption of flow meter data structure. Reorganize flow meter data structure,delete unnecessary data fields. Signed-off-by: Wentao Cui <wentaoc@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 16:18:13 +02:00
Suanming Mou	0136df99a9	net/mlx5: reorganize flow API structure Currently, the rte flow structure is not fully aligned and has some bits wasted. The members can be optimized and reorganized to save memory. 1. The drv_type uses only limited bits, change the type to 2 bits what it needs. 2. Align the hairpin_flow_id, drv_type, fdir, copy_applied to 32 bits. As hairpin never uses the full 32 bits. 3. __rte_packed helps tight up the structure memory layout. The optimization totally helps save 14 bytes for the structure. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	ab612adc1e	net/mlx5: allocate flow API from indexed pool This commit allocates rte flow from indexed memory pool. Allocate rte flow memory from indexed memory pool helps save more than MALLOC_ELEM_OVERHEAD bytes memory from rte_malloc(). Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	e745f90007	net/mlx5: optimize flow RSS struct When destroy the flow with RSS, flow can invoke the queues information from hrxq index table object, since the queue number and list are both saved to the index table object. No need to save the duplicated data in rte flow. Save the RSS description information to the intermediate private data when create the flow with RSS action helps to save the memory for rte flow. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Wentao Cui	c2ddde7950	net/mlx5: optimize flow director filter memory This commit is for mlx5 fdir flow memory optimization. Currently for the fdir member in rte_flow structure. It saves the fdir memory pointer directly. As fdir is fading away, use one bit help to indicate the function in the flow and add the content to an extra list save the memory for the other widely usage cases. Signed-off-by: Wentao Cui <wentaoc@mellanox.com> Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	90e6053a19	net/mlx5: convert mark copy resource to indexed Allocate mark copy resource from indexed pool helps rte flow saves the 4 bytes index instead of 8 bytes pointer. For mark copy resource itself, it helps save MALLOC_ELEM_OVERHEAD bytes from rte_malloc(). Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	8638e2b076	net/mlx5: allocate meter from indexed pool This patch allocate the meter object memory from indexed memory pool which will help to save the MALLOC_ELEM_OVERHEAD memory taken by rte_malloc(). Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	8eb5485dc0	net/mlx5: optimize flow meter handle type While flow attaches the meter handle, the meter id can be the unique tag for the flow to get the meter handle. It's no need for flow to save the pointer of the meter handle. Save the meter id instead of pointer helps reduce the size for rte flow structure. As the supported maximum meter rule is 4K, uint16_t type is selected for the meter id. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	77749adab9	net/mlx5: reorganize flow handle struct Currently, the mlx5_flow_handle struct is not fully aligned and has some bits wasted. The members can be optimized and reorganized to save memory. 1. As metadata and meter is sharing the same flow match id, now the flow id is limited to 24 bits due to the 8 MSBs are used as for the meter color. Align the flow id to other bit members to 32 bits to save the mlx5 flow handle memory. 2. The vlan_vf in struct mlx5_flow_handle_dv was already moved to struct mlx5_flow_handle. Remove the legacy vlan_vf in struct mlx5_flow_handle_dv. 3. Reorganize the vlan_vf in mlx5_flow_handle with member SILIST_ENTRY next to make it align with 8 bytes. 4. Reorganize the header modify in mlx5_flow_handle_dv to ILIST_ENTRY next to make it align to with bytes. 5. Introduce __rte_pack attribute to make the struct tightly organized. It will totally save 20 bytes memory for mlx5_flow_handle struct. For the resource objects which are converted to indexed, align the names with the prefix of rix_. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	488d13abdc	net/mlx5: optimize action flags in flow handle As only limited bits is used in act_flags for flow destroy, it's a bit expensive to save the whole 64 bits. Move the act_flags out of flow handle and save the needed bits for flow destroy to save some bytes for the flow handle data struct. The fate action type and mark bits are reserved as they will be used in flow destroy. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	6fc183924b	net/mlx5: reorganize fate actions as union Currently, one flow only has one fate action, the fate actions members in the flow struct can be reorganized as union to save the memory for flow struct. This commit reorganizes the fate actions as union, the act_flags helps to identify the fate action type when flow destroys. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	b88341ca35	net/mlx5: convert flow dev handle to indexed This commit converts flow dev handle to indexed. Change the mlx5 flow handle from pointer to uint32_t saves memory for flow. With million flow, it saves several MBytes memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	772dc0eb83	net/mlx5: convert hrxq to indexed This commit converts hrxq to indexed. Using the uint32_t index instead of pointer saves 4 bytes memory for the flow handle. For millions flows, it will save several MBytes of memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	7ac99475ce	net/mlx5: convert jump resource to indexed This commit convert jump resource to indexed. The table data struct is allocated from indexed memory. As it is add in the hash list, the pointer is still used for hash list search. The index is added to the table struct, and the pointer in flow handle is decrease to uint32_t type. For flow without jump flows, it saves 4 bytes memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	f3faf9ea11	net/mlx5: convert port id action to indexed This commit converts port id action to indexed. Using the uint32_t index instead of pointer saves 4 bytes memory for the flow handle. For millions flows, it will save several MBytes of memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	5f1142692a	net/mlx5: convert tag resource to indexed This commit convert tag resource to indexed. As tag resources are add in the hash list, to avoid introduce performance issue and keep the hash list, only the tag resource memory is allocated from indexed memory. The resources is still added to the hash list. Add four bytes index in the tag resource struct and change the tag resources in the flow handle from pointer to uint32_t seems be no benefit for tag resource, but it saves memory for flows without tag action. And also for sub flows share one tag action resource. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	8acf8ac9b7	net/mlx5: convert push VLAN resource to indexed This commit converts the push VLAN resource to indexed. Using the uint32_t index instead of pointer saves 4 bytes memory for the flow handle. For millions flows, it will save several MBytes of memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	014d1cbe51	net/mlx5: convert encap/decap resource to indexed This commit converts the flow encap/decap resource to indexed. Using the uint32_t index instead of pointer saves 4 bytes memory for the flow handle. For millions flows, it will save several MBytes of memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	1fd4bb67eb	net/mlx5: add trunk release for indexed pool While entries are fully freed in trunk, it means the trunk is free now. User may prefer the free trunk memory can be reclaimed. Add the trunk release memory option for indexed pool in this case. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	62d7d519b1	net/mlx5: add trunk dynamic grow for indexed pool This commit add trunk dynamic grow for the indexed pool. In case for pools which are not sure the entry number needed, pools can be configured in increase progressively mode. It means the trunk size will be increased dynamically one after one, then reach a stable value. It saves memory to avoid allocate a very big trunk at beginning. User should set both the grow_shift and grow_trunk to help the trunk grow works. Keep one or both grow_shift and grow_trunk as 0 makes the trunk work as fixed size. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Suanming Mou	a3cf59f56c	net/mlx5: add indexed memory pool Currently, the memory allocated by rte_malloc() also introduced more than 64 bytes overhead. It means when allocate 64 bytes memory, the real cost in memory maybe double. And the libc malloc() overhead is 16 bytes, If users try allocating millions of small memory blocks, the overhead costing maybe huge. And save the memory pointer will also be quite expensive. Indexed memory pool is introduced to save the memory for allocating huge amount of small memory blocks. The indexed memory uses trunk and bitmap to manage the memory entries. While the pool is empty, the trunk slot contains memory entry array will be allocated firstly. The bitmap in the trunk records the entry allocation. The offset of trunk slot in the pool and the offset of memory entry in the trunk slot compose the index for the memory entry. So, by the index, it will be very easy to address the memory of the entry. User saves the 32 bits index for the memory resource instead of the 64 bits pointer. User should create different pools for allocating different size of small memory block. It means one pool provides one fixed size of small memory blocked allocating. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:09 +02:00
Xiaoyu Min	889cf609e5	net/mlx5: fix validation of push VLAN without full mask Due the limitation of HW, when PMD create push VLAN action it needs to know what exactly the value of VID/PCP. PMD try to figure out them via: - of_set_vlan_vid/pcp actions - VLAN item in pattern If none of above is provided, default value - zero is used. However user will write rule like [1] which match on a range of VID and without of_set_vlan_vid action and expect the VID will inherit from original packet. This is not supported by HW currently. PMD will set VID to default value - zero because it cannot figure out the exact value of VID from VLAN item. This is sort of misleading for some users. In order to avoid this, PMD will spit out error for rule like [1] to force user to provide explicit VID/PCP for new pushed VLAN headers. [1]: testpmd> flow create 2 ingress transfer group 0 priority 3 pattern eth / vlan vid spec 2859 vid prefix 4 / ipv4 / end actcions of_push_vlan ethertype 0x88A8 / of_set_vlan_pcp vlan_pcp 6 / port_id id 0 / end Fixes: 9aee7a8418d4 ("net/mlx5: support push flow action on VLAN header") Cc: stable@dpdk.org Signed-off-by: Xiaoyu Min <jackmin@mellanox.com> Reviewed-by: Dekel Peled <dekelp@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Xiaoyu Min	7162c02d7b	net/mlx5: fix push VLAN action to use item info Currently when PMD create push VLAN action it need to provide VID to HW and PMD get VID value from item VLAN in pattern if there is no of_set_vlan_vid action following. When user create rule like [1], which has of_set_vlan_vid action before of_push_vlan, the intention is to modify VID on existing VLAN header and push a new VLAN header with VID _inherit_ from the previous of_set_vlan_vid. Currently the above is not covered by PMD, PMD always fetch the VLAN information from item for of_push_vlan action. Fix it by only fetch VLAN information from item when there is no previous of_set_vlan_vid action. [1]: testpmd> flow create 2 ingress transfer group 1 priority 3 pattern eth / vlan vid is 2731 / ipv4 / end actions of_set_vlan_vid vlan_vid 3209 / of_push_vlan ethertype 0x88A8 / port_id id 1 / end Fixes: b8c0372bc5ac ("net/mlx5: fix set VLAN ID/PCP in new header") Cc: stable@dpdk.org Signed-off-by: Xiaoyu Min <jackmin@mellanox.com> Reviewed-by: Dekel Peled <dekelp@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Vu Pham	b8dc6b0e29	common/mlx5: refactor memory management Refactor common memory btree and cache management to common driver. Replace some input parameters of MR APIs to more common data structure like PD, port_id, share_cache,... so that multiple PMD drivers can use those MR APIs. Modify mlx5 net pmd driver to use MR management APIs from common driver. Signed-off-by: Vu Pham <vuhuong@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Vu Pham	a4de9586ac	common/mlx5: refactor IPC handling from net driver Refactor common multi-process handling codes from net PMD to common driver. Using tuple mp_id{name, port_id} as standard input parameter for all multi-process IPC APIs instead of using rte_eth_dev. Modify net PMD to use multi-process APIs from common driver. Signed-off-by: Vu Pham <vuhuong@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Suanming Mou	fe2c412ca9	net/mlx5: fix jump table leak Currently, when translate jump action, the table reference will be increased all the time. But when release the jump action, the table resource reference will only be decreased when jump action is released. It means for jump action which was referenced more than one time, the increased table reference only decrease one time when jump action is released. Add table release when the jump action was not new created. Fixes: 684b9a1b1f5c ("net/mlx5: support jump action") Cc: stable@dpdk.org Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Suanming Mou	9dbaf7eef6	net/mlx5: fix meter suffix table leak Currently, the meter suffix table is created and saved in the mlx5 shared struct. It causes the suffix table will never be released even without any meter rules. Move the suffix table to meter domain struct to help the suffix table be released when all the meter rules are destroyed. Fixes: 46a5e6bc6a85 ("net/mlx5: prepare meter flow tables") Cc: stable@dpdk.org Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:08 +02:00
Alexander Kozyrev	775fc97b01	net/mlx5: add multi-segment packets in MPRQ mode The multi-stride operations now allow to reduce a stride size while supporting Jumbo frames. That means that it is possible to have mbufs configured with a size smaller than the whole packet received. It is not an issue during normal MPRQ operations since we attach external buffers instead of copying the data into the mbuf itself. But it is not the case in "emergency mode" when we have to copy every packet because of no more external mbufs are available. Assemble a multi-segment packet to overcome this issue in case scatter mode is enabled, drop a packet if not. Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:08 +02:00
Alexander Kozyrev	bd0d5930bf	net/mlx5: enable MPRQ multi-stride operations MPRQ feature should be updated to allow a packet to be received into multiple strides in order to support the MTU exceeding 8KB. Special care is needed to prevent the headroom corruption in the multi-stride mode since the headroom space is borrowed by the PMD from the tail of the preceding stride. Copy the whole packet into a separate mbuf in this case or just the overlapping data if the Rx scattering is supported by an application. Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:08 +02:00
Alexander Kozyrev	ecb160456a	net/mlx5: add device parameter for MPRQ stride size Define a device parameter to configure log 2 of a stride size for MPRQ - mprq_log_stride_size. User is able to specify a stride size in a range allowed by an underlying hardware. The default stride size is defined as 2048 bytes to encompass most commonly used packet sizes in the Internet (MTU 1518 and less) and will be used in case a maximum configured packet size cannot fit into the largest possible stride size. Otherwise a stride size is set to a large enough value to encompass a whole packet. Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:08 +02:00
Mohsin Shaikh	00437823cb	net/mlx5: use open/read/close for ib stats query fgets(3)/fread(3)/fscanf(3) etc. use mmap(2)/munmap(2) which leads to TLB shutdown interrupts to all DPDK app cores including RX cores. This can cause packet drops. Use read(2)/write(2) instead. Bugzilla ID: 440 Cc: stable@dpdk.org Signed-off-by: Mohsin Shaikh <mohsinshaikh@niometrics.com> Reviewed-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:07 +02:00
Bing Zhao	3ac3d8234b	net/mlx5: fix index when creating flow When creating a flow, usually the creating routine is called in serial. No parallel execution is supported right now. The same function will be called only once for a single flow creation. But there is a special case that the creating routine will be called nested. If the xmeta feature is enabled and there is FLAG / MARK in the actions list, some metadata reg copy flow needs to be created before the original flow is applied to the hardware. In the flow non-cached mode, resources only for flow creation will not be saved anymore. The memory space is pre-allocated and reused for each flow. A global index for each device is used to indicate the memory address of the resources. If the function is called in a nested mode, then the index will be reset and make everything get corrupted. To solve this, a nested index is introduced to save the position for the original flow creation. Currently, only one level nested call of the flow creating routine is supported. Fixes: e7bfa3596a0a ("net/mlx5: separate the flow handle resource") Signed-off-by: Bing Zhao <bingz@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-04-21 13:57:07 +02:00
Suanming Mou	261bb99a21	net/mlx5: reorganize fallback counter management Currently, the fallback counter is also allocated from the pool, the specify fallback function code becomes a bit duplicate. Reorganize the fallback counter code to make it reuse from the normal counter code. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:07 +02:00
Suanming Mou	826b8a8732	net/mlx5: split flow counter struct Currently, the counter struct saves both the members used by batch counters and none batch counters. The members which are only used by none batch counters cost 16 bytes extra memory for batch counters. As normally there will be limited none batch counters, mix the none batch counter and batch counter members becomes quite expensive for batch counter. If 1 million batch counters are created, it means 16 MB memory which will not be used by the batch counters are allocated. Split the mlx5_flow_counter struct for batch and none batch counters helps save the memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:07 +02:00
Suanming Mou	956d5c74d7	net/mlx5: optimize flow counter handle type Currently, DV and verbs counters are both changed to indexed. It means while creating the flow with counter, flow can save the indexed value to address the counter. Save the 4 bytes indexed value in the rte_flow instead of 8 bytes pointer helps to save memory with millions of flows. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:07 +02:00
Suanming Mou	4001d7ad26	net/mlx5: change Direct Verbs counter to indexed This part of the counter optimize change the DV counter to indexed as what have already done in verbs. In this case, all the mlx5 flow counter can be addressed by index. The counter index is composed of pool index and the counter offset in the pool counter array. The batch and none batch counter dcs ID offset 0x800000 is used to avoid the mix up for the index. As batch counter dcs ID starts from 0x800000 and none batch counter dcs starts from 0, the 0x800000 offset is added to the batch counter index to indicate the index of batch counter. The counter pointer in rte_flow struct will be aligned to index instead of pointer. It will save 4 bytes memory for every rte_flow. With millions of rte_flow, it will save MBytes memory. Signed-off-by: Suanming Mou <suanmingm@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-04-21 13:57:07 +02:00

1 2 3 4 5 ...

1393 Commits