numam-dpdk

Author	SHA1	Message	Date
Eugenio Pérez	cdf1dc5e6a	vhost: flush shadow Tx if no more packets The current implementation of vhost_net in packed vring tries to fill the shadow vector before send any actual changes to the guest. While this can be beneficial for the throughput, it conflicts with some bufferfloats methods like the linux kernel napi, that stops transmitting packets if there are too much bytes/buffers in the driver. To solve it, we flush the shadow packets at the end of virtio_dev_tx_packed if we have starved the vring, i.e. the next buffer is not available for the device. Since this last check can be expensive because of the atomic, we only check it if we have not obtained the expected "count" packets. If it happens to obtain "count" packets and there is no more available packets the caller needs to keep call virtio_dev_tx_packed again. Fixes: `31d6c6a5b8` ("vhost: optimize packed ring dequeue") Cc: stable@dpdk.org Signed-off-by: Eugenio Pérez <eperezma@redhat.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Vitaliy Mysak	bedf87c521	vhost: do not treat empty socket message as error According to recvmsg() specification, 0 is a valid return code when client is disconnecting. Therefore, it should not be reported as error, unless there are other dependencies that require message to not be empty. But there are none, since the next immediate caller of recvmsg() reports "vhost peer closed" info (not error) when message is empty. This patch changes return code check for recvmsg() so that misleading error message is not printed when the code is 0. Fixes: `8f972312b8` ("vhost: support vhost-user") Cc: stable@dpdk.org Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Tiwei Bie <tiwei.bie@intel.com>	2020-02-05 11:47:18 +01:00
Zhike Wang	499fd8e5b8	vhost: fix crash on port deletion The vhost_user_read_cb() and rte_vhost_driver_unregister() can be called at the same time by 2 threads. Eg thread1 calls vhost_user_read_cb() and removes the vsocket from conn_list, then thread2 calls rte_vhost_driver_unregister() and frees the vsocket since it is NOT in the conn_list. So thread1 will access invalid memory when trying to reconnect. The fix is to move the "removing of vsocket from conn_list" to end of the vhost_user_read_cb(), then avoid the race condition. The core trace is: Program terminated with signal 11, Segmentation fault. Fixes: `af14759181` ("vhost: introduce API to start a specific driver") Cc: stable@dpdk.org Signed-off-by: Zhike Wang <wangzhike@jd.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Tiwei Bie	9277125731	net/virtio-user: do not reset virtqueues for split ring Add missing braces to avoid resetting virtqueues unconditionally during reconnection. Fixes: `6ebbf4109f` ("net/virtio-user: fix packed ring server mode") Cc: stable@dpdk.org Signed-off-by: Tiwei Bie <tiwei.bie@intel.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 11:47:18 +01:00
Dekel Peled	ff44839929	net/mlx5: fix dirty array of actions Previous patch changed the format of struct mlx5_flow_dv_modify_hdr_resource, to use a flexible array for modification actions. In __flow_dv_translate() a union was defined with item of this struct, and an array of maximal possible size. Array elements are filled in several functions. In function flow_dv_convert_action_set_reg(), array element is filled partially, while the other fields of this array element are left uninitialized. This may cause failure of flow_dv_modify_hdr_resource_register() when calling driver function with the 'dirty' array. This patch updates flow_dv_convert_action_set_reg(), setting the selected array element fields while clearing the other fields. Other functions that fill the same array elements are also updated for clarity and proofing future use. Fixes: `024e95759c` ("net/mlx5: fix modify actions support limitation") Cc: stable@dpdk.org Signed-off-by: Dekel Peled <dekelp@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 11:15:53 +01:00
Michael Baum	4f8e6befe7	net/mlx5: fix memory regions release deadlock The mpx5 PMD maintains the list of devices for those the memory operation callback routines must be invoked to keep the device MRs (MR is the entity backing the hardware DMA transactions) consistent with the mapped memory. Each device context in the list is protected with dedicated lock on per device basis, which might be taken inside the callback routine. When device is closing the PMD frees all MRs by calling mlx5_mr_release(), that might call rte_free() under the taken device lock. If this rte_free call triggers the entire memory segment freeing it, in its turn, invokes the callback routine and attempt to take the lock inside this one causes the deadlock. The patch proposes the remove the device from the callback list first and then call mlx5_mr_release() and free the remaining device MRs explicitly. Fixes: `0e3d0525b2` ("net/mlx5: fix memory event callback list") Cc: stable@dpdk.org Signed-off-by: Michael Baum <michaelba@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-02-05 11:15:53 +01:00
Raslan Darawsheh	f9dd753942	net/failsafe: fix reported hash key size in device info Hash key size is missing from reported device info. This fills the hash key size in device info. Fixes: `4586be3743` ("net/failsafe: fix reported device info") Cc: stable@dpdk.org Signed-off-by: Raslan Darawsheh <rasland@mellanox.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-02-05 10:21:22 +01:00
Wei Hu (Xavier)	8a43329728	app/testpmd: fix uninitialized members when setting PFC Only a part of members in the local structure variable named pfc_conf are initialized in the function named cmd_priority_flow_ctrl_set_parsed when typing "set pfc_ctrl..." command, and others are random values. However, those uninitialized members may cause failure. This patch adds clearing zero operation before calling the API named rte_eth_dev_priority_flow_ctrl_set API with pfc_conf as the input parameter. Fixes: `9b53e542e9` ("app/testpmd: add priority flow control") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Xuan Li <lixuan47@hisilicon.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-02-05 09:51:21 +01:00
Wei Hu (Xavier)	3b687ec6e1	app/testpmd: fix initial value when setting PFC Currently, the initial values of the local structure variable named rx_tx_onoff_2_lfc_mode and rx_tx_onoff_2_pfc_mode are different in the similar part of these two following functions: cmd_link_flow_ctrl_set_parsed cmd_priority_flow_ctrl_set_parsed 1) The code snippset in cmd_link_flow_ctrl_set_parsed function: static enum rte_eth_fc_mode rx_tx_onoff_2_lfc_mode[2][2] = { {RTE_FC_NONE, RTE_FC_TX_PAUSE}, {RTE_FC_RX_PAUSE, RTE_FC_FULL} }; if (!cmd \|\| cmd == &cmd_link_flow_control_set_rx) rx_fc_en = (!strcmp(res->rx_lfc_mode, "on")) ? 1 : 0; if (!cmd \|\| cmd == &cmd_link_flow_control_set_tx) tx_fc_en = (!strcmp(res->tx_lfc_mode, "on")) ? 1 : 0; fc_conf.mode = rx_tx_onoff_2_lfc_mode[rx_fc_en][tx_fc_en]; <...> ret = rte_eth_dev_flow_ctrl_set(res->port_id, &fc_conf); <...> 2) The code snippset in cmd_priority_flow_ctrl_set_parsed function: static enum rte_eth_fc_mode rx_tx_onoff_2_pfc_mode[2][2] = { {RTE_FC_NONE, RTE_FC_RX_PAUSE}, {RTE_FC_TX_PAUSE, RTE_FC_FULL} }; rx_fc_enable = (!strncmp(res->rx_pfc_mode, "on",2)) ? 1 : 0; tx_fc_enable = (!strncmp(res->tx_pfc_mode, "on",2)) ? 1 : 0; pfc_conf.fc.mode = rx_tx_onoff_2_pfc_mode[rx_fc_enable][tx_fc_enable]; <...> ret = rte_eth_dev_priority_flow_ctrl_set(res->port_id, &pfc_conf); <...> The initial value of rx_tx_onoff_2_pfc_mode is wrong, it should be the same as rx_tx_onoff_2_lfc_mode. Fixes: `9b53e542e9` ("app/testpmd: add priority flow control") Cc: stable@dpdk.org Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com> Signed-off-by: Xuan Li <lixuan47@hisilicon.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-02-05 09:51:21 +01:00
Ori Kam	bd164530ec	app/testpmd: fix copy of dynamic flag name When working with testpmd and setting the dynflag name, we copy the name given by the cmd to the dynflag name. The issue is that the size of the dynflag name is smaller then the string used by testpmd. This commit solves this issue by checking that the length of the requested flag name is not too long. Coverity issue: 353610 Fixes: `b57b66a97e` ("app/testpmd: support mbuf dynamic flag") Signed-off-by: Ori Kam <orika@mellanox.com> Acked-by: Bernard Iremonger <bernard.iremonger@intel.com>	2020-02-05 09:51:21 +01:00
Július Milan	19d4c1aee8	net/memif: add link info This information is useful or needed for user applications as t-rex. Signed-off-by: Július Milan <jmilan.dev@gmail.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-02-05 09:51:21 +01:00
Ori Kam	56d5c1eedf	app/testpmd: fix uninitialized members of MPLS Some of the members of the MPLS struct are not initialized. This commit init the uninitialized members. Coverity issue: 325735 Fixes: `3e77031be8` ("app/testpmd: add MPLSoGRE encapsulation") Cc: stable@dpdk.org Signed-off-by: Ori Kam <orika@mellanox.com> Reviewed-by: Ferruh Yigit <ferruh.yigit@intel.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	75dd0ae917	vdpa/mlx5: disable RoCE In order to support virtio queue creation by the FW, RoCE mode should be disabled in the device. Do it by netlink which is like the devlink tool commands: 1. devlink dev param set pci/[pci] name enable_roce value false cmode driverinit 2. devlink dev reload pci/[pci] Or by sysfs which is like: echo 0 > /sys/bus/pci/devices/[pci]/roce_enable The IB device is matched again after ROCE disabling. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	31b9c29c86	vdpa/mlx5: support close and config operations Support dev_conf and dev_conf operations. These operations allow vdpa traffic. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	9d39e57f21	vdpa/mlx5: support live migration Add support for live migration feature by the HW: Create a single Mkey that maps the memory address space of the VHOST live migration log file. Modify VIRTIO_NET_Q object and provide vhost_log_page, dirty_bitmap_mkey, dirty_bitmap_size, dirty_bitmap_addr and dirty_bitmap_dump_enable. Modify VIRTIO_NET_Q object and move state to SUSPEND. Query VIRTIO_NET_Q and get hw_available_idx and hw_used_idx. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	62c813706e	vdpa/mlx5: map doorbell The HW supports only 4 bytes doorbell writing detection. The virtio device set only 2 bytes when it rings the doorbell. Map the virtio doorbell detected by the virtio queue kickfd to the HW VAR space when it expects to get the virtio emulation doorbell. Use the EAL interrupt mechanism to get notification when a new event appears in kickfd by the guest and write 4 bytes to the HW doorbell space in the notification callback. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	af72fdb546	vdpa/mlx5: support queue state operation Add support for set_vring_state operation. Using DevX API the virtq state can be changed as described in PRM: enable - move to ready state. disable - move to suspend state. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	a5a1d98ddc	vdpa/mlx5: add basic steering configurations Add a steering object to be managed by a new file mlx5_vdpa_steer.c. Allow promiscuous flow to scatter the device Rx packets to the virtio queues using RSS action. In order to allow correct RSS in L3 and L4, split the flow to 7 flows as required by the device. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	2aa8444b00	vdpa/mlx5: support stateless offloads Add support for the next features in virtq configuration: VIRTIO_F_RING_PACKED, VIRTIO_NET_F_HOST_TSO4, VIRTIO_NET_F_HOST_TSO6, VIRTIO_NET_F_CSUM, VIRTIO_NET_F_GUEST_CSUM, VIRTIO_F_VERSION_1, These features support depends in the DevX capabilities reported by the device. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	bff7350110	vdpa/mlx5: prepare virtio queues The HW virtq object represents an emulated context for a VIRTIO_NET virtqueue which was created and managed by a VIRTIO_NET driver as defined in VIRTIO Specification. Add support to prepare and release all the basic HW resources needed the user virtqs emulation according to the rte_vhost configurations. This patch prepares the basic configurations needed by DevX commands to create a virtq. Add new file mlx5_vdpa_virtq.c to manage virtq operations. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	8395927cdf	vdpa/mlx5: prepare HW queues As an arrangement to the vitrio queues creation, a 2 QPs and CQ may be created for the virtio queue. The design is to trigger an event for the guest and for the vdpa driver when a new CQE is posted by the HW after the packet transition. This patch add the basic operations to create and destroy the above HW objects and to trigger the CQE events when a new CQE is posted. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	cc07a42da2	vdpa/mlx5: prepare memory regions In order to map the guest physical addresses used by the virtio device guest side to the host physical addresses used by the HW as the host side, memory regions are created. By this way, for example, the HW can translate the addresses of the packets posted by the guest and to take the packets from the correct place. The design is to work with single MR which will be configured to the virtio queues in the HW, hence a lot of direct MRs are grouped to single indirect MR. Create functions to prepare and release MRs with all the related resources that are required for it. Create a new file mlx5_vdpa_mem.c to manage all the MR related code in the driver. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	f7aaf477d6	vdpa/mlx5: support features get operations Add support for get_features and get_protocol_features operations. Part of the features are reported by the DevX capabilities. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	d830dc1642	vdpa/mlx5: support queues number operation Support get_queue_num operation to get the maximum number of queues supported by the device. This number comes from the DevX capabilities. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Matan Azrad	95276abaaf	vdpa/mlx5: introduce Mellanox vDPA driver Add a new driver to support vDPA operations by Mellanox devices. The first Mellanox devices which support vDPA operations are ConnectX-6 Dx and Bluefield1 HCA for their PF ports and VF ports. This driver is depending on rdma-core like the mlx5 PMD, also it is going to use mlx5 DevX to create HW objects directly by the FW. Hence, the common/mlx5 library is linked to the mlx5_vdpa driver. This driver will not be compiled by default due to the above dependencies. Register a new log type for this driver. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Reviewed-by: Maxime Coquelin <maxime.coquelin@redhat.com>	2020-02-05 09:51:21 +01:00
Alexander Kozyrev	26f1bae837	net/mlx5: add Rx/Tx burst mode info Get a burst mode information for Rx/Tx queues in mlx5. Provide callback functions to show this information in a "show rxq info" and "show txq info" output. Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Sunil Kumar Kori	9614459b1e	net/octeontx: fix user supplied MAC address index Earlier after a successful mac_addr_add operation, index was returned by underlying layer which was unused but same as provided by DPDK API. So API is enhanced to use application provided index location to add MAC address entry. Fixes: `e4373bf1b3` ("net/octeontx: add unicast MAC filter") Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Harman Kalra <hkalra@marvell.com>	2020-02-05 09:51:21 +01:00
Sunil Kumar Kori	9e399b88ce	net/octeontx: fix memory leak of MAC address table MAC address table is allocated during octeontx device create and same is used to maintain list of MAC address associated to port. This table is not getting freed niether in case of error nor during graceful shutdown of port. Patch fixes memory required memory for both the cases as mentioned. Fixes: `f18b146c49` ("net/octeontx: create ethdev ports") Cc: stable@dpdk.org Signed-off-by: Sunil Kumar Kori <skori@marvell.com> Acked-by: Harman Kalra <hkalra@marvell.com>	2020-02-05 09:51:21 +01:00
Kiran Kumar K	184a323573	net/octeontx2: fix Tx flow control for HIGIG Tx flow controlled is disabled in the Ax silicon version due to an errata. This errata is not applicable for HIGIG Tx flow control, therefore not enabling in HIGIG case. Fixes: `602009ee2d` ("net/octeontx2: support HIGIG2") Cc: stable@dpdk.org Signed-off-by: Kiran Kumar K <kirankumark@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2020-02-05 09:51:21 +01:00
Lunyuan Cui	40163f9e17	net/i40e: fix multi-queue Rx interrupt for VF The interrupt vector which bind to queues should not be larger than the max available vector. It will cause port start failed. This patch changed the judgement condition of the limited vector id. It can effectively avoid vector id out of range. Fixes: `6a6cf5f88b` ("net/i40e: enable multi-queue Rx interrupt for VF") Signed-off-by: Lunyuan Cui <lunyuanx.cui@intel.com> Reviewed-by: Qiming Yang <qiming.yang@intel.com>	2020-02-05 09:51:21 +01:00
Qi Zhang	89a532ba87	net/ice: fix GTP-U rule conflict The patch distinguishes fdir rules for GTPU with or without extend header, so flow to match below patterns can be created correctly. 1. eth / ipv4 / udp / gtpu teid is 10 / ... 2. eth / ipv4 / udp / gtpu teid is 10 / gtp_psc / ... 3. eth / ipv4 / udp / gtpu / gtp_psc qfi is 10 / ... 4. eth / ipv4 / udp / gtpu teid is 10 / gtp_psc is 10 / ... Fixes: `efc16c6214` ("net/ice: support flow director GTPU tunnel") Cc: stable@dpdk.org Signed-off-by: Qi Zhang <qi.z.zhang@intel.com> Acked-by: Xiaolong Ye <xiaolong.ye@intel.com>	2020-02-05 09:51:21 +01:00
Yunjian Wang	eb00a7f123	net/e1000: use macro for PCI log format Use PCI_PRI_FMT instead of "%04d:%02d:%02d:%d" print format. Signed-off-by: Yunjian Wang <wangyunjian@huawei.com> Acked-by: Xiaolong Ye <xiaolong.ye@intel.com>	2020-02-05 09:51:21 +01:00
Raslan Darawsheh	90456726eb	net/mlx5: fix VXLAN-GPE item translation Currently, when using VXLAN-GPE or VXLAN item in the flow both are being treated the same with flags 0x8 in VXLAN header. Which mean the matching of the item VXLAN-GPE will match any VXLAN packet. This fixes the translation of VXLAN GPE item into PMD flow item. Which will by default set the flags to VXLAN-GPE to be 0xc. Fixes: `3d69434113` ("net/mlx5: add Direct Verbs validation function") Cc: stable@dpdk.org Signed-off-by: Raslan Darawsheh <rasland@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-02-05 09:51:21 +01:00
Krzysztof Kanas	b47e7a11f5	doc: add secondary VF details in thunderx guide thunderx-nic uses secondary VF's to provide more queues to DPDK. Current instructions explain the concept but don't show an easy way to find which PCI id is primary and which is secondary VF's. This patch extending the documentation of secondary VF w.r.t the enumeration details. Signed-off-by: Krzysztof Kanas <kkanas@marvell.com> Acked-by: Jerin Jacob <jerinj@marvell.com>	2020-02-05 09:51:21 +01:00
Jerin Jacob	b4bf22d173	net/octeontx2: change default RSS hash calculation Before C0 HW revision, The RSS adder was computed based the following static formula. rss_adder<7:0> = flow_tag<7:0> ^ flow_tag<15:8> ^ flow_tag<23:16> ^ flow_tag<31:24> The above scheme has the following drawbacks: 1) It is not in line with other standard NIC behavior. 2) There can be an SW use case where SW can compute the hash upfront using Toeplitz function and predict the queue selection to optimize some packet lookup function. The nonstandard way of doing XOR makes the consumer to not predict the queue selection. C0 HW revision onward, The HW can configure the rss_adder<7:0> as flow_tag<7:0> to align with standard NICs. This patch adds an option to select legacy RSS adder mode using tag_as_xor=1 devargs option while keeping the standard NIC behavior as default. Signed-off-by: Jerin Jacob <jerinj@marvell.com>	2020-02-05 09:51:21 +01:00
Alfredo Cardigliano	6645b283eb	net/ionic: fix packet type mask Fix the IONIC_RXQ_COMP_PKT_TYPE_MASK define. Coverity issue: 353608 Fixes: `01489e5d79` ("net/ionic: add hardware structures definitions") Signed-off-by: Alfredo Cardigliano <cardigliano@ntop.org>	2020-02-05 09:51:21 +01:00
Somnath Kotur	8522f7b12c	net/bnxt: remove spurious warning in Rx handler HW seems to populate the cfa code in the Rx descriptor even if an explicit flow rule is not configured via application as there might be a default rule configured in HW even for promisc mode. Fixes: `94eb699bc8` ("net/bnxt: support flow mark action") Signed-off-by: Somnath Kotur <somnath.kotur@broadcom.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-02-05 09:51:21 +01:00
Alexander Kozyrev	8e46d4e18f	common/mlx5: improve assert control Use the MLX5_ASSERT macros instead of the standard assert clause. Depends on the RTE_LIBRTE_MLX5_DEBUG configuration option to define it. If RTE_LIBRTE_MLX5_DEBUG is enabled MLX5_ASSERT is equal to RTE_VERIFY to bypass the global CONFIG_RTE_ENABLE_ASSERT option. If RTE_LIBRTE_MLX5_DEBUG is disabled, the global CONFIG_RTE_ENABLE_ASSERT can still make this assert active by calling RTE_VERIFY inside RTE_ASSERT. Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Alexander Kozyrev	0afacb04f5	common/mlx5: remove NDEBUG Use the RTE_LIBRTE_MLX5_DEBUG configuration flag to get rid of dependency on the NDEBUG definition. This is a preparation step to switch from standard assert clauses to DPDK RTE_ASSERT ones in MLX5 driver. Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Alexander Kozyrev	8e08df22f0	net/mlx4: improve assert control Use the MLX4_ASSERT macros instead of the standard assert clause. Depends on the RTE_LIBRTE_MLX4_DEBUG configuration option to define it. If RTE_LIBRTE_MLX4_DEBUG is enabled MLX4_ASSERT is equal to RTE_VERIFY to bypass the global CONFIG_RTE_ENABLE_ASSERT option. If RTE_LIBRTE_MLX4_DEBUG is disabled, the global CONFIG_RTE_ENABLE_ASSERT can still make this assert active by calling RTE_VERIFY inside RTE_ASSERT. Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:21 +01:00
Alexander Kozyrev	e99fdaa7a3	net/mlx4: remove NDEBUG flag Use the RTE_LIBRTE_MLX4_DEBUG compilation flag to get rid of dependency on the NDEBUG definition. This is a preparation step to switch from standard assert clauses to DPDK RTE_ASSERT ones in MLX4 driver. Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Alexander Kozyrev	b80924d677	mk: remove promotion of icc warnings as errors Remove -Werror-all flag in ICC configuration file to stop treating ICC warnings as errors in DPDK due to many false positives. We are using GCC and Clang as a benchmark for warnings anyway for simplification. Suggested-by: Thomas Monjalon <thomas@monjalon.net> Signed-off-by: Alexander Kozyrev <akozyrev@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2020-02-05 09:51:20 +01:00
Viacheslav Ovsiienko	cacb44a099	net/mlx5: add no-inline Tx flag This patch adds support for dynamic flag that hints transmit datapath do not copy data to the descriptors. This flag is useful when data are located in the memory of another (not NIC) physical device and copying to the host memory is undesirable. This hint flag is per mbuf for multi-segment packets. This hint flag might be partially ignored if: - hardware requires minimal data header to be inline into descriptor, it depends on the hardware type and its configuration. In this case PMD copies the minimal required number of bytes to the descriptor, ignoring the no inline hint flag, the rest of data is not copied. - VLAN tag insertion offload is requested and hardware does not support this options. In this case the VLAN tag is inserted by software means and at least 18B are copied to descriptor. Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-02-05 09:51:20 +01:00
Ori Kam	efa79e68c8	net/mlx5: support fine grain dynamic flag The inline feature is designed to save PCI bandwidth by copying some of the data to the wqe. This feature if enabled works for all packets. In some cases when using external memory, the PCI bandwidth is not relevant since the memory can be accessed by other means. This commit introduce the ability to control the inline with mbuf granularity. In order to use this feature the application should register the field name, and restart the port. Signed-off-by: Ori Kam <orika@mellanox.com> Signed-off-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com> Acked-by: Matan Azrad <matan@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	fbde43310f	net/mlx5: make FDB default rule optional There are RDMA-CORE versions which are not supported multi-table for some Mellanox mlx5 devices. Hence, the optimization added in commit [1] which forwards all the FDB traffic to table 1 cannot be configured. Make the above optimization optional: Do not fail when either table 1 cannot be created or the jump rule (all =>jump to table 1) is not configured successfully. In this case, all the flows will be configured to table 0. [1] commit `b67b4ecbde` ("net/mlx5: skip table zero to improve insertion rate") Cc: stable@dpdk.org Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	fa69eaef5f	common/mlx5: support ROCE disable through Netlink Add new 4 Netlink commands to support enable/disable ROCE: 1. mlx5_nl_devlink_family_id_get to get the Devlink family ID of Netlink general command. 2. mlx5_nl_enable_roce_get to get the ROCE current status. 3. mlx5_nl_driver_reload - to reload the device kernel driver. 4. mlx5_nl_enable_roce_set - to set the ROCE status. When the user changes the ROCE status, the IB device may disappear and appear again, so DPDK driver should wait for it and to restart itself. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	654810b568	common/mlx5: share Netlink commands Move Netlink mechanism and its dependencies from net/mlx5 to common/mlx5 in order to be ready to use by other mlx5 drivers. The dependencies are BITFIELD defines, the ppc64 compilation workaround for bool type and the function mlx5_translate_port_name. Update build mechanism accordingly. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	f22442cb5d	net/mlx5: reduce Netlink commands dependencies As an arrangment for Netlink command moving to the common library, reduce the net/mlx5 dependencies. Replace ethdev class command parameters. Improve Netlink sequence number mechanism to be controlled by the mlx5 Netlink mechanism. Move mlx5_nl_check_switch_info to mlx5_nl.c since it is the only one which uses it. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	c12671e3b7	net/mlx5: separate Netlink command interface The Netlink commands interfaces is included in the mlx5.h file with a lot of other PMD interfaces. As an arrangement to make the Netlink commands shared with different PMDs, this patch moves the Netlink interface to a new file called mlx5_nl.h. Move non Netlink pure vlan commands from mlx5_nl.c to the mlx5_vlan.c. Rename Netlink commands and structures to use prefix mlx5_nl. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00
Matan Azrad	d768f324d6	net/mlx5: select driver by class device argument There might be a case that one Mellanox device can be probed by multiple mlx5 drivers. One case is that any mlx5 vDPA device can be probed by both net/mlx5 and vdpa/mlx5. Add a new mlx5 common API to get the requested driver by devargs: class=[net/vdpa]. Skip net/mlx5 PMD probing while the device is selected to be probed by the vDPA driver. Signed-off-by: Matan Azrad <matan@mellanox.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@mellanox.com>	2020-02-05 09:51:20 +01:00

... 6 7 8 9 10 ...

21407 Commits