numam-dpdk

Author	SHA1	Message	Date
Dekel Peled	7f6a3168ed	ethdev: fix RSS flow expansion in case of mismatch Function rte_flow_expand_rss() is used to expand a flow rule with partial pattern into several rules, to ensure all relevant packets are matched. It uses utility function rte_flow_expand_rss_item_complete(), to check if the last valid item in the flow rule pattern needs to be completed. For example the pattern "eth / ipv4 proto is 17 / end" will be completed with a "udp" item. This function returns "void" item in two cases: 1) The last item has empty spec, for example "eth / ipv4 / end". 2) The last itme has spec that can't be expanded for RSS. For example the pattern "eth / ipv4 proto is 47 / end" ends with IPv4 item that has next protocol GRE. In both cases the flow rule may be expanded, but in the second case such expansion may create rules with invalid pattern. For example "eth / ipv4 proto is 47 / udp / end". In such a case the flow rule should not be expanded. This patch updates function rte_flow_expand_rss_item_complete(). Return value RTE_FLOW_ITEM_TYPE_END is used to indicate the flow rule should not be expanded. In such a case, rte_flow_expand_rss() will return with the original flow rule only, without any expansion. Fixes: fc2dd8dd492f ("ethdev: fix expand RSS flows") Cc: stable@dpdk.org Signed-off-by: Dekel Peled <dekelp@nvidia.com> Acked-by: Xiaoyu Min <jackmin@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com> Acked-by: Ori Kam <orika@nvidia.com>	2020-10-08 19:58:11 +02:00
Ferruh Yigit	7ae5c75f37	ethdev: check if queues are allocated before getting info A crash is detected when '--txpkts=#' parameter provided to the testpmd, this is because queue information is requested before queues have been allocated. Adding check to queue info APIs ('rte_eth_rx_queue_info_get()' & 'rte_eth_tx_queue_info_get') to protect against similar cases. Fixes: ba2fb4f022fc ("ethdev: check if queue setup when getting queue info") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com> Reviewed-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-10-08 19:58:11 +02:00
Rasesh Mody	effb1d0b95	net/qede: fix getting link details This patch fixes get current link details, without this change the link details can be inaccurate if proper lock is not acquired. Fixes: 739a5b2f2b49 ("net/qede/base: use passed ptt handler") Cc: stable@dpdk.org Reported-by: Ferruh Yigit <ferruh.yigit@intel.com> Signed-off-by: Rasesh Mody <rmody@marvell.com> Signed-off-by: Igor Russkikh <irusskikh@marvell.com>	2020-10-08 19:58:11 +02:00
Alexander Kozyrev	d2d5760552	net/mlx5: fix Rx queue count calculation There are a few discrepancies in the Rx queue count calculation. The wrong index is used to calculate the number of used descriptors in an Rx queue in case of the compressed CQE processing. The global CQ index is used while we really need an internal index in a single compressed session to get the right number of elements processed. The total number of CQs should be used instead of the number of mbufs to find out about the maximum number of Rx descriptors. These numbers are not equal for the Multi-Packet Rx queue. Allow the Rx queue count calculation for all possible Rx bursts since CQ handling is the same for regular, vectorized, and multi-packet Rx queues. Fixes: 26f04883441a ("net/mlx5: support Rx queue count API") Cc: stable@dpdk.org Signed-off-by: Alexander Kozyrev <akozyrev@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2020-10-08 19:58:11 +02:00
Suanming Mou	3e8f3e51fd	net/mlx5: fix meter table definitions As metering and metadata features were developed at the same time. The metering and metadata tables are defined conflicted. This cause the meter suffix flow jump to the same metadata table and cause flow deadloop. Adjust the metering table define to fix that issue. Fixes: 46a5e6bc6a85 ("net/mlx5: prepare meter flow tables") Cc: stable@dpdk.org Signed-off-by: Suanming Mou <suanmingm@nvidia.com> Acked-by: Viacheslav Ovsiienko <viacheslavo@nvidia.com>	2020-10-08 19:58:11 +02:00
Dekel Peled	38f9369d24	net/mlx5: fix DevX CQ attributes values Previous patch wrongly used rdma-core defined values, when preparing attributes for creating DevX CQ object. This patch adds the correct value definition and uses them instead. Fixes: 08d1838f645a ("net/mlx5: implement CQ for Rx using DevX API") Cc: stable@dpdk.org Signed-off-by: Dekel Peled <dekelp@nvidia.com> Acked-by: Matan Azrad <matan@nvidia.com>	2020-10-08 19:58:11 +02:00
Ajit Khaparde	8b96a65ce5	net/bnxt: update HWRM structures HWRM API to a newer 1.10.1.70 version. Few fields have been renamed because of this. rx_err_pkt -> rx_discard_pkts rx_drop_pkts -> rx_error_pkts tx_err_pkts -> tx_discard_pkts tx_drop_pkts -> tx_error_pkts link_signal_mode -> active_fec_signal_mode tx_bd_long_hi.mss -> tx_bd_long_hi.kid_or_ts_high_mss tx_bd_long_hi.hdr_size -> tx_bd_long_hi.kid_or_ts_low_hdr_size Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-10-08 19:58:11 +02:00
Ajit Khaparde	7ed45b1a7c	net/bnxt: support RSS hash selection Add support to select RSS hash based on innermost or outermost headers. If an application is started without any specific settings the default mode configured by FW or HW shall be used. Signed-off-by: Ajit Khaparde <ajit.khaparde@broadcom.com>	2020-10-08 19:58:11 +02:00
Stephen Hemminger	bfa63c4d7b	ethdev: use mbuf bulk free API The mbuf library now has routine to free multiple buffers. Loop is no longer needed. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Reviewed-by: Andrew Rybchenko <arybchenko@solarflare.com>	2020-10-08 19:58:11 +02:00
Hongbo Zheng	e855bffa30	net/hns3: remove redundant return value assignment When an error occurs in the reset process, -EIO is returned. The assignment of ret here is redundant, so deleted it. Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Hongbo Zheng	243651cb6c	net/hns3: check PCI config space reads This patch add return value check when calling rte_pci_read_config function. Fixes: cea37e513329 ("net/hns3: fix FLR reset") Cc: stable@dpdk.org Signed-off-by: Hongbo Zheng <zhenghongbo3@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Chengchang Tang	fa29fe45a7	net/hns3: support queue start and stop The new generation hns3 network engine supports independent enabling and disabling of a single Tx/Rx queue. So, it can support the queue start and stop feature. In addition, when different numbers of Tx and Rx queues need to be enabled in some applications, hns3 pmd does not need to create fake queues to enable these scenarios. This patch Add queue start and stop feature for the new generation hns3 networking engine. Cancel the creation of fake queue on the new generation network engine. And the previously improperly named queue related function was renamed to improve readability. Signed-off-by: Chengchang Tang <tangchengchang@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Huisong Li	040bb0f725	net/hns3: set max scheduling rate based on actual board Currently, max scheduling rates configuration of pg, pri and port are set to 100000Mbps, which is the maximum bandwidth of hns3 network engine with revision_id equals 0x21. However, max scheduling rate configuration should be set to hardware based on the actual hardware board environment. The max_tm_rate in struct hns3_hw, meaning the rate, is obtained from firmware. So we should use the variable to configure the max scheduling rate. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Huisong Li	5d78d42b31	net/hns3: offload calculating shapping to firmware In order to have more flexible selection of shapping algorithm based on different versions of hns3 network engine, moves the algorithm of calculating shapping parameter to firmware to execute. If bit HNS3_TM_RATE_VLD_B of flag field of struct named hns3_pri_shapping_cmd, hns3_pg_shapping_cmd or hns3_port_shapping_cmd is set to 1, firmware of network engine, which device revision_id is greater than and equal to 0x30, will recalculate the shapping parameters according to the xxx_rate field of struct hns3_xxx_shapping_cmd and the opcode of scheduling level, and configure to hardware. But driver still needs to calculate shapping parameters and configure firmware, so as to be compatible with the network engine with revision_id eqauls 0x21. And the rate and the flag will be ignored based on the network engine with revision_id equals 0x21. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Wei Hu (Xavier)	f257760920	net/hns3: fix flow error type The API of rte_flow_error_set is used to pass detail error information to caller, this patch sets suitable type when calling rte_flow_error_set API. Fixes: fcba820d9b9e ("net/hns3: support flow director") Fixes: c37ca66f2b27 ("net/hns3: support RSS") Cc: stable@dpdk.org Signed-off-by: Chengwen Feng <fengchengwen@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Wei Hu (Xavier)	f8f8df765f	net/hns3: fix error type when validating RSS flow action Because the macro named RTE_FLOW_ERROR_TYPE_ACTION_CONF indicates a action configuration and the macro named RTE_FLOW_ERROR_TYPE_ACTION indicates a specific action, the driver needs to return RTE_FLOW_ERROR_ACTION_CONF type and notify the user when a RSS configuration is invalid with actions list in the internal function named hns3_parse_rss_filter called by the '.validate' ops implementation function named hns3_flow_validate. Besides, this patch removes some unnecessary judgment lines in hns3_parse_rss_filter. Fixes: c37ca66f2b27 ("net/hns3: support RSS") Cc: stable@dpdk.org Signed-off-by: Lijun Ou <oulijun@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Wei Hu (Xavier)	76d794566d	net/hns3: maximize queue number The maximum number of queues for hns3 PF and VF driver is 64 based on hns3 network engine with revision_id equals 0x21. Based on hns3 network engine with revision_id equals 0x30, the hns3 PF PMD driver can support up to 1280 queues, and hns3 VF PMD driver can support up to 128 queues. The following points need to be modified to support maximizing queue number and maintain better compatibility: 1) Maximizing the number of queues for hns3 PF and VF PMD driver In current version, VF is not supported when PF is driven by hns3 PMD driver. If maximum queue numbers allocated to PF PMD driver is less than total tqps_num allocated to this port, all remaining number of queues are mapped to VF function, which is unreasonable. So we fix that all remaining number of queues are mapped to PF function. Using RTE_LIBRTE_HNS3_MAX_TQP_NUM_PER_PF which comes from configuration file to limit the queue number allocated to PF device based on hns3 network engine with revision_id greater than 0x30. And PF device still keep the maximum 64 queues based on hns3 network engine with revision_id equals 0x21. Remove restriction of the macro HNS3_MAX_TQP_NUM_PER_FUNC on the maximum number of queues in hns3 VF PMD driver and use the value allocated by hns3 PF kernel netdev driver. 2) According to the queue number allocated to PF device, a variable array for Rx and Tx queue is dynamically allocated to record the statistics of Rx and Tx queues during the .dev_init ops implementation function. 3) Add an extended field in hns3_pf_res_cmd to support the case that numbers of queue are greater than 1024. 4) Use new base address of Rx or Tx queue if QUEUE_ID of Rx or Tx queue is greater than 1024. 5) Remove queue id mask and use all bits of actual queue_id as the queue_id to configure hardware. 6) Currently, 0~9 bits of qset_id in hns3_nq_to_qs_link_cmd used to record actual qset id and 10 bit as VLD bit are configured to hardware. So we also need to use 11~15 bits when actual qset_id is greater than 1024. 7) The number of queue sets based on different network engine are different. We use it to calculate group number and configure to hardware in the backpressure configuration. 8) Adding check operations for number of Rx and Tx queue user configured when mapping queue to tc Rx queue numbers under a single TC must be less than rss_size_max supported by a single TC. Rx and Tx queue numbers are allocated to every TC by average. So Rx and Tx queue numbers must be an integer multiple of 2, or redundant queues are not available. 9) We can specify which packets enter the queue with a specific queue number, when creating flow table rules by rte_flow API. Currently, driver uses 0~9 bits to record the queue_id. So it is necessary to extend one bit field to record queue_id and configure to hardware, if the queue_id is greater than 1024. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
Huisong Li	9a7d3af22c	net/hns3: expand number of queues for one TC up to 512 The maximum number of queues for one TC hns3 PF PMD driver supported is 64 based on hns3 network engine with revision_id equals 0x21, while it is expanded up to 512 on hns3 network engine with revision_id equals 0x30. So the following points need to be modified to maintain better compatibility. 1) Using a extended rss_size_max field as the maximum queue number of one TC PF driver supported. 2) The data type of the RSS redirection table needs to be changed from uint8_t to uint16_t. 3) rss_tc_mode modification The bitwidth of tc_offset, meaning the rx queue index, has to expand from 10 bit to 11 bits. The tc_size, meaning the exponent with base 2 of queues supported on TC, needs to expand from 3 bits to 4 bits. 4) RSS indirection table modification Currently, a field with 7 bits width is used to record the queue index for RSS indirection table. It means that PF needs to expand the queue index field to 9 bits. As the RSS indirection table config command reserved 4 bytes to configure the RSS queue index, a extern field can be added. So an entries of RSS indirection table queue index has two fields to set: rss_result_l and rss_result_h, while rss_result_l records the lower 8 bits and rss_result_h records the higher 1 bit. In addition, 2~4 modifications is also compatible with hns3 VF PMD driver. Signed-off-by: Huisong Li <lihuisong@huawei.com> Signed-off-by: Wei Hu (Xavier) <xavier.huwei@huawei.com>	2020-10-08 19:58:10 +02:00
John Daley	bb66d562ae	net/enic: share flow actions with same signature Flow actions are a limited resource on the Cisco VIC, but they can be shared between flows if they are exactly the same. Use a hash table and a reference count in the PMD to enable sharing actions with the same signature between flows. Signed-off-by: John Daley <johndale@cisco.com> Reviewed-by: Hyong Youb Kim <hyonkim@cisco.com>	2020-10-08 19:58:10 +02:00
Honnappa Nagarahalli	8e6fa199d1	maintainers: update for MCS lock Updating MAINTAINERS file for MCS lock. Signed-off-by: Honnappa Nagarahalli <honnappa.nagarahalli@arm.com> Acked-by: Phil Yang <phil.yang@arm.com>	2020-10-09 11:01:43 +02:00
David Marchand	0e995cbcfc	eal: fix experimental block for 20.11 In EAL, we try to sort the experimental symbols per the release they were introduced in. Fixes: 8929de043eb4 ("service: retrieve lcore active state") Signed-off-by: David Marchand <david.marchand@redhat.com> Acked-by: Thomas Monjalon <thomas@monjalon.net>	2020-10-08 15:20:51 +02:00
Cristian Dumitrescu	64eaee23ab	examples/pipeline: fix files for table update Coverity issue: 362744, 362745, 362882 Fixes: 5074e1d551 ("examples/pipeline: add configuration commands") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:09:28 +02:00
Cristian Dumitrescu	f63ba2005e	pipeline: fix instruction config free Coverity issue: 362901 Fixes: a1711f948d ("pipeline: add SWX Rx and extract instructions") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:09:28 +02:00
Cristian Dumitrescu	941717ffe1	pipeline: fix unused variable Coverity issue: 362855 Fixes: 75634474ca ("pipeline: add SWX instruction verifier") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:09:28 +02:00
Cristian Dumitrescu	0ebe8c38a3	pipeline: fix memory free Coverity issue: 362796, 362804, 362819, 362836, 362858, 362865, 362869 Fixes: 3ca60ceed7 ("pipeline: add SWX pipeline specification file") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:09:25 +02:00
Cristian Dumitrescu	c0c9dcef88	pipeline: fix argument check Coverity issue: 362789 Fixes: 3ca60ceed7 ("pipeline: add SWX pipeline specification file") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:04:55 +02:00
Cristian Dumitrescu	faa4536684	pipeline: fix resource leak Coverity issue: 362812 Fixes: b32c0a2c5e ("pipeline: add SWX table update high level API") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:01:07 +02:00
Cristian Dumitrescu	bacfdd908d	pipeline: fix memory leak Coverity issue: 362741 Fixes: b32c0a2c5e ("pipeline: add SWX table update high level API") Signed-off-by: Cristian Dumitrescu <cristian.dumitrescu@intel.com>	2020-10-08 15:00:42 +02:00
Kevin Laatz	64d0a9097d	examples/ioat: fix stats print Currently some of the status string at the top of the stats output is being cut off. To fix this, the status string array size has been increased. In addition to this, the "\n" has been moved to the printf, rather than having it in the last string, in case of future formatting issues due to truncation. Bugzilla ID: 536 Fixes: 632bcd9b5d4f ("examples/ioat: print statistics") Cc: stable@dpdk.org Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2020-10-08 14:38:02 +02:00
Kevin Laatz	2ae23f5647	raw/ioat: add fill operation Add fill operation enqueue support for IOAT and IDXD. The fill enqueue is similar to the copy enqueue, but takes a 'pattern' rather than a source address to transfer to the destination address. This patch also includes an additional test case for the new operation type. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	3a377b10c2	raw/ioat: clean up use of common test function Now that all devices can pass the same set of unit tests, eliminate the temporary idxd_rawdev_test function and move the prototype for ioat_rawdev_test to the proper internal header file, to be used by all device instances. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	60927cc650	raw/ioat: add xstats tracking for idxd device Add update of the relevant stats for the data path functions and point the overall device struct xstats function pointers to the existing ioat functions. At this point, all necessary hooks for supporting the existing unit tests are in place so call them for each device. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	a32e194474	raw/ioat: move xstats functions to common file The xstats functions can be used by all ioat devices so move them from the ioat_rawdev.c file to ioat_common.c, and add the function prototypes to the internal header file. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	8636b9a18e	raw/ioat: create separate statistics structure Rather than having the xstats as fields inside the main driver structure, create a separate structure type for them. As part of the change, when updating the stats functions referring to the stats by the old path, we can simplify them to use the id to directly index into the stats structure, making the code shorter and simpler. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	2e35907532	raw/ioat: add info query for idxd device Add the info get function for DSA devices, returning just the ring size info about the device, same as is returned for existing IOAT/CBDMA devices. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	78ecbc66ec	raw/ioat: add data path for idxd device Add support for doing copies using DSA hardware. This is implemented by just switching on the device type field at the start of the inline functions. Since there is no hardware which will have both device types present this branch will always be predictable after the first call, meaning it has little to no perf penalty. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	2f22aeb197	raw/ioat: start and stop idxd device Add the start and stop functions for DSA hardware devices using the vfio/uio kernel drivers. For vdevs using the idxd kernel driver, the device must be started using sysfs before the device node appears for vdev use - making start/stop functions in the driver unnecessary. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	69c4162643	raw/ioat: configure idxd devices Add configure function for idxd devices, taking the same parameters as the existing configure function for ioat. The ring_size parameter is used to compute the maximum number of bursts to be supported by the driver, given that the hardware works on individual bursts of descriptors at a time. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	389d519785	raw/ioat: add datapath data structures for idxd devices Add in the relevant data structures for the data path for DSA devices. Also include a device dump function to output the status of each device. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Kevin Laatz	425fe89287	raw/ioat: probe idxd vdev For each vdev (DSA work queue) instance, create a rawdev instance. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	ff06fa2cf3	raw/ioat: probe idxd PCI When a matching device is found via PCI probe create a rawdev instance for each queue on the hardware. Use empty self-test function for these devices so that the overall rawdev_autotest does not report failures. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	01863b9d23	raw/ioat: include example configuration script Devices managed by the idxd kernel driver must be configured for DPDK use before it can be used by the ioat driver. This example script serves both as a quick way to get the driver set up with a simple configuration, and as the basis for users to modify it and create their own configuration scripts. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Kevin Laatz	777edf43ae	raw/ioat: introduce vdev probe for DSA/idxd device The Intel DSA devices can be exposed to userspace via kernel driver, so can be used without having to bind them to vfio/uio. Therefore we add support for using those kernel-configured devices as vdevs, taking as parameter the individual HW work queue to be used by the vdev. Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	d09d396fad	raw/ioat: add skeleton for VFIO/UIO based DSA device Add in the basic probe/remove skeleton code for DSA devices which are bound directly to vfio or uio driver. The kernel module for supporting these uses the "idxd" name, so that name is used as function and file prefix to avoid conflict with existing "ioat" prefixed functions. Since we are adding new files to the driver and there will be common definitions shared between the various files, we create a new internal header file ioat_private.h to hold common macros and function prototypes. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Kevin Laatz	43f9b521a7	usertools: support binding Intel DSA device Intel Data Streaming Accelerator (Intel DSA) is a high-performance data copy and transformation accelerator which will be integrated in future Intel processors [1]. Add DSA device support to dpdk-devbind.py script. [1] https://01.org/blogs/2019/introducing-intel-data-streaming-accelerator Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	cae8a1b19e	raw/ioat: make HW register spec private Only a few definitions from the hardware spec are actually used in the driver runtime, so we can copy over those few and make the rest of the spec a private header in the driver. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	f55d185540	raw/ioat: add separate API for fence call Rather than having the fence signalled via a flag on a descriptor - which requires reading the docs to find out whether the flag needs to go on the last descriptor before, or the first descriptor after the fence - we can instead add a separate fence API call. This becomes unambiguous to use, since the fence call explicitly comes between two other enqueue calls. It also allows more freedom of implementation in the driver code. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	979e29ddbb	raw/ioat: rename functions to be operation-agnostic Since the hardware supported by the ioat driver is capable of operations other than just copies, we can rename the doorbell and completion-return functions to not have "copies" in their names. These functions are not copy-specific, and so would apply for other operations which may be added later to the driver. Also add a suitable warning using deprecation attribute for any code using the old functions names. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Bruce Richardson	507bf656bf	raw/ioat: split header file for readability Rather than having a single long complicated header file for general use we can split things so that there is one header with all the publicly needed information - data structs and function prototypes - while the rest of the internal details are put separately. This makes it easier to read, understand and use the APIs. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00
Cheng Jiang	95b686a665	raw/ioat: add flag to control copying handle parameters Add a flag which controls whether rte_ioat_enqueue_copy and rte_ioat_completed_copies function should process handle parameters. Not doing so can improve the performance when handle parameters are not necessary. Signed-off-by: Cheng Jiang <cheng1.jiang@intel.com> Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Reviewed-by: Kevin Laatz <kevin.laatz@intel.com> Acked-by: Radu Nicolau <radu.nicolau@intel.com>	2020-10-08 14:33:20 +02:00

... 3 4 5 6 7 ...

24832 Commits