freebsd-skq

Author	SHA1	Message	Date
Slava Shwartsman	63cc6d1bc2	mlx5: Convert some spaces into tabs and use device_printf() instead of printf(). Submitted by: hselasky@ Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:42:36 +00:00
Slava Shwartsman	abb28d287b	mlx5: Add SRQ fixes from Linux Combine multiple fixes from Linux to SRQ. Linux commits: c73b791 IB/mlx5: Assign SRQ type earlier 0fd27a8 IB/mlx5: Fix out-of-bound access c2b37f7 IB/mlx5: Fix integer overflows in mlx5_ib_create_srq d63c467 RDMA/mlx5: Fix memory leak in mlx5_ib_create_srq() error path Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:42:06 +00:00
Slava Shwartsman	3b21d18587	mlx5: Fix for potential memory leaks. Make sure allocated data gets freed in error cases. Submitted by: hselasky@ Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:41:37 +00:00
Slava Shwartsman	07b624ed71	mlx5: Discard unused return values. Submitted by: hselasky@ Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:41:06 +00:00
Slava Shwartsman	843a89d37e	mlx5: Raise fatal IB event when sys error occurs All other mlx5_events report the port number as 1 based, which is how FW reports it in the port event EQE. Reporting 0 for this event causes mlx5_ib to not raise a fatal event notification to registered clients due to a seemingly invalid port. All switch cases in mlx5_ib_event that go through the port check are supposed to set the port now, so just do it once at variable declaration. Linux commit: aba462134634b502d720e15b23154f21cfa277e5 Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:40:36 +00:00
Slava Shwartsman	2bf40c3608	mlx5: Fix integer overflow while resizing CQ The user can provide very large cqe_size which will cause to integer overflow. Linux commit: 28e9091e3119933c38933cb8fc48d5618eb784c8 Approved by: hselasky (mentor) MFC after: 1 week Sponsored by: Mellanox Technologies	2018-12-05 13:40:05 +00:00
Slava Shwartsman	0c79f82cf0	mlx5: Notify user that the ConnectX-6 shutdown its port due to power limitation If power exceed the slot limit, or slot limit is unknown the ConnectX-6 firmware will shutdown its port. Inform the user via debug message. MFC after: 3 days Approved by: hselasky (mentor), kib (mentor) Sponsored by: Mellanox Technologies	2018-10-22 10:38:38 +00:00
Hans Petter Selasky	6ed134c41b	Make the MSIX module parameter limit per device, in mlx5en(4). MFC after: 3 days Approved by: re (marius) Sponsored by: Mellanox Technologies	2018-09-06 12:41:09 +00:00
Hans Petter Selasky	16ae32f927	Add support for receive side scaling stride, RSSS, in mlx5en(4). The receive side scaling stride parameter is a value which define the interval between active receive side queues. The traffic for the inactive queues is redirected to the nearest active queue by use of modulus. The default value of this parameter is one, which means all receive side queues are used. The point of this feature is to redirect more traffic to fewer receive side queues in order to take more advantage of sorted large receive offload, sorted LRO. The sorted LRO works better when more packets are accumulated per service interval. MFC after: 3 days Approved by: re (marius) Sponsored by: Mellanox Technologies	2018-09-06 12:28:06 +00:00
Hans Petter Selasky	0be5034007	Don't stall transmit queue on drops in mlx5en(4). When a transmitted packet is dropped don't stall the transmit queue. MFC after: 3 days Approved by: re (marius) Sponsored by: Mellanox Technologies	2018-09-06 12:19:36 +00:00
Hans Petter Selasky	2d32b0a304	Maximum number of mbuf frags is off-by-one for worst case scenario in mlx5en(4). Inspecting the PRM no more than 0x3F data segments, DS, of size 16 bytes is allowed. Worst case scenario summary of DS usage: Header is fixed: 2 DS Maximum inlining: 98 => (98 - 2) / 16 = 6 DS Remainder: 0x3F - 2 - 6 = 55 DS (mbuf frags) Previously a value of 56 DS was used and this would work in the normal case because not all inline data area was used up. MFC after: 3 days Approved by: re (marius) Sponsored by: Mellanox Technologies	2018-09-06 11:06:07 +00:00
Hans Petter Selasky	7b9b93a8dd	Update version information for the mlx5 and mlx5en(4) modules. While at it bump some copyright dates. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-18 10:12:53 +00:00
Hans Petter Selasky	0539900214	Do not inline transmit headers and use HW VLAN tagging if supported by mlx5en(4). Query the minimal inline mode supported by the card. When creating a send queue, cache the queried mode and optimize the transmit if no inlining is required. In this case, we can avoid touching the headers cache line and avoid dirtying several more lines by copying headers into the send WQEs. Also, if no inline headers are used, hardware assists in the VLAN tag framing. Submitted by: kib@, slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-18 10:03:30 +00:00
Hans Petter Selasky	90c8e44125	Use a mbuf header instead of a mbuf cluster for debugging interrupts in mlx5en(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:53:37 +00:00
Hans Petter Selasky	a6b2d28d05	Add module parameter to limit number of MSIX EQ vectors in mlx5en(4). For setups having a large amount of PCI devices, it makes sense to limit the number of MSIX vectors per PCI device, in order to avoid running out of IRQ vectors. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:47:56 +00:00
Hans Petter Selasky	aa9f073c9b	Add missing newline. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:43:43 +00:00
Hans Petter Selasky	2f17f76aa4	Handle jumbo frames without requiring big clusters in mlx5en(4). The scatter list is formed by the chunks of MCLBYTES each, and larger than default packets are returned to the stack as the mbuf chain. Submitted by: kib@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:42:05 +00:00
Hans Petter Selasky	f8c3349737	Enable both receive and transmit pauseframes by default in mlx5en(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:21:02 +00:00
Hans Petter Selasky	f2b4782c81	Add context numbers for HW elements in mlx5en(4). To access the data, set sysctl dev.mce.N.conf.debug_stats to 1. This enables the sysctl node dev.mce.N.hw_ctx_debug. Its content is the mapping of each channel' number to used receive queue and associated completion queue, set of the transmit queues numbers and corresponding completion queues. Trimmed example output: channel 30 rq 188 cq 1085 channel 30 tc 0 sq 187 cq 1084 channel 31 rq 191 cq 1087 channel 31 tc 0 sq 190 cq 1086 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:18:01 +00:00
Hans Petter Selasky	a880c1ff6a	Do not hint about 'trust both' mode when the mlx5en(4) hardware does not support it. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:11:30 +00:00
Hans Petter Selasky	f0474ab919	Correctly write atomic variable in mlx5en(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 11:08:40 +00:00
Hans Petter Selasky	52ff436841	Remove redundant call to mlx5_vsc_find_cap() in mlx5core. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 10:27:46 +00:00
Hans Petter Selasky	6d54b22db7	Make sure the state variable is set atomically instead of using a mutex in mlx5core. Device detach and setting error state may deadlock over the interface mutex like this: a) Detach code in mlx5en waits until error state is set while the interface mutex is locked. b) The set error handler needs to lock the interface mutex before it can set the error state. The solution is to use atomics to set the error state. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 10:20:01 +00:00
Hans Petter Selasky	b575d8c850	Refactor access to CR-space into using VSC APIs in mlx5core. Remove no longer used files and APIs. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 10:16:32 +00:00
Hans Petter Selasky	9fc929d2e2	Remove redundant newline character in mlx5core. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 10:11:00 +00:00
Hans Petter Selasky	18450a3b10	Update version information for the mlx5ib module. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 10:07:40 +00:00
Hans Petter Selasky	62bfa774ae	Don't pass unsupported events to ibcore from mlx5ib. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 09:59:55 +00:00
Hans Petter Selasky	14a1b9bd3a	Use static device naming instead of dynamic one in mlx5ib. When resetting mlx5core instances it can happen that the order of attach and detach for mlx5ib instances is changed. Take the unit number for mlx5_%d from the parent PCI device, similarly to what is done in mlx5en(4), so that there is a direct relationship between mce<N> and mlx5_<N>. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 09:58:11 +00:00
Hans Petter Selasky	ed0cee0bf4	Implement support for Differentiated Service Code Point, DSCP, in mlx5en(4). The DSCP feature is controlled using a set of sysctl(8) fields under the qos sysctl directory entry for mlx5en(4). For Routable RoCE QPs, the DSCP should be set in the QP's address path. The DSCP's value is derived from the traffic class. Linux commit: ed88451e1f2d400fd6a743d0a481631cf9f97550 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 09:56:40 +00:00
Hans Petter Selasky	f4546fa376	Add support for prio-tagged traffic for RDMA in ibcore. When receiving a PCP change all GID entries are reloaded. This ensures the relevant GID entries use prio tagging, by setting VLAN present and VLAN ID to zero. The priority for prio tagged traffic is set using the regular rdma_set_service_type() function. Fake the real network device to have a VLAN ID of zero when prio tagging is enabled. This is logic is hidden inside the rdma_vlan_dev_vlan_id() function which must always be used to retrieve the VLAN ID throughout all of ibcore and the infiniband network drivers. The VLAN presence information then propagates through all of ibcore and so incoming connections will have the VLAN bit set. The incoming VLAN ID is then checked against the return value of rdma_vlan_dev_vlan_id(). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-07-17 09:11:53 +00:00
Hans Petter Selasky	38535d6cab	Add support for hardware rate limiting to mlx5en(4). The hardware rate limiting feature is enabled by the RATELIMIT kernel option. Please refer to ifconfig(8) and the txrtlmt option and the SO_MAX_PACING_RATE set socket option for more information. This feature is compatible with hardware transmit send offload, TSO. A set of sysctl(8) knobs under dev.mce.<N>.rate_limit are provided to setup the ratelimit table and also to fine tune various rate limit related parameters. Sponsored by: Mellanox Technologies	2018-05-29 14:04:57 +00:00
Matt Macy	4f6c66cc9c	UDP: further performance improvements on tx Cumulative throughput while running 64 netperf -H $DUT -t UDP_STREAM -- -m 1 on a 2x8x2 SKL went from 1.1Mpps to 2.5Mpps Single stream throughput increases from 910kpps to 1.18Mpps Baseline: https://people.freebsd.org/~mmacy/2018.05.11/udpsender2.svg - Protect read access to global ifnet list with epoch https://people.freebsd.org/~mmacy/2018.05.11/udpsender3.svg - Protect short lived ifaddr references with epoch https://people.freebsd.org/~mmacy/2018.05.11/udpsender4.svg - Convert if_afdata read lock path to epoch https://people.freebsd.org/~mmacy/2018.05.11/udpsender5.svg A fix for the inpcbhash contention is pending sufficient time on a canary at LLNW. Reviewed by: gallatin Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D15409	2018-05-23 21:02:14 +00:00
Matt Macy	d7c5a620e2	ifnet: Replace if_addr_lock rwlock with epoch + mutex Run on LLNW canaries and tested by pho@ gallatin: Using a 14-core, 28-HTT single socket E5-2697 v3 with a 40GbE MLX5 based ConnectX 4-LX NIC, I see an almost 12% improvement in received packet rate, and a larger improvement in bytes delivered all the way to userspace. When the host receiving 64 streams of netperf -H $DUT -t UDP_STREAM -- -m 1, I see, using nstat -I mce0 1 before the patch: InMpps OMpps InGbs OGbs err TCP Est %CPU syscalls csw irq GBfree 4.98 0.00 4.42 0.00 4235592 33 83.80 4720653 2149771 1235 247.32 4.73 0.00 4.20 0.00 4025260 33 82.99 4724900 2139833 1204 247.32 4.72 0.00 4.20 0.00 4035252 33 82.14 4719162 2132023 1264 247.32 4.71 0.00 4.21 0.00 4073206 33 83.68 4744973 2123317 1347 247.32 4.72 0.00 4.21 0.00 4061118 33 80.82 4713615 2188091 1490 247.32 4.72 0.00 4.21 0.00 4051675 33 85.29 4727399 2109011 1205 247.32 4.73 0.00 4.21 0.00 4039056 33 84.65 4724735 2102603 1053 247.32 After the patch InMpps OMpps InGbs OGbs err TCP Est %CPU syscalls csw irq GBfree 5.43 0.00 4.20 0.00 3313143 33 84.96 5434214 1900162 2656 245.51 5.43 0.00 4.20 0.00 3308527 33 85.24 5439695 1809382 2521 245.51 5.42 0.00 4.19 0.00 3316778 33 87.54 5416028 1805835 2256 245.51 5.42 0.00 4.19 0.00 3317673 33 90.44 5426044 1763056 2332 245.51 5.42 0.00 4.19 0.00 3314839 33 88.11 5435732 1792218 2499 245.52 5.44 0.00 4.19 0.00 3293228 33 91.84 5426301 1668597 2121 245.52 Similarly, netperf reports 230Mb/s before the patch, and 270Mb/s after the patch Reviewed by: gallatin Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D15366	2018-05-18 20:13:34 +00:00
Konstantin Belousov	952e75c763	mlx5en: Always allow VLAN id 0. According to the 802.1Q-2014 9.6 VLAN Tag Control Information, VID value 0 means that there is no VLAN tag assigned to the packet, and only PCP and DEI values from the tag are meaningful. Current flow table programming filter out such packets. When programming VLAN filter for flow table, unconditionally add rule which accept packets with VLAN id 0. The packets are already handled correctly by the network stack. Reviewed by: hselasky, slavash Sponsored by: Mellanox Technologies MFC after: 1 week	2018-05-02 20:22:03 +00:00
Hans Petter Selasky	28cfdee769	Bump driver version number in mlx5en(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-04-04 10:45:06 +00:00
Hans Petter Selasky	d77004ab47	Remove unused structure field in mlx5core. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:58:58 +00:00
Hans Petter Selasky	76ee71dcd3	Bump mlx5core driver version. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:55:31 +00:00
Hans Petter Selasky	4d5fdbe9b8	Fix for use after free in mlx5core. Make sure the command completion handler is not called when the device is in internal error state. This can easily trigger use after free situations. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:50:45 +00:00
Hans Petter Selasky	ca2345a05d	Make sure Giant is locked when allocating bus resources in mlx5core. During health care IRQ resources will be reallocated. Newbus requires that Giant is locked before accessing these resources. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:49:35 +00:00
Hans Petter Selasky	92d23c82cd	Collect firmware dump when mlx5core is in device error state. Firmware dump collecting should be triggered in case firmware syndrome with request for reset bit is set. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:48:25 +00:00
Hans Petter Selasky	d28b6b55ba	Reorganize health recovery in mlx5core. - Move the semaphore locking and unlocking to the same function. - Flags are no longer needed if the reset and crdump will be done in the same function. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:45:48 +00:00
Hans Petter Selasky	3c1274bd64	Prepare for FW dump in error state in mlx5core. - Move firmware dump prep and cleanup to init_one() and remove_one() so that the init and cleanup will happen only upon driver reload. - Add some prints to indicate firmware dump. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:43:15 +00:00
Hans Petter Selasky	0a752b05a8	Properly check if crspace is supported in mlx5core. The old code checked for MLX5_CR_SPACE_DOMAIN which is irrelevant here. However, if dev->vsec_addr would be 0, an access to wrong offset would happen. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:39:27 +00:00
Hans Petter Selasky	4950c6ec72	Add missing newline character in print in mlx5core. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:35:31 +00:00
Brooks Davis	541d96aaaf	Use an accessor function to access ifr_data. This fixes 32-bit compat (no ioctl command defintions are required as struct ifreq is the same size). This is believed to be sufficent to fully support ifconfig on 32-bit systems. Reviewed by: kib Obtained from: CheriBSD MFC after: 1 week Relnotes: yes Sponsored by: DARPA, AFRL Differential Revision: https://reviews.freebsd.org/D14900	2018-03-30 18:50:13 +00:00
Hans Petter Selasky	9a3d0cf097	Remove redundant prototype to fix compilation with GCC. Reported by: jeff@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-25 08:55:53 +00:00
Hans Petter Selasky	c0cea51b46	Don't wait for completions when a mlx5en(4) device is in internal error state. If the device is in internal error state the hardware will not generate completions. Just move on to destroy the resources. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:38:12 +00:00
Hans Petter Selasky	9cd6fc88be	Fix incorrect page count when mlx5core is in internal error. Change page cleanup flow when in internal error to properly decrement the page counts when reclaiming pages. That prevents timing out waiting for extra pages that were actually cleaned up previously. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:35:59 +00:00
Hans Petter Selasky	94790180f3	Don't save PCI state when PCI error is detected in mlx5core. When a PCI error is detected the PCI state could be corrupt, don't save it in that flow. Save the state after initialization. After restoring the PCI state during slot reset save it again, restoring the state destroys the previously saved state info. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:34:35 +00:00
Hans Petter Selasky	f20b553d75	Add mutual exclusion mechanism for software reset of firmware in mlx5core. Since the FW can be shared between PCI functions it is common that more than one health poll will detected a failure, this can lead to multiple resets. The solution is to use a FW locking mechanism using semaphore space to provide a way to synchronize between functions. The FW semaphore is acquired via config cycle access. First the VSEC gateway must be acquired, then the semaphore can be locked by writing a value to it and confirmed it's locked by reading the same value back. The process in the same to free the semaphore, except the value written should be zero. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:32:03 +00:00

1 2 3 4

193 Commits