freebsd-skq

Author	SHA1	Message	Date
Hans Petter Selasky	28cfdee769	Bump driver version number in mlx5en(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-04-04 10:45:06 +00:00
Hans Petter Selasky	d77004ab47	Remove unused structure field in mlx5core. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:58:58 +00:00
Hans Petter Selasky	76ee71dcd3	Bump mlx5core driver version. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:55:31 +00:00
Hans Petter Selasky	4d5fdbe9b8	Fix for use after free in mlx5core. Make sure the command completion handler is not called when the device is in internal error state. This can easily trigger use after free situations. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:50:45 +00:00
Hans Petter Selasky	ca2345a05d	Make sure Giant is locked when allocating bus resources in mlx5core. During health care IRQ resources will be reallocated. Newbus requires that Giant is locked before accessing these resources. MFC after: 3 days Sponsored by: Mellanox Technologies	2018-03-30 19:49:35 +00:00
Hans Petter Selasky	92d23c82cd	Collect firmware dump when mlx5core is in device error state. Firmware dump collecting should be triggered in case firmware syndrome with request for reset bit is set. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:48:25 +00:00
Hans Petter Selasky	d28b6b55ba	Reorganize health recovery in mlx5core. - Move the semaphore locking and unlocking to the same function. - Flags are no longer needed if the reset and crdump will be done in the same function. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:45:48 +00:00
Hans Petter Selasky	3c1274bd64	Prepare for FW dump in error state in mlx5core. - Move firmware dump prep and cleanup to init_one() and remove_one() so that the init and cleanup will happen only upon driver reload. - Add some prints to indicate firmware dump. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:43:15 +00:00
Hans Petter Selasky	0a752b05a8	Properly check if crspace is supported in mlx5core. The old code checked for MLX5_CR_SPACE_DOMAIN which is irrelevant here. However, if dev->vsec_addr would be 0, an access to wrong offset would happen. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:39:27 +00:00
Hans Petter Selasky	4950c6ec72	Add missing newline character in print in mlx5core. MFC after: 3 days Submitted by: slavash@ Sponsored by: Mellanox Technologies	2018-03-30 19:35:31 +00:00
Brooks Davis	541d96aaaf	Use an accessor function to access ifr_data. This fixes 32-bit compat (no ioctl command defintions are required as struct ifreq is the same size). This is believed to be sufficent to fully support ifconfig on 32-bit systems. Reviewed by: kib Obtained from: CheriBSD MFC after: 1 week Relnotes: yes Sponsored by: DARPA, AFRL Differential Revision: https://reviews.freebsd.org/D14900	2018-03-30 18:50:13 +00:00
Hans Petter Selasky	9a3d0cf097	Remove redundant prototype to fix compilation with GCC. Reported by: jeff@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-25 08:55:53 +00:00
Hans Petter Selasky	c0cea51b46	Don't wait for completions when a mlx5en(4) device is in internal error state. If the device is in internal error state the hardware will not generate completions. Just move on to destroy the resources. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:38:12 +00:00
Hans Petter Selasky	9cd6fc88be	Fix incorrect page count when mlx5core is in internal error. Change page cleanup flow when in internal error to properly decrement the page counts when reclaiming pages. That prevents timing out waiting for extra pages that were actually cleaned up previously. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:35:59 +00:00
Hans Petter Selasky	94790180f3	Don't save PCI state when PCI error is detected in mlx5core. When a PCI error is detected the PCI state could be corrupt, don't save it in that flow. Save the state after initialization. After restoring the PCI state during slot reset save it again, restoring the state destroys the previously saved state info. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:34:35 +00:00
Hans Petter Selasky	f20b553d75	Add mutual exclusion mechanism for software reset of firmware in mlx5core. Since the FW can be shared between PCI functions it is common that more than one health poll will detected a failure, this can lead to multiple resets. The solution is to use a FW locking mechanism using semaphore space to provide a way to synchronize between functions. The FW semaphore is acquired via config cycle access. First the VSEC gateway must be acquired, then the semaphore can be locked by writing a value to it and confirmed it's locked by reading the same value back. The process in the same to free the semaphore, except the value written should be zero. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:32:03 +00:00
Hans Petter Selasky	fe242ba7c1	Issue a software reset on firmware assert in mlx5core. If a FW assert is considered fatal, indicated by a new bit in the health buffer, reset the FW. After the reset, follow the normal recovery flow. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:24:09 +00:00
Hans Petter Selasky	1900b6f887	Handle software reset of firmware in error flow in mlx5core. Some mlx5 adapter firmware allows the driver to reset the firmware in the event of an error. When a software reset is issued on any physical function all PFs enter reset state. This is a recoverable condition. The existing recovery flow was designed to allow the recovery of a VF after a PF driver reload. This patch expands the scope of that flow to recover PFs or VFs after a SW reset has been issued. When a software reset is issued the following occurs: 1. The NIC interface mode is set to SW_RESET (7) while the reset is in progress. 2. Once the reset completes the NIC interface mode is set to NIC disabled (1). After the reset has been issued (added in a subsequent patch) the health poll for other functions will detect that the NIC interface state has been set to disabled. This will cause it to enter the existing recovery flow. If the PCI is still working (meaning it doesn't return 0xff on all reads) it means recovery can proceed immediately instead of waiting 60 seconds. The error detetion has also been refactored to avoid incorrect or misleading log messages. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:20:42 +00:00
Hans Petter Selasky	1fb6089c3b	Hide verbose proclamation of error when forced in mlx5core. When mlx5_enter_error_state() operation is forced by shutdown, the messages surrounding setting the error state are not informational and confuse users. Submitted by: kib@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:11:06 +00:00
Hans Petter Selasky	519774ea5a	Cancel delayed recovery work when unloading the mlx5core driver. linux commit 2a0165a034ac024b60cca49c61e46f4afa2e4d98 Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:09:09 +00:00
Hans Petter Selasky	c09025693b	Add support for fast unload in shutdown flow in mlx5core. This patch accumulates the following Linux commits: - 8812c24d28f4972c4f2b9998bf30b1f2a1b62adf net/mlx5: Add fast unload support in shutdown flow - 59211bd3b6329c3e5f4a90ac3d7f87ffa7867073 net/mlx5: Split the load/unload flow into hardware and software flows - 4525abeaae54560254a1bb8970b3d4c225d32ef4 net/mlx5: Expose command polling interface Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 18:02:20 +00:00
Hans Petter Selasky	4bb7662b09	Improve support for health recovery in mlx5core. This patch accumulates the following Linux commits: - 04c0c1ab38e95105d950db5b84e727637e149ce7 net/mlx5: PCI error recovery health care simulation - 0179720d6be2096b8d0a4d143254ff9e77747daa net/mlx5: Introduce trigger_health_work function - 3fece5d676939f42f434c63dfe1bd42d7d94e6f0 net/mlx5: Continue health polling until it is explicitly stopped Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 17:33:14 +00:00
Hans Petter Selasky	177781564f	Create designated workqueue for each mlx5en(4) device instance. The mlx5e_destroy_ifp() function may be called from the system workqueue and in this case trying to flush all works will cause a dead lock. Instead of using the system workqueue, create a designated workqueue for each mlx5en(4) device instance. Submitted by: slavash@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-23 16:59:51 +00:00
Conrad Meyer	918622dbc0	mlx5(4): Remove redundant declaration of mlx5_enter_error_state Broken in r330644. Sponsored by: Dell EMC Isilon	2018-03-10 00:59:48 +00:00
Konstantin Belousov	2cec152827	Make mlx5 compilable on ILP32 arches. Sponsored by: Mellanox Technologies MFC after: 1 week	2018-03-08 22:03:43 +00:00
Hans Petter Selasky	50e3ba10e8	Set correct SL in completion for RoCE in mlx5ib(4). There is a difference when parsing a completion entry between Ethernet and IB ports. When link layer is Ethernet the bits describe the type of L3 header in the packet. In the case when link layer is Ethernet and VLAN header is present the value of SL is equal to the 3 UP bits in the VLAN header. If VLAN header is not present then the SL is undefined and consumer of the completion should check if IB_WC_WITH_VLAN is set. While that, this patch also fills the vlan_id field in the completion if present. linux commit 12f8fedef2ec94c783f929126b20440a01512c14 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 16:27:31 +00:00
Hans Petter Selasky	0285276c92	Add call to setup firmware data dump structure during device load in mlx5core. Do not consider the inability to create a firmware dump fatal, but inform about the situation and allow the driver to attach. The device might not implement the needed VSC, or we might not know the layout of the registers map. In either case, only firmware dump functionality is limited, the network operations should be fine. Submitted by: kib@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 16:19:01 +00:00
Hans Petter Selasky	9cb83c4689	Avoid more LFENCE/SFENCe on x86 in mlx5en(4), by using the FreeBSD native fences. Submitted by: kib@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:58:30 +00:00
Hans Petter Selasky	7d69d339ad	Fix mlx5en(4) driver to properly call m_defrag(). When the mlx5en(4) driver was converted to using BUSDMA(9) the call to m_defrag() was moved after the part of the TX routine that strips the header from the mbuf chain. Before it called m_defrag it first trimmed off the now-empty mbufs from the start of the chain. This has the side effect of also removing the head of the chain that has M_PKTHDR set. m_defrag() will not defrag a chain that does not have M_PKTHDR set, thus it was effectively never defragging the mbuf chains. As it turns out, trimming the mbufs in this fashion is unnecessary since the call to bus_dmamap_load_mbuf_sg doesn't map empty mbufs anyway, so remove it. Differential Revision: https://reviews.freebsd.org/D12050 Submitted by: mjoras@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:53:04 +00:00
Hans Petter Selasky	1eb09ad434	Use vport rather than physical-port MTU in mlx5en(4). Set and report vport MTU rather than physical MTU, The driver will set both vport and physical port mtu and will rely on the query of vport mtu. SRIOV VFs have to report their MTU to their vport manager (PF), and this will allow them to work with any MTU they need without failing the request. Also for some cases where the PF is not a port owner, PF can work with MTU less than the physical port mtu if set physical port mtu didn't take effect. Based on Linux upstream commit: cd255efff9baadd654d6160e52d17ae7c568c9d3 Submitted by: Meny Yossefi <menyy@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:47:17 +00:00
Hans Petter Selasky	7c22ae80d2	Use the device unit number for naming the ifnet interface in mlx5en(4). Currently the ifnet interface is named mceX, where X is a monotonically incremented value. If the device is reset due to a fatal error, then the interface name will change. Using the device unit number will keep the naming consistent across the reset logic. Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:43:41 +00:00
Hans Petter Selasky	1a872967ad	Remove duplicate prototypes. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:37:09 +00:00
Hans Petter Selasky	e808190a59	Add kernel and userspace code to dump the firmware state of supported ConnectX-4/5 devices in mlx5core. The dump is obtained by reading a predefined register map from the non-destructive crspace, accessible by the vendor-specific PCIe capability (VSC). The dump is stored in preallocated kernel memory and managed by the mlx5tool(8), which communicates with the driver using a character device node. The utility allows to store the dump in format <address> <value> into a file, to reset the dump content, and to manually initiate the dump. A call to mlx5_fwdump() should be added at the places where a dump must be fetched automatically. The most likely place is right before a firmware reset request. Submitted by: kib@ MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 15:21:56 +00:00
Hans Petter Selasky	4b95c6659a	Add vendor specific capability interface support in mlx5core. Add the ability to access the vendor specific space gateway in order to support reading and writing data into the different configuration domains. Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 11:59:47 +00:00
Hans Petter Selasky	fba294620c	Use device_printf() instead of printf() when printing warnings and errors to dmesg(8) in mlx5core. Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 11:58:27 +00:00
Hans Petter Selasky	10b0804509	Add support for per priority flow control, PFC, to mlx5en(4). Add support for PFC and implement reading the per priority statistics using the sysctl(8) interface. PFC is used together with VLAN priority and can be enabled and disabled on a per priority basis. Global pause frames and PFC are incompatible features and surrounding logic has been added to warn the user about misconfiguration. Update relevant mlx5core APIs for PFC configuration. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 11:40:39 +00:00
Hans Petter Selasky	118063fb70	Add support for explicit congestion notification, ECN, to mlx5ib(4). ECN configuration and statistics is available through a set of sysctl(8) nodes under sys.class.infiniband.mlx5_X.cong . The ECN configuration nodes can also be used as loader tunables. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 11:23:14 +00:00
Hans Petter Selasky	788333d9a6	Use the autogenerated interface file for all commands in mlx5core. This patch accumulates the following Linux commits: - 90b3e38d048f09b22fb50bcd460cea65fd00b2d7 mlx5_core: Modify CQ moderation parameters - 09a7d9eca1a6cf5eb4f9abfdf8914db9dbd96f08 mlx5_core: QP/XRCD commands via mlx5 ifc - 1a412fb1caa2c1b77719ccb5ed8b0c3c2bc65da7 mlx5_core: Modify QP commands via mlx5 ifc - ec22eb53106be1472ba6573dc900943f52f8fd1e mlx5_core: MKey/PSV commands via mlx5 ifc - 73b626c182dff06867ceba996a819e8372c9b2ce mlx5_core: EQ commands via mlx5 ifc - 20ed51c643b6296789a48adc3bc2cc875a1612cf mlx5_core: Access register and MAD IFC commands via mlx5 ifc - a533ed5e179cd15512d40282617909d3482a771c mlx5_core: Pages management commands via mlx5 ifc - b8a4ddb2e8f44f872fb93bbda2d541b27079fd2b mlx5_core: Add MLX5_ARRAY_SET64 to fix BUILD_BUG_ON - af1ba291c5e498973cc325c501dd8da80b234571 mlx5_core: Refactor internal SRQ API - b06e7de8a9d8d1d540ec122bbdf2face2a211634 mlx5_core: Refactor device capability function - c4f287c4a6ac489c18afc4acc4353141a8c53070 mlx5_core: Unify and improve command interface Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 10:43:42 +00:00
Hans Petter Selasky	ca55159467	Fix race between PCI error handlers and health work in mlx5core. linux commit 05ac2c0b7438ea08c5d54b48797acf9b22cb2f6f Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 09:58:41 +00:00
Hans Petter Selasky	7053deeb3d	Avoid calling sleeping function from the health poll thread in mlx5core. linux commit c1d4d2e92ad670168a17a57dfa182a5a5baa72d4 Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 09:51:33 +00:00
Hans Petter Selasky	a2485fe5a6	Updates for PCI and health monitor recovery in mlx5core. This patch accumulates the following Linux commits: mlx5_health.c - 78ccb25861d76a8fc5c678d762180e6918834200 mlx5_core: Fix wrong name in struct - 171bb2c560f45c0427ca3776a4c8f4e26e559400 mlx5_core: Update health syndromes - 0144a95e2ad53a40c62148f44fb0c1f9d2a0d1e9 mlx5_core: Use accessor functions to read from device memory - ac6ea6e81a80172612e0c9ef93720f371b198918 mlx5_core: Use private health thread for each device - fd76ee4da55abb21babfc69310d321b9cb9a32e0 mlx5_core: Fix internal error detection conditions - 2241007b3d783cbdbaa78c30bdb1994278b6f9b9 mlx5: Clear health sick bit when starting health poll - 712bfef60912d91033cb25739f7444d5b8d8c59f mlx5: Fix version printout in case of health issue - 89d44f0a6c732db23b219be708e2fe1e03ee4842 mlx5_core: Add pci error handlers to mlx5_core driver mlx5_cmd.c - be87544de8df2b1eb34bcb5e32691287d96f9ec4 mlx5_core: Fix async commands return code - a31208b1e11df334d443ec8cace7636150bb8ce2 mlx5_core: New init and exit flow for mlx5_core - 020446e01eebc9dbe7eda038e570ab9c7ab13586 mlx5_core: Prepare cmd interface to system errors handling - 89d44f0a6c732db23b219be708e2fe1e03ee4842 mlx5_core: Add pci error handlers to mlx5_core driver - 0d834442cc247c7b3f3bd6019512ae03e96dd99a mlx5: Fix teardown errors that happen in pci error handler mlx5_main.c - 5fc7197d3a256d9c5de3134870304b24892a4908 mlx5: Add pci shutdown callback Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-08 09:47:09 +00:00
Hans Petter Selasky	2e9c3a4f99	Implement priority to traffic class mapping in mlx5core. Add support for mapping priority to traffic class via sysctl Submitted by: Slava Shwartsman <slavash@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 15:23:07 +00:00
Hans Petter Selasky	cfc9c386eb	Implement rate limit per traffic class in mlx5core. Add support for rate limiting traffic class via sysctl. Submitted by: Slava Shwartsman <slavash@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 15:17:36 +00:00
Hans Petter Selasky	d91421515a	Implement missing query for current port rate in mlx5ib(4). - Factor out port speed definitions into new port.h header file, similarly as done in Linux upstream. - Correct two existing port speed definitions in mlx5en according to Linux upstream. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 15:03:11 +00:00
Hans Petter Selasky	ecb4fcc48e	Add log message for unsupported QSFPs in mlx5core. Submitted by: Matthew Finlay <matt@mellanox.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:51:50 +00:00
Hans Petter Selasky	2289559114	Make sure default VNET is set when adding a new interface in mlx5core. Adding an interface might be done outside the device_attach() routine and will then cause a panic, due to the VNET not being set. MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:49:27 +00:00
Hans Petter Selasky	11546d068d	Add timeout handle to commands with callback in mlx5core. The current implementation does not handle timeout in case of command with callback request, and this can lead to deadlock if the command doesn't get firmware response. Add delayed callback timeout work before posting the command to firmware. In case of real firmware command completion we will cancel the delayed work. In case of firmware command timeout the callback timeout handler will be called and it will simulate firmware completion with timeout error. linux commit 65ee67084589c1783a74b4a4a5db38d7264ec8b5 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:41:29 +00:00
Hans Petter Selasky	2327a753b9	Fix potential deadlock in command mode change in mlx5core. Call command completion handler in case of timeout when working in interrupts mode. Avoid flushing the commands workqueue after acquiring the semaphores to prevent a potential deadlock. linux commit commit 9cba4ebcf374c3772f6eb61f2d065294b2451b49 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:35:28 +00:00
Hans Petter Selasky	ee1b5c5811	Use a macro in mlx5_command_str() instead of copying OP name. linux commit 42ca502e179d0654ef441333a9d0f35c948734f3 MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:29:30 +00:00
Hans Petter Selasky	2f2d3c0cf3	Disable unsupported disassociate ucontext functionality in mlx5ib(4). MFC after: 1 week Sponsored by: Mellanox Technologies	2018-03-07 14:03:31 +00:00

1 2 3 4

159 Commits