freebsd-dev

Author	SHA1	Message	Date
Warner Losh	214909d669	Revert "cam: fix up world compilation after previous" This reverts commit `1d35493e46`. It was the wrong fix. `757fc6666b` has the proper fix to include stdbool for userland. Sponsored by: Netflix	2023-04-15 18:25:55 -06:00
Warner Losh	757fc6666b	cam: Include stdbool.h for userland Sponsored by: Netflix	2023-04-15 18:25:22 -06:00
Mateusz Guzik	1d35493e46	cam: fix up world compilation after previous Sponsored by: Rubicon Communications, LLC ("Netgate")	2023-04-15 23:11:27 +00:00
Warner Losh	fd02926a68	cam: Properly mask out the status bits to get completion code ccb_h.status has two parts: the actual status and some addition bits to indicate additional information. It must be masked before comparing against completion codes. Add new inline function cam_ccb_success to simplify this to test whether or not the request succeeded. Most of the code already does this, but a few places don't (the rest likely should be converted to use cam_ccb_status and/or cam_ccb_success, but that's for another day). This caused at least one bug in recognizing devices behind a SATA port multiplexer, though some of these checks were fine with the special knowledge of the code paths involved. PR: 270459 Sponsored by: Netflix MFC After: 1 week (and maybe a EN requst) Reviewed by: ken, mav Differential Revision: https://reviews.freebsd.org/D39572	2023-04-15 16:32:41 -06:00
Zhenlei Huang	69cb72b872	cam iosched: Use the existing CTLFLAG_RDTUN and CTLFLAG_RWTUN flag definitions Use them when possible, instead of separated flags. No functional change intended. Reviewed by: hselasky, erj MFC after: 3 days Differential Revision: https://reviews.freebsd.org/D39466	2023-04-12 12:20:38 +08:00
Yuri Pankov	6aa5b10d0c	nvme: fix resv commands with nda device - passing I/O commands through nda requires nsid field to be set (it was unused when going through nvme_ns_ioctl()) - ccb's status can be OR'ed with the flags, use CAM_STATUS_MASK Reviewed by: imp (cam) Differential Revision: https://reviews.freebsd.org/D37696	2023-03-27 14:53:24 +02:00
Alexander Motin	7467a69536	CTL: Allow userland supply tags via ioctl frontend. Before this ioctl frontend always replaced tags with sequential ones. It was done for ctladm, that can not keep track of global tag list. But in case of virtio-scsi in bhyve we can pass provided tags as-is. It should be on virtio-scsi initiator to provide us valid tags. It should allow proper task management, error reporting, etc. In case of several virtio-scsi devices, they should use different CTL ports or initiator IDs to avoid conflicts, but this is expected by design. PR: 267539	2022-12-03 12:05:05 -05:00
Alexander Motin	0acc026dda	CTL: Increase maximum SCSI tag size from 32 to 64 bits. SAM-5 specification states maximum size of command identifier (tag), defined by specific transports, should not be larger than 64 bits. While most of supported transports use 32 bits or less, it was reported that virtio-scsi uses 64 bits. Truncation to 32 bits in bhyve code caused false tag conflict errors reported and possibly other issues. This changes CTL ABI and HA protocol, so CTL_HA_VERSION is bumped. While we make HA protocol incompatible, increase default maximum number of ports in CTL from 256 to 1024, matching number of LUNs. There are many reports from people who need many iSCSI targets with only one LUN each. Increased memory consumption should be less of a problem these days. PR: 267539	2022-12-03 10:23:29 -05:00
Warner Losh	891c69864e	cam: Use FreeBSD standard copyright For CAM, move to the FreeBSD standard copyright rather than the 'put it at the front' variation. This variaiton has been flagged as potentially problematic in other contexts. Since this variation wasn't a conscious decision on our part, use the standard license from src/COPYRIGHT. Also, remove the -FreeBSD suffix in SPDX-License-Identifier. It's obsolete at SPDX and even the original text didn't match it. MFC After: 3 days Sponsored by: Netflix	2022-10-07 23:37:46 -06:00
Mark Johnston	0cd631ee06	cam: Provide compatibility for CAMGETPASSTHRU for periph drivers The CAM version bump 0x19 -> 0x1a changed the CAMGETPASSTHRU definition, so applications using the old ioctl are broken. However, that version change did not affect anything relating to the ioctl implementation for periphs. Fixes: `8f9be1eed1` ("cam(4): Improve XPT_DEV_MATCH") PR: 264709 Tested by: andreas.mahling@googlemail.com Reviewed by: imp MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D36389	2022-09-29 13:14:57 -04:00
Alexander Motin	0586be48a9	CTL: Validate IOCTL parameters. It was possible to cause kernel panic by passing too large args_len or non-NULL result_nvl. Though since the /dev/cam/ctl device is accessible only by root and used only by limited number of tools it was not a big problem. PR: 266115 PR: 266136 Reported by: Robert Morris <rtm@lcs.mit.edu> MFC after: 1 week	2022-09-06 21:58:27 -04:00
Alexander Motin	90bcc81bc3	Delay GEOM disk_create() until CAM periph probe completes. Before this patch CAM periph drivers called both disk_alloc() and disk_create() same time on periph creation. But then prevented disks from opening until the periph probe completion with cam_periph_hold(). As result, especially if disk misbehaves during the probe, GEOM event thread, triggered to taste the disk, got blocked on open attempt, potentially for a long time, unable to process other events. This patch moves disk_create() call from periph creation to the end of the probe. To allow disk_create() calls from non-sleepable CAM contexts some of its duties requiring memory allocations are moved either back to disk_alloc() or forward to g_disk_create(), so now disk_alloc() and disk_add_alias() are the only disk methods that require sleeping. If disk fails during the probe disk_create() may just be skipped, going directly to disk_destroy(). Other method calls during that time are just ignored. Since GEOM may now see the disks after CAM bus scan is already completed, introduce per-periph boot hold functions. Enclosure driver already had such mechanism, so just generalize it. Reviewed by: imp MFC after: 1 month Sponsored by: iXsystems, Inc. Differential Revision: https://reviews.freebsd.org/D35784	2022-07-14 16:17:36 -04:00
Alan Somers	5f438dd3ac	ses: don't panic if disk elements have really weird descriptors SES allows element descriptors to contain characters like spaces and quotes that devfs does not allow to appear in device aliases. Since SES element descriptors are outside of the kernel's control, we should gracefully handle a failure to create a device physical path alias. PR: 264513 Reported by: Yuri <yuri@aetern.org> Reviewed by: imp, mav Sponsored by: Axcient MFC after: 2 weeks	2022-06-23 11:19:20 -06:00
Alexander Motin	3b0e3e8d2a	CTL: Fix double command completions on HA failover. I've found couple cases when CTL_FLAG_SENT_2OTHER_SC flags were not cleared on commands return from active node or the send failure. It created races when ctl_failover_lun() call before ctl_process_done() could cause second ctl_done() and ctl_process_done() calls, causing all sorts of problems. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2022-06-14 00:24:39 -04:00
Dmitry Chagin	31d1b816fe	sysent: Get rid of bogus sys/sysent.h include. Where appropriate hide sysent.h under proper condition. MFC after: 2 weeks	2022-05-28 20:52:17 +03:00
Mitchell Horne	489ba22236	kerneldump: remove physical argument from d_dumper The physical address argument is essentially ignored by every dumper method. In addition, the dump routines don't actually pass a real address; every call to dump_append() passes a value of zero for physical. Reviewed by: markj MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D35173	2022-05-13 10:42:48 -03:00
Alexander Motin	356155fe02	Fix `bd82711aff` serial number trailing spaces removal. For devices returning 16 byte serial numbers with 8 leading spaces this falsely removed everything due to looking at wrong offset.	2022-05-09 10:30:04 -04:00
Warner Losh	a85fea31c5	iosched: remove stray debug This printf was designed to catch misqueued bio requests. Prior to supporting read_bias == 0, we couldn't get anything but reads and writes in this queue. However, for read_bias == 0 we queue everything except BIO_DELETE to this queue, so remove the printf. We don't need to update any statistics. Sponsored by: Netflix	2022-05-04 20:28:00 -06:00
Warner Losh	1907e1c07c	ada: Move comment Move the comment about releasing ccb before periph to adaprobedone() where it belongs. Sponsored by: Netflix	2022-05-04 16:54:38 -06:00
Warner Losh	6f78dae849	cam: Remove redunant static __inline forward decls Sponsored by: Netflix	2022-05-02 09:30:07 -06:00
Warner Losh	1599fc904d	iosched: Move bio_next() inside of the CAM_IOSCHED_DYNAMIC ifdef bio_next() is only used by the dynamic scheduler, so move it under that ifdef. Sponsored by: Netflix	2022-05-01 16:54:15 -06:00
Warner Losh	d095d6a34c	cam_xpt: Prefer bool to int where it's a boolean In the places where we set an integer to 0 or 1 and then use it like a boolean, replace int with bool and 0/1 with false/true. Left alone places where this is a function argument or return value. No functional changes intended. Sponsored by: Netflix	2022-05-01 12:09:42 -06:00
Warner Losh	d592c0db8b	cam: add hw.cam.iosched.read_bias Allow a global setting for the read_bias for the dynamic io scheduler. This allows global policy to be set, in addition to the existing per-drive policy. kern.cam.iosched.read_bias is a new tunable. Sponsored by: Netflix Reviewed by: chs Differential Revision: https://reviews.freebsd.org/D34365	2022-05-01 11:27:34 -06:00
Warner Losh	b65803ba57	cam iosched: default to no read bias in dynamic ioscheduling When we're doing dynamic I/O scheduling, don't default to a read bias of 100. Default it to 0 so turning on dynamic scheduling only does scheduling tweaks that are requested. The other limiters are off by default, and need no further adjustment. Sponsored by: Netflix	2022-05-01 11:27:34 -06:00
Warner Losh	cc1572ddeb	cam iosched: Remove write bias when read bias = 0 Change the meaning of read bias == 0 in the dynamic I/O scheduler. Prior to this change, a read bias of 0 would mean prefer writes. Now, when read bias is 0, we queue all requests to the same queue removing the bias. When it's non-zero, we still separate the queues we use so we can bias reads vs writes for workloads that are read centric. These changes restore the typical bias you get from disksort or ordered insertion at the end of the list. Sponsored by: Netflix	2022-05-01 11:27:34 -06:00
Warner Losh	6c8ab086fe	ada: Retry commands with retries left on CAM_SEL_TIMEOUT The AHCI and ATA SIMs will return CAM_SEL_TIMEOUT when an underlying device has stopped responding. This is usually seen after a timeouted out command and can be a transient event. Rather than fail the peripheral immediately after seeing this, queue a retry. For transient events, this allows drives to continue to provide data, though with some added latency, just like we do when we have some other kind of retriable error. If the error isn't transient (the drive is truly gone), then we'll discover that eventually and fail the transaction and invalidate the drive like we do today. This helps us avoid a panic at the end of camperiphfree when CAM_PERIPH_NEW_DEV_FOUND is set. However, the deferred callback should be queued to xpt_async_td instead of being made inline there. This issue will be solved in a different patch that does that. PR 263703. This also helps us avoid another bug where we can drop all references to the device (causing us to go through camperiphfree and destroy the path) while we have an I/O pending in the ata_da state machine (usually in state ADA_STATE_RAHEAD with ATA_SETFEATURES ATA_SF_ENAB_RCACHE command). It's not clear why the reference that we take out to do the reprobe isn't effective at blocking this. By retrying this condition, though we avoid this bug (at least more often, I don't have a good reproduction test case, I just see this panic a few times a month at work on systems that have transient disk errors on ahci connected SATA SSDs). PR 263704. It's too soon to know how much this helps us avoid this bug. Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D34977	2022-05-01 11:08:56 -06:00
Warner Losh	9fb40baf60	cam_periph: Return ENXIO when peripheral is invalidated When the peripheral is invalidated, no further I/O is possible. Signal this up the stack with ENXIO now that upper layers of the stack differentiate sometimes. In order for there to be further I/O, and new open is required for any block device that a future periph might instantiate for devices at this location that might return or otherwise become available. The I/O scheduler flushes its I/O with the ENXIO error for pending I/O that didn't make it to the device, so this makes the two paths match. MFC After: 3 days Sponsored by: Netflix Reviewed by: chs, mav Differential Revision: https://reviews.freebsd.org/D35093	2022-04-28 16:30:00 -06:00
Alexander Motin	404f001161	CAM: Keep periph_links when restoring CCB in camperiphdone(). While recovery command executed, some other commands from the periph may complete, that may affect periph_links of this CCB. So restoring original CCB we must keep current periph_links as more up to date. I've found this triggering assertions with debug kernel and suspect some memory corruptions otherwise when spun down disk receives two or sometimes more concurrent requests. MFC after: 1 week Sponsored by: iXsystems, Inc.	2022-04-27 21:39:50 -04:00
Warner Losh	e4b1ae2147	ndaasync: sync to SCSI's daasyncs cam_periph_async() calls Sponsored by: Netflix Reviewed by: mav Differential Revision: https://reviews.freebsd.org/D35059	2022-04-26 11:01:39 -06:00
Warner Losh	ae1955cd67	adaasync: Harmonize with daasync We should call cam_periph_async() always, like SCSI does. This routine is supposed to be more of a catch-all. cam_periph_async() only does actions for AC_LOST_DEVICE. It ignores all other events (today), but this may not always be true. So this is a nop change. Drop in a 'break' so we don't fall through unnecessarily. Sponsored by: Netflix Reviewed by: mav Differential Revision: https://reviews.freebsd.org/D35057	2022-04-26 11:01:39 -06:00
Warner Losh	ccaec73d0b	ada: Eliminate dead code We never use the cgd that we get from the XPT_GDEV_TYPE call. Prior to `9a6844d55f` we used it to determine if READ AHEAD or WRITE CACHING was supported. However, all that information was moved into adasetflags so we no longer need to this since it's cached in the softc and updated with the IDENTIFY data changes automatically. Sponsored by: Netflix Reviewed by: mav Differential Revision: https://reviews.freebsd.org/D35039	2022-04-25 12:55:04 -06:00
Warner Losh	9d899bbcb7	cam: Small reorg of ata xpt async code Use a switch rather than a nested if to simplify the async event processing code. No functional changes intended. Sponsored by: Netflix Reviewed by: mav Differential Revision: https://reviews.freebsd.org/D35038	2022-04-25 12:55:04 -06:00
Warner Losh	b43cfe7171	ada/da: Borrow comment from nda about cleanup Remove a XXX comment and replace it with a more accurate comment about what happens to I/O queued to the hardware. Sponsored by: Netflix	2022-04-24 15:11:56 -06:00
Warner Losh	48ae3f4f64	ata/nvme: Add comment Steal the comment from daonninvalidate about the call to disk_gone(). Sponsored by: Netflix	2022-04-24 15:11:52 -06:00
Warner Losh	c08ceddbf7	nda: Fix comment Fix a comment that was left over from the orignial implementation. Explain how pending transactions in hardware are completed/aborted in the SIM prior to ndacleanup being called. Sponsored by: Netflix	2022-04-23 20:31:46 -06:00
Alexander Motin	38f8addaab	CAM: Replicate `e0ceec676d` from da to ada and nda. MFC after: 1 week	2022-04-23 20:15:17 -04:00
John Baldwin	7b02c1e8c6	iscsi: Fetch limits based on a socket rather than assuming global limits. cxgbei needs the ability to return different limits based on the connection (e.g. if the connection is over a T5 adapter or a T6 adapter as well as factoring in the MTU). This change plumbs through the changes in the ioctls without changing any of the backends. The limits callback passed to icl_register now accepts a second socket argument which holds the integer file descriptor. To support ABI compatiblity for old binaries, the callback should return "global" values if the socket fd is zero. The CTL_ISCSI_LIMITS argument used with CTL_ISCSI by ctld(8) now accepts the socket fd in a field that was previously part of a reserved spare field. Old binaries zero this request which results in passing a socket fd of 0 to the limits callback. The ISCSIDREQUEST ioctl no longer returns limits. Instead, iscsid(8) invokes a new ISCSIDLIMITS ioctl after establishing the connection via connect(2). For ABI compat, if the old ISCSIDREQUEST is invoked, the global limits are still fetched (with a socket fd of 0) and returned. Reviewed by: mav Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D34928	2022-04-18 12:53:28 -07:00
Gordon Bergling	456b2bb97e	cam(4): Remove a double word in a source code comment - s/this this/this/ MFC after: 3 days	2022-04-09 10:13:59 +02:00
Gordon Bergling	49dace1d46	cam: Fix typos in source code comments - s/paniced/panicked/ MFC after: 3 days	2022-04-02 10:13:35 +02:00
Mateusz Guzik	bb92cd7bcd	vfs: NDFREE(&nd, NDF_ONLY_PNBUF) -> NDFREE_PNBUF(&nd)	2022-03-24 10:20:51 +00:00
Alexander Motin	f00ced06da	CTL: Rework `05c3e8e871` using %zu format. MFC after: 2 days	2022-02-25 11:53:53 -05:00
Alexander Motin	05c3e8e871	Fix 32-bit build after `530d274c15`. MFC after: 3 days	2022-02-24 18:11:36 -05:00
Alexander Motin	530d274c15	CTL: Add length validation for incoming HA messages. This should fix uninitialized memory reads when working with broken HA peer, like one fixed in `1a8d8a3a90`. Instead print error message and kill the HA link. MFC after: 3 days Sponsored by: iXsystems, Inc.	2022-02-24 16:24:43 -05:00
Warner Losh	e3d92d4cb8	cam iosched: Update comment for when we schedule writes. Sponsored by: Netflix	2022-02-23 16:21:27 -07:00
John Baldwin	bd6e8729d6	ctl ramdisk: Free compare buffer after a compare I/O request. For a compare request, the ramdisk backend allocates a temporary buffer to hold the I/O data and then compares it against the LUN's pages in ctl_backend_ramdisk_cmp after the data has been filled. However, the tempory buffer was leaked when after the comparison was complete. Fix this by freeing the buffer after the comparison. Reviewed by: mav Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D34316	2022-02-18 15:20:14 -08:00
Kenneth D. Merry	3090d5045a	Fix non-printable characters in NVMe model and serial numbers. The NVMe 1.4 spec simply says that Model and Serial numbers are ASCII strings. Unlike SCSI, it doesn't prohibit non-printable characters or say that the strings should be padded with spaces. Since 2014, we have had cam_strvis_sbuf(), which gives additional options for handling non-ASCII characters. That behavior hasn't been available for non-sbuf consumers, so users of cam_strvis() were left with having octal ASCII codes inserted. So, to avoid having garbage or octal chracters in the strings, use cam_strvis_sbuf() to create a new function, cam_strvis_flag(), and re-implement cam_strvis() using cam_strvis_flag(). Now, for the NVMe drives, we can use cam_strvis_flag with the CAM_STRVIS_FLAG_NONASCII_SPC flag. This transforms non-printable characters into spaces. sys/cam/cam.c: Add a new function, cam_strvis_flag(), that creates an sbuf on the stack with the user's destination buffer, and calls cam_strvis_sbuf() with the given flag argument. Re-implement cam_strvis() to call cam_strvis_flag with the CAM_STRVIS_FLAG_NONASCII_ESC argument. This should be the equivalent of the old cam_strvis() function, except for the overhead of creating the sbuf and calling sbuf_putc/printf. sys/cam/cam.h: Declaration for cam_strvis_flag. sys/cam/nvme/nvme_all.c: In nvme_print_ident, use the NONASCII_SPC flag with cam_strvis_flag(). sys/cam/nvme/nvme_da.c: In ndaregister(), use cam_strvis_flag() with the NONASCII_SPC flag for the disk description and serial number we report to GEOM. sys/cam/nvme/nvme_xpt.c: In nvme_probe_done(), use cam_strvis_flag with the NONASCII_SPC flag when storing the drive serial number in the CAM EDT. MFC after: 1 week Sponsored by: Spectra Logic Differential Revision: https://reviews.freebsd.org/D33973	2022-02-09 17:09:25 -05:00
John Baldwin	a3d71fffa7	cfiscsi_done: Free the dummy PDU earlier. The dummy PDU needs to be freed before marking task abortion complete as otherwise cfiscsi_session_terminate_tasks can return and destroy the session in another thread before the PDU is freed. Fixes: `2e8d1a5525` iscsi: Allocate a dummy PDU for the internal nexus reset task. Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D34176	2022-02-07 12:55:08 -08:00
Gordon Bergling	f32dd4d58a	cam(4): Fix a few typos in source code comments - s/trafer/transfer/ - s/failes/fails/ Obtained from: NetBSD MFC after: 3 days	2022-02-06 13:45:47 +01:00
Alexander Motin	1a8d8a3a90	CTL: Fix mode page trucation on HA synchronization. Due to variable size of struct ctl_ha_msg_mode ctl_isc_announce_mode() sent only first 4 bytes of modified mode page to the other HA side, that caused its corruption there, noticeable only after failover. I've found alike bug also in ctl_isc_announce_lun(), but there it was sending slightly more than needed, that is a smaller problem. MFC after: 1 week Sponsored by: iXsystems, Inc.	2022-02-03 11:10:12 -05:00
John Baldwin	2e8d1a5525	iscsi: Allocate a dummy PDU for the internal nexus reset task. When an iSCSI target session is terminated, an internal nexus reset task is posted to abort existing tasks belonging to the session. Previously, the ctl_io for this internal nexus reset stored a pointer to the session in the slot that normally holds a pointer to the PDU from the initiator that triggered the I/O request. The completion handler then assumed that any nexus reset I/O was due to an internal request and fetched the session pointer (instead of the PDU pointer) from the ctl_io. However, it is possible to trigger a nexus reset via an on-the-wire task management PDU. If such a PDU were sent to the target, then the completion handler would incorrectly treat this request as an internal request and treat the pointer to the received PDU as a pointer to the session instead. To fix, allocate a dummy PDU for the internal reset task and use an invalid opcode to differentiate internal nexus resets from resets requested by the initiator. PR: 260449 Reported by: Robert Morris <rtm@lcs.mit.edu> Reviewed by: mav Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D34055	2022-01-28 13:07:04 -08:00

1 2 3 4 5 ...

2606 Commits