freebsd-dev

Author	SHA1	Message	Date
Alexander Motin	321f819ba5	Refactor command ordering/blocking mechanism in CTL. Replace long per-LUN queue of blocked commands, scanned on each command completion and sometimes even twice, causing up to O(n^^2) processing cost, by much shorter per-command blocked queues, scanned only when respective command completes, and check only commands before the previous blocker, reducing cost to O(n). While there, unblock aborted commands to make them "complete" ASAP to be removed from the OOA queue and so not waste time ordering other commands against them. Aborted commands that were not sent to execution yet should have no visible side effects, so this is safe and easy optimization now, comparing to commands already in processing, which are a still pain. Together those two optimizations should fix quite pathological case, when due to backend slowness CTL accumulated many thousands of blocked requests, partially aborted by initiator and so supposedly not even existing, but still wasting CTL CPU time. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2019-02-27 21:29:21 +00:00
Alexander Motin	e806165bee	Remove disabled CTL_LEGACY_STATS support. It was not only disabled for quite a while, but also appeared to be broken at r325517, when maximum number of ports was made configurable. MFC after: 1 week	2019-02-23 04:24:44 +00:00
Pedro F. Giffuni	bec9534d1d	sys/cam: further adoption of SPDX licensing ID tags. Mainly focus on files that use BSD 2-Clause license, however the tool I was using misidentified many licenses so this was mostly a manual - error prone - task. The Software Package Data Exchange (SPDX) group provides a specification to make it easier for automated tools to detect and summarize well known opensource licenses. We are gradually adopting the specification, noting that the tags are considered only advisory and do not, in any way, superceed or replace the license texts.	2017-11-27 15:12:43 +00:00
Emmanuel Vadot	530fdf6771	ctl: Make max_luns and max_ports tunable variables instead of hardcoded defines. Reviewed by: trasz (earlier version), bapt (earlier version), bcr (manpages) MFC after: 2 Weeks Sponsored by: Gandi.net Differential Revision: https://reviews.freebsd.org/D12836	2017-11-07 16:59:52 +00:00
Alexander Motin	9ff948d05f	Polish handling of different reset flavours. The biggest change is that ctl_remove_initiator() now generates I_T NEXUS LOSS event, cleaning part of LUs state related to the initiator. MFC after: 2 weeks	2017-02-27 14:59:00 +00:00
Alexander Motin	6563855634	Reenable CTL_WITH_CA, optimizing it for lower memory usage. This code was disabled due to its high memory usage. But now we need this functionality for cfumass(4) frontend, since USB MS BBB transport does not support autosense. MFC after: 2 weeks	2017-02-25 14:20:30 +00:00
Edward Tomasz Napierala	5ab1cf33f7	Bring the ctl headers a bit closer to style(9). No functional changes.	2017-01-25 22:52:57 +00:00
Alexander Motin	0c629e2884	Add initial support for CTL module unloading. It is only a first step and not perfect, but better then nothing. The main blocker is CAM target frontend, that can not be unloaded, since CAM does not have mechanism to unregister periph driver now. MFC after: 2 weeks	2017-01-21 19:38:26 +00:00
Alexander Motin	bb8f9017b3	Rewrite CTL statistics in more simple and scalable way. Instead of collecting statistics for each combination of ports and logical units, that consumed ~45KB per LU with present number of ports, collect separate statistics for every port and every logical unit separately, that consume only 176 bytes per each single LU/port. This reduces struct ctl_lun size down to just 6KB. Also new IOCTL API/ABI does not hardcode number of LUs/ports, and should allow handling of very large quantities. MFC after: 2 weeks (probably keeping old API enabled for some time)	2017-01-09 18:18:15 +00:00
Alexander Motin	0e2d6474ca	Allocate memory for prevent flags only for removable LUs. This array takes 64KB of RAM now, that was more then half of struct ctl_lun size. If at some point we support more ports, this may need another tune. MFC after: 2 weeks	2017-01-09 16:21:06 +00:00
Alexander Motin	4fc0d1d757	Improve length handling when writing sense data. - Allow maximal sense size limitation via Control Extension mode page. - When sense size limited, include descriptors atomically: whole or none. - Set new SDAT_OVFL bit if some descriptors don't fit the limit. - Report real written sense length instead of static maximal 252 bytes. MFC after: 2 weeks	2016-12-24 17:42:34 +00:00
Alexander Motin	33e7ba9cf9	Add support for SITUA bit in Logical Block Provisioning mode page. VMware tries to enable this bit to avoid multiple threshold notifications in case of multiple initiators connected to the same LUN. Unfortunately their code sends MODE SELECT(6) request with parameter length hardcoded for the page without any thresholds. Since we have four threshold and our page is bigger, this attempt fails, that is correct in my understanding. So all we can do about this now is to report proper error code and hope VMware fix their code one day. MFC after: 2 weeks	2016-12-21 15:17:47 +00:00
Alexander Motin	1188787581	Add set of macros to simplify code access to mode pages fields. MFC after: 2 weeks	2016-12-19 13:25:53 +00:00
Alexander Motin	d9ba4eefdc	Improve support for informational exceptions. While CTL still has no real events to report in this way (like SMART), it is possible to trigger false event by manually setting TEST bit in Informational Exceptions Control mode page, that can be useful for initiator testing. This code supports all flavours of IE reporting: UNIT ATTENTION, RECOVERED ERROR and NO SENSE sense keys, REQUEST SENSE command and Informational Exceptions log page. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2016-12-19 10:25:47 +00:00
Alexander Motin	6bd364b523	Modify target port groups logic in CTL. - Introduce "ha_shared" port option, which being set to "on" moves the port into separate port group, shared between HA nodes. This allows to better handle cases when iSCSI portals are bound to CARP address that can dynamically move between nodes. Some initiators (at least VMware) don't detect that after iSCSI reconnect they've attached to different SCSI port from different port group, that totally breakes ALUA status parsing. In theory, I believe, it should be enough to have different iSCSI portal group tags on different nodes to make initiators detect this condition, but it seems like VMware ignores those values, and even full LUN retaste forced by UA does not help. - Make CTL report up to three port groups: 1 -- non-HA mode or ports with "ha_shared" option set, 2 -- HA node 1, 3 -- HA node 2. - Report Transitioning state for all port groups when HA interlink is connected, but neither of nodes is primary for the LUN. MFC after: 2 weeks	2015-11-11 13:18:38 +00:00
Alexander Motin	f53270c858	Unify PR variable names to reduce confusion.	2015-10-01 12:15:36 +00:00
Alexander Motin	66b6967686	Really implement PREVENT ALLOW MEDIUM REMOVAL command.	2015-09-29 15:12:40 +00:00
Alexander Motin	d6e7f6e741	Add CD/DVD Capabilities and Mechanical Status Page. This page is obsolete since MMC-4, but still used by some software.	2015-09-29 09:09:37 +00:00
Alexander Motin	648dfc1a29	Umplement media load/eject support for removable devices. In case of block backend eject really closes the backing store, while load tries to open it back. Failed store open is reported as no media.	2015-09-28 20:54:18 +00:00
Alexander Motin	91be33dc78	Add to CTL initial support for CDROMs and removable devices. Relnotes: yes	2015-09-27 13:47:28 +00:00
Alexander Motin	2e33ae99cf	Allow LOG SENSE command on non-disk devices.	2015-09-26 13:51:29 +00:00
Alexander Motin	6bff2b5bff	Move ioctl frontend defines where they belong.	2015-09-26 11:56:28 +00:00
Alexander Motin	d3ab449cfa	Remove few more unused variables.	2015-09-26 11:39:54 +00:00
Alexander Motin	9c887a4f86	Remove some duplicate, legacy, dead and questionable code.	2015-09-26 11:28:45 +00:00
Alexander Motin	c53993057b	Add support for Control extension mode page.	2015-09-22 14:55:46 +00:00
Alexander Motin	efbf6139a4	Split two command flags with different meaning. This is only a cosmetical change.	2015-09-19 19:11:59 +00:00
Alexander Motin	6213882769	When reporting TPT UA, report which of thresholds was reached.	2015-09-17 17:00:36 +00:00
Alexander Motin	7ac58230ea	Reimplement CTL High Availability. CTL HA functionality was originally implemented by Copan many years ago, but large part of the sources was never published. This change includes clean room implementation of the missing code and fixes for many bugs. This code supports dual-node HA with ALUA in four modes: - Active/Unavailable without interlink between nodes; - Active/Standby with second node handling only basic LUN discovery and reservation, synchronizing with the first node through the interlink; - Active/Active with both nodes processing commands and accessing the backing storage, synchronizing with the first node through the interlink; - Active/Active with second node working as proxy, transfering all commands to the first node for execution through the interlink. Unlike original Copan's implementation, depending on specific hardware, this code uses simple custom TCP-based protocol for interlink. It has no authentication, so it should never be enabled on public interfaces. The code may still need some polishing, but generally it is functional. Relnotes: yes Sponsored by: iXsystems, Inc.	2015-09-10 12:40:31 +00:00
Alexander Motin	0bcd4ab6ba	Move setting of media parameters inside open routines. This is preparation for possibility to open/close media several times per LUN life cycle. While there, rename variables to reduce confusion. As additional bonus this allows to open read-only media, such as ZFS snapshots.	2015-09-06 09:54:56 +00:00
Alexander Motin	67ceb24bca	Move "ioctl" CAM frontend into separate file. It has nothing to share with too huge ctl.c other then device descriptor, but even that may be counted as design error that may be fixed later. At some point we may even want to have several ioctl ports.	2015-08-15 15:42:21 +00:00
Alexander Motin	2f444d157b	Drop "internal" CTL frontend. Its idea was to be a simple initiator and execute several commands from kernel level, but FreeBSD never had consumer for that functionality, while its implementation polluted many unrelated places..	2015-08-15 13:34:38 +00:00
Alexander Motin	f2a20b166a	Relax serialization of SYNCHRONIZE CACHE commands. Before this change SYNCHRONIZE CACHE commands were executed exclusively, as if they had ORDERED tag. But looking through SCSI specs I've found no any reason to be so strict. For reads this ordering seems pointless. For writes it looks less obvious, so I left ordering against preceeding write commands, while following ones are no longer required to wait. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2015-08-05 21:58:32 +00:00
Alexander Motin	7834ea8891	Bring per-port LUN enable/disable code up to date: - remove last remnants of never implemented multiple targets support; - implement missing support for LUN mapping in this area. Due to existing locking constraints LUN mapping code is practically unlocked at this point. Hopefully it is not racy enough to live until somebody get idea how to call sleeping fronend methods under lock also taken by the same frontend in non-sleepable context. :(	2015-06-20 12:43:54 +00:00
Alexander Motin	2d8b28765c	Introduce separate lock for tokens to reduce ctl_lock scope.	2015-06-20 11:20:25 +00:00
Alexander Motin	2c8cab2a4e	Add support for General Statistics and Performance log page. CTL already collects most of statistics reported there, so why not. MFC after: 2 weeks	2015-02-11 16:10:31 +00:00
Alexander Motin	920c6cbadc	CTL LUN mapping rewrite. Replace iSCSI-specific LUN mapping mechanism with new one, working for any ports. By default all ports are created without LUN mapping, exposing all CTL LUNs as before. But, if needed, LUN mapping can be manually set on per-port basis via ctladm. For its iSCSI ports ctld does it via ioctl(2). The next step will be to teach ctld to work with FibreChannel ports also. Respecting additional flexibility of the new mechanism, ctl.conf now allows alternative syntax for LUN definition. LUNs can now be defined in global context, and then referenced from targets by unique name, as needed. It allows same LUN to be exposed several times via multiple targets. While there, increase limit for LUNs per target in ctld from 256 to 1024. Some initiators do not support LUNs above 255, but that is not our problem. Discussed with: trasz MFC after: 2 weeks Relnotes: yes Sponsored by: iXsystems, Inc.	2015-02-01 21:50:28 +00:00
Alexander Motin	bfbfc4a3cb	Count consecutive read requests as blocking in CTL for files and ZVOLs. Technically read requests can be executed in any order or simultaneously since they are not changing any data. But ZFS prefetcher goes crasy when it receives consecutive requests from different threads. Since prefetcher works on level of separate blocks, instead of two consecutive 128K requests it may receive 32 8K requests in mixed order. This patch is more workaround then a real fix, and it does not fix all of prefetcher problems, but it improves sequential read speed by 3-4x times in some configurations. On the other side it may hurt performance if some backing store has no prefetch, that is why it is disabled by default for raw devices. MFC after: 2 weeks	2014-12-06 20:39:25 +00:00
Alexander Motin	ef8daf3fed	Add GET LBA STATUS command support to CTL. It is implemented for LUNs backed by ZVOLs in "dev" mode and files. GEOM has no such API, so for LUNs backed by raw devices all LBAs will be reported as mapped/unknown. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2014-12-04 11:34:19 +00:00
Alexander Motin	9e52565344	Do not pre-allocate UNIT ATTENTIONs storage for every possible initiator. Abusing ability of major UAs cover minor ones we may not account UAs for inactive ports. Allocate UAs storage for port and start accounting only after some initiator from that port fetched its first POWER ON OCCURRED. This reduces per-LUN CTL memory usage from >1MB to less then 100K. MFC after: 1 month	2014-12-03 15:16:18 +00:00
Alexander Motin	411598df7a	Do not pre-allocate reservation keys memory for every possible initiator. In configurations with many ports, like iSCSI, each LUN is typically accessed only by limited subset of ports. Allocating that memory on demand allows to reduce CTL memory usage from 5.3MB/LUN to 1.3MB/LUN. MFC after: 1 month	2014-12-03 09:05:53 +00:00
Alexander Motin	77a06f9db0	Convert persis_offset from global variable to softc field.	2014-12-02 12:38:22 +00:00
Alexander Motin	1251a76b12	Replace home-grown CTL IO allocator with UMA. Old allocator created significant lock congestion protecting its lists of preallocated I/Os, while UMA provides much better SMP scalability. The downside of UMA is lack of reliable preallocation, that could guarantee successful allocation in non-sleepable environments. But careful code review shown, that only CAM target frontend really has that requirement. Fix that making that frontend preallocate and statically bind CTL I/O for every ATIO/INOT it preallocates any way. That allows to avoid allocations in hot I/O path. Other frontends either may sleep in allocation context or can properly handle allocation errors. On 40-core server with 6 ZVOL-backed LUNs and 7 iSCSI client connections this change increases peak performance from ~700K to >1M IOPS! Yay! :) MFC after: 1 month Sponsored by: iXsystems, Inc.	2014-11-24 11:37:27 +00:00
Alexander Motin	23b30f5600	Partially reconstruct Active/Standby clusting. In this mode one head is in Active state, supporting all commands, while another is in Standby state, supporting only minimal LUN discovery subset. It is still incomplete since Standby state requires reservation support, which is impossible to do right without having interlink between heads. But it allows to run some basic experiments.	2014-11-21 06:27:37 +00:00
Alexander Motin	3275003e03	Synchronize medium rotation rate in legacy Rigid Disk Drive Geometry mode page with modern Block Device Characteristics VPD page. MFC after: 1 week	2014-11-07 00:10:07 +00:00
Alexander Motin	c3e7ba3e6d	Add to CTL support for logical block provisioning threshold notifications. For ZVOL-backed LUNs this allows to inform initiators if storage's used or available spaces get above/below the configured thresholds. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2014-11-06 00:48:36 +00:00
Alexander Motin	115dc0c762	Reduce code duplication around Write Exclusive persistent reservation. While there, allow some more commands to pass persistent reservation. MFC after: 1 week	2014-10-27 09:26:24 +00:00
Alexander Motin	b491e75b4b	Allocate buffer for READ BUFFER/WRITE BUFFER commands on demand. These commands are rare, but consume additional 256KB RAM per LUN. MFC after: 1 week	2014-10-26 23:25:42 +00:00
Alexander Motin	218d5d4be2	Implement more functional CTL debug logging. Setting bits in kern.cam.ctl.debug allows to log errors, commands and some commands data respectively. MFC after: 1 week	2014-10-16 08:42:17 +00:00
Alexander Motin	9a0190c9a1	Remove couple Copan's vendor-specific mode pages. Those pages are highly system-/hardware-specific, the code is incomplete, and so they hardly can be useful for anybody else.	2014-10-14 11:28:25 +00:00
Alexander Motin	523f047ea2	Some groundwork for later Informational Exceptions support. This includes support for: - Read-Write Error Recovery mode page; - Informational Exceptions Control mode page; - Logical Block Provisioning mode page; - LOG SENSE command. No real Informational Exceptions features yet. This is only a placeholder. Sponsored by: iXsystems, Inc.	2014-10-14 10:14:14 +00:00

1 2

79 Commits