freebsd-nq

Author	SHA1	Message	Date
Alexander Motin	117f1bc17f	Fix wrong LUN reference in XCOPY block-to-block operation. This could cause data corruption due to accessing wrong LUN in case of retries on write errors. Failed writes were retried to read LUN. MFC after: 3 days	2015-01-24 15:40:52 +00:00
Alexander Motin	379694e3d3	Don't count requests with status sent as overlapping. While those requests are still in target OOA queue, for initiator they are already completed, so tags can be reused. MFC after: 1 week	2015-01-22 14:49:36 +00:00
Will Andrews	636870ff01	Restore the CAM XPT peripheral generation counter, and export it via sysctl. Define it as an atomic uint32_t. These increments happen infrequently enough for the atomic overhead to be a problem, and since they're now independent atomics, they won't contend with xpt_lock_buses(). This counter is useful as a means of cheaply identifying whether any changes have been made to the CAM peripheral list. Userland programs have no guarantee that the counter won't change on them while being returned or while processing the information, so they must be written accordingly. Discussed with: ken, mav (in general) MFC after: 1 week Sponsored by: Spectra Logic	2015-01-20 21:15:33 +00:00
Alexander Motin	438e86ce0c	Remove extra mtx_unlock(). Submitted by: Dmitry Luhtionov <dmitryluhtionov@gmail.com> MFC after: 1 week	2015-01-19 15:52:32 +00:00
Hans Petter Selasky	79592d52d5	Minor refactoring of code block. MFC after: 1 day	2015-01-19 07:29:07 +00:00
Alexander Motin	1b922b707e	Don't count status as sent until CTIO completes successfully. If we aggregated status sending with data move and got error, allow status to be updated and resent again separately. Without this command may stuck without status sent at all. MFC after: 2 weeks	2015-01-16 12:35:55 +00:00
Warner Losh	0ac665747d	Explain a bit of tricky code dealing with trims and how it prevents starvation. These side effects aren't obvious without extremely careful study, and are important to do just so.	2015-01-13 00:20:35 +00:00
Kenneth D. Merry	a1736be349	Improve camcontrol(8) handling of drive defect data. This includes a new summary mode (-s) for camcontrol defects that quickly tells the user the most important thing: how many defects are in the requested list. The actual location of the defects is less important. Modern drives frequently have more than the 8191 defects that can be reported by the READ DEFECT DATA (10) command. If they don't have that many grown defects, they certainly have more than 8191 defects in the primary (i.e. factory) defect list. The READ DEFECT DATA (12) command allows for longer parameter lists, as well as indexing into the list of defects, and so allows reporting many more defects. This has been tested with HGST drives and Seagate drives, but does not fully work with Seagate drives. Once I have a Seagate spec I may be able to determine whether it is possible to make it work with Seagate drives. scsi_da.h: Add a definition for the new long block defect format. Add bit and mask definitions for the new extended physical sector and bytes from index defect formats. Add a prototype for the new scsi_read_defects() CDB building function. scsi_da.c: Add a new scsi_read_defects() CDB building function. camcontrol(8) was previously composing CDBs manually. This is long overdue. camcontrol.c: Revamp the camcontrol defects subcommand. We now go through multiple stages in trying to get defect data off the drive while avoiding various drive firmware quirks. We start off by requesting the defect header with the 10 byte command. If we're in summary mode (-s) and the drive reports fewer defects than can be represented in the 10 byte header, we're done. Otherwise, we know that we need to issue the 12 byte command if the drive reports the maximum number of defects. If we're in summary mode, we're done if we get a good response back when asking for the 12 byte header. If the user has asked for the full list, then we use the address descriptor index field in the 12 byte CDB to step through the list in 64K chunks. 64K is small enough to work with most any ancient or modern SCSI controller. Add support for printing the new long block defect format, as well as the extended physical sector and bytes from index formats. I don't have any drives that support the new formats. Add a hexadecimal output format that can be turned on with -X. Add a quiet mode (-q) that can be turned on with the summary mode (-s) to just print out a number. Revamp the error detection and recovery code for the defects command to work with HGST drives. Call the new scsi_read_defects() CDB building function instead of rolling the CDB ourselves. Pay attention to the residual from the defect list request when printing it out, so we don't run off the end of the list. Use the new scsi_nv library routines to convert from strings to numbers and back. camcontrol.8: Document the new defect formats (longblock, extbfi, extphys) and command line options (-q, -s, -S and -X) for the defects subcommand. Explain a little more about what drives generally do and don't support. Sponsored by: Spectra Logic MFC after: 1 week	2015-01-08 16:58:40 +00:00
Kenneth D. Merry	9fb7b3949c	Fix a bug in the CAM SCSI probe code that caused changes in inquiry data to go undetected. The probe code does an MD5 checksum of the inquiry data (and page 0x80 serial number if available) before doing a reprobe of an existing device, and then compares a checksum after the probe to see whether the device has changed. This check was broken in January, 2000 by change 56146 when the extended inquiry probe code was added. In the extended inquiry probe case, it was calculating the checksum a second time. The second time it included the updated inquiry data from the short inquiry probe (first 36 bytes). So it wouldn't catch cases where the vendor, product, revision, etc. changed. This change will have the effect that when a device's inquiry data is updated and a rescan is issued, it will disappear and then reappear. This is the appropriate action, because if the inquiry data or serial number changes, it is either a different device or the device configuration may have changed significantly. (e.g. with updated firmware.) scsi_xpt.c: Don't calculate the initial MD5 checksum on standard inquiry data and the page 0x80 serial number if we have already calculated it. MFC after: 1 week Sponsored by: Spectra Logic	2015-01-08 16:27:56 +00:00
Hans Petter Selasky	68f71fc180	Allow a block size of zero to mean 512 bytes, which is the most common block size for USB disks. This fixes support for "Action Cam SJ4000". Reviewed by: mav @ MFC after: 1 week	2015-01-08 15:10:25 +00:00
Alexander Motin	7a19455d22	Hide block device VPD pages for non-block devices. MFC after: 2 weeks	2014-12-23 15:18:28 +00:00
Alexander Motin	1f69310e70	Fix potential division by zero after r275920. Reported by: Coverity Scan CID: 1260387 MFC after: 3 days	2014-12-23 10:18:42 +00:00
Alexander Motin	83c25ce791	Report initiator id in portlist XML in more formalized way. MFC after: 3 days	2014-12-20 16:39:56 +00:00
Alexander Motin	e7038eb747	Replace ctl_min() macro with MIN(). MFC after: 1 week	2014-12-20 13:33:31 +00:00
Alexander Motin	0664680536	Constify some static data. MFC after: 2 weeks	2014-12-19 20:51:54 +00:00
Alexander Motin	9602f43616	Reduce number of places where global control_softc is used. At some point we may want to have several CTL instances, and that is not really impossible. MFC after: 2 weeks	2014-12-19 20:35:06 +00:00
Alexander Motin	cb8727e23a	Pass real optimal transfer size supported by backend. For files and ZVOLs that is 1MB now, not 128K. MFC after: 1 week	2014-12-18 22:32:22 +00:00
Alexander Motin	34961f407d	Add configuration options to override physical and UNMAP blocks geometry. While in most cases CTL should correctly fetch those values from backing storages, there are some initiators (like MS SQL), that may not like large physical block sizes, even if they are true. For such cases allow override fetched values with supported ones (like 4K). MFC after: 1 week	2014-12-17 17:30:54 +00:00
Alexander Motin	2124e3b07f	Make sequence numbers checks more strict. While we don't support MCS, hole in received sequence numbers may mean only PDU loss. While we don't support lost PDU recovery, terminate the connection to avoid stuck commands. While there, improve handling of sequence numbers wrap after 2^32 PDUs. MFC after: 2 weeks	2014-12-17 15:13:21 +00:00
Alexander Motin	cb7430346d	Do not count RCTD bit set as an error. We can not really implement it, but specification tells that it "shall" work, so it can be safely ignored. MFC after: 1 week	2014-12-16 21:51:21 +00:00
Alexander Motin	bfbfc4a3cb	Count consecutive read requests as blocking in CTL for files and ZVOLs. Technically read requests can be executed in any order or simultaneously since they are not changing any data. But ZFS prefetcher goes crasy when it receives consecutive requests from different threads. Since prefetcher works on level of separate blocks, instead of two consecutive 128K requests it may receive 32 8K requests in mixed order. This patch is more workaround then a real fix, and it does not fix all of prefetcher problems, but it improves sequential read speed by 3-4x times in some configurations. On the other side it may hurt performance if some backing store has no prefetch, that is why it is disabled by default for raw devices. MFC after: 2 weeks	2014-12-06 20:39:25 +00:00
Alexander Motin	85700d4d7d	In addition to r275481 allow threshold notifications work without UNMAP. While without UNMAP support there is not much initiator can do about it, the administrator still better be notified about the storage overflow. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2014-12-05 13:30:45 +00:00
Alexander Motin	53c146de18	Add to CTL support for threshold notifications for file-backed LUNs. Previously it was supported only for ZVOL-backed LUNs, but now should work for file-backed LUNs too. Used value in this case is a space occupied by the backing file, while available value is an available space on file system. Pool thresholds are still not implemented in this case. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2014-12-04 18:37:42 +00:00
Alexander Motin	5a770b5496	Swap resource count scopes for used/available space. Used count should be reported as per-LUN, while available should not. MFC after: 1 week	2014-12-04 17:36:29 +00:00
Alexander Motin	ef8daf3fed	Add GET LBA STATUS command support to CTL. It is implemented for LUNs backed by ZVOLs in "dev" mode and files. GEOM has no such API, so for LUNs backed by raw devices all LBAs will be reported as mapped/unknown. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2014-12-04 11:34:19 +00:00
Alexander Motin	ffe9621cc3	Increase CTL ports limit from 128 to 256 and LUNs limit from 256 to 1024. After recent optimizations this change is no longer blocked by CTL memory consumption. Those limits are still not free, but much cheaper now. MFC after: 1 week Relnotes: yes Sponsored by: iXsystems, Inc.	2014-12-03 16:04:01 +00:00
Alexander Motin	f9477570ec	Unify function names after r275458. MFC after: 1 month	2014-12-03 15:19:38 +00:00
Alexander Motin	9e52565344	Do not pre-allocate UNIT ATTENTIONs storage for every possible initiator. Abusing ability of major UAs cover minor ones we may not account UAs for inactive ports. Allocate UAs storage for port and start accounting only after some initiator from that port fetched its first POWER ON OCCURRED. This reduces per-LUN CTL memory usage from >1MB to less then 100K. MFC after: 1 month	2014-12-03 15:16:18 +00:00
Alexander Motin	c9fe195c24	Remove some unused code.	2014-12-03 10:39:47 +00:00
Alexander Motin	411598df7a	Do not pre-allocate reservation keys memory for every possible initiator. In configurations with many ports, like iSCSI, each LUN is typically accessed only by limited subset of ports. Allocating that memory on demand allows to reduce CTL memory usage from 5.3MB/LUN to 1.3MB/LUN. MFC after: 1 month	2014-12-03 09:05:53 +00:00
Alexander Motin	2a72b5936d	Plug memory leaks on UNMAP and XCOPY with invalid parameters. MFC after: 1 week	2014-12-03 08:25:41 +00:00
Alexander Motin	77a06f9db0	Convert persis_offset from global variable to softc field.	2014-12-02 12:38:22 +00:00
Alexander Motin	40103f1ec4	Reduce code duplication by creating ctl_set_res_ua() helper.	2014-12-02 12:31:28 +00:00
Alexander Motin	1e8607769f	Removed unused variable and unify some names.	2014-12-02 12:05:44 +00:00
Alexander Motin	fa91cabfbd	When passing LUN IDs through treat ASCII values as fixed-length, not interpreating NULLs as EOLs, but converting them to spaces. SPC-4 does not tell that T10-based IDs should be NULL-terminated/padded. And while it tells that it should include only ASCII chars (0x20-0x7F), there are some USB sticks (SanDisk Ultra Fit), that have NULLs inside the value. Treating NULLs as EOLs there made those LUN IDs non-unique. MFC after: 1 week	2014-12-01 15:21:54 +00:00
Alexander Motin	7511bd04e4	Move ctlfe_onoffline() out of lock to let it sleep when needed. Do some more other polishing while there. MFC after: 2 weeks	2014-12-01 13:55:45 +00:00
Alexander Motin	f7241cceb0	Coalesce last data move and command status for read commands. Make CTL core and block backend set success status before initiating last data move for read commands. Make CAM target and iSCSI frontends detect such condition and send command status together with data. New I/O flag allows to skip duplicate status sending on later fe_done() call. For Fibre Channel this change saves one of three interrupts per read command, increasing performance from 126K to 160K IOPS. For iSCSI this change saves one of three PDUs per read command, increasing performance from 1M to 1.2M IOPS. MFC after: 1 month Sponsored by: iXsystems, Inc.	2014-11-25 17:53:35 +00:00
Alexander Motin	993a751eb3	Decouple datamove/done logic from CTL status set.	2014-11-25 12:22:29 +00:00
Alexander Motin	4a2863452f	Use ctl_set_success() instead of direct inlining. MFC after: 1 week	2014-11-25 06:11:05 +00:00
Alexander Motin	1251a76b12	Replace home-grown CTL IO allocator with UMA. Old allocator created significant lock congestion protecting its lists of preallocated I/Os, while UMA provides much better SMP scalability. The downside of UMA is lack of reliable preallocation, that could guarantee successful allocation in non-sleepable environments. But careful code review shown, that only CAM target frontend really has that requirement. Fix that making that frontend preallocate and statically bind CTL I/O for every ATIO/INOT it preallocates any way. That allows to avoid allocations in hot I/O path. Other frontends either may sleep in allocation context or can properly handle allocation errors. On 40-core server with 6 ZVOL-backed LUNs and 7 iSCSI client connections this change increases peak performance from ~700K to >1M IOPS! Yay! :) MFC after: 1 month Sponsored by: iXsystems, Inc.	2014-11-24 11:37:27 +00:00
Alexander Motin	7c5db11fe0	Read cs_outstanding_ctl_pdus before incrementing it for NEXUS RESET task. This removes extra log noise on idle connection termination. MFC after: 1 week	2014-11-24 00:59:51 +00:00
Alexander Motin	94ab1f3693	Make iSCSI frontend less chatty while waiting for tasks termination. MFC after: 1 week	2014-11-22 04:40:24 +00:00
Steven Hartland	85c9dd9d89	Prevent overflow issues in timeout processing Previously, any timeout value for which (timeout * hz) will overflow the signed integer, will give weird results, since callout(9) routines will convert negative values of ticks to '1'. For unsigned integer overflow we will get sufficiently smaller timeout values than expected. Switch from callout_reset, which requires conversion to int based ticks to callout_reset_sbt to avoid this. Also correct isci to correctly resolve ccb timeout. This was based on the original work done by Eygene Ryabinkin <rea@freebsd.org> back in 5 Aug 2011 which used a macro to help avoid the overlow. Differential Revision: https://reviews.freebsd.org/D1157 Reviewed by: mav, davide MFC after: 1 month Sponsored by: Multiplay	2014-11-21 21:01:24 +00:00
Alexander Motin	6f58afed59	Make cfiscsi_offline() synchronous, waiting for connections termination before return. This should make ctld restart more clean and predictable. MFC after: 2 weeks	2014-11-21 18:05:02 +00:00
Alexander Motin	9a3e44f9c5	Close race between cfiscsi_offline() and new connection arrival. Incoming connection should be either rejected or accepted and terminated.	2014-11-21 15:38:31 +00:00
Alexander Motin	c6ceed4f7a	Remove bunch of unused lun variables. MFC after: 1 week	2014-11-21 11:21:39 +00:00
Alexander Motin	00dfef9e94	Reduce race between LUN destruction and request arrival. MFC after: 1 week	2014-11-21 11:20:24 +00:00
Alexander Motin	c82bf2659b	Log errors for absent LUNs too. MFC after: 1 week	2014-11-21 08:24:37 +00:00
Alexander Motin	23b30f5600	Partially reconstruct Active/Standby clusting. In this mode one head is in Active state, supporting all commands, while another is in Standby state, supporting only minimal LUN discovery subset. It is still incomplete since Standby state requires reservation support, which is impossible to do right without having interlink between heads. But it allows to run some basic experiments.	2014-11-21 06:27:37 +00:00
John Baldwin	a92cf726f8	Lock the scsi_low code and the drivers which use it along with other related cleanups: - Require each driver to initalize a mutex in the scsi_low_softc that is shared with the scsi_low code. This mutex is used for CAM SIMs, timers, and interrupt handlers. - Replace the osdep function switch with direct calls to the relevant CAM functions and direct manipulation of timers via callout(9). - Collapse the CAM-specific scsi_low_osdep_interface substructure directly into scsi_low_softc. - Use bus_() instead of bus_space_(). - Return BUS_PROBE_DEFAULT from probe routines instead of 0. - No need to zero softcs. - Pass 0ul and ~0ul instead of 0 and ~0 to bus_alloc_resource(). - Spell "dettach" as "detach". - Remove unused 'dvname' variables. - De-spl(). Tested by: no one	2014-11-20 20:50:05 +00:00

1 2 3 4 5 ...

1586 Commits