freebsd-skq

Author	SHA1	Message	Date
smh	b27d66677a	Format CDB output as 2 digit hex correcting the length Approved by: pjd (mentor) MFC after: 1 week	2013-01-31 14:07:24 +00:00
jh	cd3fe29726	Sanitize the element descriptor string before using it as a device name. Reported and tested by: Vitalij Satanivskij Reviewed by: gibbs, mav	2013-01-24 17:28:39 +00:00
kan	404c93b661	Do not pretend to have autosense data when no such data is available. Make umass return an error code if SCSI sense retrieval request has failed. Make sure scsi_error_action honors SF_NO_RETRY and SF_NO_RECOVERY in all cases, even if it cannot parse sense bytes. Reviewed by: hselasky (umass), scottl (cam)	2013-01-19 03:19:39 +00:00
mav	74a37192c1	- Add missig xpt_schedule() call for cases when requested immediate CCB priority is lower then payload/TUR one. - Reduce TUR priority and avoid sending them if there are any other outstanding commands, alike to DA driver.	2013-01-11 19:11:56 +00:00
mav	7abdc5deda	Do not schedule periph for payload/TUR requests if reprobe is in progress to avoid sending extra READ CAPACITY requests by dastart(). Schedule periph again on reprobe completion, or otherwise it may stuck indefinitely long. This should fix USB explore thread hanging on device unplug, waiting for periph destruction. Reported by: hselasky	2013-01-11 16:10:11 +00:00
smh	f63fde9f6f	Changed scsi_da device requests to use the sysctl tunable value for retry_count and da_default_timeout where their current hardcoded values matched the current default value for said tunables. PR: kern/169976 Reviewed by: pjd (mentor) Approved by: mav	2013-01-10 12:25:00 +00:00
smh	648c153a8f	Updates delete_method sysctl changes to always maintain disk d_flags DISKFLAG_CANDELETE. While this change makes this layer consistent other layers such as UFS and ZFS BIO_DELETE support may not notice any change made manually via these device sysctls until the device is reopened via a mount. Also corrected var order in dadeletemethodsysctl PR: kern/169801 Reviewed by: pjd (mentor) Approved by: mav MFC after: 2 weeks	2013-01-10 11:57:46 +00:00
smh	7d5ba844df	Removes essentially unused variables from scsi_da probe setups PR: kern/169835 Reviewed by: pjd (mentor) Approved by: mav MFC after: 2 weeks	2013-01-10 11:28:12 +00:00
ken	dc876f51b7	Make CTL work a little better with loading and unloading drivers. Previously CTL would leave individual LUNs enabled in the target driver, whether or not the port as a whole was enabled. It would also leave the wildcard LUN enabled indefinitely. This change means that CTL will enable and disable any active LUNs, as well as the wildcard LUN, when enabling and disabling a port. Also, fix a bug that could crop up due to an uninitialized CCB type. ctl.c: Before calling ctl_frontend_online(), run through the LUN list and enable all active LUNs. After calling ctl_frontend_offline(), run through the LUN list and disble all active LUNs. scsi_ctl.c: Before bringing a port online, allocate the wildcard peripheral for that bus. And after taking a port offline, invalidate the wildcard peripheral for that bus. Make sure that we hold the SIM lock around all calls to xpt_action() and other transport layer interfaces that require it. Use CAM_SIM_{LOCK\|UNLOCK} consistently to acquire and release the SIM lock. Update a number of outdated comments. Some of these should have been fixed long ago. Actually do LUN disbables now. The newer drivers in the tree work correctly for this as far as I know. Initialize the CCB type to CTLFE_CCB_DEFAULT to avoid a panic due to uninitialized memory. Submitted by: Chuck Tuffli (partially) MFC after: 1 week	2013-01-09 17:02:08 +00:00
mav	d3671c1677	Make SES driver to not fall out on some errors in Additional Status page. This allows CAM devices still get their physical paths even if status of later elements it corrupted. Sponsored by: iXsystems, Inc.	2012-12-20 19:51:32 +00:00
mav	4a14a78087	Fix bug in r242720, that caused additional status page to not be used if descriptor page is supported.	2012-12-19 09:55:13 +00:00
ken	287e4f6a6b	Fix a couple of CTL locking issues and clean up some duplicated code. ctl_frontend_cam_sim.c: Coalesce cfcs_online() and cfcs_offline() into a single function since these were identical except for one line. Make sure we hold the SIM lock around path creation, and calling xpt_rescan(). scsi_ctl.c: In ctlfe_onoffline(), make sure we hold the SIM lock around path creation and free calls, as well as xpt_action(). In ctlfe_lun_enable(), hold the SIM lock around path and peripheral operations that require it. Sponsored by: Spectra Logic Corporation MFC after: 1 week	2012-12-09 19:53:21 +00:00
ken	30f51af395	Make sure we hold the SIM lock when calling xpt_free_path(). Sponsored by: Spectra Logic Corporation MFC after: 1 week	2012-12-08 04:55:06 +00:00
ken	f524484130	Fix the CTL OOA queue dumping code so that it does not hold a mutex while doing a copyout. That can cause a panic, because copyout can trigger VM faults, and we can't handle VM faults while holding a mutex. The solution here is to malloc a separate buffer to hold the OOA queue entries, so that we don't risk a VM fault while filling up the buffer and we don't have to drop the lock. The other solution would be to wire the user's memory while filling their buffer with copyout, but that would have been a little more complex. Also fix a debugging parenthesis issue in ctl_abort_task() pointed out by Chuck Tuffli. Sponsored by: Spectra Logic Corporation MFC after: 1 week	2012-12-08 04:16:07 +00:00
ken	ff1edb089b	Fix a device departure bug for the the pass(4), enc(4), sg(4) and ch(4) drivers. The bug occurrs when a userland process has the driver instance open and the underlying device goes away. We get the devfs callback that the device node has been destroyed, but not all of the closes necessary to fully decrement the reference count on the CAM peripheral. The reason is that once devfs calls back and says the device has been destroyed, it is moved off to deadfs, and devfs guarantees that there will be no more open or close calls. So the solution is to keep track of how many outstanding open calls there are on the device, and just release that many references when we get the callback from devfs. scsi_pass.c, scsi_enc.c, scsi_enc_internal.h: Add an open count to the softc in these drivers. Increment it on open and decrement it on close. When we get a devfs callback to say that the device node has gone away, decrement the peripheral reference count by the number of still outstanding opens. Make sure we don't access the peripheral with cam_periph_unlock() after what might be the final call to cam_periph_release_locked(). The peripheral might have been freed, and we will be dereferencing freed memory. scsi_ch.c, scsi_sg.c: For the ch(4) and sg(4) drivers, add the same changes described above, and in addition, fix another bug that was previously fixed in the pass(4) and enc(4) drivers. These drivers were calling destroy_dev() from their cleanup routine, but that could cause a deadlock because the cleanup routine could be indirectly called from the driver's close routine. This would cause a deadlock, because the device node is being held open by the active close call, and can't be destroyed. Sponsored by: Spectra Logic Corporation MFC after: 1 week	2012-12-08 04:03:04 +00:00
ken	b8327e28a3	Fix a panic during CAM EDT traversal. The problem was a race condition between the EDT traversal used by things like 'camcontrol devlist', and CAM peripheral driver removal. The EDT traversal code holds the CAM topology lock, and wants to show devices that have been invalidated. It acquires a reference to the peripheral to make sure the peripheral it is examining doesn't go away. However, because the peripheral removal code in camperiphfree() drops the CAM topology lock to call the peripheral's destructor routine, we can run into a situation where the EDT traversal increments the peripheral reference count after free process is already in progress. At that point, the reference count is ignored, because it was 0 when we started the process. Fix this race by setting a flag, CAM_PERIPH_FREE, that I previously added and checked in xptperiphtraverse() and xptpdperiphtravsere(), but failed to use. If the EDT traversal code sees that flag, it will know that the peripheral free process has already started, and that it should not access that peripheral. Also, fix an inconsistency in the locking between xptpdperiphtraverse() and xptperiphtraverse(). They now both hold the CAM topology lock while calling the peripheral traversal function. cam_xpt.c: Change xptperiphtraverse() to hold the CAM topology lock across calls to the traversal function. Take out the comment in xptpdperiphtraverse() that referenced the locking inconsistency. cam_periph.c: Set the CAM_PERIPH_FREE flag when we are in the process of freeing a peripheral driver. Sponsored by: Spectra Logic Corporation MFC after: 1 week	2012-12-07 23:48:54 +00:00
mav	a094deb5ee	Fix problem with the Samsung 840 PRO series SSD detection. The device reports support for SATA Asynchronous Notification in its IDENTIFY data, but returns error on attempt to enable that feature. Make SATA XPT of CAM only report these errors, but not fail the device. MFC after: 1 week	2012-11-26 20:07:10 +00:00
mav	70551b60a0	Use information about suported diagnostic pages to avoid reading optional Element Descriptor page if it is not supported. This removes one error message from verbose logs during boot on systems with some enclosures. Sponsored by: iXsystems, Inc.	2012-11-07 23:12:53 +00:00
trasz	7d2cc01a24	Use M_ZERO instead of explicit memsets and bzeros.	2012-10-30 12:52:41 +00:00
trasz	76f8fadfa8	Fix locking problem in disk_resize(); previously it would run without topology lock, resulting in assertion when running with DIAGNOSTIC. Reviewed by: mav (earlier version)	2012-10-29 17:52:43 +00:00
mav	dd686c1964	Remove priority enforcement from xpt_ation(). It is not good and even not safe in some cases to reduce CCB priority after it was scheduled with high priority. This fixes reproducible deadlock when command sent through the pass interface while ATA XPT recovers from command timeout. Instead of that enforce priority at passioctl(). libcam provides no obvious interface to specify CCB priority and so much (all?) code specifies zero (highest) priority. This change limits pass CCBs priority to NORMAL run level, allowing XPT to complete bus and device recovery after reset before running any payload.	2012-10-27 10:14:12 +00:00
mav	1cb4a9777f	Remove several uses of numeric priorities from immediate CCB setups.	2012-10-27 09:40:29 +00:00
mav	0d3ac10d4d	Remove one more numeric priority constant.	2012-10-27 08:52:33 +00:00
mav	a8f86acc09	Remove two more 'periph == NULL' checks missed in r241404. This condition can never be true as functions are called from single place and the checks just pollute the code and confuse Clang Static Analyzer.	2012-10-23 16:03:00 +00:00
kib	560aa751e0	Remove the support for using non-mpsafe filesystem modules. In particular, do not lock Giant conditionally when calling into the filesystem module, remove the VFS_LOCK_GIANT() and related macros. Stop handling buffers belonging to non-mpsafe filesystems. The VFS_VERSION is bumped to indicate the interface change which does not result in the interface signatures changes. Conducted and reviewed by: attilio Tested by: pho	2012-10-22 17:50:54 +00:00
eadler	3f7a414911	remove duplicate semicolons where possible. Approved by: cperciva MFC after: 1 week	2012-10-22 03:00:37 +00:00
eadler	5d26a5d3b5	Adds 4K quirks for the some SSD's which all perform better when 4K aligned and only except 4K deletes (TRIM). PR: kern/169974 Submitted by: Steven Hartland <steven.hartland@multiplay.co.uk> Tested by: ak Reviewed by: mav Approved by: cperciva (implicit) MFC after: 1 week	2012-10-20 15:30:14 +00:00
eadler	4ac8b06a12	Add support for samsung HM250JI PR: usb/121474 Submitted by: Ben Stuyts <ben@altesco.nl> Approved by: cperciva (implicit) MFC after: 3 days	2012-10-16 17:49:14 +00:00
eadler	abb735a61b	Add support for the USB DISK Pro PMAP. This patch has sit for 6 years in the PR database. PR: usb/96381 Submitted by: jhs Reviewed by: mav Approved by: cperciva (implicit) MFC after: 3 days	2012-10-15 12:33:53 +00:00
mav	fdb8b5f8ba	Add explicit check for not set time inside cam_periph_freeze_after_event(). System time is set later on boot process then initial bus scan by CAM. Until that moment microtime() is equal to microuptime(), and if system boots quickly, the value can be close to zero. That causes settle time waiting even for buses that don't use reset during probe. On my test system this reduces boot time by 1 second if USB enabled, or by 4 seconds if USB disabled. CAM waited for ctl2cam0 bus "settle".	2012-10-14 08:50:05 +00:00
mav	b2c37a9f0e	Don't exclude XPT SIM from locking in xpt_create_path_unlocked(). We don't want xpt periph, device, target or bus disappeared because of incorrect reference counting.	2012-10-13 18:24:52 +00:00
mav	e21d57c65c	Use xpt_create_path_unlocked() for initial debug path compilation because we are not holding respective SIM lock.	2012-10-13 18:11:50 +00:00
mav	fd39176a32	Extend SIM lock coverage during the bus registration process.	2012-10-13 17:55:06 +00:00
mav	6d988f09cf	Fix XPT_DEBUG paths operations locking: - Extend the lock to cover xpt_path_release() for the new path. - While xpt_action() is called while holding right SIM lock for the new bus, the old path release may require different SIM lock. So we have to temporary drop the new lock and get the old one.	2012-10-13 11:23:16 +00:00
mav	fd248e3cc6	XPT_DEV_MATCH is probably the only xpt_action() method that is called without holding SIM lock. It really doesn't need that lock, but adding it removes that specific exception, allowing to assert locking there later. Submitted by: ken@ (earlier version)	2012-10-13 10:18:36 +00:00
mav	69de44a310	Get SIM lock in several places while calling CAM functions. This fixes several use-after-free panics on systems with SAS enclosures. Submitted by: ken@, mav@	2012-10-12 18:21:31 +00:00
mav	c5f476d2d7	Protect xpt_getattr() calls with the SIM lock and assert that. Submitted by: ken@ (earlier version)	2012-10-12 17:18:24 +00:00
mav	3b492402e1	Use separate malloc buckets for CAM devices, CCBs and paths. This will make it easier to track down the source of any use after free problems. Submitted by: ken@	2012-10-11 20:14:11 +00:00
mav	def5df11b3	Don't duplicate path/ccb allocation code, use existing functions.	2012-10-11 19:57:11 +00:00
mav	e867826c02	Increase device CCB queue array size by CAM_RL_VALUES - 1 (4) elements. It is required to store extra recovery requests in case of bus resets. On ATA/SATA this fixes assertion panics on HEAD with INVARIANTS enabled or possible memory corruptions otherwise if timeout/reset happens when device CCB queue is already full. Reported by: gibbs@ MFC after: 1 week	2012-10-11 15:21:07 +00:00
mav	81626f7fab	There are SCSI conditions that are not an errors. In those cases cderror() returns zero while request status is not CAM_REQ_CMP. That could cause partial device attach or other unexpected results. Found by: Clang Static Analyzer	2012-10-10 19:32:40 +00:00
mav	9e15eadd18	Really handle xpt_compile_path() error in xpt_bus_register() instead of print error message and probably crash just after it on NULL dereference. Found by: Clang Static Analyzer	2012-10-10 18:34:15 +00:00
mav	35f2c3dc00	Remove 'periph == NULL' check from bunch of periph drivers. This condition can never be true as functions are called from single place and the checks just pollute the code and confuse Clang Static Analyzer.	2012-10-10 18:10:11 +00:00
avg	562e7d7aa0	ata_da: set disk::d_ident from serial number MFC after: 10 days	2012-10-06 21:42:07 +00:00
mav	06f4d9dad6	Change queue overflow checks from DIAGNOSTIC+panic() to KASSERT() to make them enabled on HEAD by default. It is probably better to do single compare then hunt for unexpected memory corruption.	2012-09-28 12:13:34 +00:00
trasz	cec93f2f0f	Remove useless NULL checks after M_WAITOK allocations.	2012-09-27 10:51:38 +00:00
trasz	625aa019e1	Fix panic in CTL caused by trying to free invalid pointers passed by the userland process via the IOCTL interface. Reviewed by: ken@	2012-09-26 07:09:15 +00:00
tijl	be53f707af	Fix a panic when trying to play invalid audio tracks.	2012-09-19 18:42:31 +00:00
eadler	2dc3520f1e	s/ is is / is /g s/ a a / a /g Approved by: cperciva MFC after: 3 days	2012-09-14 22:00:03 +00:00
jhb	e8b429e1c0	Remove some more NetBSD compat shims and other unused bits from these drivers: - Remove scsi_low_pisa.*, they were unused. - Remove <compat/netbsd/physio_proc.h> and calls to the stubs in that header. They were empty nops. - Retire sl_xname and use device_get_nameunit() and device_printf() with the underlying device_t instead. - Remove unused {ct,ncv,nsp,stg}print() functions. - Remove empty SOFT_INTR_REQUIRED() macro and the unused sl_irq member.	2012-09-10 18:49:49 +00:00
jhb	56003f21a4	Remove NetBSD compat shims for drivers originally shared with NetBSD/pc98. NetBSD/pc98 was never merged into the main NetBSD tree and is no longer developed. Adding locking to these drivers would have made the compat shims hard to impossible to maintain, so remove the shims to ease future changes. These changes were verified by md5. Some additional shims can be removed that do affect the compiled results that I will probably do in another round. Approved by: nyan (tentatively)	2012-09-06 18:53:33 +00:00
jimharris	c5c7ea28c7	Fix scsi_da's BIO_DELETE->SCSI_UNMAP translation to use correct local variable when determining various sizes related to SCSI UNMAP block descriptor lists. Sponsored by: Intel Reviewed by: mav MFC after: 3 days	2012-08-24 17:08:02 +00:00
mjacob	0b3ddf87a1	1. Remove SEN support. I doubt there are any working examples of this hardware still running (close to twenty years now). 2. Quiesece and use ENC_VLOG instead of ENC_LOG for most complaints. That is, they're visible with bootverbose, but otherwise quiesced and not repeatedly spamming messages with constant reminders that hardware in this space is rarely fully compliant. MFC after: 1 month	2012-08-12 17:01:07 +00:00
mjacob	1ac79d17a9	Add missing VERIFY_10 definition. MFC after: 1 month	2012-08-12 16:58:38 +00:00
bz	a605e9ce57	Remove opt_enc.h from files committed with r235911. enc(4) is the 'encapsulating interface' used with IPsec and has nothing to do with storage 'enclosure' services. MFC after: 3 days Noticed while: debugging why enc(4) is no longer automatically created	2012-07-30 03:00:58 +00:00
mav	24017b5387	Implement media change notification for DA and CD removable media devices. It includes three parts: 1) Modifications to CAM to detect media media changes and report them to disk(9) layer. For modern SATA (and potentially UAS) devices it utilizes Asynchronous Notification mechanism to receive events from hardware. Active polling with TEST UNIT READY commands with 3 seconds period is used for incapable hardware. After that both CD and DA drivers work the same way, detecting two conditions: "NOT READY: Medium not present" after medium was detected previously, and "UNIT ATTENTION: Not ready to ready change, medium may have changed". First one reported to disk(9) as media removal, second as media insert/change. To reliably receive second event new AC_UNIT_ATTENTION async added to make UAs broadcasted to all periphs by generic error handling code in cam_periph_error(). 2) Modifications to GEOM core to handle media remove and change events. Media removal handled by spoiling all consumers attached to the provider. Media change event also schedules provider retaste after spoiling to probe new media. New flag G_CF_ORPHAN was added to consumers to reflect that consumer is in process of destruction. It allows retaste to create new geom instance of the same class, while previous one is still dying. 3) Modifications to some GEOM classes: DEV -- to report media change events to devd; VFS -- to handle spoiling same as orphan to prevent accessing replaced media. PART class already handles spoiling alike to orphan. Reviewed by: silence on geom@ and scsi@ Tested by: avg Sponsored by: iXsystems, Inc. / PC-BSD MFC after: 2 months	2012-07-29 11:51:48 +00:00
mjacob	4253fe261f	Handle a case where we had an SRR that pushed back the data pointer. This is a temp fix that resubmits the command, adjusted, so that the backend can fetch the data again. Sponsored by: Spectralogic MFC after: 1 month	2012-07-28 20:08:14 +00:00
mav	ef01c33903	Fix off by one error in ses_enc_desc_last_byte().	2012-07-24 13:32:49 +00:00
mav	ce83f9282b	Do not call ses_softc_cleanup() in case of configuration read failure. Just free inclomplete daemon cache instead to let it retry next time. Premature ses_softc_cleanup() caused NULL dereference when freed softc was accessed later.	2012-07-24 13:08:43 +00:00
mav	b6e3f0ff49	Fix some typos in r238595. Reported by: brueffer	2012-07-18 12:41:09 +00:00
mav	89b28f9c69	Add bunch of new ASC/ASCQ values from T10 site.	2012-07-18 12:23:45 +00:00
mjacob	5d2a55de50	The call to disk_resize causes a panic if DIAGNOSTIC is set. Coping with that while the finest minds of our generation figure out why.	2012-07-14 02:59:11 +00:00
brueffer	4526c9ddda	Add and utilize defines for the ATA device register. PR: 169764 Submitted by: Steven Hartland <steven.hartland@multiplay.co.uk> Reviewed by: mav MFC after: 2 weeks	2012-07-12 10:09:34 +00:00
brueffer	a32f7c6c14	Apply similar same change as in r238379: Renamed the kern.cam.ada.ada_send_ordered sysctl and tunable to kern.cam.ada.send_ordered, more in line with the other da sysctls/tunables. Suggested by: kib	2012-07-11 23:22:09 +00:00
brueffer	6975aedf10	Renamed the kern.cam.da.da_send_ordered sysctl and tunable to kern.cam.da.send_ordered, more in line with the other da sysctls/tunables. PR: 169765 Submitted by: Steven Hartland <steven.hartland@multiplay.co.uk> Reviewed by: mav	2012-07-11 23:00:26 +00:00
brueffer	b4a8dc85d6	- fix description of SECURITY_FREEZE_LOCK (1) - add missing underscores to SECURITY_DISABLE_PASSWORD PR: 169763 (1) Submitted by: Steven Hartland <steven.hartland@multiplay.co.uk> Reviewed by: mav MFC after: 1 week	2012-07-11 08:24:30 +00:00
trasz	770bbae61e	Make the da(4) driver notify GEOM about LUN size change. Reviewed by: mav Sponsored by: FreeBSD Foundation	2012-07-07 22:19:51 +00:00
eadler	30437f51c7	Remove variables which are initialized but never used thereafter reported by gcc46 warning Reviewed by: scottl Approved by: cperciva MFC after: 1 week	2012-07-07 17:17:43 +00:00
ken	064fbb4fb6	Add a loader tunable, kern.cam.ctl.disable, that will disable loading CTL. This may be useful in very low memory installations. MFC after: 3 days	2012-06-28 19:39:30 +00:00
imp	4c7b908928	Add a sysctl to set the cdrom timeout. Data recovery operations from a CD or DVD drive with a damaged disc often benefit from a shorter timeout. Also, when retries are set to 0, an application is expecting errors and recovering them so do not print the error into the log. The number of expected errors can literally be in the hundreds of thousands which significantly slows data recovery. Reviewed by: ken@ (but quite some time ago).	2012-06-28 07:01:48 +00:00
ken	c7cc3c126d	Fix a typo in a panic() call. PR: kern/169497 Submitted by: Steven Hartland <steven.hartland@multiplay.co.uk> MFC after: 3 days	2012-06-28 03:36:13 +00:00
ken	877f7e22dc	Fix an issue that caused the kernel to panic inside CTL when trying to attach to target capable HBAs that implement the old immediate notify (XPT_IMMED_NOTIFY) and notify acknowledge (XPT_NOTIFY_ACK) CCBs. The new API has been in place since SVN change 196008 in 2009. The solution is two-fold: fix CTL to handle the responses from the HBAs, and convert the HBA drivers in question to use the new API. These drivers have not been tested with CTL, so how well they will interoperate with CTL is unknown. scsi_target.c: Update the userland target example code to use the new immediate notify API. scsi_ctl.c: Detect when an immediate notify CCB is returned with CAM_REQ_INVALID or CAM_PROVIDE_FAIL status, and just free it. Fix a duplicate assignment. aic79xx.c, aic79xx_osm.c: Update the aic79xx driver to use the new API. Target mode is not enabled on for this driver, so the changes will have no practical effect. aic7xxx.c, aic7xxx_osm.c: Update the aic7xxx driver to use the new API. sbp_targ.c: Update the firewire target code to work with the new API. mpt_cam.c: Update the mpt(4) driver to work with the new API. Target mode is only enabled for Fibre Channel mpt(4) devices. MFC after: 3 days	2012-06-26 14:51:35 +00:00
ken	be54b17782	Fix a bug which causes a panic in daopen(). The panic is caused by a da(4) instance going away while GEOM is still probing it. In this case, the GEOM disk class instance has been created by disk_create(), and the taste of the disk is queued in the GEOM event queue. While that event is queued, the da(4) instance goes away. When the open call comes into the da(4) driver, it dereferences the freed (but non-NULL) peripheral pointer provided by GEOM, which results in a panic. The solution is to add a callback to the GEOM disk code that is called when all of its resources are cleaned up. This is implemented inside GEOM by adding an optional callback that is called when all consumers have detached from a provider, and the provider is about to be deleted. scsi_cd.c, scsi_da.c: In the register routine for the cd(4) and da(4) routines, acquire a reference to the CAM peripheral instance just before we call disk_create(). Use the new GEOM disk d_gone() callback to register a callback (dadiskgonecb()/cddiskgonecb()) that decrements the peripheral reference count once GEOM has finished cleaning up its resources. In the cd(4) driver, clean up open and close behavior slightly. GEOM makes sure we only get one open() and one close call, so there is no need to set an open flag and decrement the reference count if we are not the first open. In the cd(4) driver, use cam_periph_release_locked() in a couple of error scenarios to avoid extra mutex calls. geom.h: Add a new, optional, providergone callback that is called when a provider is about to be deleted. geom_disk.h: Add a new d_gone() callback to the GEOM disk interface. Bump the DISK_VERSION to version 2. This probably should have been done after a couple of previous changes, especially the addition of the d_getattr() callback. geom_disk.c: Add a providergone callback for the disk class, g_disk_providergone(), that calls the user's d_gone() callback if it exists. Bump the DISK_VERSION to 2. geom_subr.c: In g_destroy_provider(), call the providergone callback if it has been provided. In g_new_geomf(), propagate the class's providergone callback to the new geom instance. blkfront.c: Callers of disk_create() are supposed to pass in DISK_VERSION, not an explicit disk API version number. Update the blkfront driver to do that. disk.9: Update the disk(9) man page to include information on the new d_gone() callback, as well as the previously added d_getattr() callback, d_descr field, and HBA PCI ID fields. MFC after: 5 days	2012-06-24 04:29:03 +00:00
mav	6a0dd123c9	Add scsi_extract_sense_ccb() -- wrapper around scsi_extract_sense_len(). It allows to remove number of duplicate checks from several places.	2012-06-23 12:32:53 +00:00
ken	e2f0baed00	Change 'camcontrol defects' to first probe a drive to find out how much defect information it has before grabbing the full defect list. This works around a bug with some Hitachi drives that generate data overrun errors when they are asked for more defect data than they have. The change is done in a spec-compliant way, so it should have no negative impact on drives that don't have this issue. This is based on work originally done at Sandvine. scsi_da.h: Add a define for the maximum amount of data that can be contained in a defect list. camcontrol.c: Update the readdefects() function to issue an initial command to determine the length of the defect list, and then use that length in the request for the full defect list. camcontrol.8: Add a note that some drives will report 0 defects available if you don't request either the PLIST or GLIST. Submitted by: Mark Johnston <markjdb@gmail.com> (original version) MFC after: 3 days	2012-06-22 18:57:06 +00:00
mav	1e34717b22	Don't print SCSI Queue Full and CAM_REQUEUE_REQ statuses as errors if they were handled and retried. They are part of normal operation for SCSI TCQ. MFC after: 3 days	2012-06-22 16:20:13 +00:00
mav	b8ac542c35	Make cam_periph_hold() behavior consistent: drop taken reference and return ENXIO if periph was invalidated while we were waiting for it. MFC after: 1 week	2012-06-21 15:14:51 +00:00
mav	a632fb1531	In camisr() clear CAM_SIM_ON_DONEQ flag after camisr_runqueue() purged SIM done queue. Clearing it before caused extra SIM queueing in some cases. It was invisible during normal operation, but during USB device unplug and respective SIM destruction it could keep pointer on SIM without having counted reference and as result crash the system by use afer free. Reported by: hselasky MFC after: 1 week	2012-06-21 14:35:46 +00:00
mav	f62c46ca1b	Remove unused error variables in cdclose() and daclose().	2012-06-20 18:35:36 +00:00
mav	056fd51a22	Check status of cam_periph_hold() inside cdclose(). If cd device was invalidated while open, cam_periph_hold() will return error and won't get the reference. Following reference release will crash the system. Sponsored by: iXsystems, Inc. MFC after: 3 days	2012-06-20 18:25:51 +00:00
ken	188f5a2133	Fix several reference counting and object lifetime issues between the pass(4) and enc(4) drivers and devfs. The pass(4) driver uses the destroy_dev_sched() routine to schedule its device node for destruction in a separate thread context. It does this because the passcleanup() routine can get called indirectly from the passclose() routine, and that would cause a deadlock if the close routine tried to destroy its own device node. In any case, once a particular passthrough driver number, e.g. pass3, is destroyed, CAM considers that unit number (3 in this case) available for reuse. The problem is that devfs may not be done cleaning up the previous instance of pass3, and will panic if isn't done cleaning up the previous instance. The solution is to get a callback from devfs when the device node is removed, and make sure we hold a reference to the peripheral until that happens. Testing exposed some other cases where we have reference counting issues, and those were also fixed in the pass(4) driver. cam_periph.c: In camperiphfree(), reorder some of the operations. The peripheral destructor needs to be called before the peripheral is removed from the peripheral is removed from the list. This is because once we remove the peripheral from the list, and drop the topology lock, the peripheral number may be reused. But if the destructor hasn't been called yet, there may still be resources hanging around (like devfs nodes) that haven't been fully cleaned up. cam_xpt.c: Add an argument to xpt_remove_periph() to indicate whether the topology lock is already held. scsi_enc.c: Acquire an extra reference to the peripheral during registration, and release it once we get a callback from devfs indicating that the device node is gone. Call destroy_dev_sched_cb() in enc_oninvalidate() instead of calling destroy_dev() in the cleanup routine. scsi_pass.c: Add reference counting to handle peripheral and devfs object lifetime issues. Add a reference to the peripheral and the devfs node in the peripheral registration. Don't attempt to add a physical path alias if the peripheral has been marked invalid. Release the devfs reference once the initial physical path alias taskqueue run has completed. Schedule devfs node destruction in the passoninvalidate(), and release our peripheral reference in a new routine, passdevgonecb() once the devfs node is gone. This allows the peripheral to fully go away, and the peripheral destructor, passcleanup(), will get called. MFC after: 3 days Sponsored by: Spectra Logic	2012-06-20 17:08:00 +00:00
mav	387c8d1228	Remove never used CD/DA_FLAG_TAGGED_QUEUING flags. MFC after: 1 week	2012-06-18 12:45:35 +00:00
mav	478d881b57	One more major cam_periph_error() rewrite to improve error handling and reporting. It includes: - removing of error messages controlled by bootverbose, replacing them with more universal and informative debugging on CAM_DEBUG_INFO level, that is now built into the kernel by default; - more close following to the arguments submitted by caller, such as SF_PRINT_ALWAYS, SF_QUIET_IR and SF_NO_PRINT; consumer knows better which errors are usual/expected at this point and which are really informative; - adding two new flags SF_NO_RECOVERY and SF_NO_RETRY to allow caller specify how much assistance it needs at this point; previously consumers controlled that by not calling cam_periph_error() at all, but that made behavior inconsistent and debugging complicated; - tuning debug messages and taken actions order to make debugging output more readable and cause-effect relationships visible; - making camperiphdone() (common device recovery completion handler) to also use cam_periph_error() in most cases, instead of own dumb code; - removing manual sense fetching code from cam_periph_error(); I was told by number of people that it is SIM obligation to fetch sense data, so this code is useless and only significantly complicates recovery logic; - making ada, da and pass driver to use cam_periph_error() with new limited recovery options to handle error recovery and debugging in common way; as one of results, CAM_REQUEUE_REQ and other retrying statuses are now working fine with pass driver, that caused many problems before. - reverting r186891 by raj@ to avoid burning few seconds in tight DELAY() loops on device probe, while device simply loads media; I think that problem may already be fixed in other way, and even if it is not, solution must be different. Sponsored by: iXsystems, Inc. MFC after: 2 weeks	2012-06-09 13:07:44 +00:00
mav	7a24c2b54f	Add CAM_DEBUG_INFO debug messages for periph created/invalidated/destroyed and for asyncs sent.	2012-06-07 10:53:42 +00:00
mav	56a5c844a2	To make CAM debugging easier, compile in some debug flags (CAM_DEBUG_INFO, CAM_DEBUG_CDB, CAM_DEBUG_PERIPH and CAM_DEBUG_PROBE) by default. List of these flags can be modified with CAM_DEBUG_COMPILE kernel option. CAMDEBUG kernel option still enables all possible debug, if not overriden. Additional 50KB of kernel size is a good price for the ability to debug problems without rebuilding the kernel. In case where size is important, debugging can be compiled out by setting CAM_DEBUG_COMPILE option to 0.	2012-06-07 10:05:51 +00:00
mav	385ccd1fec	Remove declaration of scsi_interpret_sense(), removed 11 years ago.	2012-06-06 17:28:46 +00:00
ken	29d8fd1bc1	Fix a memory leak in the kernel case in scsi_command_string(). Submitted by: Kashyap Desai <Kashyap.Desai@lsi.com> MFC after: 3 days	2012-06-06 17:04:56 +00:00
mav	ee6412e858	ATA/SATA controllers have no idea about protocol of the connected device until transport will do some probe actions (at least soft reset). Make ATA/SATA SIMs to not report bogus and confusing PROTO_ATA protocol. Make ATA/SATA transport to fill that gap by reporting protocol to SIM with XPT_SET_TRAN_SETTINGS and patching XPT_GET_TRAN_SETTINGS results if needed.	2012-06-06 06:52:51 +00:00
mav	a2502ee134	Use default error handler when flushing disk caches.	2012-06-05 20:39:12 +00:00
mav	28f741b193	Tune and add some more CAM_DEBUG() points for the probe sequences.	2012-06-05 11:48:32 +00:00
mav	4544de7ee7	Replace #ifdef CAMDEBUG + if + panic() with single KASSERT().	2012-06-05 10:23:41 +00:00
mav	c0ce8800a0	Do not reinvent a wheel and let default error handler do its job.	2012-06-05 10:08:22 +00:00
mav	bc7a4bd8f7	Tune and add some missing CAM_DEBUG() points for better consistency.	2012-06-05 09:45:42 +00:00
mav	e5ff9bbfd5	Remove some dead code that I doubt will ever be implemented.	2012-06-04 09:47:19 +00:00
mav	8461de2f4c	Rewrite enabling NCQ for SATA devices in a way more alike to SCSI TCQ. This allows to control it with `camcontrol negotiate adaX -T (en\|dis)able` on the fly, same as for SCSI devices. Sponsored by: iXsystems, Inc.	2012-06-02 08:29:07 +00:00
mjacob	f2720055c7	Print FC PortID as a hex number. This makes it easy to figure out domain, etc.. Zero ATIO and INOTify allocations. It makes for much less guesswork when looking at the structure and seeing 'deadc0de' present. Reviewed by: kdm MFC after: 2 weeks Sponsored by: Spectralogic	2012-06-01 23:24:56 +00:00
mav	8e77e366f6	Use AC_GETDEV_CHANGED async to notify ada driver about DMA and NCQ status change. Now that allows switching between PIO and DMA modes on the fly.	2012-06-01 09:32:37 +00:00
eadler	3b17c63ef4	Add support for newer garmin devices PR: kern/163932 Submitted by: Bartosz Fabianowski <freebsd@chillt.de> Approved by: cperciva MFC after: 1 week	2012-05-30 03:50:58 +00:00
mav	1805c47af3	Allow to change number of openings (used tags) for ATA/SATA devices via `camcontrol tags ... -N ...`. There is no need to tune it in usual cases, but some users want to have it for debugging purposes. MFC after: 2 weeks	2012-05-29 14:28:46 +00:00
mav	ff66e936dd	Plug request and references leak caused by race between invalidated ond probe periph destruction and new incoming probe request. This at least caused problems with SATA Port Multipliers hot-plug. MFC after: 2 weeks	2012-05-29 11:22:40 +00:00

1 2 3 4 5 ...

1175 Commits