freebsd-dev

Author	SHA1	Message	Date
Matt Jacob	e277218592	Frequency default should be '25' for 25MHz, not 25000000. Through the PITA of endiannness, clock has to be MHz freq << 8. Don't trust NVRAM on SBus cards. Set a default initiator ID sensibly. SBus/ISP now working, what with the change to sbus.c earlier today.	2002-07-25 20:49:30 +00:00
Matt Jacob	43722a425a	Don't test against default_iid being zero as a test for whether we set something- iid 0 is valid.	2002-07-25 20:47:40 +00:00
Matt Jacob	72429e49f5	Make sure that if are in fact using 'full SMP', make the interrupt flags include INTR_MPSAFE. Put the flags in a common place so that both isp_sbus && isp_pci DTRT. In isp_mbxdma setup, drop any locks prior to calling things like bus_dmatag_create. This gets rid of these obnoxious WITNESS messages about 'sleeping with locks held' blah blah blah blah blah.	2002-07-25 16:02:09 +00:00
Matt Jacob	4eb494274f	Put MODULE_VERSION back here so that ispfw is happy.	2002-07-25 16:00:24 +00:00
Matt Jacob	dbf6d71cba	Remove a couple of debugging lines.	2002-07-11 03:27:30 +00:00
Matt Jacob	73030e03ce	'Support' for ISP SBus cards. This code does not imply that SBus cards work yet. They hang for me. But I can't netboot the latest snapshot on my ultra1e, and things hang at bus_setup_intr time. Since I'm offline for a while, I thought I'd toss this in in case somebody else who has a bit better luck wants to fart around with it. Please try and wait until I get back to check things in.	2002-07-11 03:25:04 +00:00
Matt Jacob	f1df0f59e9	Add 2002 to copyright. Oops; I forgot for previous delta... If we're and FC or ULTRA2 or better card, we can have a 1024 element request queue instead of 256. MFC after: 1 week	2002-07-08 17:48:39 +00:00
Matt Jacob	fdeb9f2f66	Add get/set param ioctl support. Remove sim queue freezes for resource shortages. I've had too many strange race conditions where I freeze on a resource shortage but never get unfrozen. Consolidate the remaining sim queue freeze condition (for loopdown) into an inline with debug messages that allows us to track problems at ISP_LOGDEBUG0 level easier. Change a bunch of debug messages about loop down/up conditions to ISP_LOGDEBUG0 level. Remove dead isp_relsim code. Change some internal flag stuff for efficiency. Complain vociferously if we try and use our FC scratch area while it's busy being used already (I mean, if we don't have solaris' ability to sleep as an interrupt thread which would allow us to just use a p/v semaphore, at least say when you've just borked yourself). Add infrastructure to allow overrides of hard loopid && initiator id from boot variables. Fix the usual quota of silly bugs: + 'ktmature' needs to be per-instance. Argh. + When entering isp_watchdog, set intsok to zero, preserving old value to restore later. It's not nice to try and sleep from splsoftclock. + Fix tick overflow buglet in checking timeout value. MFC after: 1 week	2002-07-08 17:42:47 +00:00
Matt Jacob	f00939f92f	Add get/set param ioctls. MFC after: 1 week	2002-07-08 17:34:56 +00:00
Matt Jacob	ed753e824b	Add override so that we can force set our hard loopdid. MFC after: 1 week	2002-07-08 17:34:32 +00:00
Matt Jacob	af2d254da9	Remove the 'bogus registrant' hack for fabric searches. It really turns out that there's something of a hole in our new fabric name server stuff. We ask the name server for entities that have registered as a specific type. That type is FC-SCSI. If the entity hasn't performed a REGISTER FC4 TYPES, the fabric nameserver won't return it. This brings this driver to a bit of a fork in the road as to what the right thing to do is. For servicing the needs of accessing FC-SCSI devices, this method is fine, and to be preferred. It is extremely unlikely we're interested in fabric devices that don't register correctly. If I ever get around to adding an FC-IP stack, then asking for devices that have registers as FC-IP types is also the right thing to do. So- asking the fabric nameserver for a specific type is fine, as long as you are only interested in specific types. If, on the other hand, you want to create (as for management tool support) a picture of everything on the fabric, this is not so fine. There are a large class of FC-SCSI initiators who don't correctly register, so we never will see them. Is this a problem? Yes, but only a little one. If we want to do such management tool support, we should probably run a different fabric nameserver query algorithm. Better yet, we should talk to the management nameserver in Brocade switches instead of the standard FC-GS-2 fabric nameserver (which can be unwieldy). Other changes: if we've overrrides marked, don't set some default values from reading NVRAM. This allows us to override things like EXEC throttle without having to ignore NVRAM entirely. MFC after: 1 week	2002-07-08 17:33:37 +00:00
Matt Jacob	52154faa5f	If the HBA is already 'touched', still set maxluns. Othewise for CAM_QUIRK_HILUN devices we loop thru 32bits of lun. Oops. Switch to using USEC_DELAY rather than USEC_SLEEP at isp_reset time. Try to paper around a defect in clients that don't correctly registers themeselves with the fabric nameserver. Minor updates for Mirapoint support- they still use code that is not HANDLE_LOOPSTATE_IN_OUTER_LAYERS, and, surprise surprise, this old stuff had some bugs in it. Clean up some target mode stuff. MFC after: 1 week	2002-06-16 05:18:22 +00:00
Matt Jacob	570c7a3f78	Add support for ISP_FC_GETHINFO, which returns current connection topology, speed, loopid, WWPN/WWNN, etc. Beef up target mode. Add isp_handle_platform_notify_scsi and isp_handle_platform_notify_fc platform handlers to handle immediate notifies (isp_handle_platform_notify_scsi is still stubbed out). In implementation of isp_handle_platform_notify_fc, for IN_ABORT_TASK, peel off a pending XPT_IMMED_NOTIFY and call xpt_done on it and hope that somebody upstream is listening. Make sure on final CTIO2s that we set residual correctly. These are absolutely crucial. Make sure we set relative offset for each CTIO2 based upon bytes we've already xferred. This is what the private adjunct datat to the original ATIO is. Note state of command so we can figure out where to find it if we get an ABORT from the firmware. Make sure we always set CAM_TAG_ACTION_VALID for ATIO2s. Make sure we keep track of the original lun. If se sent status (or we're otherwise done with the command), don't forget to free the adjunct structure.	2002-06-16 05:08:02 +00:00
Matt Jacob	759981f464	Extend private adjunct to ATIO to have both tag lun, and extended state (so we can, when things get lost, find out who currently is processing on behalf of this open exchange. Invariably, when things are lost and wedged, it's CAM). Keep an atio resource counter locally. MFC after: 1 week	2002-06-16 05:02:25 +00:00
Matt Jacob	c49c3023c7	Force commit (last CVS comment was wrong). Go back to not fully evaluating loop/fabric state if our role is ISP_ROLE_NONE.	2002-06-16 05:00:20 +00:00
Matt Jacob	81ac553609	Add ISP_FC_GETHINFO ioctl. MFC after: 1 week	2002-06-16 04:59:30 +00:00
Matt Jacob	fc08717104	Set all 23XX cards as 'touched' (we have trouble, unpredictably, about running ABOUT FIRMWARE with some that were started by BIOS downloads). Redo CTIO2 dma mapping- use continuation segments instead of multiple CTIO2s. Thanks to Veritas for sponsoring this work (in a different context). MFC after: 1 week	2002-06-16 04:58:00 +00:00
Matt Jacob	e63442b6c1	Change isp_target_async to a function returning an integer. Roll most immediate notifies into something the platform has to handle.	2002-06-16 04:56:07 +00:00
Matt Jacob	9dba6a4ecb	Set default command count to 0xfe. This tells the f/w essentially to not do flow control based upon resource counts for the firmware. Increase default immediate notify count to 16. Change isp_target_async to a function returning an integer.	2002-06-16 04:54:46 +00:00
Matt Jacob	0322f8f8f7	Add MBOX_DRIVER_HEARTBEAT/MBOX_FW_HEARTBEAT/FC4_FC_SVC defines. MFC after: 1 week	2002-06-16 04:53:26 +00:00
Matt Jacob	0499ae008f	Roll minor version. Add ISPASYNC_FW_RESTARTED async event. Add DEFAULT_FRAMESIZE && DEFAULT_EXEC_THROTTLE references. MFC after: 1 week	2002-06-16 04:52:53 +00:00
Matt Jacob	f77e6d9569	If we get a DATA UNDERRUN error from QLogic FC cards, but the RQCS_RU bit is not set in the scsi completion status, or if the residual is clearly nonsense, then this was a command that suffered the loss of one or more FC frames in the middle of the exchange. Set HBA_BOTCH and hope it will get retried. It's the only thing we can do. MFC after: 1 day	2002-05-01 21:58:36 +00:00
Mike Barcroft	a30d4b3270	Move the new byte order function prototypes from <sys/param.h> to <sys/endian.h>. This puts us in line with NetBSD and OpenBSD.	2002-04-26 22:48:23 +00:00
Matt Jacob	4a999c65de	Scale back # of luns supported for SCC to 16384- oops- top 3 bits are a lun address modifier of sorts. Only an HP XP-512 seems to have cared. Fix a few misplaced pointers for the new fabric goop, which has been demonstrated to work on newer Brocades and McData switches now. Put in commented out code which would run GFF_ID if the QLogic f/w allowed it. Don't whine about not being able to find a handle for a command if it was a command aborted (by us).	2002-04-16 19:55:35 +00:00
Matt Jacob	f35c4ba63a	Send 32 bytes out for fc4_types... Interestingly enough the Solaris/Sparc version worked fine, but Linux/Sparc && FreeBSD/Sparc choked. MFC after: 1 week	2002-04-05 01:40:05 +00:00
Matt Jacob	029f13c671	Fix bus dma segment count to be based off of MAXPHYS, not BUS_SPACE_MAXSIZE. Grumble. I've seen better documented architectures out of Redmond. Redo fabric evaluation to not use GET ALL NEXT (GA_NXT). Switches seem to be trying to wriggle out of supporting this well. Instead, use GID_FT to get a list of Port IDs and then use GPN_ID/GNN_ID to find the port and node wwn. This should make working on fabrics a bit cleaner and more stable. This also caused some cleanup of SNS subcommand canonicalization so that we can actually check for FS_ACC and FS_RJT, and if we get an FS_RJT, print out the reason and explanation codes. We'll keep the old GA_NXT method around if people want to uncomment a controlling definition in ispvar.h. This also had us clean up ISPASYNC_FABRICDEV to use a local lportdb argument and to have the caller explicitly say that a device is at the end of the fabric list. MFC after: 1 week	2002-04-04 23:46:01 +00:00
John Baldwin	6008862bc2	Change callers of mtx_init() to pass in an appropriate lock type name. In most cases NULL is passed, but in some cases such as network driver locks (which use the MTX_NETWORK_LOCK macro) and UMA zone locks, a name is used. Tested on: i386, alpha, sparc64	2002-04-04 21:03:38 +00:00
Matt Jacob	1923f73990	Redo stuff for sparc64- primarily fix bus dma implementation. The endian stuff was right, but the busdma stuff was massively not right. Didn't really test on ia64 or i386- don't have the former h/w and my FreeBSD-current disk is unwell right now. Hope that this is okay. MFC after: 1 week	2002-04-02 23:36:14 +00:00
Matt Jacob	371777b161	Limit fabric search to a default 256 entries. This will all go away soon because it's just getting harder and harder to find switches that correctly implement the GET ALL NEXT subcommands for the SNS protocol. Latch up result out pointer and set a busy flag when we're looking at the response queue. This allows for a cleaner way to make sure we don't get multiple CPUs trying to read the same response queue entries. Change how isp_handle_other_response returns values (clarity). Make PORT UNAVAILABLE the same as PORT LOGOUT (force a LIP). Do some formatting changes. MFC after: 0 days	2002-03-21 21:10:16 +00:00
Alfred Perlstein	e51a25f850	Remove __P.	2002-03-20 02:08:01 +00:00
Matt Jacob	70e9673917	Disable RIO (reduced interrupt operation) for 2200 boards- it seemed like it worked- but I ran into a case with a 2204 where commands were being lost right and left. Best be safe. For target mode, or things called if we call isp_handle_other response- note that we might have dropped locks by changing the output pointer so we bail from the loop. It's the responsibility of the entity dropping the lock to make sure that we let the f/w know we've read thus far into the response queue (else we begin processing the same entries again- blech!). MFC after: 1 day	2002-03-07 17:32:45 +00:00
Matt Jacob	f553351ed2	Reorder some of the ioctls and add a few new ones. MFC after: 1 day	2002-02-21 23:30:05 +00:00
Matt Jacob	014e78d18c	Fix a problem where a local loop disk logs out- and we get a PORT LOGGED OUT status. We are, apparently, required to force the f/w to log back in if we want to try and talk to that disk again. This means either issuing a LOGIN LOCAL LOOP PORT mailbox command, or by issuing a LIP. I've elected to issue a LIP because this has a better chance of waking up the disk which clearly just crashed and burned. These should not occur at all. If they do, they should be darned rare. MFC after: 1 week	2002-02-21 01:56:08 +00:00
Matt Jacob	d134aa0b20	More for f/w crash dumps (bug fixing and adding ioctl entry points and hints to enable for specific units) MFC after: 1 week	2002-02-18 00:00:34 +00:00
Matt Jacob	b894188248	Support for f/w crash dumps (2200 && 23XX). If you want QLogic to look at a potential f/w problem for FC cards, you really have to provide them info in the format they expect. This involves dumping a lot of hardware registers (> 300 16 bit registers) and a lot of SRAM (> 128KB minimum). Thus all of this code is #ifdef protected which will become an option so that the memory allocation of where to dump the crash image is pretty expensive. It's worth it if you have a reproducible problem because they have some tools that can tell them, given the f/w version, the precise state of everything. MFC after: 1 week	2002-02-17 06:38:22 +00:00
Matt Jacob	3f02619fb8	Hints for WWN are now WWNN and/or WWPN. MFC after: 1 week	2002-02-17 06:34:21 +00:00
Matt Jacob	01ff579d86	Add in support firmware crash dumps. Change CFG options to split WWN into WWNN and WWPN. MFC after: 1 week	2002-02-17 06:32:58 +00:00
Matt Jacob	75c1e828c0	+ A variety of 23XX changes: disable MWI on 2300 based on function code, set an 'isp_port' for the 2312- it's a separate instance, but the NVRAM is shared, and the second port's NVRAM is at offset 256. + Enable RIO operation for LVD SCSI cards. This makes a big difference as even under reasonable load we get batched completions of about 30 commands at a time on, say, an ISP1080. + Do 'continuation' mailbox commands- this allows us to specify a work area within the softc and 'continue' repeated mailbox commands. This is more or less on an ad hoc basis and is currently only used for firmware loading (which f/w now loads substantially faster becuase the calling thread is only woken when all the f/w words are loaded- not for each one of the 40000 f/w words that gets loaded). + If we're about to return from isp_intr with a 'bogus interrupt' indication, and we're not a 23XX card, check to see whether the semaphore register is currently 2 (not 1 as it should be) and whether there's an async completion sitting in outgoing mailbox0. This seems to capture cases of lost fast posting and RIO interrupts that the 12160 && 1080 have been known to pump out under extreme load (extreme, as in > 250 active commands). + FC_SCRATCH_ACQUIRE/FC_SCRATCH_RELEASE macros. + Endian correct swizzle/unswizzle of an ATIO2 that has a WWPN in it. MFC after: 1 week	2002-02-04 21:04:25 +00:00
Matt Jacob	975284da32	Add missing move of relative offset for CTIO2 updates.	2002-01-11 23:48:25 +00:00
Matt Jacob	2903b27203	Implement REDUCED INTERRUPT OPERATION usage form FC cards- this allows the firmware to delay completion of commands so that it can attempt to batch a bunch of completions at once- either returning 16 bit handles in mailbox registers, or in a resposne queue entry that has a whole wad of 16 bit handles. Distinguish between 2300 and 2312 chipsets- if only because the revisions on the chips have different meanings. Add more instrumentation plus ISP_GET_STATS and ISP_CLR_STATS ioctls. Run up the maximum number of response queue entities we'll look at per interrupt. If we haven't set HBA role yet, always return success from isp_fc_runstate. MFC after: 2 weeks	2002-01-03 20:43:22 +00:00
Matt Jacob	c748b5e634	Explicitly decode GetAllNext SNS Response back as a GetAllNext response. Otherwise, we won't unswizzle it correctly. This was found on linux/PPC. This mandated creating another inline: isp_get_gan_response.	2001-12-11 21:58:04 +00:00
Matt Jacob	4fd13c1ba2	Major restructuring for swizzling to the request queue and unswizzling from the response queue. Instead of the ad hoc ISP_SWIZZLE_REQUEST, we now have a complete set of inline functions in isp_inline.h. Each platform is responsible for providing just one of a set of ISP_IOX_{GET,PUT}{8,16,32} macros. The reason this needs to be done is that we need to have a single set of functions that will work correctly on multiple architectures for both little and big endian machines. It also needs to work correctly in the case that we have the request or response queues in memory that has to be treated specially (e.g., have ddi_dma_sync called on it for Solaris after we update it or before we read from it). It also has to handle the SBus cards (for platforms that have them) which, while on a Big Endian machine, do not require most of the request/response queue entry fields to be swizzled or unswizzled. One thing that falls out of this is that we no longer build requests in the request queue itself. Instead, we build the request locally (e.g., on the stack) and then as part of the swizzling operation, copy it to the request queue entry we've allocated. I thought long and hard about whether this was too expensive a change to make as it in a lot of cases requires an extra copy. On balance, the flexbility is worth it. With any luck, the entry that we build locally stays in a processor writeback cache (after all, it's only 64 bytes) so that the cost of actually flushing it to the memory area that is the shared queue with the PCI device is not all that expensive. We may examine this again and try to get clever in the future to try and avoid copies. Another change that falls out of this is that MEMORYBARRIER should be taken a lot more seriously. The macro ISP_ADD_REQUEST does a MEMORYBARRIER on the entry being added. But there had been many other places this had been missing. It's now very important that it be done. Additional changes: Fix a longstanding buglet of sorts. When we get an entry via isp_getrqentry, the iptr value that gets returned is the value we intend to eventually plug into the ISP registers as the entry one past the last one we've written- not the current entry we're updating. All along we've been calling sync functions on the wrong index value. Argh. The 'fix' here is to rename all 'iptr' variables as 'nxti' to remember that this is the 'next' pointer- not the current pointer. Devote a single bit to mboxbsy- and set aside bits for output mbox registers that we need to pick up- we can have at least one command which does not have any defined output registers (MBOX_EXECUTE_FIRMWARE). MFC after: 2 weeks	2001-12-11 00:18:45 +00:00
Matt Jacob	fc16d270b7	Tra-La, another QLogic f/w funny- this time with the 2300. If we get a completion status of RQCS_QUEUE_FULL, it means that the internal queues are full. Other QLogic boards set the QFULL SCSI status. But nooooooooooo, not the 2300. MFC after: 1 day	2001-10-23 23:05:20 +00:00
Matt Jacob	8b8e73049d	Protect against deranged fabric nameservers that spit out 10000 identical port numbers. MFC after: 1 day	2001-10-18 17:26:52 +00:00
Matt Jacob	1c3749836f	Add some somewhat vague documentation for this driver and a list of Hardware that might, in fact, work.	2001-10-07 18:26:47 +00:00
Matt Jacob	71793c0dc4	Some patches from Doug for ia64 support- the principle one being the appropriate cache flush that provides MEMORY_BARRIER in between handoffs between host && RISC processor for the shared memory request/response queues. Submitted by: dfr@nlsystems.com	2001-10-07 18:18:50 +00:00
Matt Jacob	cd37f56f5a	Misunderstanding documentation caused me to try and set 1Gbps/2Gps/Auto connection speed for the 2300 in the wrong offset in the ICB. Oops. Respect some QLogic errat wrt PCI errors on certain shared host/RISC registers.	2001-10-06 20:41:18 +00:00
Matt Jacob	3bd4033010	Whups- remember to zero the isr pointer arg.	2001-10-06 19:34:43 +00:00
Matt Jacob	db4fa023f8	Respect QLogic's errata- read BIU_ISR even on the 2300 to see if there's an interrupt (avoids PCI parity errors which can occur on the 2312 if you access some registers from the host at the same time the RISC on the 2312 is C accessing them). MFC after: 1 day	2001-10-06 19:19:24 +00:00
Matt Jacob	53036e9289	Begin to implement target mode that for Fibre Channel has a private per-command component that we don't try and pass thru CAM. CAM just is too risky and too much of a pain- structures get copied, but not all info of interest can be considered safely transported thru all consumers (including user space) from the incoming ATIO to the outgoing CTIO- it's just much safer to have a buddy structure, identified by the command's tag which does make it thru safely. Pay attention to link speed and report 200MB/s xfer speed for a 23XX card in 2GPs mode. MFC after: 1 week	2001-10-01 03:48:42 +00:00
Matt Jacob	c507669af4	Implement a call to get the actual link data rate (if 23XX) so we can set whether it's a 2Gps or 1Gps link. MFC after: 1 week	2001-10-01 03:45:54 +00:00
Matt Jacob	83548830a7	When calling isp_reset, set the request/response in/out pointers all at once so there isn't a window with the ones for the 23XX cards being wrong. When being verbose, print out some more FC NVRAM values (like framesize). MFC after: 1 week	2001-09-29 19:37:49 +00:00
Julian Elischer	b40ce4165d	KSE Milestone 2 Note ALL MODULES MUST BE RECOMPILED make the kernel aware that there are smaller units of scheduling than the process. (but only allow one thread per process at this time). This is functionally equivalent to teh previousl -current except that there is a thread associated with each process. Sorry john! (your next MFC will be a doosie!) Reviewed by: peter@freebsd.org, dillon@freebsd.org X-MFC after: ha ha ha ha	2001-09-12 08:38:13 +00:00
Matt Jacob	64edff948b	I don't know what I was thinking- if I have two separate busses on on SIM (as is true for the 1280 and the 12160), then I have to have separate flags && status for both busses. Whap. Implement condition variables for coordination with some target mode events. It's nice to use these and not panic in obscure little places in the kernel like 'propagate_priority' just because we went to sleep holding a mutex, or some other absurd thing. Remove some bogus ISP_UNLOCK calls. Whap. No longer require that somebody do a lun enable on the wildcard device to enable target mode. They are, in fact, orthogonal. A wildcard open is a statement that somebody upstream is willing to accept commands which are otherwise unrouteable. Now, for QLogic regular SCSI target mode, this won't matter for a damn because we'll never see ATIOs for luns we haven't enabled (are listening for, if you will). But for SCCLUN fibre channel SCSI, we get all kinds of ATIOs. We can either reflect them back here with minimal info (which is isp_target.c:isp_endcmd() is for), or the wildcard device (nominally targbh) can handle them. Do further checking against firmware attributes to see whether we can, in fact, support target mode in Fibre Channel. For now, require SCCLUN f/w to supoprt FC target mode. This is an awful lot of change, but target mode still isn't quite right. MFC after: 4 weeks	2001-09-04 21:53:12 +00:00
Matt Jacob	23ac1fce7b	Note for ATIOs returned because of BDRs or Bus Resets for which bus this applies to. Do more bus # foo things. Acknowledge Immediate Notifies right away prior to throwing events upstream (where they're currently being ignored, groan) Capture ASYNC_LIP_F8 as with ASYNC_LIP_OCCURRED. Don't percolate them upstream as if they were BUS RESETS- they're not.	2001-09-04 21:48:02 +00:00
Matt Jacob	b96934e89a	If we're on an interrupt stack, mark things so that we don't try and cv_wait for mailbox commands to complete if we start them from here. Fix residuals for target mode such that we only check the residual and set it in the CTIO if this is the last CTIO (when we're sending status). MFC after: 4 weeks	2001-09-04 21:45:57 +00:00
Matt Jacob	f6a3bcf86c	I don't know what I was thinking- if I have two separate busses on on SIM (as is true for the 1280 and the 12160), then I have to have separate flags && status for both busses. Whap. Implement condition variables for coordination with some target mode events. It's nice to use these and not panic in obscure little places in the kernel like 'propagate_priority' just because we went to sleep holding a mutex, or some other absurd thing. MFC after: 4 weeks	2001-09-04 21:33:06 +00:00
Matt Jacob	2332ac8c61	Fix SET_IID_VAL/SET_BUS_VAL macros to usable. MFC after: 4 weeks	2001-09-04 19:42:13 +00:00
Matt Jacob	d82b6503a9	Because we now store SCCLUN capabilities in firmware attributes, get rid of the silly test of isp_maxluns > 16 and use the attibutes directly. MFC after: 4 weeks	2001-09-03 03:12:10 +00:00
Matt Jacob	181640a81c	Clarify issues about whether we have SCCLUN (65535 luns) or non-SCCLUN (16 luns) firmware for the Fibre Channel cards. We used to assume that if we didn't download firmware, we couldn't know what the firmware capability with respect to SCCLUNs is- and it's important because the lun field changes in the request queue entry based upon which firmware it is. At any rate, we do get back firmware attributes in mailbox register 6 when we do ABOUT FIRMWARE for all 2200/2300 cards- and for 2100 cards with at least 1.17.0 firmware. So- we now assume non-SCCLUN behaviour for 2100 cards with firmware < 1.17.0- and we check the firmware attributes for other cards (loaded firmware or not). This also allows us to get rid of the crappy test of isp_maxluns > 16- we simply can check firmware attributes for SCCLUN behaviour. This required an 'oops' fix to the outgoing mailbox count field for ABOUT FIRMWARE for FC cards. Also- while here, hardwire firmware revisions for loaded code for SBus cards. Apparently the 1.35 or 1.37 f/w we've been loading into isp1000 just doesn't report firmware revisions out to mailbox regs 1, 2 and 3 like everyone else. Grumble. Not that this fix hardly matters for FreeBSD. MFC after: 4 weeks	2001-09-03 03:09:48 +00:00
Matt Jacob	f8597b62e5	Add some more firmware revision macros. Add firmware attributes field to fcparam structure. MFC after: 4 weeks	2001-09-03 03:03:32 +00:00
Matt Jacob	126ec86486	Add 2 Gigabit Fibre Channel support (2300 && 2312 cards). This required some reworking (and consequent cleanup) of the interrupt service code. Also begin to start a cleanup of target mode support that will (eventually) not require more inforamtion routed with the ATIO to come back with the CTIO other than tag. MFC after: 4 weeks	2001-08-31 21:39:04 +00:00
Matt Jacob	ed4bea259e	Clean up some ways in which we set defaults for SCSI cards that do not have valid NVRAM. In particular, we were leaving a retry count set (to retry selection timeouts) when thats not really what we want. Do some constant string additions so that LOGDEBUG0 info is useful across all cards. MFC after: 2 weeks	2001-08-20 17:28:32 +00:00
Matt Jacob	dec1985672	Add MBOX_GET_PCI_PARAMS alias. MFC after: 2 weeks	2001-08-20 17:26:45 +00:00
Matt Jacob	169ad8cfef	oops- typo in a previous commit	2001-08-16 17:39:45 +00:00
Matt Jacob	561e7bb942	Fix a spelling error in a comment.	2001-08-16 17:31:27 +00:00
Matt Jacob	dda035d1fc	Add more MBOX and ASYNC event defines. MFC after: 2 weeks	2001-08-16 17:26:03 +00:00
Matt Jacob	be534d5f1a	Thanks to PHK for spotting: ISPASYNC_UNHANDLED_RESPONSE not handle in isp_async.	2001-08-16 17:25:41 +00:00
Matt Jacob	50719f7521	Enable LIP F8, LIP Reset async events. Be more chatty about SNS failures. Fix typo for skipped phase mesage. Correct MBOX_GET_PORT_QUEUE_PARAMS options in table. MFC after: 2 weeks	2001-08-16 17:25:08 +00:00
Matt Jacob	d51456f800	Oops- don't set 'goal' twice when you mean to set 'nvrm' as well. This breaks bogus NVRAM boards. MFC after: 1 day	2001-08-02 00:34:56 +00:00
Matt Jacob	9ce9bdaf8a	Redo how we manage SCSI device settings- have a 3rd flags (nvram) that records either what's in NVRAM or what the safe defaults would be if we lack NVRAM. Then we rename cur_XXXX to actv_XXXX (these are the currently active settings) and the dev_XXX settings to goal_XXXX (these are the settings which we want cur_XXXX to converge to). This probably isn't entirely final as yet- but it's a lot closer to now being what it should be, including allowing camcontrol to actually set specific settings.	2001-07-30 01:00:21 +00:00
Matt Jacob	d9c272f3ea	Redo how we manage SCSI device settings- have a 3rd flags (nvram) that records either what's in NVRAM or what the safe defaults would be if we lack NVRAM. Then we rename cur_XXXX to actv_XXXX (these are the currently active settings) and the dev_XXX settings to goal_XXXX (these are the settings which we want cur_XXXX to converge to). Roll core minor.	2001-07-30 00:59:32 +00:00
Matt Jacob	df225582bf	Redo how we manage SCSI device settings- have a 3rd flags (nvram) that records either what's in NVRAM or what the safe defaults would be if we lack NVRAM. Then we rename cur_XXXX to actv_XXXX (these are the currently active settings) and the dev_XXX settings to goal_XXXX (these are the settings which we want cur_XXXX to converge to).	2001-07-30 00:59:06 +00:00
Matt Jacob	f44257c29a	Remove ISP_SMPLOCK stuff- we're just using locking now. Correctly reintroduce loop_seen_once semantics- that is, if we've never seen good link, start bouncing commands with CAM_SEL_TIMEOUT. But we have to be careful to have let ourselves try (in isp_kthread) to check for loop up at least once. PR: 28992 MFC after: 1 week	2001-07-25 04:23:52 +00:00
Matt Jacob	3910362ab8	Roll minor version. Remove ISP_SMPLOCK nonsense. We're using full locking, and that's final. MFC after: 1 week	2001-07-25 04:21:53 +00:00
Matt Jacob	761d6b7150	Hmm. Let's try this on for size... We originally had it such that if the connection topology was FL-loop (public loop), we never looked at any local loop addresses. The reason for not doing that was fear or concern that we'd see the same local loop disks reflected from the name server and we'd attach them twice. However, when I recently hooked up a JBOD and a system to an ANCOR SA-8 switch, the disks did not show up on the fabric. So at least the ANCOR is screening those disks from appearing on the fabric. Now, it's possible this is a 'feature' of the ANCOR. When I get a chance, I'll check the Brocade (it's hard to do this on a low budget). In any case, if they do also show up on the fabric, we should simply elect to not log into them because we already have an entry for the local loop. There is relatively unexercised code just for this case. MFC after: 2 weeks	2001-07-11 02:34:21 +00:00
Matt Jacob	8e6a12fcad	Oops- missed a CAMLOCK_2_ISP case.	2001-07-05 19:34:06 +00:00
Matt Jacob	45c9a36af5	Things have become cinched down more tightly about assertions for Giant. This uncovered some missing spots where I trade off between isp's lock and Giant as I enter CAM.	2001-07-05 17:14:57 +00:00
Matt Jacob	ab163f5fee	Add CAM_NEW_TRAN_CODE support. Use correct CAMLOCK_2_ISPLOCK macros. For fibre channel, start going for the gusto and using AC_FOUND_DEVICE and AC_LOST_DEVICE calls to xpt_async when devices appear and disappear as the loop or fabric changes. ISPASYNC_FW_CRASH is the async event code where the platform layer deals with a firmware crash.	2001-07-04 18:54:29 +00:00
Matt Jacob	9927912756	Macroize request/response in/out queue pointer access.	2001-07-04 18:52:23 +00:00
Matt Jacob	559a1ad2c1	Some possibly helpful casts.	2001-07-04 18:51:58 +00:00
Matt Jacob	f0f4c8ae4b	Add a microcomment about how you'd use ispds64_t or ispdlist_t for CTIO3/CTIO4 entries.	2001-07-04 18:51:06 +00:00
Matt Jacob	1ee34f05dd	Add a bunch of additional defines for completion codes. Define some of the RIO (reduced interrupt operation) stuff. Add 64 bit data list (DSD type 1) and arbitrary data list (DSD type 2) data structure defines. Add macros that parameterize usage of the Request/Response in/out queue pointers. When we finish 2300 support, different registers will be accessed for the 2300.	2001-07-04 18:49:00 +00:00
Matt Jacob	b91862efef	Firmware crashes handled in platform specific code (isp_async call). Fix longstanding silly buglet that left a hole in the debug log defines.	2001-07-04 18:46:50 +00:00
Matt Jacob	9b9288ec4a	More 2300 support prep- the Request/Response in/out pointers are part of the PCI block for the 2300- not software convention usage of the mailbox registers- so we macrosize in/out pointer usage. Only report that a LIP destroyed commands if it actually destroyed commands. Get the chan/tgt/lun order correct. Fix a longstanding stupid bug that caused us to try and issue a command with a tag on Channel B because we were checking the tagged capability for the target against Channel A. A firmware crash is now vectored out to platform specific code as an async event. Some minor formatting tweaks.	2001-07-04 18:42:41 +00:00
Peter Wemm	22941bd78f	Fix warnings: 554: passing arg 4 of `resource_string_value' from incompatible pointer type 576: passing arg 4 of `resource_string_value' from incompatible pointer type 593: passing arg 4 of `resource_string_value' from incompatible pointer type	2001-06-15 00:13:18 +00:00
Matt Jacob	cb62bc53d1	We've had problems with data corruption occuring on commands that complete (with no apparent error) after we receive a LIP. This has been observed mostly on Local Loop topologies. To be safe, let's just mark all active commands as dead if we get a LIP and we're on a private or public loop. MFC after: 4 weeks	2001-06-14 17:13:24 +00:00
Matt Jacob	6a23026c6e	Fix botch for state levels. Role minor release. Start adding code for a 'force logout' path. MFC after: 4 weeks	2001-06-05 17:11:06 +00:00
Matt Jacob	5d57194434	Spring MegaChange #1 . ---- Make a device for each ISP- really usable only with devfs and add an ioctl entry point (this can be used to (re)set debug levels, reset the HBA, rescan the fabric, issue lips, etc). ---- Add in a kernel thread for Fibre Channel cards. The purpose of this thread is to be woken up to clean up after Fibre Channel events block things. Basically, any FC event that casts doubt on the location or identify of FC devices blocks the queues. When, and if, we get the PORT DATABASE CHANGED or NAME SERVER DATABASE CHANGED async event, we activate the kthread which will then, in full thread context, re-evaluate the local loop and/or the fabric. When it's satisfied that things are stable, it can then release the blocked queues and let commands flow again. The prior mechanism was a lazy evaluation. That is, the next command to come down the pipe after change events would pay the full price for re-evaluation. And if this was done off of a softcall, it really could hang up the system. These changes brings the FreeBSD port more in line with the Solaris, Linux and NetBSD ports. It also, more importantly, gets us being more proactive about topology changes which could then be reflected upwards to CAM so that the periph driver can be informed sooner rather than later when things arrive or depart. --- Add in the (correct) usage of locking macros- we now have lock transition macros which allow us to transition from holding the CAM lock (Giant) and grabbing the softc lock and vice versa. Switch over to having this HBA do real locking. Some folks claim this won't be a win. They're right. But you have to start somewhere, and this will begin to teach us how to DTRT for HBAs, etc. -- Start putting in prototype 2300 support. Add back in LIP and Loop Reset as async events that each platform will handle. Add in another int_bogus instrumentation point. Do some more substantial target mode cleanups. MFC after: 8 weeks	2001-05-28 21:20:43 +00:00
Matt Jacob	a1bc34c6b8	Redo a lot of the target mode infrastructure to be cognizant of Dual Bus cards like the 1280 && the 12160. Cleanup isp_target_putback_atio. Make sure bus and correct tag ids and firmware handles get propagated as needed.	2001-04-04 21:58:29 +00:00
Matt Jacob	1209134a70	Roll platform minor. Change target mode state definitions to be aware of 'channel' (for the dualbus 1280/12160 cards).	2001-04-04 21:56:15 +00:00
Matt Jacob	e9a2738ad1	Complete some Ansification. Check to make sure, in tdma_mk, that we won't overflow the request queue. The reason we want to do this is that we now push out completed CTIOs as we complete them- this gets the QLogic working on them quicker. So we need to know whether we can put the entire burrito out before we start. We now support conjoint status with data for the last CTIO for both Fibre Channel and SCSI. Leave the old code in place in case we need to go back (minor 3 line ifdef). Ultra-ultra important- don't set rq->req_seg_count for non-data target mode requests in isp_pci_dmasetup. D'oh- this is actually the tag value area for a CTIO. What was I thinking? Boy howdy does both aic7xxx and sym get awfully unhappy when on reconnect you give them a constant '1' for a tag value.	2001-04-04 21:53:59 +00:00
Matt Jacob	b25bcef87a	Perform some more Ansification. Remove and then replace the isp_putback_atio function- we did it a bit cleaner. We only use this if a CTIO completes with !CT_OK state. We now have managed to get away without having to poke around and trying to find the original ATIO- the csio we're using has the tag_id and lun values with it which is mostly what we need when we do the putback. Make sure we correctly propagate AT_TQAE->CT_TQAE for tags. Make sure we call ISP_DMAFREE only if we had DATA to move.	2001-04-04 21:49:51 +00:00
Matt Jacob	b5da7b232e	Amazing. The bits to enable tagged queing in target mode, grok that a tag is active for an ATIO, and say that you want to reconnect with a tag value in a CTIO have never been exercised until now. This lossage derived from Solaris code where this stuff originally came from that is about 7 years old. Amazing. We now bundle the incoming tag (legal values are 0..256) as the low 16 bits of the ccb_accept_tio's at_tagid while we put the firmware handle for this ATIO in the top 16 bits- define some macros to make this cleaner. Complete some Ansification.	2001-04-04 21:46:48 +00:00
Matt Jacob	8055acd82d	Add some target mode definitions and firmware (FC only) attribute definitions.	2001-04-04 21:44:10 +00:00
Matt Jacob	56f7a63a7a	Ansification of source.	2001-04-04 21:43:43 +00:00
Matt Jacob	534bd9fecb	After loading f/w, for FC cards print out Firmware Attributes. Redo establishment of default SCSI parameters whether or not we've been compiled for target mode. Unfortunately, the Qlogic f/w is confused so that if we set all targets to be 'safe' (i.e., narrow/async), it will also then report narrow, async if we're contacted in target mode from that target (acting in initiator role). D'oh! Fix ISPCTL_TOGGLE_TMODE to correctly enable the right channel for dual channel cards. Add some more opcodes. Fix a stupid NULL pointer bug.	2001-04-04 21:42:59 +00:00
John Baldwin	f34fa851e0	Catch up to header include changes: - <sys/mutex.h> now requires <sys/systm.h> - <sys/mutex.h> and <sys/sx.h> now require <sys/lock.h>	2001-03-28 09:17:56 +00:00
Matt Jacob	b72b15696c	For parallel SCSI, let us now do status with the final CTIO. For the 1080, I was hanging after sending a xfer CTIO and a status CTIO for a non-discon INQUIRY- the xfer CTIO was returned as completed OK, but the status CTIO was dropped on the floor. All the fields looked good. I don't know why it got dropped. But allowing status to go back with data xfer seemed to work. I also noticed that with a non-disconnecting command that the firmware handle in the ATIO is zero- this leads me to believe that the f/w really can only handle one CTIO at a time in the discon case, and it had no idea what to do with the second (status) CTIO.	2001-03-21 00:49:37 +00:00
Matt Jacob	290dc24b4d	Check CT2_SENDSTATUS/CT_SENDSTATUS against cto->ct_flags, not CAM_SEND_STATUS. Set a timeout of 2 seconds per CTIO. Make sure that the 'real' tag value is being checked against- not the one that also carries the firmware handle.	2001-03-21 00:46:44 +00:00
Matt Jacob	b0bd9b71f2	Clean up usage- ct_reserved is really ct_syshandle now.	2001-03-14 04:14:58 +00:00
Matt Jacob	f5a4462713	First cut of target mode swizzling for non-little endian machines. It's probably wrong but it's a start.	2001-03-14 04:14:22 +00:00
Matt Jacob	75f7d77928	Mote that how the pad bytes can be divided in half and used by either the target mode code or outer layers. Increase cd_tagval to be 32 bits since it will have to now carry 16 bits of parallel SCSI ATIO handle as well as a normal tag (if any).	2001-03-14 04:13:30 +00:00
Matt Jacob	e2ec5cf0f9	In order to save ourselves grief with the SUNPRO compiler under Solaris (which, for reasons unknown to me, chokes on u_int16_t as a typedef of unsigned short if used in a transitional (mixed K&R and ANSI) way), we'll go the extra mile and fully ANSIfy things.	2001-03-14 04:11:56 +00:00
Matt Jacob	d8d5f2adc5	more 32 to 16 bit handle conversions	2001-03-04 18:42:51 +00:00
Matt Jacob	24d52eb7b5	More 32 to 16 bit handle stuff. Roll core minor version.	2001-03-04 18:42:23 +00:00
Matt Jacob	3bfa867765	Remove a superfluous newline in a string (isp_prt adds this). Fix a missed conversion of 32 to 16 bit handles.	2001-03-04 18:41:23 +00:00
Matt Jacob	5f5aafe1fc	Switch to using 16 bit handles instead of 32 bit handles. This is a pretty invasive change, but there are three good reasons to do this: 1. We'll never have > 16 bits of handle. 2. We can (eventually) enable the RIO (Reduced Interrupt Operation) bits which return multiple completing 16 bit handles in mailbox registers. 3. The !)$)$~)@$~)$ Qlogic target mode for parallel SCSI spec changed such that at_reserved (which was 32 bits) was split into two pieces- and one of which was a 16 bit handle id that functions like the at_rxid for Fibre Channel (a tag for the f/w to correlate CTIOs with a particular command). Since we had to muck with that and this changed the whole handler architecture, we might as well... Propagate new at_handle on through int ct_fwhandle. Follow implications of changing to 16 bit handles. These above changes at least get Qlogic 1040 cards working in target mode again. 1080/12160 cards don't work yet. In isp.c: Prepare for doing all loop management in outer layers.	2001-03-02 06:28:55 +00:00
Matt Jacob	715ec7e9a7	Fix isp_print_qentry to print all four lines- it's been broken for months.	2001-03-02 04:48:41 +00:00
Mark Murray	ed34d0ade2	Turn on interrupt-entropy harvesting for all/any mass storage devices I could find. I have no doubt missed a couple. Interrupt entropy harvesting is still conditional on the kern.random.sys.harvest_interrupt sysctl.	2001-03-01 17:09:09 +00:00
Matt Jacob	6e5c5328c4	Eliminate the use of the getenv_int stuff we'd been using (with a bitmap for selecting unit). Instead, use the resource hints mechanism. One unfortunate situation here is that there is no resource_quad_value function- which is what I needed for WWN boot time replacement. Worse- you can't store the hint as just plain hint.isp.0.nodewwn="0x50000000aaaa0001" because this gets interpreted as an int- incorrectly because it can't be converted to an int. I can't even get this as a string. To work around this particular case for nodewwn && portwwn setting, this rather grotesque form will be used: hint.isp.0.nodewwn="w50000000aaaa0001" hint.isp.0.portwwn="w50000000aaaa0002" At the same time, if we have no hinted WWN, set the default WWN (which, btw, gets overridden if the card has valid NVRAM, which is usual) to 0x400000007F000009ull (which translates to NAA == IPv4, 127.0.0.9). Eliminate more printf's and replace them either with device_printf or isp_prt calls.	2001-03-01 02:21:36 +00:00
Matt Jacob	c9a6d60b09	Go to a default port and default node wwn model. Eliminate isp_name and isp_unit and just store the device_t, fer gosh sakes.... Include sys/bus.h for use by isp_pci.c.	2001-03-01 02:15:58 +00:00
Matt Jacob	3c75bb14be	Finally eliminate as many of the printf calls as possible (still leaving ones where we have a CAM path) and replacing them with calls to isp_prt., Eliminate isp_unit references- we no longer have an isp_unit- we now have an isp_dev that device_get_unit can work with.	2001-03-01 02:14:54 +00:00
Matt Jacob	b0a3ba7e28	Fix at2_entry_t to reflect what the firmware actually writes (instead of just deriving from SCSI at_entry_t). In this case, there is no 'suggested sense' for FC cards.	2001-02-27 00:14:39 +00:00
Matt Jacob	4102f2f6ef	Fix a longstanding bug- we had the sense of what bit 14 for the ICB firmware options meant- I had taken it to mean that if you set it, Node Name would be ignored and derived from Port Name. Actually, it meant the opposite. As a consequence- change ICBOPT_USE_PORTNAME to the define ICBOPT_BOTH_WWNS- makes more sense. Fix wrong input bitmap for MBOX_DUMP_RAM command. Call ISP_DUMPREGS if we get a f/w crash. Add ISPCTL_RUN_MBOXCMD control command (so outer layers can run a mailbox command directly) and add a ISPASYNC_UNHANDLED_RESPONSE hook so outer layers can understand response queue entries we might not know about.	2001-02-23 05:35:50 +00:00
Matt Jacob	410b556714	Eliminate ISP2100_FABRIC- we always allow for fabric now. Add an isp_iid_set/isp_iid for fibre channel- this is because we now fake a port database entry for ourselves. Add the additional loop states between LOOP_PDB_RCVD and LOOP_READY. Change and comment on a wad of Fibre Channel isp_control functions. Change and comment on some of the ISPASYNC Fibre Channel events.	2001-02-11 03:56:48 +00:00
Matt Jacob	b21d3f4ef8	Add structure defining FC-AL position maps. The only tool that I know of that really uses this is luxadm(8) under Solaris.	2001-02-11 03:53:58 +00:00
Matt Jacob	b9b599fe4c	Shuffle around how we do isp_disable management- make sure we return 0 so the unit number doesn't get reused. Make sure that if we've compiled for ISP_TARGET_MODE we set the default role to be ISP_ROLE_INITIATOR\|ISP_ROLE_TARGET. Do some misc other cleanups.	2001-02-11 03:53:23 +00:00
Matt Jacob	6b528b1a1e	Add isp_fc_runstate function- this function's purpose is to, in stages, and depending on role, make sure link is up, scan the fabric (if we're connected to a fabric), scan the local loop (if appropriate), merge the results into the local port database then, check once again to make sure we have f/w at FW_READY state and the the loopstate is LOOP_READY.	2001-02-11 03:52:04 +00:00
Matt Jacob	250bc0aa8b	Roll minor version. Remove ISP2100_FABRIC define (unneeded now). Comment out usage of ISP_SMPLOCK- I have my doubts that this works sanely as yet because CAM itself still needs Giant. I was dropping my lock and grabbing Giant when doing the upcall for completion, but this is all seems ridiculous until CAM is fixed.	2001-02-11 03:48:54 +00:00
Matt Jacob	d6e5500f27	Do some cleanup based upon adapter role- mainly not enabling interrupts if we're ISP_ROLE_NONE. Change ISPASYNC_LOGGED_INOUT to ISPASYNC_PROMENADE. Make sure we note if something is a fabric device. Target mode: Finally fix (to a first approximation) SCSI Target Mode again- we needed to correctly check against CAM_TARGET_WILDCARD and CAM_LUN_WILDCARD so that targbh won't confuse us. Comment out the drainqueue stuff for now. Use isp_fc_runstate instead if isp_control/ISPCTL_FCLINK_TEST.	2001-02-11 03:47:39 +00:00
Matt Jacob	b2b4adaa33	Minor stuff: Remove ISP2100_FABRIC defines- we always handle fabric now. Insert isp_getmap helper function (for getting Loop Position map). Make sure we (for our own benefit) mark req_state_flags with RQSF_GOT_SENSE for Fibre Channel if we got sense data- the !$)!$)~$)$ Qlogic f/w doesn't do so. Add ISPCTL_SCAN_FABRIC, ISPCTL_SCAN_LOOP, ISPCTL_SEND_LIP, and ISPCTL_GET_POSMAP isp_control functions. Correctly send async notifications upstream for changes in the name server, changes in the port database, and f/w crashes. Correctly set topology when we get a ASYNC_PTPMODE event. Major stuff: Quite massively redo how we handle Loop events- we've now added several intermediate states between LOOP_PDB_RCVD and LOOP_READY. This allows us a lot finer control about how we scan fabric, whether we go further than scanning fabric, how we look at the local loop, and whether we merge entries at the level or not. This is the next to last step for moving managing loop state out of the core module entirely (whereupon loop && fabric events will simply freeze the command queue and a thread will run to figure out what's changed and it will re-enable the queu). This fine amount of control also gets us closer to having an external policy engine decide which fabric devices we really want to log into.	2001-02-11 03:44:43 +00:00
Bosko Milekic	9ed346bab0	Change and clean the mutex lock interface. mtx_enter(lock, type) becomes: mtx_lock(lock) for sleep locks (MTX_DEF-initialized locks) mtx_lock_spin(lock) for spin locks (MTX_SPIN-initialized) similarily, for releasing a lock, we now have: mtx_unlock(lock) for MTX_DEF and mtx_unlock_spin(lock) for MTX_SPIN. We change the caller interface for the two different types of locks because the semantics are entirely different for each case, and this makes it explicitly clear and, at the same time, it rids us of the extra `type' argument. The enter->lock and exit->unlock change has been made with the idea that we're "locking data" and not "entering locked code" in mind. Further, remove all additional "flags" previously passed to the lock acquire/release routines with the exception of two: MTX_QUIET and MTX_NOSWITCH The functionality of these flags is preserved and they can be passed to the lock/unlock routines by calling the corresponding wrappers: mtx_{lock, unlock}_flags(lock, flag(s)) and mtx_{lock, unlock}_spin_flags(lock, flag(s)) for MTX_DEF and MTX_SPIN locks, respectively. Re-inline some lock acq/rel code; in the sleep lock case, we only inline the _obtain_lock()s in order to ensure that the inlined code fits into a cache line. In the spin lock case, we inline recursion and actually only perform a function call if we need to spin. This change has been made with the idea that we generally tend to avoid spin locks and that also the spin locks that we do have and are heavily used (i.e. sched_lock) do recurse, and therefore in an effort to reduce function call overhead for some architectures (such as alpha), we inline recursion for this case. Create a new malloc type for the witness code and retire from using the M_DEV type. The new type is called M_WITNESS and is only declared if WITNESS is enabled. Begin cleaning up some machdep/mutex.h code - specifically updated the "optimized" inlined code in alpha/mutex.h and wrote MTX_LOCK_SPIN and MTX_UNLOCK_SPIN asm macros for the i386/mutex.h as we presently need those. Finally, caught up to the interface changes in all sys code. Contributors: jake, jhb, jasone (in no particular order)	2001-02-09 06:11:45 +00:00
Jeroen Ruigrok van der Werven	f09deb6962	Fix typo: wierd -> weird. There is no such thing as wierd in the english language.	2001-02-06 09:25:10 +00:00
Matt Jacob	d69a5f7d9c	Guard against overflow of the calculated timeout value.	2001-01-16 07:15:36 +00:00
Matt Jacob	7afb656f3f	Add was_fabric_dev/fabric_dev tags to our local FC database structure (so we can see rapidly whether something was a fabric device but is now gone). Add a tag which says what role this adapter should take. It can take on the value of None, Target, Initiator or Both. None is useful for warm failover purposes. Remove the ISP_CFG_NOINIT silliness since a role of "None" does this. Add a isp_lastmbxcmd tag to store the opcode for the last mailbox command used.	2001-01-15 18:40:37 +00:00
Matt Jacob	144ff11903	Put in offset definitions for FPM and FBM registers, plus just enough bits defined so we can reset them.	2001-01-15 18:37:14 +00:00
Matt Jacob	df1590c05d	Set default adapter role.	2001-01-15 18:36:39 +00:00
Matt Jacob	fe4a3254ce	Use the isp_lastmbxcmd tag to report timed out mailbox commands. Arrrggghhhh! Very likely fix 22650 by remembering to, ahem, set CAM_AUTOSNS_VALID when one has sense data.	2001-01-15 18:36:09 +00:00
Matt Jacob	70d2cccebd	Do more cleanup of the usage of 0..125 for F-port topologies.	2001-01-15 18:34:49 +00:00
Matt Jacob	6677e7f89e	When resetting the Qlogic 2X00 units, reset the FPM (Fibre Protocol Module) and FBM (Fibre Buffer Modules). Also remember to clear the semaphore registers. Tell the RISC processor to not halt on FPM parity errors. Throw out the ISP_CFG_NOINIT silliness and instead go to the use of adapter 'roles' to see whether one completes initialization or not (mostly for Fibre Channel). The ultimate intent, btw, of all of this is to have a warm standby adapter for failover reasons. Because we do roles now, setting of Target Capable Class 3 service parameters in the ICB for the 2x00 cards reflects from role. Also, in isp_start, if we're not supporting an initiator role, we bounce outgoing commands with a Selection Timeout error. Also clean out the TOGGLE_TMODE goop for FC- there is no toggling of target mode like there is for parallel SCSI cards. Do more cleanup with respect to using target ids 0..125 in F-port topologies. Also keep track of things which were fabric devices so that when you rescan the fabric you can notify the outer layers when fabric devices go away. Only force a LOGOUT for fabric devices if they're still logged in (i.e., you cat their Port Database entry. Clean up the Get All Next scanning. Finally, use a new tag in the softc to store the opcode for the last mailbox command used so we can report which opcode timed out.	2001-01-15 18:33:08 +00:00
Matt Jacob	3fc3cadde6	ISPASYNC_PDB_CHANGED -> ISPASYNC_LOGGED_INOUT.	2001-01-09 02:49:02 +00:00
Matt Jacob	0ecded8a13	Add some SNS "Register FC4 type" subcommand defines. Add some defines that are pertinetnt for state flags on Qlogic 2X00 status completion entries.	2001-01-09 02:48:44 +00:00
Matt Jacob	27d1caa3cd	Up tsleep && poll time for mailbox commands from 2 to 10 seconds. Print out the mailbox command opcode if the command times out.	2001-01-09 02:47:56 +00:00
Matt Jacob	4b9d588e2c	Follow the ISPASYNC_PDB_CHANGED -> ISPASYNC_LOGGED_INOUT change. Also, ISPASYNC_NOTIFY_CHANGE now is for both local loop && fabric changes.	2001-01-09 02:47:15 +00:00
Matt Jacob	0433833d0d	Add a isp_register_fc4_type function so that we work with McData switches that require us to register our FC4 types of interest. Allow ourselves, in F-port topologies, to start logging in fabric devices in the target 0..125 range. Change ISPASYNC_PDB_CHANGED (misnamed) to ISPASYNC_LOGGED_INOUT. Fix (SMACK) again some default WWN stuff. This is really hard to get right across all the range of platforms.	2001-01-09 02:46:23 +00:00
Matt Jacob	3486bfe084	add missing length argument	2001-01-09 02:12:42 +00:00
Matt Jacob	91f1caa2d8	Fix problems with incomplete conversions from printf to isp_prt.	2000-12-31 20:50:56 +00:00
Matt Jacob	56c6d0d775	Change the modification of what could be a const string. Apparently the construct: char *foo; ... foo = "XXX"; ... foo[1] = 'Y'; is wrong. IT blew up on NetBSD-sparc64 because that platform write-protects constant strings.	2000-12-30 20:09:26 +00:00
Matt Jacob	e9423e211e	Add in Bill Sommerfeld's -Wformat stuff. Add a ISP_CFG_NOINIT option to keep from completing initialization when isp_init is called.	2000-12-29 19:17:18 +00:00
Matt Jacob	8ead30564e	Add in Bill Sommerfelds -Wformat changes. Set up default node && port WWNs correctly (Again!) - this time for the case that we're not going to fully init the adapter if isp_init is called (with ISP_CFG_NOINIT set in options). The pupose for this is to bring the adapter up to almost ready to go, get info out of NVRAM, but to not start it up- leaving it until later to actually start things up if wanted (and possibly with different roles selected).	2000-12-29 19:12:44 +00:00
Matt Jacob	f09b192280	Set up to do a local interrupt fielding before calling common code- allows us to grab lock as we should.	2000-12-29 19:10:16 +00:00
Matt Jacob	c40e096ed6	Make sure we do locking if we call isp_intr. Make sure we enter Giant for now if we call into cam for completion.	2000-12-29 19:06:32 +00:00
Matt Jacob	7325560821	add a couple off offset defines for ATIO2s	2000-12-28 23:27:54 +00:00
David Malone	7cc0979fd6	Convert more malloc+bzero to malloc+M_ZERO. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net>	2000-12-08 21:51:06 +00:00
Matt Jacob	4081cc88c9	Only call ISP_UNLOCK/ISP_LOCK if isp->isp_osinfo.intsok in USEC_SLEEP. Add a test against isp->isp_osinfo.islocked prior to trying to see whether --isp->isp_osinfo.islocked is zero to cause us to unlock (non-SMPLOCK case).	2000-12-05 07:41:53 +00:00
Matt Jacob	bfbab17021	Replace some more printfs with isp_prt's. Use isp_prt/ISP_LOGDEBUG0 for rate setting/getting printouts.	2000-12-05 07:39:54 +00:00
Matt Jacob	f7dddf8a54	Remove more printfs and use either isp_prt or device_printf. Remember to set ISP_LOGINFO if bootverbose is set.	2000-12-05 07:38:41 +00:00
David Malone	ea8b5a9ae9	More M_ZERO patches. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net> Approved by: mjacob	2000-12-03 20:46:54 +00:00
Matt Jacob	e5f2f488c5	Add USEC_SLEEP macro support. Change the location at which we define ISP_LOCK/ISP_UNLOCK macros.	2000-12-02 18:33:29 +00:00
Matt Jacob	81babfd043	Make the Not RESPONSE in RESPONSE QUEUE message have a bit more info (specifically, how many entries we've looked at so far). Maintain interrupt instrumentation. Use USEC_SLEEP instead of USEC_DELAY in a number of places (this allows us to drop locks and sleep instead of spin). Track changes to configuration options for topology preference. Fix botched order of printout for Channel, Target, Lun.	2000-12-02 18:08:35 +00:00
Matt Jacob	67afe757a2	Add interrupt instrumentation. Change ISP_CFG_NPORT config option to a set of options that allows specific loop, loop-only, nport, nport-only topology settings. Define a required macro for all platforms (USEC_SLEEP).	2000-12-02 18:06:03 +00:00
Matt Jacob	aa0898c0e0	I'm dropping the MAINTAINER request and see what happens. If it becomes too hard for me to keep in sync with other platforms, FreeBSD will go it's own way.	2000-10-31 05:55:54 +00:00
Matt Jacob	650789cb1b	Get rid of ridiculous ISP_PVS macro. Instead, just set an ISP_SMPLOCK define based on the previous 5.4 major/minor release define of PVS- because this allows us to turn it off easier.	2000-10-25 04:42:46 +00:00
Matt Jacob	3395b0568a	Whoops! Forgot to commit this when I committed the other (turnin on locks) change. Sorry about that.	2000-10-25 04:40:49 +00:00
John Baldwin	35e0e5b311	Catch up to moving headers: - machine/ipl.h -> sys/ipl.h - machine/mutex.h -> sys/mutex.h	2000-10-20 07:58:15 +00:00
Matt Jacob	e92fbe47e2	Roll minor revision- for once we'll use this because.... if revision >= 5.4, compile time will build in mutex locks, otherwise the old locking (splcam/splx with a recursion counter) will be compiled in. We still depend on config_intr_hook to tell us when it's okay to call msleep instead of polling. It'd be real nice if we could do this early enough to not hang up a machine struggling with a bad Fibre Channel loop, but that's still to come.	2000-10-17 18:18:14 +00:00
Matt Jacob	39f3fc6f69	remove "SERVICING_INTERRUPT" nonsense	2000-10-17 18:15:30 +00:00
Poul-Henning Kamp	db7e3af111	Remove unneeded #include <machine/clock.h>	2000-10-15 14:19:01 +00:00
Matt Jacob	e5d4e19714	Make changes required by change in how default and usable node and port WWNS are made and used.	2000-10-12 23:59:40 +00:00
Matt Jacob	c914d4237d	Redo how default Node and Port WWNs are determined (again!). This is so we don't stomp on the differences between ports for a Qlogic 2202.	2000-10-12 23:49:09 +00:00
Matt Jacob	9cf43b9716	Change some default macro usages/definitions/requirements.	2000-10-12 23:47:03 +00:00
Matt Jacob	aa57fd6fa5	some copyright cleanups	2000-09-21 20:16:04 +00:00
Matt Jacob	c0cfc79790	Inintialize the queue index stuff from what the f/w sends back- just in case it's insane enough to not do what you tell it to. Print out (LOGINFO level) initiator ID.	2000-09-21 17:06:45 +00:00
Matt Jacob	e11a1ee870	Per msmith's request, don't attach to Qlogic 12160 id'd cards that have a certain SubVendorID.	2000-09-07 20:27:40 +00:00
Doug Rabson	21c3015a24	* Completely rewrite the alpha busspace to hide the implementation from the drivers. * Remove legacy inx/outx support from chipset and replace with macros which call busspace. * Rework pci config accesses to route through the pcib device instead of calling a MD function directly. With these changes it is possible to cleanly support machines which have more than one independantly numbered PCI busses. As a bonus, the new busspace implementation should be measurably faster than the old one.	2000-08-28 21:48:13 +00:00
Matt Jacob	84267b9edd	remove clause 3 licence	2000-08-27 23:39:23 +00:00
Matt Jacob	b6b6ad2f23	various fixes	2000-08-27 23:38:44 +00:00
Matt Jacob	3ea883b46d	Add a comment as to where stdarg.h applies.	2000-08-03 03:05:50 +00:00
John Baldwin	7d615c1d8b	Use <machine/stdarg.h> instead of <stdarg.h> so that this will compile. While I'm at it, move the #include line up to the top of the file.	2000-08-03 02:47:06 +00:00
Matt Jacob	c7d5594134	Add in macros && masks so that mailbox command errors can be selectively printed/supressed in isp_mboxcmd.	2000-08-01 06:55:08 +00:00
Matt Jacob	d0d5832ac7	Major whacking for core version 2.0. A major motivator for 2.0 and these changes is that there's now a Solaris port of this driver, so some things in the core version had to change (not much, but some). In order, from the top.....: A lot of error strings are gathered in one place at the head of the file. This caused me to rewrite them to look consistent (with respect to things like 'Port 0x%' and 'Target %d' and 'Loop ID 0x%x'. The major mailbox function, isp_mboxcmd, now takes a third argument, which is a mask that selectively says whether mailbox command failures will be logged. This will substantially reduce a lot of spurious noise from the driver. At the first run through isp_reset we used to try and get the current running firmware's revision by issuing a mailbox command. This would invariably fail on alpha's with anything but a Qlogic 1040 since SRM doesn't start the f/w on these cards. Instead, we now see whether we're sitting ROM state before trying to get a running BIOS loaded f/w version. All CFGPRINTF/PRINTF/IDPRINTF macros have been replaced with calls to isp_prt. There are seperate print levels that can be independently set (see ispvar.h), which include debugging, etc. All SYS_DELAY macros are now USEC_DELAY macros. RQUEST_QUEUE_LEN and RESULT_QUEUE_LEN now take ispsoftc as a parameter- the Fibre Channel cards and the Ultra2/Ultra3 cards can have 16 bit request queue entry indices, so we can make a 1024 entry index for them instead of the 256 entries we've had until now. A major change it to fix isp_fclink_test to actually only wait the delay of time specified in the microsecond argument being passed. The problem has always been that a call to isp_mboxcmd to get he current firmware state takes an unknown (sometimes long) amount of time- this is if the firmware is busy doing PLOGIs while we ask it what's up. So, up until now, the usdelay argument has been a joke. The net effect has been that if you boot without being plugged into a good loop or into a switch, you hang. Massively annonying, and hard to fix because the actual time delta was impossible to know from just guessing. Now, using the new GET_NANOTIME macros, a precise and measured amount of USEC_DELAY calls are done so that only the specified usecdelay is allowed to pass. This means that if the initial startup of the firmware if followed by a call from isp_freebsd.c:isp_attach to isp_control(isp, ISP_FCLINK_TEST, &tdelay) where tdelay is 2 * 1000000, no more than two seconds will actually elapse before we leave concluding that the cable is unhooked. Jeez. About time.... Change the ispscsicmd entry point to isp_start, and the XS_CMD_DONE macro to a call to the platform supplied isp_done (sane naming). Limit our size of request queue completions we'll look at at interrupt time. Since we've increased the size of the Request Queue (and the size of the Response Queue proportionally), let's not create an interrupt stack overflow by having to keep a max completion list (forw links are not an option because this is common code with some platforms that don't have link space in their XS_T structures). A limit of 32 is not unreasonable- I doubt there'd be even this many request queue completions at a time- remember, most boards now use fast posting for normal command completion instead of filling out response queue entries. In the isp_mboxcmd cleanup, also create an array of command names so that "ABOUT FIRMWARE" can be printed instead of "CMD #8". Remove the isp_lostcmd function- it's been deprecated for a while. Remove isp_dumpregs- the ISP_DUMPREGS goes to the specific bus register dump fucntion. Various other cleanups.	2000-08-01 06:51:05 +00:00
Matt Jacob	b09b009594	Core version 2.0 rewrite. In this file we replace isp_tdebug with isp_prt calls. We now use an argument to the ISPCTL_FCLINK_TEST call. We change all IDPRINTF macros to isp_prt calls. We add the isp_prt function here.	2000-08-01 06:31:44 +00:00
Matt Jacob	18ccaecd45	Core version 2.0 cleanup/rewrite. Things get rearranged and changed quite a bit so that all of the ports have a similar set of required macros/definitions (and in similar places in the isp_<platform>.h file). Some new macros/functions added- Mailbox Acquire/Relase macros, NANOTIME macros, SNPRINTf and STRNCAT. MemoryBarrier beomes MEMORYBARRIER with much stronger types.	2000-08-01 06:29:55 +00:00
Matt Jacob	16dd34376c	Remove isp_prtstst (now in case statement in isp.c). Remove isp2100_fw_statename as an INLINE (now a function in isp.c). Remove isp2100_pdb_statename (unused). Redo all ISP_SCSI_XFER_T as XS_T types. Change all RQUEST_QUEUE_LEN/RESULT_QUEUE_LEN macros to take a parameter. Add isp_print_bytes function.	2000-08-01 06:26:04 +00:00
Matt Jacob	10549c059a	Remove isp_tdebug. Change all PRINTF macros to the now common isp_prt logging function.	2000-08-01 06:24:01 +00:00
Matt Jacob	69fbe07a2e	Fix typo. Remove isp_tdebug (we'll use ISP_LOGTDEBUG2 in isp->isp_dblev as a selector now). Change DFLT_CMD_CNT to a fixed amount for now.	2000-08-01 06:23:24 +00:00
Matt Jacob	a6db0ba6d3	Add in lengths of SBus or PCI registers.	2000-08-01 06:21:21 +00:00
Matt Jacob	53cff3bb65	Rewrite for version 2.0. Some structural changes, but also a substantial amount of commenting about what each platform specific definitions are supposed to be.	2000-08-01 06:10:21 +00:00
Matt Jacob	d02373f1a0	Part of major rewrite for core version 2.0- clarification of mdvec structure, removal of printf/CFGPRINTF in place of isp_prt calls. Parameterization of RQUEST_QUEUE_LEN/RESULT_QUEUE_LEN.	2000-08-01 05:16:49 +00:00
Matt Jacob	482cf5c2e7	Add in some new IN_XXX and CT_XXXX flags in preparation for the rototilling that !$)~@!$_@_(~@$_(~@$~@$* Qlogic F/W changes will need.	2000-07-18 07:06:47 +00:00
Matt Jacob	d37162ca7a	If debugging set, zero out an incoming response entry when we're done reading it (makes checking things easier). Before calling isp_notify_ack make sure we're at RUNSTATE- elsewise we can be responding to LIPs or SCSI bus resets before we've finished some of the wiring.	2000-07-18 07:05:37 +00:00
Matt Jacob	910fb4f6ee	The SERVICING_INTERRUPT isn't quite safe yet.	2000-07-18 07:04:07 +00:00
Matt Jacob	f48ce1882f	Add a isp_target_putback_atio- we aren't using CCINCR at this time, so we need a function that tells the Qlogic f/w that a target mode command is done, so increase the resource count for that lun. Add in a timeout function to kick the putback again if we fail to do it the first time (we may not have the request queue space for ATIO push). Split the function isp_handle_platform_ctio into two parts so that the timeout function for the ATIO push or isp_handle_platform_ctio can inform CAM that the requested CTIO(s) are now done. Clean up (cough) residual handling. What we need for Fibre Channel is to preserve the at_datalen field from the original incoming ATIO so we can calculate a 'true' residual. Unfortunately, we're not guaranteed to get that back from CAM. We'll try to find it hiding in the periph_priv field (layering violation)- but if an ATIO was passed in from user land- forget it. This means that we'll probably get residuals wrong for Fibre Channel commands we're completing with an error. It's too late to 4.1 release to fix this- too bad. Luckily the only device we'd really care about this occurring on is a tape device and they're still so rare as FC attached devices that this can be considered an untested combination anyway. Remove all CCINCR usage (resource autoreplenish). When we've proved to ourself that things are working properly, we can add it back in. Make sure we propage 'suggested' sense data from the incoming ATIO into the created system ATIO- and set sense_len appropriately. Correctly propagate tag values. Fall back to the model of generating (well, the functions in isp_pci.c do the work) multiple CTIOs based upon what we get from XPT. Instead of being able to pair Qlogic generated ATIOs with CAM ATIOs, and then to pair CAM CTIOs with Qlogic CTIOs, we have to take the CTIO passed to us from XPT, and if it implies that we have to generate extra Qlogic CTIOs, so be it. This means that we have to wait until the last CTIO in a sequence we generated completes before calling xpt_done. Executive summary- target mode actually now pretty much works well enough to tell folks about.	2000-07-18 06:58:28 +00:00
Matt Jacob	c77d11d0cc	Raise debug level for some messages. Fix botched inversion about MBOX_COMMAND_ERROR vs. MBOX_COMMAND_PARAM_ERROR.	2000-07-18 06:46:48 +00:00
Matt Jacob	05fbcbb000	Keep interrupts blocked for all of isp_pci_attach. Redo DMA routines for target mode for cleanliness and accuracy.	2000-07-18 06:40:22 +00:00
Matt Jacob	1fcf5deb4a	Oops! If we're deciding a command is now really dead, make darned sure that it really is by issuing a ISPCTL_ABORT_CMD just on the off chance the f/w will start it up again and, ha ha, start using the DMA resources we gave it but are now taking away.	2000-07-05 06:44:17 +00:00
Matt Jacob	3e97a5b432	Clean up ISPCTL_ABORT_CMD function to not be too chatty if it succeeds, or even if it fails with INVALID_PARM (which just means that the handle doesn't refer to an active commane).	2000-07-05 06:41:36 +00:00
Matt Jacob	c464389f4b	Remove obsolete isp_dogactive tag.	2000-07-04 01:06:42 +00:00
Matt Jacob	8bdda719ae	Fix completely stupid and idiotiuc sprintfs in isp_inline.h with with the STRNCAT function.	2000-07-04 01:06:23 +00:00
Matt Jacob	f6e75de230	Add in config_hook for catching when interrupts are safe- this allows us to not the ints are ok and also to (re)ENABLE isp interrupts. Remove all splcam()/splx() invocates and replace them with ISP_LOCK/ISP_UNLOCK macros.	2000-07-04 01:05:43 +00:00
Matt Jacob	df9d46b6d9	Add in isp_lock/isp_unlock inlines. Add in an islocked/intsok flag to isp_osinfo substructure (all in prep for SMP). Define MBOX_WAIT_COMPLETE and MBOX_NOTIFY_COMPLETE macros so that we can now (temp) use tsleep to wait for mailbox completion. Requires us to guess whether we're servicing an interrupt or not- will use intr_nesting_level. Add local strncat function.	2000-07-04 01:04:35 +00:00
Matt Jacob	1d460ef8d5	Change delay loop in new isp_mboxcmd to the use of the new MBOX_WAIT_COMPLETE macro. Change notification of completion of a mailbox command in isp_intr to MBOX_NOTIFY_COMPLETE macro.	2000-07-04 01:02:38 +00:00
Matt Jacob	469b6b9efb	Change startup locking. Use new isp_handle_index function for indexing off of handles to get dma maps.	2000-07-04 01:01:15 +00:00
Matt Jacob	28445eef28	Fix usage of DELAY (SYS_DELAY is the platform independent local define). Fix stupidity wrt checking whether we've gone to LOOP_PDB_RCVD loopstate- it's okay to be greater than this state. D'oh! Protect calls to isp_pdb_sync and isp_fclink_state with IS_FC macros. Completely redo mailbox command routine (in preparation to make this possibly wait rather than poll for completion). Make a major attempt to solve the 'lost interrupt' problem 1. Problem The Qlogic cards would appear to 'lose' interrupts, i.e., a legitimate regular SCSI command placed on the request queue would never complete and the watchdog routine in the driver would eventually wakeup and catch it. This would typically only happen on Alphas, although a couple folks with 700MHz Intel platforms have also seen this. For a long time I thought it was a foulup with f/w negotiations of SYNC and/or WIDE as it always seemed to happen right after the platform it was running on had done a SET TARGET PARAMETERS mailbox command to (re)enable sync && wide (after initially forcing ASYNC/NARROW at startup). However, occasionally, the same thing would also occur for the Fibre Channel cards as well (which, ahem, have no SET TARGET PARAMETERS for transfer mode). After finally putting in a better set of watchdog routines for the platforms for this driver, it seemed to be the case that the command in question (usually a READ CAPACITY) just had up and died- the watchdog routine would catch it after ~10 seconds. For some platforms (NetBSD/OpenBSD)- an ABORT COMMAND mailbox command was sent (which would always fail- indicating that the f/w denied knowledge of this command, i.e., the f/w thought it was a done command). In any case, retrying the command worked. But this whole problem needed to be really fixed. 2. A False Step That Went in The Right Direction The mailbox code was completely rewritten to no longer try and grab the mailbox semaphore register and to try and 'by hand' complete async fast posting completions. It was also rewritten to now have separate in && out bitpatterns for registers to load to start and retrieve to complete. This means that isp_intr now handles mailbox completions. This substantially simplifies the mailbox handling code, and carries things 90% toward getting this to be a non-polled routine for this driver. This did not solve the problem, though. 3. Register Debouncing I saw some comments in some errata sheets and some notes in a Qlogic produced Linux driver (for the Qlogic 2100) that seemed to indicate that debouncing of reads of the mailbox registers might be needed, so I added this. This did not affect the problem. In fact, it made the problem worse for non-2100 cards. 5. Interrupt masking/unmasking The driver used to do a substantial amount of masking/unmasking of the interrupt control register. This was done to make sure that the core common code could just assume it would never get pre-empted. This apparently substantially contributed to the lost interrupt problem. The rewrite of the ICR (Interrupt Control Register), which is a separate register from the ISR (Interrupt Status Register) should not have caused any change to interrupt assertions pending. The manual does not state that it will, and the register layout seems to imply that the ICR is just an active route gate. We only enable PCI Interrupts and RISC Interrupts- this should mean that when the f/w asserts a RISC interrupt and (and the ICR allows RISC Interrupts) and we have PCI Interrupts enabled, we should get a PCI interrupt. Apparently this is a latch- not a signal route. Removing this got rid of most but not all, lost interrupts. 5. Watchdog Smartening I made sure that the watchdog routine would catch cases where the Qlogic's ISR showed an interrupt assertion. The watchdog routine now calls the interrupt service routine if it sees this. Some additional internal state flags were added so that the watchdog routine could then know whether the command it was in the middle of burying (because we had time it out) was in fact completed by the interrupt service routine. 6. Occasional Constipation Of Commands.. In running some very strenous high IOPs tests (generating about 11000 interrupts/second across one Qlogic 1040, one Qlogic 1080 and one Qlogic 2200 on an Alpha PC164), I found that I would get occasional but regular 'watchdog timeouts' on both the 1080 and the 2100 cards. This is under FreeBSD, and the watchdog timeout routine just marks the command in error and retries it. Invariably, right after this 'watchdog timeout' error, I'd get a command completion for the command that I had thought timed out. That is, I'd get a command completion, but the handle returned by the firmware mapped to no current command. The frequency of this problem is low under such a load- it would usually take an 30 minutes per 'lost' interrupt. I doubled the timeout for commands to see if it just was an edge case of waiting too short a period. This has no effect. I gathered and printed out microtimes for the watchdog completed command and the completion that couldn't find a command- it was always the case that the order of occurrence was "timeout, completion" separated by a time on the order of 100 to 150 ms. This caused me to consider 'firmware constipation' as to be a possible culprit. That is, resubmission of a command to the device that had suffered a watchdog timeout seemed to cause the presumed dead command to show back up. I added code in the watchdog routine that, when first entered for the command, marks the command with a flag, reissues a local timeout call for one second later, but also then issues a MARKER Request Queue entry to the Qlogic f/w. A MARKER entry is used typically after a Bus Reset to cause the f/w to get synchronized with respect to either a Bus, a Nexus or a Target. Since I've added this code, I always now see the occasional watchdog timeout, but the command that was about to be terminated always now seems to be completed after the MARKER entry is issued (and before the timeout extension fires, which would come back and really terminate the command).	2000-06-27 19:44:31 +00:00
Matt Jacob	b85389e117	Add in the enabling of interrupts (to isp_attach). Clean up a busted comment. Check against firmware state- not loop state when enabling target mode. Other changes have to do with no longer enabling/disabling interrupts at will. Rearchitect command watchdog timeouts- First of all, set the timeout period for a command that has a timeout (in isp_action) to the period of time requested plus two seconds. We don't want the Qlogic firmware and the host system to race each other to report a dead command (the watchdog is there to catch dead and/or broken firmware). Next, make sure that the command being watched isn't done yet. If it's not done yet, check for INT_PENDING and call isp_intr- if that said it serviced an interrupt, check to see whether the command is now done (this is what the "IN WATCHDOG" private flag is for- if isp_intr completes the command, it won't call xpt_done on it because isp_watchdog is still looking at the command). If no interrupt was pending, or the command wasn't completed, check to see if we've set the private 'grace period' flag. If so, the command really is dead, so report it as dead and complete it with a CAM_CMD_TIMEOUT value. If the grace period flag wasn't set, set it and issue a SYNCHRONIZE_ALL Marker Request Queue entry and re-set the timeout for one second from now (see Revision 1.45 isp.c notes for more on this) to give the firmware a final chance to complete this command.	2000-06-27 19:31:02 +00:00
Matt Jacob	cc28790740	Clean up private storage so that we can use the spriv_field0 to store a bitmask of whether we've set a value into ccb->ccb_h.status, whether we're in the watchdog routine for this command now, whether we've set a grace period for this command and whether this command is actually done. See comments of rev 1.45 of isp.c for more complete information.	2000-06-27 19:22:13 +00:00
Matt Jacob	e2adf86e4e	Add 8 bits of volatile mailbox busy mask- this will be the bitmask of output mailbox values we want to get back out of the chip once a mailbox command is done. Add storage for the maximum number of output mailbox registers to the softc. Roll minor version number.	2000-06-27 19:17:39 +00:00
Matt Jacob	40e88de6c3	Add mailbox bitmask macros (numbers of available mailbox registers based upon Qlogic chip type). Define maximum mailboxes. Add INT_PENDING_MASK macro. Change mailbox offset macro name.	2000-06-27 19:15:43 +00:00
Matt Jacob	986973a448	Add an isp_handle_index function- this is prepatory to loading more into the handle (i.e., generation number), so we will now need a function that will take a handle and return a flat index [ 0 .. maxhandles-1 ] for auxillary routines that need an index to get at buddy store values (like dma maps or xflist pointers).	2000-06-27 19:14:14 +00:00

... 2 3 4 5 6 ...

571 Commits