freebsd-nq

Author	SHA1	Message	Date
Adrian Chadd	0891354cd2	Migrate the multicast queue assembly code to not use the axq_link pointer and instead use the HAL method to set the link pointer. Tested: * AR9280, hostap mode, CABQ frames being queued and transmitted	2013-03-26 04:47:40 +00:00
Adrian Chadd	1f6b3ed63c	Add new regulatory domain. Obtained from: Qualcomm Atheros	2013-03-24 04:42:56 +00:00
Adrian Chadd	56a859789f	Move the TXQ lock earlier in this routine - so to correctly protect the link pointer check.	2013-03-24 04:09:54 +00:00
Adrian Chadd	0acf45ed86	Fix the locking changes due to the TXQ change drive-by. Tested: * AR9580, STA mode	2013-03-24 04:09:29 +00:00
Adrian Chadd	b837332d0a	Overhaul the TXQ locking (again!) as part of some beacon/cabq timing related issues. Moving the TX locking under one lock made things easier to progress on but it had one important side-effect - it increased the latency when handling CABQ setup when sending beacons. This commit introduces a bunch of new changes and a few unrelated changs that are just easier to lump in here. The aim is to have the CABQ locking separate from other locking. The CABQ transmit path in the beacon process thus doesn't have to grab the general TX lock, reducing lock contention/latency and making it more likely that we'll make the beacon TX timing. The second half of this commit is the CABQ related setup changes needed for sane looking EDMA CABQ support. Right now the EDMA TX code naively assumes that only one frame (MPDU or A-MPDU) is being pushed into each FIFO slot. For the CABQ this isn't true - a whole list of frames is being pushed in - and thus CABQ handling breaks very quickly. The aim here is to setup the CABQ list and then push _that list_ to the hardware for transmission. I can then extend the EDMA TX code to stamp that list as being "one" FIFO entry (likely by tagging the last buffer in that list as "FIFO END") so the EDMA TX completion code correctly tracks things. Major: * Migrate the per-TXQ add/removal locking back to per-TXQ, rather than a single lock. * Leave the software queue side of things under the ATH_TX_LOCK lock, (continuing) to serialise things as they are. * Add a new function which is called whenever there's a beacon miss, to print out some debugging. This is primarily designed to help me figure out if the beacon miss events are due to a noisy environment, issues with the PHY/MAC, or other. * Move the CABQ setup/enable to occur _after_ all the VAPs have been looked at. This means that for multiple VAPS in bursted mode, the CABQ gets primed once all VAPs are checked, rather than being primed on the first VAP and then having frames appended after this. Minor: * Add a (disabled) twiddle to let me enable/disable cabq traffic. It's primarily there to let me easily debug what's going on with beacon and CABQ setup/traffic; there's some DMA engine hangs which I'm finally trying to trace down. * Clear bf_next when flushing frames; it should quieten some warnings that show up when a node goes away. Tested: * AR9280, STA/hostap, up to 4 vaps (staggered) * AR5416, STA/hostap, up to 4 vaps (staggered) TODO: * (Lots) more AR9380 and later testing, as I may have missed something here. * Leverage this to fix CABQ hanling for AR9380 and later chips. * Force bursted beaconing on the chips that default to staggered beacons and ensure the CABQ stuff is all sane (eg, the MORE bits that aren't being correctly set when chaining descriptors.)	2013-03-24 00:03:12 +00:00
Adrian Chadd	49ddabc4bd	CABQ calculation changes to try and fix some weird corner cases leading to stuck beacons. * Set the cabq readytime (ie, how long to burst for) to 50% of the total beacon interval time * fix the cabq adjustment calculation based on how the beacon offset is calculated (the SWBA/DBA time offset.) This is all still a bit magic voodoo but it does seem to have further quietened issues with missed/stuck beacons under my local testing. In any case, it better matches what the reference HAL implements. Obtained from: Qualcomm Atheros	2013-03-23 23:51:11 +00:00
Adrian Chadd	9cda8c8082	Fix the EDMA CABQ handling - for now, the CABQ takes a descriptor chain like the legacy chips expect.	2013-03-20 05:44:03 +00:00
Adrian Chadd	f0db652cf6	Break out the RX completion path into "FIFO check / refill" and "complete RX frames." The 128 entry RX FIFO is really easy to fill up and miss refilling when it's done in the ath taskq - as that gets blocked up doing RX completion, TX completion and other random things. So the 128 entry RX FIFO now gets emptied and refilled in the ath_intr() task (and it grabs / releases locks, so now ath_intr() can't just be a FAST handler yet!) but the locks aren't held for very long. The completion part is done in the ath taskqueue context. Details: * Create a new completed frame list - sc->sc_rx_rxlist; * Split the EDMA RX process queue into two halves - one that processes the RX FIFO and refills it with new frames; another that completes the completed frame list; * When tearing down the driver, flush whatever is in the deferred queue as well as what's in the FIFO; * Create two new RX methods - one that processes all RX queues, one that processes the given RX queue. When MSI is implemented, we get told which RX queue the interrupt came in on so we can specifically schedule that. (And I can do that with the non-MSI path too; I'll figure that out later.) * Convert the legacy code over to use these new RX methods; * Replace all the instances of the RX taskqueue enqueue with a call to a relevant RX method to enqueue one or all RX queues. Tested: * AR9380, STA * AR9580, STA * AR5413, STA	2013-03-19 19:32:28 +00:00
Adrian Chadd	74ea88c379	Add more TODO items.	2013-03-19 17:55:36 +00:00
Adrian Chadd	378a752f59	Now that the tx map field is correctly populated for both edma and legacy chips, just use that.	2013-03-19 17:54:37 +00:00
Adrian Chadd	1ab002f461	Print out the current fifo queue depth correctly - not just the max queue depth. Silly hat to me.	2013-03-18 02:29:57 +00:00
Adrian Chadd	eefc93a947	Dump out information about the RX descriptor free list and FIFO information.	2013-03-18 01:12:36 +00:00
Adrian Chadd	d50e882ab9	Log some more information when the RX buffer allocation failed.	2013-03-18 01:11:52 +00:00
Adrian Chadd	cd4f1ba89f	Why'd I keep this here? remove it entirely now.	2013-03-15 20:22:20 +00:00
Adrian Chadd	302868d914	Fix two bugs: * when pulling frames off of the TID queue, the ATH_TID_REMOVE() macro decrements the axq_depth field. So don't do it twice. * in ath_tx_comp_cleanup_aggr(), bf wasn't being reset to bf_first before walking the buffer list to complete buffers; so those buffers will leak.	2013-03-15 20:00:08 +00:00
Adrian Chadd	8454d32107	Remove a now incorrect comment. This comment dates back to my initial stab at TX aggregation completion, where I didn't even bother trying to do software retries.	2013-03-15 04:43:27 +00:00
Adrian Chadd	5f2f0e616b	Add locking around the new holdingbf code. Since this is being done during buffer free, it's a crap shoot whether the TX path lock is held or not. I tried putting the ath_freebuf() code inside the TX lock and I got all kinds of locking issues - it turns out that the buffer free path sometimes is called with the lock held and sometimes isn't. So I'll go and fix that soon. Hence for now the holdingbf buffers are protected by the TXBUF lock.	2013-03-15 02:52:37 +00:00
Adrian Chadd	629ce2188a	Implement "holding buffers" per TX queue rather than globally. When working on TDMA, Sam Leffler found that the MAC DMA hardware would re-read the last TX descriptor when getting ready to transmit the next one. Thus the whole ATH_BUF_BUSY came into existance - the descriptor must be left alone (very specifically the link pointer must be maintained) until the hardware has moved onto the next frame. He saw this in TDMA because the MAC would be frequently stopping during active transmit (ie, when it wasn't its turn to transmit.) Fast-forward to today. It turns out that this is a problem not with a single MAC DMA instance, but with each QCU (from 0->9). They each maintain separate descriptor pointers and will re-read the last descriptor when starting to transmit the next. So when your AP is busy transmitting from multiple TX queues, you'll (more) frequently see one QCU stopped, waiting for a higher-priority QCU to finsh transmitting, before it'll go ahead and continue. If you mess up the descriptor (ie by freeing it) then you're short of luck. Thanks to rpaulo for sticking with me whilst I diagnosed this issue that he was quite reliably triggering in his environment. This is a reimplementation; it doesn't have anything in common with the ath9k or the Qualcomm Atheros reference driver. Now - it in theory doesn't apply on the EDMA chips, as long as you push one complete frame into the FIFO at a time. But the MAC can DMA from a list of frames pushed into the hardware queue (ie, you concat 'n' frames together with link pointers, and then push the head pointer into the TXQ FIFO.) Since that's likely how I'm going to implement CABQ handling in hostap mode, it's likely that I will end up teaching the EDMA TX completion code about busy buffers, just to be "sure" this doesn't creep up. Tested - iperf ap->sta and sta->ap (with both sides running this code): * AR5416 STA * AR9160/AR9220 hostap To validate that it doesn't break the EDMA (FIFO) chips: * AR9380, AR9485, AR9462 STA Using iperf with the -S <tos byte decimal value> to set the TCP client side DSCP bits, mapping to different TIDs and thus different TX queues. TODO: * Make this work on the EDMA chips, if we end up pushing lists of frames to the hardware (eg how we eventually will handle cabq in hostap/ibss mode.)	2013-03-14 06:20:02 +00:00
Adrian Chadd	0639c54a67	Use the correct antenna configuration variable here. "diversity" just controls whether it's on or off. Found by: clang	2013-03-12 03:03:24 +00:00
Adrian Chadd	0e168bb8e3	Add a few new fields to the RX vendor radiotap header: * a flags field that lets me know what's going on; * the hardware ratecode, unmolested by conversion to a bitrate; * the HAL rs_flags field, useful for debugging; * specifically mark aggregate sub-frames. This stuff sorely needs tidying up - it's missing some important stuff (eg numdelims) and it would be nice to put the flags at the beginning rather than at the end. Tested: * AR9380, STA mode, 2x2 HT40, monitoring RSSI and EVM values	2013-03-11 06:54:58 +00:00
Adrian Chadd	6b3ba411d3	Bump the EVM array size up to fit the AR9380 EVM entries.	2013-03-11 06:01:00 +00:00
Adrian Chadd	1896b0880a	Add three-stream EVM values.	2013-03-11 04:19:10 +00:00
Adrian Chadd	ba8d066231	Add another register definition bit - whether to populate EVM or PLCP data in the RX status descriptor. Obtained from: Qualcomm Atheros	2013-03-10 09:43:01 +00:00
Adrian Chadd	b3420862a7	Disable the hw TID != buffer TID check. I can 100% reliably trigger this on TID 1 traffic by using iperf -S 32 <client fields> to create traffic that maps to TID 1. The reference driver doesn't do this check.	2013-03-09 08:50:17 +00:00
Adrian Chadd	9d2a962bf3	Print out the queue flags during a TX DMA shutdown.	2013-03-09 06:11:58 +00:00
Adrian Chadd	bdb9fa5c87	add a method to set/clear the VMF field in the TX descriptor. Obtained from: Qualcomm Atheros	2013-03-04 07:40:49 +00:00
Adrian Chadd	87c176d272	Add missing flags.	2013-02-28 23:39:38 +00:00
Adrian Chadd	7a27f0a338	Oops - fix an incorrect test.	2013-02-28 23:39:22 +00:00
Adrian Chadd	5612990623	Don't enable the HT flags for legacy rates. I stumbled across this whilst trying to debug another weird hang reported on the freebsd-wireless list. Whilst here, add in the STBC check to ath_rateseries_setup(). Whilst here, fix the short preamble flag to be set only for legacy rates. Whilst here, comment that we should be using the full set of decisions made by ath_rateseries_setup() rather than recalculating them!	2013-02-28 23:31:23 +00:00
Adrian Chadd	ee563d630b	I give up - just throw the EWMA update into the normal update_stats() routine. There were still corner cases where the EWMA update stats are being called on a rix which didn't have an intermediary stats update; thus no packets were counted against it. Sigh. This should fix the crashes I've been seeing on recent -HEAD.	2013-02-27 04:33:06 +00:00
Adrian Chadd	1a3a560767	Enable STBC for the given rate series if it's negotiated: * If both ends have negotiated (at least) one stream; * Only if it's a single stream rate (MCS0-7); * Only if there's more than one TX chain enabled. Tested: * AR9280 STA mode -> Atheros AP; tested both MCS2 (STBC) and MCS12 (no STBC.) Verified using athalq to inspect the TX descriptors. TODO: * Test AR5416 - no STBC should be enabled; * Test AR9280 with one TX chain enabled - no STBC should be enabled.	2013-02-27 00:49:32 +00:00
Adrian Chadd	6606ba811c	Add in the STBC TX/RX capability support into the HAL and driver. The HAL already included the STBC fields; it just needed to be exposed to the driver and net80211 stack. This should allow single-stream STBC TX and RX to be negotiated; however the driver and rate control code currently don't do anything with it.	2013-02-27 00:25:44 +00:00
Adrian Chadd	38fda92679	Update the EWMA statistics for each intermediary rate as well as the final rate. This fixes two things: * The intermediary rates now also have their EWMA values changed; * The existing code was using the wrong value for longtries - so the EWMA stats were only adjusted for the first rate and not subsequent rates in a MRR setup. TODO: * Merge the EWMA updates into update_stats() now..	2013-02-26 10:24:49 +00:00
Adrian Chadd	6322256b83	Part #2 of the TX chainmask changes: * Remove ar5416UpdateChainmasks(); * Remove the TX chainmask override code from the ar5416 TX descriptor setup routines; * Write a driver method to calculate the current chainmask based on the operating mode and update the driver state; * Call the HAL chainmask method before calling ath_hal_reset(); * Use the currently configured chainmask in the TX descriptors rather than the hardware TX chainmasks. Tested: * AR5416, STA/AP mode - legacy and 11n modes	2013-02-25 22:45:02 +00:00
Adrian Chadd	d2a72d673f	Begin adding support to explicitly set the current chainmask. Right now the only way to set the chainmask is to set the hardware configured chainmask through capabilities. This is fine for forcing the chainmask to be something other than what the hardware is capable of (eg to reduce TX/RX to one connected antenna) but it does change what the HAL hardware chainmask configuration is. For operational mode changes, it (may?) make sense to separately control the TX/RX chainmask. Right now it's done as part of ar5416_reset.c - ar5416UpdateChainMasks() calculates which TX/RX chainmasks to enable based on the operating mode. (1 for legacy and whatever is supported for 11n operation.) But doing this in the HAL is suboptimal - the driver needs to know the currently configured chainmask in order to correctly enable things for each TX descriptor. This is currently done by overriding the chainmask config in the ar5416 TX routines but this has to disappear - the AR9300 HAL support requires the driver to dynamically set the TX chainmask based on the TX power and TX rate in order to meet mini-PCIe slot power requirements. So: * Introduce a new HAL method to set the operational chainmask variables; * Introduce null methods for the previous generation chipsets; * Add new driver state to record the current chainmask separate from the hardware configured chainmask. Part #2 of this will involve disabling ar5416UpdateChainMasks() and moving it into the driver; as well as properly programming the TX chainmask based on the currently configured HAL chainmask. Tested: * AR5416, STA mode - both legacy (11a/11bg) and 11n rates - verified that AR_SELFGEN_MASK (the chainmask used for self-generated frames like ACKs and RTSes) is correct, as well as the TX descriptor contents is correct.	2013-02-25 22:42:43 +00:00
Adrian Chadd	ffdc8f48dd	Add a workaround for AR5416, AR9130 and AR9160 chipsets - work around an incorrectly calculated RTS duration value when transmitting aggregates. These earlier 802.11n NICs incorrectly used the ACK duration time when calculating what to put in the RTS of an aggregate frame. Instead it should have used the block-ack time. The result is that other stations may not reserve enough time and start transmitting _over_ the top of the in-progress blockack field. Tsk. This workaround is to popuate the burst duration field with the delta between the ACK duration the hardware is using and the required duration for the block-ack. The result is that the RTS field should now contain the correct duration for the subsequent block-ack. This doesn't apply for AR9280 and later NICs. Obtained from: Qualcomm Atheros	2013-02-22 07:07:11 +00:00
Adrian Chadd	ce597531f2	Disable debugging entries about BAW issues. I haven't seen any issues to do with BAW tracking in the last 9 months or so.	2013-02-21 21:47:35 +00:00
Adrian Chadd	de2d9111ec	Be slightly more paranoid with the TX DMA buffer maximum threshold. Specifically - never jack the TX FIFO threshold up to the absolute maximum; always leave enough space for two DMA transactions to appear. This is a paranoia from the Linux ath9k driver. It can't hurt. Obtained from: Linux ath9k	2013-02-21 08:42:40 +00:00
Adrian Chadd	a54ecf784a	Add an option to allow the minimum number of delimiters to be tweaked. This is primarily for debugging purposes. Tested: * AR5416, STA mode	2013-02-21 06:38:49 +00:00
Adrian Chadd	4a502c332a	Add a new option to limit the maximum size of aggregates. The default is to limit them to what the hardware is capable of. Add sysctl twiddles for both the non-RTS and RTS protected aggregate generation. Whilst here, add some comments about stuff that I've discovered during my exploration of the TX aggregate / delimiter setup path from the reference driver.	2013-02-21 06:18:40 +00:00
Adrian Chadd	054eace83f	Remove this unneeded printf(), sorry!	2013-02-21 02:52:13 +00:00
Adrian Chadd	d7cc11edce	Configure larger TX FIFO default and maximum level values. This has reduced the number of TX delimiter and data underruns when doing large UDP transfers (>100mbit). This stops any HAL_INT_TXURN interrupts from occuring, which is a good sign! Obtained from: Qualcomm Atheros	2013-02-20 12:14:49 +00:00
Adrian Chadd	71d6fe723e	If any of the TX queues have underrun reporting enabled, enable HAL_INT_TXURN in the interrupt mask register. This should now allow for TXURN interrupts to be posted.	2013-02-20 11:24:11 +00:00
Adrian Chadd	f274e91f67	A couple of quick tidyups: * Delete this debugging print - I used it when debugging the initial TX descriptor chaining code. It now works, so let's toss it. It just confuses people if they enable TX descriptor debugging as they get two slightly different versions of the same descriptor. * Indenting.	2013-02-20 11:22:44 +00:00
Adrian Chadd	69930f8794	Enable TX FIFO underrun interrupts. This allows the TX FIFO threshold adjustment code to now run. Tested: * AR5416, STA TODO: * Much more thorough testing on the other chips, AR5210 -> AR9287	2013-02-20 11:20:51 +00:00
Adrian Chadd	bab336db27	oops, tab!	2013-02-20 11:17:29 +00:00
Adrian Chadd	a26f33276f	Post interrupts in the ath alq trace.	2013-02-20 11:17:03 +00:00
Adrian Chadd	158cb431db	CFG_ERR, DATA_UNDERRUN and DELIM_UNDERRUN are all flags, rather than part of ts_status. Thus: * make sure we decode them from ts_flags, rather than ts_status; * make sure we decode them regardless of whether there's an error or not. This correctly exposes descriptor configuration errors, TX delimiter underruns and TX data underruns.	2013-02-20 11:14:55 +00:00
Adrian Chadd	feb043c69c	Fix an incorrect sizeof() PR: kern/176238 Submitted by: Christoph Mallon <christoph.mallon@gmx.de>	2013-02-18 18:39:15 +00:00
Adrian Chadd	d97c06b3a4	Add a new ATH KTR debug method to log the interrupt status.	2013-02-18 04:10:38 +00:00

1 2 3 4 5 ...

1537 Commits