freebsd-dev

Author	SHA1	Message	Date
Oleksandr Tymoshenko	3279df7e87	[am335x] Use upstream tda19988 framer node from upstream DTS Remove custom DTS duplicate of tda19988 node and use upstream-provided one introduced by r295436. This duplication created two tdaX devices which confused fb driver into using only 640x480 area while setting display to native resolution. Reported by: Michael Smith MFC after: 3 days	2017-01-23 19:20:55 +00:00
Alexander Motin	b96d9778eb	Fix overrun handling issue in r312291. MFC after: 1 week	2017-01-23 17:44:33 +00:00
Bjoern A. Zeeb	09d8a96e1c	Remove a static function declaration for a function not implemented. Makes head code compile on 10.3 and cleanup is never wrong. MFC after: 3 days	2017-01-23 16:40:20 +00:00
Adrian Chadd	5e2c0d2d47	[ath] modify cabq and per-node packet usage limits. * limit cabq to 64 - in practice if this stays at ath_txbuf then all buffers can be tied up by a very busy broadcast domain (eg ARP storm, way too much MDNS/NETBIOS). It's been like this in the freebsd-wifi-build AP project for the longest time. * Now that I figured out the hilarity inherent in aggregate forming and AR9380 EDMA work, change the per-node to 64 frames by default. I'll do some more work to shorten the queue latency introduced when doing data so TCP isn't so terrible, but it's now no longer /always/ tens of milliseconds of extra latency when doing active iperf tests. Notes: The reason for the extra latency is partly tx/rx taskqueue handling and scheduling, and partly due to a lack of airtime/QoS awareness of per-node traffic. Ideally we'd have different limits/priorities on the QoS/TID levels per node so say, voice/video data got a better share of buffer allocations over best effort/bulk data, but we currently don't implement that. It's not /hard/ to do, I just need to do it. Tested: * AR9380 (STA), AR9580 (hostap) - both with the relevant changes. TCP is now at around 180mbit with rate control and RTS protection enabled. UDP stays at 355mbit at MCS23, no HT protection.	2017-01-23 04:47:38 +00:00
Adrian Chadd	57af292d36	[ath] fix thresholds for deciding to queue to the software queue and populate hardware frames This is two fixes, which establishes what I /think/ is pretty close to the theoretical PHY maximum speed on the AR9380 devices. * When doing A-MPDU on a TID, don't queue to the hardware directly if the hardware queue is busy. This gives us time to get more packets queued up (and the hardware is busy, so there's no point in queuing more to the hardware right now) to potentially form an A-MPDU. This fixes up the throughput issue I was seeing where a couple hundred single frames were being sent a second interspersed between A-MPDU frames. It just happened that the software queue had exactly one frame in it at that point. Queuing it until the hardware finishes transmitting isn't exactly costly. * When determining whether to dequeue from a software node/TID queue into the hardware queue, fix up the checks to work right for EDMA chips (ar9380 and later.) Before it was not dispatching anything until the FIFO was empty. Now we allow it to dispatch another aggregate up to the hardware aggregate limit, like I intended with the earlier work. This allows a 5GHz HT40, short-GI, "htprotmode off" test at MCS23 to achieve 357 Mbit/sec in a one-way UDP test. The stars have to be aligned /just right/ so there are no retries but it can happen. Just don't expect it to work in an OTA test if your 2yo is running around the room - MCS23 is very very sensitive to channel conditions. Tested: * AR9380 STA (test) -> AR9580 hostap TODO: * More thorough testing on pre-AR9380 chips (AR5416, AR9160, AR9280) * (Finally) teach ath_rate_sample about throughput/latency rather than air time, so I can get good transmit rates with a 2yo running around.	2017-01-23 04:30:08 +00:00
Adrian Chadd	6eb9f206a6	[ath] [ar9300] ensure the software scheduler is called to form more aggregates for EDMA chips When investigating performance on UDP TX on the AR9380 I found that the following sequence was occuring: * INTR * EINPROGRESS - nothing yet * INTR * TXSTATUS - process a TX completion for an aggregate * INTR, INTR * TXSTATUS - process a TX completion for an aggregate * TXD, TXD ... populate frames from the hardware queue and submit What should be happening is a completed TXSTATUS fires off more packets that are queued on active TIDs. What /was/ happening was after that first TXSTATUS the TX queue hardware queue was still empty, so it didn't push anything into the FIFO. Only after the second TXSTATUS did any progress get made. This is one of two commits - it ensures that the software TX queue scheduler is called /after/ TX completion, otherwise no frames from the software staging queues will be processed into the hardware queues. The second commit will fix it so it populates aggregate frames correctly when the above occurs - right now ath_txq_sched() is called, but it doesn't populate anything because its pre-check conditions are wrong. Whilst here, add/tweak debugging. Tested: * AR9380 STA (testing device) -> AR9580 hostap	2017-01-23 04:20:37 +00:00
Justin Hibbits	8d2f50dba6	Avoid using non-zero argument for __builtin_frame_address(). Building kernel with devel/powerpc64-gcc (6.2.0) yields the following error: /usr/src/sys/powerpc/powerpc/db_trace.c:299:20: error: calling '__builtin_frame_address' with a nonzero argument is unsafe [-Werror=frame-address] Work around this by dereferencing the frame address manually instead. PR: 215600 Reported by: Mark Millard <markmi AT dsl-only DOT net> MFC after: 2 weeks	2017-01-23 04:03:12 +00:00
Mark Johnston	792e2f09ee	Remove the DTRACEHIOC_ADD ioctl. This ioctl has been considered legacy by upstream since the DTrace code was first imported, and is unused. The removal also allows some simplification of dtrace_helper_slurp(). Also remove a bogus copyout in the DTRACEHIOC_ADDDOF handler. Due to a bug, it would overwrite an in-memory copy of the DOF header rather than the passed-in DOF helper. Moreover, DTRACEHIOC_ADDDOF already copies the helper back out automatically since its argument has the IOC_OUT attribute.	2017-01-23 02:21:06 +00:00
Andriy Voskoboinyk	98ff1f7c6e	net80211: fix flags setup for HT40 5GHz channels.	2017-01-22 23:45:59 +00:00
Edward Tomasz Napierala	f308ab7043	Remove max_targets and max_target_id CTL port variables; they were unused. This changes the CTL frontend ABI and thus shouldn't be MFC-ed. Reviewed by: mav@	2017-01-22 20:06:15 +00:00
Konstantin Belousov	3467f88cd6	Add comments explaining unobvious td_critnest adjustments in critical_exit(). Based on the discussion with: jhb Reviewed by: imp Sponsored by: The FreeBSD Foundation Differential revision: D9276 MFC after: 1 week	2017-01-22 19:41:42 +00:00
Konstantin Belousov	25c6816845	More style cleanup. Use ANSI C definition for vn_closefile(). Switch to VNASSERT in _vn_lock(), simplify messages. Sponsored by: The FreeBSD Foundation X-MFC with: r312600, r312601, r312602, r312606	2017-01-22 19:38:45 +00:00
Konstantin Belousov	aec8391d46	Provide fallback VOP methods for crossmp vnode. In particular, crossmp vnode might leak into rename code. PR: 216380 Reported by: fnacl@protonmail.com Sponsored by: The FreeBSD Foundation X-MFC with: r309425	2017-01-22 19:36:02 +00:00
Sean Bruno	db569544d1	igb(4) enable WOL features for this class of devices. PR: 208343 Submitted by: Kaho Tashikazu <kaho@elam.kais.kyoto-u.ac.jp>	2017-01-22 18:04:57 +00:00
Luiz Otavio O Souza	6386d003c9	Be a little more pedantic here, the TRM says the hardware is supposed to only clean the OWNER bit on SOP descriptors. MFC after: 3 days Sponsored by: Rubicon Communications, LLC (Netgate)	2017-01-22 17:24:00 +00:00
Luiz Otavio O Souza	3a3d1c770e	Properly assemble an mbuf chain out of received fragments. Remove the rx_batch hack, it makes no difference now that most of bugs have been sorted out. Sponsored by: Rubicon Communications, LLC (Netgate)	2017-01-22 17:07:37 +00:00
Edward Tomasz Napierala	5c93966020	Remove redundant KASSERT.	2017-01-22 15:35:51 +00:00
Edward Tomasz Napierala	c6646787ab	Add SCSI descriptors for USB Mass Storage. MFC after: 2 weeks Sponsored by: The FreeBSD Foundation	2017-01-22 15:32:17 +00:00
Edward Tomasz Napierala	8acac5a9f5	Improve debugging printf.	2017-01-22 15:27:14 +00:00
Justin Hibbits	3e7e31bda0	Hide the 'MOREARGS' macro, it conflicts with contrib code, and is only used in one file. PR: 211818 Reported by: Mark Millard <markmi AT dsl-only.net> MFC after: 2 weeks	2017-01-22 06:30:55 +00:00
Justin Hibbits	279ae8fec0	Fix r312613. Somehow this slipped through my build testing.	2017-01-22 06:17:31 +00:00
Justin Hibbits	0b58112faf	Don't pass -Wa,-many through clang, the integrated as doesn't support it. Our base binutils sets -many by default anyway, but external gcc may not do this. PR: kern/215948 Submitted by: Mark Millard <markmi AT dsl-only DOT net> Reported by: Mark Millard MFC after: 2 weeks	2017-01-22 06:00:05 +00:00
Justin Hibbits	5c1f5ea426	Fix use of uninitialized variable. I don't know how gcc didn't catch this. This was caught during test building with clang.	2017-01-22 05:49:43 +00:00
Adrian Chadd	a2d74cc30b	[ath] only apply the AR9300 delimiter workaround for the first sub-frame. This is supposed to only be applied to the first subframe and only if RTS/CTS is being done. I'm still not yet checking RTS/CTS exchange status so it's just happening for all subframes on AR9380 and later. This gets MCS23 throughput up from around 250mbit to 303mbit with RTS/CTS protection enabled, and around 330mbit with no HT protection enabled. Now, MCS23 has a PHY rate of 450mbit and we should be seeing closer to 400mbit for a straight one-way UDP test, but this beats the previous maximum throughput. Tested: * AR9380 (STA) -> AR9580 (AP) - STA with the modifications, doing UDP TX test using iperf.	2017-01-22 05:45:42 +00:00
Jason A. Harmening	e0b79e669b	Like r310481 for i386, move the objects used to create temporary mappings for armv6 pmap zero and copy operations to the MD PCPU region. Change sysmap initialization to only allocate KVA pages for CPUs that are actually present. While here, collapse CMAP3 into CMAP2 (their use was mutually exclusive anyway) and "recover" some space in PCPU padding that has always been available due to 64-byte cacheline padding. Reviewed by: skra MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D9172	2017-01-22 00:46:04 +00:00
Adrian Chadd	28dc144e73	[ath] improve the debugging when looking into the maximum A-MPDU size being chosen. This is how I caught the "why are we only sending 8K aggregates?" problem.	2017-01-21 23:35:54 +00:00
Luiz Otavio O Souza	6d800e3c9c	Handle the rx queue stall while reading the packets from NIC (when the descriptor state will not change anymore). This seems to eliminate the race where we can miss a stalled queue under high load. While here remove the unnecessary curly brackets. Reported by: Konstantin Kormashev <konstantin@netgate.com> MFC after: 3 days Sponsored by: Rubicon Communications, LLC (Netgate)	2017-01-21 23:07:15 +00:00
Andriy Voskoboinyk	8f1e113906	ath: adapt LDPC support checks Set both IEEE80211_HTCAP_LDPC and IEEE80211_HTC_TXLDPC capability flags if LDPC is supported + set 'do_ldpc = 1' only when it is not disabled, not just supported. Reviewed by: adrian Differential Revision: https://reviews.freebsd.org/D9277	2017-01-21 21:03:26 +00:00
Mateusz Guzik	eaf0969bda	vfs: fix LK_RETRY logic braino in r312600	2017-01-21 20:34:20 +00:00
Luiz Otavio O Souza	f348557302	Commit the struture changes for the padding of small packets on if_cpsw. Should have been committed together with r312604. MFC with: r312604	2017-01-21 19:56:28 +00:00
Luiz Otavio O Souza	904e8e891b	Simplify the handling of small packets padding in cpsw: - Pad small packets to 60 bytes and not 64 (exclude the CRC bytes); - Pad the packet using m_append(9), if the packet has enough space for padding, which is usually true, it will not be necessary append a newly allocated mbuf to the chain. Suggested by: yongari MFC after: 3 days Sponsored by: Rubicon Communications, LLC (Netgate)	2017-01-21 19:49:39 +00:00
Alexander Motin	0c629e2884	Add initial support for CTL module unloading. It is only a first step and not perfect, but better then nothing. The main blocker is CAM target frontend, that can not be unloaded, since CAM does not have mechanism to unregister periph driver now. MFC after: 2 weeks	2017-01-21 19:38:26 +00:00
Mateusz Guzik	829857c893	vfs: __predict_false the need to handle F_HASLOCK Also reorder the check with DTYPE_VNODE. Passed files are vnodes vast majority of the time, so it is typically true.	2017-01-21 19:01:42 +00:00
Mateusz Guzik	abbc538d9a	vfs: fix whitespace damage in r312600 While here wrap the previously overly long line so that it fits 80 chars.	2017-01-21 18:56:58 +00:00
Mateusz Guzik	1091fb52c1	vfs: refactor _vn_lock Stop testing for LK_RETRY and error multiple times. Also postpone the VI_DOOMED until after LK_RETRY was seen as it reads from the vnode. No functional changes.	2017-01-21 18:38:16 +00:00
Conrad Meyer	21dff1801f	Add remaining ELF compression definitions and structs A follow-up to r300231. Sponsored by: Dell EMC Isilon	2017-01-21 17:39:10 +00:00
Mateusz Guzik	067115e050	vfs: hide the getvnode NULL mp message behind DIAGNOSTIC Since crossmp vnode changes the message was being printed on each boot. Reported by: trasz Discussed with: kib	2017-01-21 16:59:50 +00:00
Andriy Voskoboinyk	3111723c09	rtwn: enable LDPC support where possible Tested with RTL8821AU, STA mode.	2017-01-21 15:03:58 +00:00
Andriy Voskoboinyk	c5bba9da5a	net80211: allow to configure LDPC support Tested with RTL8821AU, STA mode (Tx support only) Reviewed by: adrian Differential Revision: https://reviews.freebsd.org/D9268	2017-01-21 14:19:06 +00:00
Brooks Davis	e0f076ceea	Enable TMPFS on MALTA so we can use it on minimalist disk images without modules. Sponsored by: DARPA, AFRL	2017-01-21 09:08:27 +00:00
Adrian Chadd	0149d3d476	[ath] ensure both iv_ampdu_limit and iv_ampdu_rxmax is set. A recent change enforced the VAP limit as well as the peer limit. I now need to actually set iv_ampdu_limit or we don't transmit more than 8K sized aggregates. This restores the expected (suboptimal, but still much faster) behaviour. Tested: * AR9380, STA mode	2017-01-21 06:53:30 +00:00
Konstantin Belousov	5611aaa195	Use SFENCE for ordering CLFLUSHOPT. SDM states that CLFLUSHOPT instructions can be ordered with other writes by SFENCE, heavier MFENCE is not required. Reviewed by: alc Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2017-01-20 19:08:44 +00:00
Alan Somers	c9bd03d87f	Fix "camcontrol timestamp -s" with LTO-7 drives The length of the scsi_set_timestamp_parameters struct was incorrect. LTO-5 drives don't care, but LTO-7 drives do. Reviewed by: Sam Klopsch MFC after: 2 weeks Sponsored by: Spectra Logic Corp	2017-01-20 17:54:24 +00:00
Hans Petter Selasky	10c8755706	Fix for race leading to endless timer interrupts related to configtimer(). During normal operation "state->nextcallopt" will always be less than or equal to "state->nextcall" and checking only "state->nextcallopt" before calling "callout_process()" is sufficient. However when "configtimer()" is called a race might happen requiring both of these binary times to be checked. Short description of race: 1) A configtimer() call will reset both "state->nextcall" and "state->nextcallopt" to the same binary time. 2) If a "callout_reset()" call happens between "configtimer()" and the next "callout_process()" call, "state->nextcallopt" will get updated and "state->nextcall" will remain at the current time. Refer to logic inside cpu_new_callout(). 3) getnextcpuevent() only respects "state->nextcall" and returns this value over and over again, even if it is in the past, until "now >= state->nextcallopt" becomes true. Then these two time variables are corrected by a "callout_process()" call and the situation goes back to normal. The problem manifests itself in different ways. The common factor is the timer process(es) consume all CPU on one or more CPU cores for a long time, blocking other kernel processes from getting execution time. This can be seen by very high interrupt counts as displayed by "vmstat -i \| grep timer" right after boot. When EARLY_AP_STARTUP was enabled in r310177 the likelyhood of hitting this bug apparently increased. Example output from "vmstat -i" before patch: cpu0:timer 7591 69 cpu9:timer 39031773 358089 cpu4:timer 9359 85 cpu3:timer 9100 83 cpu2:timer 9620 88 Example output from "vmstat -i" after patch: cpu0:timer 4242 34 cpu6:timer 5531 44 cpu3:timer 6450 52 cpu1:timer 4545 36 cpu9:timer 7153 58 Before the patch cpu9 in the example above, was spinning in a loop in order to reach 39 million interrupts just a few seconds after bootup. After the patch the timer interrupt counts are more or less consistent. Discussed with: mav @ Reported by: several people MFC after: 1 week Sponsored by: Mellanox Technologies	2017-01-20 17:40:31 +00:00
Ryan Stone	7d309e8e40	Fix reference to free memory in ixgbe/if_media.c When ixgbe receives an interrupt indicating that a new optical module may have been inserted, it discards all of its current media types by calling ifmedia_removeall() and then creates a new set of media types for the supported media on the new module. However, ifmedia_removeall() was maintaining a pointer to whatever the current media type was before the call to ifmedia_removealL(). The result of this was that any attempt to read the current media type of the interface (e.g. via ifconfig) would return potentially garbage data from free memory (or if one were particularly unlucky on an architecture that does not malloc() from a direct map, page fault the kernel). Fix this by NULL'ing out the current media field in if_media.c, and have ixgbe update the current media type after recreating them. Submitted by: Matt Joras <matt.joras AT gmail DOT com> Reviewed by: sbruno, erj MFC after: 1 week Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D9164	2017-01-20 17:16:48 +00:00
Pedro F. Giffuni	f3a9adb9c7	Addition of clang nullability qualifiers. For consistency with the qualifiers added in r310977, define a new qualifier _Null_unspecified which is also defined in clang 3.7+. Add two new macros: __NULLABILITY_PRAGMA_PUSH __NULLABILITY_PRAGMA_POP These are for use in headers when we want avoid noisy warnings if some pointers are left without nullability annotations. These are added with way ahead of their first use to teach the GCC ports headers of their existance before their first use.	2017-01-20 15:56:40 +00:00
Hans Petter Selasky	e16c241deb	Remove superfluous return statement. MFC after: 1 week Sponsored by: Mellanox Technologies	2017-01-20 15:47:29 +00:00
Hans Petter Selasky	b98ba64027	Allow transmit packet bufring in software to be disabled. - Add new sysctl node to control the transmit packet bufring. - Add optimised version of the transmit routine which output packets directly to the DMA ring instead of using bufring in case the transmit lock is congested. This can reduce the number of taskswitches which in turn influence the overall system CPU usage, depending on the workload. - Add " TX" suffix to debug name for transmit mutexes to silence some witness warnings about aquiring duplicate locks having same name. MFC after: 1 week Sponsored by: Mellanox Technologies Suggested by: gallatin @	2017-01-20 15:45:21 +00:00
Josh Paetzel	f2be81e92c	MFV 312436 6569 large file delete can starve out write ops illumos/illumos-gate@ff5177ee8b `ff5177ee8b` https://www.illumos.org/issues/6569 The core issue I've found is that there is no throttle for how many deletes get assigned to one TXG. As a results when deleting large files we end up filling consecutive TXGs with deletes/frees, then write throttling other (more important) ops. There is an easy test case for this problem. Try deleting several large files (at least 1/2 TB) while you do write ops on the same pool. What we've seen is performance of these write ops (let's call it sideload I/O) would drop to zero. More specifically the problem is that dmu_free_long_range_impl() can/will fill up all of the dirty data in the pool "instantly", before many of the sideload ops can get in. So sideload performance will be impacted until all the files are freed. The solution we have tested at Nexenta (with positive results) creates a relatively simple throttle for how many "free" ops we let into one TXG. However this solution exposes other problems that should also be addressed. If we are to slow down freeing of data that means one has to wait even longer (assuming vnode ref count of 1) to get shell back after an rm or for NFS thread to finish the free-ing op. To avoid this the proposed solution is to call zfs_inactive() async for "large" files. Async freeing then begs for the reclaimed space to be accounted for in the zpool's "freeing" prop. The other issue with having a longer delete is the inability to export/unmount for a longer period of time. The proposed solution is to interrupt freeing of blocks when a fs is unmounted. Author: Alek Pinchuk <alek@nexenta.com> Reviewed by: Matt Ahrens <mahrens@delphix.com> Reviewed by: Sanjay Nadkarni <sanjay.nadkarni@nexenta.com> Reviewed by: Pavel Zakharov <pavel.zakharov@delphix.com> Approved by: Dan McDonald <danmcd@omniti.com> Reviewed by: avg Differential Revision: D9008	2017-01-20 15:01:04 +00:00
Ed Maste	039644eca9	ANSYfy kern_ktrace.c and remove archaic register keyword Sponsored by: The FreeBSD Foundation	2017-01-20 14:59:56 +00:00

1 2 3 4 5 ...

115082 Commits