freebsd-dev

Author	SHA1	Message	Date
John Baldwin	a4284ef768	Add a 10 millisecond delay after sending the initial INIT IPI. This matches the algorithm in the MP specification (1.4). Previously we were sending out the deassert INIT IPI immediately after the initial INIT IPI was sent.	2012-08-13 16:33:22 +00:00
Ed Maste	1221cc675d	Use array notation for consistency.	2012-08-13 16:03:30 +00:00
Andrey V. Elsukov	9131425146	Restore the old behaviour. If requested partition is a BSD slice, but d_partition isn't explicitly set, then try to open BSD label and its first partition.	2012-08-13 13:33:21 +00:00
Andrey V. Elsukov	baf3d8bfeb	Remove colons from the debug message, device name returned by the disk_fmtdev() already has the colons.	2012-08-13 13:10:50 +00:00
Andrey V. Elsukov	42b577fe76	Unbreak booting from the true dedicated disks. When we open the disk, check the type of partition table, that has been detected. If this is BSD label, then we assume this is DD mode. Reported by: dim@	2012-08-13 13:08:30 +00:00
Colin Percival	347c7fd7bf	Build modules along with the XENHVM kernels. No objections from: freebsd-xen mailing list MFC after: 1 week	2012-08-13 07:36:57 +00:00
Matt Jacob	64f202fc7d	Remove extraneous newline. MFC after: 1 month	2012-08-12 20:49:25 +00:00
Matt Jacob	405b7a2903	Fix an oops where we wiped out DMA maps. Don't allocate extended command space for anything less than a 2300. MFC after: 1 month X-MFC: 238869	2012-08-12 20:45:47 +00:00
Hans Petter Selasky	a015554f40	Compile fix. MFC after: 2 weeks	2012-08-12 19:58:56 +00:00
Hans Petter Selasky	a5cf1aaaff	Add support for the so-called streams feature of BULK endpoints in SUPER-speed mode, USB 3.0. This feature has not been tested yet, due to lack of hardware. This feature is useful when implementing protocols like UASP, USB attached SCSI which promises higher USB mass storage throughput. This patch also implements support for hardware processing of endpoints for increased performance. The switching to hardware processing of an endpoint is done via a callback to the USB controller driver. The stream feature is implemented like a variant of a hardware USB protocol. USB controller drivers implementing device mode needs to be updated to implement the new "xfer_stall" USB controller method and remove the "xfer" argument from the "set_stall" method. The API's toward existing USB drivers are preserved. To setup a USB transfer in stream mode, set the "stream_id" field of the USB config structure to the desired value. The maximum number of BULK streams is currently hardcoded and limited to 8 via a define in usb_freebsd.h. All USB drivers should be re-compiled after this change. LibUSB will be updated next week to support streams mode. A new IOCTL to setup BULK streams as already been implemented. The ugen device nodes currently only supports stream ID zero. The FreeBSD version has been bumped. MFC after: 2 weeks	2012-08-12 17:53:06 +00:00
Matt Jacob	5e6609a21a	1. Remove SEN support. I doubt there are any working examples of this hardware still running (close to twenty years now). 2. Quiesece and use ENC_VLOG instead of ENC_LOG for most complaints. That is, they're visible with bootverbose, but otherwise quiesced and not repeatedly spamming messages with constant reminders that hardware in this space is rarely fully compliant. MFC after: 1 month	2012-08-12 17:01:07 +00:00
Matt Jacob	10d4323996	Add missing VERIFY_10 definition. MFC after: 1 month	2012-08-12 16:58:38 +00:00
Andrey V. Elsukov	777085904e	Add another debug message.	2012-08-12 14:32:30 +00:00
Andrey V. Elsukov	e6232c0dea	Add more debug messages.	2012-08-12 14:16:21 +00:00
Adrian Chadd	1762ec944a	Revert the ath_tx_draintxq() method, and instead teach it the minimum necessary to "do" EDMA. It was just using the TX completion status for logging information about the descriptor completion. Since with EDMA we don't know this without checking the TX completion FIFO, we can't provide this information. So don't.	2012-08-12 00:46:15 +00:00
Adrian Chadd	788e6aa99c	Break out ath_draintxq() into a method and un-methodize ath_tx_processq(). Now that I understand what's going on with this, I've realised that it's going to be quite difficult to implement a processq method in the EDMA case. Because there's a separate TX status FIFO, I can't just run processq() on each EDMA TXQ to see what's finished. i have to actually run the TX status queue and handle individual TXQs. So: * unmethodize ath_tx_processq(); * leave ath_tx_draintxq() as a method, as it only uses the completion status for debugging rather than actively completing the frames (ie, all frames here are failed); * Methodize ath_draintxq(). The EDMA ath_draintxq() will have to take care of running the TX completion FIFO before (potentially) freeing frames in the queue. The only two places where ath_tx_draintxq() (on a single TXQ) are used: * ath_draintxq(); and * the CABQ handling in the beacon setup code - it drains the CABQ before populating the CABQ with frames for a new beacon (when doing multi-VAP operation.) So it's quite possible that once I methodize the CABQ and beacon handling, I can just drop ath_tx_draintxq() in its entirety. Finally, it's also quite possible that I can remove ath_tx_draintxq() in the future and just "teach" it to not check the status when doing EDMA.	2012-08-12 00:37:29 +00:00
David Xu	d7f97db7bd	Some style fixes inspired by @bde.	2012-08-11 23:48:39 +00:00
Adrian Chadd	e1252ce1d2	Extend the beacon code slightly to support AP mode beaconing for the EDMA HAL hardware. * The EDMA HAL code assumes the nexttbtt and intval values are in TU/8 units, rather than TU. For now, just "hack" around that here, at least until I code up something to translate it in the HAL. * Setup some different TXQ flags for EDMA hardware. * The EDMA HAL doesn't support setting the first rate series via ath_hal_setuptxdesc() - instead, a call to ath_hal_set11nratescenario() is always required. So for now, just do an 11n rate series setup for EDMA beacon frames. This allows my AR9380 to successfully transmit beacon frames. However, CABQ TX and all normal data frame TX and TX completion is still not functional and will require some more significant code churn to make work.	2012-08-11 23:26:19 +00:00
Adrian Chadd	0655b67b3c	Add the AR9380 HAL to the TX descriptor debugging, in order to dump all of the descriptor contents.	2012-08-11 22:39:27 +00:00
Adrian Chadd	4ddf2cc38c	Add the AR9300 HAL ID in to the 11n check routine. I was having TX hang issues, which I root caused to having the legacy ath_hal_setupxtxdesc() called, rather than the 11n rate scenario setup code. This meant that rate control information wasn't being put into frames, causing the MAC to stall/hang.	2012-08-11 22:25:28 +00:00
Adrian Chadd	3ae723d459	Begin fleshing out the TX FIFO support. * Add ATH_TXQ_FIRST() for easy tasting of what's on the list; * Add an "axq_fifo_depth" for easy tracking of how deep the current FIFO is; * Flesh out the handoff (mcast, hw) functions; * Begin fleshing out a TX ISR proc, which tastes the TX status FIFO. The legacy hardware stuffs the TX completion at the end of the final frame descriptor (or final sub-frame when doing aggregate.) So it's feasible to do a per-TXQ drain and process, as the needed info is right there. For EDMA hardware, there's a separate TX completion FIFO. So the TX process routine needs to read the single FIFO and then process the frames in each hardware queue. This makes it difficult to do a per-queue process, as you'll end up with frames in the TX completion FIFO for a different TXQ to the one you've passed to ath_tx_draintxq() or ath_tx_processq(). Testing: I've tested the TX queue and TX completion code in hostap mode on an AR9380. Beacon frames successfully transmit and the completion routine is called. Occasional data frames end up in TXQ 1 and are also successfully completed. However, this requires some changes to the beacon code path as: * The AR9380 beacon configuration API is now in TU/8, rather than TU; * The AR9380 TX API requires the rate control is setup using a call to setup11nratescenario, rather than having the try0 series setup (rate/tries for the first series); so the beacon won't go out. I'll follow this up with commits to the beacon code.	2012-08-11 22:20:28 +00:00
Alexander Motin	37f4e0254f	Some more minor tunings inspired by bde@.	2012-08-11 20:24:39 +00:00
Alexander Motin	bf89d544d0	Allow idle threads to steal second threads from other cores on systems with 8 or more cores to improve utilization. None of my tests on 2xXeon (2x6x2) system shown any slowdown from mentioned "excess thrashing". Same time in pbzip2 test with number of threads more then number of CPUs I see up to 10% speedup with SMT disabled and up 5% with SMT enabled. Thinking about trashing I was trying to limit that stealing within same last level cache, but got only worse results. Present code any way prefers to steal threads from topologically closer cores. Sponsored by: iXsystems, Inc.	2012-08-11 15:08:19 +00:00
Andrew Turner	d2a3296d1a	Move the decoding of the swi instruction to the syscall function. With the ARM EABI the syscall value will be moved to a register to ease adding thumb support. When this happens decoding of the instruction will no longer be required.	2012-08-11 05:58:56 +00:00
Warner Losh	56c265f2f9	Correct the PLLA setting functions and centralize.	2012-08-11 05:45:19 +00:00
Warner Losh	bcc1a5425a	Update comments about setting PLLA and refernce the tables in the datasheet that express the limits.	2012-08-11 05:12:46 +00:00
Warner Losh	19b89339dc	Don't use C++ comments.	2012-08-11 05:03:30 +00:00
David Xu	e8afbca2bc	tvtohz will print out an error message if a negative value is given to it, avoid this problem by detecting timeout earlier. Reported by: pho	2012-08-11 00:06:56 +00:00
Alexander Motin	579895df01	Some minor tunings/cleanups inspired by bde@ after previous commits: - remove extra dynamic variable initializations; - restore (4BSD) and implement (ULE) hogticks variable setting; - make sched_rr_interval() more tolerant to options; - restore (4BSD) and implement (ULE) kern.sched.quantum sysctl, a more user-friendly wrapper for sched_slice; - tune some sysctl descriptions; - make some style fixes.	2012-08-10 19:02:49 +00:00
Pawel Jakub Dawidek	9d18043979	Always initialize sc_ekey, because as of r238116 it is always used. If GELI provider was created on FreeBSD HEAD r238116 or later (but before this change), it is using very weak keys and the data is not protected. The bug was introduced on 4th July 2012. One can verify if its provider was created with weak keys by running: # geli dump <provider> \| grep version If the version is 7 and the system didn't include this fix when provider was initialized, then the data has to be backed up, underlying provider overwritten with random data, system upgraded and provider recreated. Reported by: Fabian Keil <fk@fabiankeil.de> Tested by: Fabian Keil <fk@fabiankeil.de> Discussed with: so MFC after: 3 days	2012-08-10 18:43:29 +00:00
Alexander Motin	9000aabf3b	sched_rr_interval() seems always returned period in hz ticks, but same always it was used as rate. Fix use side units to period in hz ticks.	2012-08-10 18:19:57 +00:00
Hans Petter Selasky	65b31289ae	Style. MFC after: 2 weeks	2012-08-10 17:43:20 +00:00
Hans Petter Selasky	1216d2c543	Remove unused structure field. MFC after: 2 weeks	2012-08-10 17:42:14 +00:00
Hans Petter Selasky	5805d1782d	Take advantage of new UCOM and bus functionality so that the device_detach() function doesn't block on UCOM device drivers until the TTY handle is closed by the userspace application. This is implemented by a postpone of the softc free where the UCOM structures reside until the TTY references are gone. Discussed with: kib, ed MFC after: 2 weeks	2012-08-10 15:29:41 +00:00
Hans Petter Selasky	8f42c74844	Switch unit management in UCOM to unrhdr. Extend the callback table of UCOM to include a "ucom_free" function pointer which is called when all refs on a UCOM super structure is gone. Implement various helper functions to handle refcounting and draining on the UCOM super structure. Implement macro which can be used in device drivers to avoid module unload before all pending TTY references are gone. The UCOM API is backwards compatible after this change and device drivers require no changes to function with this change. Only a recompilation of UCOM device drivers is required. The FreeBSD version has been bumped in that regard. Discussed with: kib, ed MFC after: 2 weeks	2012-08-10 15:21:12 +00:00
Hans Petter Selasky	ea1bd564ac	Add new device method to free the automatically allocated softc structure which is returned by device_get_softc(). This method can be used to easily implement softc refcounting. This can be desirable when the softc has memory references which are controlled by userspace handles for example. This solves the problem of blocking the caller of device_detach() for a non-deterministic time. Discussed with: kib, ed MFC after: 2 weeks	2012-08-10 15:02:49 +00:00
Hans Petter Selasky	2203949434	Style. MFC after: 2 weeks	2012-08-10 14:51:41 +00:00
Hans Petter Selasky	f83858fe37	Fix spelling. MFC after: 2 weeks	2012-08-10 14:44:04 +00:00
Alexander Motin	d9d6849693	Add missing FAILED event to g_raid_subdisk_event2str() to print it properly in debug messages. Submitted by: Dmitry Luhtionov <dmitryluhtionov@gmail.com>	2012-08-10 13:36:33 +00:00
Alan Cox	bab30462fb	Eliminate an unnecessary acquisition and release of the page queues lock from pmap_pte(). PT_SET_MA() is not a queued mapping update, but instead an immediate mapping update, so the page queues lock is not required here. Reviewed by: cperciva	2012-08-10 05:47:04 +00:00
Alan Cox	c0c5f0df71	Merge r134393 from amd64/i386: The machine-independent parts of the virtual memory system always pass a valid pmap to the pmap functions that require one. Remove the checks for NULL. (These checks have their origins in the Mach pmap.c that was integrated into BSD. None of the new code written specifically for FreeBSD included them.)	2012-08-10 05:00:50 +00:00
Warner Losh	cb8b429ed3	More comments about setting PLLA, or rather that we never do.	2012-08-10 04:48:06 +00:00
Warner Losh	e2af3b9a86	Add charge pump current register.	2012-08-10 04:47:20 +00:00
Alexander Motin	3d7f41175d	Rework r220198 change (by fabient). I believe it solves the problem from the wrong direction. Before it, if preemption and end of time slice happen same time, thread was put to the head of the queue as for only preemption. It could cause single thread to run for indefinitely long time. r220198 handles it by not clearing TDF_NEEDRESCHED in case of preemption. But that causes delayed context switch every time preemption happens, even when not needed. Solve problem by introducing scheduler-specifoc thread flag TDF_SLICEEND, set when thread's time slice is over and it should be put to the tail of queue. Using SW_PREEMPT flag for that purpose as it was before just not enough informative to work correctly. On my tests this by 2-3 times reduces run time deviation (improves fairness) in cases when several threads share one CPU. Reviewed by: fabient MFC after: 2 months Sponsored by: iXsystems, Inc.	2012-08-09 19:26:13 +00:00
Alexander Motin	48317e9e27	SCHED_4BSD scheduling quantum mechanism appears to be broken for some time. With switchticks variable being reset each time thread preempted (that is done regularly by interrupt threads) scheduling quantum may never expire. It was not noticed in time because several other factors still regularly trigger context switches. Handle the problem by replacing that mechanism with its equivalent from SCHED_ULE called time slice. It is effectively the same, just measured in context of stathz instead of hz. Some unification is probably not bad.	2012-08-09 18:09:59 +00:00
Alan Cox	b3ca34cfd2	Merge r132141 and r111272 from amd64/i386: Reduce the size of a PV entry by eliminating pv_ptem. There is no need to store a pointer to the page table page in the PV entry because it is easily computed during the walk down the page table. Eliminate the ptphint from the pmap. Long, long ago, page table pages belonged to a vm object, and we would look up page table pages based upon their offset within this vm object. In those days, this hint may have had tangible benefits. Tested by: jchandra	2012-08-09 16:38:17 +00:00
Ed Maste	2f70fca5ec	Improve lock and unlock symmetry - Move destruction of per-ring locks to netmap_dtor_locked to mirror the initialization that happens in NIOCREGIF. Otherwise unloading a netmap- capable interface that was never put into netmap mode would try to mtx_destroy an uninitialized mutex, and panic. - Destroy core_lock in netmap_detach, mirroring init in netmap_attach. - Also comment out the knlist_destroy for now as there is currently no knlist_init. Sponsored by: ADARA Networks Reviewed by: luigi@	2012-08-09 14:46:52 +00:00
Matt Jacob	94dff77179	More rototilling with target mode in an attemp to get multiple CCB at a time outstanding reliable. It's not there yet, but this is the direction to go in so might as well commit. So far, multiple at a time CCBs work (see ISP_INTERNAL_TARGET test mode), but it fails if there are more downstream than the SIM wants to handle and SRR is sort of confused when this happens, plus it is not entirely quite clear what one does if a CCB/CTIO fails and you have more in flight (that don't fail, say) and more queued up at the SIM level that haven't been started yet. Some of this is driven because there apparently is no flow control to requeue XPT_CONTINUE_IO requests like there are for XPT_SCSI_IO requests. It is also more driven in that the few target mode periph drivers there are are not really set up for handling pushback- heck most of them don't even check for errors (and what would they really do with them anyway? It's the initiator's problem, really....). The data transfer arithmetic has been worked over again to handle multiple outstanding commands, so you have a notion of what's been moved already as well as what's currently in flight. It turns that this led to uncovering a REPORT_LUNS bug in the ISP_INTERNAL_TARGET code which was sending back 24 bytes of rpl data instead of the specified 16. What happened furthermore here is that sending back 16 bytes and reporting an overrun of 8 bytes made the initiator (running FC-Tape aware f/w) mad enough to request, and keep requesting, another FCP response (I guess it didn't like the answer so kept asking for it again). Sponsored by: Spectralogic MFC after: 1 month	2012-08-08 18:24:33 +00:00
Adrian Chadd	2a72e4b222	Fix an incorrect comment.	2012-08-08 17:16:06 +00:00
Ed Maste	0bf8895411	Fix whitespace (missing newline)	2012-08-08 15:28:29 +00:00
Ed Maste	24e57ec96d	Clarify comments about number of tx / rx rings	2012-08-08 15:27:01 +00:00
Alan Cox	663f8700d4	The assertion that I added in r238889 could legitimately fail when a debugger creates a breakpoint. Replace that assertion with a narrower one that still achieves my objective. Reported and tested by: kib	2012-08-08 05:28:30 +00:00
Konstantin Belousov	c0c6e95f7f	Always initialize pl_event. Submitted by: Andrey Zonov <andrey@zonov.org> MFC after: 3 days	2012-08-08 00:20:30 +00:00
Adrian Chadd	0a1a3230b6	Commit device IDs for the (eventually upcoming) AR9380 HAL. Obtained from: Qualcomm Atheros, Linux ath9k	2012-08-07 23:45:43 +00:00
Jim Harris	7bfcb3bb9b	During TSC synchronization test, use rdtsc() rather than rdtsc32(), to protect against 32-bit TSC overflow while the sync test is running. On dual-socket Xeon E5-2600 (SNB) systems with up to 32 threads, there is non-trivial chance (2-3%) that TSC synchronization test fails due to 32-bit TSC overflow while the synchronization test is running. Sponsored by: Intel Reviewed by: jkim Discussed with: jkim, kib	2012-08-07 23:16:11 +00:00
Jim Harris	82a6ae1009	Clone BIO_ORDERED flag, for disk drivers (namely CAM) that try to consume it. Sponsored by: Intel Discussed with: gibbs, scottl	2012-08-07 20:16:10 +00:00
Mikolaj Golub	1d9db37c77	In g_gate_dumpconf() always check the result of g_gate_hold(). This fixes "Negative sc_ref" panic possible when sysctl_kern_geom_confxml() is run simultaneously with destroying GATE device. Reviewed by: pjd MFC after: 3 days	2012-08-07 18:50:33 +00:00
John Baldwin	1710852ebd	Don't try to stop the IPMI watchdog timer if it is not running. Starting or stopping the IPMI watchdog is rather expensive with the current implementation as all IPMI requests are bounced via thread. This is not viable during shutdown or dumps, and this avoids headache in the common case that the watchdog is not enabled. The IPMI watchdog should probably be reworked to not use a separate thread to fix this in the case when the watchdog timer is enabled. MFC after: 2 weeks	2012-08-07 12:40:31 +00:00
Andrey V. Elsukov	06f2b92916	As it turned out, there are some installations, where BSD label contains partitions with type zero. And it has worked. So, allow detect these partitions. Reported by: glebius	2012-08-07 09:22:46 +00:00
Warner Losh	b7e39c683a	Fix obvious problem with emulate_fp sysctl. Submitted by: Paul Ambrose <ambrosehua@gmail.com>	2012-08-07 08:37:35 +00:00
Konstantin Belousov	65211d02c4	Do not apply errata 721 workaround when under hypervisor, since typical hypervisor does not implement access to the required MSR, causing #GP on boot. Reported and tested by: olgeni PR: amd64/170388 MFC after: 3 days	2012-08-07 08:36:10 +00:00
Luigi Rizzo	e5813a3bce	s/lenght/length/ in comments	2012-08-07 07:52:25 +00:00
Sergey Kandaurov	16ec457aeb	Remove duplicate header inclusion of <sys/sysent.h> Discussed with: bz	2012-08-07 05:46:36 +00:00
Alan Cox	18f55b0171	Never sleep on busy pages in vm_pageout_launder(), always skip them. Long ago, sleeping on busy pages in vm_pageout_launder() made sense. The call to vm_pageout_flush() specified asynchronous I/O and sleeping on busy pages blocked vm_pageout_launder() until the flush had completed. However, in CVS revision 1.35 of vm/vm_contig.c, the call to vm_pageout_flush() was changed to request synchronous I/O, but the sleep on busy pages was not removed.	2012-08-07 04:48:14 +00:00
Adrian Chadd	d2da554492	Correct re-initialise the link pointer to be the final descriptor in the last buffer. This fixes traffic stalls that were occuring with stuck beacon events. PR: kern/170433	2012-08-07 00:42:46 +00:00
Adrian Chadd	a176030864	Remove unnecessary debugging printf()s.	2012-08-06 22:54:10 +00:00
Davide Italiano	29f44c62f7	Remove a spurious bace which cause build fail in case BCE_DEBUG option is turned on. Reviewed by: delphij Approved by: gnn (mentor) Sponsored by: Google Summer of Code 2012 MFC after: 1 week	2012-08-06 22:53:24 +00:00
Jack F Vogel	724f79462b	Make the polling interface in igb able to handle multiqueue, and correct the rxdone handling. Update the polling man page to include igb as well. Thanks to Mark Johnston for these changes.	2012-08-06 22:43:49 +00:00
Brooks Davis	12a14de08e	MFP4 change 214967: Make the process of embedding MDROOT images less perilous by makeing the target that links kernel and embedding the image depend on the image. This means, if the image doesn't exist you find out before you try to boot from it and that if you change the image you don't have to touch some random source file to cause a rebuild. Don't hide that we're embedding the image.	2012-08-06 21:24:43 +00:00
Jack F Vogel	6aa4d618ca	Correct the mq_start routine to avoid out-of-order packet delivery, always enqueue when possible. Also correct the DEPLETED test as multiple bits might be set. Thanks to Randall Stewart for the changes!	2012-08-06 20:44:05 +00:00
Dimitry Andric	d265aafb5a	In sys/dev/aic7xxx/aic79xx_osm.c, function ahd_attach(), initialize the 'path' variable to NULL, to avoid using it uninitialized in certain cases. Found by: clang MFC after: 1 week	2012-08-06 20:01:32 +00:00
John Baldwin	ef8881526b	Explicitly enable busmastering on PCI-PCI bridges. Transactions initiated on the secondary side of a bridge will not be propagated to the primary bus unless this is enabled. Busmastering is not enabled by default (we have relied on firmware to set this bit to date). The OS needs to set it for any bridges not configured by system firmware. Tested by: Steve Polyack korvus comcast net MFC after: 2 weeks	2012-08-06 19:49:57 +00:00
Dimitry Andric	daccbb811d	In sys/dev/cxgbe/firmware/t4fw_interface.h, change the enum 'fw_hdr_intfver' into an anonymous enum, which avoids a clang 3.2 warning about all the enum values being the same value. Reviewed by: np MFC after: 1 week	2012-08-06 18:54:17 +00:00
Dimitry Andric	25982a6e2d	In cxgb(4), in function iwch_reregister_phys_mem(), initialize the 'npages' variable to zero, to avoid using it uninitialized in certain cases. Found by: clang Reviewed by: np MFC after: 1 week	2012-08-06 18:51:14 +00:00
Alexander Kabaev	c9516c94b4	Do not add handler to event handlers list until ithread is created. In rare event when fast and ithread interrupts share the same vector and the fast handler was registered first, we can end up trying to schedule the ithread that is not created yet. The kernel built with INVARIANTS then triggers an assertion. Change the order to create the ithread first and only then add the handler that needs it to the interrupt event handlers list. Reviewed by: jhb	2012-08-06 16:37:43 +00:00
Luigi Rizzo	17369272e4	move functions outside the SYSBEGIN/SYSEND block (SYSBEGIN/SYSEND are specific to ipfw/dummynet and are used to emulate sysctl on platforms that do not have them, and they work by creating an array which contains all the sysctl-ed symbols.)	2012-08-06 11:02:23 +00:00
Luigi Rizzo	00c4633285	use FREE_PKT instead of m_freem to free an mbuf. The former is the standard form used in ipfw/dummynet, so that it is easier to remap it to different memory managers depending on the platform.	2012-08-06 10:50:43 +00:00
Michael Tuexen	55b175e747	Fix a bug found by dim@: Don't use an uninitilized variable, if INVARIANTS is on and an illegal packet with destination 0 is received. MFC after: 3 days X-MFC with: 238003	2012-08-06 10:50:23 +00:00
Marius Strobl	60df250ae1	- Merge from NetBSD: When issuing a non-DMA command, make sure to set the "remaining length of command to be transferred via DMA" (sc_cmdlen) to zero up-front, otherwise we might get confused on command competition interrupt (no DMA active but still data left to transfer). - Implement handling of MSG_IGN_WIDE_RESIDUE which some targets produce, as just rejecting these leads to a resend and disconnect loop. Reported and tested by: mjacob MFC after: 3 days	2012-08-06 08:58:54 +00:00
Andrey V. Elsukov	a2bdef2a84	Fix start offset calculation for the EBR partitions.	2012-08-06 08:54:06 +00:00
Xin LI	0af671184d	Add PCI IDs for various new High Point RocketRAID 43xx and 3xxx devices. Obtained from: FreeNAS MFC after: 3 days	2012-08-06 05:27:26 +00:00
Marius Strobl	55afc4edf1	Merge r236494 from x86: Isolate the global TTE list lock from data and other locks to prevent false sharing within the cache. MFC after: 3 days	2012-08-05 22:03:13 +00:00
Marius Strobl	787c338407	Include <vm/vm_param.h> for PA_LOCK_COUNT in order to fix kernel build with options ZFS after r239065.	2012-08-05 20:19:27 +00:00
Eitan Adler	0dfbbace0d	Add support for SIIG Cyber Serial Dual PCI 16C850 Submitted by: David Boyd David.Boyd@insightbb.com Approved by: cperciva MFC after: 3 days	2012-08-05 19:37:18 +00:00
Mikolaj Golub	655f934b78	In tcp timers, check INP_DROPPED flag a little later, after callout_deactivate(), so if INP_DROPPED is set we return with the timer active flag cleared. For me this fixes negative keep timer values reported by `netstat -x' for connections in CLOSE state. Approved by: net (silence) MFC after: 2 weeks	2012-08-05 17:30:17 +00:00
Andrey V. Elsukov	3db1bfd80d	Bump USERBOOT_VERSION. Requested by: dfr	2012-08-05 17:04:58 +00:00
Alan Cox	59fa03faa3	Shave off a few more cycles from the average execution time of pmap_enter() by simplifying the control flow and reducing the live range of "om".	2012-08-05 16:59:02 +00:00
Yoshihiro Takahashi	7361df284b	Reduce diffs against i386.	2012-08-05 14:48:55 +00:00
Andrey V. Elsukov	4c89da6c18	Teach the ZFS use new partitions API when probing. Note: now ZFS does probe only for partitions with type "freebsd-zfs" and "freebsd".	2012-08-05 14:48:28 +00:00
Andrey V. Elsukov	6f6aad97b6	Remove unneeded flag.	2012-08-05 14:38:53 +00:00
Andrey V. Elsukov	a86f714d15	Add offset field to the i386_devdesc structure to be compatible with disk_devdesc structure. Update biosdisk driver to the new disk API.	2012-08-05 14:37:48 +00:00
Konstantin Belousov	1c771f9222	After the PHYS_TO_VM_PAGE() function was de-inlined, the main reason to pull vm_param.h was removed. Other big dependency of vm_page.h on vm_param.h are PA_LOCK* definitions, which are only needed for in-kernel code, because modules use KBI-safe functions to lock the pages. Stop including vm_param.h into vm_page.h. Include vm_param.h explicitely for the kernel code which needs it. Suggested and reviewed by: alc MFC after: 2 weeks	2012-08-05 14:11:42 +00:00
Yoshihiro Takahashi	d68fc999de	Reduce diffs against i386.	2012-08-05 14:11:07 +00:00
Yoshihiro Takahashi	72b179343b	MFi386: the part of r219452 and r236405. - bunch of variables are turned into uint8_t. - Remove unnecessary initializations.	2012-08-05 14:05:11 +00:00
Andrey V. Elsukov	95caf34849	When GPT signature is invalid in the primary GPT header, then try to read backup GPT header. Submitted by: hrs (previous version)	2012-08-05 12:57:38 +00:00
Andrey V. Elsukov	2c6f04dc0a	Introduce new API to work with disks from the loader's drivers. It uses new API from the part.c to work with partition tables. Update userboot's disk driver to use new API. Note that struct loader_callbacks_v1 has changed.	2012-08-05 12:15:15 +00:00
Andrey V. Elsukov	9cef000008	Remove unused variables.	2012-08-05 11:59:46 +00:00
Hans Petter Selasky	3426950e9c	Update the list of devices supported by the FTDI driver. It might be that the wrong UART reference clock will be used for a few of the IDs. It is currently not possible to figure that out because the Linux FTDI driver detects this run-time and not compile time based on the bcdDevice field of the USB device descriptor. Some of the ID's in usbdevs are not sorted according to the product ID value. Please feel free to fix this. I'm out of my xemacs magic today. This syncronises us with the linux kernel at kernel.org (HEAD). MFC after: 2 weeks	2012-08-05 11:50:56 +00:00
Andrey V. Elsukov	a0ff532148	Create the interface to work with various partition tables from the loader(8). The following partition tables are supported: BSD label, GPT, MBR, EBR and VTOC8.	2012-08-05 11:29:44 +00:00
Adrian Chadd	fffbec8618	Migrate the 802.11n ath_hal_chaintxdesc() API to use a buffer/segment array, similar to what filltxdesc() uses. This removes the last reference to ds_data in the TX path outside of debugging statements. These need to be adjusted/fixed. Tested: * AR9280 STA/AP with iperf TCP traffic	2012-08-05 11:24:21 +00:00
Michael Tuexen	63c6726e05	Fix a refcount issue. The called only decrements is stcb is NULL. MFC after: 3 days Discussed with: rrs	2012-08-05 10:47:18 +00:00
Adrian Chadd	46634305f4	Migrate the ath_hal_filltxdesc() API to take a list of buffer/seglen values. The existing API only exposes 'seglen' (the current buffer (segment) length) with the data buffer pointer set in 'ds_data'. This is fine for the legacy DMA engine but it won't work for the EDMA engines. The EDMA engine has a significantly different TX descriptor layout. * The legacy DMA engine had a ds_data pointer at the same offset in the descriptor for both TX and RX buffers; * The EDMA engine has no ds_data for RX - the data is DMAed after the descriptor; * The EDMA engine has support for 4 TX buffer/segment pairs in the TX DMA descriptor; * The EDMA TX completion is in a different FIFO, and the driver will 'link' the status completion entry to a QCU by a "QCU ID". I don't know why it's just not filled in by the hardware, alas. So given that, here are the changes: * Instead of directly fondling 'ds_data' in ath_desc, change the ath_hal_filltxdesc() to take an array of buffer pointers as well as segment len pointers; * The EDMA TX completion status wants a descriptor and queue id. This (for now) uses bf_state.bfs_txq and will extract the hardware QCU ID from that. * .. and this is ugly and wasteful; it should change to just store the QCU in the bf_state and save 3/7 bytes in the process. Now, the weird crap: * The aggregate TX path was using bf_state->bfs_txq for the TXQ, rather than taking a function argument. I've tidied that up. * The multicast queue frames get put on a software TXQ and then that is appended to the hardware CABQ when appropriate. So for now, make sure that bf_state->bfs_txq points at the CABQ when adding frames to the multicast queue. * .. but the multicast queue TX path for now doesn't use the software queue and instead (a) directly sets up the descriptor contents at that point; (b) the frames on the vap->avp_mcastq are then just appended wholesale to the CABQ. So for now, I don't have to worry about making the multicast path work with aggregation or the per-TID software queue. Phew. What's left to do: * I need to modify the 11n ath_hal_chaintxdesc() API to do the same. I'll do that in a subsequent commit. * Remove bf_state.bfs_txq entirely and store the QCU as appropriate. * .. then do the runtime "is this going on the right HWQ?" checks using that, rather than comparing pointer values. Tested on: * AR9280 STA/AP * AR5416 STA/AP	2012-08-05 10:12:27 +00:00
Hans Petter Selasky	7a27d904bd	Minor style nit: Use the interface number from the USB interface descriptor like in the other USB serial drivers. These numbers are not supposed to be different, though in theory they can. Make sure that the driver then uses the interface number given by the USB descriptor, and not the logical index of the USB stack. For the future: Whenever the term "index" is used in the USB code, it refers to a number computed by the USB stack. Whenever the term "number" is used in the USB code, it refers to a number in a USB descriptor. MFC after: 2 weeks	2012-08-05 08:56:29 +00:00
Eitan Adler	edfaa73726	Add additional Perle Speed LE serial cards PR: kern/168816 Submitted by: Dennis Oyama <doyama@perle.com> Approved by: cperciva MFC after: 1 week	2012-08-05 08:10:02 +00:00
Eitan Adler	2bd29f8f66	Remove variables which are initialized but never used thereafter reported by gcc46 warning Reviewed by: scottl Approved by: cperciva MFC after: 1 week	2012-08-05 08:08:34 +00:00
Andreas Tobler	7f7acad770	Fix typo. Reviewed by: jhb	2012-08-05 06:15:12 +00:00
Michael Tuexen	832208514f	Fix a bug reported by Simon L. B. Nielsen: If an SCTP endpoint receives an ASCONF with a wildcard lookup address and incorrect verification tag, the system crashes. MFC after: 3 days.	2012-08-04 20:40:36 +00:00
Konstantin Belousov	0055cbd3c5	Reduce code duplication and exposure of direct access to struct vm_page oflags by providing helper function vm_page_readahead_finish(), which handles completed reads for pages with indexes other then the requested one, for VOP_GETPAGES(). Reviewed by: alc MFC after: 1 week	2012-08-04 18:16:43 +00:00
Konstantin Belousov	843dcea09e	The header uma_int.h is internal uma header, unused by this source file. Do not include it needlessly. Reviewed by: alc MFC after: 1 week	2012-08-04 18:12:54 +00:00
Konstantin Belousov	2ddfc13d8d	Remove verbose unused commented out debugging printf. MFC after: 1 week Reviewed by: alc	2012-08-04 18:10:04 +00:00
Gavin Atkinson	b5ba3bdb25	Support multiple interface devices. The driver had previously hardcoded support for only the first port, but the CP2105 can have multiple ports. Although this allowed the first port to mostly work on multi port devices, there could be issues with this arrangement. Update the man page to reflect support for both ports and the CP2105. Many thanks to Silicon Labs (www.silabs.com) for providing a CP2105-EK dev board for testing. MFC after: 2 weeks	2012-08-04 15:11:36 +00:00
Alexander Motin	2038943013	Particlly MFcalloutng r238425 (by davide): Fix an issue related to old periodic timers. The code in kern_clocksource.c uses interrupt to keep track of time, and this time may not match with binuptime(). In order to address such incoherency, switch periodic timers to binuptime(). Except further calloutng it is needed for already present cyclic subsystem.	2012-08-04 08:06:37 +00:00
Michael Tuexen	173be2b6cd	Testing an interface property should depend on the interface, not on an address. MFC after: 3 days	2012-08-04 08:03:30 +00:00
Alexander Motin	9b71c63a8b	Partialy MFcalloutng r236894 (by davide): ... While here, Bruce Evans told me that "unsigned int" is spelled "u_int" in KNF, so replace it where needed.	2012-08-04 07:46:58 +00:00
Andrew Turner	c75943da26	Correctly return EFAULT in copyin & copyout on a fault. This fixes NFS when running FreeBSD on QEMU emulating a Gumstix board. While here remove the use of a magic number in the not-XScale version. Pointed out by: kib Reviewed by: stas	2012-08-04 05:38:25 +00:00
Andrew Turner	bfba812656	Ensure we align the stack to 8 bytes in system calls. This is not strictly required with the current ABI but will be when we switch to the ARM EABI. The aapcs requires the stack to be 4 byte aligned at all times and 8 byte aligned when calling a public subroutine where the current ABI only requires sp to be a multiple of 4.	2012-08-04 05:31:26 +00:00
Justin Hibbits	d0ec68d41c	Add backlight support for nVidia-based PowerBooks/iBooks/iMacs. Approved by: nwhitehorn (mentor) MFC after: 9.1-RELEASE	2012-08-04 03:05:01 +00:00
Matt Jacob	2468ec31b3	Add detach logic to SBus variant. Obtained from: Marius MFC after: 1 month	2012-08-04 00:00:30 +00:00
Jim Harris	c1d00eabe8	In virstor_ctl_stop(), check for a valid softc before trying to update metadata. Sponsored by: Intel Reported and tested by: Marcelo Gondim <gondim at bsdinfo dot com dot br> PR: kern/170199 MFC after: 3 days	2012-08-03 20:24:16 +00:00
John Baldwin	0046805a58	Correct function name in comment. Submitted by: alc	2012-08-03 18:40:44 +00:00
Alexander Motin	b19ee1c6ef	Microoptimize LAPIC timer routines to avoid reading from hardware during programming using earlier cached values. This makes respective routines to disappear from PMC top and reduces total number of active CPU cycles on idle 24-core system by 10%.	2012-08-03 15:19:59 +00:00
Thomas Quinot	71ee4ef0d9	New command "gmultipath prefer" to force selection of a specified provider in an Active/Passive configuration. Reviewed by: mav MFC after: 4 weeks	2012-08-03 14:55:35 +00:00
Matt Jacob	de0627d021	Oops. We only do allocate room for extended commands and responses for 2300 cards are newer. Sponsored by: Spectralogic Noticed by: Our Friend Manfred MFC after: 1 month X-MFC: 238869	2012-08-03 14:25:35 +00:00
Luigi Rizzo	1463ca4a8f	support building vtnet as an embedded driver	2012-08-03 14:00:26 +00:00
John Baldwin	2db99100a4	Improve the handling of static DMA buffers that use non-default memory attributes (currently just BUS_DMA_NOCACHE): - Don't call pmap_change_attr() on the returned address, instead use kmem_alloc_contig() to ask the VM system for memory with the requested attribute. - As a result, always use kmem_alloc_contig() for non-default memory attributes, even for sub-page allocations. This requires adjusting bus_dmamem_free()'s logic for determining which free routine to use. - For x86, add a new dummy bus_dmamap that is used for static DMA buffers allocated via kmem_alloc_contig(). bus_dmamem_free() can then use the map pointer to determine which free routine to use. - For powerpc, add a new flag to the allocated map (bus_dmamem_alloc() always creates a real map on powerpc) to indicate which free routine should be used. Note that the BUS_DMA_NOCACHE handling in powerpc is currently #ifdef'd out. I have left it disabled but updated it to match x86. Reviewed by: scottl MFC after: 1 month	2012-08-03 13:50:29 +00:00
Alexander Motin	2c2e2be746	Remove duplicate check. Submitted by: Dmitry Luhtionov <dmitryluhtionov@gmail.com>	2012-08-03 12:55:31 +00:00
Alexander Motin	c0722d20d3	Microoptimize time math. As soon as our event periods are always below ome second we may not add intereger parts by using bintime_addx() instead of bintime_add(). Profiling shows handleevents() time redction by 15%.	2012-08-03 09:08:20 +00:00
Alan Cox	369763e31a	Inline vm_page_aflags_clear() and vm_page_aflags_set(). Add comments stating that neither these functions nor the flags that they are used to manipulate are part of the KBI.	2012-08-03 01:48:15 +00:00
Xin LI	0f3fae6159	Correct a typo. Reported by: Sascha Wildner <swildner dragonflybsd org> Reviewed by: scottl MFC after: 3 days	2012-08-03 00:11:13 +00:00
Adrian Chadd	a6e829596d	Fix an issue that crept in with the previous descriptor tidyup. When forming aggregates, the last descriptor was now not being correctly setup - instead, the "setuplasttxdesc" call was being handed the first descriptor in the last subframe, rather than the last descriptor in the last subframe. This showed up as "bad series0 hwrate" messages, as the final descriptor just didn't have any of the rate control information squirreled away. Tested: * AR9280 STA -> 11n AP, iperf TCP	2012-08-02 20:14:45 +00:00
Jaakko Heinonen	8cb51643e4	Disallow sectorsize larger than MAXPHYS and mediasize smaller than sectorsize. PR: 169947 Submitted by: Filip Palian (original version) Reviewed by: kib	2012-08-02 15:05:34 +00:00
Gleb Smirnoff	ea53792942	Fix races between in_lltable_prefix_free(), lla_lookup(), llentry_free() and arptimer(): o Use callout_init_rw() for lle timeout, this allows us safely disestablish them. - This allows us to simplify the arptimer() and make it race safe. o Consistently use ifp->if_afdata_lock to lock access to linked lists in the lle hashes. o Introduce new lle flag LLE_LINKED, which marks an entry that is attached to the hash. - Use LLE_LINKED to avoid double unlinking via consequent calls to llentry_free(). - Mark lle with LLE_DELETED via \|= operation istead of =, so that other flags won't be lost. o Make LLE_ADDREF(), LLE_REMREF() and LLE_FREE_LOCKED() more consistent and provide more informative KASSERTs. The patch is a collaborative work of all submitters and myself. PR: kern/165863 Submitted by: Andrey Zonov <andrey zonov.org> Submitted by: Ryan Stone <rysto32 gmail.com> Submitted by: Eric van Gyzen <eric_van_gyzen dell.com>	2012-08-02 13:57:49 +00:00
Gleb Smirnoff	b1d86af706	The llentry_update() is used only by flowtable and the latter always passes NULL pointer to it. Thus, code can be simplified and function renamed to llentry_alloc() to match rtalloc().	2012-08-02 13:20:44 +00:00
Luigi Rizzo	46f2f751e1	replace __unused with a portable construct; fix a couple of signed/unsigned warnings.	2012-08-02 12:45:13 +00:00
Luigi Rizzo	b3d5301688	fix some signed/unsigned warnings in the netmap code. Unfortunately the original drivers still have a lot of sign conversion/comparison warnings.	2012-08-02 11:59:43 +00:00
Luigi Rizzo	42a3a5bd91	Add a newline on an error message; rename linux functions to avoid confusion; fix error reporting on linux	2012-08-02 07:35:40 +00:00
Sean Bruno	8844c80848	CPU_NEXT() already handles wrapping around to the beginning. Also, in a system with sparse CPU IDs, you can have a valid CPU ID > mp_ncpus (e.g. if you have two CPUs 0 and 4, with mp_maxid == 4 and mp_ncpus == 2). Introduced at svn r235210 Submitted by: jhb@ Reviewed by: jfv@	2012-08-02 00:00:34 +00:00
Tai-hwa Liang	50e91f8ce1	Just like the other file systems found in /sys/fs, g_vfs_open() should be paried with g_vfs_close(). Though g_vfs_close() is a wrapper around g_wither_geom_close(), r206130 added the following test in g_vfs_open(): if (bo->bo_private != vp) return (EBUSY); Which will cause a 'Device busy' error inside reiserfs_mountfs() if the same file system is re-mounted again after umount or mounting failure: (case 1, /dev/ad4s3 is not a valid REISERFS partition) # mount -t reiserfs -o ro /dev/ad4s3 /mnt mount: /dev/ad4s3: Invalid argument # mount -t msdosfs -o ro /dev/ad4s3 /mnt mount: /dev/ad4s3: Device busy (case 2, /dev/ad4s3 is a valid REISERFS partition) # mount -t reiserfs -o ro /dev/ad4s3 /mnt # umount /mnt # mount -t reiserfs -o ro /dev/ad4s3 /mnt mount: /dev/ad4s3: Device busy On the other hand, g_vfs_close() 'fixed' the above cases by doing an extra step to keep 'sc->sc_bo->bo_private' and 'cp->private' pointers synchronised. Reviewed by: kib MFC after: 1 month	2012-08-01 23:05:57 +00:00
Luigi Rizzo	f5705b527d	replace inet_ntoa_r with the more standard inet_ntop(). As discussed on -current, inet_ntoa_r() is non standard, has different arguments in userspace and kernel, and almost unused (no clients in userspace, only net/flowtable.c, net/if_llatbl.c, netinet/in_pcb.c, netinet/tcp_subr.c in the kernel)	2012-08-01 18:52:07 +00:00
Luigi Rizzo	71ca24f182	add a cast to avoid a signed/unsigned warning (to be removed when we will have TUNABLE_UINT constructors)	2012-08-01 18:49:00 +00:00
Konstantin Belousov	e1a18e46e1	Do a trivial reformatting of the comment, to record the proper commit message for r238973: Rdtsc instruction is not synchronized, it seems on some Intel cores it can bypass even the locked instructions. As a result, rdtsc executed on different cores may return unordered TSC values even when the rdtsc appearance in the instruction sequences is provably ordered. Similarly to what has been done in r238755 for TSC synchronization test, add explicit fences right before rdtsc in the timecounters 'get' functions. Intel recommends to use LFENCE, while AMD refers to MFENCE. For VIA follow what Linux does and use LFENCE. With this change, I see no reordered reads of TSC on Nehalem. Change the rmb() to inlined CPUID in the SMP TSC synchronization test. On i386, locked instruction is used for rmb(), and as noted earlier, it is not enough. Since i386 machine may not support SSE2, do simplest possible synchronization with CPUID. MFC after: 1 week Discussed with: avg, bde, jkim	2012-08-01 17:34:43 +00:00
Alexander Motin	61c49b4dd1	Several fixes to allow firmware/BIOS flash access from user-level: - remove special handling of zero length transfers in mpi_pre_fw_upload(); - add missing MPS_CM_FLAGS_DATAIN flag in mpi_pre_fw_upload(); - move mps_user_setup_request() call into proper place; - increase user command timeout from 30 to 60 seconds; - avoid NULL dereference panic in case of firmware crash. Set max DMA segment size to 24bit, as MPI SGE supports it. Use mps_add_dmaseg() to add empty SGE instead of custom code. Tune endianness safety. Reviewed by: Desai, Kashyap <Kashyap.Desai@lsi.com> Sponsored by: iXsystems, Inc.	2012-08-01 17:31:31 +00:00
Konstantin Belousov	814124c33e	diff --git a/sys/x86/x86/tsc.c b/sys/x86/x86/tsc.c index c253a96..3d8bd30 100644 --- a/sys/x86/x86/tsc.c +++ b/sys/x86/x86/tsc.c @@ -82,7 +82,11 @@ static void tsc_freq_changed(void arg, const struct cf_level level, static void tsc_freq_changing(void arg, const struct cf_level level, int status); static unsigned tsc_get_timecount(struct timecounter tc); -static unsigned tsc_get_timecount_low(struct timecounter tc); +static inline unsigned tsc_get_timecount_low(struct timecounter tc); +static unsigned tsc_get_timecount_lfence(struct timecounter tc); +static unsigned tsc_get_timecount_low_lfence(struct timecounter tc); +static unsigned tsc_get_timecount_mfence(struct timecounter tc); +static unsigned tsc_get_timecount_low_mfence(struct timecounter tc); static void tsc_levels_changed(void arg, int unit); static struct timecounter tsc_timecounter = { @@ -262,6 +266,10 @@ probe_tsc_freq(void) (vm_guest == VM_GUEST_NO && CPUID_TO_FAMILY(cpu_id) >= 0x10)) tsc_is_invariant = 1; + if (cpu_feature & CPUID_SSE2) { + tsc_timecounter.tc_get_timecount = + tsc_get_timecount_mfence; + } break; case CPU_VENDOR_INTEL: if ((amd_pminfo & AMDPM_TSC_INVARIANT) != 0 \|\| @@ -271,6 +279,10 @@ probe_tsc_freq(void) (CPUID_TO_FAMILY(cpu_id) == 0xf && CPUID_TO_MODEL(cpu_id) >= 0x3)))) tsc_is_invariant = 1; + if (cpu_feature & CPUID_SSE2) { + tsc_timecounter.tc_get_timecount = + tsc_get_timecount_lfence; + } break; case CPU_VENDOR_CENTAUR: if (vm_guest == VM_GUEST_NO && @@ -278,6 +290,10 @@ probe_tsc_freq(void) CPUID_TO_MODEL(cpu_id) >= 0xf && (rdmsr(0x1203) & 0x100000000ULL) == 0) tsc_is_invariant = 1; + if (cpu_feature & CPUID_SSE2) { + tsc_timecounter.tc_get_timecount = + tsc_get_timecount_lfence; + } break; } @@ -328,16 +344,31 @@ init_TSC(void) #ifdef SMP -/ rmb is required here because rdtsc is not a serializing instruction. / -#define TSC_READ(x) \ -static void \ -tsc_read_##x(void arg) \ -{ \ - uint32_t tsc = arg; \ - u_int cpu = PCPU_GET(cpuid); \ - \ - rmb(); \ - tsc[cpu 3 + x] = rdtsc32(); \ +/* + * RDTSC is not a serializing instruction, and does not drain + * instruction stream, so we need to drain the stream before executing + * it. It could be fixed by use of RDTSCP, except the instruction is + * not available everywhere. + * + * Use CPUID for draining in the boot-time SMP constistency test. The + * timecounters use MFENCE for AMD CPUs, and LFENCE for others (Intel + * and VIA) when SSE2 is present, and nothing on older machines which + * also do not issue RDTSC prematurely. There, testing for SSE2 and + * vendor is too cumbersome, and we learn about TSC presence from + * CPUID. + * + * Do not use do_cpuid(), since we do not need CPUID results, which + * have to be written into memory with do_cpuid(). + / +#define TSC_READ(x) \ +static void \ +tsc_read_##x(void arg) \ +{ \ + uint32_t tsc = arg; \ + u_int cpu = PCPU_GET(cpuid); \ + \ + __asm __volatile("cpuid" : : : "eax", "ebx", "ecx", "edx"); \ + tsc[cpu 3 + x] = rdtsc32(); \ } TSC_READ(0) TSC_READ(1) @@ -487,7 +518,16 @@ init: for (shift = 0; shift < 31 && (tsc_freq >> shift) > max_freq; shift++) ; if (shift > 0) { - tsc_timecounter.tc_get_timecount = tsc_get_timecount_low; + if (cpu_feature & CPUID_SSE2) { + if (cpu_vendor_id == CPU_VENDOR_AMD) { + tsc_timecounter.tc_get_timecount = + tsc_get_timecount_low_mfence; + } else { + tsc_timecounter.tc_get_timecount = + tsc_get_timecount_low_lfence; + } + } else + tsc_timecounter.tc_get_timecount = tsc_get_timecount_low; tsc_timecounter.tc_name = "TSC-low"; if (bootverbose) printf("TSC timecounter discards lower %d bit(s)\n", @@ -599,16 +639,48 @@ tsc_get_timecount(struct timecounter tc __unused) return (rdtsc32()); } -static u_int +static inline u_int tsc_get_timecount_low(struct timecounter tc) { uint32_t rv; __asm __volatile("rdtsc; shrd %%cl, %%edx, %0" - : "=a" (rv) : "c" ((int)(intptr_t)tc->tc_priv) : "edx"); + : "=a" (rv) : "c" ((int)(intptr_t)tc->tc_priv) : "edx"); return (rv); } +static u_int +tsc_get_timecount_lfence(struct timecounter tc __unused) +{ + + lfence(); + return (rdtsc32()); +} + +static u_int +tsc_get_timecount_low_lfence(struct timecounter tc) +{ + + lfence(); + return (tsc_get_timecount_low(tc)); +} + +static u_int +tsc_get_timecount_mfence(struct timecounter tc __unused) +{ + + mfence(); + return (rdtsc32()); +} + +static u_int +tsc_get_timecount_low_mfence(struct timecounter tc) +{ + + mfence(); + return (tsc_get_timecount_low(tc)); +} + uint32_t cpu_fill_vdso_timehands(struct vdso_timehands *vdso_th) {	2012-08-01 17:26:22 +00:00
Konstantin Belousov	0220d04fe3	Add lfence(). MFC after: 1 week	2012-08-01 17:24:53 +00:00
Alan Cox	879eedbc7b	Revise pmap_enter()'s handling of mapping updates that change the PTE's PG_M and PG_RW bits but not the physical page frame. First, only perform vm_page_dirty() on a managed vm_page when the PG_M bit is being cleared. If the updated PTE continues to have PG_M set, then there is no requirement to perform vm_page_dirty(). Second, flush the mapping from the TLB when PG_M alone is cleared, not just when PG_M and PG_RW are cleared. Otherwise, a stale TLB entry may stop PG_M from being set again on the next store to the virtual page. However, since the vm_page's dirty field already shows the physical page as being dirty, no actual harm comes from the PG_M bit not being set. Nonetheless, it is potentially confusing to someone expecting to see the PTE change after a store to the virtual page.	2012-08-01 16:04:13 +00:00
Alexander Motin	1914fdecbe	Fix kernel panic on `camcontrol reset` for specific target, caused by uninitialized cm_targ in mpssas_action_resetdev(). Reviewed by: Desai, Kashyap <Kashyap.Desai@lsi.com> Sponsored by: iXsystems, Inc. MFC after: 3 days	2012-08-01 12:24:13 +00:00
Gleb Smirnoff	b9aee262e5	Some more whitespace cleanup.	2012-08-01 09:00:26 +00:00
Warner Losh	cc90639873	Add the chip select glue.	2012-08-01 01:18:36 +00:00
Adrian Chadd	9f579ef85d	Fix a case of "mis-located braces". PR: kern/170302	2012-08-01 00:18:02 +00:00
Adrian Chadd	af01710118	Allow 802.11n hardware to support multi-rate retry when RTS/CTS is enabled. The legacy (pre-802.11n) hardware doesn't support this - although the AR5212 era hardware supports MRR, it doesn't have all the bits needed to support MRR + RTS/CTS. The AR5416 and later support a packet duration and RTS/CTS flags per rate scenario, so we should support it. Tested: * AR9280, STA PR: kern/170302	2012-07-31 23:54:15 +00:00

1 2 3 4 5 ...

87782 Commits