freebsd-skq

Author	SHA1	Message	Date
attilio	b94ea1b2e3	Really, smb_iod_main() is not totally MPSAFE, so just acquire and drop Giant around it in order to assume MPSAFETY. Reported by: jhb, rwatson Pointy hat to: attilio	2008-03-27 01:23:59 +00:00
phk	c763b22a79	Back in the good old days, PC's had random pieces of rock for frequency generation and what frequency the generated was anyones guess. In general the 32.768kHz RTC clock x-tal was the best, because that was a regular wrist-watch Xtal, whereas the X-tal generating the ISA bus frequency was much lower quality, often costing as much as several cents a piece, so it made good sense to check the ISA bus frequency against the RTC clock. The other relevant property of those machines, is that they typically had no more than 16MB RAM. These days, CPU chips croak if their clocks are not tightly within specs and all necessary frequencies are derived from the master crystal by means if PLL's. Considering that it takes on average 1.5 second to calibrate the frequency of the i8254 counter, that more likely than not, we will not actually use the result of the calibration, and as the final clincher, we seldom use the i8254 for anything besides BEL in syscons anyway, it has become time to drop the calibration code. If you need to tell the system what frequency your i8254 runs, you can do so from the loader using hw.i8254.freq or using the sysctl kern.timecounter.tc.i8254.frequency.	2008-03-26 22:12:00 +00:00
phk	f5d8b74690	Further cleanup of sound generation in syscons: The timer_spkr_() functions take care of the enabling/disabling of the speaker. Test on the existence of timer_spkr_() functions, rather than architectures.	2008-03-26 22:02:51 +00:00
phk	3cbe36127b	Make speaker a pseudo device driver instead of attaching to a PnP id. If somebody cleaned this code up to proper style(9), it could become a great educational starting point for aspiring kernel hackers.	2008-03-26 21:33:41 +00:00
rwatson	61a4ef5ea0	Add a comment explaining that we initialize the 'a' buffer for zero-copy to the store buffer position on the BPF descriptor, and the 'b' buffer as the free buffer in order to fill them in the order documented in bpf(4). MFC after: 4 months Suggested by: csjp	2008-03-26 21:29:13 +00:00
mav	5b9ac353f2	Some minor code and math optimizations.	2008-03-26 21:19:03 +00:00
jhb	20cadd93f0	Fix a nit with the 'nofoo' options where 'foo' is mapped to 'nonofoo' (such as 'atime' vs 'noatime'). The filesystems will always see either 'nofoo' or 'nonofoo', never plain 'foo'. As such, their list of valid mount options should include 'nofoo' instead of 'foo'. With this fix, you can do 'mount -u -o atime' on a FFS filesystem that isn't marked as noatime without getting an error. You can also update a noatime FFS filesystem mounted via mount(2) (e.g. 6.x /sbin/mount binary) to 'atime' using nmount(2) (e.g. 7.x /sbin/mount binary). MFC after: 1 week Reviewed by: crodig	2008-03-26 20:48:07 +00:00
phk	259c5e1579	Remove two variables which are handled MI now.	2008-03-26 20:28:52 +00:00
phk	168398fe50	Eliminate unnecessary #includes	2008-03-26 20:26:12 +00:00
phk	fa71439e44	The "free-lance" timer in the i8254 is only used for the speaker these days, so de-generalize the acquire_timer/release_timer api to just deal with speakers. The new (optional) MD functions are: timer_spkr_acquire() timer_spkr_release() and timer_spkr_setfreq() the last of which configures the timer to generate a tone of a given frequency, in Hz instead of 1/1193182th of seconds. Drop entirely timer2 on pc98, it is not used anywhere at all. Move sysbeep() to kern/tty_cons.c and use the timer_spkr() if they exist, and do nothing otherwise. Remove prototypes and empty acquire-/release-timer() and sysbeep() functions from the non-beeping archs. This eliminate the need for the speaker driver to know about i8254frequency at all. In theory this makes the speaker driver MI, contingent on the timer_spkr_() functions existing but the driver does not know this yet and still attaches to the ISA bus. Syscons is more tricky, in one function, sc_tone(), it knows the hz and things are just fine. In the other function, sc_bell() it seems to get the period from the KDMKTONE ioctl in terms if 1/1193182th second, so we hardcode the 1193182 and leave it at that. It's probably not important. Change a few other sysbeep() uses which obviously knew that the argument was in terms of i8254 frequency, and leave alone those that look like people thought sysbeep() took frequency in hertz. This eliminates the knowledge of i8254_freq from all but the actual clock.c code and the prof_machdep.c on amd64 and i386, where I think it would be smart to ask for help from the timecounters anyway [TBD].	2008-03-26 20:09:21 +00:00
dfr	7ce50c542d	Bump __FreeBSD_version for the addition of 'l_sysid' to the flock structure.	2008-03-26 15:41:00 +00:00
emaste	aa3c79c94c	Add \n to the end of a printf string and remove it from panic strings.	2008-03-26 15:28:56 +00:00
dfr	1c5a20ad66	Regen.	2008-03-26 15:24:02 +00:00
dfr	79d2dfdaa6	Add the new kernel-mode NFS Lock Manager. To use it instead of the user-mode lock manager, build a kernel with the NFSLOCKD option and add '-k' to 'rpc_lockd_flags' in rc.conf. Highlights include: * Thread-safe kernel RPC client - many threads can use the same RPC client handle safely with replies being de-multiplexed at the socket upcall (typically driven directly by the NIC interrupt) and handed off to whichever thread matches the reply. For UDP sockets, many RPC clients can share the same socket. This allows the use of a single privileged UDP port number to talk to an arbitrary number of remote hosts. * Single-threaded kernel RPC server. Adding support for multi-threaded server would be relatively straightforward and would follow approximately the Solaris KPI. A single thread should be sufficient for the NLM since it should rarely block in normal operation. * Kernel mode NLM server supporting cancel requests and granted callbacks. I've tested the NLM server reasonably extensively - it passes both my own tests and the NFS Connectathon locking tests running on Solaris, Mac OS X and Ubuntu Linux. * Userland NLM client supported. While the NLM server doesn't have support for the local NFS client's locking needs, it does have to field async replies and granted callbacks from remote NLMs that the local client has contacted. We relay these replies to the userland rpc.lockd over a local domain RPC socket. * Robust deadlock detection for the local lock manager. In particular it will detect deadlocks caused by a lock request that covers more than one blocking request. As required by the NLM protocol, all deadlock detection happens synchronously - a user is guaranteed that if a lock request isn't rejected immediately, the lock will eventually be granted. The old system allowed for a 'deferred deadlock' condition where a blocked lock request could wake up and find that some other deadlock-causing lock owner had beaten them to the lock. * Since both local and remote locks are managed by the same kernel locking code, local and remote processes can safely use file locks for mutual exclusion. Local processes have no fairness advantage compared to remote processes when contending to lock a region that has just been unlocked - the local lock manager enforces a strict first-come first-served model for both local and remote lockers. Sponsored by: Isilon Systems PR: 95247 107555 115524 116679 MFC after: 2 weeks	2008-03-26 15:23:12 +00:00
phk	632e5d39f7	Rename timer0_max_count to i8254_max_count. Rename timer0_real_max_count to i8254_real_max_count and make it static. Rename timer_freq to i8254_freq and make it a loader tunable.	2008-03-26 15:03:24 +00:00
phk	44bfb30efd	The RTC related pscnt and psdiv variables have no business being public.	2008-03-26 13:25:27 +00:00
phk	6f9c1b7d47	Remove old sysctl stuff which is long gone in other arch's.	2008-03-26 13:03:51 +00:00
brueffer	b64d211df2	Fix some "in in" typos in comments. PR: 121490 Submitted by: Anatoly Borodin <anatoly.borodin@gmail.com> Approved by: rwatson (mentor), jkoshy MFC after: 3 days	2008-03-26 07:32:08 +00:00
alc	0e2f1e0b38	Enable the automatic creation of superpage reservations.	2008-03-26 03:12:00 +00:00
sam	2b9c326fca	split out tty create part of ucom_attach into ucom_attach_tty so derived drivers can use it Submitted by: Jared Go MFC after: 3 weeks	2008-03-25 23:46:24 +00:00
sam	5f88f5ba90	add some CDMA modems Submitted by: Jared Go MFC after: 1 week	2008-03-25 23:35:32 +00:00
scottl	a3b7a4bce8	Implement taskqueue_block() and taskqueue_unblock(). These functions allow the owner of a queue to block and unblock execution of the tasks in the queue while allowing tasks to continue to be added queue. Combining this with taskqueue_drain() allows a queue to be safely disabled. The unblock function may run (or schedule to run) the queue when it is called, just as calling taskqueue_enqueue() would. Reviewed by: jhb, sam	2008-03-25 22:38:45 +00:00
emaste	5e698c9f5e	Add 64-bit array support for RAIDs > 2TB. This corresponds to ~ Adaptec driver build 15317. Tested on: Adaptec 2230S, Firmware 4.2-0 (8205) ICP ICP5085BL, Firmware 5.2-0 (12814) Submitted by: Adaptec	2008-03-25 21:39:06 +00:00
sam	f9c4823d64	add __noinline Submitted by: imp Reviewed by: kan (long ago) MFC after: 3 weeks	2008-03-25 21:30:01 +00:00
sam	d5c642ca44	expose if_purgemaddrs, it will be used by the vap code unless someone redesigns the mcast support code in the next few weeks MFC after: 3 weeks	2008-03-25 21:23:32 +00:00
sam	12e7d1940e	IFM_IEEE80211_IBSSMASTER hasn't been used in many years; replace it with IFM_IEEE80211_WDS which will be used by the forthcoming vap code MFC after: 3 weeks	2008-03-25 21:22:43 +00:00
sam	8e10753c85	enable dynamic addition of "show all" commands MFC after: 3 weeks	2008-03-25 20:36:32 +00:00
jhb	fce41b3b76	Regen.	2008-03-25 19:35:34 +00:00
jhb	a8ff4f0990	Add entries for the cpuset-related system calls. The existing system calls can be used on little endian systems. Pointy hat to: jeff	2008-03-25 19:34:47 +00:00
emaste	4146778caa	Correct data direction flags in aac_bio_command() in the !AAC_FLAGS_RAW_IO && AAC_FLAGS_SG_64BIT case. Submitted by: Adaptec	2008-03-25 18:34:04 +00:00
ru	e9ab62a9ff	Fix build. Reported by: ache, tinderbox	2008-03-25 13:20:52 +00:00
ru	3b1bf8c2e9	Replaced the misleading uses of a historical artefact M_TRYWAIT with M_WAIT. Removed dead code that assumed that M_TRYWAIT can return NULL; it's not true since the advent of MBUMA. Reviewed by: arch There are ongoing disputes as to whether we want to switch to directly using UMA flags M_WAITOK/M_NOWAIT for mbuf(9) allocation.	2008-03-25 09:39:02 +00:00
ru	0655a583e2	Regen after changing prototypes of cpuset_{get,set}affinity().	2008-03-25 09:14:17 +00:00
ru	4feaeed265	Fixed type of the fourth argument of cpuset_{get,set}affinity(2) to be size_t. Prodded by: davidxu	2008-03-25 09:11:53 +00:00
rwatson	59900e5206	Check for a NULL free buffer pointer in BPF before invoking bpf_canfreebuf() in order to avoid potentially calling a non-inlinable but trivial function in zero-copy buffer mode for every packet received when we couldn't free the buffer anyway. MFC after: 4 months	2008-03-25 07:41:33 +00:00
weongyo	9a9594d179	Add support for Marvell Libertas 88W8335 based PCI network adapters. Reviewed by: sam, many wireless people Approved by: thompsa (mentor)	2008-03-25 06:32:33 +00:00
mav	9af7fc155d	Rewrite node to support multiple hooks, alike to ng_l2tp, to use one pair of pptpgre and ksocket nodes for all calls between two peers. This patch modifies node's API by adding new "session_%04x" hook names support, while keeping backward compatibility. Together with appropriate user-level support (by latest mpd5) it gives huge performance benefits for case of multiple active calls between two peers because of avoiding data duplication and extra socket processing. On my benchmarks I have got more then 10 times speedup for the 200 simultaneous PPTP calls between two peers. In conclusion, it allows now to build effective "clients <=> PAC <=> PNS" setups.	2008-03-24 22:55:22 +00:00
jkim	3e99f5d364	Belatedly add BPF_JITTER in NOTES for supported architectures.	2008-03-24 22:23:22 +00:00
jkim	e4afcc95a9	Fix build with option BPF_JITTER.	2008-03-24 22:21:32 +00:00
jkim	f1bce3f01d	Remove redundant inclusions of net/bpfdesc.h.	2008-03-24 22:16:46 +00:00
kmacy	59f40fe008	change inp_wlock_assert to inp_lock_assert	2008-03-24 20:24:04 +00:00
emaste	86c0a7d5de	Diff reduction to Adaptec's driver (around build 15317): catch up with a change in debugging routines. The fwprintf macro in the AAC_DEBUG case (mapping to printf) isn't from the Adaptec driver.	2008-03-24 19:23:33 +00:00
sam	f95a1f76b1	o add M_PROTO[678]; they'll be needed by net80211 vap code o sort mbuf flags together and extend values to 32 bits o write M_COPYFLAGS in terms of M_PROTOFLAGS o move M_COPYFLAGS and M_PROTOFLAGS up to be together with flag defs Reviewed by: rwatson MFC after: 3 weeks	2008-03-24 19:01:29 +00:00
marius	dd3d50596e	- Const'ify the bus_stream_asi and bus_type_asi arrays. - Replace hard-coded functions names missed in bus_machdep.c rev. 1.44 with __func__. - Break some long lines. MFC after: 1 month	2008-03-24 17:57:01 +00:00
marius	9310ab33a4	- Take advantage of bus_dmamap_load_mbuf_sg(9). - Take advantage of m_collapse(9). - Sync with other NIC drivers and prepend a TX mbuf if the first attempt to load it fails with an error other than EFBIG and stop trying instead of freeing it and keeping on trying to enqueue more mbufs. Also ensure the driver queue isn't empty before trying to enqueue mbufs in order to reduce locking operations. - In xl_ifmedia_upd() add a missing XL_UNLOCK(). [1] - Const'ify the xl_devs array. - Remove an outdated comment. PR: 113406 [1] MFC after: 1 month	2008-03-24 17:49:06 +00:00
marius	9813122d2a	- Const'ify the dc_devs array. - Correct the maxsize parameter when creating the mbufs busdma tag to reflect the actual requirement of dc(4). - Move the KASSERT in dc_newbuf() to the right spot. - Also convert the TX side to take advantage of bus_dmamap_load_mbuf_sg(9). - Move the comment regarding dc_start_locked() to the right spot. MFC after: 2 weeks	2008-03-24 17:38:24 +00:00
marius	cf4d38b379	Split the registers into two halves in preparation for SBus support. Obtained from: NetBSD (loosely) MFC after: 2 weeks	2008-03-24 17:23:53 +00:00
emaste	bfdd190b82	Diff reduction to Adaptec driver build 15317 (refactoring and code shuffling): - Resource allocation in aac_alloc (moved from from aac_init) - Interrupt setup in aac_setup_intr (from aac_attach) - Container probing in aac_get_container_info (from aac_startup and aac_handle_aif) - Firmware status check moved to aac_check_firmware from aac_init	2008-03-24 16:38:47 +00:00
bz	e1cf25141c	Fix a bug that when getting/dumping the soft lifetime we reported the hard lifetime instead. MFC after: 3 days	2008-03-24 15:01:20 +00:00
bz	42fbad307b	Import change from KAME, rev. 1.362 kame/kame/sys/netkey/key.c In case of "new SA", we must check the hard lifetime of the old SA to find out if it is not permanent and we can delete it. Submitted by: sakane via gnn MFC after: 3 days	2008-03-24 14:55:09 +00:00
csjp	5c0a194548	Bump the FreeBSD version for zerocopy bpf buffers and changes to the bpf(4) monitoring ABI/structures.	2008-03-24 14:30:01 +00:00
csjp	310e3f93dd	Introduce support for zero-copy BPF buffering, which reduces the overhead of packet capture by allowing a user process to directly "loan" buffer memory to the kernel rather than using read(2) to explicitly copy data from kernel address space. The user process will issue new BPF ioctls to set the shared memory buffer mode and provide pointers to buffers and their size. The kernel then wires and maps the pages into kernel address space using sf_buf(9), which on supporting architectures will use the direct map region. The current "buffered" access mode remains the default, and support for zero-copy buffers must, for the time being, be explicitly enabled using a sysctl for the kernel to accept requests to use it. The kernel and user process synchronize use of the buffers with atomic operations, avoiding the need for system calls under load; the user process may use select()/poll()/kqueue() to manage blocking while waiting for network data if the user process is able to consume data faster than the kernel generates it. Patchs to libpcap are available to allow libpcap applications to transparently take advantage of this support. Detailed information on the new API may be found in bpf(4), including specific atomic operations and memory barriers required to synchronize buffer use safely. These changes modify the base BPF implementation to (roughly) abstrac the current buffer model, allowing the new shared memory model to be added, and add new monitoring statistics for netstat to print. The implementation, with the exception of some monitoring hanges that break the netstat monitoring ABI for BPF, will be MFC'd. Zerocopy bpf buffers are still considered experimental are disabled by default. To experiment with this new facility, adjust the net.bpf.zerocopy_enable sysctl variable to 1. Changes to libpcap will be made available as a patch for the time being, and further refinements to the implementation are expected. Sponsored by: Seccuris Inc. In collaboration with: rwatson Tested by: pwood, gallatin MFC after: 4 months [1] [1] Certain portions will probably not be MFCed, specifically things that can break the monitoring ABI.	2008-03-24 13:49:17 +00:00
kmacy	9fbcabc6c7	remove unneccessary tcbinfo lock acquisitions - set tp to null affter calling enter_timewait as we no longer own the inpcb	2008-03-24 05:21:10 +00:00
jeff	3ad75daf19	- Greatly simplify vget() by removing the guarantee that any new references to a vnode with VI_OWEINACT set will force the vinactive() call. The kernel makes no guarantees about which reference was the last to close a file or when the actual inactive processing will happen. The previous code was designed to preserve existing semantics in the face of shared locks, however, this was unnecessary. Discussed with: mckusick	2008-03-24 04:22:58 +00:00
jeff	1bf44343e2	- Don't acquire the vnode interlock in _vn_lock() unless no lock type is requested. Handle this case specially before the while loop. - Use the held vnode lock to check for VI_DOOMED. The vnode lock and interlock must both be held to set VI_DOOMED so either one held, even shared, is sufficient to check it. No objection by: kib	2008-03-24 04:17:35 +00:00
jeff	955d594912	- Remove an old comment; vnodes have been working without Giant for years now. - Clarify the locking required for VI_DOOMED in preparation for simplifications to vget() and vn_lock().	2008-03-24 04:11:40 +00:00
kmacy	08877248a3	Label inp as unused in the non-INVARIANTS case	2008-03-24 00:29:01 +00:00
peter	112e790f78	First pass at (possibly futile) microoptimizing of cpu_switch. Results are mixed. Some pure context switch microbenchmarks show up to 29% improvement. Pipe based context switch microbenchmarks show up to 7% improvement. Real world tests are far less impressive as they are dominated more by actual work than switch overheads, but depending on the machine in question, workload, kernel options, phase of moon, etc, a few percent gain might be seen. Summary of changes: - don't reload MSR_[FG]SBASE registers when context switching between non-threaded userland apps. These typically cost 120 clock cycles each on an AMD cpu (less on Barcelona/Phenom). Intel cores are probably no faster on this. - The above change only helps unthreaded userland apps that tend to use the same value for gsbase. Threaded apps will get no benefit from this. - reorder things like accessing the pcb to be in memory order, to give prefetching a better chance of working. Operations are now in increasing memory address order, rather than reverse or random. - Push some lesser used code out of the main code paths. Hopefully allowing better code density in cache lines. This is probably futile. - (part 2 of previous item) Reorder code so that branches have a more realistic static branch prediction hint. Both Intel and AMD cpus default to predicting branches to lower memory addresses as being taken, and to higher memory addresses as not being taken. This is overridden by the limited dynamic branch prediction subsystem. A trip through userland might overflow this. - Futule attempt at spreading the use of the results of previous operations in new operations. Hopefully this will allow the cpus to execute in parallel better. - stop wasting 16 bytes at the top of kernel stack, below the PCB. - Never load the userland fs/gsbase registers for kthreads, but preserve curpcb->pcb_[fg]sbase as caches for the cpu. (Thanks Jeff!) Microbenchmarking this code seems to be really sensitive to things like scheduling luck, timing, cache behavior, tlb behavior, kernel options, other random code changes, etc. While it doesn't help heavy userland workloads much, it does help high context switch loads a little, and should help those that involve switching via kthreads a bit more. A special thanks to Kris for the testing and reality checks, and Jeff for tormenting me into doing this. :) This is still work-in-progress.	2008-03-23 23:09:06 +00:00
alc	f9d9755304	Correct an error in pmap_mincore() when applied to a 2MB page mapping: Use PG_PS_FRAME, not PG_FRAME, to obtain the physical address of the 2MB physical page from the PDE.	2008-03-23 23:04:09 +00:00
peter	b238ee1007	Export TDP_KTHREAD to asm files.	2008-03-23 22:46:37 +00:00
peter	1f7e9770bb	Move pcb_flags to make trivially better use of cache lines.	2008-03-23 22:45:51 +00:00
peter	075e9da352	Protect the setting of the fsbase/gsbase MSR registers and the pcb_[fg]sbase values with a critical section, like the rest of the kernel.	2008-03-23 22:44:56 +00:00
kmacy	fb74f62b24	Insulate inpcb consumers outside the stack from the lock type and offset within the pcb by adding accessor functions. Reviewed by: rwatson MFC after: 3 weeks	2008-03-23 22:34:16 +00:00
alc	e702727e2c	To date, we have assumed that the TLB will only set the PG_M bit in a PTE if that PTE has the PG_RW bit set. However, this assumption does not hold on recent processors from Intel. For example, consider a PTE that has the PG_RW bit set but the PG_M bit clear. Suppose this PTE is cached in the TLB and later the PG_RW bit is cleared in the PTE, but the corresponding TLB entry is not (yet) invalidated. Historically, upon a write access using this (stale) TLB entry, the TLB would observe that the PG_RW bit had been cleared and initiate a page fault, aborting the setting of the PG_M bit in the PTE. Now, however, P4- and Core2-family processors will set the PG_M bit before observing that the PG_RW bit is clear and initiating a page fault. In other words, the write does not occur but the PG_M bit is still set. The real impact of this difference is not that great. Specifically, we should no longer assert that any PTE with the PG_M bit set must also have the PG_RW bit set, and we should ignore the state of the PG_M bit unless the PG_RW bit is set. However, these changes enable me to remove a work-around from pmap_promote_pde(), the superpage promotion procedure. (Note: The AMD processors that we have tested, including the latest, the Phenom, still exhibit the historical behavior.) Acknowledgments: After I observed the problem, Stephan (ups) was instrumental in characterizing the exact behavior of Intel's recent TLBs. Tested by: Peter Holm	2008-03-23 20:38:01 +00:00
kib	5ddf5664cc	Yield the cpu in the kernel while iterating the list of the vnodes belonging to the mountpoint. Also, yield when in the softdep_process_worklist() even when we are not going to sleep due to buffer drain. It is believed that the ULE fixed the problem [1], but the yielding seems to be needed at least for the 4BSD case. Discussed: on stable@, with bde Reviewed by: tegge, jeff [1] MFC after: 2 weeks	2008-03-23 13:45:24 +00:00
kib	53a15ee1ea	Prevent the overflow in the calculation of the next page directory. The overflow causes the wraparound with consequent corruption of the (almost) whole address space mapping. As Alan noted, pmap_copy() does not require the wrap-around checks because it cannot be applied to the kernel's pmap. The checks there are included for consistency. Reported and tested by: kris (i386/pmap.c:pmap_remove() part) Reviewed by: alc MFC after: 1 week	2008-03-23 07:07:27 +00:00
yongari	fcd39263e4	MSI handling on some RealTek chips are broken so disable it by default. Reported by: Giulio Ferro ( auryn AT zirakzigil DOT org ) Tested by: Giulio Ferro ( auryn AT zirakzigil DOT org )	2008-03-23 05:35:18 +00:00
yongari	031ecde733	For MSI capable hardwares, enable MSI enable bit in RL_CFG2 register. If MSI was disabled by hw.re.msi_disable tunable expliclty clear the MSI enable bit.	2008-03-23 05:31:35 +00:00
yongari	7fea7ba914	Some RealTek chips are known to be buggy on DAC handling, so disable DAC by default.	2008-03-23 05:13:45 +00:00
yongari	00b0cf0b1a	VLAN hardware tag information should be set for all desciptors of a multi-descriptor transmission attempt. Datasheet said nothing about this requirements. This should fix a long-standing VLAN hardware tagging issues with re(4). Reported by: Giulio Ferro ( auryn AT zirakzigil DOT org ) Tested by: Giulio Ferro ( auryn AT zirakzigil DOT org )	2008-03-23 05:06:16 +00:00
yongari	fd413d352f	Always honor configured VLAN/checksum offload capabilities. Previously re(4) used to blindly enable VLAN hardware tag stripping and Rx checksum offload regardless of enabled optional features of interface.	2008-03-23 04:59:13 +00:00
davidxu	c32a483ae9	Remove commented out code, thread suspension is done in thread library.	2008-03-23 02:03:06 +00:00
jeff	8103d042fb	- Only return 1 from sync_vnode() in cases where the vnode is still at the head of the sync list. This prevents sched_sync() from re-queueing a vnode which may have been freed already. Discussed with: kib	2008-03-23 01:44:28 +00:00
marcel	124e0025d3	Instead of making a single geom_part.ko module, make a module for each partitioning scheme. The gpart code is currently non- optional.	2008-03-23 01:42:47 +00:00
jeff	73b6a5597c	- Pass BO_MTX(bo) to lockmgr in vtruncbuf, we don't own the vnode interlock here anymore. Reported by: kris	2008-03-23 01:42:19 +00:00
marcel	c184f6ced2	Redefine G_PART_SCHEME_DECLARE() from populating a private linker set to declaring a proper module. The module event handler is part of the gpart core and will add the scheme to an internal list on module load and will remove the scheme from the internal list on module unload. This makes it possible to dynamically load and unload partitioning schemes.	2008-03-23 01:31:59 +00:00
marcel	31a163ef06	Add g_retaste(), which given a class will present all non-open providers to it for tasting. This is useful when the class, through means outside the scope of GEOM, can claim providers previously unclaimed. The g_retaste() function posts an event which is handled by the g_retaste_event(). Event suggested by: phk	2008-03-23 01:23:35 +00:00
cognet	4d5f668fc2	We need to prototype _start() as well, as we use it to test if we're running from flash or from RAM. Reported by: imp MFC After: 3 days	2008-03-22 20:34:07 +00:00
qingli	4471734ac4	Reuse the mbuf that was just retrieved from the receive ring if mbuf exhaustion is encountered. There was a fix made previously for this problem but the solution (breaking out of the receive loop) does not seem to work. mbuf reuse strategy is already adopted by other drivers such as if_bge. The problem was recreated and the patch is also verified in the same test environment.	2008-03-22 18:13:39 +00:00
sam	5996854133	add hints to specify how NPE ports are mapped to MAC+PHY; these could be commented out as they just duplicate the defaults that are built into the code Reviewed by: imp MFC after: 1 week	2008-03-22 16:55:51 +00:00
sam	85a6e3f5ef	Improve mac+phy configuration so that hints can be used to describe layouts different than the defaults: o hint.npe.0.mac="A", "B", etc. specifies the window for MAC register accesses o hint.npe.0.mii="A", "B", etc. specifies PHY registers o hint.npe.1.phy=%d specifies the PHY to map to a port This allows devices like NSLU to be setup w/o code changes and will also be used for forthcoming support for more Avila boards. Reviewed by: imp MFC after 1 week	2008-03-22 16:53:28 +00:00
phk	5a1f4173f5	In abort2(2): Accept a NULL arg pointer if nargs == 0	2008-03-22 16:32:52 +00:00
sam	9eb2a09a7e	(finally) add the hal status to the diagnostic generated after a failed ath_hal_reset call MFC after: 3 days	2008-03-22 16:27:47 +00:00
jeff	a9d123c3ab	- Complete part of the unfinished bufobj work by consistently using BO_LOCK/UNLOCK/MTX when manipulating the bufobj. - Create a new lock in the bufobj to lock bufobj fields independently. This leaves the vnode interlock as an 'identity' lock while the bufobj is an io lock. The bufobj lock is ordered before the vnode interlock and also before the mnt ilock. - Exploit this new lock order to simplify softdep_check_suspend(). - A few sync related functions are marked with a new XXX to note that we may not properly interlock against a non-zero bv_cnt when attempting to sync all vnodes on a mountlist. I do not believe this race is important. If I'm wrong this will make these locations easier to find. Reviewed by: kib (earlier diff) Tested by: kris, pho (earlier diff)	2008-03-22 09:15:16 +00:00
alfred	b283b3e59a	Fix a race where timeout/untimeout could cause crashes for Giant locked code. The bug: There exists a race condition for timeout/untimeout(9) due to the way that the softclock thread dequeues timeouts. The softclock thread sets the c_func and c_arg of the callout to NULL while holding the callout lock but not Giant. It then drops the callout lock and acquires Giant. It is at this point where untimeout(9) on another cpu/thread could be called. Since c_arg and c_func are cleared, untimeout(9) does not touch the callout and returns as if the callout is canceled. The softclock then tries to acquire Giant and likely blocks due to the other cpu/thread holding it. The other cpu/thread then likely deallocates the backing store that c_arg points to and finishes working and hence drops Giant. Softclock resumes and acquires giant and calls the function with the now free'd c_arg and we have corruption/crash. The fix: We need to track curr_callout even for timeout(9) (LOCAL_ALLOC) callouts. We need to free the callout after the softclock processes it to deal with the race here. Obtained from: Juniper Networks, iedowse Reviewed by: jhb, iedowse MFC After: 2 weeks.	2008-03-22 07:29:45 +00:00
ambrisko	085cbdfe5d	Add in a compat. mode so you can either open the card's device node or directly open mfi0 and specify the card you want to talk to in the ioctl.	2008-03-22 02:57:49 +00:00
bz	418e4a564c	Add ';' missed with the SYSINIT changes. Not noticed by tb as TCP_SIGNATURE is not in LINT. MFC after: 1 month	2008-03-21 18:31:42 +00:00
remko	29b5baab7c	Add the i915 GME device to DRM. PR: kern/121808 Submitted by: Volker Werth <volker at vwsoft dot com> Approved by: imp (mentor, implicit for trivial changes) MFC after: 3 days	2008-03-21 16:38:42 +00:00
kib	bc4bc893dd	Reduce contention on the vnode interlock by not acquiring the BO_LOCK around the check for the BV_BKGRDINPROG in the brelse() and bqrelse(). See the comment for the explanation why it is safe. Tested by: pho Submitted by: jeff	2008-03-21 12:38:44 +00:00
kib	04661caa35	Reduce the acquisition of the vnode interlock in the ffs_read() and ffs_extread() when setting the IN_ACCESS flag by checking whether the IN_ACCESS is already set. The possible race there is admissible. Tested by: pho Submitted by: jeff	2008-03-21 12:33:00 +00:00
jeff	72142b2fae	- Reduce contention on the global bdonelock and bpinlock by using a pool mutex to protect these sleep/wakeup/counter races. This still is preferable to bloating each bio with a mtx.	2008-03-21 10:00:05 +00:00
jeff	ba540b27d6	- Add a new td flag TDF_NEEDSUSPCHK that is set whenever a thread needs to enter thread_suspend_check(). - Set TDF_ASTPENDING along with TDF_NEEDSUSPCHK so we can move the thread_suspend_check() to ast() rather than userret(). - Check TDF_NEEDSUSPCHK in the sleepq_catch_signals() optimization so that we don't miss a suspend request. If this is set use the expensive signal path. - Set NEEDSUSPCHK when creating a new thread in thr in case the creating thread is due to be suspended as well but has not yet. Reviewed by: davidxu (Authored original patch)	2008-03-21 08:23:25 +00:00
jhb	fbea3b6403	Explicitly use spinlock_enter/exit rather than locking the icu_lock spin lock in the 8259A drivers as these drivers are only used on UP systems. This slightly reduces the penalty of an SMP kernel (such as GENERIC) on a UP x86 machine.	2008-03-20 21:53:27 +00:00
jhb	6cf6d7b22b	Implement a BUS_BIND_INTR() method in the bus interface to bind an IRQ resource to a CPU. The default method is to pass the request up to the parent similar to BUS_CONFIG_INTR() so that all busses don't have to explicitly implement bus_bind_intr. A bus_bind_intr(9) wrapper routine similar to bus_setup/teardown_intr() is added for device drivers to use. Unbinding an interrupt is done by binding it to NOCPU. The IRQ resource must be allocated, but it can happen in any order with respect to bus_setup_intr(). Currently it is only supported on amd64 and i386 via nexus(4) methods that simply call the intr_bind() routine. Tested by: gallatin	2008-03-20 21:24:32 +00:00
sos	f997b9d36a	Unbreak the last commit. Changes from the PM WIP sneaked in and caused compile errors.	2008-03-20 21:21:31 +00:00
kmacy	db590514fa	pay attention to default cluster limits when sizing receive queues	2008-03-20 20:52:37 +00:00
emaste	ae058e4be5	Restore creation of passthrough devices with newer controller firmware by putting the correct size in the fib header. Presumably the older firmware silently ignored a bad size field. (This change tested with a 3805 controller. Passthrough devices were created when running firmware build 12814, but not 15323 or later. With this change they're created for both old and new firmware versions.) Submitted by: Adaptec	2008-03-20 20:33:48 +00:00
emaste	2d11776afc	Add ioctls FSACTL_SEND_LARGE_FIB, FSACTL_SEND_RAW_SRB, FSACTL_LNX_SEND_LARGE_FIB, and FSACTL_LNX_SEND_RAW_SRB, and correct size checks on FIBs passed in from userspace. Both changes were obtained from Adaptec's driver build 15317. Adaptec's commandline RAID tool arcconf uses these ioctls when creating a RAID-10 array (and probably other operations too).	2008-03-20 17:59:19 +00:00
sam	0b0672cdd0	add usb devices and more wlan stuff now that usb is functional MFC after: 1 month	2008-03-20 17:44:58 +00:00
rdivacky	4a8a8b1c08	o Add stub support for some new futex operations, so the annoying message is not printed. o Don't warn about FUTEX_FD not being implemented and return ENOSYS instead of 0 (eg. success). o Clear FUTEX_PRIVATE_FLAG as we actually implement only private futexes so there is no reason to return ENOSYS when app asks for a private futex. We don't reject shared futexes because they worked just fine with our implementation so far. Approved by: kib (mentor) Tested by: bsam MFC after: 1 week	2008-03-20 17:03:55 +00:00
sam	1906c8de60	Workaround design botch in usb: blindly mixing bus_dma with PIO does not work on architectures with a write-back cache as the PIO writes end up in the cache which the sync(BUS_DMASYNC_POSTREAD) in usb_transfer_complete then discards; compensate in the xfer methods that do PIO by pushing the writes out of the cache before usb_transfer_complete is called. This fixes USB on xscale and likely other places. Sponsored by: hobnob Reviewed by: cognet, imp MFC after: 1 month	2008-03-20 16:19:25 +00:00
kib	de73f6b678	Do not dereference cdev->si_cdevsw, use the dev_refthread() to properly obtain the reference. In particular, this fixes the panic reported in the PR. Remove the comments stating that this needs to be done. PR: kern/119422 MFC after: 1 week	2008-03-20 16:08:42 +00:00
sam	c8f58c14c0	Correct cache handling for xfer requests marked URQ_REQUEST: many (if not all uses) involve a read but usbd_start_transfer only does a PREWRITE; change this to BUS_DMASYNC_PREREAD \| BUS_DMASYNC_PREWRITE as I'm not sure if any users do write+read. Reviewed by: cognet, imp MFC after: 1 month	2008-03-20 16:04:13 +00:00
sam	70faf3fbc1	map device 5; the optional USB controller on Gateworks 2348 boards shows up here instead of the minipci slot at J4 Reviewed by: cognet, imp MFC after: 1 week	2008-03-20 15:54:19 +00:00
kib	28174d9ffb	Fix the leak of the vmspace on the fork when the process limits are exceeded. Pointy hat to: me MFC after: 3 days	2008-03-20 15:24:49 +00:00
sos	c69ad0290e	Fix Problem with Intel Matrix RAID. Fix from PR/121899.	2008-03-20 11:54:26 +00:00
kmacy	56b72c6a35	back out last change as Sam believes that it breaks multicast - need to revisit after following up with pyun	2008-03-20 06:19:34 +00:00
jeff	a3f8e0c20d	- Restore runq to manipulating threads directly by putting runq links and rqindex back in struct thread. - Compile kern_switch.c independently again and stop #include'ing it from schedulers. - Remove the ts_thread backpointers and convert most code to go from struct thread to struct td_sched. - Cleanup the ts_flags #define garbage that was causing us to sometimes do things that expanded to td->td_sched->ts_thread->td_flags in 4BSD. - Export the kern.sched sysctl node in sysctl.h	2008-03-20 05:51:16 +00:00
kmacy	8b4fc7299f	Don't re-initialize the interface if it is already running. This one line change makes the following code found in many ethernet device drivers (at least em, igb, ixgbe, and cxgb) gratuitous case SIOCSIFADDR: if (ifa->ifa_addr->sa_family == AF_INET) { /* * XXX * Since resetting hardware takes a very long time * and results in link renegotiation we only * initialize the hardware only when it is absolutely * required. */ ifp->if_flags \|= IFF_UP; if (!(ifp->if_drv_flags & IFF_DRV_RUNNING)) { EM_CORE_LOCK(adapter); em_init_locked(adapter); EM_CORE_UNLOCK(adapter); } arp_ifinit(ifp, ifa); } else error = ether_ioctl(ifp, command, data); break;	2008-03-20 05:35:02 +00:00
kevlo	931fe00266	- Add the Corega CG-WLUSB2GL from NetBSD - Add the Corega CG-WLUSB2GPX	2008-03-20 05:05:37 +00:00
bland	5671837ad2	Improve VT_WAITACTIVE semantics. - Wait for requested vty activation regardless its open state. - Remove redundant console cleanup. Approved by: kib MFC after: 1 week	2008-03-20 04:10:52 +00:00
sam	dc8118c259	add some debug msgs for tracking xfers	2008-03-20 03:11:07 +00:00
sam	478b08fc65	add support for %b printing of request + xfer flags	2008-03-20 03:09:59 +00:00
jeff	4274384df8	- Remove the unused and redundant sched_newproc() function. - Remove the unused and redundant sched_newthread() which peaks into scheduler private structures.	2008-03-20 03:09:15 +00:00
jeff	898428987b	- There is no sense in calling sched_newthread() at thread_init() and thread_fini(). The schedulers initialize themselves properly during sched_fork_thread() anyhow. fini is only called when we're returning the memory to the allocator which surely doesn't care what state the memory is in.	2008-03-20 03:07:57 +00:00
jeff	19aab7bccf	- ULE and 4BSD share only one line of code from sched_newthread() so implement the required pieces in sched_fork_thread(). The td_sched pointer is already setup by thread_init anyway.	2008-03-20 03:06:33 +00:00
jeff	055d72e2b9	- Don't call the empty sched_newproc() function. sched_newproc() already existed as sched_fork() which is a non empty function in both schedulers.	2008-03-20 03:05:17 +00:00
jeff	ded0975003	- Move maybe_preempt() from kern_switch.c to sched_4bsd.c. This is function is only used by 4bsd. - Create a new runq_choose_fuzz() function rather than polluting runq_choose() with 4BSD specific code. - Move the fuzz sysctl into sched_4bsd.c - Remove some dead code from kern_switch.c	2008-03-20 02:14:02 +00:00
jeff	a2c51b3fe4	- Directly include opt_sched.h in sched_4bsd.	2008-03-20 01:32:48 +00:00
jeff	480fb0f310	- Add an option to compile in SCHED_STATS. - Add some more information about SLEEPQUEUE_PROFILING to NOTES.	2008-03-20 01:30:49 +00:00
julian	b8355c8260	Replace really convoluted code that simplifies to "a ^= 0x01;"	2008-03-19 22:29:11 +00:00
kmacy	2deefa8109	fix link management bug and conditionally allow the PHY to be kept on at all times for allowing non-conformant link state checks	2008-03-19 20:56:51 +00:00
alc	caedbf233d	Rename vm_pageq_requeue() to vm_page_requeue() on account of its recent migration to vm/vm_page.c.	2008-03-19 20:24:35 +00:00
dfr	cba668f51c	Fix a regression from the last revision - don't edit the ns_rec list while not holding the lock.	2008-03-19 12:33:25 +00:00
piso	c487d69141	Explicitate the newpacket size. Bug pointed out by: many Pointy hat to: me :(	2008-03-19 11:28:13 +00:00
sobomax	d818a8db68	Revert previous change - it appears that the limit I was hitting was a maxsockets limit, not maxfiles limit. The question remains why those limits are handled differently (with error code for maxfiles but with sleep for maxsokets), but those would be addressed in a separate commit if necessary. Requested by: rwhatson, jeff	2008-03-19 09:58:25 +00:00
pjd	da476acd46	Remove extra uihold() call that accidentally sneak in during perforce change @125544.	2008-03-19 07:52:07 +00:00
jeff	d6d07d2730	- Remove some dead code and comments related to KSE. - Don't set tdq_lowpri on every switch, it should be precisely maintained now. - Add some comments to sched_thread_priority().	2008-03-19 07:36:37 +00:00
jeff	d43ad8d37e	- At the top of sleepq_catch_signals() lock the thread and check TDF_NEEDSIGCHK before doing the very expensive cursig() and related locking. NEEDSIGCHK is updated whenever our signal mask change or when a signal is delivered and should be sufficient to avoid the more expensive tests. This eliminates another source of PROC_LOCK contention in multithreaded programs.	2008-03-19 07:35:14 +00:00
jeff	d4862a02d2	- Remove stale comment. - In the last revision the code was changed to use maxfilesperproc rather than the per-process file limit to restrict the size of the poll array. This eliminates a significant source of process lock contention in multithreaded programs and is cheaper. This had been committed with the wrong batch of changes.	2008-03-19 07:33:16 +00:00
pjd	2023a8c5fd	Oops. Use atomic_add_long() for atomic_fetchadd_long() (not atomic_add_int()) for sparc64 and sun4v. Noticed by: marius	2008-03-19 07:27:24 +00:00
jeff	ea2b75bd30	- Add a facility similar to LOCK_PROFILING under SLEEPQUEUE_PROFILING. Keep a simple (wmesg, count) tuple in a hash to keep track of how many times we sleep at each wait message. We hash on message and not channel. No line number information is given as typically wait messages are not used in more than one place. Identical strings defined at different addresses will show up with seperate counters. - Use debug.sleepq.enable to enable, .reset to reset, and .stats dumps stats. - Do an unsynchronized check in sleepq_switch() prior to switching before calling sleepq_profile() which uses a global lock to synchronize the hash. Only sleeps which actually cause a context switch are counted.	2008-03-19 07:22:07 +00:00
jeff	4cd4553bb5	- Fix the last of the threading bugs that were introduced as far back as 1.38 in 2001. Break out of the FOREACH_THREAD_IN_PROC loop when we've discovered a new proc in the chain. - Increment i and check for maxlockdepth once per matching process not once per thread. This didn't properly terminate the loop before. - Fix a bug which has existed potentially since rev 1.1. waitblock->lf_next can be NULL when a thread has been woken-up but not yet scheduled. Check for this condition rather than blindly dereferencing. Found by: libMicro	2008-03-19 07:13:24 +00:00
jeff	4350e599a3	- Restore the NULL check for td_cpuset. This can happen if a partially constructed thread was torn down as is the case when we fail to allocate a kernel stack.	2008-03-19 06:20:21 +00:00
jeff	46f09d5bc3	- Relax requirements for p_numthreads, p_threads, p_swtick, and p_nice from requiring the per-process spinlock to only requiring the process lock. - Reflect these changes in the proc.h documentation and consumers throughout the kernel. This is a substantial reduction in locking cost for these fields and was made possible by recent changes to threading support.	2008-03-19 06:19:01 +00:00
dfr	f46620ae37	Don't call nfs_realign while holding locks. Reviewed by: kib	2008-03-18 18:42:59 +00:00
jhb	ee328ab541	Catch up to intr_event_create() prototype change. Pointy hat: jhb	2008-03-18 13:31:45 +00:00
lulf	8a5c25a52b	- Fix a memory leak when re-discovering a gvinum configuration. Approved by: pjd (mentor) MFC after: 1 week	2008-03-18 08:48:51 +00:00
adrian	63f525812c	Sign-extend the 48-bit AMD PMC counter before treating it to a 64-bit 2's compliment. The 2's compliment transform is done so a "count down" sampling interval can be converted into a "count up" PMC value. a 2's complimented 'count down' value is written to the PMC counter; then the read-back counter is reverted via another 2's compliment. PR: kern/121660 Reviewed by: jkoshy Approved by: jkoshy MFC after: 1 week	2008-03-18 08:39:11 +00:00
adrian	26e3101bd9	Fix the debugging output - the '0x' was duplicated from the %p option.	2008-03-18 08:36:19 +00:00
alc	4e9b2a2931	Almost seven years ago, vm/vm_page.c was split into three parts: vm/vm_contig.c, vm/vm_page.c, and vm/vm_pageq.c. Today, vm/vm_pageq.c has withered to the point that it contains only four short functions, two of which are only used by vm/vm_page.c. Since I can't foresee any reason for vm/vm_pageq.c to grow, it is time to fold the remaining contents of vm/vm_pageq.c back into vm/vm_page.c. Add some comments. Rename one of the functions, vm_pageq_enqueue(), that is now static within vm/vm_page.c to vm_page_enqueue(). Eliminate PQ_MAXCOUNT as it no longer serves any purpose.	2008-03-18 06:52:15 +00:00
kmacy	8070f0fd71	- Integrate 1.133 vendor driver changes - update some copyrights - add improved support for delayed ack - fix issue with fec	2008-03-18 03:55:12 +00:00
piso	965a19847a	Don't cache ptr to nat rule in case of tablearg argument. Bug spotted by: Dyadchenko Mihail	2008-03-17 23:02:56 +00:00
jhb	c04bb048f6	Simplify the interrupt code a bit: - Always include the ie_disable and ie_eoi methods in 'struct intr_event' and collapse down to one intr_event_create() routine. The disable and eoi hooks simply aren't used currently in the !INTR_FILTER case. - Expand 'disab' to 'disable' in a few places. - Use function casts for arm and i386:intr_eoi_src() instead of wrapper routines since to trim one extra indirection. Compiled on: {arm,amd64,i386,ia64,ppc,sparc64} x {FILTER, !FILTER} Tested on: {amd64,i386} x {FILTER, !FILTER}	2008-03-17 22:42:01 +00:00
piso	10c89ab0d1	Don't abuse stack space while in kernel land, use heap instead.	2008-03-17 22:08:31 +00:00
antoine	a52d65bf2b	Simplify fcntl(SVR4_F_DUP2FD) code now that FreeBSD has F_DUP2FD. Approved by: rwatson (mentor)	2008-03-17 18:27:28 +00:00
scottl	97df7d0709	Locking in the ses_ioctl handler doesn't have to be so strict because the referenced data is only obtained/changed in the device open handler, and the ioctl handler can only run after the open handler. Also fix a few nearby style issues. Submitted by: Matt Jacob	2008-03-17 17:18:16 +00:00
kib	d5211e24af	Fix two races in the handling of the d_gianttrick for the D_NEEDGIANT drivers. In the giant_XXX wrappers for the device methods of the D_NEEDGIANT drivers, do not dereference the cdev->si_devsw. It is racing with the destroy_devl() clearing of the si_devsw. Instead, use the dev_refthread() and return ENXIO for the destroyed device. [1] The check for the D_INIT in the prep_cdevsw() was not synchronized with the call of the fini_cdevsw() in destroy_devl(), that under rapid device creation/destruction may result in the use of uninitialized cdevsw [2]. Change the protocol for the prep_cdevsw(), requiring it to be called under dev_mtx, where the check for D_INIT is done. Do not free the memory allocated for the gianttrick cdevsw while holding the dev_mtx, put it into the free list to be freed later. Reuse the d_gianttrick pointer to keep the size and layout of the struct cdevsw (requested by phk). Free the memory in the dev_unlock_and_free(), and do all the free after the dev_mtx is dropped (suggested by jhb). Reported by: bsdimp + many [1], pho [2] Reviewed by: phk, jhb Tested by: pho MFC after: 1 week	2008-03-17 13:17:10 +00:00
rwatson	d1dffe342c	Fix indentation for a closing brace in in_pcballoc(). MFC after: 3 days	2008-03-17 13:04:56 +00:00
pjd	cf9cd1298d	- There is no more "uidinfo struct" mutex. - The "uidinfo hash" lock is now a rwlock. Reminded by: kib	2008-03-17 11:48:40 +00:00
phk	916647a52a	Add a "spindown" facility to ata-disks: If no requests have been received for a configurable number of seconds, spin the disk down. Spin it back up on the next request. Notice that the timeout is only armed by a request, so to spin down a disk you may have to do: atacontrol spindown ad10 5 dd if=/dev/ad10 of=/dev/null count=1 To disable spindown, set timeout to zero: atacontrol spindown ad10 0 In order to debug any trouble caused, this code is somewhat noisy on the console. Enabling spindown on a disk containing / or /var/log/messages is not going to do anything sensible. Spinning a disk up and down all the time will wear it out, use sensibly. Approved by: sos	2008-03-17 10:33:23 +00:00
phk	341d5f653b	A cautionary XXX comment about seemingly bogus errata checks.	2008-03-17 09:05:15 +00:00
phk	234bd357a6	Increase time we wait for things to settle to 1 millisecond, 10 microseconds is too short. Always set the cpu to the highest frequency so that we get through boot and don't handicap cpus where powerd(8) is not used.	2008-03-17 09:01:43 +00:00
phk	7f4e520791	Revert last commit and stop committing before morning tea.	2008-03-17 09:00:59 +00:00
phk	0ef978b056	Increase time we wait for things to settle to 1 millisecond, 10 microseconds is too short. Always set the cpu to the highest frequency so that we get through boot and don't handicap cpus where powerd(8) is not used.	2008-03-17 08:38:38 +00:00
weongyo	de9984d139	don't set sniffer mode to ON when the driver is running with the monitor mode. This solves a problem that sometimes mangled frames are passed. Submitted by: Werner Backes <werner_at_bit-1.de> Tested by: Werner Backes <werner_at_bit-1.de> PR: kern/121608 Approved by: thompsa (mentor)	2008-03-17 02:30:13 +00:00
thompsa	bcbf813313	Remove extra semicolons. Pointed out by: antoine	2008-03-17 01:26:44 +00:00
marcel	380d312794	Make remote GDB work for AIM processors. For BookE, the kernel will have a special section, named .PPC.EMB.apuinfo, which will tell GDB that a BookE processor is targeted and which will result in GDB using a different register definition. In order to support remote GDB for BookE, we need the GDB stub in the kernel look for that section and use the BookE definitions.	2008-03-17 00:46:52 +00:00
phk	573f4ecf0d	Use correct bitmask for identifying chip family.	2008-03-17 00:36:16 +00:00
mav	b09a1d85ff	Remove impossible (hk_peer == NULL) check from ng_address_hook(). Valid hook can't have NULL peer. Even invalid one can't, as it is resets to deadhook, but not NULL.	2008-03-16 23:12:17 +00:00
mav	c3b3361aa8	Add session ID hashing to speedup incoming packets dispatch in case of many connections working via the same tunnel. For example, in case of full "client <-> LAC <-> LNS" setup.	2008-03-16 21:33:12 +00:00
pjd	4ef010fa26	Whitespace cleanups.	2008-03-16 21:32:20 +00:00
pjd	9123873999	- Use wait-free method to manage ui_sbsize and ui_proccnt fields in the uidinfo structure. This entirely removes contention observed on the ui_mtxp mutex (as it is now gone). - Convert the uihashtbl_mtx mutex to a rwlock, as most of the time we just need to read-lock it. Reviewed by: jhb, jeff, kris & others Tested by: kris	2008-03-16 21:29:02 +00:00
pjd	ea49d310bf	Implement atomic_fetchadd_long() for all architectures and document it. Reviewed by: attilio, jhb, jeff, kris (as a part of the uidinfo_waitfree.patch)	2008-03-16 21:20:50 +00:00
thompsa	bc8c8477a0	Switch the LACP state machine over to its own mutex to protect the internals, this means that it no longer grabs the lagg rwlock. Use two port table arrays which list the active ports for Tx and switch between them with an atomic op. Now the lagg rwlock is only exclusively locked for management (ioctls) and queuing of lacp control frames isnt needed.	2008-03-16 19:25:30 +00:00
rwatson	e7b290ea3d	Consistently use ANSI C declarationsfor all functions in kern_synch.c.	2008-03-16 18:59:21 +00:00
pjd	52f3c4136e	Style fixes.	2008-03-16 18:26:59 +00:00
pjd	d61d590ad7	Fix information leak. We can find PIDs of running processes from within a jail, etc. by simply calling setpriority(PRIO_PROCESS, <PID>, 0) and checking the return value: 0 means that the process exists and -1 that it doesn't exist. Reviewed by: rwatson MFC after: 1 week	2008-03-16 17:55:06 +00:00
alc	0de51cf047	Simplify the inner loop of vm_fault()'s delete-behind heuristic. Instead of checking each page for PG_UNMANAGED, perform a one-time check whether the object is OBJT_PHYS. (PG_UNMANAGED pages only belong to OBJT_PHYS objects.)	2008-03-16 17:37:19 +00:00
pjd	6a68218ce1	Implement soon-to-be-used rw_unlock() macro.	2008-03-16 17:10:52 +00:00
rdivacky	64c7931e65	Regen.	2008-03-16 16:29:37 +00:00
rdivacky	b13a84dcb7	Implement sched_setaffinity and get_setaffinity using real cpu affinity setting primitives. Reviewed by: jeff Approved by: kib (mentor)	2008-03-16 16:27:44 +00:00
rwatson	c1f067745e	Commit SYSINIT() ;-adding patch missed in previous pass. MFC after: 1 month Caught by: tinderbox	2008-03-16 13:02:04 +00:00
rwatson	2cdb34b36a	Remove trailing ';' from C_SYSINIT() macro definition, in keeping with style(9) recommendation that macros not contain the terminating ';', leaving that to the invoker. All SYSINIT() consumers must now provide a trailing ';'. Unlike the change to remove the ';'s from callers, this change shouldn't be MFC'd unless we don't mind requiring source changes to third party modules that might still depend on SYSINIT() providing its own ';'.	2008-03-16 11:01:32 +00:00
rwatson	877d7c65ba	In keeping with style(9)'s recommendations on macros, use a ';' after each SYSINIT() macro invocation. This makes a number of lightweight C parsers much happier with the FreeBSD kernel source, including cflow's prcc and lxr. MFC after: 1 month Discussed with: imp, rink	2008-03-16 10:58:09 +00:00
sobomax	1560402d31	Properly set size of the file_zone to match kern.maxfiles parameter. Otherwise the parameter is no-op, since zone by default limits number of descriptors to some 12K entries. Attempt to allocate more ends up sleeping on zonelimit. MFC after: 2 weeks	2008-03-16 06:21:30 +00:00
pjd	f924bfc1c5	Fix mmap(2) on ZFS after some changes in VM subsystem. Submitted by: alc Reported by: kris (originally) and many others Tested with: fsx MFC after: 1 week	2008-03-15 23:23:04 +00:00
ru	a786aa30ca	Fix panic on e.g. "kldload /dev/null". PR: kern/121427 Reviewed by: sem MFC after: 3 days	2008-03-15 17:40:18 +00:00
imp	be829c21fb	BUS_DMA_ISA is left over from Alpha, and is not used in the tree at all. The reference in ia64 code is due to cutNpaste in its history and can safely be removed. Revired by: cognet, raj, marcel, jhb and maybe one other whom I'm forgetting	2008-03-15 06:44:45 +00:00
emaste	3d21898757	Change spelling and eliminate a typo in comments to reduce diffs with Adaptec's vendor driver. I have some fixes to bring in and this makes ongoing review of the FreeBSD-Adaptec driver diffs easier.	2008-03-14 21:59:11 +00:00
jhb	9c113163fb	Add preliminary support for binding interrupts to CPUs: - Add a new intr_event method ie_assign_cpu() that is invoked when the MI code wishes to bind an interrupt source to an individual CPU. The MD code may reject the binding with an error. If an assign_cpu function is not provided, then the kernel assumes the platform does not support binding interrupts to CPUs and fails all requests to do so. - Bind ithreads to CPUs on their next execution loop once an interrupt event is bound to a CPU. Only shared ithreads are bound. We currently leave private ithreads for drivers using filters + ithreads in the INTR_FILTER case unbound. - A new intr_event_bind() routine is used to bind an interrupt event to a CPU. - Implement binding on amd64 and i386 by way of the existing pic_assign_cpu PIC method. - For x86, provide a 'intr_bind(IRQ, cpu)' wrapper routine that looks up an interrupt source and binds its interrupt event to the specified CPU. MI code can currently (ab)use this by doing: intr_bind(rman_get_start(irq_res), cpu); however, I plan to add a truly MI interface (probably a bus_bind_intr(9)) where the implementation in the x86 nexus(4) driver would end up calling intr_bind() internally. Requested by: kmacy, gallatin, jeff Tested on: {amd64, i386} x {regular, INTR_FILTER}	2008-03-14 19:41:48 +00:00
bz	33dfb1706b	Correct IPsec behaviour with a 'use' level in SP but no SA available. In that case return an continue processing the packet without IPsec. PR: 121384 MFC after: 5 days Reported by: Cyrus Rahman (crahman gmail.com) Tested by: Cyrus Rahman (crahman gmail.com) [slightly older version]	2008-03-14 16:38:11 +00:00
bz	ee90b5b6c8	Remove the "Fast " from the "Fast IPsec: Initialized Security Association Processing." printf. People kept asking questions about this after the IPsec shuffle. This still is the Fast IPsec implementation so no worries that it would be any slower now. There are no functional changes. Discussed with: sam MFC after: 4 days	2008-03-14 16:25:40 +00:00
jkim	90e95d13b0	Add a quirk to ignore ASUS LCM display found on some ASUS laptops.	2008-03-14 15:59:30 +00:00
jhb	c162b59cc2	Make the function prototype for cpu_search() match the declaration so that this still compiles with gcc3.	2008-03-14 15:22:38 +00:00
bz	51315b3d89	Correct reference counting on the SP for outgoing IPv6 IPsec connections. PR: 121374 Reported by: Cyrus Rahman (crahman gmail.com) Tested by: Cyrus Rahman (crahman gmail.com) MFC after: 5 days	2008-03-14 11:55:04 +00:00
bz	f507f0e4fa	#if 0 out a currently unsued (and incomplete) function: ip6_ipsec_mtu(). No need to compile 'dead' code. I am leaving it in because we will have to review the concept and should use the common function in various places. MFC after: 5 days	2008-03-14 11:44:30 +00:00
bz	693055a8ae	Replace the function name in two identical printfs by __func__, __LINE__ so we can distinguish them when people report a problem. PR: 121373 MFC after: 5 days	2008-03-14 11:09:11 +00:00
nyan	5d1ed3596a	Add stub for pc98.	2008-03-14 09:00:04 +00:00
jkoshy	b2f6c016f9	Correct a typo.	2008-03-14 06:16:18 +00:00
jhb	8293fe75e4	Fix a silly bogon which prevented all the CPUs that are tagged as interrupt receivers from being given interrupts if any CPUs in the system were not tagged as interrupt receivers that I introduced when switching the x86 interrupt code to track CPUs via FreeBSD CPU IDs rather than local APIC IDs. In practice this only affects systems with Hyperthreading (though disabling HTT in the BIOS would workaround the issue) as that is the only case currently where one can have CPUs that aren't tagged as interrupt receivers. On a Dell SC1425 test box with 2 x Xeon w/ HTT (so 4 logical CPUs of which 2 were interrupt receivers) the result was that all device interrupts were sent to CPU 0. MFC after: 1 week Pointy hat to: jhb	2008-03-14 03:44:42 +00:00
jhb	64ab71ccbd	Rework how the nexus(4) device works on x86 to better handle the idea of different "platforms" on x86 machines. The existing code already handles having two platforms: ACPI and legacy. However, the existing approach was rather hardcoded and difficult to extend. These changes take the approach that each x86 hardware platform should provide its own nexus(4) driver (it can inherit most of its behavior from the default legacy nexus(4) driver) which is responsible for probing for the platform and performing appropriate platform-specific setup during attach (such as adding a platform-specific bus device). This does mean changing the x86 platform busses to no longer use an identify routine for probing, but to move that logic into their matching nexus(4) driver instead. - Make the default nexus(4) driver in nexus.c on i386 and amd64 handle the legacy platform. It's probe routine now returns BUS_PROBE_GENERIC so it can be overriden. - Expose a nexus_init_resources() routine which initializes the various resource managers so that subclassed nexus(4) drivers can invoke it from their attach routine. - The legacy nexus(4) driver explicitly adds a legacy0 device in its attach routine. - The ACPI driver no longer contains an new-bus identify method. Instead it exposes a public function (acpi_identify()) which is a probe routine that the MD nexus(4) drivers can use to probe for ACPI. All of the probe logic in acpi_probe() is now moved into acpi_identify() and acpi_probe() is just a stub. - On i386 and amd64, an ACPI-specific nexus(4) driver checks for ACPI via acpi_identify() and claims the nexus0 device if the probe succeeds. It then explicitly adds an acpi0 device in its attach routine. - The legacy(4) driver no longer knows anything about the acpi0 device. - On ia64 if acpi_identify() fails you basically end up with no devices. This matches the previous behavior where the old acpi_identify() would fail to add an acpi0 device again leaving you with no devices. Discussed with: imp Silence on: arch@	2008-03-13 20:39:04 +00:00
cokane	1dbd762cae	Replace the non-MPSAFE timeout(9) API in ffs_softdep.c with the MPSAFE callout_* API (e.g. callout_init_mtx(9)). This was one of the numerous items on the http://wiki.freebsd.org/SMPTODO list. Reviewed by: imp, obrien, jhb MFC after: 1 week	2008-03-13 20:15:48 +00:00
jhb	990c3cc04e	Use the SMAP data from the loader if it is provided instead of using virtual 86 mode to query the BIOS directly. This is needed for certain HP machines whose BIOS only provide an SMAP when invoked from real mode. On such machines the loader will be able to query the SMAP successfully due to the recent BTX changes, but the kernel will not. One thing I'm not sure of is if we can skip the INT 12h probe altogether if we have the SMAP from the loader as it seems that we do the INT 12h probe to setup enough state so we can use vm86 to call the BIOS. MFC after: 1 week	2008-03-13 18:56:53 +00:00
obrien	d0848f449c	style(9) & style.Makefile(9) Reviewed by: raj	2008-03-13 17:54:21 +00:00
cokane	029b349a7c	Add the module dependency on the mem(4) module. This will fix the module failing to load on a kernel that has "nodevice mem" in the config. It will now properly bring in the mem(4) module. Submitted by: antoine Reviewed by: imp MFC after: 1 week	2008-03-13 14:08:41 +00:00
kib	be9c86776f	Since version 4.3, gcc changed its behaviour concerning the i386/amd64 ABI and the direction flag, that is it now assumes that the direction flag is cleared at the entry of a function and it doesn't clear once more if needed. This new behaviour conforms to the i386/amd64 ABI. Modify the signal handler frame setup code to clear the DF {e,r}flags bit on the amd64/i386 for the signal handlers. jhb@ noted that it might break old apps if they assumed DF == 1 would be preserved in the signal handlers, but that such apps should be rare and that older versions of gcc would not generate such apps. Submitted by: Aurelien Jarno <aurelien aurel32 net> PR: 121422 Reviewed by: jhb MFC after: 2 weeks	2008-03-13 10:54:38 +00:00
kib	efc24456b8	Add missed parentheses	2008-03-13 09:52:48 +00:00
davidxu	41623df806	Add const qualifier to cpuset mask's pointer, since the cpuset mask should be not changed by the system call.	2008-03-13 02:56:11 +00:00
jeff	a469063987	PR 117603 - Close a sleepqueue signal race by interlocking with the per-process spinlock. This was mistakenly omitted from the thread_lock patch and has been a race since. MFC After: 1 week PR: bin/117603 Reported by: Danny Braniss <danny@cs.huji.ac.il>	2008-03-13 00:46:12 +00:00

... 2 3 4 5 6 ...

66939 Commits