freebsd-skq

Author	SHA1	Message	Date
arr	80237bc833	- Massive style fixup. Reviewed by: mike Approved by: dfr	2002-02-22 04:14:49 +00:00
bp	72144faea4	Add support for iovcnt greater than 1. This should resolve problems with some applications. Obtained from: Darwin project MFC after: 2 weeks	2002-02-21 16:23:38 +00:00
bde	e31f481df2	Fixed some style bugs. Added a comment about a bug in PT_SSTEP. Approved by: des	2002-02-21 04:47:38 +00:00
bde	57654bd7db	Recover bits that were lost in transition in rev.1.76: - P_INMEM checks in all the functions. P_INMEM must be checked because PHOLD() is broken. The old bits had bogus locking (using sched_lock) to lock P_INMEM. After removing the P_INMEM checks, we were left with just the bogus locking. - large comments. They were too large, but better than nothing. Remove obfuscations that were gained in transition in rev.1.76: - PROC_REG_ACTION() is even more of an obfuscation than PROC_ACTION(). The change copies procfs_machdep.c rev.1.22 of i386/procfs_machdep.c verbatim except for "fixing" the old-style function headers and adjusting function names and comments. It doesn't remove the bogus locking. Approved by: des	2002-02-21 04:37:55 +00:00
julian	8ff28c6ae2	Oops, used wrong error value for unimplemented syscalls.	2002-02-20 22:27:09 +00:00
peter	1906d1360c	Tidy up some unused variables	2002-02-20 21:25:44 +00:00
arr	2ea45f625c	- Fix style further by adding parentheses around return values so that they look like: return (val); instead of: return val;	2002-02-20 16:05:30 +00:00
arr	1b20e66ad4	- Style.9 formatting fix; this commit is mostly white space related with the next commit actually doing the: return val; -> return (val); changes. This commit was done in preparation for getting ``struct modules'' locked down. Reviewed by: bde Approved by: dfr	2002-02-20 14:30:02 +00:00
rwatson	5587a218c1	More cleanups relating to vm object allocation failure: make sure we call VOP_CLOSE() with vp unlocked; clean up the return path a little, in as much as our namei/vnode operation return paths can be cleared up. For a return case that was apparently never taken, this sure is ugly. Reviewed by: jeffr	2002-02-20 00:11:57 +00:00
silby	e561ca6dce	A few misc forkbomb defenses: - Leave 10 processes for root-only use, the previous value of 1 was insufficient to run ps ax \| more. - Remove the printing of "proc: table full". When the table really is full, this would flood the screen/logs, making the problem tougher to deal with. - Force any process trying to fork beyond its user's maximum number of processes to sleep for .5 seconds before returning failure. This turns 2000 rampaging fork monsters into 2000 harmlessly snoozing fork monsters. Reviewed by: dillon, peter MFC after: 1 week	2002-02-19 03:15:28 +00:00
julian	ffdde8ed99	Add stub syscalls and definitions for KSE calls. "Book'em Danno"	2002-02-19 02:40:31 +00:00
julian	6ebb003bcc	Add 5 KSE syscalls. Two will be implemented with the next KSE step and the others are reservations for coming code. All will be stubbed in this kernel in the next commit. This will allow people to easily make KSE binaries for userland testing (the syscalls will be in libc) but they will still need a real KSE kernel to test it. (libc looks in /sys to decide what it should add stubs for).	2002-02-19 02:19:36 +00:00
dillon	f2e166dc7a	Load the current timecounter into tc. The timecounter global can change at any time and we do not want to call one timercounter's function with another timecounter's structural pointer. MFC after: 3 days	2002-02-18 19:49:30 +00:00
dillon	7c64675013	Add kern_giant_ucred to instrument Giant around ucred related operations such a getgid(), setgid(), etc...	2002-02-18 17:51:47 +00:00
phk	68389bd8ba	Make v_addpollinfo() visible and non-inline. Have callers only call it as needed. Add necessary call in ufs_kqfilter(). Test-case found by: Andrew Gallatin <gallatin@cs.duke.edu>	2002-02-18 16:18:02 +00:00
rwatson	22b5668b60	Rehash of 1.43: simply remove the comment, since it's highly redundant and only partially correct.	2002-02-18 16:02:24 +00:00
iedowse	269a0d55a1	Add the braces missed by revision 1.131. Pointy hat to: rwatson	2002-02-18 12:46:18 +00:00
phk	e340a5652f	Take the common case of gettimeofday(&tv, NULL) out from under Giant.	2002-02-18 08:40:28 +00:00
phk	68320d04d1	Remove yet a redundant VN_KNOTE() macro.	2002-02-18 08:24:48 +00:00
dillon	15dc81ca3d	The ICANON flag is an lflag, not an iflag. Submitted by: Neelkanth Natu <neelnatu@yahoo.com> MFC after: 3 days	2002-02-18 06:07:11 +00:00
rwatson	43f97fe4ae	When vn_open() is failing because it cannot allocate a vm object, call VOP_CLOSE() on the vnode, so that VOP_OPEN() and VOP_CLOSE() calls are symmetric in all failure cases. This prevents an 'open' reference from being leaked in that unlikely failure scenario.	2002-02-18 00:26:10 +00:00
rwatson	946a79fb74	style(9) prefers formatted comments in '/' ... '/' as opposed to #if 0'd.	2002-02-18 00:23:44 +00:00
rwatson	fc479aab62	Per discussion at BSDCon, note that the vop_getattr locking protocol should require a shared lock, rather than an exclusive lock, which can improve performance. No actual code change here, since a number of VFS locking fixes are in the works.	2002-02-18 00:22:57 +00:00
phk	c2a47cdbe8	Move the stuff related to select and poll out of struct vnode. The use of the zone allocator may or may not be overkill. There is an XXX: over in ufs/ufs/ufs_vnops.c that jlemon may need to revisit. This shaves about 60 bytes of struct vnode which on my laptop means 600k less RAM used for vnodes.	2002-02-17 21:15:36 +00:00
phk	4a29a670db	Remove cache_purgeleafdirs(), it has been #if 0 for quite some time.	2002-02-17 20:40:29 +00:00
deischen	71ac8c05bf	Regenerate these files after change to syscalls.master.	2002-02-17 17:42:47 +00:00
deischen	90f4e08327	Fix prototype to sigreturn to use struct __ucontext instead of ucontext_t.	2002-02-17 17:41:28 +00:00
dillon	683d9b041d	replace the embedded cr_mtx in the ucred structure with cr_mtxp (a mutex pointer), and use the mutex pool routines. This greatly reduces the size of the ucred structure.	2002-02-17 07:30:34 +00:00
julian	abe785e035	If the credential on an incoming thread is correct, don't bother reaquiring it. In the same vein, don't bother dropping the thread cred when goinf ot userland. We are guaranteed to nned it when we come back, (which we are guaranteed to do). Reviewed by: jhb@freebsd.org, bde@freebsd.org (slightly different version)	2002-02-17 01:09:56 +00:00
green	80dcdefa0d	(Doing that whole test-immediately-after-commit-thing like obrien sez:) Forgot to include lock.h and mutex.h for GIANT_REQUIRED.	2002-02-16 17:44:43 +00:00
green	5c8476e5d9	Add revoke_and_destroy_dev(), to be used by devices which decide when they choose to destroy themselves without regard to whether or not they are open.	2002-02-16 17:35:05 +00:00
bde	60f2b0c638	Fixed a typo in rev.1.65 that gave a reference to a nonexistent variable. This was not detected by LINT because LINT is missing COMPAT_SUNOS.	2002-02-15 03:54:01 +00:00
luigi	8129d9e355	Make this compile after changes to kse structures. This escaped because DEVICE_POLLING is disabled in LINT being not compatible with SMP. In fact, it is only a runtime problem, so if we could recognize that we are building a LINT kernel we could as well disable the check for SMP being defined. Reported-by: Joe Clarke	2002-02-15 02:50:07 +00:00
alc	f64c08b326	o Clearing p/td_retval[0] after aio_newproc() is unnecessary. (We stopped calling rfork() to create aio threads in revision 1.46.) o Don't recompute the FILE * when it's already stored in the kernel's AIOCB.	2002-02-12 17:40:41 +00:00
alc	95059f78ce	The previous commit included a change to fill_kinfo_proc() that results in a NULL pointer dereference. Repair this mistake.	2002-02-12 04:21:28 +00:00
luigi	57d5032c48	MFS: synchronize the code with the version in -stable, specifically: + SYSCTL_ULONG -> SYSCTL_UINT + some procedure renaming and variable rearrangement + fix the 'interface going deaf' problem same as in -stable.	2002-02-11 23:56:18 +00:00
julian	37369620df	In a threaded world, differnt priorirites become properties of different entities. Make it so. Reviewed by: jhb@freebsd.org (john baldwin)	2002-02-11 20:37:54 +00:00
obrien	f898027669	Allow one to specify the AWK used in the environment(commandline). Gawk is blowing up when run natively on the sparc64 -- leading to totally bogus kernel values (all "0x0"). Good ole BWK awk works fine however.	2002-02-11 03:54:30 +00:00
phk	6f1345e7e7	GC the unused einval() Obtained from: ~bde/sys.dif.gz	2002-02-10 22:07:41 +00:00
phk	95c1f2fa1e	Style(9) nits. Obtained from: ~bde/sys.dif.gz	2002-02-10 22:04:44 +00:00
rwatson	3accefe9ca	Add a comment indicating that the locking protocol should be updated to be 'L L L' for vop_getattr(). Don't update it yet, because there are still many offenders.	2002-02-10 21:46:16 +00:00
rwatson	5e6a46b8e5	Add a comment indicating that VOP_GETATTR() is called without appropriate locking in the core dump code. This should be fixed.	2002-02-10 21:45:16 +00:00
rwatson	2bbae54d18	Make sure to hold vnode lock when calling into VOP_GETATTR(). Discussed with: mckusick, phk	2002-02-10 21:44:30 +00:00
rwatson	e2e50cbb21	Add a comment indicating that the vnode locking in this section of the kernel linker code may be wrong: it fails to hold a lock across the call to VOP_GETATTR(), and vn_rdwr() with IO_NODELOCKED.	2002-02-10 21:29:02 +00:00
rwatson	aa54f85939	Make sure to grab vnode lock on a vnode before calling VOP_GETATTR() to perform an ownership test in revoke(). This is also required for MAC hooks so that the vnode lock is held during a call to the MAC framework. Release the lock before calling VOP_REVOKE(). Discussed with: phk, mckusick	2002-02-10 20:45:43 +00:00
rwatson	eef638ac93	Remove a stray 'const' that slept into extattr_set_vp(), and could result in compiler warnings.	2002-02-10 05:31:55 +00:00
rwatson	94eec10ab3	Part II: Update system calls for extended attributes. Rebuild of generated files.	2002-02-10 04:44:37 +00:00
rwatson	6ca91a055c	Part I: Update extended attribute API and ABI: o Modify the system call syntax for extattr_{get,set}_{fd,file}() so as not to use the scatter gather API (which appeared not to be used by any consumers, and be less portable), rather, accepts 'data' and 'nbytes' in the style of other simple read/write interfaces. This changes the API and ABI. o Modify system call semantics so that extattr_get_{fd,file}() return a size_t. When performing a read, the number of bytes read will be returned, unless the data pointer is NULL, in which case the number of bytes of data are returned. This changes the API only. o Modify the VOP_GETEXTATTR() vnode operation to accept a *size_t argument so as to return the size, if desirable. If set to NULL, the size will not be returned. o Update various filesystems (pseodofs, ufs) to DTRT. These changes should make extended attributes more useful and more portable. More commits to rebuild the system call files, as well as update userland utilities to follow. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-02-10 04:43:22 +00:00
julian	11fa082235	Replace accidentally removed setrunqueue() solves problem with machines failing to sync in booting. Submitted by: Tor.Egge@cvsup.no.freebsd.org	2002-02-09 01:38:16 +00:00
jhb	340b988d0b	Use the mtx_owner() macro in one spot in _mtx_lock_sleep() to make the code easier to read.	2002-02-09 00:12:53 +00:00
tmm	e5ac0f9e2f	Fix a bug introduced in r. 1.28: when copy{in,out} would fail for an iovec that was not the last one in the uio, the error would be ignored silently. Bug found and fix proposed by: jhb	2002-02-08 20:19:44 +00:00
peter	0c6681c435	Fix broken Giant locking protocol introduced in rev 1.114. You cannot unlock Giant if it is not locked in the first place. This make the nfstat(2) syscall (#278) a nice panic(2) implementation.	2002-02-08 09:16:57 +00:00
peter	3589cfc992	Bah, I managed to turn cosmetic things into real bugs. Fix shadowed variable declarations. :-( Definately not my day today.	2002-02-08 08:56:01 +00:00
rwatson	d45677811a	o Merge various recent fixes from the MAC branch relating to extattrctl(): - Fix null-pointer dereference introduced when snapshotting was introduced. This occured because unlike the previous code, vn_start_write() doesn't always return a non-NULL mp, as filesystems may not support the VOP_GETWRITEMOUNT() call. For now, rely on two pointers, so that vn_finished_write() works properly. - Fix locking problems on exit, introduced at some past time, some when snapshots came in, where a vnode might not be unlocked before being vrele'd in various error situations. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-02-08 05:58:41 +00:00
peter	4f71c6828b	Fix a fatal trap when using ksched_setscheduler() (eg: mozilla, netscape etc) which use: td->td_last_kse->ke_flags \|= KEF_NEEDRESCHED;	2002-02-08 02:56:10 +00:00
julian	6145412058	remove superfluous blank line	2002-02-08 01:38:32 +00:00
peter	4289b50433	Fix a couple of style bugs introduced (or touched by) previous commit.	2002-02-07 23:06:26 +00:00
peter	59b80f6697	Fix a whole bunch of long lines introduced by previous commit by using td = FIRST_THREAD_IN_PROC(p) once, after we have identified the process that we are operating on.	2002-02-07 23:05:40 +00:00
phk	a028cfa65d	Revise timercounters to use binary fixed point format internally. The binary format "bintime" is a 32.64 format, it will go to 64.64 when time_t does. The bintime format is available to consumers of time in the kernel, and is preferable where timeintervals needs to be accumulated. This change simplifies much of the magic math inside the timecounters and improves the frequency and time precision by a couple of bits. I have not been able to measure a performance difference which was not a tiny fraction of the standard deviation on the measurements.	2002-02-07 21:21:55 +00:00
julian	b5eb64d6f0	Pre-KSE/M3 commit. this is a low-functionality change that changes the kernel to access the main thread of a process via the linked list of threads rather than assuming that it is embedded in the process. It IS still embeded there but remove all teh code that assumes that in preparation for the next commit which will actually move it out. Reviewed by: peter@freebsd.org, gallatin@cs.duke.edu, benno rice,	2002-02-07 20:58:47 +00:00
jhb	156f4c8aea	Fixes for alpha pmap on SMP machines: - Create a private list of active pmaps rather than abusing the list of all processes when we need to look up pmaps. The process list needs a sx lock and we can't be getting sx locks in the middle of cpu_switch() (pmap_activate() can call pmap_get_asn() from cpu_switch()). Instead, we protect the list with a spinlock. This also means the list is shorter since a pmap can be used by more than one process and we could (at least in thoery) dink with pmap's more than once, but now we only touch each pmap once when we have to update all of them. - Wrap pmap_activate()'s code to get a new ASN in an explicit critical section so that when it is called while doing an exec() we can't get preempted. - Replace splhigh() in pmap_growkernel() with a critical section to prevent preemption while we are adjusting the kernel page tables. - Fixes abuse of PCPU_GET(), which doesn't return an L-value. - Also adds some slight cleanups to the ASN handling by adding some macros instead of magic numbers in relation to the ASN and ASN generations. Reviewed by: dfr	2002-02-06 04:30:26 +00:00
dillon	9371a9a23b	Allow the kern.maxusers boot tuneable to be set to 0 (previously only the kernel config's maxusers could be set to 0 for autosizing to work). Reviewed by: rwatson, imp MFC after: 3 days	2002-02-06 01:19:19 +00:00
alfred	c6a128a4b9	Fix a race with free'ing vmspaces at process exit when vmspaces are shared. Also introduce vm_endcopy instead of using pointer tricks when initializing new vmspaces. The race occured because of how the reference was utilized: test vmspace reference, possibly block, decrement reference When sharing a vmspace between multiple processes it was possible for two processes exiting at the same time to test the reference count, possibly block and neither one free because they wouldn't see the other's update. Submitted by: green	2002-02-05 21:23:05 +00:00
phk	d0f44978bc	Let the number of timecounters follow hz, otherwise people with HZ=BIGNUM will strain the assumptions behind timecounters to the point where they break. This may or may not help people seeing microuptime() backwards messages. Make the global timecounter variable volatile, it makes no difference in the code GCC generates, but it makes represents the intent correctly. Thanks to: jdp MFC after: 2 weeks	2002-02-05 20:44:56 +00:00
dillon	b3ddc72561	Get rid of the twisted MFREE() macro entirely. Reviewed by: dg, bmilekic MFC after: 3 days	2002-02-05 02:00:56 +00:00
rwatson	10b6b09b25	o Scatter vn_start_write() and vn_finished_write() through ACL code so that it interacts properly with snapshotting. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-02-04 17:58:15 +00:00
rwatson	ca6a9763cb	Note that Kirk apparently missed adding vn_start_write() and friends to kern_acl.c when he added snapshotting. This will need to be added at some point.	2002-02-04 16:41:59 +00:00
mckusick	ca79facdf4	In the routines vrele() and vput(), we must lock the vnode and call VOP_INACTIVE before placing the vnode back on the free list. Otherwise there is a race condition on SMP machines between getnewvnode() locking the vnode to reclaim it and vrele() locking the vnode to inactivate it. This window of vulnerability becomes exaggerated in the presence of filesystems that have been suspended as the inactive routine may need to temporarily release the lock on the vnode to avoid deadlock with the syncer process.	2002-02-02 01:49:18 +00:00
alfred	b616625325	Remove bogus assertion in dup2 that can lead to panics when kernel threads race for a file slot. dup2(2) incorrectly assumes that if it needs to grow the ofiles array that it will get what it wants. This assertion was valid before we allowed shared filedescriptor tables but is now incorrect. The assertion can trigger superfolous panics if the thread doing a dup2 looses a race with another thread while possibly blocked in the MALLOC call in fdalloc. Another thread may grab the slot we are requesting which makes fdalloc return something other than what we asked for, this will triggering the bogus assertion. MFC after: 2 weeks Reviewed by: phk	2002-02-01 19:25:36 +00:00
alfred	74f44e9118	Avoid lock order reversal filedesc/Giant when calling FREE() in fdalloc by unlocking the filedesc before calling FREE(). Submitted by: bde	2002-02-01 19:19:54 +00:00
alfred	bfbf894c82	Don't recurse on filedesc lock in chroot_refuse_vdir_fds(). Noticed by: Michael Nottebrock <michaelnottebrock@gmx.net>	2002-02-01 18:27:16 +00:00
bde	b50e6bc8e5	Regenerate to make osigreturn standard.	2002-02-01 17:41:45 +00:00
bde	c1d433597e	Made osigreturn(2) standard so that SYS_osigreturn can be used in the signal trampoline for old signals. The arches that support old signals currently abuse sigreturn(2) instead. This mainly complicates things and slightly breaks the the new sigreturn(2). COMPAT is too limited to support the correct configuration of osigreturn, and this commit doesn't attempt to fix it; it just moves the bogusness: osigreturn() must now be provided unconditionally even on arches that don't really need it; previously it had to be provided under the bogus condition defined(COMPAT_43).	2002-02-01 17:27:14 +00:00
dillon	8abd6168f2	GC P_BUFEXHAUST leftovers, we've had a new mechanism to avoid buffer cache lockups for over a year now. MFC after: 0 days	2002-01-31 18:39:44 +00:00
alfred	b6dbc86ae0	Remove unused variables in select(2) from previous delta. Pointed out by: bde	2002-01-30 19:48:25 +00:00
bde	e40a861815	Oops, fix previous commit to not generate a C comment in syscall.mk.	2002-01-30 15:12:12 +00:00
bde	7e5d2672ea	Regenerate _after_ the commit to syscalls.master.	2002-01-30 10:29:12 +00:00
bde	a348e6b305	Escape $FreeBSD$ in a different way to avoid using the bogus escapes \$ and \F. Awk just started warning about these.	2002-01-30 10:22:05 +00:00
alfred	b0fc10702a	Attempt to fixup select(2) and poll(2), this should fix some races with other threads as well as speed up the interfaces. To fix the race and accomplish the speedup, remove selholddrop and pollholddrop. The entire concept is somewhat bogus because holding the individual struct file pointers offers us no guarantees that another thread context won't close it on us thereby removing our access to our own reference. Selholddrop and pollholddrop also would do multiple locks and unlocks of mutexes _per-file_ in the fd arrays to be scanned, this needed to be sped up. Instead of using selholddrop and pollholddrop, simply hold the filedesc lock over the selscan and pollscan functions. This should protect us against close(2)'s on the files as reduce the multiple lock/unlock pairs per fd into a single lock over the filedesc.	2002-01-29 22:54:19 +00:00
alfred	b969e5c198	Backout 1.120, EINVAL isn't a proper error return when the passed fd is negative, the 'pointer' referred to by the manpage is actually the struct file's f_offset field. Pointed out by: bde	2002-01-29 17:12:10 +00:00
phk	2566d06fba	Be more conservative about interrupt latency, it aint getting better it seems.	2002-01-25 21:22:34 +00:00
phk	f8c5229a89	Make st_blksize default to PAGE_SIZE instead of zero.	2002-01-25 16:39:57 +00:00
dillon	e1e10af6b7	Make the 'maxusers 0' auto-sizing code slightly more conservative. Change from 1 megabyte of ram per user to 2 megabytes of ram per user, and reduce the cap from 512 to 384. 512 leaves around 240 MB of KVM available while 384 leaves 270 MB of KVM available. Available KVM is important in order to deal with zalloc and kernel malloc area growth. Reviewed by: mckusick MFC: either before 4.5 if re's agree, or after 4.5	2002-01-25 01:54:16 +00:00
phk	cb8828be3d	Yet a bug with extensible sbufs being marked as OVERFLOWED. This time because of a signed/unsigned problem. Approved by: DES	2002-01-24 20:57:56 +00:00
jlemon	b8aecb9e59	Add entry for EVFILT_NETDEV, which was inadverdently omitted back in Sept.	2002-01-24 17:20:55 +00:00
alfred	53eeef7678	in fget() return EINVAL when the descriptor requested is negative.	2002-01-23 08:40:35 +00:00
alfred	8ea3c5cdda	make pread use fget_read instead of holdfp.	2002-01-23 08:22:59 +00:00
dg	83a72ec01c	Fixed bug in calculation of amount of file to send when nbytes !=0 and headers or trailers are supplied. Reported by Vladislav Shabanov <vs@rambler-co.ru>. PR: 33771 Submitted by: Maxim Konovalov <maxim@macomnet.ru> MFC after: 3 days	2002-01-22 17:32:10 +00:00
phk	cbd45b6c66	In certain cases sbuf_printf() and sbuf_vprintf() could mistakely make extendable sbufs as overflowed. Approved by: des	2002-01-22 11:22:55 +00:00
sobomax	4e0549db55	Allow dump device be configured as early as possible using loader(8) tunable. This allows obtaining crash dumps from the panics occured during late stages of kernel initialisation before system enters into single-user mode. MFC after: 2 weeks	2002-01-21 01:16:11 +00:00
alfred	1d6432ede3	use mutex pools for "struct file" locking. fix indentation of FILE_LOCK/UNLOCK macros while I'm here.	2002-01-20 22:58:08 +00:00
alfred	5a34a0d3bb	use mutex pool mutexes for uidinfo locking. replace mutex_lock calls on uidinfo with macro calls: mtx_lock(&uidp->ui_mtx) -> UIDINFO_LOCK(uidp) Terry Lambert <tlambert2@mindspring.com> helped with this.	2002-01-20 22:48:49 +00:00
alc	ca68430b0d	o Remove the unused vestiges of JOBST_JOBQPROC and the per-thread jobtorun queue. o Use TAILQ_EMPTY() instead of TAILQ_FIRST(...) == NULL.	2002-01-20 18:59:58 +00:00
alc	60143f00ec	o Revision 1.99 ("KSE Milestone 2") left the aio daemons sleeping on a process object but changed the corresponding wakeup()s to the thread object. The result was that non-raw aio ops waited for an aio daemon to timeout before action was taken. Now, we sleep on the thread object. PR: kern/34016	2002-01-20 00:52:44 +00:00
dillon	f51ea914df	Remove 'VXLOCK: interlock avoided' warnings. This can now occur in normal operation. The vgonel() code has always called vclean() but until we started proactively freeing vnodes it would never actually be called with a dirty vnode, so this situation did not occur prior to the vnlru() code. Now that we proactively free vnodes when kern.maxvnodes is hit, however, vclean() winds up with work to do and improperly generates the warnings. Reviewed by: peter Approved by: re (for MFC) MFC after: 1 day	2002-01-19 02:14:45 +00:00
alfred	20073b0322	undo a bit of the Giant pushdown. fdrop isn't SMP safe as it may call into the file's close routine which definetly is not SMP safe right now, so we hold Giant over calls to fdrop now.	2002-01-19 01:03:54 +00:00
nik	1d07367781	Explain that the admin can safely power down the system as well as rebooting.	2002-01-18 22:45:29 +00:00
tanimura	d254e72400	Invert the test of sx_xholder for SX_LOCKED. We need to warn if a thread other than the curthread holds an sx. While I am here, break a line at the end of warning.	2002-01-18 09:21:15 +00:00
bde	3dee686619	Uninlined most of the bloated inline functions in <sys/disklabel.h>. Some of them need to become even larger to support devfs.	2002-01-17 18:33:18 +00:00
bde	73ef84f92b	Changed the type of pcb_flags from u_char to u_int and adjusted things. This removes the only atomic operation on a char type in the entire kernel.	2002-01-17 17:49:23 +00:00
alc	babe0aff74	o Eliminate an unused parameter from aio_fphysio().	2002-01-17 17:19:40 +00:00
alfred	b191447bdd	Fix giant handling in pwrite(2), I forgot to release it when finishing the syscall.	2002-01-16 21:33:41 +00:00
arr	1ae1e4e3f2	- Attempt to help declutter kern. sysctl by moving security out from beneath it. Reviewed by: rwatson	2002-01-16 06:55:30 +00:00
jhb	9f04e2aaf9	Bump the limits for determining if we've held a spinlock too long as they seem to be too short for the 500 Mhz DS20 I'm testing on. The rather arbitrary numbers are rather bogus anyways. We should probably have variables for these limits that are calibrated in the MD startup code somehow.	2002-01-15 14:20:33 +00:00
mckusick	b8d6599e4c	When downgrading a filesystem from read-write to read-only, operations involving file removal or file update were not always being fully committed to disk. The result was lost files or corrupted file data. This change ensures that the filesystem is properly synced to disk before the filesystem is down-graded. This delta also fixes a long standing bug in which a file open for reading has been unlinked. When the last open reference to the file is closed, the inode is reclaimed by the filesystem. Previously, if the filesystem had been down-graded to read-only, the inode could not be reclaimed, and thus was lost and had to be later recovered by fsck. With this change, such files are found at the time of the down-grade. Normally they will result in the filesystem down-grade failing with `device busy'. If a forcible down-grade is done, then the affected files will be revoked causing the inode to be released and the open file descriptors to begin failing on attempts to read. Submitted by: "Sam Leffler" <sam@errno.com>	2002-01-15 07:17:12 +00:00
alfred	18fa15ac4c	Push down Giant in dup(2) and dup2(2), Giant is only needed when calling closef() in the case of dup2(2) duping over a descriptor and when fdalloc must grow or free a filedesc.	2002-01-15 00:58:40 +00:00
alfred	c8a759143f	Fix select on fifos. Backout revision 1.56 and 1.57 of fifo_vnops.c. Introduce a new poll op "POLLINIGNEOF" that can be used to ignore EOF on a fifo, POLLIN/POLLRDNORM is converted to POLLINIGNEOF within the FIFO implementation to effect the correct behavior. This should allow one to view a fifo pretty much as a data source rather than worry about connections coming and going. Reviewed by: bde	2002-01-14 22:03:48 +00:00
alfred	13c64df775	Remove a bogus FILEDESC_UNLOCK. Submitted by: tanimura	2002-01-14 19:45:03 +00:00
alc	6a4a71604b	o Correct the initialization of aiolio_zone: Each entry was 16 times larger than necessary. o Move a rarely-used goto label inside a critical section so that we don't perform an splnet() for which there is no corresponding splx(). o Remove unnecessary splnet()/splx() around accesses to kaioinfo::kaio_jobdone in aio_return(). o Use TAILQ_FOREACH for simple cases of iteration over kaioinfo::kaio_jobdone.	2002-01-14 07:26:33 +00:00
alfred	1f82bc18d1	Replace ffind_* with fget calls. Make fget MPsafe. Make fgetvp and fgetsock use the fget subsystem to reduce code bloat. Push giant down in fpathconf().	2002-01-14 00:13:45 +00:00
alfred	5e2f4cf200	Include sys/_lock.h and sys/_mutex.h to reduce namespace pollution. Requested by: jhb	2002-01-13 21:37:49 +00:00
alc	62ca6901d8	o Call the functions registered with at_exec() from exec_new_vmspace() instead of execve(). Otherwise, the possibility still exists for a pending AIO to modify the new address space. Reviewed by: alfred	2002-01-13 19:36:35 +00:00
alfred	f720362ae2	Comment fdrop and fdrop_locked functions.	2002-01-13 12:58:14 +00:00
alfred	b0764e3d9a	Implement ffind_hold using ffind_lock. Recommended by: jhb	2002-01-13 12:57:02 +00:00
alfred	844237b396	SMP Lock struct file, filedesc and the global file list. Seigo Tanimura (tanimura) posted the initial delta. I've polished it quite a bit reducing the need for locking and adapting it for KSE. Locks: 1 mutex in each filedesc protects all the fields. protects "struct file" initialization, while a struct file is being changed from &badfileops -> &pipeops or something the filedesc should be locked. 1 mutex in each struct file protects the refcount fields. doesn't protect anything else. the flags used for garbage collection have been moved to f_gcflag which was the FILLER short, this doesn't need locking because the garbage collection is a single threaded container. could likely be made to use a pool mutex. 1 sx lock for the global filelist. struct file * fhold(struct file fp); / increments reference count on a file / struct file fhold_locked(struct file fp); / like fhold but expects file to locked / struct file ffind_hold(struct thread , int fd); / finds the struct file in thread, adds one reference and returns it unlocked / struct file ffind_lock(struct thread , int fd); / ffind_hold, but returns file locked */ I still have to smp-safe the fget cruft, I'll get to that asap.	2002-01-13 11:58:06 +00:00
mckusick	5c33a3566a	Fix typo so that the delay code introduced in revision 1.60 actually does something. Submitted by: John Baldwin <john@baldwin.cx>	2002-01-12 02:04:15 +00:00
dillon	05b2183d53	Add vlruvp() routine - implements LRU operation for vnode recycling. We calculate a trigger point that both guarentees we will find a sufficient number of vnodes to recycle and prevents us from recycling vnodes with lots of resident pages. This particular section of code is designed to recycle vnodes, not do unnecessary frees of cached VM pages.	2002-01-10 18:31:53 +00:00
iedowse	83b07d10e7	Change dounmount() to return EBUSY in the non-MNT_FORCE case if we can't acquire the mnt_lock without blocking. Normally non-forced unmount attempts return EBUSY quickly if any vnodes are active, so this just extends that behaviour to cover the per-mount mnt_lock too.	2002-01-10 01:59:30 +00:00
rwatson	9f1ff731e4	o Revert kern_sig.c#1.143, as cr_cansignal() doesn't currently permit a number of desirable cases in which SIGIO/SIGURG are delivered. We'll keep tweaking. Reported by: Alexander Kabaev <ak03@gte.com>	2002-01-10 01:25:35 +00:00
kbyanc	779fa7fcc3	Replace spaces after #defines with tabs; this makes all #defines consistent in their adherence with style(9).	2002-01-09 07:29:28 +00:00
alc	13725ff1d5	o Correct a 32/64-bit error in the initialization of aiol_zone, specifically, sizeof(int) is not the size of a pointer.	2002-01-09 06:40:45 +00:00
msmith	c2656ac96b	Add a new sysinit SI_SUB_DEVFS. Devfs hooks into the kernel at SI_ORDER_FIRST, and devices can be created anytime after that. Print a warning if an atttempt is made to create a device too early.	2002-01-09 04:58:49 +00:00
silby	c5438df911	GC fast_vfork; it's not actually referenced anywhere. MFC after: 3 weeks	2002-01-09 04:51:21 +00:00
alfred	11d426818d	Sockets are called 'so' not 'sp'.	2002-01-09 02:47:00 +00:00
silby	4c0cf8914c	Revert 1.81; 1.19 fixed this already in a different way.	2002-01-09 01:45:17 +00:00
alc	938cb766b8	o Add missing synchronization (splnet()/splx()) in aio_free_entry(). o Move the definition of struct aiocblist from sys/aio.h to kern/vfs_aio.c. o Make aio_swake_cb() static.	2002-01-06 21:03:39 +00:00
kbyanc	9af9cb3fe9	* Implement SBUF_AUTOEXTEND flag; sbufs created with this flag are automatically extended to prevent overflow. * Added sbuf_vprintf(); sbuf_printf() is now just a wrapper around sbuf_vprintf(). * Include <stdio.h> and <string.h> when building libsbuf to silence WARNS=4 warnings. Reviewed by: des	2002-01-06 08:38:23 +00:00
silby	719af3e61a	Reorder a calculation in sbreserve so that it does not overflow with multi-megabyte socket buffer sizes. PR: 7420 MFC after: 3 weeks	2002-01-06 06:50:54 +00:00
rwatson	51a1c19396	- Teach SIGIO code to use cr_cansignal() instead of a custom CANSIGIO() macro. As a result, mandatory signal delivery policies will be applied consistently across the kernel. - Note that this subtly changes the protection semantics, and we should watch out for any resulting breakage. Previously, delivery of SIGIO in this circumstance was limited to situations where the subject was privileged, or where one of the subject's (ruid, euid) matched one of the object's (ruid, euid). In the new scenario, subject (ruid, euid) are matched against the object's (ruid, svuid), and the object uid's must be a subset of the subject uid's. Likewise, jail now affects delivery, and special handling for P_SUGID of the object is present. This change can always be reversed or tweaked if it proves to disrupt application behavior substantially. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-01-06 00:54:46 +00:00
rwatson	6b7ac7804d	- Push much of the logic for p_cansignal() behind cr_cansignal, which authorized based on a subject credential rather than a subject process. This will permit the same logic to be reused in situations where only the credential generating the signal is available, such as in the delivery of SIGIO. - Because of two clauses, the automatic success against curproc, and the session semantics for SIGCONT, not all logic can be pushed into cr_cansignal(), but those cases should not apply for most other consumers of cr_cansignal(). - This brings the base system inter-process authorization code more into line with the MAC implementation. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-01-06 00:20:12 +00:00
dwmalone	f974b4f783	Release text vnode in exit() rather than wait(). Occasionally fifesystem problems could prevent the release from completing and this could result in init being blocked indefinitely. This was looked over by Matt ages ago. Approved by: dillon	2002-01-05 21:47:58 +00:00
jhb	b8765de1bf	Fix a bug where the mutex name wasn't always displayed for processes in SMTX in utils such as ps and top. The KI_CTTY flag was assigned to kinfo_proc->ki_kiflag rather than or'd into the flag, thus clobbering any flags set earlier, including KI_MTXBLOCK. Prodding by: peter	2002-01-05 17:18:59 +00:00
peter	5e902a48f6	Fix forward_roundrobin(). It was mistakenly using the cpu number as though it was a mask. As a result, we sent AST IPI's to the wrong cpu and/or left out some. Spotted by: jake	2002-01-05 09:38:47 +00:00
peter	d0a39cc230	Add a per-cpu variable, cpumask, the preshifted equivalent of 1 << cpuid. We use this around the place a lot.	2002-01-05 09:35:50 +00:00
jhb	1ce407b675	Change the preemption code for software interrupt thread schedules and mutex releases to not require flags for the cases when preemption is not allowed: The purpose of the MTX_NOSWITCH and SWI_NOSWITCH flags is to prevent switching to a higher priority thread on mutex releease and swi schedule, respectively when that switch is not safe. Now that the critical section API maintains a per-thread nesting count, the kernel can easily check whether or not it should switch without relying on flags from the programmer. This fixes a few bugs in that all current callers of swi_sched() used SWI_NOSWITCH, when in fact, only the ones called from fast interrupt handlers and the swi_sched of softclock needed this flag. Note that to ensure that swi_sched()'s in clock and fast interrupt handlers do not switch, these handlers have to be explicitly wrapped in critical_enter/exit pairs. Presently, just wrapping the handlers is sufficient, but in the future with the fully preemptive kernel, the interrupt must be EOI'd before critical_exit() is called. (critical_exit() can switch due to a deferred preemption in a fully preemptive kernel.) I've tested the changes to the interrupt code on i386 and alpha. I have not tested ia64, but the interrupt code is almost identical to the alpha code, so I expect it will work fine. PowerPC and ARM do not yet have interrupt code in the tree so they shouldn't be broken. Sparc64 is broken, but that's been ok'd by jake and tmm who will be fixing the interrupt code for sparc64 shortly. Reviewed by: peter Tested on: i386, alpha	2002-01-05 08:47:13 +00:00
jhb	2f03379495	Remove brain damaged code in witness_lock(). We could have easily just used PCPU_GET(spinlocks) w/o needing the w_mtx held. It is more correct to just check td_critnest now though.	2002-01-05 08:29:54 +00:00
jhb	82f83a1cbe	Axe a stale comment. Holding sched_lock across both setrunqueue() and mi_switch() is sufficient.	2002-01-04 10:55:51 +00:00
silby	a239a7e562	Throw the $FreeBSD$s back in, properly escaping them.	2002-01-04 05:27:47 +00:00
silby	6cc0a06d0d	Remove $FreeBSD$s from previous commit; perl thinks that they're something to be interpreted. Urk.	2002-01-04 01:40:50 +00:00
silby	a45db01b69	Solve vnode_if.pl's identity crisis; make sure that it refers to itself as vnode_if.pl instead of vnode_if.sh. PR: 33509 MFC after: 3 weeks	2002-01-03 21:53:09 +00:00
se	4f45ba2c53	Return EBADF in case some vnode field has been reset to a NULL pointer. (There has been some discussion, whether ENOENT or EBADF is more appropriate. I choose the latter, since the operation is not supported on the file descriptor at that time, even if it was, immediately before.) PR: 32681 Reviewed by: dillon, iedowse, ... Approved by: nectar MFC after: 3 days (pending RE approval)	2002-01-03 09:54:24 +00:00
alc	e78b8215cc	o Properly check the file descriptor passed to aio_cancel(2). (Previously, no out-of-bounds check was performed on the file descriptor.) o Eliminate some excessive white space from aio_cancel(2).	2002-01-02 07:04:38 +00:00
jake	92bcc2bcb1	Print parm6 too in the !KTR_EXTEND case.	2002-01-01 21:47:38 +00:00
alc	e5d0c7a325	o Some style(9)-motivated changes to white space.	2002-01-01 00:40:29 +00:00
rwatson	5eea21ccca	o Make the credential used by socreate() an explicit argument to socreate(), rather than getting it implicitly from the thread argument. o Make NFS cache the credential provided at mount-time, and use the cached credential (nfsmount->nm_cred) when making calls to socreate() on initially connecting, or reconnecting the socket. This fixes bugs involving NFS over TCP and ipfw uid/gid rules, as well as bugs involving NFS and mandatory access control implementations. Reviewed by: freebsd-arch	2001-12-31 17:45:16 +00:00
alc	4687eb5aa5	o Correct an off-by-one error in aio_suspend(2). PR: 18350	2001-12-31 03:13:24 +00:00
alc	0fe9459a66	o Use "td->td_proc" instead of "curproc" where possible. o Eliminate the unnecessary initialization of several static variables to zero.	2001-12-31 02:03:39 +00:00
alc	d6b2b75593	Eliminate semexit_hook using at_exit(9) and rm_at_exit(9). Reviewed by: alfred	2001-12-30 18:55:09 +00:00
jake	0bff76ae56	Change traces in hardclock and statclock to use the KTR_CLK trace facility, rather than KTR_INTR.	2001-12-29 08:39:57 +00:00
alfred	f097734c27	Make AIO a loadable module. Remove the explicit call to aio_proc_rundown() from exit1(), instead AIO will use at_exit(9). Add functions at_exec(9), rm_at_exec(9) which function nearly the same as at_exec(9) and rm_at_exec(9), these functions are called on behalf of modules at the time of execve(2) after the image activator has run. Use a modified version of tegge's suggestion via at_exec(9) to close an exploitable race in AIO. Fix SYSCALL_MODULE_HELPER such that it's archetecuterally neutral, the problem was that one had to pass it a paramater indicating the number of arguments which were actually the number of "int". Fix it by using an inline version of the AS macro against the syscall arguments. (AS should be available globally but we'll get to that later.) Add a primative system for dynamically adding kqueue ops, it's really not as sophisticated as it should be, but I'll discuss with jlemon when he's around.	2001-12-29 07:13:47 +00:00
bde	4ac956411e	Fixed an apparent typo ("-" before ":") and an English error (comma splice) in the "already exists" message. Fixed some minor style bugs (KNFization to "return (foo)" had rotted in 2 out of 177 cases).	2001-12-28 18:32:13 +00:00
alfred	3026a8036d	brace by itself after function declaration. Mandated by: style(9) Pointed out by: rwatson	2001-12-27 20:16:21 +00:00
dillon	91aada8d5f	Fix type-o in previous commit (tsleep was using wrong rendezvous point)	2001-12-25 01:23:25 +00:00
bmilekic	965b8e2ef2	On the first day of Christmas bde gave to me: A [hopefully] conforming style(9) revamp of mb_alloc and related code. (This was possible due to bde's remarkable patience.) Submitted by: (in large part) bde Reviewed by: (the other part) bde	2001-12-23 22:04:08 +00:00
bmilekic	54e5874ed2	Move prototype of _mext_free to mbuf.h, where it belongs, because it is used in MEXTFREE and needs to be in scope for external MEXTFREE users. Pointed out by: Chad David <davidc@acns.ab.ca> Confirmed by: bde	2001-12-22 20:09:08 +00:00
tmm	5d1f367b0b	Add a generic __BUS_ACCESSOR macro to construct ivar accessor functions, and a generic resource_list_print_type() function to print all resouces of a certain type in a resource list. Use ulmin()/ulmax() instead of min()/max() in two places to handle u_longs correctly.	2001-12-21 21:45:09 +00:00
tmm	dadac69200	Add a rman_reserve_resource_bound() function that takes an additional argument specifying the boundary for the resource allocation. Use ulmin()/ulmax() instead of min()/max() in some places to correctly deal with the u_long resource range specifications.	2001-12-21 21:40:55 +00:00
peter	d39f9b4517	Avoid an interaction between syncache and accept filters. The syncache code only passed up the connection to the tcp stack when it was complete, so it went directly into the so_comp (complete) queue. However, with accept filters, there is an additional phase before calling it "complete". Reviewed by: jlemon	2001-12-21 04:30:49 +00:00
jhb	2463f40fc3	Introduce a standard name for the lock protecting an interrupt controller and it's associated state variables: icu_lock with the name "icu". This renames the imen_mtx for x86 SMP, but also uses the lock to protect access to the 8259 PIC on x86 UP. This also adds an appropriate lock to the various Alpha chipsets which fixes problems with Alpha SMP machines dropping interrupts with an SMP kernel.	2001-12-20 23:48:31 +00:00
dillon	ac9876d609	Fix a BUF_TIMELOCK race against BUF_LOCK and fix a deadlock in vget() against VM_WAIT in the pageout code. Both fixes involve adjusting the lockmgr's timeout capability so locks obtained with timeouts do not interfere with locks obtained without a timeout. Hopefully MFC: before the 4.5 release	2001-12-20 22:42:27 +00:00
dillon	de48df525f	Calculate whether the sbuf is dynamic before bzero()ing the structure. This fixes a serious memory leak in the sbuf code. MFC after: 3 days	2001-12-19 19:04:57 +00:00
peter	d6d1e90f25	Do not initialize static/global variables to 0. Use bss instead of taking up space in the data section.	2001-12-19 01:35:18 +00:00
peter	12f2610cb5	Use a different mechanism to get the vnlru process to wake up and notice the shutdown request at reboot/halt time. Disable the printf 'vnlru process getting nowhere, pausing...' and instead export the count to the debug.vnlru_nowhere sysctl.	2001-12-19 01:31:12 +00:00
luigi	b6f2ecc1bc	Complete the device polling support by adding a thread in charge of polling interfaces at the lowest possible priority (this might result in softnetisr being scheduled, but there is no risk of livelock because they have a higher priority than this thread).	2001-12-19 00:53:24 +00:00
jhb	5463e6afe5	Return EINVAL if kernel only flags are passed to the rfork syscall rather than silently masking them.	2001-12-19 00:53:23 +00:00
dillon	1750942f6f	This is a forward port of Peter's vlrureclaim() fix, with some minor mods by me to make it more efficient. The original code had serious balancing problems and could also deadlock easily. This code relegates the vnode reclamation to its own kproc and relaxes the vnode reclamation requirements to better maintain kern.maxvnodes. This code still doesn't balance as well as it could, but it does a much better job then the original code. Approved by: re@freebsd.org Obtained from: ps, peter, dillon MFS Assuming: Assuming no problems crop up in Yahoo testing MFC after: 7 days	2001-12-18 20:48:54 +00:00
jhb	3b3c195480	- Change all callers of addupc_task() to check PS_PROFIL explicitly and remove the check from addupc_task(). It would need sched_lock while testing the flag anyways. - Always read sticks while holding sched_lock using a temporary variable where needed. - Always init prticks to 0 in ast() to quiet a warning.	2001-12-18 09:06:10 +00:00
jhb	a3b98398cb	Modify the critical section API as follows: - The MD functions critical_enter/exit are renamed to start with a cpu_ prefix. - MI wrapper functions critical_enter/exit maintain a per-thread nesting count and a per-thread critical section saved state set when entering a critical section while at nesting level 0 and restored when exiting to nesting level 0. This moves the saved state out of spin mutexes so that interlocking spin mutexes works properly. - Most low-level MD code that used critical_enter/exit now use cpu_critical_enter/exit. MI code such as device drivers and spin mutexes use the MI wrappers. Note that since the MI wrappers store the state in the current thread, they do not have any return values or arguments. - mtx_intr_enable() is replaced with a constant CRITICAL_FORK which is assigned to curthread->td_savecrit during fork_exit(). Tested on: i386, alpha	2001-12-18 00:27:18 +00:00
mp	add9abf1bb	Remove whitespace at end of line.	2001-12-16 17:21:16 +00:00
luigi	4893656ff8	Add/correct description for some sysctl variables where it was missing. The description field is unused in -stable, so the MFC there is equivalent to a comment. It can be done at any time, i am just setting a reminder in 45 days when hopefully we are past 4.5-release. MFC after: 45 days	2001-12-16 16:07:20 +00:00
luigi	e39284a688	Add code to export and print the description associated to sysctl variables. Use the -d flag in sysctl(8) to see this information. Possible extensions to sysctl: + report variables that do not have a description + given a name, report the oid it maps to. Note to developers: have a look at your code, there are a number of variables which do not have a description. Note to developers: do we want this in 4.5 ? It is a very small change and very useful for documentation purposes. Suggested by: Orion Hodson	2001-12-16 02:55:41 +00:00
jhb	0c1bfda974	Fix some nits in fork_exit() so it more properly duplicates the backend of mi_switch: - Set the oncpu value for the current thread. - Always set switchticks, not just in the SMP case. - Add a KTR entry for fork_exit that is the same as the "new proc" entry in mi_switch(). - Release sched_lock a bit later like we do with mi_switch().	2001-12-14 23:37:35 +00:00
jlemon	f7af5c6f92	When removing kqueue descriptors from the descriptor table during a fork, update fd_freefile and fd_lastfile as well, to keep things in sync. Pointed out by: Debbie Chu <dchu@juniper.net>	2001-12-14 19:02:57 +00:00
luigi	f8ad22919e	Device Polling code for -current. Non-SMP, i386-only, no polling in the idle loop at the moment. To use this code you must compile a kernel with options DEVICE_POLLING and at runtime enable polling with sysctl kern.polling.enable=1 The percentage of CPU reserved to userland can be set with sysctl kern.polling.user_frac=NN (default is 50) while the remainder is used by polling device drivers and netisr's. These are the only two variables that you should need to touch. There are a few more parameters in kern.polling but the default values are adequate for all purposes. See the code in kern_poll.c for more details on them. Polling in the idle loop will be implemented shortly by introducing a kernel thread which does the job. Until then, the amount of CPU dedicated to polling will never exceed (100-user_frac). The equivalent (actually, better) code for -stable is at http://info.iet.unipi.it/~luigi/polling/ and also supports polling in the idle loop. NOTE to Alpha developers: There is really nothing in this code that is i386-specific. If you move the 2 lines supporting the new option from sys/conf/{files,options}.i386 to sys/conf/{files,options} I am pretty sure that this should work on the Alpha as well, just that I do not have a suitable test box to try it. If someone feels like trying it, I would appreciate it. NOTE to other developers: sure some things could be done better, and as always I am open to constructive criticism, which a few of you have already given and I greatly appreciated. However, before proposing radical architectural changes, please take some time to possibly try out this code, or at the very least read the comments in kern_poll.c, especially re. the reason why I am using a soft netisr and cannot (I believe) replace it with a simple timeout. Quick description of files touched by this commit: sys/conf/files.i386 new file kern/kern_poll.c sys/conf/options.i386 new option sys/i386/i386/trap.c poll in trap (disabled by default) sys/kern/kern_clock.c initialization and hardclock hooks. sys/kern/kern_intr.c minor swi_net changes sys/kern/kern_poll.c the bulk of the code. sys/net/if.h new flag sys/net/if_var.h declaration for functions used in device drivers. sys/net/netisr.h NETISR_POLL sys/dev/fxp/if_fxp.c sys/dev/fxp/if_fxpvar.h sys/pci/if_dc.c sys/pci/if_dcreg.h sys/pci/if_sis.c sys/pci/if_sisreg.h device driver modifications	2001-12-14 17:56:12 +00:00
peter	dd0f3c5ca2	Proper fix for old config setting maxusers to 8.	2001-12-14 09:39:29 +00:00
dillon	8e6d2fbcbd	A slightly different version of the vlrureclaim fix. Reported by: peter, ps	2001-12-14 07:18:31 +00:00
mckusick	d3b383005d	Add disk I/O scheduling for positively niced processes. When a positively niced process requests a disk I/O, make it wait for its nice value of ticks before scheduling its I/O request if there are any other processes with I/O requests in the disk queue. For all the gory details, see the ``Running fsck in the Background'' paper in the Usenix BSDCon 2002 Conference Proceedings, pages 55-64.	2001-12-14 05:50:44 +00:00
dillon	62f062ea62	Too many people are compiling kernels with maxusers set to 0 without the new config. Hack the kernel to force auto-sizing if the old config is used.	2001-12-14 04:01:08 +00:00
dillon	cd4d323ad3	This fixes a large number of bugs in our NFS client side code. A recent commit by Kirk also fixed a softupdates bug that could easily be triggered by server side NFS. * An edge case with shared R+W mmap()'s and truncate whereby the system would inappropriately clear the dirty bits on still-dirty data. (applicable to all filesystems) THIS FIX TEMPORARILY DISABLED PENDING FURTHER TESTING. see vm/vm_page.c line 1641 * The straddle case for VM pages and buffer cache buffers when truncating. (applicable to NFS client side) * Possible SMP database corruption due to vm_pager_unmap_page() not clearing the TLB for the other cpu's. (applicable to NFS client side but could effect all filesystems). Note: not considered serious since the corruption occurs beyond the file EOF. * When flusing a dirty buffer due to B_CACHE getting cleared, we were accidently setting B_CACHE again (that is, bwrite() sets B_CACHE), when we really want it to stay clear after the write is complete. This resulted in a corrupt buffer. (applicable to all filesystems but probably only triggered by NFS) * We have to call vtruncbuf() when ftruncate()ing to remove any buffer cache buffers. This is still tentitive, I may be able to remove it due to the second bug fix. (applicable to NFS client side) * vnode_pager_setsize() race against nfs_vinvalbuf()... we have to set n_size before calling nfs_vinvalbuf or the NFS code may recursively vnode_pager_setsize() to the original value before the truncate. This is what was causing the user mmap bus faults in the nfs tester program. (applicable to NFS client side) * Fix to softupdates (see ufs/ffs/ffs_inode.c 1.73, commit made by Kirk). Testing program written by: Avadis Tevanian, Jr. Testing program supplied by: jkh / Apple (see Dec2001 posting to freebsd-hackers with Subject 'NFS: How to make FreeBS fall on its face in one easy step') MFC after: 1 week	2001-12-14 01:16:57 +00:00
rwatson	9e2b770a8f	o Wording fix in comment. Submitted by: tanimura via p4	2001-12-14 00:38:01 +00:00
peter	a194c44001	If we were called to allocate a vnode that is not associated with a mount point, do not dereference the NULL mp argument.	2001-12-13 23:46:01 +00:00
rwatson	36784fd2c4	o Back out portions of 1.50 and 1.47, eliminating sonewconn3() and always deriving the credential for a newly accepted connection from the listen socket. Previously, the selection of the credential depended on the protocol: UNIX domain sockets would use the connecting process's credential, and protocols supporting a creation of the socket before the receiving end called accept() would use the listening socket. After this change, it is always the listening credential. Reviewed by: green	2001-12-13 22:09:37 +00:00
silby	dc4fed395a	Limit maxprocperuid to 9/10 maxproc, and limit maxfilesperproc to 9/10 maxfiles. This should make local resource exhaustion attacks easier to handle with a non-tweaked setup. MFC after: 3 days	2001-12-13 20:00:45 +00:00
jhb	66ac46bd15	Use a per-thread variable for keeping state when a thread is processing a KTR log entry. Any KTR requests made while working on an entry are ignored/discarded to prevent recursion. This is a better fix for the hack to futz with the CPU mask and call getnanotime() if KTR_LOCK or KTR_WITNESS was on. It also covers the actual formatting of the log entry including dumping it to the display which the earlier hacks did not.	2001-12-13 10:33:20 +00:00
arr	e55fee2143	- Move _jail sysctl node underneath _kern_security in order to standardize where our security related sysctl tuneables are located. Also, this will help if/when we move _security node out from under _kern as to help make _kern less cluttered. Approved by: rwatson Review by: rwatson	2001-12-12 05:23:20 +00:00
jhb	21b6b26912	Overhaul the per-CPU support a bit: - The MI portions of struct globaldata have been consolidated into a MI struct pcpu. The MD per-CPU data are specified via a macro defined in machine/pcpu.h. A macro was chosen over a struct mdpcpu so that the interface would be cleaner (PCPU_GET(my_md_field) vs. PCPU_GET(md.md_my_md_field)). - All references to globaldata are changed to pcpu instead. In a UP kernel, this data was stored as global variables which is where the original name came from. In an SMP world this data is per-CPU and ideally private to each CPU outside of the context of debuggers. This also included combining machine/globaldata.h and machine/globals.h into machine/pcpu.h. - The pointer to the thread using the FPU on i386 was renamed from npxthread to fpcurthread to be identical with other architectures. - Make the show pcpu ddb command MI with a MD callout to display MD fields. - The globaldata_register() function was renamed to pcpu_init() and now init's MI fields of a struct pcpu in addition to registering it with the internal array and list. - A pcpu_destroy() function was added to remove a struct pcpu from the internal array and list. Tested on: alpha, i386 Reviewed by: peter, jake	2001-12-11 23:33:44 +00:00
guido	2e77fc4d02	Fix boot -p for DDBless kernels Pointed out by: John Hay <jhay@icomtek.csir.co.za>	2001-12-11 10:21:26 +00:00
peter	46c0ef263e	Wrap Dangerously Dedicated printf under if (bootverbose)	2001-12-11 05:35:43 +00:00
obrien	41ac252611	Missed an assignment of arg6 in previous commit.	2001-12-10 20:58:39 +00:00
obrien	806dd95941	Adjust for the addition of CTR6.	2001-12-10 20:18:17 +00:00
guido	d779575f78	Add new boot flag to i386 boot: -p. This flag adds a pausing utility. When ran with -p, during the kernel probing phase, the kernel will pause after each line of output. This pausing can be ended with the '.' key, and is automatically suspended when entering ddb. This flag comes in handy at systems without a serial port that either hang during booting or reser. Reviewed by: (partly by jlemon) MFC after: 1 week	2001-12-10 20:02:22 +00:00
obrien	330a1032c1	Update to C99, s/__FUNCTION__/__func__/.	2001-12-10 05:51:45 +00:00
obrien	cca4f7b2d9	Repeat after me -- "Use of ANSI string concatenation can be bad." In this case, C99's __func__ is properly defined as: static const char __func__[] = "function-name"; and GCC 3.1 will not allow it to be used in bogus string concatenation.	2001-12-10 05:40:12 +00:00
alc	a49c1c9183	o Eliminate compilation warnings on 64-bit architectures.	2001-12-10 03:34:06 +00:00
alc	be4cbfd029	o Eliminate unnecessary synchronization from filt_aiodetach(). o The manual page for kevent says that EVFILT_AIO returns under the same conditions as aio_error(). With that in mind, set the data field of the returned struct kevent to the value that would be returned by aio_error(). o Fix two compilation warnings.	2001-12-09 08:16:36 +00:00
dillon	6fe4980d43	Allow maxusers to be specified as 0 in the kernel config, which will cause the system to auto-size to between 32 and 512 depending on the amount of memory. MFC after: 1 week	2001-12-09 01:57:09 +00:00
dillon	6e9238ff3f	The nbuf calculation was assuming that PAGE_SIZE = 4096 bytes, which is bogus. The calculation has been adjusted to use units of kilobytes. Noticed by: Chad David <davidc@acns.ab.ca> MFC after: 1 week	2001-12-08 20:37:08 +00:00
davidc	1d1054c88d	Update the comment about System initialization to reflect the use of DOMAIN_SET(9) instead of SYSINIT for adding domains at startup. Reviewed by: alfred	2001-12-08 04:20:54 +00:00
rwatson	7769631069	o A few more minor whitespace and other style fixes. Submitted by: bde	2001-12-06 21:58:47 +00:00
rwatson	751c41df3a	o Remove unnecessary inclusion of opt_global.h. Submitted by: bde	2001-12-06 21:55:41 +00:00

... 2 3 4 5 6 ...

4664 Commits