freebsd-skq

Author	SHA1	Message	Date
alc	13725ff1d5	o Correct a 32/64-bit error in the initialization of aiol_zone, specifically, sizeof(int) is not the size of a pointer.	2002-01-09 06:40:45 +00:00
msmith	c2656ac96b	Add a new sysinit SI_SUB_DEVFS. Devfs hooks into the kernel at SI_ORDER_FIRST, and devices can be created anytime after that. Print a warning if an atttempt is made to create a device too early.	2002-01-09 04:58:49 +00:00
silby	c5438df911	GC fast_vfork; it's not actually referenced anywhere. MFC after: 3 weeks	2002-01-09 04:51:21 +00:00
alfred	11d426818d	Sockets are called 'so' not 'sp'.	2002-01-09 02:47:00 +00:00
silby	4c0cf8914c	Revert 1.81; 1.19 fixed this already in a different way.	2002-01-09 01:45:17 +00:00
alc	938cb766b8	o Add missing synchronization (splnet()/splx()) in aio_free_entry(). o Move the definition of struct aiocblist from sys/aio.h to kern/vfs_aio.c. o Make aio_swake_cb() static.	2002-01-06 21:03:39 +00:00
kbyanc	9af9cb3fe9	* Implement SBUF_AUTOEXTEND flag; sbufs created with this flag are automatically extended to prevent overflow. * Added sbuf_vprintf(); sbuf_printf() is now just a wrapper around sbuf_vprintf(). * Include <stdio.h> and <string.h> when building libsbuf to silence WARNS=4 warnings. Reviewed by: des	2002-01-06 08:38:23 +00:00
silby	719af3e61a	Reorder a calculation in sbreserve so that it does not overflow with multi-megabyte socket buffer sizes. PR: 7420 MFC after: 3 weeks	2002-01-06 06:50:54 +00:00
rwatson	51a1c19396	- Teach SIGIO code to use cr_cansignal() instead of a custom CANSIGIO() macro. As a result, mandatory signal delivery policies will be applied consistently across the kernel. - Note that this subtly changes the protection semantics, and we should watch out for any resulting breakage. Previously, delivery of SIGIO in this circumstance was limited to situations where the subject was privileged, or where one of the subject's (ruid, euid) matched one of the object's (ruid, euid). In the new scenario, subject (ruid, euid) are matched against the object's (ruid, svuid), and the object uid's must be a subset of the subject uid's. Likewise, jail now affects delivery, and special handling for P_SUGID of the object is present. This change can always be reversed or tweaked if it proves to disrupt application behavior substantially. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-01-06 00:54:46 +00:00
rwatson	6b7ac7804d	- Push much of the logic for p_cansignal() behind cr_cansignal, which authorized based on a subject credential rather than a subject process. This will permit the same logic to be reused in situations where only the credential generating the signal is available, such as in the delivery of SIGIO. - Because of two clauses, the automatic success against curproc, and the session semantics for SIGCONT, not all logic can be pushed into cr_cansignal(), but those cases should not apply for most other consumers of cr_cansignal(). - This brings the base system inter-process authorization code more into line with the MAC implementation. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-01-06 00:20:12 +00:00
dwmalone	f974b4f783	Release text vnode in exit() rather than wait(). Occasionally fifesystem problems could prevent the release from completing and this could result in init being blocked indefinitely. This was looked over by Matt ages ago. Approved by: dillon	2002-01-05 21:47:58 +00:00
jhb	b8765de1bf	Fix a bug where the mutex name wasn't always displayed for processes in SMTX in utils such as ps and top. The KI_CTTY flag was assigned to kinfo_proc->ki_kiflag rather than or'd into the flag, thus clobbering any flags set earlier, including KI_MTXBLOCK. Prodding by: peter	2002-01-05 17:18:59 +00:00
peter	5e902a48f6	Fix forward_roundrobin(). It was mistakenly using the cpu number as though it was a mask. As a result, we sent AST IPI's to the wrong cpu and/or left out some. Spotted by: jake	2002-01-05 09:38:47 +00:00
peter	d0a39cc230	Add a per-cpu variable, cpumask, the preshifted equivalent of 1 << cpuid. We use this around the place a lot.	2002-01-05 09:35:50 +00:00
jhb	1ce407b675	Change the preemption code for software interrupt thread schedules and mutex releases to not require flags for the cases when preemption is not allowed: The purpose of the MTX_NOSWITCH and SWI_NOSWITCH flags is to prevent switching to a higher priority thread on mutex releease and swi schedule, respectively when that switch is not safe. Now that the critical section API maintains a per-thread nesting count, the kernel can easily check whether or not it should switch without relying on flags from the programmer. This fixes a few bugs in that all current callers of swi_sched() used SWI_NOSWITCH, when in fact, only the ones called from fast interrupt handlers and the swi_sched of softclock needed this flag. Note that to ensure that swi_sched()'s in clock and fast interrupt handlers do not switch, these handlers have to be explicitly wrapped in critical_enter/exit pairs. Presently, just wrapping the handlers is sufficient, but in the future with the fully preemptive kernel, the interrupt must be EOI'd before critical_exit() is called. (critical_exit() can switch due to a deferred preemption in a fully preemptive kernel.) I've tested the changes to the interrupt code on i386 and alpha. I have not tested ia64, but the interrupt code is almost identical to the alpha code, so I expect it will work fine. PowerPC and ARM do not yet have interrupt code in the tree so they shouldn't be broken. Sparc64 is broken, but that's been ok'd by jake and tmm who will be fixing the interrupt code for sparc64 shortly. Reviewed by: peter Tested on: i386, alpha	2002-01-05 08:47:13 +00:00
jhb	2f03379495	Remove brain damaged code in witness_lock(). We could have easily just used PCPU_GET(spinlocks) w/o needing the w_mtx held. It is more correct to just check td_critnest now though.	2002-01-05 08:29:54 +00:00
jhb	82f83a1cbe	Axe a stale comment. Holding sched_lock across both setrunqueue() and mi_switch() is sufficient.	2002-01-04 10:55:51 +00:00
silby	a239a7e562	Throw the $FreeBSD$s back in, properly escaping them.	2002-01-04 05:27:47 +00:00
silby	6cc0a06d0d	Remove $FreeBSD$s from previous commit; perl thinks that they're something to be interpreted. Urk.	2002-01-04 01:40:50 +00:00
silby	a45db01b69	Solve vnode_if.pl's identity crisis; make sure that it refers to itself as vnode_if.pl instead of vnode_if.sh. PR: 33509 MFC after: 3 weeks	2002-01-03 21:53:09 +00:00
se	4f45ba2c53	Return EBADF in case some vnode field has been reset to a NULL pointer. (There has been some discussion, whether ENOENT or EBADF is more appropriate. I choose the latter, since the operation is not supported on the file descriptor at that time, even if it was, immediately before.) PR: 32681 Reviewed by: dillon, iedowse, ... Approved by: nectar MFC after: 3 days (pending RE approval)	2002-01-03 09:54:24 +00:00
alc	e78b8215cc	o Properly check the file descriptor passed to aio_cancel(2). (Previously, no out-of-bounds check was performed on the file descriptor.) o Eliminate some excessive white space from aio_cancel(2).	2002-01-02 07:04:38 +00:00
jake	92bcc2bcb1	Print parm6 too in the !KTR_EXTEND case.	2002-01-01 21:47:38 +00:00
alc	e5d0c7a325	o Some style(9)-motivated changes to white space.	2002-01-01 00:40:29 +00:00
rwatson	5eea21ccca	o Make the credential used by socreate() an explicit argument to socreate(), rather than getting it implicitly from the thread argument. o Make NFS cache the credential provided at mount-time, and use the cached credential (nfsmount->nm_cred) when making calls to socreate() on initially connecting, or reconnecting the socket. This fixes bugs involving NFS over TCP and ipfw uid/gid rules, as well as bugs involving NFS and mandatory access control implementations. Reviewed by: freebsd-arch	2001-12-31 17:45:16 +00:00
alc	4687eb5aa5	o Correct an off-by-one error in aio_suspend(2). PR: 18350	2001-12-31 03:13:24 +00:00
alc	0fe9459a66	o Use "td->td_proc" instead of "curproc" where possible. o Eliminate the unnecessary initialization of several static variables to zero.	2001-12-31 02:03:39 +00:00
alc	d6b2b75593	Eliminate semexit_hook using at_exit(9) and rm_at_exit(9). Reviewed by: alfred	2001-12-30 18:55:09 +00:00
jake	0bff76ae56	Change traces in hardclock and statclock to use the KTR_CLK trace facility, rather than KTR_INTR.	2001-12-29 08:39:57 +00:00
alfred	f097734c27	Make AIO a loadable module. Remove the explicit call to aio_proc_rundown() from exit1(), instead AIO will use at_exit(9). Add functions at_exec(9), rm_at_exec(9) which function nearly the same as at_exec(9) and rm_at_exec(9), these functions are called on behalf of modules at the time of execve(2) after the image activator has run. Use a modified version of tegge's suggestion via at_exec(9) to close an exploitable race in AIO. Fix SYSCALL_MODULE_HELPER such that it's archetecuterally neutral, the problem was that one had to pass it a paramater indicating the number of arguments which were actually the number of "int". Fix it by using an inline version of the AS macro against the syscall arguments. (AS should be available globally but we'll get to that later.) Add a primative system for dynamically adding kqueue ops, it's really not as sophisticated as it should be, but I'll discuss with jlemon when he's around.	2001-12-29 07:13:47 +00:00
bde	4ac956411e	Fixed an apparent typo ("-" before ":") and an English error (comma splice) in the "already exists" message. Fixed some minor style bugs (KNFization to "return (foo)" had rotted in 2 out of 177 cases).	2001-12-28 18:32:13 +00:00
alfred	3026a8036d	brace by itself after function declaration. Mandated by: style(9) Pointed out by: rwatson	2001-12-27 20:16:21 +00:00
dillon	91aada8d5f	Fix type-o in previous commit (tsleep was using wrong rendezvous point)	2001-12-25 01:23:25 +00:00
bmilekic	965b8e2ef2	On the first day of Christmas bde gave to me: A [hopefully] conforming style(9) revamp of mb_alloc and related code. (This was possible due to bde's remarkable patience.) Submitted by: (in large part) bde Reviewed by: (the other part) bde	2001-12-23 22:04:08 +00:00
bmilekic	54e5874ed2	Move prototype of _mext_free to mbuf.h, where it belongs, because it is used in MEXTFREE and needs to be in scope for external MEXTFREE users. Pointed out by: Chad David <davidc@acns.ab.ca> Confirmed by: bde	2001-12-22 20:09:08 +00:00
tmm	5d1f367b0b	Add a generic __BUS_ACCESSOR macro to construct ivar accessor functions, and a generic resource_list_print_type() function to print all resouces of a certain type in a resource list. Use ulmin()/ulmax() instead of min()/max() in two places to handle u_longs correctly.	2001-12-21 21:45:09 +00:00
tmm	dadac69200	Add a rman_reserve_resource_bound() function that takes an additional argument specifying the boundary for the resource allocation. Use ulmin()/ulmax() instead of min()/max() in some places to correctly deal with the u_long resource range specifications.	2001-12-21 21:40:55 +00:00
peter	d39f9b4517	Avoid an interaction between syncache and accept filters. The syncache code only passed up the connection to the tcp stack when it was complete, so it went directly into the so_comp (complete) queue. However, with accept filters, there is an additional phase before calling it "complete". Reviewed by: jlemon	2001-12-21 04:30:49 +00:00
jhb	2463f40fc3	Introduce a standard name for the lock protecting an interrupt controller and it's associated state variables: icu_lock with the name "icu". This renames the imen_mtx for x86 SMP, but also uses the lock to protect access to the 8259 PIC on x86 UP. This also adds an appropriate lock to the various Alpha chipsets which fixes problems with Alpha SMP machines dropping interrupts with an SMP kernel.	2001-12-20 23:48:31 +00:00
dillon	ac9876d609	Fix a BUF_TIMELOCK race against BUF_LOCK and fix a deadlock in vget() against VM_WAIT in the pageout code. Both fixes involve adjusting the lockmgr's timeout capability so locks obtained with timeouts do not interfere with locks obtained without a timeout. Hopefully MFC: before the 4.5 release	2001-12-20 22:42:27 +00:00
dillon	de48df525f	Calculate whether the sbuf is dynamic before bzero()ing the structure. This fixes a serious memory leak in the sbuf code. MFC after: 3 days	2001-12-19 19:04:57 +00:00
peter	d6d1e90f25	Do not initialize static/global variables to 0. Use bss instead of taking up space in the data section.	2001-12-19 01:35:18 +00:00
peter	12f2610cb5	Use a different mechanism to get the vnlru process to wake up and notice the shutdown request at reboot/halt time. Disable the printf 'vnlru process getting nowhere, pausing...' and instead export the count to the debug.vnlru_nowhere sysctl.	2001-12-19 01:31:12 +00:00
luigi	b6f2ecc1bc	Complete the device polling support by adding a thread in charge of polling interfaces at the lowest possible priority (this might result in softnetisr being scheduled, but there is no risk of livelock because they have a higher priority than this thread).	2001-12-19 00:53:24 +00:00
jhb	5463e6afe5	Return EINVAL if kernel only flags are passed to the rfork syscall rather than silently masking them.	2001-12-19 00:53:23 +00:00
dillon	1750942f6f	This is a forward port of Peter's vlrureclaim() fix, with some minor mods by me to make it more efficient. The original code had serious balancing problems and could also deadlock easily. This code relegates the vnode reclamation to its own kproc and relaxes the vnode reclamation requirements to better maintain kern.maxvnodes. This code still doesn't balance as well as it could, but it does a much better job then the original code. Approved by: re@freebsd.org Obtained from: ps, peter, dillon MFS Assuming: Assuming no problems crop up in Yahoo testing MFC after: 7 days	2001-12-18 20:48:54 +00:00
jhb	3b3c195480	- Change all callers of addupc_task() to check PS_PROFIL explicitly and remove the check from addupc_task(). It would need sched_lock while testing the flag anyways. - Always read sticks while holding sched_lock using a temporary variable where needed. - Always init prticks to 0 in ast() to quiet a warning.	2001-12-18 09:06:10 +00:00
jhb	a3b98398cb	Modify the critical section API as follows: - The MD functions critical_enter/exit are renamed to start with a cpu_ prefix. - MI wrapper functions critical_enter/exit maintain a per-thread nesting count and a per-thread critical section saved state set when entering a critical section while at nesting level 0 and restored when exiting to nesting level 0. This moves the saved state out of spin mutexes so that interlocking spin mutexes works properly. - Most low-level MD code that used critical_enter/exit now use cpu_critical_enter/exit. MI code such as device drivers and spin mutexes use the MI wrappers. Note that since the MI wrappers store the state in the current thread, they do not have any return values or arguments. - mtx_intr_enable() is replaced with a constant CRITICAL_FORK which is assigned to curthread->td_savecrit during fork_exit(). Tested on: i386, alpha	2001-12-18 00:27:18 +00:00
mp	add9abf1bb	Remove whitespace at end of line.	2001-12-16 17:21:16 +00:00
luigi	4893656ff8	Add/correct description for some sysctl variables where it was missing. The description field is unused in -stable, so the MFC there is equivalent to a comment. It can be done at any time, i am just setting a reminder in 45 days when hopefully we are past 4.5-release. MFC after: 45 days	2001-12-16 16:07:20 +00:00
luigi	e39284a688	Add code to export and print the description associated to sysctl variables. Use the -d flag in sysctl(8) to see this information. Possible extensions to sysctl: + report variables that do not have a description + given a name, report the oid it maps to. Note to developers: have a look at your code, there are a number of variables which do not have a description. Note to developers: do we want this in 4.5 ? It is a very small change and very useful for documentation purposes. Suggested by: Orion Hodson	2001-12-16 02:55:41 +00:00
jhb	0c1bfda974	Fix some nits in fork_exit() so it more properly duplicates the backend of mi_switch: - Set the oncpu value for the current thread. - Always set switchticks, not just in the SMP case. - Add a KTR entry for fork_exit that is the same as the "new proc" entry in mi_switch(). - Release sched_lock a bit later like we do with mi_switch().	2001-12-14 23:37:35 +00:00
jlemon	f7af5c6f92	When removing kqueue descriptors from the descriptor table during a fork, update fd_freefile and fd_lastfile as well, to keep things in sync. Pointed out by: Debbie Chu <dchu@juniper.net>	2001-12-14 19:02:57 +00:00
luigi	f8ad22919e	Device Polling code for -current. Non-SMP, i386-only, no polling in the idle loop at the moment. To use this code you must compile a kernel with options DEVICE_POLLING and at runtime enable polling with sysctl kern.polling.enable=1 The percentage of CPU reserved to userland can be set with sysctl kern.polling.user_frac=NN (default is 50) while the remainder is used by polling device drivers and netisr's. These are the only two variables that you should need to touch. There are a few more parameters in kern.polling but the default values are adequate for all purposes. See the code in kern_poll.c for more details on them. Polling in the idle loop will be implemented shortly by introducing a kernel thread which does the job. Until then, the amount of CPU dedicated to polling will never exceed (100-user_frac). The equivalent (actually, better) code for -stable is at http://info.iet.unipi.it/~luigi/polling/ and also supports polling in the idle loop. NOTE to Alpha developers: There is really nothing in this code that is i386-specific. If you move the 2 lines supporting the new option from sys/conf/{files,options}.i386 to sys/conf/{files,options} I am pretty sure that this should work on the Alpha as well, just that I do not have a suitable test box to try it. If someone feels like trying it, I would appreciate it. NOTE to other developers: sure some things could be done better, and as always I am open to constructive criticism, which a few of you have already given and I greatly appreciated. However, before proposing radical architectural changes, please take some time to possibly try out this code, or at the very least read the comments in kern_poll.c, especially re. the reason why I am using a soft netisr and cannot (I believe) replace it with a simple timeout. Quick description of files touched by this commit: sys/conf/files.i386 new file kern/kern_poll.c sys/conf/options.i386 new option sys/i386/i386/trap.c poll in trap (disabled by default) sys/kern/kern_clock.c initialization and hardclock hooks. sys/kern/kern_intr.c minor swi_net changes sys/kern/kern_poll.c the bulk of the code. sys/net/if.h new flag sys/net/if_var.h declaration for functions used in device drivers. sys/net/netisr.h NETISR_POLL sys/dev/fxp/if_fxp.c sys/dev/fxp/if_fxpvar.h sys/pci/if_dc.c sys/pci/if_dcreg.h sys/pci/if_sis.c sys/pci/if_sisreg.h device driver modifications	2001-12-14 17:56:12 +00:00
peter	dd0f3c5ca2	Proper fix for old config setting maxusers to 8.	2001-12-14 09:39:29 +00:00
dillon	8e6d2fbcbd	A slightly different version of the vlrureclaim fix. Reported by: peter, ps	2001-12-14 07:18:31 +00:00
mckusick	d3b383005d	Add disk I/O scheduling for positively niced processes. When a positively niced process requests a disk I/O, make it wait for its nice value of ticks before scheduling its I/O request if there are any other processes with I/O requests in the disk queue. For all the gory details, see the ``Running fsck in the Background'' paper in the Usenix BSDCon 2002 Conference Proceedings, pages 55-64.	2001-12-14 05:50:44 +00:00
dillon	62f062ea62	Too many people are compiling kernels with maxusers set to 0 without the new config. Hack the kernel to force auto-sizing if the old config is used.	2001-12-14 04:01:08 +00:00
dillon	cd4d323ad3	This fixes a large number of bugs in our NFS client side code. A recent commit by Kirk also fixed a softupdates bug that could easily be triggered by server side NFS. * An edge case with shared R+W mmap()'s and truncate whereby the system would inappropriately clear the dirty bits on still-dirty data. (applicable to all filesystems) THIS FIX TEMPORARILY DISABLED PENDING FURTHER TESTING. see vm/vm_page.c line 1641 * The straddle case for VM pages and buffer cache buffers when truncating. (applicable to NFS client side) * Possible SMP database corruption due to vm_pager_unmap_page() not clearing the TLB for the other cpu's. (applicable to NFS client side but could effect all filesystems). Note: not considered serious since the corruption occurs beyond the file EOF. * When flusing a dirty buffer due to B_CACHE getting cleared, we were accidently setting B_CACHE again (that is, bwrite() sets B_CACHE), when we really want it to stay clear after the write is complete. This resulted in a corrupt buffer. (applicable to all filesystems but probably only triggered by NFS) * We have to call vtruncbuf() when ftruncate()ing to remove any buffer cache buffers. This is still tentitive, I may be able to remove it due to the second bug fix. (applicable to NFS client side) * vnode_pager_setsize() race against nfs_vinvalbuf()... we have to set n_size before calling nfs_vinvalbuf or the NFS code may recursively vnode_pager_setsize() to the original value before the truncate. This is what was causing the user mmap bus faults in the nfs tester program. (applicable to NFS client side) * Fix to softupdates (see ufs/ffs/ffs_inode.c 1.73, commit made by Kirk). Testing program written by: Avadis Tevanian, Jr. Testing program supplied by: jkh / Apple (see Dec2001 posting to freebsd-hackers with Subject 'NFS: How to make FreeBS fall on its face in one easy step') MFC after: 1 week	2001-12-14 01:16:57 +00:00
rwatson	9e2b770a8f	o Wording fix in comment. Submitted by: tanimura via p4	2001-12-14 00:38:01 +00:00
peter	a194c44001	If we were called to allocate a vnode that is not associated with a mount point, do not dereference the NULL mp argument.	2001-12-13 23:46:01 +00:00
rwatson	36784fd2c4	o Back out portions of 1.50 and 1.47, eliminating sonewconn3() and always deriving the credential for a newly accepted connection from the listen socket. Previously, the selection of the credential depended on the protocol: UNIX domain sockets would use the connecting process's credential, and protocols supporting a creation of the socket before the receiving end called accept() would use the listening socket. After this change, it is always the listening credential. Reviewed by: green	2001-12-13 22:09:37 +00:00
silby	dc4fed395a	Limit maxprocperuid to 9/10 maxproc, and limit maxfilesperproc to 9/10 maxfiles. This should make local resource exhaustion attacks easier to handle with a non-tweaked setup. MFC after: 3 days	2001-12-13 20:00:45 +00:00
jhb	66ac46bd15	Use a per-thread variable for keeping state when a thread is processing a KTR log entry. Any KTR requests made while working on an entry are ignored/discarded to prevent recursion. This is a better fix for the hack to futz with the CPU mask and call getnanotime() if KTR_LOCK or KTR_WITNESS was on. It also covers the actual formatting of the log entry including dumping it to the display which the earlier hacks did not.	2001-12-13 10:33:20 +00:00
arr	e55fee2143	- Move _jail sysctl node underneath _kern_security in order to standardize where our security related sysctl tuneables are located. Also, this will help if/when we move _security node out from under _kern as to help make _kern less cluttered. Approved by: rwatson Review by: rwatson	2001-12-12 05:23:20 +00:00
jhb	21b6b26912	Overhaul the per-CPU support a bit: - The MI portions of struct globaldata have been consolidated into a MI struct pcpu. The MD per-CPU data are specified via a macro defined in machine/pcpu.h. A macro was chosen over a struct mdpcpu so that the interface would be cleaner (PCPU_GET(my_md_field) vs. PCPU_GET(md.md_my_md_field)). - All references to globaldata are changed to pcpu instead. In a UP kernel, this data was stored as global variables which is where the original name came from. In an SMP world this data is per-CPU and ideally private to each CPU outside of the context of debuggers. This also included combining machine/globaldata.h and machine/globals.h into machine/pcpu.h. - The pointer to the thread using the FPU on i386 was renamed from npxthread to fpcurthread to be identical with other architectures. - Make the show pcpu ddb command MI with a MD callout to display MD fields. - The globaldata_register() function was renamed to pcpu_init() and now init's MI fields of a struct pcpu in addition to registering it with the internal array and list. - A pcpu_destroy() function was added to remove a struct pcpu from the internal array and list. Tested on: alpha, i386 Reviewed by: peter, jake	2001-12-11 23:33:44 +00:00
guido	2e77fc4d02	Fix boot -p for DDBless kernels Pointed out by: John Hay <jhay@icomtek.csir.co.za>	2001-12-11 10:21:26 +00:00
peter	46c0ef263e	Wrap Dangerously Dedicated printf under if (bootverbose)	2001-12-11 05:35:43 +00:00
obrien	41ac252611	Missed an assignment of arg6 in previous commit.	2001-12-10 20:58:39 +00:00
obrien	806dd95941	Adjust for the addition of CTR6.	2001-12-10 20:18:17 +00:00
guido	d779575f78	Add new boot flag to i386 boot: -p. This flag adds a pausing utility. When ran with -p, during the kernel probing phase, the kernel will pause after each line of output. This pausing can be ended with the '.' key, and is automatically suspended when entering ddb. This flag comes in handy at systems without a serial port that either hang during booting or reser. Reviewed by: (partly by jlemon) MFC after: 1 week	2001-12-10 20:02:22 +00:00
obrien	330a1032c1	Update to C99, s/__FUNCTION__/__func__/.	2001-12-10 05:51:45 +00:00
obrien	cca4f7b2d9	Repeat after me -- "Use of ANSI string concatenation can be bad." In this case, C99's __func__ is properly defined as: static const char __func__[] = "function-name"; and GCC 3.1 will not allow it to be used in bogus string concatenation.	2001-12-10 05:40:12 +00:00
alc	a49c1c9183	o Eliminate compilation warnings on 64-bit architectures.	2001-12-10 03:34:06 +00:00
alc	be4cbfd029	o Eliminate unnecessary synchronization from filt_aiodetach(). o The manual page for kevent says that EVFILT_AIO returns under the same conditions as aio_error(). With that in mind, set the data field of the returned struct kevent to the value that would be returned by aio_error(). o Fix two compilation warnings.	2001-12-09 08:16:36 +00:00
dillon	6fe4980d43	Allow maxusers to be specified as 0 in the kernel config, which will cause the system to auto-size to between 32 and 512 depending on the amount of memory. MFC after: 1 week	2001-12-09 01:57:09 +00:00
dillon	6e9238ff3f	The nbuf calculation was assuming that PAGE_SIZE = 4096 bytes, which is bogus. The calculation has been adjusted to use units of kilobytes. Noticed by: Chad David <davidc@acns.ab.ca> MFC after: 1 week	2001-12-08 20:37:08 +00:00
davidc	1d1054c88d	Update the comment about System initialization to reflect the use of DOMAIN_SET(9) instead of SYSINIT for adding domains at startup. Reviewed by: alfred	2001-12-08 04:20:54 +00:00
rwatson	7769631069	o A few more minor whitespace and other style fixes. Submitted by: bde	2001-12-06 21:58:47 +00:00
rwatson	751c41df3a	o Remove unnecessary inclusion of opt_global.h. Submitted by: bde	2001-12-06 21:55:41 +00:00
rwatson	754ad10054	o Make kern.security.bsd.suser_enabled TUNABLE. Requested by: green	2001-12-05 18:49:20 +00:00
mckusick	f62c954d2f	Update pathnames for creation of tags file.	2001-12-05 01:23:21 +00:00
rwatson	fb311b7cce	o Update an instance of 'unprivileged_procdebug_permitted' missed in the previous commit: the comment should also call it 'unprivileged_proc_debug'.	2001-12-03 19:10:21 +00:00
rwatson	b5de442911	o Introduce pr_mtx into struct prison, providing protection for the mutable contents of struct prison (hostname, securelevel, refcount, pr_linux, ...) o Generally introduce mtx_lock()/mtx_unlock() calls throughout kern/ so as to enforce these protections, in particular, in kern_mib.c protection sysctl access to the hostname and securelevel, as well as kern_prot.c access to the securelevel for access control purposes. o Rewrite linux emulator abstractions for accessing per-jail linux mib entries (osname, osrelease, osversion) so that they don't return a pointer to the text in the struct linux_prison, rather, a copy to an array passed into the calls. Likewise, update linprocfs to use these primitives. o Update in_pcb.c to always use prison_getip() rather than directly accessing struct prison. Reviewed by: jhb	2001-12-03 16:12:27 +00:00
rwatson	de0f8b15da	o Uniformly copy uap arguments into local variables before grabbing giant, and make whitespace more consistent around giant-frobbing.	2001-12-02 15:22:56 +00:00
rwatson	dbe003dc3e	o Remove KSE race in setuid() in which oldcred was preserved before giant was grabbed. This was introduced in 1.101 when the giant pushdown for kern_prot.c was originally performed.	2001-12-02 15:15:29 +00:00
rwatson	8b2ab77900	o General style, formatting, etc, improvements: - uid's -> uids - whitespace improvements, linewrap improvements - reorder copyright more appropriately - remove redundant MP SAFE comments, add one "NOT MPSAFE?" for setgroups(), which seems to be the sole un-changed system call in the file. - clean up securelevel_g?() functions, improve comments. Largely submitted by: bde	2001-12-02 15:07:10 +00:00
alfred	77b8f8139c	make LOCKF_DEBUG kernel option work (sorta) Submitted by: Maxim Konovalov <maxim@macomnet.ru> PR: kern/32267	2001-12-02 12:47:25 +00:00
luigi	0d72b82e2e	vm/vm_kern.c: rate limit (to once per second) diagnostic printf when you run out of mbuf address space. kern/subr_mbuf.c: print a warning message when mb_alloc fails, again rate-limited to at most once per second. This covers other cases of mbuf allocation failures. Probably it also overlaps the one handled in vm/vm_kern.c, so maybe the latter should go away. This warning will let us gradually remove the printf that are scattered across most network drivers to report mbuf allocation failures. Those are potentially dangerous, in that they are not rate-limited and can easily cause systems to panic. Unless there is disagreement (which does not seem to be the case judging from the discussion on -net so far), and because this is sort of a safety bugfix, I plan to commit a similar change to STABLE during the weekend (it affects kern/uipc_mbuf.c there). Discussed-with: jlemon, silby and -net	2001-12-01 00:21:30 +00:00
rwatson	aa8360c1cd	o Introduce kern.security.bsd.unprivileged_read_msgbuf, which allows the administrator to restrict access to the kernel message buffer. It defaults to '1', which permits access, but if set to '0', requires that the process making the sysctl() have appropriate privilege. o Note that for this to be effective, access to this data via system logs derived from /dev/klog must also be limited. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2001-11-30 21:40:52 +00:00
rwatson	68b9d3708b	o Further sysctl name simplification, generally stripping 'permitted', using '_'s more consistently. Discussed with: bde, jhb Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2001-11-30 21:33:16 +00:00
rwatson	e92874bd10	o Move current inhabitants of kern.security to kern.security.bsd, so that new models can inhabit kern.security.<modelname>. o While I'm there, shorten somewhat excessive variable names, and clean things up a little. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2001-11-30 20:58:31 +00:00
rwatson	5682f21557	o Cache req->td->td_proc->p_ucred->cr_prison in pr to improve readability. o Conditionalize only the SYSCTL definitions for the regression tree, not the variables itself, decreasing the number of #ifdef REGRESSIONs scattered in kern_mib.c, and making the code more readable. Sponsored by: DARPA, NAI Labs	2001-11-28 21:22:05 +00:00
jwd	2a6f1a68f9	Return a more meaningful errno when the length of the interpreter exceeds MAXSHELLCMDLEN to avoid secondary /bin/sh execution. Update execve man page to reflect change. Increase MAXSHELLCMDLEN to a slightly more meaningful value. PR: kern/32106 Submitted by: b@etek.chalmers.se Reviewed by: bsd MFC after: 2 weeks	2001-11-28 03:26:58 +00:00
peter	ca5b2bc739	Dont print the sysctl node tree unless you're root. Found by: jkb (Yahoo OS troublemaker)	2001-11-28 03:11:16 +00:00
bmilekic	0dbfbc0131	Context: For an object type, we maintain a variable mb_mapfull. It is 0 by default and is only raised to 1 in one place: when an mb_pop_cont() fails for the first time, on the assumption that the reason for the failure is due to the underlying map for the object (e.g. clust_map, mbuf_map) being exhausted. Problem and Changes: Change how we define "mb_mapfull." It now means: "set to 1 when the first mb_pop_cont() fails only in the kmem_malloc()-ing of the object, and only if the call was with the M_TRYWAIT flag." This is a more conservative definition and should avoid odd [but theoretically possible] situations from occuring. i.e. we had set mb_mapfull to 1 thinking the map for the object was actually exhausted when we _actually_ failed in malloc()ing the space for the bucket structure managing the objects in the page we're allocating.	2001-11-25 04:42:54 +00:00
dfr	194b963c4d	Since we used '#ifdef __i386__', don't close with '#endif /* !__alpha__ */'	2001-11-24 10:11:14 +00:00
obrien	8c5542cd12	Remove the use of _PATH_DEV in the example. The kernel certainly doesn't use _PATH_DEV or even /dev/ to find the device. It cannot, since "/" has not been mounted. Maybe the only affect of using /dev/ is that it gets put in the mounted-from name for "/", so that mount(8), etc., display an absolute path before "/" has been remounted. Many have never bothered typing the full path, and code that constructs a path in rootdevnames[] never bothered to construct a full path, so the example shouldn't have it. Submitted by: bde	2001-11-24 01:34:12 +00:00
peter	43edb17438	Recognize the "fixed" geometry in boot1 so that DD disks are not interpreted as real fdisk tables (and fail).	2001-11-21 08:31:45 +00:00
obrien	33425adab6	We only have slices on i386 and IA-64.	2001-11-20 23:48:00 +00:00
sobomax	23105d4979	Make kevents on pipes work as described in the manpage - when the last reader/writer disconnects, ensure that anybody who is waiting for the kevent on the other end of the pipe gets EV_EOF. MFC after: 2 weeks	2001-11-19 09:25:30 +00:00
dillon	58a458515f	cast hashing index to (int)(intptr_t) for calculation. mtx_init() with MTX_QUIET and MTX_NOWITNESS to avoid bogus warnings	2001-11-19 00:20:36 +00:00
arr	47cd77ddbd	- Ensure that linker file id's are unique, rather than blindly incrementing the value. Reviewed by: dfr, peter	2001-11-18 18:19:35 +00:00
dillon	86ed17d675	Give struct socket structures a ref counting interface similar to vnodes. This will hopefully serve as a base from which we can expand the MP code. We currently do not attempt to obtain any mutex or SX locks, but the door is open to add them when we nail down exactly how that part of it is going to work.	2001-11-17 03:07:11 +00:00
peter	9aa0c95a10	Fix some warnings on 64 bit platforms.	2001-11-17 00:42:02 +00:00
peter	ac0c0d2f8c	utime/stime.tv_sec are elapsed times, not relative to 1970. We can safely print them as longs. Even if ^T overflows after a process has accumulated 68 years of user or system time, it is no big deal.	2001-11-17 00:26:57 +00:00
peter	b15a1b598d	You cannot cast a time_t to quad_t and printf it with %lld. quad_t is 64 bits, not long long.	2001-11-16 23:53:48 +00:00
iedowse	4e3498d275	Fix a number of misspellings of "dependency" and "dependencies" in comments and function names. PR: kern/8589 Submitted by: Rajesh Vaidheeswarran <rv@fore.com>	2001-11-16 21:08:40 +00:00
phk	13cae0ede7	Back out the previous fix to the leading zero problem, I hadn't noticed it in there already. That should teach me to check exit code from cvsup.	2001-11-16 17:07:47 +00:00
phk	045e2cb555	Reject leading zeros in dev_stdclone(). PR: 32019 Submitted by: fenner	2001-11-16 17:05:07 +00:00
joe	0b97e4b5e2	Switch warnings and strict back on again in a way that's compatible with -stable as well as -current. Reviewed by: imp	2001-11-16 02:02:42 +00:00
fenner	45b8f05b03	Do not allow leading zeros on device names in dev_stdclone(). PR: kern/32019 Reviewed by: phk	2001-11-15 23:27:46 +00:00
jhb	ae1274f8d2	Use MTX_QUIET for the lock operations during clock interrupts so their logs don't drown out more useful log messages.	2001-11-15 19:54:48 +00:00
jhb	7225db9bf4	Add a couple of returns to making recovering from a failed witness_assert() more sane in the RESTARTABLE_PANICS case.	2001-11-15 19:46:36 +00:00
jhb	a34999ebf2	Remove definition of witness and comment stating that this file implements witness. Witness moved off to subr_witness.c a while ago.	2001-11-15 19:08:55 +00:00
dillon	e3b965f7d5	remove holdfp() Replace uses of holdfp() with fget() or fgetvp() calls as appropriate introduce fget(), fget_read(), fget_write() - these functions will take a thread and file descriptor and return a file pointer with its ref count bumped. introduce fgetvp(), fgetvp_read(), fgetvp_write() - these functions will take a thread and file descriptor and return a vref()'d vnode. _read() requires that the file pointer be FREAD, _write that it be FWRITE. This continues the cleanup of struct filedesc and struct file access routines which, when are all through with it, will allow us to then make the API calls MP safe and be able to move Giant down into the fo_* functions.	2001-11-14 06:30:36 +00:00
dillon	27124b4079	Create a mutex pool API for short term leaf mutexes. Replace the manual mutex pool in kern_lock.c (lockmgr locks) with the new API. Replace the mutexes embedded in sxlocks with the new API.	2001-11-13 21:55:13 +00:00
jhb	7e0d456cdf	As a followup to the previous fixes to inferior, revert some of the changes in 1.80 that were needed for locking that are no longer needed now that a lock is simply asserted. Submitted by: bde	2001-11-13 16:55:54 +00:00
ps	d745b728a2	Fix a signed bug in the crashdump code for systems with > 2GB of ram. Reviewed by: peter	2001-11-13 01:08:54 +00:00
keramida	e2b354901d	Remove EOL whitespace. Reviewed by: alfred	2001-11-12 20:51:40 +00:00
keramida	d820d4cb55	Make KASSERT's print the values that triggered a panic. Reviewed by: alfred	2001-11-12 20:50:06 +00:00
jhb	c7338726d9	Clean up breakage in inferior() I introduced in 1.92 of kern_proc.c: - Restore inferior() to being iterative rather than recursive. - Assert that the proctree_lock is held in inferior() and change the one caller to get a shared lock of it. This also ensures that we hold the lock after performing the check so the check can't be made invalid out from under us after the check but before we act on it. Requested by: bde	2001-11-12 18:56:49 +00:00
peter	63c937a8f7	Commit the better version that I had a while ago. This has only one reference to curthread. (#define curproc (curthread->td_proc)).	2001-11-12 08:53:34 +00:00
dillon	9a4e2a07a8	When curproc is used repeatedly store curproc into a local variable to reduce generated code. This is a test case.	2001-11-12 08:42:20 +00:00
alfred	015f13094a	turn vn_open() into a wrapper around vn_open_cred() which allows one to perform a vn_open using temporary/other/fake credentials. Modify the nfs client side locking code to use vn_open_cred() passing proc0's ucred instead of the old way which was to temporary raise privs while running vn_open(). This should close the race hopefully.	2001-11-11 22:39:07 +00:00
arr	cd1e73aaef	- No need for resetting values to 0 when M_ZERO flag is used. Approved: jhb	2001-11-10 21:36:56 +00:00
iedowse	8122c9fcb4	Properly sanity-check the old msgbuf structure before we accept it as being valid. Previously only the magic number and the virtual address were checked, but it makes little sense to require that the virtual address is the same (the message buffer is located at the end of physical memory), and checks on the msg_bufx and msg_bufr indices were missing. Submitted by: Bodo Rueskamp <br@clabsms.de> Tripped over during a kernel debugging tutorial given by: grog Reviewed by: grog, dwmalone MFC after: 1 week	2001-11-09 23:58:07 +00:00
dillon	08792e81f7	Placemark an interrupt race in -current which is currently protected by Giant. -stable will get spl*() fixes for the race. Reported by: Rob Anderson <rob@isilon.com> MFC after: 0 days	2001-11-08 18:09:18 +00:00
rwatson	5d0ec904c0	o General style improvemnts. Submitted by: bde	2001-11-08 15:31:19 +00:00
rwatson	2a6a10923a	o Trim trailing whitespace from kern_mib.c, as suggested by bde. Good grief.	2001-11-08 15:20:00 +00:00
rwatson	8cf42b482a	o Replace reference to 'struct proc' with 'struct thread' in 'struct sysctl_req', which describes in-progress sysctl requests. This permits sysctl handlers to have access to the current thread, permitting work on implementing td->td_ucred, migration of suser() to using struct thread to derive the appropriate ucred, and allowing struct thread to be passed down to other code, such as network code where td is not currently available (and curproc is used). o Note: netncp and netsmb are not updated to reflect this change, as they are not currently KSE-adapted. Reviewed by: julian Obtained from: TrustedBSD Project	2001-11-08 02:13:18 +00:00
peter	1a27c90eb8	For what its worth, sync up the type of ps_arg_cache_max (unsigned long) with the sysctl type (signed long).	2001-11-08 00:24:48 +00:00
rwatson	bd13886bd8	o Cache the process's struct prison so as to create a more visually appealing code structure. In particular, s/req->p->p_ucred->cr_prison/pr/ Requested by: imp, jhb, jake, other hangers on	2001-11-06 20:09:33 +00:00
rwatson	835371a313	o Remove a tab missed in the previous whitespace commit.	2001-11-06 19:58:43 +00:00
rwatson	08fb9c82f6	o Remove double-indentation of sysctl_kern_securelvl. This change is consistent with the one other function in the file, and prevents long lines in up-coming changes. This nominally pulls kern_mib.c a little further down the long path to style(9) compliance.	2001-11-06 19:56:58 +00:00
arr	786277e5d2	o No need to set values to 0 when we utilize M_ZERO Approved by: peter	2001-11-05 22:27:46 +00:00
dillon	1147eaf58a	Implement IO_NOWDRAIN and B_NOWDRAIN - prevents the buffer cache from blocking in wdrain during a write. This flag needs to be used in devices whos strategy routines turn-around and issue another high level I/O, such as when MD turns around and issues a VOP_WRITE to vnode backing store, in order to avoid deadlocking the dirty buffer draining code. Remove a vprintf() warning from MD when the backing vnode is found to be in-use. The syncer of buf_daemon could be flushing the backing vnode at the time of an MD operation so the warning is not correct. MFC after: 1 week	2001-11-05 18:48:54 +00:00
rwatson	11bc0f4ff1	Update copyrights to include Thomas Moestl. Submitted by: "Ilmar S. Habibulin" <ilmar@watson.org> Obtained from: TrustedBSD Project	2001-11-05 15:36:24 +00:00
phk	235f3ed483	Define a new mount flag "MNT_JAILDEVFS" Collect the magic combination of flags which can be updated into a macro in sys/mount.h rather than inlining them (twice!) in vfs_syscalls.c	2001-11-05 10:33:45 +00:00
dillon	c9a56085ce	Add mnt_reservedvnlist so we can MFC to 4.x, in order to make all mount structure changes now rather then piecemeal later on. mnt_nvnodelist currently holds all the vnodes under the mount point. This will eventually be split into a 'dirty' and 'clean' list. This way we only break kld's once rather then twice. nvnodelist will eventually turn into the dirty list and should remain compatible with the klds.	2001-11-04 18:55:42 +00:00
peter	1c09a79255	* empty log message *	2001-11-04 18:22:48 +00:00
phk	b102b404f9	Don't call cdevsw_add().	2001-11-04 11:56:22 +00:00
phk	c665837dfd	Rename the top 7 bits if disk minors to spare bits, rather than type bits.	2001-11-04 09:01:07 +00:00
phk	076014359d	Don't choke on old sd%d.ctl devices. Tripped over by: Jos Backus <josb@cncdsl.com>	2001-11-03 23:21:00 +00:00
peter	bd5684dc54	_SIG_MAXSIG (128) is the highest legal signal. The arrays are offset by one - see _SIG_IDX(). Revert part of my mis-correction in kern_sig.c (but signal 0 still has to be allowed) and fix _SIG_VALID() (it was rejecting ignal 128).	2001-11-03 13:26:15 +00:00
peter	43929480b6	Partial reversion of rev 1.138. kill and killpg allow a signal argument of 0. You cannot return EINVAL for signal 0. This broke (in 5 minutes of testing) at least ssh-agent and screen. However, there was a bug in the original code. Signal 128 is not valid. Pointy-hat to: des, jhb	2001-11-03 12:36:16 +00:00
peter	2fc110a60d	FreeBSD/tahoe is not likely for a while.	2001-11-03 08:19:21 +00:00
des	84073a96d1	We have a _SIG_VALID() macro, so use it instead of duplicating the test all over the place. Also replace a printf() + panic() with a KASSERT(). Reviewed by: jhb	2001-11-02 23:50:00 +00:00
rwatson	171c9bdfc7	o Remove (struct proc *p = td->td_proc) indirection in ipcperm(), as suser_td(td) works as well as suser_xxx(NULL, p->p_ucred, 0); This simplifies upcoming changes to suser(), and causes this code to use the right credential (well, largely) once the td->td_ucred changes are complete. There remains some redundancy and oddness in this code, which should be rethought after the next batch of suser and credential changes.	2001-11-02 21:20:05 +00:00
imp	1f54f2c411	Back out the -w, option strict and our($...). They don't work for me and have broken the kernel build.	2001-11-02 21:14:17 +00:00

1 2 3 4 5 ...

4494 Commits