freebsd-skq

Author	SHA1	Message	Date
Robert Watson	0a05006dd2	Add MAC_STATIC, a kernel option that disables internal MAC Framework synchronization protecting against dynamic load and unload of MAC policies, and instead simply blocks load and unload. In a static configuration, this allows you to avoid the synchronization costs associated with introducing dynamicism. Obtained from: TrustedBSD Project Sponsored by: DARPA, McAfee Research	2004-05-03 20:53:05 +00:00
Colin Percival	b62b230461	Fix a race condition which could result in profprocs being decremented more than once if stopprofclock is called multiple times on the same process.	2004-05-03 00:48:11 +00:00
Peter Wemm	e9eabf5983	Checkpoint commit for an alternative WIP kernel module loader that isn't as dependent on binutils features/quirks as the current one. This one loads plain .o files without having to mess with shared object mode. This happens to be essential on amd64, because binutils hasn't implemented all the quirks/features that we need for producing the hack non-PIC shared objects. As it turned out, .o format isn't all that inconvenient after all. It looks like the ability to use the same .o files for linking directly into a static kernel or loading as a module might be worth it. It is still very much a work-in-progress, but it is almost usable. Other changes are still needed in order to use it though, these have not been committed yet. There is still a memory corruption/overrun bug somewhere. For example, test modules load and work, but the machine explodes a few minutes later in vm_forkproc() or the like. Notable missing things include kldxref support, and loader(8) support. I wanted to figure out a working baseline set of code first.	2004-04-30 16:32:40 +00:00
Daniel Eischen	4fc21c0947	Keep track of threads waiting in kse_release() to avoid a race condition where kse_wakeup() doesn't yet see them in (interruptible) sleep queues. Also add an upcall check to sleepqueue_catch_signals() suggested by jhb. This commit should fix recent mysql hangs. Reviewed by: jhb, davidxu Mysql'd by: Robin P. Blanchard <robin.blanchard at gactr uga edu>	2004-04-28 20:36:53 +00:00
David Schultz	06afcd9d10	If the buffer supplied to kenv(KENV_DUMP, ...) isn't big enough, return the number of bytes needed instead of 0. The manpage claims that we do this anyway.	2004-04-28 01:27:33 +00:00
Bosko Milekic	5a59cefcd1	Give jail(8) the feature to allow raw sockets from within a jail, which is less restrictive but allows for more flexible jail usage (for those who are willing to make the sacrifice). The default is off, but allowing raw sockets within jails can now be accomplished by tuning security.jail.allow_raw_sockets to 1. Turning this on will allow you to use things like ping(8) or traceroute(8) from within a jail. The patch being committed is not identical to the patch in the PR. The committed version is more friendly to APIs which pjd is working on, so it should integrate into his work quite nicely. This change has also been presented and addressed on the freebsd-hackers mailing list. Submitted by: Christian S.J. Peron <maneo@bsdpro.com> PR: kern/65800	2004-04-26 19:46:52 +00:00
Pawel Jakub Dawidek	6c0ad4a77a	Always use nd.ni_vp->v_mount as an argument for VFS_QUOTACTL(), just like in RELENG_4. Pointed out by: Alex Lyashkov <umka@sevinter.net>	2004-04-26 15:44:42 +00:00
Hiten Pandya	024035e822	The paper "Hashed Timers and Hierarchical Wheels: Data Structures for the Efficient Implementation of a Timer Facility" was co-author'ed by T. Lauk, not A. Lauk. Adjust nearby whitespace.	2004-04-25 04:10:17 +00:00
Alan Cox	59c8bc40ce	Utilize sf_buf_alloc() rather than pmap_qenter() (and sometimes kmem_alloc_wait()) for mapping the image header. On all machines with a direct virtual-to-physical mapping and SMP/HTT i386s, this is a clear win.	2004-04-23 03:01:40 +00:00
David E. O'Brien	207a6c0dcb	There was a thread on "unusually high load averages" when running under sched_ule, in January 2004. Looking at this, "pagezero" is (one of) the culprit(s). We had no provision for processes with P_NOLOAD set. With pagezero not running at PRI_ITHD, kseq_load_{add,rem} count pagezero as another-normal-process, thus the "expected-plus-one" load reported in the above thread. Submitted by: Nikos Ntarmos <ntarmos@ceid.upatras.gr>	2004-04-22 21:37:46 +00:00
Pawel Jakub Dawidek	0c0c597faa	Look out! vn_start_write() is able to return 0 and NULL 'mp'. Submitted by: Alex Lyashkov <shadow@psoft.net>	2004-04-22 15:40:27 +00:00
Bruce Evans	057e27959f	Include <sys/mutex.h> and its prerequisite <sys/lock.h> instesd of depending on namespace pollution in <sys/vnode.h>. Sorted includes.	2004-04-21 12:10:30 +00:00
Colin Percival	05641e82d7	1. Remove callout_stop binary compatibility. 2. Document that this means that kernel modules must be rebuilt. 3. While I'm here, fix my sorting error in callout.h Requested by: many [1], scottl [2], bde [3]	2004-04-20 15:49:31 +00:00
Mike Makonnen	b9fb5d4286	If you're trying to find out if a thread is valid and in the same process as the current thread it makes absolutely no sense to lock the parent process through the pointer in said thread. Submitted by: pho (with minor correction) Pointy Hat To: mtm	2004-04-19 14:20:01 +00:00
Luigi Rizzo	24665342d3	constify the last argument of m_copyback.	2004-04-18 13:01:28 +00:00
Bruce Evans	7b1fe905ef	Fixed some style bugs in previous commit (mainly an insertion sort error for declarations, and poorly worded messages). Fixed some nearby style bugs (unsorted declarations).	2004-04-17 02:46:05 +00:00
John Baldwin	7870c3c61c	- Enable (unmask) interrupt sources earlier in the ithread loop. Specifically, we used to enable the source after locking sched_lock and just before we had already decided to do a context switch. This meant that an ithread could never process more than one interrupt per context switch. Enabling earlier in the loop before sched_lock is acquired allows an ithread to handle multiple interrupts per context switch if interrupts fire very rapidly. For the case of heavy interrupt load this can reduce the number of context switches (and thus overhead) as well as reduce interrupt latency. - Now that we can handle multiple interrupts per context switch, add simple interrupt storm protection to threaded interrupts. If X number of consecutive interrupts are triggered before the itherad voluntarily yields to another thread, then the interrupt thread will sleep with the associated interrupt source disabled (masked) for 1/10th of a second. The default value of X is 500, but it can be tweaked via the tunable/ sysctl hw.intr_storm_threshold. If an interrupt storm is detected, then a message is output to the kernel console on the first occurrence per interrupt thread. Interrupt storm protection can be disabled completely by setting this value to 0. There is no scientific reasoning for the 1/10th of a second or 500 interrupts values, so they may require tweaking at some point in the future. Tested by: rwatson (an earlier version w/o the storm protection) Tested by: mux (reportedly made a machine with two PCI interrupts storming usable rather than hard locked) Reviewed by: imp	2004-04-16 20:25:40 +00:00
Robert Watson	d54efd4d31	At some point during the history of m_getcl(), MAC support began to unconditionally initialize the mbuf header even if cluster allocation failed, which could result in a NULL pointer dereference in low-memory conditions. PR: kern/65548 Submitted by: Stephan Uphoff <ups@tree.com>	2004-04-16 14:35:11 +00:00
Ruslan Ermilov	61f7581d08	Ensure that the poll_burst <= poll_burst_max constraint really holds. Reviewed by: luigi	2004-04-15 07:38:44 +00:00
Warner Losh	5e1d0a23bc	Fix off by one error, twice. Submitted by: Carlos Velasco (first one), jhb (second one)	2004-04-12 23:02:21 +00:00
Colin Percival	4a3b3dcb55	stop() no longer needs sched_lock held; in fact, holding sched_lock causes a LOR against sleepq. Fix the comment, and fix ptracestop() to pick up sched_lock after stop() rather than before. Reported by: Scott Sipe <cscotts@mindspring.com> Reviewed by: rwatson, jhb	2004-04-12 15:56:05 +00:00
Maxime Henrion	a0b5a67929	Put deprecated sysctl code inside BURN_BRIDGES.	2004-04-11 21:09:22 +00:00
Alan Cox	148b3f62a9	Use vm_page_hold() rather than vm_page_wire() for short-duration page wiring. The reason being that vm_page_hold() is cheaper.	2004-04-11 19:57:11 +00:00
Maxime Henrion	4ddd1e65d4	Remove a comment that complains about the lack of %qd, to justify truncating a rlim_t to a long. We have %qd since some time now. However, the correct format to use here is %jd and a cast to intmax_t, so do this.	2004-04-10 11:08:16 +00:00
Peter Edwards	24554d00bc	Plug minor memory leak of module_t structures when unloading a file from the kernel. Reviewed By: Doug Rabson (dfr@)	2004-04-09 15:27:38 +00:00
Olivier Houchard	d50c87decf	Spell "switches" a more conventional way.	2004-04-09 14:31:29 +00:00
Robert Watson	123f024b24	Compare pointers with NULL rather than using pointers are booleans in if/for statements. Assign pointers to NULL rather than typecast 0. Compare pointers with NULL rather than 0.	2004-04-09 13:23:51 +00:00
Mike Silbersack	e8410540b7	Fix a regression in my change which sends headers along with data; a side effect of that change caused headers to not be sent if a 0 byte file was passed to sendfile. This change fixes that behavior, allowing sendfile to send out the headers even with a 0 byte file again. Noticed by: Dirk Engling	2004-04-08 07:14:34 +00:00
Marcel Moolenaar	ece267ba58	Do not assume that the initial thread (i.e. the thread with the ID equal to the process ID) is still present when we dump a core. It already may have been destroyed. In that case we would end up dereferencing a NULL pointer, so specifically test for that as well. Reported & tested by: Dan Nelson <dnelson@allantgroup.com>	2004-04-08 06:37:00 +00:00
Colin Percival	49a74476a6	Add whitespace before comment blocks. (reported by njl) Remove spurious whitespace, add indent protection, fix punctuation, remove initialization of static variables to zero, put wakeup_ctr and wakeup_needed in the correct order. (reported by bde) This doesn't fix all the style bugs I introduced, but the remaining style bugs make it easier for me to understand what I did here.	2004-04-08 02:03:49 +00:00
Warner Losh	f36cfd49ad	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999 and email from Peter Wemm, Alan Cox and Robert Watson. Approved by: core, peter, alc, rwatson	2004-04-07 20:46:16 +00:00
Colin Percival	ec513ff759	Fix filt_timer* races: Finish initializing a knote before we pass it to a callout, and use the new callout_drain API to make sure that a callout has finished before we deallocate memory it is using. PR: kern/64121 Discussed with: gallatin	2004-04-07 05:59:57 +00:00
Colin Percival	2c1bb20746	Introduce a callout_drain() function. This acts in the same manner as callout_stop(), except that if the callout being stopped is currently in progress, it blocks attempts to reset the callout and waits until the callout is completed before it returns. This makes it possible to clean up callout-using code safely, e.g., without potentially freeing memory which is still being used by a callout. Reviewed by: mux, gallatin, rwatson, jhb	2004-04-06 23:08:49 +00:00
John Baldwin	9000d57d57	Associate a simple count of waiters with each condition variable. The count is protected by the mutex that protects the condition, so the count does not require any extra locking or atomic operations. It serves as an optimization to avoid calling into the sleepqueue code at all if there are no waiters. Note that the count can get temporarily out of sync when threads sleeping on a condition variable time out or are aborted. However, it doesn't hurt to call the sleepqueue code for either a signal or a broadcast when there are no waiters, and the count is never out of sync in the opposite direction unless we have more than INT_MAX sleeping threads.	2004-04-06 19:17:46 +00:00
John Baldwin	535eb30962	Add a new kernel option MUTEX_WAKE_ALL that changes the mutex unlock code to awaken all waiters when a contested mutex is released instead of just the highest priority waiter. If the various threads are awakened in sequence then each thread may acquire and release the lock in question without contention resulting in fewer expensive unlock and lock operations. This old behavior of waking just the highest priority is still used if this option is specified. Making the algorithm conditional on a kernel option will allows us to benchmark both cases later and determine which one should be used by default. Requested by: tanimura-san	2004-04-06 19:12:24 +00:00
John Baldwin	ef2c0ba7e4	Rename turnstile_wakeup() to turnstile_broadcast() to make the naming more consistent with other APIs. sleepq and cv's use signal/broadcast, and msleep uses wakeup_one/wakeup. Prior to this turnstiles were using a signal/wakeup mixture.	2004-04-06 19:07:21 +00:00
Bruce Evans	295ed75297	Removed some less than useful comments: - don't say what a small subset of the options includes are for. - don't mark up functions which use all their args with /* ARGSUSED */. The markup should have been removed when the unused retval parameter was removed. - don't comment on what routine suser() checks do. Removed nearby excessive vertical whitespace.	2004-04-06 10:05:02 +00:00
Warner Losh	7f8a436ff2	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999. Approved by: core	2004-04-05 21:03:37 +00:00
Doug Rabson	7d5ea13fcd	Try not to crash instantly when signalling a libthr program to death.	2004-04-05 15:06:01 +00:00
Doug Rabson	e2c8a799c1	Regen.	2004-04-05 10:17:23 +00:00
Doug Rabson	0b0a60fb43	Add lgetfh(2) which is like getfh(2) but doesn't follow symlinks.	2004-04-05 10:15:53 +00:00
Robert Watson	051bbf603a	Detatch incorrect spellings of detach.	2004-04-04 19:15:45 +00:00
Jeff Roberson	37a35e4a60	- Use the proper constant in sched_interact_update(). Previously, SCHED_INTERACT_MAX was used where SCHED_SLP_RUN_MAX was needed. This was causing the interactivity scaler to lose history at a more dramatic rate than intended.	2004-04-04 19:12:56 +00:00
Marcel Moolenaar	8c9b7b2c84	Create NT_PRSTATUS and NT_FPREGSET notes for each and every thread in the process. This is required for proper debugging of corefiles created by 1:1 or M:N threaded processes. Add an XXX comment where we should actually call a function that dumps MD specific notes. An example of a MD specific note is the NT_PRXFPREG note for SSE registers. Since BFD creates non-annotated pseudo-sections for the first PRSTATUS and FPREGSET notes (non-annotated in the sense that the name of the section does not contain the pid/tid), make sure those sections describe the initial thread of the process (i.e. the thread which tid equals the pid). This is not strictly necessary, but makes sure that tools that use the non-annotated section names will not change behaviour due to this change. The practical upshot of this all is that one can see the threads in the debugger when looking at a corefile. For 1:1 threading this means that all threads are visible.	2004-04-03 20:25:41 +00:00
Marcel Moolenaar	fdcac92868	Assign thread IDs to kernel threads. The purpose of the thread ID (tid) is twofold: 1. When a 1:1 or M:N threaded process dumps core, we need to put the register state of each of its kernel threads in the core file. This can only be done by differentiating the pid field in the respective note. For this we need the tid. 2. When thread support is present for remote debugging the kernel with gdb(1), threads need to be identified by an integer due to limitations in the remote protocol. This requires having a tid. To minimize the impact of having thread IDs, threads that are created as part of a fork (i.e. the initial thread in a process) will inherit the process ID (i.e. tid=pid). Subsequent threads will have IDs larger than PID_MAX to avoid interference with the pid allocation algorithm. The assignment of tids is handled by thread_new_tid(). The thread ID allocation algorithm has been written with 3 assumptions in mind: 1. IDs need to be created as fast a possible, 2. Reuse of IDs may happen instantaneously, 3. Someone else will write a better algorithm.	2004-04-03 15:59:13 +00:00
Alan Cox	121230a40d	In some cases, sf_buf_alloc() should sleep with pri PCATCH; in others, it should not. Add a new parameter so that the caller can specify which is the case. Reported by: dillon	2004-04-03 09:16:27 +00:00
Kris Kennaway	c5af600675	Add missing comment terminator.	2004-04-02 04:57:40 +00:00
Julian Elischer	4f73277a35	The comment complained about not having a thread_unlink() and did the work itself, but thread_unink() has existed for a while... use it.	2004-04-02 01:01:34 +00:00
John Baldwin	e43257aa7d	Finish fixing up Alpha to work with an MP safe ptrace(): - ptrace_single_step() is no longer called with the proc lock held, so don't try to unlock it and then relock it. - Push Giant down into proc_rwmem() instead of forcing all the consumers (including Alpha breakpoint support) to explicitly wrap calls to proc_rwmem() with Giant. Tested by: kensmith	2004-04-01 20:56:44 +00:00
Scott Long	cd587b1397	Don't print out 'GIANT-LOCKED' for INTR_FAST drivers.	2004-04-01 07:18:42 +00:00
Pawel Jakub Dawidek	2fc0588da2	Remove sysctl kern.ps_argsopen, it is not very useful, one should use security.bsd.see_other_uids instead. Discussed with: phk, rwatson	2004-04-01 00:10:45 +00:00
Pawel Jakub Dawidek	5e2c0c0b0e	Remove ps_argsopen check. It is was bogus in the past and was corrected not quite well by me - if kern.ps_argsopen was set to 0, users weren't permitted to see arguments of even own processes. But kern.ps_argsopen is going away, so just remove this check and leave security checks for p_cansee() function.	2004-04-01 00:08:20 +00:00
Julian Elischer	4ccbe07e84	Remove unused variable.	2004-03-31 08:20:44 +00:00
Robert Watson	8e44a7ec13	In sofree(), avoid nested declaration and initialization in declaration. Observe that initialization in declaration is frequently incompatible with locking, not just a bad idea due to style(9). Submitted by: bde	2004-03-31 03:48:35 +00:00
Robert Watson	db48c0d254	Export uipc_connect2() from uipc_usrreq.c instead of unp_connect2(), and consume that interface in portalfs and fifofs instead. In the new world order, unp_connect2() assumes that the unpcb mutex is held, whereas uipc_connect2() validates that the passed sockets are UNIX domain sockets, then grabs the mutex. NB: the portalfs and fifofs code gets down and dirty with UNIX domain sockets. Maybe this is a bad thing.	2004-03-31 01:41:30 +00:00
Alan Cox	1dc10fceaa	White space and wording changes to init_param3(). Mostly submitted by: bde	2004-03-30 08:00:11 +00:00
Robert Watson	fc3fcacf52	Prefer NULL to 0 when testing and assigning pointer values.	2004-03-30 02:16:25 +00:00
Peter Wemm	9a6a4cb50d	Shorten some XXXKSE commentry	2004-03-29 22:46:54 +00:00
Peter Wemm	39d3505a30	Kill some XXXKSE's. vnlru/syncer are single threaded.	2004-03-29 22:45:33 +00:00
Peter Wemm	b21126c6b3	Clean up the stub fake vnode locking implemenations. The main reason this stuff was here (NFS) was fixed by Alfred in November. The only remaining consumer of the stub functions was umapfs, which is horribly horribly broken. It has missed out on about the last 5 years worth of maintenence that was done on nullfs (from which umapfs is derived). It needs major work to bring it up to date with the vnode locking protocol. umapfs really needs to find a caretaker to bring it into the 21st century. Functions GC'ed: vop_noislocked, vop_nolock, vop_nounlock, vop_sharedlock.	2004-03-29 22:41:21 +00:00
Robert Watson	181e65db5b	Use a common return path for filt_soread() and filt_sowrite() to simplify the impact of locking on these functions. Submitted by: sam Sponsored by: FreeBSD Foundation	2004-03-29 18:06:15 +00:00
Robert Watson	71c90a2944	In sofree(), moving caching of 'head' from 'so->so_head' to later in the function once it has been determined to be non-NULL to simplify locking on an earlier return.	2004-03-29 17:57:43 +00:00
Robert Watson	5a35e5f9af	If debug.mpsafenet, initialize UNIX domain socket timeouts as MPSAFE; otherwise, assert Giant in the callouts.	2004-03-29 17:00:05 +00:00
Robert Watson	627e4a9973	Conditionally acquire Giant when entering the sockets layer via the socket-specific system calls based on debug.mpsafenet, rather than acquiring Giant unconditionally.	2004-03-29 02:21:56 +00:00
Robert Watson	32903c86e7	Conditionally acquire Giant when entering the socket layer via file descriptor operations based on debug.mpsafenet, rather than acquiring Giant unconditionally.	2004-03-29 01:55:32 +00:00
Robert Watson	74041f5a10	When validating that the length sum in recvit(), we fail to release Giant on an error. Add a Giant acquisition. Reviewed by: sam, bms	2004-03-29 01:37:06 +00:00
Robert Watson	a1288c786e	Conditionally assert Giant in fputsock() based on the value of debug.mpsafenet.	2004-03-29 00:33:02 +00:00
Alan Cox	e3b19536fb	Revise the direct or optimized case to use uiomove_fromphys() by the reader instead of ephemeral mappings using pmap_qenter() by the writer. The writer is still, however, responsible for wiring the pages, just not mapping them. Consequently, the allocation of KVA for the direct case is unnecessary. Remove it and the sysctls limiting it, i.e., kern.ipc.maxpipekvawired and kern.ipc.amountpipekvawired. The number of temporarily wired pages is still, however, limited by kern.ipc.maxpipekva. Note: On platforms lacking a direct virtual-to-physical mapping, uiomove_fromphys() uses sf_bufs to cache ephemeral mappings. Thus, the number of available sf_bufs can influence the performance of pipes on platforms such i386. Surprisingly, I saw the greatest gain from this change on such a machine: lmbench's pipe bandwidth result increased from ~1050MB/s to ~1850MB/s on my 2.4GHz, 400MHz FSB P4 Xeon.	2004-03-27 19:50:23 +00:00
Marcel Moolenaar	b2ae7ed72c	Change the type of the various CPU masks to cpumask_t. Note that as long as there are still explicit uses of int, whether in types or in function names (such as atomic_set_int() in sched_ule.c), we can not change cpumask_t to be anything other than u_int. See also the commit log for sys/sys/types.h, revision 1.84.	2004-03-27 18:21:24 +00:00
Mike Makonnen	a73027fee9	Regen for libthr thread synchronization syscalls.	2004-03-27 14:34:17 +00:00
Mike Makonnen	0af67a2ef9	Use the proc lock to sleep on a libthr umtx.	2004-03-27 14:32:03 +00:00
Mike Makonnen	1713a51661	Separate thread synchronization from signals in libthr. Instead use msleep() and wakeup_one(). Discussed with: jhb, peter, tjr	2004-03-27 14:30:43 +00:00
Pawel Jakub Dawidek	0b68054f9d	- Add a description for vfs.usermount sysctl. - Add the vfs_equalopts() function for mount options comparsion. Now it looks much more clear. - Style fixed. In co-operation with: bde	2004-03-27 08:39:28 +00:00
Pawel Jakub Dawidek	6c8cc8ec4b	- Loudly disallow MNT_SUIDDIR mount flag for unprivileged users mounts. - Style fixed. Submitted by: bde	2004-03-27 08:09:00 +00:00
Pawel Jakub Dawidek	2c6040bbb7	We probably shouldn't allow users to mount file systems with MNT_SUIDDIR. There should be not shell access when SUIDDIR is compiled in, but better be sure. Reviewed by: rwatson	2004-03-26 21:12:14 +00:00
Alan Cox	2b63e7f397	Use uiomove_fromphys() instead of pmap_qenter() and pmap_qremove() in proc_rwmem().	2004-03-24 23:35:04 +00:00
Warner Losh	9fc0327792	Conform to local file sytle and prefer (a && (b & flag)).	2004-03-24 16:49:37 +00:00
David E. O'Brien	0d50bcb36b	Change the !MPSAFE boot string to something that doesn't potentially scare users that the kernel won't run on MP systems.	2004-03-23 01:58:09 +00:00
Alfred Perlstein	12e9993f65	Emit a traceback when witness_trace is set and witness_warn() is called and triggers (typically caused by sleeping with a non-sleepable lock). Reviewed by: jhb	2004-03-23 00:32:27 +00:00
David E. O'Brien	f1c8692d0a	Rather than display which interrupts are MPSAFE, display those that aren't. This way we can take stock of the work to be done. boot -v will note those interrupts that are MPSAFE.	2004-03-22 22:36:11 +00:00
Paul Saab	2eada6bc8e	Remove some netbsd debug code that crept into rev 1.116	2004-03-22 10:17:40 +00:00
David E. O'Brien	b003da7938	Give a more reasonable CPU time to the threads which are using scheduler activation (i.e., applications are using libpthread). This is because SCHED_ULE sometimes puts P_SA processes into ksq_next unnecessarily. Which doesn't give fair amount of CPU time to processes which are using scheduler-activation-based threads when other (semi-)CPU-intensive, non-P_SA processes are running. Further work will no doubt be done by jeffr at a later date. Submitted by: Taku YAMAMOTO <taku@cent.saitama-u.ac.jp> Reviewed by: rwatson, freebsd-current@	2004-03-21 18:53:29 +00:00
Julian Elischer	84eef27df4	Massively up the (artificial) limit on system scope threads in a process from 50 to 500 Also up the number of process scope threads allowed to be in the kernel at one time from 150 to 1500 (per process)	2004-03-21 09:22:38 +00:00
Brian Feldman	150883179a	Add the missing Giant when doing anything with VFS -- in this case, releasing the ktrace vnode.	2004-03-18 18:15:58 +00:00
Jacques Vidrine	3dc19c4677	Verify more bits of the ELF header: the program header table entry size and the ELF version. Also, avoid a potential integer overflow when determining whether the ELF header fits entirely within the first page. Reviewed by: jdp A panic when attempting to execute an ELF binary with a bogus program header table entry size was Reported by: Christer Öberg <christer.oberg@texonet.com>	2004-03-18 16:33:05 +00:00
Alan Cox	9508f75c23	Revise socow_iodone() in light of recent sf_buf changes. Specifically, use sf_buf_free() instead of sf_buf_mext() to consolidate all actions that require the page queues lock in one critical section. While I'm here remove unnecessary splvm() and splx() calls.	2004-03-17 23:25:04 +00:00
John Baldwin	b7e23e826c	- Replace wait1() with a kern_wait() function that accepts the pid, options, status pointer and rusage pointer as arguments. It is up to the caller to copyout the status and rusage to userland if needed. This lets us axe the 'compat' argument and hide all that functionality in owait(), by the way. This also cleans up some locking in kern_wait() since it no longer has to drop locks around copyout() since all the copyout()'s are deferred. - Convert owait(), wait4(), and the various ABI compat wait() syscalls to use kern_wait() rather than wait1() or wait4(). This removes a bit more stackgap usage. Tested on: i386 Compiled on: i386, alpha, amd64	2004-03-17 20:00:00 +00:00
Pawel Jakub Dawidek	9cdb62160b	Fix information leakage. Without this fix it is possible to cheat policies like: - sysctl security.bsd.see_other_[gu]ids=0, - mac_seeotheruids(4), - jail(2) and get full processes list with their arguments. This problem exists from revision 1.62 of kern_proc.c when it was introduced. Reviewed by: nectar, rwatson.	2004-03-17 13:19:43 +00:00
Colin Percival	018e32c194	Adjust the number of processes waiting on a semaphore properly if we're woken up in the middle of sleeping. PR: misc/64347 Reviewed by: tjr MFC after: 7 days	2004-03-17 09:37:13 +00:00
Alan Cox	90ecfebd82	Refactor the existing machine-dependent sf_buf_free() into a machine- dependent function by the same name and a machine-independent function, sf_buf_mext(). Aside from the virtue of making more of the code machine- independent, this change also makes the interface more logical. Before, sf_buf_free() did more than simply undo an sf_buf_alloc(); it also unwired and if necessary freed the page. That is now the purpose of sf_buf_mext(). Thus, sf_buf_alloc() and sf_buf_free() can now be used as a general-purpose emphemeral map cache.	2004-03-16 19:04:28 +00:00
John Baldwin	27de234992	Remove a bogus assertion and readd it in a more correct location. A thread might be enqueued on a sleep queue but not be asleep when the timeout fires if it is blocked on a lock trying to check for pending signals before going to sleep. In the case of fixing up the TDF_TIMEOUT race, however, the thread must be marked asleep. Reported by: kan (the bogus one)	2004-03-16 18:56:22 +00:00
Peter Grehan	721b6196d5	Add powerpc to temporary fix. The new cpu device claims all 'generic' OpenFirmware nexus nodes, since it uses bus_generic_probe. Maybe the cpu device probe should be MD.	2004-03-16 13:34:50 +00:00
David Malone	31c7e8b05b	Nudge Giant as far as I can into kern_open(). Mark open() as MPSAFE. Use kern_open() to implement creat() rather than taking the long route through open(). Mark creat as MPSAFE. While I'm at it, mark nosys() (syscall 0) as MPSAFE, for all the difference it will make.	2004-03-16 10:46:42 +00:00
David Malone	1f325ae35e	Get ready to mark open, creat and nosys as MPSAFE.	2004-03-16 10:41:23 +00:00
Tim J. Robbins	537370d0a4	Make vfs_nmount() public. The Linux emulator needs this in order to mount linprocfs filesystems.	2004-03-16 08:59:37 +00:00
Don Lewis	a961520c13	Rename the wiredlen member of struct sysctl_req to validlen and always set it to avoid the need for a bunch of code that tests whether or not the lock member is set to REQ_WIRED in order to determine which length member should be used. Fix another bug in the oldlen return value code. Fix a potential wired memory leak if a sysctl handler uses sysctl_wire_old_buffer() and returns an EAGAIN error to trigger a retry.	2004-03-16 06:53:03 +00:00
Don Lewis	8ac3e8e940	Don't bother calling vslock() and vsunlock() if oldlen is zero. If vslock() returns ENOMEM, sysctl_wire_old_buffer() should set wiredlen to zero and return zero (success) so that the handler will operate according to sysctl(3): The size of the buffer is given by the location specified by oldlenp before the call, and that location gives the amount of data copied after a successful call and after a call that returns with the error code ENOMEM. The handler will return an ENOMEM error because the zero length buffer will overflow.	2004-03-16 01:28:45 +00:00
John Baldwin	6b55d75c44	Regen for ptrace being safe again.	2004-03-15 18:50:06 +00:00
John Baldwin	8ac61436e6	Drop the proc lock around calls to the MD functions ptrace_single_step(), ptrace_set_pc(), and cpu_ptrace() so that those functions are free to acquire Giant, sleep, etc. We already do a PHOLD/PRELE around them so that it is safe to sleep inside of these routines if necessary. This allows ptrace() to be marked MP safe again as it no longer triggers lock order reversals on Alpha. Tested by: wilko	2004-03-15 18:48:28 +00:00
Pawel Jakub Dawidek	7f4704c01d	Remove sysctl security.jail.list_allowed. This functionality was a misfeature, sysctl was added and turned off by default just to check if nobody complains. Reviewed by: rwatson	2004-03-15 12:10:34 +00:00

1 2 3 4 5 ...

7221 Commits