freebsd-skq

Author	SHA1	Message	Date
John Baldwin	b36f458861	Use the recently added msleep_spin() function to simplify the callout_drain() logic. We no longer need a separate non-spin mutex to do sleep/wakeup with, instead we can now just use the one spin mutex to manage all the callout functionality.	2006-02-23 19:13:12 +00:00
David Xu	7e0221a251	1. Refine kern_sigtimedwait() to remove redundant code. 2. Fix a bug, if thread got a SIGKILL signal, call sigexit() to kill its process. MFC after: 3 days	2006-02-23 09:24:19 +00:00
David Xu	7c9a98f15b	Code cleanup, simply compare with curproc.	2006-02-23 05:50:55 +00:00
Jeff Roberson	8febcfb92f	- Use vfs_ref/rel to protect a mountpoint from going away while VFS_STATFS is being called. Be sure to grab the ref before we unlock the vnode to prevent the mount from disappearing. Tested by: kris	2006-02-23 05:18:07 +00:00
Jeff Roberson	a1db11fc40	- Release the mount ref once the vnode has been recycled rather than once the last reference is dropped. I forgot that vnodes can stick around for a very long time until processes discover that they are dead. This means that a vnode reference is not sufficient to keep the mount referenced and even more code will be required to ref mount points. Discovered by: kris	2006-02-23 05:15:37 +00:00
David Xu	dc94f5e383	Move comments to more accurate place.	2006-02-23 03:42:17 +00:00
David Xu	c008d51784	Fix a sleep queue race for KSE thread. Reviewed by: jhb	2006-02-23 00:13:58 +00:00
John Baldwin	daad1cd74d	Fixup some comments. Mutexes's are locked, not entered for several years now and msleep blocks threads rather than processes.	2006-02-22 20:46:10 +00:00
John Baldwin	06ad42b2f7	Close some races between procfs/ptrace and exit(2): - Reorder the events in exit(2) slightly so that we trigger the S_EXIT stop event earlier. After we have signalled that, we set P_WEXIT and then wait for any processes with a hold on the vmspace via PHOLD to release it. PHOLD now KASSERT()'s that P_WEXIT is clear when it is invoked, and PRELE now does a wakeup if P_WEXIT is set and p_lock drops to zero. - Change proc_rwmem() to require that the processing read from has its vmspace held via PHOLD by the caller and get rid of all the junk to screw around with the vmspace reference count as we no longer need it. - In ptrace() and pseudofs(), treat a process with P_WEXIT set as if it doesn't exist. - Only do one PHOLD in kern_ptrace() now, and do it earlier so it covers FIX_SSTEP() (since on alpha at least this can end up calling proc_rwmem() to clear an earlier single-step simualted via a breakpoint). We only do one to avoid races. Also, by making the EINVAL error for unknown requests be part of the default: case in the switch, the various switch cases can now just break out to return which removes a _lot_ of duplicated PRELE and proc unlocks, etc. Also, it fixes at least one bug where a LWP ptrace command could return EINVAL with the proc lock still held. - Changed the locking for ptrace_single_step(), ptrace_set_pc(), and ptrace_clear_single_step() to always be called with the proc lock held (it was a mixed bag previously). Alpha and arm have to drop the lock while the mess around with breakpoints, but other archs avoid extra lock release/acquires in ptrace(). I did have to fix a couple of other consumers in kern_kse and a few other places to hold the proc lock and PHOLD. Tested by: ps (1 mostly, but some bits of 2-4 as well) MFC after: 1 week	2006-02-22 18:57:50 +00:00
John Baldwin	54690b5679	Don't do a PHOLD() in kthread_create() w/o a matching PRELE() in kthread_exit(). Rather than add the missing PRELE() I chose to just axe the PHOLD() since it was redundant with the P_SYSTEM flag. MFC after: 1 week	2006-02-22 17:21:45 +00:00
John Baldwin	8f95fc2481	Various style and comment fixes. Submitted by: bde	2006-02-22 16:58:48 +00:00
Wayne Salamon	bc5504b942	Add pathname and/or vnode argument auditing for the following system calls: quotactl, statfs, fstatfs, fchdir, chdir, chroot, open, mknod, mkfifo, link, symlink, undelete, unlink, access, eaccess, stat, lstat, pathconf, readlink, chflags, lchflags, fchflags, chmod, lchmod, fchmod, chown, lchown, fchown, utimes, lutimes, futimes, truncate, ftruncate, fsync, rename, mkdir, rmdir, getdirentries, revoke, lgetfh, getfh, extattrctl, extattr_set_file, extattr_set_link, extattr_get_file, extattr_get_link, extattr_delete_file, extattr_delete_link, extattr_list_file, extattr_list_link. In many cases the pathname and vnode auditing is done within namei lookup instead of directly in the system call. Audit the remaining arguments to these system calls: fstatfs, fchdir, open, mknod, chflags, lchflags, fchflags, chmod, lchmod, fchmod, chown, lchown, fchown, futimes, ftruncate, fsync, mkdir, getdirentries.	2006-02-22 16:04:20 +00:00
Jeff Roberson	c5dcb84008	- Revert r1.406 until a solution can be found that doesn't break nfs. The statfs handler in nfs will lock vnodes which may lead to deadlock or recursion. Found by: kris Pointy hat to: me	2006-02-22 09:52:25 +00:00
Jeff Roberson	a4aeaefe5a	- We can not hold a vnode lock while we do a lookup. Search for and load modules prior to looking up the directory which we will cover to avoid this problem in mount. - We must hold the coveredvp locked before we can busy the mountpoint to prevent a lock order reversal with the vfs_busy() in lookup which holds the directory lock prior to doing a vfs_busy(). The directory lock is required to safely clear the v_mountedhere field on the directory. MFC After: 1 week	2006-02-22 06:29:55 +00:00
Jeff Roberson	8a7cd2fdfb	- Grab a mnt ref in vfs_busy() before dropping the interlock. This will prevent the mount point from going away while we're waiting on the lock. The ref does not need to persist once we have the lock because the lock prevents the mount point from being unmounted. MFC After: 1 week	2006-02-22 06:20:12 +00:00
Jeff Roberson	05b6a20a66	- Hold the vnode used in the statfs related functions until we're done with the VFS_STATFS call to prevent the mount from disappearing while we're stating. - Convert these routines to use MPSAFE namei semantics. MFC After: 1 week	2006-02-22 06:19:08 +00:00
David Xu	ba0360b135	Abstract function mqfs_create_node() to create a mqueue node.	2006-02-22 02:38:25 +00:00
David Xu	ad8de0f243	If block size is zero, use normal file operations to do I/O, this eliminates a divided-by-zero fault. Recommended by: phk	2006-02-22 00:05:12 +00:00
John Baldwin	bd106be404	Move the ruadd() in kern_exit() to save our final stats in our child stats even further down in exit1() so that it includes the runtime and tick counts from the final time slice for the dying thread. Reviewed by: phk	2006-02-21 21:48:42 +00:00
John Baldwin	6fc6433ecd	Split calcru() back into a calcru1() function shared with calccru() and a calcru() wrapper that passes a local rusage_ext on the stack that is a snapshot to do the calculations on. Now we can pass p->p_crux to calcru1() in calccru() again which fixes the issues with runtime going backwards messages when dead processes are harvested by init. Reviewed by: phk Tested by: Stefan Ehmann shoesoft at gmx dot net	2006-02-21 21:47:46 +00:00
Andre Oppermann	80444f8803	The sysctls kern.ipc.[max_linkhdr\|max_protohdr\|max_hdr\|max_datalen] can't be changed from userland. Make them read-only and provide descriptions. kern.ipc.max_datalen must never be less than one byte. Enforce this with a panic in net_init_domain(). Sponsored by: TCP/IP Optimization Fundraise 2005 MFC after: 3 days	2006-02-18 17:16:18 +00:00
Andre Oppermann	ec63cb90a3	Replace the 4k fixed sized jumbo mbuf clusters with PAGE_SIZE sized jumbo mbuf clusters. To make the variable size clear they are named MJUMPAGESIZE. Having jumbo clusters with the native PAGE_SIZE is more useful than a fixed 4k size according the device driver writers using this API. The 9k and 16k jumbo mbuf clusters remain unchanged. Requested by: glebius, gallatin Sponsored by: TCP/IP Optimization Fundraise 2005 MFC after: 3 days	2006-02-17 14:14:15 +00:00
Andre Oppermann	a4684d742d	Make sysctl_msec_to_ticks(SYSCTL_HANDLER_ARGS) generally available instead of being private to tcp_timer.c. Sponsored by: TCP/IP Optimization Fundraise 2005 MFC after: 3 days	2006-02-16 15:40:36 +00:00
David Xu	94f0972bec	Fix a long standing race between sleep queue and thread suspension code. When a thread A is going to sleep, it calls sleepq_catch_signals() to detect any pending signals or thread suspension request, if nothing happens, it returns without holding process lock or scheduler lock, this opens a race window which allows thread B to come in and do process suspension work, however since A is still at running state, thread B can do nothing to A, thread A continues, and puts itself into actually sleeping state, but B has never seen it, and it sits there forever until B is woken up by other threads sometimes later(this can be very long delay or never happen). Fix this bug by forcing sleepq_catch_signals to return with scheduler lock held. Fix sleepq_abort() by passing it an interrupted code, previously, it worked as wakeup_one(), and the interruption can not be identified correctly by sleep queue code when the sleeping thread is resumed. Let thread_suspend_check() returns EINTR or ERESTART, so sleep queue no longer has to use SIGSTOP as a hack to build a return value. Reviewed by: jhb MFC after: 1 week	2006-02-15 23:52:01 +00:00
Wayne Salamon	085a0d43ca	Audit the arguments to the ptrace(2) system call. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-14 01:18:31 +00:00
Wayne Salamon	bfd7575a39	Audit the arguments to the kill(2) and killpg(2) system calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-14 01:17:03 +00:00
David Xu	d8267df729	In order to speed up process suspension on MP machine, send IPI to remote CPU. While here, abstract thread suspension code into a function called sig_suspend_threads, the function is called when a process received a STOP signal.	2006-02-13 03:16:55 +00:00
Robert Watson	13f322c2fc	Improve consistency of return() style. MFC after: 3 days	2006-02-12 15:00:27 +00:00
Poul-Henning Kamp	e8444a7e6f	CPU time accounting speedup (step 2) Keep accounting time (in per-cpu) cputicks and the statistics counts in the thread and summarize into struct proc when at context switch. Don't reach across CPUs in calcru(). Add code to calibrate the top speed of cpu_tickrate() for variable cpu_tick hardware (like TSC on power managed machines). Don't enforce monotonicity (at least for now) in calcru. While the calibrated cpu_tickrate ramps up it may not be true. Use 27MHz counter on i386/Geode. Use TSC on amd64 & i386 if present. Use tick counter on sparc64	2006-02-11 09:33:07 +00:00
David Xu	42925630b6	Test before modifying p_sflag to avoid unconditionally cache line ping-pong on SMP.	2006-02-10 14:59:16 +00:00
David Xu	71b7afb2b4	Call thread_stopped in thr_exit to notify parent that the child process is now fully stopped, this was already in kse_exit().	2006-02-10 03:34:29 +00:00
Poul-Henning Kamp	eb2da9a51f	Simplify system time accounting for profiling. Rename struct thread's td_sticks to td_pticks, we will need the other name for more appropriately named use shortly. Reduce it from uint64_t to u_int. Clear td_pticks whenever we enter the kernel instead of recording its value as reference for userret(). Use the absolute value of td->pticks in userret() and eliminate third argument.	2006-02-08 08:09:17 +00:00
Poul-Henning Kamp	5b1a8eb397	Modify the way we account for CPU time spent (step 1) Keep track of time spent by the cpu in various contexts in units of "cputicks" and scale to real-world microsec^H^H^H^H^H^H^H^Hclock_t only when somebody wants to inspect the numbers. For now "cputicks" are still derived from the current timecounter and therefore things should by definition remain sensible also on SMP machines. (The main reason for this first milestone commit is to verify that hypothesis.) On slower machines, the avoided multiplications to normalize timestams at every context switch, comes out as a 5-7% better score on the unixbench/context1 microbenchmark. On more modern hardware no change in performance is seen.	2006-02-07 21:22:02 +00:00
John Baldwin	222fdf4bff	Provide some anti-footshooting. Don't allow the user to set the interval for acctwatch() runs to be negative or zero as this could result in either a possible hang (or panic if INVARIANTS is on). Previously the accounting code handled the <= 0 case by calling acctwatch on every clock tick (eww!) due to an implementation detail of callout_reset(). (Tick counts of <= 0 are converted to 1). MFC after: 3 days	2006-02-07 18:59:47 +00:00
John Baldwin	505a14934e	- Add a kthread to periodically call acctwatch() when accounting is active instead of calling acctwatch() from softclock. The acctwatch() function needs to hold an sx lock and also makes a VFS call, and neither of these are good things (or safe) to do from a callout. The kthread only exists and is running when accounting is turned on; it is started and stopped as needed. I didn't run acctwatch() via the thread taskqueue at Robert's request as he was worried that if the accounting file was over NFS the VFS_STAT() calls might stall other work on the taskqueue. - Add an acct_disable() function to take care of closing the accounting vnode and cleaning up so we don't duplicate the same code in two different places. MFC after: 3 days	2006-02-07 16:04:03 +00:00
John Baldwin	8917b8d28c	- Always call exec_free_args() in kern_execve() instead of doing it in all the callers if the exec either succeeds or fails early. - Move the code to call exit1() if the exec fails after the vmspace is gone to the bottom of kern_execve() to cut down on some code duplication.	2006-02-06 22:06:54 +00:00
John Baldwin	809f984b21	Add a kern_eaccess() function and use it to implement xenix_eaccess() rather than kern_access(). Suggested by: rwatson	2006-02-06 22:00:53 +00:00
John Baldwin	934ba9b2cf	- Move the wakeup() for exiting kthreads out of exit1() and into kthread_exit() as that is cleaner and less obscured. It also does the wakeup sooner. - Add some comments to kthread_exit().	2006-02-06 21:56:13 +00:00
John Baldwin	2c9d9d392a	We don't need the proc lock to check P_KTHREAD on curthread since it is only set before the kthread starts executing and is never cleared.	2006-02-06 21:54:47 +00:00
Olivier Houchard	2a3b10658d	rwlock expects the struct thread to be aligned on 8 bytes, so make sure thread0 is.	2006-02-06 16:03:10 +00:00
Jeff Roberson	04f6d3effa	- Add a ref count to the mount structure. Sleep for up to 3 seconds in vfs_mount_destroy waiting for this ref to hit 0. We don't print an error if we are rebooting as the root mount always retains some refernces by init proc. - Acquire a mnt ref for every vnode allocated to a mount point. Drop this ref only once vdestroy() has been called and the mount has been freed. - No longer NULL the v_mount pointer in delmntque() so that we may release the ref after vgone() has been called. This allows us to guarantee that the mount point structure will be valid until the last vnode has lost its last ref. - Fix a few places that rely on checking v_mount to detect recycling. Sponsored by: Isilon Systems, Inc. MFC After: 1 week	2006-02-06 10:19:50 +00:00
Jeff Roberson	2f0bca553a	- Don't check v_mount for NULL to determine if a vnode has been recycled. Use the more appropriate VI_DOOMED flag instead. Sponsored by: Isilon Systems, Inc. MFC After: 1 week	2006-02-06 10:15:27 +00:00
Jeff Roberson	36a52c3cae	- Add the global 'rebooting' variable that is used to detect when boot() has been called. Sponsored by: Isilon Systems, Inc. MFC After: 1 week	2006-02-06 10:12:00 +00:00
David Xu	ea8e65b0fa	Add members pl_sigmask and pl_siglist into ptrace_lwpinfo to get lwp's signal mask and pending signals.	2006-02-06 09:41:56 +00:00
Robert Watson	9653775b18	Regenerate.	2006-02-06 02:00:32 +00:00
Robert Watson	c983324ef5	Prefer AUE_FOO audit identifiers to AUE_O_FOO, which are largely left over from the Darwin implementation. When we implement a system call as a wrapper to sysctl(), audit it as AUE_SYSCTL. This leads to greater compatibility with Solaris audit trails as sysctl() argument tokens are not the same as the ones for the originaly system calls (i.e., setdomainname()). Replace references to AUE_ events that are equivilent to AUE_NULL with AUE_NULL. In the case of process signal configuration, this is because these events do not require auditing. Move from the Darwin spelling of getsockopt() to the FreeBSD/Solaris one. Audit nmount(). Obtained from: TrustedBSD Project	2006-02-06 02:00:06 +00:00
Robert Watson	89964dd284	When exiting a thread, submit any pending record. Today, we don't audit thread exit, but should that happen, this will prevent unhappiness, as the thread exit system call will never return, and hence not commit the record. Pointed out by/with: cognet Obtained from: TrustedBSD Project	2006-02-06 01:51:08 +00:00
Wayne Salamon	2f8a46d5ff	Audit the arguments (user/group IDs) for the system calls that set these IDs. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-06 00:32:33 +00:00
Wayne Salamon	ad20c8f325	Audit the args to rfork(), and the child PID for all fork system calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-06 00:28:50 +00:00
Wayne Salamon	de3007e8f3	Audit the pid being requested in wait4(). Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-06 00:19:09 +00:00

1 2 3 4 5 ...

9107 Commits