freebsd-skq

Author	SHA1	Message	Date
jhb	cfc179a934	- Split out kern_accept(), kern_getpeername(), and kern_getsockname() for use by ABI emulators. - Alter the interface of kern_recvit() somewhat. Specifically, go ahead and hard code UIO_USERSPACE in the uio as that's what all the callers specify. In place, add a new uioseg to indicate what type of pointer is in mp->msg_name. Previously it was always a userland address, but ABI emulators may pass in kernel-side sockaddrs. Also, remove the namelenp field and instead require the two places that used it to explicitly copy mp->msg_namelen out to userland. - Use the patched kern_recvit() to replace svr4_recvit() and the stock kern_sendit() to replace svr4_sendit(). - Use kern_bind() instead of stackgap use in ti_bind(). - Use kern_getpeername() and kern_getsockname() instead of stackgap in svr4_stream_ti_ioctl(). - Use kern_connect() instead of stackgap in svr4_do_putmsg(). - Use kern_getpeername() and kern_accept() instead of stackgap in svr4_do_getmsg(). - Retire the stackgap from SVR4 compat as it is no longer used.	2006-07-10 21:38:17 +00:00
jhb	d05d04f2c7	Explicitly use STAILQ_REMOVE_HEAD() when we know we are removing the head element to avoid confusing Coverity. It's now also easier for humans to parse as well. Found by: Coverity Prevent(tm) CID: 1201	2006-07-10 19:28:57 +00:00
jhb	401a0eaea0	Fix two more instances of using a linker_file_t object in TAILQ() macros after free'ing it. Found by: Coverity Prevent(tm) CID: 1435	2006-07-10 19:13:45 +00:00
jhb	3f18ce7ca9	Don't try to reuse the linker_file structure after we've freed it when throwing out the kld's loaded by the loader that didn't successfully link. Found by: Coverity Prevent(tm) CID: 1435	2006-07-10 19:06:01 +00:00
scottl	483a18cd99	Use a sleep mutex instead of an sx lock for the kernel environment. This allows greater flexibility for drivers that want to query the environment. Reviewed by: jhb, mux	2006-07-09 21:42:58 +00:00
jhb	094306d69d	- Split ioctl() up into ioctl() and kern_ioctl(). The kern_ioctl() assumes that the 'data' pointer is already setup to point to a valid KVM buffer or contains the copied-in data from userland as appropriate (ioctl(2) still does this). kern_ioctl() takes care of looking up a file pointer, implementing FIONCLEX and FIOCLEX, and calling fi_ioctl(). - Use kern_ioctl() to implement xenix_rdchk() instead of using the stackgap and mark xenix_rdchk() MPSAFE.	2006-07-08 20:12:14 +00:00
jhb	e09e5b52db	Add a kern_close() so that the ABIs can close a file descriptor w/o having to populate a close_args struct and change some of the places that do.	2006-07-08 20:03:39 +00:00
jhb	5e8693a976	Rework kern_semctl a bit to always assume the UIO_SYSSPACE case. This mostly consists of pushing a few copyin's and copyout's up into __semctl() as all the other callers were already doing the UIO_SYSSPACE case. This also changes kern_semctl() to set the return value in a passed in pointer to a register_t rather than td->td_retval[0] directly so that callers can only set td->td_retval[0] if all the various copyout's succeed. As a result of these changes, kern_semctl() no longer does copyin/copyout (except for GETALL/SETALL) so simplify the locking to acquire the semakptr mutex before the MAC check and hold it all the way until the end of the big switch statement. The GETALL/SETALL cases have to temporarily drop it while they do copyin/malloc and copyout. Also, simplify the SETALL case to remove handling for a non-existent race condition.	2006-07-08 19:51:38 +00:00
imp	f7c9fd2027	Create bus_enumerate_hinted_children. This routine will allow drivers to use the hinted child system. Bus drivers that use this need to implmenet the bus_hinted_child method, where they actually add the child to their bus, as they see fit. The bus is repsonsible for getting the attribtues for the child, adding it in the right order, etc. ISA hinting will be updated to use this method. MFC After: 3 days	2006-07-08 17:06:15 +00:00
rwatson	e8e07ae08e	Move POSIX.1e-specific utility routines from kern_acl.c to subr_acl_posix1e.c, leaving kern_acl.c containing only ACL system calls and utility routines common across ACL types. Add subr_acl_posix1e.c to the build. Obtained from: TrustedBSD Project	2006-07-06 23:37:39 +00:00
jhb	5e008b18de	- Explicitly acquire Giant around SYSINIT's and SYSUNINIT's since they are not all known to be MPSAFE yet. - Actually remove Giant from the kernel linker by taking it out of the KLD_LOCK() and KLD_UNLOCK() macros. Pointy hat to: jhb (2)	2006-07-06 21:39:39 +00:00
jhb	54c687571c	Add kern_setgroups() and kern_getgroups() and use them to implement ibcs2_[gs]etgroups() rather than using the stackgap. This also makes ibcs2_[gs]etgroups() MPSAFE. Also, it cleans up one bit of weirdness in the old setgroups() where it allocated an entire credential just so it had a place to copy the group list into. Now setgroups just allocates a NGROUPS_MAX array on the stack that it copies into and then passes to kern_setgroups().	2006-07-06 21:32:20 +00:00
wsalamon	3f67fc0a68	Audit the remaining parameters to the extattr system calls. Generate the audit records for those calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-07-06 19:33:38 +00:00
rwatson	fba25d1a75	Remove now unneeded opt_mac.h and mac.h includes.	2006-07-06 13:25:51 +00:00
wsalamon	fe0b47d3f1	Regen the system calls files, picking up the extended attr events, and some mount-related changes done previously. Approved by: rwatson (mentor)	2006-07-05 19:24:14 +00:00
kib	95ef2e0daa	Back out my rev. 1.674. The better fix (rev. 1.637) is already in tree. Approved by: kan (mentor)	2006-07-05 16:33:25 +00:00
wsalamon	93c3d5f3e6	Add audit events for the extended attribute system calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-07-05 15:46:02 +00:00
maxim	a4a29e2efc	o Fix grammar in the comment, indent macros. No functional changes.	2006-07-02 20:53:52 +00:00
maxim	f9bac2df2c	o Remove rev. 1.57 leftover, not reached code.	2006-07-02 20:49:46 +00:00
maxim	d96c84ab9e	o Fix typo in the comment. PR: kern/99632 Submitted by: clsung	2006-06-30 08:10:55 +00:00
obrien	cbba39186d	Fix building with GCC 4.2: define data types before referring to them.	2006-06-29 19:37:31 +00:00
jhb	7a3e252286	Fix semctl(2) breakage from the previous commit. Previously __semctl() had a local 'semid' variable which was the array index and used uap->semid as the original IPC id. During the kern_semctl() conversion those two variables were collapsed into a single 'semid' variable breaking the places that needed the original IPC ID. To fix, add a new 'semidx' variable to hold the array index and leave 'semid' unmolested as the IPC id. While I'm here, explicitly document that the (undocumented, at least in semctl(2)) SEM_STAT command curiously expects an array index in the 'semid' parameter rather than an IPC id. Submitted by: maxim	2006-06-29 13:58:36 +00:00
davidxu	ce8741b1e7	Fix a bug when accumulating run time, if a thread calls yield() syscall, its run time may be lost.	2006-06-29 12:29:20 +00:00
davidxu	859313b1e9	Fix system load count (noticed by dephij). Remove incorrect comment.	2006-06-29 09:49:00 +00:00
davidxu	9720ffd6e5	Remove unused function declaration. Add else statement in sched_calc_pri. Fix a bug when checking interrupt thread in sched_add.	2006-06-29 05:59:36 +00:00
davidxu	9320568021	Remove load balancer code, since it has serious priority inversion problem which really hurts performance on FreeBSD.	2006-06-29 05:36:34 +00:00
jhb	dff69a853e	- Add a kern_semctl() helper function for __semctl(). It accepts a pointer to a copied-in copy of the 'union semun' and a uioseg to indicate which memory space the 'buf' pointer of the union points to. This is then used in linux_semctl() and svr4_sys_semctl() to eliminate use of the stackgap. - Mark linux_ipc() and svr4_sys_semsys() MPSAFE.	2006-06-27 18:28:50 +00:00
jhb	5ceeece21b	- Expand the scope of Giant some in mount(2) to protect the vfsp structure from going away. mount(2) is now MPSAFE. - Expand the scope of Giant some in unmount(2) to protect the mp structure (or rather, to handle concurrent unmount races) from going away. umount(2) is now MPSAFE, as well as linux_umount() and linux_oldumount(). - nmount(2) and linux_mount() were already MPSAFE.	2006-06-27 14:46:31 +00:00
pjd	97382ba992	Compress direct cr_ruid comparsion and jailed() call to suser_cred(9). Reviewed by: rwatson	2006-06-27 11:32:08 +00:00
pjd	9de1945c3f	Use suser_cred(9) instead of checking cr_uid directly. Reviewed by: rwatson	2006-06-27 11:29:38 +00:00
pjd	03a43a81a3	- Use suser_cred(9) instead of checking cr_ruid directly. - For privileged processes safe two mutex operations. We may want to consider if this is good idea to use SUSER_ALLOWJAIL here, but for now I didn't wanted to change the original behaviour. Reviewed by: rwatson	2006-06-27 11:28:50 +00:00
babkin	f0555f2de9	Backed out the change by request from rwatson. PR: kern/14584	2006-06-26 22:03:22 +00:00
jhb	2afb03cfd2	Address a problem I missed in removing Giant from the kernel linker. Not all of the module event handlers are MP safe yet, so always acquire Giant for now when invoking module event handlers. Eventually we can add an MPSAFE flag or some such and add appropriate locking to all module event handlers.	2006-06-26 18:34:45 +00:00
jhb	d7b0de5b87	Remove duplicate security checks already performed in kern_kldload().	2006-06-26 18:33:32 +00:00
rwatson	e4433ceff4	Trim basically unused 'unp' in uipc_connect().	2006-06-26 16:18:22 +00:00
babkin	3d8be823b0	The common UID/GID space implementation. It has been discussed on -arch in 1999, and there are changes to the sysctl names compared to PR, according to that discussion. The description is in sys/conf/NOTES. Lines in the GENERIC files are added in commented-out form. I'll attach the test script I've used to PR. PR: kern/14584 Submitted by: babkin	2006-06-25 18:37:44 +00:00
iedowse	559e4b8ec0	If linker_release_module() fails then we still hold a reference on the linker_file, so record this by restoring the linker_file pointer in fp->file.	2006-06-25 12:36:21 +00:00
pjd	4a62627c84	Simplify the code and remove two mutex operations. MFC after: 2 weeks	2006-06-24 22:55:43 +00:00
jhb	777260528c	Replace the kld_mtx mutex with a kld_sx sx lock and expand it's scope to protect all linker-related data structures including the contents of linker file objects and the any linker class data as well. Considering how rarely the linker is used I just went with the simple solution of single-threading the whole thing rather than expending a lot of effor on something more fine-grained and complex. Giant is still explicitly acquired while registering and deregistering sysctl's as well as in the elf linker class while calling kmupetext(). The rest of the linker runs without Giant unless it has to acquire Giant while loading files from a non-MPSAFE filesystem.	2006-06-21 20:42:08 +00:00
jhb	acb5f7e773	- Push down Giant in kldfind() and kldsym(). - Remove several goto's by either using direct return's or else clauses.	2006-06-21 20:15:36 +00:00
jhb	210ed4b32d	Whoops, revert accidental commit.	2006-06-21 17:48:59 +00:00
jhb	32a6c3d3d5	Fix two comments and a style fix.	2006-06-21 17:48:03 +00:00
jhb	2c4ca6adca	Various whitespace fixes.	2006-06-21 17:47:45 +00:00
jhb	3f7e224385	Conditionally acquire Giant around VFS operations.	2006-06-20 21:31:38 +00:00
jhb	232160a8fb	- Push Giant down into linker_reference_module(). - Add a new function linker_release_module() as a more intuitive complement to linker_reference_module() that wraps linker_file_unload(). linker_release_module() can either take the module name and version info passed to linker_reference_module() or it can accept the linker file object returned by linker_reference_module().	2006-06-20 20:54:13 +00:00
jhb	4a92b906b8	Make linker_find_file_by_name() and linker_find_file_by_id() static to simplify linker locking. The only external consumers now use linker_file_foreach().	2006-06-20 20:41:15 +00:00
jhb	c9fac30209	- Add a new linker_file_foreach() function that walks the list of linker file objects calling a user-specified predicate function on each object. The iteration terminates either when the entire list has been iterated over or the predicate function returns a non-zero value. linker_file_foreach() returns the value returned by the last invocation of the predicate function. It also accepts a void * context pointer that is passed to the predicate function as well. Using an iterator function avoids exposing linker internals to the rest of the kernel making locking simpler. - Use linker_file_foreach() instead of walking the list of linker files manually to lookup ndis files in ndis(4). - Use linker_file_foreach() to implement linker_hwpmc_list_objects().	2006-06-20 20:37:17 +00:00
jhb	eef5a2aec4	Make linker_file_add_dependency() and linker_load_module() static since only the linker uses them.	2006-06-20 20:18:42 +00:00
jhb	41188bcabe	Don't check if malloc(M_WAITOK) returns NULL.	2006-06-20 20:11:00 +00:00
jhb	70000c1c81	Use 'else' to remove another goto.	2006-06-20 19:49:28 +00:00
jhb	c6a767184d	- Remove some useless variable initializations. - Make some conditional free()'s where the condition was always true unconditional.	2006-06-20 19:32:10 +00:00
gnn	549bd60e43	Properly cast the values of valsize (the size of the value passed in) in setsockopt so that they can be compared correctly against negative values. Passing in a negative value had a rather negative effect on our socket code, making it impossible to open new sockets. PR: 98858 Submitted by: James.Juran@baesystems.com MFC after: 1 week	2006-06-20 12:36:40 +00:00
rwatson	350dce4037	When retrieving SO_ERROR via getsockopt(), hold the socket lock around the retrieval and replacement with 0. MFC after: 1 week	2006-06-18 19:02:49 +00:00
yar	0bf54ce0e0	Add a funny sysctl: debug.kdb.trap_code . It is similar to debug.kdb.trap, except for it tries to cause a page fault via a call to an invalid pointer. This can highlight differences between a fault on data access vs. a fault on code call some CPUs might have. This appeared as a test for a work \ Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-18 12:27:59 +00:00
rwatson	a1677cd654	Remove sbinsertoob(), sbinsertoob_locked(). They violate (and have basically always violated) invariannts of soreceive(), which assume that the first mbuf pointer in a receive socket buffer can't change while the SB_LOCK sleepable lock is held on the socket buffer, which is precisely what these functions do. No current protocols invoke these functions, and removing them will help discourage them from ever being used. I should have removed them years ago, but lost track of it. MFC after: 1 week Prodded almost by accident by: peter	2006-06-17 22:48:34 +00:00
emaste	f932a159ee	Add a description for sysctl -d.	2006-06-17 02:58:18 +00:00
rwatson	83e5d6bce1	Remove unused (and ifdef'd) unp_abort() and unp_drain(). MFC after: 1 month	2006-06-16 22:11:49 +00:00
dwmalone	2e1fa1a727	Add a kern.timecounter.tc sysctl tree that contains the mask, frequency, quality and current value of each available time counter. At the moment all of these are read-only, but it might make sense to make some of these read-write in the future. MFC after: 3 months	2006-06-16 20:29:05 +00:00
yar	d4c85447de	Kill an XXX remark that has been untrue since rev. 1.150 of this file.	2006-06-16 07:36:18 +00:00
csjp	bff7d924ba	Axe Giant from vn_fullpath(9). The vnode -> pathname lookup should be filesystem agnostic. We are not touching any file system specific functions in this code path. Since we have a cache lock, there is really no need to keep Giant around here. This eliminates Giant acquisitions for any syscall which is auditing pathnames. Discussed with: jeff	2006-06-16 05:09:28 +00:00
maxim	cd982b5207	o Expand an exclusive lock scope to prevent a race between two simultaneous module_register(). Original work done by: Alex Lyashkov Reviewed by: jhb MFC after: 2 weeks	2006-06-15 08:53:09 +00:00
davidxu	1689a257a3	Use scheduler API sched_relinquish() to implement yield() syscall.	2006-06-15 06:41:57 +00:00
davidxu	a4976ce481	Add scheduler API sched_relinquish(), the API is used to implement yield() and sched_yield() syscalls. Every scheduler has its own way to relinquish cpu, the ULE and CORE schedulers have two internal run- queues, a timesharing thread which calls yield() syscall should be moved to inactive queue.	2006-06-15 06:37:39 +00:00
davidxu	3cbbc973d7	Clear ke_runq before calling maybe_preempt, this avoids a KASSERT(ke->ke_runq == NULL) panic when the sched_add is recursively called by maybe_preempt. Reported by: Wojciech A. Koszek < dunstan at freebsd dot czest dot pl >	2006-06-14 03:46:03 +00:00
delphij	9cbae6be49	Unexpand an instance of TAILQ_EMPTY()	2006-06-14 03:14:26 +00:00
marcel	c77d37e5b9	Unbreak 64-bit architectures. The 3rd argument to kern_kldload() is a pointer to an integer and td->td_retval[0] is of type register_t. On 64-bit architectures register_t is wider than an integer.	2006-06-14 03:01:06 +00:00
davidxu	ee5438638d	Fox a typo in sched_is_timeshare.	2006-06-13 23:45:59 +00:00
davidxu	8471318e5c	Pass boolean value to __predict_false. Try to keep KSE slot count correct for migrating thread, the count is a bit mess.	2006-06-13 23:01:50 +00:00
jhb	9af5dd68e4	Use kern_kldload() and kern_kldunload() to load and unload modules when we intend for the user to be able to unload them later via kldunload(2) instead of calling linker_load_module() and then directly adjusting the ref count on the linker file structure. This makes the resulting consumer code simpler and cleaner and better hides the linker internals making it possible to sanely lock the linker.	2006-06-13 21:36:23 +00:00
jhb	00130e4f0c	A couple of minor style tweaks.	2006-06-13 21:34:12 +00:00
jhb	620ecf8adf	- Add a kern_kldload() that is most of the previous kldload() and push Giant down in it. - Push Giant down in kern_kldunload() and reorganize it slightly to avoid using gotos. Also, expose this function to the rest of the kernel.	2006-06-13 21:28:18 +00:00
jhb	1484c04623	- Push down Giant some in kldstat(). - Use a 'struct kld_file_stat' on the stack to read data under the lock and then do one copyout() w/o holding the lock at the end to push the data out to userland.	2006-06-13 21:11:12 +00:00
jhb	e88fcff5a4	Unexpand TAILQ_FOREACH() and TAILQ_FOREACH_SAFE().	2006-06-13 20:49:07 +00:00
jhb	77030411c8	Remove some more pointless goto's and don't check to see if malloc(M_WAITOK) returns NULL.	2006-06-13 20:27:23 +00:00
jhb	baafd575e6	Handle the simple case of just dropping a reference near the start of linker_file_unload() instead of in the middle of a bunch of code for the case of dropping the last reference to improve readability and sanity. While I'm here, remove pointless goto's that were just jumping to a return statement.	2006-06-13 19:45:08 +00:00
maxim	0cf435f016	o There are two methods to get a process credentials over the unix sockets: 1) A sender sends SCM_CREDS message to a reciever, struct cmsgcred; 2) A reciever sets LOCAL_CREDS socket option and gets sender credentials in control message, struct sockcred. Both methods use the same control message type SCM_CREDS with the same control message level SOL_SOCKET, so they are indistinguishable for the receiver. A difference in struct cmsgcred and struct sockcred layouts may lead to unwanted effects. Now for sockets with LOCAL_CREDS option remove all previous linked SCM_CREDS control messages and then add a control message with struct sockcred so the process specifically asked for the peer credentials by LOCAL_CREDS option always gets struct sockcred. PR: kern/90800 Submitted by: Andrey Simonenko Regres. tests: tools/regression/sockets/unix_cmsg/ MFC after: 1 month	2006-06-13 14:33:35 +00:00
davidxu	82b666ed4a	Add scheduler CORE, the work I have done half a year ago, recent, I picked it up again. The scheduler is forked from ULE, but the algorithm to detect an interactive process is almost completely different with ULE, it comes from Linux paper "Understanding the Linux 2.6.8.1 CPU Scheduler", although I still use same word "score" as a priority boost in ULE scheduler. Briefly, the scheduler has following characteristic: 1. Timesharing process's nice value is seriously respected, timeslice and interaction detecting algorithm are based on nice value. 2. per-cpu scheduling queue and load balancing. 3. O(1) scheduling. 4. Some cpu affinity code in wakeup path. 5. Support POSIX SCHED_FIFO and SCHED_RR. Unlike scheduler 4BSD and ULE which using fuzzy RQ_PPQ, the scheduler uses 256 priority queues. Unlike ULE which using pull and push, the scheduelr uses pull method, the main reason is to let relative idle cpu do the work, but current the whole scheduler is protected by the big sched_lock, so the benefit is not visible, it really can be worse than nothing because all other cpu are locked out when we are doing balancing work, which the 4BSD scheduelr does not have this problem. The scheduler does not support hyperthreading very well, in fact, the scheduler does not make the difference between physical CPU and logical CPU, this should be improved in feature. The scheduler has priority inversion problem on MP machine, it is not good for realtime scheduling, it can cause realtime process starving. As a result, it seems the MySQL super-smack runs better on my Pentium-D machine when using libthr, despite on UP or SMP kernel.	2006-06-13 13:12:56 +00:00
jhb	efa082aa74	Use fget() in kqueue_register() instead of doing all the work by hand.	2006-06-12 21:46:23 +00:00
imp	a157e1bbd0	Add a convenience function rman_init_from_resource for initializing a rman from a resource. Also, include _bus.h since the implementation of bus_space isn't needed here, just the definitions of the types.	2006-06-12 04:06:21 +00:00
iedowse	d9a00b27e5	Keep firmware images on the list until they have been unregistered with firmware_unregister(). Previously when the last driver reference had been dropped we would clear the list entry under the assumption that the firmware module was about to be unloaded, but this was not true if the firmware image had been loaded manually with kldload. This makes it possible to manually kldload firmware images as a workaround for drivers such as ipw that attempt to load firmware while resuming after a suspend. Reviewed by: mlaier (an earlier version of the patch)	2006-06-10 17:04:07 +00:00
rwatson	120490c1a5	Move some functions and definitions from uipc_socket2.c to uipc_socket.c: - Move sonewconn(), which creates new sockets for incoming connections on listen sockets, so that all socket allocate code is together in uipc_socket.c. - Move 'maxsockets' and associated sysctls to uipc_socket.c with the socket allocation code. - Move kern.ipc sysctl node to uipc_socket.c, add a SYSCTL_DECL() for it to sysctl.h and remove lots of scattered implementations in various IPC modules. - Sort sodealloc() after soalloc() in uipc_socket.c for dependency order reasons. Statisticize soalloc() and sodealloc() as they are now required only in uipc_socket.c, and are internal to the socket implementation. After this change, socket allocation and deallocation is entirely centralized in one file, and uipc_socket2.c consists entirely of socket buffer manipulation and default protocol switch functions. MFC after: 1 month	2006-06-10 14:34:07 +00:00
rwatson	ae84950041	Rearrange code in soalloc() so that it's less indented by returning early if uma_zalloc() from the socket zone fails. No functional change. MFC after: 1 week	2006-06-08 22:33:18 +00:00
kib	241c4b444c	Fix the LOR that occurs when the MAC compiled into the kernel and vnode is destroyed. Reviewed by: rwatson LOR: 189 MFC after: 2 weeks Approved by: kan (mentor)	2006-06-08 07:55:10 +00:00
davidxu	75c47e2f15	Make ke_rqindex unsigned.	2006-06-06 12:26:17 +00:00
rwatson	590882238f	Audit some arguments to nmount(), mount(), umount(). Submitted by: wsalamon Obtained from: TrustedBSD Project	2006-06-05 15:32:07 +00:00
rwatson	20a9b30021	Audit command, uid arguments for quotactl(). Audit the mode argument to mkfifo(). Audit the target path passed to symlink(). Submitted by: wsalamon Obtained from: TrustedBSD Project	2006-06-05 13:34:23 +00:00
rwatson	5250e2999a	Audit path passed to the acct() system call. Obtained from: TrustedBSD Project	2006-06-05 13:02:34 +00:00
jhb	bdcca9d25a	Bah, fix fat finger in last. Invert the ~ on MTX_FLAGMASK as it's non-intuitive for the ~ to be built into the mask. All the users now explicitly ~ the mask. In addition, add MTX_UNOWNED to the mask even though it technically isn't a flag. This should unbreak mtx_owner(). Quickly spotted by: kris	2006-06-03 21:11:33 +00:00
jhb	dc064c4d48	In the case of reentering the debugger due to an attempt to perform a context switch while in the debugger, reenter the debugger sooner before performing any statistics updates.	2006-06-03 20:49:44 +00:00
jhb	9c15984c1b	Simplify mtx_owner() so it only reads m->mtx_lock once.	2006-06-03 20:45:00 +00:00
jhb	4f9dae34b7	Style fix to be more like _mtx_lock_sleep(): use 'while (!foo) { ... }' instead of 'for (;;) { if (foo) break; ... }'.	2006-06-03 20:44:01 +00:00
pjd	deb3721592	Fix a problem introduced in revision 1.220. On mount(2) failure, don't forget to unbusy file system before its destruction. This fixes the following warning on mount failure: Mount point <X> had 1 dangling refs Tested by: wkoszek	2006-06-02 20:29:02 +00:00
ambrisko	f3fbf567ee	Make lio ident more consistant with aio ident.	2006-06-02 17:45:48 +00:00
pjd	f2806f2f0a	Don't forget to unlock kq lock in low memory situations. OK'ed by: jmg	2006-06-02 13:23:39 +00:00
pjd	7563ec071e	Remove confusing done_noglobal label. The KQ_GLOBAL_UNLOCK() macro know how to handle both situations - when kq_global lock is and is not held. OK'ed by: jmg	2006-06-02 13:21:21 +00:00
pjd	e8fbceff26	Use SLIST_FOREACH_SAFE() macro, because knote_drop() can free an element which can be then used to find next element in the list. OK'ed by: jmg	2006-06-02 13:18:59 +00:00
cognet	9a61d7bd72	sched_rem() already sets ke->ke_state to KES_THREAD, so there's no need to redo it.	2006-06-01 22:45:56 +00:00
dds	840cb44e85	Remove two locking assertion entries that: a) were incorrectly written and therefore never compiled into assertions, and b) were incorrectly specified and when compiled resulted in a failed assertion.	2006-05-31 14:06:06 +00:00
dds	27cc870a3f	Assertion code specifications are introduced using special character sequences that are distinct from comments. %% is used for argument locks; %! for pre- and post-conditions.	2006-05-30 20:49:54 +00:00
dds	1694caa758	Remove incorrect lock validation specifications that caused failed assertions with DEBUG_VFS_LOCKS. We should reinstate them with correct specifications, possibly after extendng vnode_if.awk Noted by: truckman@	2006-05-30 20:21:51 +00:00
tegge	0d5f191162	Close race between vmspace_exitfree() and exit1() and races between vmspace_exitfree() and vmspace_free() which could result in the same vmspace being freed twice. Factor out part of exit1() into new function vmspace_exit(). Attach to vmspace0 to allow old vmspace to be freed earlier. Add new function, vmspace_acquire_ref(), for obtaining a vmspace reference for a vmspace belonging to another process. Avoid changing vmspace refcount from 0 to 1 since that could also lead to the same vmspace being freed twice. Change vmtotal() and swapout_procs() to use vmspace_acquire_ref(). Reviewed by: alc	2006-05-29 21:28:56 +00:00
delphij	5835e6264a	Unexpand TAILQ_FIRST(foo) == NULL to TAILQ_EMPTY(foo).	2006-05-29 05:43:26 +00:00
kris	06831bc7bc	Correct typos MFC after: 2 weeks	2006-05-28 22:15:28 +00:00
rwatson	81673aab33	In execve(), audit the path name being executed. In the future, it would also be good to audit the interpreter pathname, if any. Obtained from: TrustedBSD Project	2006-05-28 08:28:47 +00:00
dds	ac5d85085e	Add missing % signs in the lock annotations of the functions: lookup, rename, strategy, islocked The missing % sign meant that the lines were processed as plain comments and the corresponding assertions were never generated.	2006-05-28 07:24:12 +00:00
delphij	9434b0bc2e	extlen and cpp is not used here in linker_search_kld(), so nuke them. Reported by: Mingyan Guo <guomingyan at gmail dot com> MFC After: 2 weeks	2006-05-27 09:21:41 +00:00
phk	014f0df978	If the console has no cncheckc method, use cngetc instead.	2006-05-26 11:00:20 +00:00
phk	9427c39ef5	Don't use CONS_DRIVER() macro to insert dummy element in cons_set	2006-05-26 10:46:38 +00:00
phk	393a50156b	GC the cn_dbctl_t hook for consoles, it is unused. This used to make syscons switch to vty0 when we entered DDB but this was lost in the KDB shuffle. We may want to bring it back down the road but it should be done by calling cn_init_t/cn_term_t instead, possibly with a flag argument saying "Debugger!"	2006-05-26 10:24:00 +00:00
rodrigc	ad98ff7ce6	Add "update" mount option to global_opts array, for use with vfs_filteropt().	2006-05-26 02:38:48 +00:00
rodrigc	055e2abe68	Remove calls to vfs_export() for exporting a filesystem for NFS mounting from individual filesystems. Call it instead in vfs_mount.c, after we call VFS_MOUNT() for a specific filesystem.	2006-05-26 00:32:21 +00:00
rwatson	032282fd7e	Use getsock() and fput() instead of fgetsock() and fputsock() in sendfile(). This causes sendfile() to use the file descriptor reference to the socket instead of bumping the socket reference count, which avoids an additional refcount operation, as well as a potential expensive socket refcount drop, which can lead to contention on the accept mutex. This change also has the side effect of further reducing the number of cases where an in-progress I/O operation can occur on a socket after close, as using the file descriptor refcount prevents the socket from closing while in use. MFC after: 3 months	2006-05-25 15:10:13 +00:00
ups	4eb5a7d9ee	Do not set B_NOCACHE on buffers when releasing them in flushbuflist(). If B_NOCACHE is set the pages of vm backed buffers will be invalidated. However clean buffers can be backed by dirty VM pages so invalidating them can lead to data loss. Add support for flush dirty page in the data invalidation function of some network file systems. This fixes data losses during vnode recycling (and other code paths using invalbuf(,V_SAVE,,*)) for data written using an mmaped file. Collaborative effort by: jhb@,mohans@,peter@,ps@,ups@ Reviewed by: tegge@ MFC after: 7 days	2006-05-25 01:00:35 +00:00
sam	ebcbe046b9	When starting up threads in taskqueue_start_threads create them stopped before adjusting their priority and setting them on the run q so they cannot race for resources (pointed out by njl). While here add a console printf on thread create fails; otherwise noone may notice (e.g. return value is always 0 and caller has no way to verify). Reviewed by: jhb, scottl MFC after: 2 weeks	2006-05-24 22:11:07 +00:00
davidxu	77a76e9949	Don't allow non-root user to set a scheduler policy, otherwise this could be a local DOS. Submitted by: Diane Bruce at db at db.net	2006-05-21 00:40:38 +00:00
davidxu	1ed65d1070	Style fixes. Submitted by: Diane Bruce < db at db dot net >	2006-05-19 06:37:24 +00:00
davidxu	50efb8e314	Move flag TDF_UMTXQ into structure umtxq, this eliminates the requirement of scheduler lock in some umtx code.	2006-05-18 08:43:46 +00:00
phk	2ace540d1f	Make the printfs relating to purging threads from a device less intrusive.	2006-05-17 06:37:14 +00:00
phk	ef310efff8	Since DELAY() was moved, most <machine/clock.h> #includes have been unnecessary.	2006-05-16 14:37:58 +00:00
ps	83af4cc433	Allow concurrent read(2)/readv(2) access to a file. Lock file offset against multiple read calls. Submitted by: ups Obtained from: Yahoo! MFC after: 2 weeks	2006-05-16 07:50:54 +00:00
kbyanc	defe42e909	Restore the ability to mount procfs and fdescfs filesystems via the mount(2) system call: * Add cmount hook to fdescfs and pseudofs (and, by extension, procfs and linprocfs). This (mostly) restores the ability to mount these filesystems using the old mount(2) system call (see below for the rest of the fix). * Remove not-NULL check for the data argument from the mount(2) entry point. Per the mount(2) man page, it is up to the individual filesystem being mounted to verify data. Or, in the case of procfs, etc. the filesystem is free to ignore the data parameter if it does not use it. Enforcing data to be not-NULL in the mount(2) system call entry point prevented passing NULL to filesystems which ignored the data pointer value. Apparently, passing NULL was common practice in such cases, as even our own mount_std(8) used to do it in the pre-nmount(2) world. All userland programs in the tree were converted to nmount(2) long ago, but I've found at least one external program which broke due to this (presumably unintentional) mount(2) API change. One could argue that external programs should also be converted to nmount(2), but then there isn't much point in keeping the mount(2) interface for backward compatibility if it isn't backward compatible.	2006-05-15 19:42:10 +00:00
benno	64e3a72f51	The VERBOSE_SYSINIT stuff sees the DDB define a lot better if we include opt_ddb.h. Spotted by: benno Pointy hat to: benno	2006-05-14 07:11:28 +00:00
rodrigc	c3b6d66647	For nmount(), if "rw" is specified as a mount option, add "noro" to the list of mount options. This allows a read-only mount to be converted to read-write via: mount -u -o rw Requested by: kris	2006-05-14 01:51:38 +00:00
jhb	0f921e0992	Remove various bits of conditional Alpha code and fixup a few comments.	2006-05-12 05:04:46 +00:00
benno	ed056cb6c7	Add a new kernel config option, VERBOSE_SYSINIT. When porting FreeBSD to a new platform, one of the more useful things to do is get mi_startup() to let you know which SYSINIT it's up to. Most people tend to whack a printf in the SYSINIT loop to print the address of the function it's about to call. Going one better, jhb made a version that uses DDB to look up the name of the function and print that instead. This version is essentially his with the addition of some ifdeffery to make it optional and to allow it to work (although using only the function address, not the symbol) if you forgot to enable DDB. All the cool bits by: jhb Approved by: scottl, rink, cognet, imp	2006-05-12 02:01:38 +00:00
phk	d12dd358d4	Remove more straggling CPU_ macro references	2006-05-11 17:53:26 +00:00
davidxu	122715f4f0	Use wakeup_one to avoid thundering herd. Tested by: kris	2006-05-09 13:00:46 +00:00
davidxu	a9ab4c311a	Use a dedicated mutex to protect aio queues, the movation is to reduce lock contention with other parts.	2006-05-09 00:10:11 +00:00
tegge	aa5d948d47	Call vn_finished_write() before calling the coredump handler which will indirectly call vn_start_write() as necessary for each write.	2006-05-07 22:50:22 +00:00
tegge	ce79e019da	Temporarily unlock vnode for new image being executed to avoid lock order reversals that can lead to deadlocks. Normally vn_close(), namei() or vrele() should not be called while holding vnode locks.	2006-05-05 20:25:05 +00:00
pjd	abf5b08807	vn_start_write()/vn_finished_write() is not needed here, because vn_start_write() is always called earlier in the code path and calling the function recursively may lead to a deadlock. Confirmed by: tegge MFC after: 2 weeks	2006-04-29 21:57:38 +00:00
kris	78ed66ae08	Lock giant when assigning ni_vp and keep vfslocked state valid. Committed for: jeff	2006-04-29 07:13:49 +00:00
pjd	3b9d7e4828	vn_start_write() is called only when v_type != VCHR, so corresponding vn_finished_write() should also be called only then. BTW. I fixed two functions here: vn_rdwr() and vn_write(). The latter seems to be unused. MFC after: 3 weeks	2006-04-28 21:54:05 +00:00
rwatson	18166a0696	Also check use_pty in the ptmx clone lookup; this means that when ptmx support is turned off using the sysctl, we no longer even allow the ptmx device to be looked up. Foot provided by: peter	2006-04-28 21:39:57 +00:00
marcel	b2cf5a1a9a	Remove the puc-specific hacks. The puc(4) driver now properly uses the rman(9) interface.	2006-04-28 21:23:09 +00:00
jeff	eee673a6a7	- Add a BO_NEEDSGIANT flag to the bufobj. This flag forces all child buffers to go on the buf daemon's DIRTYGIANT queue. - Set BO_NEEDSGIANT on ffs's devvp since the ffs_copyonwrite handler runs in the context of the buf daemon and may require Giant.	2006-04-28 01:05:31 +00:00
jeff	3450f7fc51	- Consistently track ni_dvp and ni_vp with dvfslocked and vfslocked rather than trying to optimize it into a single lock. This adds more calls to lock giant with non smpsafe filesystems but is the only way to reliably hold the correct lock. - Remove an invalid assert in the mountedhere case in lookup and fix the code to properly deal with the scenario. We can actually have a lookup that returns dp == dvp with mountedhere set with certain unmount races. Tested by: kris Reported by: kris/mohans	2006-04-28 00:59:48 +00:00
jmg	e8b8868e9b	back out for now... revert ccpu to being kern.ccpu...	2006-04-27 17:57:59 +00:00
jmg	af1ed256ae	move remaining sysctl into the kern.sched tree...	2006-04-26 19:42:38 +00:00
jhb	0b071af547	Add some new commands to hopefully make it easier to diagnose lock-related problems in ddb: - "show threadchain [thread]" will start with the specified thread (or the current kdb thread by default) and show it's state. If it is blocked on a lock, it will find the owner of the lock and show its state, etc. - "show allchains" will find all of the threads that are blocked on a lock (but do not have any threads blocked on a lock they hold) and show the resulting thread chain. - "show lockchain <lock>" takes a pointer to a lock_object (such as a mutex or rwlock). If there is a turnstile for that lock, then it will display all the threads blocked on the lock. In addition, for each thread blocked on the lock, it will display any contested locks they hold, and recurse on those locks to show any threads blocked on those locks, etc.	2006-04-25 20:28:17 +00:00
jhb	9a8083cd6c	Use db_lookup_thread() to lookup the thread for the passed in address and change 'show locks' to only list the locks for a given thread rather than for all the threads in the process containing a specified thread.	2006-04-25 20:24:23 +00:00
marius	49cbf99f47	Remove last vestiges of sab(4).	2006-04-25 19:43:53 +00:00
rwatson	dd8ff1c1c5	Extend getsock() to return the struct file flags read while holding the file lock, in the style of fgetsock(). Modify accept1() to use getsock() instead of fgetsock(), relying on the file descriptor reference rather than an acquired socket reference to prevent the listen socket from being destroyed during accept(). This avoids additional reference count operations, which should improve performance, and also avoids accept1() operating on a socket whose file descriptor has been torn down, which may have resulted in protocol shutdown starting. MFC after: 3 months	2006-04-25 11:48:16 +00:00
maxim	b583a2a914	Inherit LOCAL_CREDS option from listen socket for sockets returned by accept(2). PR: kern/90644 Submitted by: Andrey Simonenko OK'ed by: mdodd Tested by: NetBSD regress/sys/kern/unfdpass/unfdpass.c MFC after: 1 month	2006-04-24 19:09:33 +00:00
marcel	2e48f3c964	MFp4: Add the ipend() method to the serdev I/F to allow umbrella drivers to obtain pending interrupt status from subordinate drivers.	2006-04-23 22:12:39 +00:00
rwatson	54902641b6	Assert that sockets passed into soabort() not be SQ_COMP or SQ_INCOMP, since that removal should have been done a layer up. MFC after: 3 months	2006-04-23 18:15:54 +00:00
rwatson	426844655f	Add missing 'not' to SQ_COMP comment. MFC after: 3 months	2006-04-23 15:37:23 +00:00
rwatson	7685218d64	Move handling of SQ_COMP exception case in sofree() to the top of the function along with the remainder of the reference checking code. Move comment from body to header with remainder of comments. Inclusion of a socket in a completed connection queue counts as a true reference, and should not be handled as an under-documented edge case. MFC after: 3 months	2006-04-23 15:33:38 +00:00
jhb	8c0b6ba0a3	Print td_name instead of p_comm if td_name is non-empty for 'show turnstile' and 'show sleepq'.	2006-04-21 20:40:43 +00:00
ps	76acdb6332	Don't try to kill embryonic processes in killpg1(). This prevents a race condition between fork() and kill(pid,sig) with pid < 0 that can cause a kernel panic. Submitted by: up MFC after: 3 weeks	2006-04-21 19:26:21 +00:00
ps	10b2fe8dea	Allow for nmbclusters and maxsockets to be increased via sysctl. An eventhandler is used to update all the various zones that depend on these values.	2006-04-21 09:25:40 +00:00
jmg	c5ebe3714a	const'ify resource_spec to note that we won't be changing anything while releasing resources... also, NULL out the resources as we free them...	2006-04-20 01:44:16 +00:00
imp	8299c3aeb9	r_spare1 and r_spare2 aren't needed. They aren't used. They can't be accessed from outside of subr_rman.c. Remove them. Reviewed by: jmg (in theory)	2006-04-19 21:25:55 +00:00
jhb	99cfbe57d4	Implement rw_try_upgrade() and rw_downgrade(). rw_try_upgrade() makes a single attempt at upgrading a read lock to a write lock, and rw_downgrade() converts curthread's write lock into a read lock.	2006-04-19 21:06:52 +00:00
wkoszek	56c56816b9	'owner' is not used without SMP. Fix kernel build for such kernel configurations. Approved by: jhb	2006-04-18 20:32:42 +00:00
jhb	6a2cfc7f2c	Adaptively spin before blocking on the turnstile if an rwlock is write locked. In general the adaptive spinning is similar to the same code for mutexes with some extra trickiness in rw_wunlock_hard(). Specifically, even though both wait bits might be set and we might have a turnstile with at least one waiting thread, there might not be any threads blocked on the queue we are not waking up (they might all be spinning), and we should only preserve the waiting flag for the queue we aren't waking up if there are in fact threads blocked on that queue. Secondly, there might not be any threads blocked on the queue we have chosen to waken threads from (there might only be threads blocked on the other queue and the threads for this queue are all spinning) in which case we disown the turnstile instead of doing a braodcast and unpend.	2006-04-18 18:27:54 +00:00
jhb	084bf8cc1a	- Bring back turnstile_empty() which can check to see if an individual queue on a turnstile is empty. - Add a turnstile_disown() function that allows a thread to give up ownership of a turnstile w/o waking up any waiters.	2006-04-18 18:16:54 +00:00
delphij	0c90800cc6	In vfs_hash_get(): mount point should never be changed so explicitly constify the mp parameter. Reviewed by: phk	2006-04-18 08:05:08 +00:00
jhb	244b211eab	- Add a rw_wowner() macro that just returns the owner of a write lock and use it in places that only care about the write owner instead of rw_owner() as a baby step towards limited read-lock owner. - Tidy the code that sets the WAITER flag bits to not duplicate a test around the atomic operation and the KTR trace in both of the lock functions.	2006-04-17 21:11:01 +00:00
jhb	963387dcfe	Add a 'show sleepqueue' alias for 'show sleepq' in DDB.	2006-04-17 20:16:32 +00:00
jhb	e37a168f00	Trim trailing whitespace.	2006-04-17 20:14:51 +00:00
jhb	495b6abf1f	Add a new module_file() function that returns the linker_file_t associated with a given module_t. I use this in some the MOD_LOAD event handler for some test kernel modules to ask the kernel linker to look up the linker sets in my test modules. (I use linker sets to generate the list of possible events that I then signal to execute via a sysctl. On non-amd64, ld(8) would resolve the entire linker set, but on amd64 I have to ask the kernel linker to do it for me, and having the kernel linker do it works on all archs.)	2006-04-17 19:44:44 +00:00
jhb	d535a5cb81	Change msleep() and tsleep() to not alter the calling thread's priority if the specified priority is zero. This avoids a race where the calling thread could read a snapshot of it's current priority, then a different thread could change the first thread's priority, then the original thread would call sched_prio() inside msleep() undoing the change made by the second thread. I used a priority of zero as no thread that calls msleep() or tsleep() should be specifying a priority of zero anyway. The various places that passed 'curthread->td_priority' or some variant as the priority now pass 0.	2006-04-17 18:20:38 +00:00
jmg	4a3ab050f7	remove duplicate sizeof vnode entry (debug.sizeof.vnode already existed)... move ncsize into debug.sizeof and rename to namecache...	2006-04-16 18:38:30 +00:00
scottl	1c640d92e9	Take a better stab at making this compile.	2006-04-15 18:54:56 +00:00
scottl	333b9af7b9	Take a stab at making this compile.	2006-04-15 18:04:04 +00:00
jhb	b15516d824	Mark the thread pointer used during an adaptive spin volatile so that the compiler doesn't decide to cache td_state. Cachine the state would cause the spinning thread to not notice when the owning thread stopped executing (if it was preempted for example) which could result in livelock.	2006-04-14 19:51:50 +00:00
jhb	db475d0075	Drop the kqueue global mutex as soon as we are finished with it rather than keeping it locked until we exit the function to optimize the case where the lock would be dropped and later reacquired. The optimization was broken when kevent's were moved from UFS to VFS and the knote list lock for a vnode kevent became the lockmgr vnode lock. If one tried to use a kqueue that contained events for a kqueue fd followed by a vnode, then the kq global lock would end up being held when the vnode lock was acquired which could result in sleeping with a mutex held (and subsequent panics) if the vnode lock was contested. Reviewed by: jmg Tested by: ps (on 6.x) MFC after: 3 days	2006-04-14 14:27:28 +00:00
davidxu	9e131242ef	Clear TDF_SINTR in sleepq_resume_thread, also sleepq_catch_signal does not need to clear it now, this should fix panic when msleep is recursivly called. Patch is slightly adjusted after review. Reviewed by: jhb Tested by: Csaba Henk, csaba-ml at creo.hu MFC after: 3 days	2006-04-13 23:29:25 +00:00
jhb	b0e5efc22c	Turn on ithread_destroy() and call it from intr_event_destroy() to tear down an interrupt event's associated thread (if it has one).	2006-04-13 17:29:04 +00:00
csjp	8ddf669db3	Kill the last Giant acquisition in the exit(2) code. This Giant acquisition doesn't appear to be protecting anything. Most of consumers funsetownlst(9) do not appear to be picking up Giant anywhere. This was originally a part of my Giant exit(2) clean up revision 1.272 but I thought it was a good idea to leave it out until we were able to analyze it better. Tested by: kris MFC after: 3 weeks	2006-04-10 14:07:28 +00:00
pjd	ca3f23ca34	On shutdown try to turn off all swap devices. This way GEOM providers are properly closed on shutdown. Requested by: ru Reviewed by: alc MFC after: 2 weeks	2006-04-10 10:03:41 +00:00
davidxu	0b8fbed4ea	Use proc lock to prevent a thread from exiting, Giant was no longer used to protect thread list.	2006-04-10 04:55:59 +00:00
rwatson	a7901a2d9c	Remove UNIX domain socket raw socket support. This feature is documented as being undocumented in Stevens, and was broken in 1997 during network stack infrastructure work. It is the one remaining (and incorrect) direct protocol reference to raw_usrreq.pru_attach; this is incorrect because the raw socket code assumes that raw_uattach is called only after the protocol has allocated a PCB. MFC after: 3 months	2006-04-09 16:29:47 +00:00
marcel	aa6df7801f	Add the scc_hwmtx spin mutex, defined by scc(4).	2006-04-07 22:15:54 +00:00
jmg	d8abb8a512	spell unlock correctly, this is relatively minor as it's rare someone would provide a lock method, and want the default unlock, but it is a bug... PR: 95356 Submitted by: Stephen Corteselli MFC after: 3 days	2006-04-07 17:21:27 +00:00
jeff	275c043cbe	- VFS_LOCK_GIANT when recycling a vnode via getnewvnode. We may be recycling for an unrelated filesystem. I really don't like potentially acquiring giant in the context of a giantless filesystem but there are reasonable objections to removing the recycling from this path. Sponsored by: Isilon Systems, Inc.	2006-04-04 06:46:10 +00:00
jeff	6862688995	- Properly check against B_DELWRI and B_NEEDSGIANT. This check was incorrectly written and caused some !NEEDSGIANT buffers to be put in the NEEDSGIANT queue. Sponsored by: Isilon Systems, Inc.	2006-04-04 06:44:21 +00:00
marcel	dde58fbc0d	Increment kdb_active after we stopped the other CPUs and decrement kdb_active before we restart them. This avoids false positives on restarted CPUs when they test for kdb_active while kdb_trap() is still finishing up.	2006-04-04 00:40:20 +00:00
marcel	8278e2d5fb	Eliminate HAVE_STOPPEDPCBS. On ia64 the PCPU holds a pointer to the PCB in which the context of stopped CPUs is stored. To access this PCB from KDB, we introduce a new define, called KDB_STOPPEDPCB. The definition, when present, lives in <machine/kdb.h> and abstracts where MD code saves the context. Define KDB_STOPPEDPCB on i386, amd64, alpha and sparc64 in accordance to previous code.	2006-04-03 22:51:47 +00:00
peter	0f363b7d24	Remove the unused sva and eva arguments from pmap_remove_pages().	2006-04-03 21:16:10 +00:00
marcel	a1c5f48a6d	In kdb_trap(), change the type of the local variable 'intr' from int to register_t, as intr_disable() returns the latter and register_t may be wider than int. Pointed out by: marius@	2006-04-03 20:55:52 +00:00
marcel	64ac08d05f	Replace critical_enter() and critical_exit() in kdb_trap() with intr_disable() and intr_restore() resp. Previously, critical regions would have interrupts disabled, but that was changed. Consequently, the debugger could run with interrupts enabled. This could cause problems for the low-level console code where received characters would trigger an interrupt that causes the interrupt handler to read the character instead of the cngetc() function.	2006-04-03 17:48:09 +00:00
jmg	45648c7949	mask out any action when copying the flags from the event to the knote.. Pointed out by: Václav Haisman Submitted by: Dan Nelson (slightly modifed patch) MFC after: 3 days	2006-04-01 20:15:39 +00:00
rwatson	5479e5d692	Chance protocol switch method pru_detach() so that it returns void rather than an error. Detaches do not "fail", they other occur or the protocol flags SS_PROTOREF to take ownership of the socket. soclose() no longer looks at so_pcb to see if it's NULL, relying entirely on the protocol to decide whether it's time to free the socket or not using SS_PROTOREF. so_pcb is now entirely owned and managed by the protocol code. Likewise, no longer test so_pcb in other socket functions, such as soreceive(), which have no business digging into protocol internals. Protocol detach routines no longer try to free the socket on detach, this is performed in the socket code if the protocol permits it. In rts_detach(), no longer test for rp != NULL in detach, and likewise in other protocols that don't permit a NULL so_pcb, reduce the incidence of testing for it during detach. netinet and netinet6 are not fully updated to this change, which will be in an upcoming commit. In their current state they may leak memory or panic. MFC after: 3 months	2006-04-01 15:42:02 +00:00
rwatson	8622e776f9	Change protocol switch pru_abort() API so that it returns void rather than an int, as an error here is not meaningful. Modify soabort() to unconditionally free the socket on the return of pru_abort(), and modify most protocols to no longer conditionally free the socket, since the caller will do this. This commit likely leaves parts of netinet and netinet6 in a situation where they may panic or leak memory, as they have not are not fully updated by this commit. This will be corrected shortly in followup commits to these components. MFC after: 3 months	2006-04-01 15:15:05 +00:00
rwatson	cbb87d3f67	Add comment to accept1() that it should use getsock() instead of fgetsock() to avoid additional mutex operations, and also to avoid use of soref/sorele which are now not preferred. MFC after: 3 months	2006-04-01 11:14:56 +00:00
rwatson	53d8847cf3	Mark fgetsock() and fputsock() as depcrecated: callers should rely on the file descriptor reference, rather than paying additional lock operations to acquire a socket reference from the file descriptor. This will also help to ensure that file descriptor based socket requests are not delivered to a socket after close. Most consumers have already been converted to this model. MFC after: 3 months	2006-04-01 11:09:54 +00:00
rwatson	b7a0ed6a86	Assert so->so_pcb is NULL in sodealloc() -- the protocol state should not be present at this point. We will eventually remove this assert because the socket layer should never look at so_pcb, but for now it's a useful debugging tool. MFC after: 3 months	2006-04-01 10:45:52 +00:00
rwatson	67815b8633	Add a somewhat sizable comment documenting the semantics of various kernel socket calls relating to the creation and destruction of sockets. This will eventually form the foundation of socket(9), but is currently in too much flux to do so. MFC after: 3 months	2006-04-01 10:43:02 +00:00
jeff	db0836bdc3	- Add an assert to vgone. It is illegal to call vgone without a reference to the vnode. Without a reference the vnode will never be vdestroy'd and the memory will never be reclaimed. Sponsored by: Isilon Systems, Inc.	2006-03-31 23:39:26 +00:00
jeff	e943e141d5	- When there are dangling vnodes at unmount print them before we panic. Sponsored by: Isilon Systems, Inc.	2006-03-31 23:38:15 +00:00
jeff	32b1878006	- Release the references acquired by VOP_GETWRITEMOUNT and vfs_getvfs(). Discussed with: tegge Tested by: kris Sponsored by: Isilon Systems, Inc.	2006-03-31 03:54:20 +00:00
jeff	b9e82e7fef	- Hold a reference from the time vfs_busy starts until vfs_unbusy is called. - vfs_getvfs has to return a reference to prevent the returned mountpoint from changing identities. - Release references acquired via vfs_getvfs. Discussed with: tegge Tested by: kris Sponsored by: Isilon Systems, Inc.	2006-03-31 03:53:25 +00:00
jeff	d018a9a820	- GETWRITEMOUNT now returns a referenced mountpoint to prevent its identity from changing. This is possible now that mounts are not freed. Discussed with: tegge Tested by: kris Sponsored by: Isilon Systems, Inc.	2006-03-31 03:52:24 +00:00
jeff	1a43943ef6	- Allocate mounts from a uma zone that uses UMA_ZONE_NOFREE to prevent mount memory from being reclaimed. This resolves a number of race conditions described in vfs_default.c and introduced with the VFS_LOCK_GIANT macros. - Let the mtx and lock remain valid after the mount structure has been freed by using init and fini calls. Technically fini will never be called but is included for completeness. - Consistently use lockmgr directly rather than lockmgr to lock and vfs_unbusy to unlock. Discussed with: tegge Tested by: kris Sponsored by: Isilon Systems, Inc.	2006-03-31 03:49:51 +00:00
jeff	73f46586c6	- LK_RETRY means nothing when passed to VOP_LOCK. Call vn_lock instead. - Move the vn_lock of the dvp until after we've unbusied the filesystem to avoid a LOR with the mount point lock. - In the v_mountedhere while loop we acquire a new instance of giant each time through without releasing the first. This would cause us to leak Giant. Sponsored by: Isilon Systems, Inc.	2006-03-31 02:59:23 +00:00
jeff	2086f279cf	- Add the B_NEEDSGIANT flag which is only set if the vnode that owns a buf requires Giant. It is set in bgetvp and cleared in brelvp. - Create QUEUE_DIRTY_GIANT for dirty buffers that require giant. - In the buf daemon, only grab giant when processing QUEUE_DIRTY_GIANT and only if we think there are buffers in that queue. Sponsored by: Isilon Systems, Inc.	2006-03-31 02:56:30 +00:00
sam	a2a6bebf93	fixup error handling in taskqueue_start_threads: check for kthread_create failing, print a message when we fail for some reason as most callers do not check the return value (e.g. 'cuz they're called from SYSINIT) Reviewed by: scottl MFC after: 1 week	2006-03-30 23:06:59 +00:00
pjd	e23555c1ae	Fix a panic on sparc64 related to inproper aligment - we cannot assume, that 'unsigned char *' argument is 4 byte aligned. MFC after: 3 days	2006-03-30 18:45:50 +00:00

... 2 3 4 5 6 ...

9530 Commits