freebsd-nq

Author	SHA1	Message	Date
David Xu	adc9c950af	Create thread in separated ksegrp, so they always get correct user level priority.	2006-07-10 23:14:07 +00:00
John Baldwin	c870740e09	- Split out kern_accept(), kern_getpeername(), and kern_getsockname() for use by ABI emulators. - Alter the interface of kern_recvit() somewhat. Specifically, go ahead and hard code UIO_USERSPACE in the uio as that's what all the callers specify. In place, add a new uioseg to indicate what type of pointer is in mp->msg_name. Previously it was always a userland address, but ABI emulators may pass in kernel-side sockaddrs. Also, remove the namelenp field and instead require the two places that used it to explicitly copy mp->msg_namelen out to userland. - Use the patched kern_recvit() to replace svr4_recvit() and the stock kern_sendit() to replace svr4_sendit(). - Use kern_bind() instead of stackgap use in ti_bind(). - Use kern_getpeername() and kern_getsockname() instead of stackgap in svr4_stream_ti_ioctl(). - Use kern_connect() instead of stackgap in svr4_do_putmsg(). - Use kern_getpeername() and kern_accept() instead of stackgap in svr4_do_getmsg(). - Retire the stackgap from SVR4 compat as it is no longer used.	2006-07-10 21:38:17 +00:00
John Baldwin	0f8e0c3dd4	Explicitly use STAILQ_REMOVE_HEAD() when we know we are removing the head element to avoid confusing Coverity. It's now also easier for humans to parse as well. Found by: Coverity Prevent(tm) CID: 1201	2006-07-10 19:28:57 +00:00
John Baldwin	0bf8969c60	Fix two more instances of using a linker_file_t object in TAILQ() macros after free'ing it. Found by: Coverity Prevent(tm) CID: 1435	2006-07-10 19:13:45 +00:00
John Baldwin	6b5b470aea	Don't try to reuse the linker_file structure after we've freed it when throwing out the kld's loaded by the loader that didn't successfully link. Found by: Coverity Prevent(tm) CID: 1435	2006-07-10 19:06:01 +00:00
Scott Long	e3546a7549	Use a sleep mutex instead of an sx lock for the kernel environment. This allows greater flexibility for drivers that want to query the environment. Reviewed by: jhb, mux	2006-07-09 21:42:58 +00:00
John Baldwin	d9f4623307	- Split ioctl() up into ioctl() and kern_ioctl(). The kern_ioctl() assumes that the 'data' pointer is already setup to point to a valid KVM buffer or contains the copied-in data from userland as appropriate (ioctl(2) still does this). kern_ioctl() takes care of looking up a file pointer, implementing FIONCLEX and FIOCLEX, and calling fi_ioctl(). - Use kern_ioctl() to implement xenix_rdchk() instead of using the stackgap and mark xenix_rdchk() MPSAFE.	2006-07-08 20:12:14 +00:00
John Baldwin	c1cccebe8b	Add a kern_close() so that the ABIs can close a file descriptor w/o having to populate a close_args struct and change some of the places that do.	2006-07-08 20:03:39 +00:00
John Baldwin	b1ee5b654d	Rework kern_semctl a bit to always assume the UIO_SYSSPACE case. This mostly consists of pushing a few copyin's and copyout's up into __semctl() as all the other callers were already doing the UIO_SYSSPACE case. This also changes kern_semctl() to set the return value in a passed in pointer to a register_t rather than td->td_retval[0] directly so that callers can only set td->td_retval[0] if all the various copyout's succeed. As a result of these changes, kern_semctl() no longer does copyin/copyout (except for GETALL/SETALL) so simplify the locking to acquire the semakptr mutex before the MAC check and hold it all the way until the end of the big switch statement. The GETALL/SETALL cases have to temporarily drop it while they do copyin/malloc and copyout. Also, simplify the SETALL case to remove handling for a non-existent race condition.	2006-07-08 19:51:38 +00:00
Warner Losh	db2bc1bb82	Create bus_enumerate_hinted_children. This routine will allow drivers to use the hinted child system. Bus drivers that use this need to implmenet the bus_hinted_child method, where they actually add the child to their bus, as they see fit. The bus is repsonsible for getting the attribtues for the child, adding it in the right order, etc. ISA hinting will be updated to use this method. MFC After: 3 days	2006-07-08 17:06:15 +00:00
Robert Watson	e4256d1e8d	Move POSIX.1e-specific utility routines from kern_acl.c to subr_acl_posix1e.c, leaving kern_acl.c containing only ACL system calls and utility routines common across ACL types. Add subr_acl_posix1e.c to the build. Obtained from: TrustedBSD Project	2006-07-06 23:37:39 +00:00
John Baldwin	398c993b2a	- Explicitly acquire Giant around SYSINIT's and SYSUNINIT's since they are not all known to be MPSAFE yet. - Actually remove Giant from the kernel linker by taking it out of the KLD_LOCK() and KLD_UNLOCK() macros. Pointy hat to: jhb (2)	2006-07-06 21:39:39 +00:00
John Baldwin	3cb83e714d	Add kern_setgroups() and kern_getgroups() and use them to implement ibcs2_[gs]etgroups() rather than using the stackgap. This also makes ibcs2_[gs]etgroups() MPSAFE. Also, it cleans up one bit of weirdness in the old setgroups() where it allocated an entire credential just so it had a place to copy the group list into. Now setgroups just allocates a NGROUPS_MAX array on the stack that it copies into and then passes to kern_setgroups().	2006-07-06 21:32:20 +00:00
Wayne Salamon	65ee602e0c	Audit the remaining parameters to the extattr system calls. Generate the audit records for those calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-07-06 19:33:38 +00:00
Robert Watson	6435cdafa3	Remove now unneeded opt_mac.h and mac.h includes.	2006-07-06 13:25:51 +00:00
Wayne Salamon	761aed363f	Regen the system calls files, picking up the extended attr events, and some mount-related changes done previously. Approved by: rwatson (mentor)	2006-07-05 19:24:14 +00:00
Konstantin Belousov	c8d3bc1fa3	Back out my rev. 1.674. The better fix (rev. 1.637) is already in tree. Approved by: kan (mentor)	2006-07-05 16:33:25 +00:00
Wayne Salamon	bbe5d0318d	Add audit events for the extended attribute system calls. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-07-05 15:46:02 +00:00
Maxim Konovalov	1f36c876a1	o Fix grammar in the comment, indent macros. No functional changes.	2006-07-02 20:53:52 +00:00
Maxim Konovalov	75d960eb2e	o Remove rev. 1.57 leftover, not reached code.	2006-07-02 20:49:46 +00:00
Maxim Konovalov	e2668f5563	o Fix typo in the comment. PR: kern/99632 Submitted by: clsung	2006-06-30 08:10:55 +00:00
David E. O'Brien	2e4db89cfc	Fix building with GCC 4.2: define data types before referring to them.	2006-06-29 19:37:31 +00:00
John Baldwin	fe95c76276	Fix semctl(2) breakage from the previous commit. Previously __semctl() had a local 'semid' variable which was the array index and used uap->semid as the original IPC id. During the kern_semctl() conversion those two variables were collapsed into a single 'semid' variable breaking the places that needed the original IPC ID. To fix, add a new 'semidx' variable to hold the array index and leave 'semid' unmolested as the IPC id. While I'm here, explicitly document that the (undocumented, at least in semctl(2)) SEM_STAT command curiously expects an array index in the 'semid' parameter rather than an IPC id. Submitted by: maxim	2006-06-29 13:58:36 +00:00
David Xu	5151eeb194	Fix a bug when accumulating run time, if a thread calls yield() syscall, its run time may be lost.	2006-06-29 12:29:20 +00:00
David Xu	d29a8ce69b	Fix system load count (noticed by dephij). Remove incorrect comment.	2006-06-29 09:49:00 +00:00
David Xu	0922ef0c42	Remove unused function declaration. Add else statement in sched_calc_pri. Fix a bug when checking interrupt thread in sched_add.	2006-06-29 05:59:36 +00:00
David Xu	d60003a2e4	Remove load balancer code, since it has serious priority inversion problem which really hurts performance on FreeBSD.	2006-06-29 05:36:34 +00:00
John Baldwin	49d409a108	- Add a kern_semctl() helper function for __semctl(). It accepts a pointer to a copied-in copy of the 'union semun' and a uioseg to indicate which memory space the 'buf' pointer of the union points to. This is then used in linux_semctl() and svr4_sys_semctl() to eliminate use of the stackgap. - Mark linux_ipc() and svr4_sys_semsys() MPSAFE.	2006-06-27 18:28:50 +00:00
John Baldwin	597d608f86	- Expand the scope of Giant some in mount(2) to protect the vfsp structure from going away. mount(2) is now MPSAFE. - Expand the scope of Giant some in unmount(2) to protect the mp structure (or rather, to handle concurrent unmount races) from going away. umount(2) is now MPSAFE, as well as linux_umount() and linux_oldumount(). - nmount(2) and linux_mount() were already MPSAFE.	2006-06-27 14:46:31 +00:00
Pawel Jakub Dawidek	0bd645ae0c	Compress direct cr_ruid comparsion and jailed() call to suser_cred(9). Reviewed by: rwatson	2006-06-27 11:32:08 +00:00
Pawel Jakub Dawidek	8838c27693	Use suser_cred(9) instead of checking cr_uid directly. Reviewed by: rwatson	2006-06-27 11:29:38 +00:00
Pawel Jakub Dawidek	2905ade228	- Use suser_cred(9) instead of checking cr_ruid directly. - For privileged processes safe two mutex operations. We may want to consider if this is good idea to use SUSER_ALLOWJAIL here, but for now I didn't wanted to change the original behaviour. Reviewed by: rwatson	2006-06-27 11:28:50 +00:00
Sergey Babkin	d81175c738	Backed out the change by request from rwatson. PR: kern/14584	2006-06-26 22:03:22 +00:00
John Baldwin	c94ce032df	Address a problem I missed in removing Giant from the kernel linker. Not all of the module event handlers are MP safe yet, so always acquire Giant for now when invoking module event handlers. Eventually we can add an MPSAFE flag or some such and add appropriate locking to all module event handlers.	2006-06-26 18:34:45 +00:00
John Baldwin	322fb40cbf	Remove duplicate security checks already performed in kern_kldload().	2006-06-26 18:33:32 +00:00
Robert Watson	e83b30bdcb	Trim basically unused 'unp' in uipc_connect().	2006-06-26 16:18:22 +00:00
Sergey Babkin	7a799f1ef0	The common UID/GID space implementation. It has been discussed on -arch in 1999, and there are changes to the sysctl names compared to PR, according to that discussion. The description is in sys/conf/NOTES. Lines in the GENERIC files are added in commented-out form. I'll attach the test script I've used to PR. PR: kern/14584 Submitted by: babkin	2006-06-25 18:37:44 +00:00
Ian Dowse	450ec4ed45	If linker_release_module() fails then we still hold a reference on the linker_file, so record this by restoring the linker_file pointer in fp->file.	2006-06-25 12:36:21 +00:00
Pawel Jakub Dawidek	92c0849935	Simplify the code and remove two mutex operations. MFC after: 2 weeks	2006-06-24 22:55:43 +00:00
John Baldwin	70f3778827	Replace the kld_mtx mutex with a kld_sx sx lock and expand it's scope to protect all linker-related data structures including the contents of linker file objects and the any linker class data as well. Considering how rarely the linker is used I just went with the simple solution of single-threading the whole thing rather than expending a lot of effor on something more fine-grained and complex. Giant is still explicitly acquired while registering and deregistering sysctl's as well as in the elf linker class while calling kmupetext(). The rest of the linker runs without Giant unless it has to acquire Giant while loading files from a non-MPSAFE filesystem.	2006-06-21 20:42:08 +00:00
John Baldwin	cbda6f950b	- Push down Giant in kldfind() and kldsym(). - Remove several goto's by either using direct return's or else clauses.	2006-06-21 20:15:36 +00:00
John Baldwin	d36e739a0c	Whoops, revert accidental commit.	2006-06-21 17:48:59 +00:00
John Baldwin	9dd44bd79e	Fix two comments and a style fix.	2006-06-21 17:48:03 +00:00
John Baldwin	0df2972736	Various whitespace fixes.	2006-06-21 17:47:45 +00:00
John Baldwin	62d615d508	Conditionally acquire Giant around VFS operations.	2006-06-20 21:31:38 +00:00
John Baldwin	aeeb017bd6	- Push Giant down into linker_reference_module(). - Add a new function linker_release_module() as a more intuitive complement to linker_reference_module() that wraps linker_file_unload(). linker_release_module() can either take the module name and version info passed to linker_reference_module() or it can accept the linker file object returned by linker_reference_module().	2006-06-20 20:54:13 +00:00
John Baldwin	f462ce3edd	Make linker_find_file_by_name() and linker_find_file_by_id() static to simplify linker locking. The only external consumers now use linker_file_foreach().	2006-06-20 20:41:15 +00:00
John Baldwin	932151064a	- Add a new linker_file_foreach() function that walks the list of linker file objects calling a user-specified predicate function on each object. The iteration terminates either when the entire list has been iterated over or the predicate function returns a non-zero value. linker_file_foreach() returns the value returned by the last invocation of the predicate function. It also accepts a void * context pointer that is passed to the predicate function as well. Using an iterator function avoids exposing linker internals to the rest of the kernel making locking simpler. - Use linker_file_foreach() instead of walking the list of linker files manually to lookup ndis files in ndis(4). - Use linker_file_foreach() to implement linker_hwpmc_list_objects().	2006-06-20 20:37:17 +00:00
John Baldwin	aaf3170501	Make linker_file_add_dependency() and linker_load_module() static since only the linker uses them.	2006-06-20 20:18:42 +00:00
John Baldwin	e767366f99	Don't check if malloc(M_WAITOK) returns NULL.	2006-06-20 20:11:00 +00:00
John Baldwin	e5bb3a01d7	Use 'else' to remove another goto.	2006-06-20 19:49:28 +00:00
John Baldwin	73a2437a83	- Remove some useless variable initializations. - Make some conditional free()'s where the condition was always true unconditional.	2006-06-20 19:32:10 +00:00
George V. Neville-Neil	fb11be62a2	Properly cast the values of valsize (the size of the value passed in) in setsockopt so that they can be compared correctly against negative values. Passing in a negative value had a rather negative effect on our socket code, making it impossible to open new sockets. PR: 98858 Submitted by: James.Juran@baesystems.com MFC after: 1 week	2006-06-20 12:36:40 +00:00
Robert Watson	721150ad8f	When retrieving SO_ERROR via getsockopt(), hold the socket lock around the retrieval and replacement with 0. MFC after: 1 week	2006-06-18 19:02:49 +00:00
Yaroslav Tykhiy	42ccd54fec	Add a funny sysctl: debug.kdb.trap_code . It is similar to debug.kdb.trap, except for it tries to cause a page fault via a call to an invalid pointer. This can highlight differences between a fault on data access vs. a fault on code call some CPUs might have. This appeared as a test for a work \ Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-18 12:27:59 +00:00
Robert Watson	cd3a3a269f	Remove sbinsertoob(), sbinsertoob_locked(). They violate (and have basically always violated) invariannts of soreceive(), which assume that the first mbuf pointer in a receive socket buffer can't change while the SB_LOCK sleepable lock is held on the socket buffer, which is precisely what these functions do. No current protocols invoke these functions, and removing them will help discourage them from ever being used. I should have removed them years ago, but lost track of it. MFC after: 1 week Prodded almost by accident by: peter	2006-06-17 22:48:34 +00:00
Ed Maste	374875fa56	Add a description for sysctl -d.	2006-06-17 02:58:18 +00:00
Robert Watson	9a44cbf19c	Remove unused (and ifdef'd) unp_abort() and unp_drain(). MFC after: 1 month	2006-06-16 22:11:49 +00:00
David Malone	93ef14a74b	Add a kern.timecounter.tc sysctl tree that contains the mask, frequency, quality and current value of each available time counter. At the moment all of these are read-only, but it might make sense to make some of these read-write in the future. MFC after: 3 months	2006-06-16 20:29:05 +00:00
Yaroslav Tykhiy	be70abccba	Kill an XXX remark that has been untrue since rev. 1.150 of this file.	2006-06-16 07:36:18 +00:00
Christian S.J. Peron	4f0840f348	Axe Giant from vn_fullpath(9). The vnode -> pathname lookup should be filesystem agnostic. We are not touching any file system specific functions in this code path. Since we have a cache lock, there is really no need to keep Giant around here. This eliminates Giant acquisitions for any syscall which is auditing pathnames. Discussed with: jeff	2006-06-16 05:09:28 +00:00
Maxim Konovalov	059d68dea6	o Expand an exclusive lock scope to prevent a race between two simultaneous module_register(). Original work done by: Alex Lyashkov Reviewed by: jhb MFC after: 2 weeks	2006-06-15 08:53:09 +00:00
David Xu	7bb561fbb9	Use scheduler API sched_relinquish() to implement yield() syscall.	2006-06-15 06:41:57 +00:00
David Xu	36ec198bd5	Add scheduler API sched_relinquish(), the API is used to implement yield() and sched_yield() syscalls. Every scheduler has its own way to relinquish cpu, the ULE and CORE schedulers have two internal run- queues, a timesharing thread which calls yield() syscall should be moved to inactive queue.	2006-06-15 06:37:39 +00:00
David Xu	c2c1ab1858	Clear ke_runq before calling maybe_preempt, this avoids a KASSERT(ke->ke_runq == NULL) panic when the sched_add is recursively called by maybe_preempt. Reported by: Wojciech A. Koszek < dunstan at freebsd dot czest dot pl >	2006-06-14 03:46:03 +00:00
Xin LI	6ad26d8376	Unexpand an instance of TAILQ_EMPTY()	2006-06-14 03:14:26 +00:00
Marcel Moolenaar	e1684acf38	Unbreak 64-bit architectures. The 3rd argument to kern_kldload() is a pointer to an integer and td->td_retval[0] is of type register_t. On 64-bit architectures register_t is wider than an integer.	2006-06-14 03:01:06 +00:00
David Xu	2c7cae8042	Fox a typo in sched_is_timeshare.	2006-06-13 23:45:59 +00:00
David Xu	e15abbf251	Pass boolean value to __predict_false. Try to keep KSE slot count correct for migrating thread, the count is a bit mess.	2006-06-13 23:01:50 +00:00
John Baldwin	edd32c2da2	Use kern_kldload() and kern_kldunload() to load and unload modules when we intend for the user to be able to unload them later via kldunload(2) instead of calling linker_load_module() and then directly adjusting the ref count on the linker file structure. This makes the resulting consumer code simpler and cleaner and better hides the linker internals making it possible to sanely lock the linker.	2006-06-13 21:36:23 +00:00
John Baldwin	b21c9288ce	A couple of minor style tweaks.	2006-06-13 21:34:12 +00:00
John Baldwin	d53885879d	- Add a kern_kldload() that is most of the previous kldload() and push Giant down in it. - Push Giant down in kern_kldunload() and reorganize it slightly to avoid using gotos. Also, expose this function to the rest of the kernel.	2006-06-13 21:28:18 +00:00
John Baldwin	6b3d277ad4	- Push down Giant some in kldstat(). - Use a 'struct kld_file_stat' on the stack to read data under the lock and then do one copyout() w/o holding the lock at the end to push the data out to userland.	2006-06-13 21:11:12 +00:00
John Baldwin	b904477c68	Unexpand TAILQ_FOREACH() and TAILQ_FOREACH_SAFE().	2006-06-13 20:49:07 +00:00
John Baldwin	3a600aeabc	Remove some more pointless goto's and don't check to see if malloc(M_WAITOK) returns NULL.	2006-06-13 20:27:23 +00:00
John Baldwin	2fa6cc80d7	Handle the simple case of just dropping a reference near the start of linker_file_unload() instead of in the middle of a bunch of code for the case of dropping the last reference to improve readability and sanity. While I'm here, remove pointless goto's that were just jumping to a return statement.	2006-06-13 19:45:08 +00:00
Maxim Konovalov	70df31f4de	o There are two methods to get a process credentials over the unix sockets: 1) A sender sends SCM_CREDS message to a reciever, struct cmsgcred; 2) A reciever sets LOCAL_CREDS socket option and gets sender credentials in control message, struct sockcred. Both methods use the same control message type SCM_CREDS with the same control message level SOL_SOCKET, so they are indistinguishable for the receiver. A difference in struct cmsgcred and struct sockcred layouts may lead to unwanted effects. Now for sockets with LOCAL_CREDS option remove all previous linked SCM_CREDS control messages and then add a control message with struct sockcred so the process specifically asked for the peer credentials by LOCAL_CREDS option always gets struct sockcred. PR: kern/90800 Submitted by: Andrey Simonenko Regres. tests: tools/regression/sockets/unix_cmsg/ MFC after: 1 month	2006-06-13 14:33:35 +00:00
David Xu	b41f1452d9	Add scheduler CORE, the work I have done half a year ago, recent, I picked it up again. The scheduler is forked from ULE, but the algorithm to detect an interactive process is almost completely different with ULE, it comes from Linux paper "Understanding the Linux 2.6.8.1 CPU Scheduler", although I still use same word "score" as a priority boost in ULE scheduler. Briefly, the scheduler has following characteristic: 1. Timesharing process's nice value is seriously respected, timeslice and interaction detecting algorithm are based on nice value. 2. per-cpu scheduling queue and load balancing. 3. O(1) scheduling. 4. Some cpu affinity code in wakeup path. 5. Support POSIX SCHED_FIFO and SCHED_RR. Unlike scheduler 4BSD and ULE which using fuzzy RQ_PPQ, the scheduler uses 256 priority queues. Unlike ULE which using pull and push, the scheduelr uses pull method, the main reason is to let relative idle cpu do the work, but current the whole scheduler is protected by the big sched_lock, so the benefit is not visible, it really can be worse than nothing because all other cpu are locked out when we are doing balancing work, which the 4BSD scheduelr does not have this problem. The scheduler does not support hyperthreading very well, in fact, the scheduler does not make the difference between physical CPU and logical CPU, this should be improved in feature. The scheduler has priority inversion problem on MP machine, it is not good for realtime scheduling, it can cause realtime process starving. As a result, it seems the MySQL super-smack runs better on my Pentium-D machine when using libthr, despite on UP or SMP kernel.	2006-06-13 13:12:56 +00:00
John Baldwin	5c69ad8374	Use fget() in kqueue_register() instead of doing all the work by hand.	2006-06-12 21:46:23 +00:00
Warner Losh	ccdc8d9bff	Add a convenience function rman_init_from_resource for initializing a rman from a resource. Also, include _bus.h since the implementation of bus_space isn't needed here, just the definitions of the types.	2006-06-12 04:06:21 +00:00
Ian Dowse	eb1030c4fd	Keep firmware images on the list until they have been unregistered with firmware_unregister(). Previously when the last driver reference had been dropped we would clear the list entry under the assumption that the firmware module was about to be unloaded, but this was not true if the firmware image had been loaded manually with kldload. This makes it possible to manually kldload firmware images as a workaround for drivers such as ipw that attempt to load firmware while resuming after a suspend. Reviewed by: mlaier (an earlier version of the patch)	2006-06-10 17:04:07 +00:00
Robert Watson	b37ffd3189	Move some functions and definitions from uipc_socket2.c to uipc_socket.c: - Move sonewconn(), which creates new sockets for incoming connections on listen sockets, so that all socket allocate code is together in uipc_socket.c. - Move 'maxsockets' and associated sysctls to uipc_socket.c with the socket allocation code. - Move kern.ipc sysctl node to uipc_socket.c, add a SYSCTL_DECL() for it to sysctl.h and remove lots of scattered implementations in various IPC modules. - Sort sodealloc() after soalloc() in uipc_socket.c for dependency order reasons. Statisticize soalloc() and sodealloc() as they are now required only in uipc_socket.c, and are internal to the socket implementation. After this change, socket allocation and deallocation is entirely centralized in one file, and uipc_socket2.c consists entirely of socket buffer manipulation and default protocol switch functions. MFC after: 1 month	2006-06-10 14:34:07 +00:00
Robert Watson	e02421f3fb	Rearrange code in soalloc() so that it's less indented by returning early if uma_zalloc() from the socket zone fails. No functional change. MFC after: 1 week	2006-06-08 22:33:18 +00:00
Konstantin Belousov	55aef2632f	Fix the LOR that occurs when the MAC compiled into the kernel and vnode is destroyed. Reviewed by: rwatson LOR: 189 MFC after: 2 weeks Approved by: kan (mentor)	2006-06-08 07:55:10 +00:00
David Xu	0ae716e5ee	Make ke_rqindex unsigned.	2006-06-06 12:26:17 +00:00
Robert Watson	7ebfc8df78	Audit some arguments to nmount(), mount(), umount(). Submitted by: wsalamon Obtained from: TrustedBSD Project	2006-06-05 15:32:07 +00:00
Robert Watson	6e79e6f805	Audit command, uid arguments for quotactl(). Audit the mode argument to mkfifo(). Audit the target path passed to symlink(). Submitted by: wsalamon Obtained from: TrustedBSD Project	2006-06-05 13:34:23 +00:00
Robert Watson	d3778141bf	Audit path passed to the acct() system call. Obtained from: TrustedBSD Project	2006-06-05 13:02:34 +00:00
John Baldwin	49b94bfc54	Bah, fix fat finger in last. Invert the ~ on MTX_FLAGMASK as it's non-intuitive for the ~ to be built into the mask. All the users now explicitly ~ the mask. In addition, add MTX_UNOWNED to the mask even though it technically isn't a flag. This should unbreak mtx_owner(). Quickly spotted by: kris	2006-06-03 21:11:33 +00:00
John Baldwin	3ce3f44293	In the case of reentering the debugger due to an attempt to perform a context switch while in the debugger, reenter the debugger sooner before performing any statistics updates.	2006-06-03 20:49:44 +00:00
John Baldwin	315ce35f7b	Simplify mtx_owner() so it only reads m->mtx_lock once.	2006-06-03 20:45:00 +00:00
John Baldwin	f781b5a4bb	Style fix to be more like _mtx_lock_sleep(): use 'while (!foo) { ... }' instead of 'for (;;) { if (foo) break; ... }'.	2006-06-03 20:44:01 +00:00
Pawel Jakub Dawidek	1f58dd4956	Fix a problem introduced in revision 1.220. On mount(2) failure, don't forget to unbusy file system before its destruction. This fixes the following warning on mount failure: Mount point <X> had 1 dangling refs Tested by: wkoszek	2006-06-02 20:29:02 +00:00
Doug Ambrisko	51e37c7f37	Make lio ident more consistant with aio ident.	2006-06-02 17:45:48 +00:00
Pawel Jakub Dawidek	f420242b2b	Don't forget to unlock kq lock in low memory situations. OK'ed by: jmg	2006-06-02 13:23:39 +00:00
Pawel Jakub Dawidek	8ebab14c70	Remove confusing done_noglobal label. The KQ_GLOBAL_UNLOCK() macro know how to handle both situations - when kq_global lock is and is not held. OK'ed by: jmg	2006-06-02 13:21:21 +00:00
Pawel Jakub Dawidek	241321abc0	Use SLIST_FOREACH_SAFE() macro, because knote_drop() can free an element which can be then used to find next element in the list. OK'ed by: jmg	2006-06-02 13:18:59 +00:00
Olivier Houchard	4bb0f51d1d	sched_rem() already sets ke->ke_state to KES_THREAD, so there's no need to redo it.	2006-06-01 22:45:56 +00:00
Diomidis Spinellis	23efd78d03	Remove two locking assertion entries that: a) were incorrectly written and therefore never compiled into assertions, and b) were incorrectly specified and when compiled resulted in a failed assertion.	2006-05-31 14:06:06 +00:00
Diomidis Spinellis	f69ec7af12	Assertion code specifications are introduced using special character sequences that are distinct from comments. %% is used for argument locks; %! for pre- and post-conditions.	2006-05-30 20:49:54 +00:00
Diomidis Spinellis	b1b4282160	Remove incorrect lock validation specifications that caused failed assertions with DEBUG_VFS_LOCKS. We should reinstate them with correct specifications, possibly after extendng vnode_if.awk Noted by: truckman@	2006-05-30 20:21:51 +00:00
Tor Egge	57051fdc4b	Close race between vmspace_exitfree() and exit1() and races between vmspace_exitfree() and vmspace_free() which could result in the same vmspace being freed twice. Factor out part of exit1() into new function vmspace_exit(). Attach to vmspace0 to allow old vmspace to be freed earlier. Add new function, vmspace_acquire_ref(), for obtaining a vmspace reference for a vmspace belonging to another process. Avoid changing vmspace refcount from 0 to 1 since that could also lead to the same vmspace being freed twice. Change vmtotal() and swapout_procs() to use vmspace_acquire_ref(). Reviewed by: alc	2006-05-29 21:28:56 +00:00
Xin LI	56e26c3e7e	Unexpand TAILQ_FIRST(foo) == NULL to TAILQ_EMPTY(foo).	2006-05-29 05:43:26 +00:00
Kris Kennaway	80a8e5da94	Correct typos MFC after: 2 weeks	2006-05-28 22:15:28 +00:00
Robert Watson	4bb260ad78	In execve(), audit the path name being executed. In the future, it would also be good to audit the interpreter pathname, if any. Obtained from: TrustedBSD Project	2006-05-28 08:28:47 +00:00
Diomidis Spinellis	0e1c7fb8ea	Add missing % signs in the lock annotations of the functions: lookup, rename, strategy, islocked The missing % sign meant that the lines were processed as plain comments and the corresponding assertions were never generated.	2006-05-28 07:24:12 +00:00
Xin LI	e38c7f3ef3	extlen and cpp is not used here in linker_search_kld(), so nuke them. Reported by: Mingyan Guo <guomingyan at gmail dot com> MFC After: 2 weeks	2006-05-27 09:21:41 +00:00
Poul-Henning Kamp	9dd2370db6	If the console has no cncheckc method, use cngetc instead.	2006-05-26 11:00:20 +00:00
Poul-Henning Kamp	8aed7613bd	Don't use CONS_DRIVER() macro to insert dummy element in cons_set	2006-05-26 10:46:38 +00:00
Poul-Henning Kamp	16b1613a31	GC the cn_dbctl_t hook for consoles, it is unused. This used to make syscons switch to vty0 when we entered DDB but this was lost in the KDB shuffle. We may want to bring it back down the road but it should be done by calling cn_init_t/cn_term_t instead, possibly with a flag argument saying "Debugger!"	2006-05-26 10:24:00 +00:00
Craig Rodrigues	0c89bb0a02	Add "update" mount option to global_opts array, for use with vfs_filteropt().	2006-05-26 02:38:48 +00:00
Craig Rodrigues	5eb304a91a	Remove calls to vfs_export() for exporting a filesystem for NFS mounting from individual filesystems. Call it instead in vfs_mount.c, after we call VFS_MOUNT() for a specific filesystem.	2006-05-26 00:32:21 +00:00
Robert Watson	20bdac8a4f	Use getsock() and fput() instead of fgetsock() and fputsock() in sendfile(). This causes sendfile() to use the file descriptor reference to the socket instead of bumping the socket reference count, which avoids an additional refcount operation, as well as a potential expensive socket refcount drop, which can lead to contention on the accept mutex. This change also has the side effect of further reducing the number of cases where an in-progress I/O operation can occur on a socket after close, as using the file descriptor refcount prevents the socket from closing while in use. MFC after: 3 months	2006-05-25 15:10:13 +00:00
Stephan Uphoff	dcf67e65d2	Do not set B_NOCACHE on buffers when releasing them in flushbuflist(). If B_NOCACHE is set the pages of vm backed buffers will be invalidated. However clean buffers can be backed by dirty VM pages so invalidating them can lead to data loss. Add support for flush dirty page in the data invalidation function of some network file systems. This fixes data losses during vnode recycling (and other code paths using invalbuf(,V_SAVE,,*)) for data written using an mmaped file. Collaborative effort by: jhb@,mohans@,peter@,ps@,ups@ Reviewed by: tegge@ MFC after: 7 days	2006-05-25 01:00:35 +00:00
Sam Leffler	75b773ae3d	When starting up threads in taskqueue_start_threads create them stopped before adjusting their priority and setting them on the run q so they cannot race for resources (pointed out by njl). While here add a console printf on thread create fails; otherwise noone may notice (e.g. return value is always 0 and caller has no way to verify). Reviewed by: jhb, scottl MFC after: 2 weeks	2006-05-24 22:11:07 +00:00
David Xu	f705bbe8b1	Don't allow non-root user to set a scheduler policy, otherwise this could be a local DOS. Submitted by: Diane Bruce at db at db.net	2006-05-21 00:40:38 +00:00
David Xu	f6c040a2c5	Style fixes. Submitted by: Diane Bruce < db at db dot net >	2006-05-19 06:37:24 +00:00
David Xu	7b8d821268	Move flag TDF_UMTXQ into structure umtxq, this eliminates the requirement of scheduler lock in some umtx code.	2006-05-18 08:43:46 +00:00
Poul-Henning Kamp	d595182f0b	Make the printfs relating to purging threads from a device less intrusive.	2006-05-17 06:37:14 +00:00
Poul-Henning Kamp	c40da00ca3	Since DELAY() was moved, most <machine/clock.h> #includes have been unnecessary.	2006-05-16 14:37:58 +00:00
Paul Saab	6befa6ae1b	Allow concurrent read(2)/readv(2) access to a file. Lock file offset against multiple read calls. Submitted by: ups Obtained from: Yahoo! MFC after: 2 weeks	2006-05-16 07:50:54 +00:00
Kelly Yancey	c9ad8a67af	Restore the ability to mount procfs and fdescfs filesystems via the mount(2) system call: * Add cmount hook to fdescfs and pseudofs (and, by extension, procfs and linprocfs). This (mostly) restores the ability to mount these filesystems using the old mount(2) system call (see below for the rest of the fix). * Remove not-NULL check for the data argument from the mount(2) entry point. Per the mount(2) man page, it is up to the individual filesystem being mounted to verify data. Or, in the case of procfs, etc. the filesystem is free to ignore the data parameter if it does not use it. Enforcing data to be not-NULL in the mount(2) system call entry point prevented passing NULL to filesystems which ignored the data pointer value. Apparently, passing NULL was common practice in such cases, as even our own mount_std(8) used to do it in the pre-nmount(2) world. All userland programs in the tree were converted to nmount(2) long ago, but I've found at least one external program which broke due to this (presumably unintentional) mount(2) API change. One could argue that external programs should also be converted to nmount(2), but then there isn't much point in keeping the mount(2) interface for backward compatibility if it isn't backward compatible.	2006-05-15 19:42:10 +00:00
Benno Rice	77fe443878	The VERBOSE_SYSINIT stuff sees the DDB define a lot better if we include opt_ddb.h. Spotted by: benno Pointy hat to: benno	2006-05-14 07:11:28 +00:00
Craig Rodrigues	5250012a1d	For nmount(), if "rw" is specified as a mount option, add "noro" to the list of mount options. This allows a read-only mount to be converted to read-write via: mount -u -o rw Requested by: kris	2006-05-14 01:51:38 +00:00
John Baldwin	73dbd3da73	Remove various bits of conditional Alpha code and fixup a few comments.	2006-05-12 05:04:46 +00:00
Benno Rice	26ab616fdc	Add a new kernel config option, VERBOSE_SYSINIT. When porting FreeBSD to a new platform, one of the more useful things to do is get mi_startup() to let you know which SYSINIT it's up to. Most people tend to whack a printf in the SYSINIT loop to print the address of the function it's about to call. Going one better, jhb made a version that uses DDB to look up the name of the function and print that instead. This version is essentially his with the addition of some ifdeffery to make it optional and to allow it to work (although using only the function address, not the symbol) if you forgot to enable DDB. All the cool bits by: jhb Approved by: scottl, rink, cognet, imp	2006-05-12 02:01:38 +00:00
Poul-Henning Kamp	99ab8292c7	Remove more straggling CPU_ macro references	2006-05-11 17:53:26 +00:00
David Xu	005efcdb0e	Use wakeup_one to avoid thundering herd. Tested by: kris	2006-05-09 13:00:46 +00:00
David Xu	759ccccadb	Use a dedicated mutex to protect aio queues, the movation is to reduce lock contention with other parts.	2006-05-09 00:10:11 +00:00
Tor Egge	11991ab418	Call vn_finished_write() before calling the coredump handler which will indirectly call vn_start_write() as necessary for each write.	2006-05-07 22:50:22 +00:00
Tor Egge	d302786c87	Temporarily unlock vnode for new image being executed to avoid lock order reversals that can lead to deadlocks. Normally vn_close(), namei() or vrele() should not be called while holding vnode locks.	2006-05-05 20:25:05 +00:00
Pawel Jakub Dawidek	643df192de	vn_start_write()/vn_finished_write() is not needed here, because vn_start_write() is always called earlier in the code path and calling the function recursively may lead to a deadlock. Confirmed by: tegge MFC after: 2 weeks	2006-04-29 21:57:38 +00:00
Kris Kennaway	cef31ff7d9	Lock giant when assigning ni_vp and keep vfslocked state valid. Committed for: jeff	2006-04-29 07:13:49 +00:00
Pawel Jakub Dawidek	122410eea2	vn_start_write() is called only when v_type != VCHR, so corresponding vn_finished_write() should also be called only then. BTW. I fixed two functions here: vn_rdwr() and vn_write(). The latter seems to be unused. MFC after: 3 weeks	2006-04-28 21:54:05 +00:00
Robert Watson	3bf14fd5e9	Also check use_pty in the ptmx clone lookup; this means that when ptmx support is turned off using the sysctl, we no longer even allow the ptmx device to be looked up. Foot provided by: peter	2006-04-28 21:39:57 +00:00
Marcel Moolenaar	8f405ed335	Remove the puc-specific hacks. The puc(4) driver now properly uses the rman(9) interface.	2006-04-28 21:23:09 +00:00
Jeff Roberson	6ca9fcc586	- Add a BO_NEEDSGIANT flag to the bufobj. This flag forces all child buffers to go on the buf daemon's DIRTYGIANT queue. - Set BO_NEEDSGIANT on ffs's devvp since the ffs_copyonwrite handler runs in the context of the buf daemon and may require Giant.	2006-04-28 01:05:31 +00:00
Jeff Roberson	4b5b86816c	- Consistently track ni_dvp and ni_vp with dvfslocked and vfslocked rather than trying to optimize it into a single lock. This adds more calls to lock giant with non smpsafe filesystems but is the only way to reliably hold the correct lock. - Remove an invalid assert in the mountedhere case in lookup and fix the code to properly deal with the scenario. We can actually have a lookup that returns dp == dvp with mountedhere set with certain unmount races. Tested by: kris Reported by: kris/mohans	2006-04-28 00:59:48 +00:00
John-Mark Gurney	5c06d111b8	back out for now... revert ccpu to being kern.ccpu...	2006-04-27 17:57:59 +00:00
John-Mark Gurney	c71ce6a445	move remaining sysctl into the kern.sched tree...	2006-04-26 19:42:38 +00:00
John Baldwin	ae110b53d1	Add some new commands to hopefully make it easier to diagnose lock-related problems in ddb: - "show threadchain [thread]" will start with the specified thread (or the current kdb thread by default) and show it's state. If it is blocked on a lock, it will find the owner of the lock and show its state, etc. - "show allchains" will find all of the threads that are blocked on a lock (but do not have any threads blocked on a lock they hold) and show the resulting thread chain. - "show lockchain <lock>" takes a pointer to a lock_object (such as a mutex or rwlock). If there is a turnstile for that lock, then it will display all the threads blocked on the lock. In addition, for each thread blocked on the lock, it will display any contested locks they hold, and recurse on those locks to show any threads blocked on those locks, etc.	2006-04-25 20:28:17 +00:00
John Baldwin	de833b7c0c	Use db_lookup_thread() to lookup the thread for the passed in address and change 'show locks' to only list the locks for a given thread rather than for all the threads in the process containing a specified thread.	2006-04-25 20:24:23 +00:00
Marius Strobl	fa63296aba	Remove last vestiges of sab(4).	2006-04-25 19:43:53 +00:00
Robert Watson	102ea03373	Extend getsock() to return the struct file flags read while holding the file lock, in the style of fgetsock(). Modify accept1() to use getsock() instead of fgetsock(), relying on the file descriptor reference rather than an acquired socket reference to prevent the listen socket from being destroyed during accept(). This avoids additional reference count operations, which should improve performance, and also avoids accept1() operating on a socket whose file descriptor has been torn down, which may have resulted in protocol shutdown starting. MFC after: 3 months	2006-04-25 11:48:16 +00:00
Maxim Konovalov	481f8fe85f	Inherit LOCAL_CREDS option from listen socket for sockets returned by accept(2). PR: kern/90644 Submitted by: Andrey Simonenko OK'ed by: mdodd Tested by: NetBSD regress/sys/kern/unfdpass/unfdpass.c MFC after: 1 month	2006-04-24 19:09:33 +00:00
Marcel Moolenaar	845652dd28	MFp4: Add the ipend() method to the serdev I/F to allow umbrella drivers to obtain pending interrupt status from subordinate drivers.	2006-04-23 22:12:39 +00:00
Robert Watson	0cec9959e8	Assert that sockets passed into soabort() not be SQ_COMP or SQ_INCOMP, since that removal should have been done a layer up. MFC after: 3 months	2006-04-23 18:15:54 +00:00
Robert Watson	28ea180136	Add missing 'not' to SQ_COMP comment. MFC after: 3 months	2006-04-23 15:37:23 +00:00
Robert Watson	6ca35d4b81	Move handling of SQ_COMP exception case in sofree() to the top of the function along with the remainder of the reference checking code. Move comment from body to header with remainder of comments. Inclusion of a socket in a completed connection queue counts as a true reference, and should not be handled as an under-documented edge case. MFC after: 3 months	2006-04-23 15:33:38 +00:00
John Baldwin	f9ab2f134f	Print td_name instead of p_comm if td_name is non-empty for 'show turnstile' and 'show sleepq'.	2006-04-21 20:40:43 +00:00

1 2 3 4 5 ...

9478 Commits