freebsd-skq

Author	SHA1	Message	Date
julian	e88eb0a6e9	If the process is a zombie, then you must not try dereference the thread because there isn't one. Of course this code only possibly works for single threaded processes anyhow..	2002-06-30 07:50:22 +00:00
alfred	b7edb33754	Partial backout of 1.318, remove error handling added because it may be incorrect. Requested by: bde	2002-06-30 05:23:58 +00:00
iedowse	b5040e7472	Add a hashdestroy() function to undo the actions of hashinit().	2002-06-30 02:07:26 +00:00
alfred	b3a394a74c	Fix several style bugs: close up the continued line after removing the cast made the line. space before parentheses in indirect function call. Add an addtional error handler case for the results of callback. Submitted by: bde	2002-06-29 17:58:44 +00:00
alfred	ce846a9c49	Unbreak computation of 'smask' that I broke when removing caddr_t. Submitted by: bde	2002-06-29 17:56:34 +00:00
julian	aa2dc0a5d9	Part 1 of KSE-III The ability to schedule multiple threads per process (one one cpu) by making ALL system calls optionally asynchronous. to come: ia64 and power-pc patches, patches for gdb, test program (in tools) Reviewed by: Almost everyone who counts (at various times, peter, jhb, matt, alfred, mini, bernd, and a cast of thousands) NOTE: this is still Beta code, and contains lots of debugging stuff. expect slight instability in signals..	2002-06-29 17:26:22 +00:00
julian	a06b68b34f	Add files that are new for KSE.	2002-06-29 07:04:59 +00:00
obrien	4db8ac83cb	Rename the db command lockedvnodes to lockedvnods so that it fits on the help screen and one doens't think we have a lockedvnodesmap command.	2002-06-29 04:45:09 +00:00
alfred	97873dcbf3	more caddr_t removal.	2002-06-29 02:00:02 +00:00
alfred	d1cbf6a1d1	More caddr_t removal, make fo_ioctl take a void * instead of a caddr_t.	2002-06-29 01:50:25 +00:00
alfred	d92411ce7a	catch up with mextadd callback taking a void argument instead of a caddr_t.	2002-06-29 01:49:22 +00:00
alfred	8cd894ca70	More caddr_t removal. Change struct knote's kn_hook from caddr_t to void *.	2002-06-29 00:29:12 +00:00
alfred	d47feb4376	nuke more instances of caddr_t	2002-06-29 00:02:01 +00:00
alfred	65984a1f06	m_extadd takes a void (freef)(void , void ) now, not a void (freef)(caddr_t, void *).	2002-06-29 00:01:46 +00:00
alfred	b0475cc9d5	remove or replace caddr_t with void. make the mbuf external free function take a void * rather than caddr_t.	2002-06-28 23:48:23 +00:00
alfred	708aac7550	nuke caddr_t.	2002-06-28 23:17:36 +00:00
alfred	ae44f24f87	Remove unneeded casts to caddr_t.	2002-06-28 23:02:38 +00:00
alfred	d1029cb00e	document that the pipe fo_stat routine doesn't need locks because it's a read operation. Requested by: rwatson	2002-06-28 22:35:12 +00:00
jeff	1ed9e0f375	Improve the VOP locking asserts - Add vfs_badlock_print to control whether or not we print lock violations - Add vfs_badlock_panic to control whether we panic on lock violations Both default to on to mimic the original behavior if DEBUG_VFS_LOCKS is on.	2002-06-28 20:58:14 +00:00
iedowse	792737cf4d	In vn_mkdir(), use vrele() instead of vput() on the parent directory vnode in the case that the target exists and is the same vnode as the parent (i.e. "mkdir ."). The namei() call does not leave the vnode locked in this case even though you might expect it to. This bug was mostly harmless in practice because unlocking an already unlocked vnode currently does not trigger any panics or warnings. Reviewed by: jeff	2002-06-28 20:06:47 +00:00
jeff	9fbb8d216f	Clean up vn_rdwr locking. - Do shared locks on read. - Only do vn_{start,finished}_write when writing.	2002-06-28 17:51:11 +00:00
green	62d02a6b93	Fix a case where a vnode got explicitly unlocked after the pointer to it got set to NULL. Revision 1.355: in the box	2002-06-28 16:17:47 +00:00
luigi	f5aea44c67	Remove a printf and add a comment on an assumption that could be occasionally violated by device drivers.	2002-06-27 23:23:04 +00:00
rwatson	c282ad9b24	Fix a bug that prevented the deletion of non-default ACLs from being passed down the VFS stack. While I'm here, replace a '0' with a 'NULL' to make the code more readable. Sponsored by: DARPA, NAI Labs Obtained from: TrustedBSD Project	2002-06-27 19:31:15 +00:00
rwatson	99f90043d5	A bit of whitespace magic.	2002-06-27 19:30:11 +00:00
ken	0d3a835f3f	At long last, commit the zero copy sockets code. MAKEDEV: Add MAKEDEV glue for the ti(4) device nodes. ti.4: Update the ti(4) man page to include information on the TI_JUMBO_HDRSPLIT and TI_PRIVATE_JUMBOS kernel options, and also include information about the new character device interface and the associated ioctls. man9/Makefile: Add jumbo.9 and zero_copy.9 man pages and associated links. jumbo.9: New man page describing the jumbo buffer allocator interface and operation. zero_copy.9: New man page describing the general characteristics of the zero copy send and receive code, and what an application author should do to take advantage of the zero copy functionality. NOTES: Add entries for ZERO_COPY_SOCKETS, TI_PRIVATE_JUMBOS, TI_JUMBO_HDRSPLIT, MSIZE, and MCLSHIFT. conf/files: Add uipc_jumbo.c and uipc_cow.c. conf/options: Add the 5 options mentioned above. kern_subr.c: Receive side zero copy implementation. This takes "disposable" pages attached to an mbuf, gives them to a user process, and then recycles the user's page. This is only active when ZERO_COPY_SOCKETS is turned on and the kern.ipc.zero_copy.receive sysctl variable is set to 1. uipc_cow.c: Send side zero copy functions. Takes a page written by the user and maps it copy on write and assigns it kernel virtual address space. Removes copy on write mapping once the buffer has been freed by the network stack. uipc_jumbo.c: Jumbo disposable page allocator code. This allocates (optionally) disposable pages for network drivers that want to give the user the option of doing zero copy receive. uipc_socket.c: Add kern.ipc.zero_copy.{send,receive} sysctls that are enabled if ZERO_COPY_SOCKETS is turned on. Add zero copy send support to sosend() -- pages get mapped into the kernel instead of getting copied if they meet size and alignment restrictions. uipc_syscalls.c:Un-staticize some of the sf* functions so that they can be used elsewhere. (uipc_cow.c) if_media.c: In the SIOCGIFMEDIA ioctl in ifmedia_ioctl(), avoid calling malloc() with M_WAITOK. Return an error if the M_NOWAIT malloc fails. The ti(4) driver and the wi(4) driver, at least, call this with a mutex held. This causes witness warnings for 'ifconfig -a' with a wi(4) or ti(4) board in the system. (I've only verified for ti(4)). ip_output.c: Fragment large datagrams so that each segment contains a multiple of PAGE_SIZE amount of data plus headers. This allows the receiver to potentially do page flipping on receives. if_ti.c: Add zero copy receive support to the ti(4) driver. If TI_PRIVATE_JUMBOS is not defined, it now uses the jumbo(9) buffer allocator for jumbo receive buffers. Add a new character device interface for the ti(4) driver for the new debugging interface. This allows (a patched version of) gdb to talk to the Tigon board and debug the firmware. There are also a few additional debugging ioctls available through this interface. Add header splitting support to the ti(4) driver. Tweak some of the default interrupt coalescing parameters to more useful defaults. Add hooks for supporting transmit flow control, but leave it turned off with a comment describing why it is turned off. if_tireg.h: Change the firmware rev to 12.4.11, since we're really at 12.4.11 plus fixes from 12.4.13. Add defines needed for debugging. Remove the ti_stats structure, it is now defined in sys/tiio.h. ti_fw.h: 12.4.11 firmware. ti_fw2.h: 12.4.11 firmware, plus selected fixes from 12.4.13, and my header splitting patches. Revision 12.4.13 doesn't handle 10/100 negotiation properly. (This firmware is the same as what was in the tree previously, with the addition of header splitting support.) sys/jumbo.h: Jumbo buffer allocator interface. sys/mbuf.h: Add a new external mbuf type, EXT_DISPOSABLE, to indicate that the payload buffer can be thrown away / flipped to a userland process. socketvar.h: Add prototype for socow_setup. tiio.h: ioctl interface to the character portion of the ti(4) driver, plus associated structure/type definitions. uio.h: Change prototype for uiomoveco() so that we'll know whether the source page is disposable. ufs_readwrite.c:Update for new prototype of uiomoveco(). vm_fault.c: In vm_fault(), check to see whether we need to do a page based copy on write fault. vm_object.c: Add a new function, vm_object_allocate_wait(). This does the same thing that vm_object allocate does, except that it gives the caller the opportunity to specify whether it should wait on the uma_zalloc() of the object structre. This allows vm objects to be allocated while holding a mutex. (Without generating WITNESS warnings.) vm_object_allocate() is implemented as a call to vm_object_allocate_wait() with the malloc flag set to M_WAITOK. vm_object.h: Add prototype for vm_object_allocate_wait(). vm_page.c: Add page-based copy on write setup, clear and fault routines. vm_page.h: Add page based COW function prototypes and variable in the vm_page structure. Many thanks to Drew Gallatin, who wrote the zero copy send and receive code, and to all the other folks who have tested and reviewed this code over the years.	2002-06-26 03:37:47 +00:00
arr	5290d081cc	- Remove Giant acquisition from modevent(), modfnext(), modstat() and modfind(). Giant is no longer needed by these functions for safe execution. Reviewed by: jhb	2002-06-26 00:31:44 +00:00
arr	b6fded0faf	- Alleviate jail() from having the burden of acquiring Giant by simply removing. We can do this since we no longer need Giant to safely execute jail(). Reviewed by: rwatson, jhb	2002-06-26 00:29:01 +00:00
alc	698aabc226	o Eliminate vmspace::vm_minsaddr. It's initialized but never used. o Replace stale comments in vmspace by "const until freed" annotations on some fields.	2002-06-25 18:14:38 +00:00
jake	e102a9b6dd	Add an MD callout like cpu_exit, but which is called after sched_lock is obtained, when all other scheduling activity is suspended. This is needed on sparc64 to deactivate the vmspace of the exiting process on all cpus. Otherwise if another unrelated process gets the exact same vmspace structure allocated to it (same address), its address space will not be activated properly. This seems to fix some spontaneous signal 11 problems with smp on sparc64.	2002-06-24 15:48:02 +00:00
mux	48b8a20cbe	Bring sys/kern/md5c.c in sync with the userland version. Add a comment so that people don't forget to keep the version in src/lib/libmd/md5c.c in sync with this one. This fixes a warning on sparc64. Reviewed by: phk	2002-06-24 14:15:25 +00:00
mckusick	7d70f5926f	Use proper size in bzero of stat structure. Submitted by: Jake Burkholder <jake@locore.ca> Sponsored by: DARPA & NAI Labs.	2002-06-24 07:14:44 +00:00
mini	ef6f2f567d	Remove unused diagnostic function cread_free_thread(). Approved by: alfred	2002-06-24 06:22:00 +00:00
dillon	3bb3ab3df2	I Noticed a defect in the way wakeup() scans the tailq. Tor noticed an even worse defect in wakeup_one(). This patch cleans up both. Submitted by: tegge MFC after: 3 days	2002-06-24 00:14:36 +00:00
mux	abb73c48ca	More 64 bits platforms warning fixes. Reviewed by: rwatson	2002-06-23 18:32:39 +00:00
mckusick	f09d0a42d9	This patch fixes a size problem with the stat structure for 64-bit architectures that was introduced in the UFS2 code merge two days ago. The stat structure change that caused the problem was the addition of the file create time. Submitted by: Bruce Evans <bde@zeta.org.au> Sponsored by: DARPA & NAI Labs.	2002-06-22 22:01:13 +00:00
mux	df8395282d	We don't need to check the return value of malloc() against NULL when the M_WAITOK flag is specified.	2002-06-22 21:44:11 +00:00
dillon	0af4a7d8da	Fix a bug in vfs_bio_clrbuf(). The single-page-clrbuf optimization was improperly clearing more then just the invalid portions of the page. (This bug is not known to have been triggered by anything). Submitted by: tegge MFC after: 7 days	2002-06-22 19:09:35 +00:00
mux	24aca74f2d	o Remove the initialization of unused fields in the struct uio now that we don't use uiomove() anymore. o Enforce stricter checks on the length of the iov's in nmount(2) since we now malloc() them individually and corrupted iov's could make the kernel crash in malloc() with "kmem_map too small". Reviewed by: phk	2002-06-22 18:07:05 +00:00
mini	783d5aaa2c	Always drop the p_args reference we held for copyout, even if we're about to change it. This fixes a leak triggered by setproctitle(3). Approved by: alfred Noticed by: Peter Jeremy <peter.jeremy@alcatel.com.au>	2002-06-22 10:05:50 +00:00
mckusick	88d85c15ef	This commit adds basic support for the UFS2 filesystem. The UFS2 filesystem expands the inode to 256 bytes to make space for 64-bit block pointers. It also adds a file-creation time field, an ability to use jumbo blocks per inode to allow extent like pointer density, and space for extended attributes (up to twice the filesystem block size worth of attributes, e.g., on a 16K filesystem, there is space for 32K of attributes). UFS2 fully supports and runs existing UFS1 filesystems. New filesystems built using newfs can be built in either UFS1 or UFS2 format using the -O option. In this commit UFS1 is the default format, so if you want to build UFS2 format filesystems, you must specify -O 2. This default will be changed to UFS2 when UFS2 proves itself to be stable. In this commit the boot code for reading UFS2 filesystems is not compiled (see /sys/boot/common/ufsread.c) as there is insufficient space in the boot block. Once the size of the boot block is increased, this code can be defined. Things to note: the definition of SBSIZE has changed to SBLOCKSIZE. The header file <ufs/ufs/dinode.h> must be included before <ufs/ffs/fs.h> so as to get the definitions of ufs2_daddr_t and ufs_lbn_t. Still TODO: Verify that the first level bootstraps work for all the architectures. Convert the utility ffsinfo to understand UFS2 and test growfs. Add support for the extended attribute storage. Update soft updates to ensure integrity of extended attribute storage. Switch the current extended attribute interfaces to use the extended attribute storage. Add the extent like functionality (framework is there, but is currently never used). Sponsored by: DARPA & NAI Labs. Reviewed by: Poul-Henning Kamp <phk@freebsd.org>	2002-06-21 06:18:05 +00:00
mux	3770ca4156	Change the way we internally store the mount options to a linked list. This is to allow the merging of the mount options in the MNT_UPDATE case, as the current data structure is unsuitable for this. There are no functional differences in this commit. Reviewed by: phk	2002-06-20 20:03:42 +00:00
alfred	619c88aeeb	Implement SO_NOSIGPIPE option for sockets. This allows one to request that an EPIPE error return not generate SIGPIPE on sockets. Submitted by: lioux Inspired by: Darwin	2002-06-20 18:52:54 +00:00
alfred	4f8d67f852	Don't leak resources if fdcheckstd() fails during exec. Submitted by: Mike Makonnen <makonnen@pacbell.net>	2002-06-20 17:27:28 +00:00
iedowse	ddd0934a7c	Display the mutex name in the ^T status line if the selected thread is blocked on a mutex. Prepend a '*' to distinguish this case as is done in top(1).	2002-06-20 14:03:36 +00:00
peter	72c289faf4	Remove UIO_USERISPACE - we do not support any split instruction/data address space machines (eg: pdp-11) and are not likely to ever do so. Nothing in our kernel sets this.	2002-06-20 07:08:43 +00:00
peter	4830c34648	Move the "- 1" into the RQB_FFS(mask) macro itself so that implementations can provide a base zero ffs function if they wish. This changes #define RQB_FFS(mask) (ffs64(mask)) foo = RQB_FFS(mask) - 1; to #define RQB_FFS(mask) (ffs64(mask) - 1) foo = RQB_FFS(mask); On some platforms we can get the "- 1" for free, eg: those that use the C code for ffs64(). Reviewed by: jake (in principle)	2002-06-20 06:21:20 +00:00
arr	646109847c	- Remove the lock(9) protecting the kernel linker system. - Added a mutex, kld_mtx, to protect the kernel_linker system. Note that while ``classes'' is global (to that file), it is only read only after SI_SUB_KLD, SI_ORDER_ANY. - Add a SYSINIT to flip a flag that disallows class registration after SI_SUB_KLD, SI_ORDER_ANY. Idea for ``classes'' read only by: jake Reviewed by: jake	2002-06-19 21:25:59 +00:00
phk	1f4e9c0c72	Remove the compat bits for the mis-aligned struct disklabel on alpha, people got three times longer than I promised. Sponsored by: DARPA & NAI Labs.	2002-06-19 08:37:02 +00:00
alfred	bfa1cb192c	Squish the "could sleep with process lock" messages caused by calling uifind() with a proc lock held. change_ruid() and change_euid() have been modified to take a uidinfo structure which will be pre-allocated by callers, they will then call uihold() on the uidinfo structure so that the caller's logic is simplified. This allows one to call uifind() before locking the proc struct and thereby avoid a potential blocking allocation with the proc lock held. This may need revisiting, perhaps keeping a spare uidinfo allocated per process to handle this situation or re-examining if the proc lock needs to be held over the entire operation of changing real or effective user id. Submitted by: Don Lewis <dl-freebsd@catspoiler.org>	2002-06-19 06:39:25 +00:00
alfred	90955db2d9	setsugid() touches p->p_flag so assert that the proc is locked.	2002-06-18 22:41:35 +00:00
tanimura	cb3347e926	Remove so*_locked(), which were backed out by mistake.	2002-06-18 07:42:02 +00:00
mux	49532dbc77	Change vfs_copyopt() so that the length argument passed to it must be the exact same size as the mount option. This makes vfs_copyopt() much more useful.	2002-06-14 20:04:21 +00:00
bmilekic	86f2253b6a	Set system_map for both mbuf_map and clust_map to 1, in mbuf_init(). Submitted by: Tor Egge (tegge) Pointed out to me by: hsu	2002-06-13 23:53:42 +00:00
rwatson	c281a7f80e	Regen.	2002-06-13 23:44:50 +00:00
rwatson	8ddeab67a5	Keep POSIX.1e capabilities system call placeholders, but remove definitions.	2002-06-13 23:43:53 +00:00
rwatson	b5791a138c	kern_cap.c no longer needed.	2002-06-13 23:19:34 +00:00
rwatson	4ad90dd62b	opt_cap.c no longer needed	2002-06-13 23:17:39 +00:00
kbyanc	052b70fe67	Make nselcol, the number of select collisions since boot, unsigned as negative collisions simply doesn't make sense. PR: (one small part of) 19720 Approved by: alfred	2002-06-12 02:08:18 +00:00
kbyanc	a46c40d9e1	Time counter stats are unsigned, advertise them to sysctl(8) that way. PR: (one small part of) 19720 Approved by: phk	2002-06-11 19:47:44 +00:00
kbyanc	4bf495ef47	Convert hit and miss counters to unsigned values. Surely negative values for either does not make sense. PR: (one small part of) 19720	2002-06-10 22:40:26 +00:00
jhb	1a2a2fa24a	We no longer need to acqure Giant in ast() for ktrpsig() in postsig() now that ktrace no longer needs Giant.	2002-06-07 05:43:40 +00:00
jhb	11b212e025	- trapsignal() no longer needs to acquire Giant for ktrpsig(). - Catch up to new ktrace API.	2002-06-07 05:43:02 +00:00
jhb	b83763b249	- Proper locking for p_tracep and p_traceflag. - Catch up to new ktrace API.	2002-06-07 05:42:25 +00:00
jhb	eb29fde68b	Properly lock accesses to p_tracep and p_traceflag. Also make a few ktrace-only things #ifdef KTRACE that were not before.	2002-06-07 05:41:27 +00:00
jhb	fd3d90c2c8	- Catch up to new ktrace API. - ktrace trace points in msleep() and cv_wait() no longer need Giant.	2002-06-07 05:39:16 +00:00
jhb	fbebc83b5b	Catch up to changes in ktrace API.	2002-06-07 05:37:18 +00:00
jhb	ab80d12ef1	Overhaul the ktrace subsystem a bit. For the most part, the actual vnode operations to dump a ktrace event out to an output file are now handled asychronously by a ktrace worker thread. This enables most ktrace events to not need Giant once p_tracep and p_traceflag are suitably protected by the new ktrace_lock. There is a single todo list of pending ktrace requests. The various ktrace tracepoints allocate a ktrace request object and tack it onto the end of the queue. The ktrace kernel thread grabs requests off the head of the queue and processes them using the trace vnode and credentials of the thread triggering the event. Since we cannot assume that the user memory referenced when doing a ktrgenio() will be valid and since we can't access it from the ktrace worker thread without a bit of hassle anyways, ktrgenio() requests are still handled synchronously. However, in order to ensure that the requests from a given thread still maintain relative order to one another, when a synchronous ktrace event (such as a genio event) is triggered, we still put the request object on the todo list to synchronize with the worker thread. The original thread blocks atomically with putting the item on the queue. When the worker thread comes across an asynchronous request, it wakes up the original thread and then blocks to ensure it doesn't manage to write a later event before the original thread has a chance to write out the synchronous event. When the original thread wakes up, it writes out the synchronous using its own context and then finally wakes the worker thread back up. Yuck. The sychronous events aren't pretty but they do work. Since ktrace events can be triggered in fairly low-level areas (msleep() and cv_wait() for example) the ktrace code is designed to use very few locks when posting an event (currently just the ktrace_mtx lock and the vnode interlock to bump the refcoun on the trace vnode). This also means that we can't allocate a ktrace request object when an event is triggered. Instead, ktrace request objects are allocated from a pre-allocated pool and returned to the pool after a request is serviced. The size of this pool defaults to 100 objects, which is about 13k on an i386 kernel. The size of the pool can be adjusted at compile time via the KTRACE_REQUEST_POOL kernel option, at boot time via the kern.ktrace_request_pool loader tunable, or at runtime via the kern.ktrace_request_pool sysctl. If the pool of request objects is exhausted, then a warning message is printed to the console. The message is rate-limited in that it is only printed once until the size of the pool is adjusted via the sysctl. I have tested all kernel traces but have not tested user traces submitted by utrace(2), though they should work fine in theory. Since a ktrace request has several properties (content of event, trace vnode, details of originating process, credentials for I/O, etc.), I chose to drop the first argument to the various ktrfoo() functions. Currently the functions just assume the event is posted from curthread. If there is a great desire to do so, I suppose I could instead put back the first argument but this time make it a thread pointer instead of a vnode pointer. Also, KTRPOINT() now takes a thread as its first argument instead of a process. This is because the check for a recursive ktrace event is now per-thread instead of process-wide. Tested on: i386 Compiles on: sparc64, alpha	2002-06-07 05:32:59 +00:00
jhb	165d918ce2	Change the all locks list from a STAILQ to a TAILQ. This bloats struct lock_object by another pointer (though all of lock_object should be conditional on LOCK_DEBUG anyways) in exchange for an O(1) TAILQ_REMOVE() in witness_destroy() (called for every mtx_destroy() and sx_destroy()) instead of an O(n) STAILQ_REMOVE. Since WITNESS is so dog slow as it is, the speed-up is worth the space cost. Suggested by: iedowse	2002-06-06 20:51:04 +00:00
davidc	b44a13481e	s/!SIGNOTEMPY/SIGISEMPTY/ Reviewed by: marcel, jhb, alfred	2002-06-06 19:12:41 +00:00
jhb	de3e290d8f	Handle "dead" witnesses better in the situation of several short term locks being created and destroyed without a single long-term one around to ensure the witness associated with that group of locks stays alive. The pipe mutexes are an example of this group. For a dead witness we no longer clear the witness name. Instead, when looking up the witness for a lock, if a dead witness' (a witness with a refcount of 0) w_name pointer is identical to the witness name of the lock then we revive that witness instead of using a new witness for the lock. This results in far fewer dead witness objects and also better preserves locking orders over the long term resulting in more correct lock order checking. Note that we can't ever derefence w_name of a dead witness since we don't know if the string it is pointing to has been free()'d or kldunload()'d out from under us.	2002-06-06 19:04:38 +00:00
des	936333132d	Move some sysctls from the debug tree to the vfs tree.	2002-06-06 15:50:22 +00:00
des	8aef2ace20	Gratuitous whitespace cleanup.	2002-06-06 15:46:38 +00:00
phk	b98dc1ffce	Use "bwrbg" as description when we sleep for background writing, "biord" was misleading in every possible way.	2002-06-06 08:56:10 +00:00
bde	f55264a991	Fixed overflow in the bounds checking in dscheck(). It assumed that daadr_t is no larger than a long, and some other relatively harmless things (blush). Overflow for subtracting a daddr_t from a u_long caused "truncation" of the i/o for attempts to access blocks beyond the end of the actually cause expansion of the i/o to a preposterous size.	2002-06-06 00:35:07 +00:00
jhb	4a77bedabf	Replace thread_runnable() with thread_running() as the latter is more accurate. Suggested by: julian	2002-06-04 22:36:24 +00:00
jhb	408adb7287	Optimize the adaptive mutex spin a bit. Use a simple while loop with simple reads (and on IA32, a "pause" instruction for each interation of the loop) to spin until either the mutex owner field changes, or the lock owner stops executing. Suggested by: tanimura Tested on: i386	2002-06-04 21:53:48 +00:00
jhb	1ba6786436	Add a private thread_runnable() macro to make the code more readable and make the KSE diff easier to maintain.	2002-06-04 21:50:02 +00:00
des	7464466a40	ANSIfy the one remaining K&R function.	2002-06-02 21:57:28 +00:00
des	f58932ded5	Whitespace nits.	2002-06-02 21:55:58 +00:00
des	a79d7499e2	Add support for 'j' flag. Simplify the size modifier code and reduce code duplication. Also add support for 'n' specifier. Reviewed by: bde	2002-06-02 21:54:55 +00:00
schweikh	28bcbfe85d	Fix typo in the BSD copyright: s/withough/without/ Spotted and suggested by: des MFC after: 3 weeks	2002-06-02 20:05:59 +00:00
mike	1b681bdeaa	Add POSIX.1-2001 WCONTINUED option for waitpid(2). A proc flag (P_CONTINUED) is set when a stopped process receives a SIGCONT and cleared after it has notified a parent process that has requested notification via waitpid(2) with WCONTINUED specified in its options operand. The status value can be checked with the new WIFCONTINUED() macro. Reviewed by: jake	2002-06-01 18:37:46 +00:00
archie	f138a74bc8	Fix a bug in m_split(): the "m->m_ext.ext_size" field of an mbuf was being set to zero. This field indicates the total space in the external buffer and therefore should not be modified after the external buffer is added. Add a comment warning that the mbufs returned by m_split() might be read-only. Fix M_TRAILINGSPACE() to return zero if !M_WRITABLE(m). Reviewed by: freebsd-net Obtained from: Vernier Networks, Inc. MFC after: 1 week	2002-05-31 22:09:57 +00:00
des	ad1dca67b1	Nit: kern.ttys is of type S,xtty, not S,tty.	2002-05-31 16:11:49 +00:00
tanimura	e6fa9b9e92	Back out my lats commit of locking down a socket, it conflicts with hsu's work. Requested by: hsu	2002-05-31 11:52:35 +00:00
robert	c2b2e9b3bc	- Replace the bandaid introduced in revision 1.110 with a better solution. - Add braces for a ``for'' statement containing a single multi-line statement.	2002-05-31 09:41:09 +00:00
phk	53143bb2c1	Mistyped and lost a '&' in previous commit.	2002-05-30 16:26:39 +00:00
phk	559ad51949	Don't forget to factor in the boottime when we calculate PPS timestamps. Submitted by: Akira Watanabe <akira@myaw.ei.meisei-u.ac.jp>	2002-05-30 10:34:01 +00:00
jeff	feec324370	Record the file, line, and pid of the last successful shared lock holder. This is useful as a last effort in debugging file system deadlocks. This is enabled via 'options DEBUG_LOCKS'	2002-05-30 05:55:22 +00:00
julian	304195369e	CURSIG() is not a macro so rename it cursig(). Obtained from: KSE tree	2002-05-29 23:44:32 +00:00
julian	200eddc848	diff reduction from KSE to keep WW-III from happenning on -current	2002-05-29 20:40:50 +00:00
des	577e468b90	Add some checks to prevent NULL dereferences. Submitted by: jhay	2002-05-28 14:29:56 +00:00
mux	8ba975c439	Remove a duplicated vfs_freeopts() that I introduced in last revision.	2002-05-28 13:27:55 +00:00
des	35f5a040c8	Add NAI copyright.	2002-05-28 06:53:41 +00:00
marcel	58435e6cb7	Add uuidgen(2) and uuidgen(1). The uuidgen command, by means of the uuidgen syscall, generates one or more Universally Unique Identifiers compatible with OSF/DCE 1.1 version 1 UUIDs. From the Perforce logs (change 11995): Round of cleanups: o Give uuidgen() the correct prototype in syscalls.master o Define struct uuid according to DCE 1.1 in sys/uuid.h o Use struct uuid instead of uuid_t. The latter is defined in sys/uuid.h but should not be used in kernel land. o Add snprintf_uuid(), printf_uuid() and sbuf_printf_uuid() to kern_uuid.c for use in the kernel (currently geom_gpt.c). o Rename the non-standard struct uuid in kern/kern_uuid.c to struct uuid_private and give it a slightly better definition for better byte-order handling. See below. o In sys/gpt.h, fix the broken uuid definitions to match the now compliant struct uuid definition. See below. o In usr.bin/uuidgen/uuidgen.c catch up with struct uuid change. A note about byte-order: The standard failed to provide a non-conflicting and unambiguous definition for the binary representation. My initial implementation always wrote the timestamp as a 64-bit little-endian (2s-complement) integral. The clock sequence was always written as a 16-bit big-endian (2s-complement) integral. After a good nights sleep and couple of Pan Galactic Gargle Blasters (not necessarily in that order :-) I reread the spec and came to the conclusion that the time fields are always written in the native by order, provided the the low, mid and hi chopping still occurs. The spec mentions that you "might need to swap bytes if you talk to a machine that has a different byte-order". The clock sequence is always written in big-endian order (as is the IEEE 802 address) because its division is resulting in bytes, making the ordering unambiguous.	2002-05-28 06:16:08 +00:00
marcel	e2eeb62542	Add syscall uuidgen() for generating Univerally Unique Identifiers (UUIDs). On ia64 UUIDs, aka GUIDs, are used by EFI and the firmware among others. To create GUID Partition Tables (GPTs), we need to be able to generate UUIDs.	2002-05-28 05:58:06 +00:00
des	e332aae785	Introduce struct xtty, used when exporting tty information to userland. Make kern.ttys export a struct xtty rather than struct tty. Since struct tty is no longer exposed to userland, remove the dev_t / udev_t hack. Sponsored by: DARPA, NAI Labs	2002-05-28 05:40:53 +00:00
alc	afb615dae0	o Remove some unnecessary casting from and add some necessary casting to aio_suspend() and lio_listio(). Submitted by: bde	2002-05-25 18:39:42 +00:00
des	324a67fe9d	ANSIfy (significant portions were already partly ANSIfied)	2002-05-25 15:52:53 +00:00
des	94fe5108ff	Remove register.	2002-05-25 15:44:38 +00:00
des	f1297851a7	Automated whitespace cleanup.	2002-05-25 15:43:06 +00:00
jake	88bdee3b2f	Make the run queue parameters machine dependent. Optimize 64 bit architectures by using a 64 bit word for the bit array which keeps track of non-empty queues. Reviewed by: peter	2002-05-25 01:12:23 +00:00
peter	c952c3ce19	Fix warnings. Also, removed an unused variable that I found that was just initialized and never used afterwards.	2002-05-24 06:06:18 +00:00
mux	334d1908ec	Style nit, no functional changes.	2002-05-23 23:22:22 +00:00
mux	67080508a8	Slightly change the way we pass mount options to the filesystem VFS_NMOUNT operations. Reviewed by: phk	2002-05-23 23:02:19 +00:00
ume	a58ed55860	In m_aux_delete, no need to chase beyond victim. Submitted by: archie Obtained from: KAME	2002-05-23 15:59:48 +00:00
jhb	bd383063f6	Minor nit: get p pointer in msleep() from td->td_proc (where td == curthread) rather than from curproc.	2002-05-23 04:14:18 +00:00
jhb	2d4c041eb3	Whitespace: trim a trailing tab.	2002-05-23 04:12:28 +00:00
des	2fda28e6ab	Make the counters uintmax_ts, and use %ju rather than %llu.	2002-05-23 03:08:42 +00:00
jhb	096c0249dc	Rename pause() to ia32_pause() so it doesn't conflict with the pause() function defined in <unistd.h>. I didn't #ifdef _KERNEL it because the mutex implementation in libpthread will probably need this.	2002-05-22 20:32:39 +00:00
jhb	2f66cc911b	Rename cpu_pause() to pause(). Originally I was going to make this an MI API with empty cpu_pause() functions on other arch's, but this functionality is definitely unique to IA-32, so I decided to leave it as i386-only and wrap it in #ifdef's. I should have dropped the cpu_ prefix when I made that decision. Requested by: bde	2002-05-22 13:19:22 +00:00
jhb	3b7890a56f	Add appropriate IA32 "pause" instructions to improve performanec on Pentium 4's and newer IA32 processors. The "pause" instruction has been verified by Intel to be a NOP on all currently existing IA32 processors prior to the Pentium 4.	2002-05-21 22:26:35 +00:00
arr	8f86bf993e	- td will never be NULL, so the call to soalloc() in socreate() will always be passed a 1; we can, however, use M_NOWAIT to indicate this. - Check so against NULL since it's a pointer to a structure.	2002-05-21 21:30:44 +00:00
jhb	0ceb358d5c	Fix an old cut 'n' paste bug inherited from BSD/OS: don't increment 'i' twice once we are in the long wait stage of spinning on a spin mutex.	2002-05-21 21:27:05 +00:00
arr	8bb819d225	- OR the flag variable with M_ZERO so that the uma_zalloc() handles the zero'ing out of the allocated memory. Also removed the logical bzero that followed.	2002-05-21 21:18:41 +00:00
jhb	6190f4162b	Whitespace fixup, properly indent the body of an else clause.	2002-05-21 21:13:27 +00:00
jhb	d3398f2f58	Add code to make default mutexes adaptive if the ADAPTIVE_MUTEXES kernel option is used (not on by default). - In the case of trying to lock a mutex, if the MTX_CONTESTED flag is set, then we can safely read the thread pointer from the mtx_lock member while holding sched_lock. We then examine the thread to see if it is currently executing on another CPU. If it is, then we keep looping instead of blocking. - In the case of trying to unlock a mutex, it is now possible for a mutex to have MTX_CONTESTED set in mtx_lock but to not have any threads actually blocked on it, so we need to handle that case. In that case, we just release the lock as if MTX_CONTESTED was not set and return. - We do not adaptively spin on Giant as Giant is held for long times and it slows SMP systems down to a crawl (it was taking several minutes, like 5-10 or so for my test alpha and sparc64 SMP boxes to boot up when they adaptively spinned on Giant). - We only compile in the code to do this for SMP kernels, it doesn't make sense for UP kernels. Tested on: i386, alpha, sparc64	2002-05-21 20:47:11 +00:00
jhb	fd74bc1d8e	Optimize spin mutexes for UP kernels without debugging to just enter and exit critical sections. We only contest on a spin mutex on an SMP kernel running on an SMP machine.	2002-05-21 20:34:28 +00:00
jhb	a4a680304c	In witness_unlock(), when updating a lock list entry bucket, decrement the count of lock list entries after we fixup the bucket of lock list entries. In theory we can remove the intr_disable/intr_restore() calls now.	2002-05-20 19:16:22 +00:00
jake	dca97f2341	Add a bandaid so that sysctl kern.malloc works on sparc64.	2002-05-20 18:29:37 +00:00
jhb	4423d1f90a	- Allow witness_sleep() to be called when witness hasn't been initialized yet. We just return without performing any checks. - Don't explicitly enter and exit critical sections when walking lock lists. We don't need a critical section to walk the list of sleep locks for a thread. We check to see if a spin lock list is empty before we walk it. If the list is empty we don't need to walk it. If it isn't then we already hold at least one spin lock and are already in a critical section and thus don't need our own explicit critical section.	2002-05-20 17:49:46 +00:00
jhb	bb678d578d	Fix the td_intr_nesting_level check to work ok if a flag like M_ZERO is passed in with M_WAITOK to malloc().	2002-05-20 17:46:57 +00:00
silby	85e17a3398	Subtle fix to the accept filter LRU code. In some cases, a newly initialized socket with no qlimit was being passed in. In order to handle this case properly, we must not use >= when comparing queue sizes to qlimit. As a result of this improper handling, a panic could result in certain cases. PR: 38325 MFC after: 3 days	2002-05-20 17:34:31 +00:00
mux	85aa3f836d	Change two vput() that should have been vrele(). Submitted by: iedowse	2002-05-20 14:59:43 +00:00
tanimura	92d8381dd5	Lock down a socket, milestone 1. o Add a mutex (sb_mtx) to struct sockbuf. This protects the data in a socket buffer. The mutex in the receive buffer also protects the data in struct socket. o Determine the lock strategy for each members in struct socket. o Lock down the following members: - so_count - so_options - so_linger - so_state o Remove *_locked() socket APIs. Make the following socket APIs touching the members above now require a locked socket: - sodisconnect() - soisconnected() - soisconnecting() - soisdisconnected() - soisdisconnecting() - sofree() - soref() - sorele() - sorwakeup() - sotryfree() - sowakeup() - sowwakeup() Reviewed by: alfred	2002-05-20 05:41:09 +00:00
marcel	1f1f792674	All signals can be sent to the inferior process when it's restarted, not just the legacy ones. PR: 33299 Submitted by: Alexander N. Kabaev <ak03@gte.com>	2002-05-19 01:37:43 +00:00
jhb	b6d6774e76	Change p_can{debug,see,sched,signal}()'s first argument to be a thread pointer instead of a proc pointer and require the process pointed to by the second argument to be locked. We now use the thread ucred reference for the credential checks in p_can*() as a result. p_canfoo() should now no longer need Giant.	2002-05-19 00:14:50 +00:00
jhb	7df8f89185	Now that daddr_t has grown up, use %lld to printf it and cast it to long long.	2002-05-18 23:46:04 +00:00
phk	c506e4337e	Use btodb() macro. Sponsored by: DARPA & NAI Labs.	2002-05-18 09:34:09 +00:00
eric	4579e1dcd0	Separate "seperate" from kernel source.	2002-05-16 22:43:20 +00:00
trhodes	28d42899b7	More s/file system/filesystem/g	2002-05-16 21:28:32 +00:00
mux	84d9baf797	o Fix vfs_copyopt(), the first argument to bcopy() is the source, not the destination. o Remove some code from vfs_getopt() which was making the interface more complicated to use for a very slight gain.	2002-05-16 17:09:41 +00:00
rwatson	61d5a9043f	p_cansignal() returns an errno value; at some point, the check for inter-process signalling ceased to preserve and return that value, instead always returning EPERM. This meant that it was possible to "probe" the pid space for processes that were not otherwise visible. This change reverts that reversion. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-05-14 23:07:15 +00:00
jeff	ba85b0e087	Disable the shared locking namei() code for now. It breaks several stacking filesystems. This is on hold until the rest of VFS Locking is reviewed and deemed safe. It can be enabled with 'options LOOKUP_SHARED'.	2002-05-14 21:59:49 +00:00
des	f2d1d92921	Remove a printf(3) argument with no corresponding format specifier.	2002-05-14 18:28:06 +00:00
phk	8536ea3cdb	Make daddr_t and u_daddr_t 64bits wide. Retire daddr64_t and use daddr_t instead. Sponsored by: DARPA & NAI Labs.	2002-05-14 11:09:43 +00:00
phk	02fe70f68e	Retire the bogus uses of the disklabel field d_sbsize and begin to initialize it to zero so we don't have to have everbody and their aunt including FFS specific header files. Sponsored by: DARPA & NAI Labs.	2002-05-12 20:49:41 +00:00
marcel	6683d5d11c	Fix alpha build. The alpha has dumpsys implemented. While here, revert the condition to list the machines for which dumpsys has not been implemented. Reported by: wilko	2002-05-12 18:27:28 +00:00
silby	f3419e2e8f	Change the mbuf exhaustion warning message to match the message in -stable.	2002-05-09 20:21:07 +00:00
mini	b6d1cd6b33	Remove trace_req(). Reviewed by: alfred, jhb, peter	2002-05-09 04:13:41 +00:00
alc	eff7d93533	o Correct an error made in revision 1.65: In readv(), if uap->iovcnt is out-of-range, drop the file reference before returning. (This error also exists in the RELENG_4 branch.) o Eliminate the acquisition and release of Giant in readv() now that malloc() and free() are callable without Giant.	2002-05-09 02:30:41 +00:00
alfred	8de609e473	expand_name fixes: .) don't use MAXPATHLEN + 1, fix logic to compensate. .) style(9) function parameters. .) fix line wrapping. .) remove duplicated error and string handling code. .) don't NUL terminate already NUL terminated string. .) all string length variables changed from int to size_t. .) constify variables. .) catch when corename would be truncated. .) cast pid_t and uid_t args for format string. .) add parens around return arguments. Help and suggestions from: bde	2002-05-08 09:06:47 +00:00
jake	4b2b9b41e7	Remove runq_findproc. This never worked right in the first place and can be prohibitively expensive.	2002-05-08 04:39:49 +00:00
alfred	c4da65d875	M_ZERO the temp buffer in expand_name() otherwise if an error occurs while logging we may pass a non NUL terminated string to log(9) for a %s format arg.	2002-05-07 23:37:07 +00:00
peter	890d39a38c	Re-remove kern_random.c and svr4_signal.c. Somehow dillon managed to keep on committing to these while they were in the Attic after they had been removed. I think this was because he had the file checked out and already 'modified' while markm cvs rm'ed them, and cvs screws up when trying to "merge" the modifications with the "rm". And after that the client state was sufficiently hosed to keep it messed up. Yay CVS! (CVS is very fragile for adding and removing files remotely) The existence of these files was pointed out by: ru	2002-05-07 21:54:47 +00:00
tanimura	9070f27e7d	Do not forget to increase the number of completely connected sockets in soisconnected_locked(). Forgotten by: tanimura	2002-05-07 16:17:44 +00:00
jeff	74069a30ee	Switch from just holding the interlock to holding the standard lock throughout getnewvnode(). This is safer. In the future, we should investigate requiring only the interlock to get the vnode object.	2002-05-07 02:44:06 +00:00
alfred	d1e340364b	Make funsetown() take a 'struct sigio **' so that the locking can be done internally. Ensure that no one can fsetown() to a dying process/pgrp. We need to check the process for P_WEXIT to see if it's exiting. Process groups are already safe because there is no such thing as a pgrp zombie, therefore the proctree lock completely protects the pgrp from having sigio structures associated with it after it runs funsetownlst. Add sigio lock to witness list under proctree and allproc, but over proc and pgrp. Seigo Tanimura helped with this.	2002-05-06 19:31:28 +00:00
jhb	1641885111	When checking to see if the init process calls exit1(), compare p to the initproc proc pointer instead of checking to see if the pid is 1. Submitted by: bde	2002-05-06 17:07:10 +00:00

1 2 3 4 5 ...

5095 Commits