freebsd-dev

Author	SHA1	Message	Date
Robert Watson	4499985ef2	Reserve system call numbers for the following system calls: __mac_get_pid Retrieve MAC label of a process by pid Similar to __mac_get_proc() except that the target process of the operation is explicitly specified rather than assuming curthread. __mac_get_link Retrieve MAC label of a path with NOFOLLOW __mac_set_link Set MAC label of a path with NOFOLLOW extattr_set_link Set EAs on a path with NOFOLLOW extattr_get_link Retrieve EAs on a path with NOFOLLOW extattr_delete_link Delete EAs on a path with NOFOLLOW These calls are similar to __mac_get_file(), __mac_set_file(), extattr_set_file(), extattr_get_file(), and extattr_delete_file(), except that they do not follow symlinks. The distinction between these calls is similar to lchown() vs chown(). Implementations to follow. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2002-10-01 02:35:59 +00:00
Juli Mallett	a88b260a86	Back out code changes that snuck into the previous forced commit.	2002-10-01 00:16:17 +00:00
Juli Mallett	226e1171e1	(Forced commit, to clarify previous commit of ksiginfo/signal queue code.) I've added a structure, kernel-private, to represent a pending or in-delivery signal, called `ksiginfo'. It is roughly analogous to the basic information that is exported by the POSIX interface 'siginfo_t', but more basic. I've added functions to allocate these structures, and further to wrap all signal operations using them. Once the operations are wrapped, I've added a TailQ (see queue(3)) of these structures to 'struct proc', and all pending signals are in that TailQ. When a signal is being delivered, it is dequeued from the list. Once I finish the spreading of ksiginfo throughout the tree, the dequeued structure will be delivered to the process in question, whereas currently and normally, the signal number is what is used.	2002-10-01 00:07:28 +00:00
John Baldwin	dc183990ca	- Add a new per-process flag PS_XCPU to indicate that at least one thread has exceeded its CPU time limit. - In mi_switch(), set PS_XCPU when the CPU time limit is exceeded. - Perform actual CPU time limit exceeded work in ast() when PS_XCPU is set. Requested by: many	2002-09-30 21:13:54 +00:00
John Baldwin	f4cd8f9ff4	Change p_cpulimit to be in seconds instead of microseconds. Since p_runtime now is a bintime, it is no longer an optimization to store p_cpulimit as microseconds. Suggested by: phk	2002-09-30 21:08:38 +00:00
Robert Watson	0626774f08	Move vnode MAC label initialization to after the release of the vnode interlock in getnewvnode() to avoid possible sleeps while holding the mutex. Note that the warning from Witness is a slight false positive since we know there will be no contention on the interlock since we haven't made the vnode available for use yet, but the theory is not a bad one. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2002-09-30 20:51:48 +00:00
Robert Watson	c031391bd5	Add tunables for the existing sysctl twiddles for pipe and vm enforcement so they can be disabled prior to kernel start. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2002-09-30 20:50:00 +00:00
Juli Mallett	1226f694e6	First half of implementation of ksiginfo, signal queues, and such. This gets signals operating based on a TailQ, and is good enough to run X11, GNOME, and do job control. There are some intricate parts which could be more refined to match the sigset_t versions, but those require further evaluation of directions in which our signal system can expand and contract to fit our needs. After this has been in the tree for a while, I will make in kernel API changes, most notably to trapsignal(9) and sendsig(9), to use ksiginfo more robustly, such that we can actually pass information with our (queued) signals to the userland. That will also result in using a struct ksiginfo pointer, rather than a signal number, in a lot of kern_sig.c, to refer to an individual pending signal queue member, but right now there is no defined behaviour for such. CODAFS is unfinished in this regard because the logic is unclear in some places. Sponsored by: New Gold Technology Reviewed by: bde, tjr, jake [an older version, logic similar]	2002-09-30 20:20:22 +00:00
Poul-Henning Kamp	50c2233141	Plug memory leaks. Detected by: FlexeLint Approved by: jhb	2002-09-30 19:19:47 +00:00
Julian Elischer	2735483034	uh, commit all of the patch	2002-09-29 23:28:58 +00:00
Julian Elischer	e081731767	commit the version I actually tested.. Submitted by: davidxu	2002-09-29 23:23:25 +00:00
Julian Elischer	9eb1fdea37	Implement basic KSE loaning. This stops a hread that is blocked in BOUND mode from stopping another thread from completing a syscall, and this allows it to release its resources etc. Probably more related commits to follow (at least one I know of) Initial concept by: julian, dillon Submitted by: davidxu	2002-09-29 23:04:34 +00:00
David E. O'Brien	21b68415cd	Fix style nit where conditionally compiled code was unconditionalized, but style(9) was consulted. Submitted by: bde	2002-09-29 04:47:41 +00:00
Julian Elischer	0cd3964f6d	lock proc while calling psignal (plus related cleanups) Submitted by: davidxu	2002-09-29 02:48:37 +00:00
Poul-Henning Kamp	651dde1b81	Move includ of <sys/bus_priate.h> later to get semantic identity of device_t the same throughout kernel. This is a very fine point of C which fortunatly does not make any difference in normal circumstances but which due to the pervasiveness of device_t in the kernel can make a lint barf a lot.	2002-09-28 21:38:35 +00:00
Poul-Henning Kamp	2f9752e95e	Change a return to a break so the local buffers get properly freeed. Spotte by: FlexeLint Reviewed by: rwatson	2002-09-28 21:34:31 +00:00
Poul-Henning Kamp	53cc479393	Remove unused includes. Clarify the intention of a while(); Move a local variable to avoid potential name-confusion.	2002-09-28 17:46:30 +00:00
Poul-Henning Kamp	37c841831f	Be consistent about "static" functions: if the function is marked static in its prototype, mark it static at the definition too. Inspired by: FlexeLint warning #512	2002-09-28 17:15:38 +00:00
Poul-Henning Kamp	54286a04c5	Correctly order VI_UNLOCK(), local variables and block comment.	2002-09-28 12:15:44 +00:00
Julian Elischer	165d2b993c	Rewrite the kse_create() function to better aproach the semantics we have specified in the design.	2002-09-28 08:44:31 +00:00
Jake Burkholder	169d513cb4	Add a workaround for what seems to be confusion between binutils and the sparc v9 ABI. The Elf_Rela records for local symbols appear to already have the symbol's value added in to the addend field, even though the ABI specifies we need to lookup the symbol and add its value too. This breaks text relocations in klds because the symbol's value is added twice, and the resulting address points off into nowhere land, so for now just use the addend. Tested by: rwatson	2002-09-27 23:12:53 +00:00
Poul-Henning Kamp	ca916247cd	Rename struct specinfo to the more appropriate struct cdev. Agreed on: jake, rwatson, jhb	2002-09-27 18:27:10 +00:00
Julian Elischer	3d0586d4f2	Redo how completing threads pass their state to userland if they are not going to cross over themselves. Also change how the list of completed user threads is tracked and passed to the KSE. This is not a change in design but rather the implementation of what was originally envisionned.	2002-09-27 07:11:11 +00:00
Poul-Henning Kamp	3c275c19c4	Under DIAGNOSTIC, complain if ENOIOCTL leaks out through VOP_IOCTL().	2002-09-26 21:21:13 +00:00
Poul-Henning Kamp	089cf428da	Make biowait() check bio_error before the BIO_ERROR flag, to propery catch internal GEOM use of bio_error. Sponsored by: DARPA & NAI Labs.	2002-09-26 16:32:14 +00:00
Jeff Roberson	a414302f90	- Export the alq daemon thread pointer. - Don't log ktr events from the alq daemon.	2002-09-26 07:38:56 +00:00
Jeff Roberson	6423c9433c	- Move ASSERT_VOP_LOCK functionality into functions in vfs_subr.c - Make the VI asserts more orthogonal to the rest of the asserts by using a new, common vfs_badlock() function and adding a 'str' arg. - Adjust generated ASSERTS to match the new prototype. - Adjust explicit ASSERTS to match the new prototype.	2002-09-26 04:48:44 +00:00
Jeff Roberson	6fc15f9bdf	- We don't need any automated lock checking for vop_islocked.	2002-09-26 00:31:16 +00:00
Archie Cobbs	89def71cbd	Make the following name changes to KSE related functions, etc., to better represent their purpose and minimize namespace conflicts: kse_fn_t -> kse_func_t struct thread_mailbox -> struct kse_thr_mailbox thread_interrupt() -> kse_thr_interrupt() kse_yield() -> kse_release() kse_new() -> kse_create() Add missing declaration of kse_thr_interrupt() to <sys/kse.h>. Regenerate the various generated syscall files. Minor style fixes. Reviewed by: julian	2002-09-25 18:10:42 +00:00
Bruce Evans	ac0653dcc8	Round up instead of towards 0 in clock_getres() so that a resolution of 0 is never returned. PR: 41781 MFC after: 3 days	2002-09-25 12:00:38 +00:00
Jeff Roberson	6cb8bf2027	- Lock down the syncer with sync_mtx. - Enable vfs_badlock_mutex by default. - Assert that the vp is locked in VOP_UNLOCK. - Use standard interlock macros in remaining code. - Correct a race in getnewvnode(). - Lock access to v_numoutput with interlock. - Lock access to buf lists and splay tree with interlock. - Add VOP and VI asserts. - Lock b_vnbufs with the vnode interlock. - Add vrefcnt() for callers who want to retreive the vnode ref without holding a lock. Add a comment that describes when this is safe. - Add vholdl() and vdropl() so that callers who already own the interlock can avoid race conditions and unnecessary unlocking. - Move the VOP_GETATTR() in vflush() into the WRITECLOSE conditional case. - Hold the interlock before droping the mntlist_mtx in vflush() to avoid a race. - Fix locking in vfs_msync().	2002-09-25 02:22:21 +00:00
Jeff Roberson	d40a8125f5	- Properly lock v_vflags in getdirents().	2002-09-25 02:13:38 +00:00
Jeff Roberson	d64370cb30	- Use incore() where no other interlock locking is necessary. - Lock access to numoutput.	2002-09-25 02:12:32 +00:00
Jeff Roberson	b7227b7712	- Lock accesses to v_numoutput. - Lock calls to gbincore.	2002-09-25 02:11:37 +00:00
Jeff Roberson	609058e884	- Don't protect mountedhere with the vn interlock. - Protect mountedhere with the vn lock.	2002-09-25 01:44:21 +00:00
Jeff Roberson	3cc511c528	- Use the standard vp interlock macros.	2002-09-25 01:42:24 +00:00
Julian Elischer	ed32df81e8	Don't use local variable 'p' in a debug statement.. we removed it.	2002-09-23 14:06:12 +00:00
Julian Elischer	10b33e6b2c	oops don't do dthe copy range in a new KSE. There isn't one any more.	2002-09-23 14:01:01 +00:00
Julian Elischer	253fdd5ba9	slightly clean up the thread_userret() and thread_consider_upcall() calls. also some slight changes for TDF_BOUND testing and small style changes Should ONLY affect KSE programs Submitted by: davidxu	2002-09-23 06:14:30 +00:00
Julian Elischer	acb460624e	Add code to create > 1 KSe per process. (support code not yet complete) Submitted by: davidxu	2002-09-23 06:10:24 +00:00
Julian Elischer	33c06e1d3e	Indentation does not define a block.. you need breces {} as well.. also add a mutex assert. (threaded path only) Submitted by: davidxu	2002-09-23 05:27:30 +00:00
Jeff Roberson	9e9256e252	- Hold the credential of the caller and use it in all subsequent vn ops. - Get rid of the ill conceived aq_td field. Suggested by: rwatson	2002-09-23 05:20:00 +00:00
Jeff Roberson	abee588b36	- Add support for logging KTR via ALQ. This is optional and enabled by the KTR_ALQ config option.	2002-09-22 07:13:45 +00:00
Jeff Roberson	c76e20451c	- Tell witness about ALQ's spin lock.	2002-09-22 07:11:57 +00:00
Jeff Roberson	9405072a95	- Add an asynchronous fixed length record logging mechanism called ALQ (Asynch. Logging Queues). ALQ supports many seperate queues with different record and buffer sizes. It opens and logs to any vnode so it can be used with character devices as well as regular files. Reviewed in part by: phk, jake, markm	2002-09-22 07:11:14 +00:00
Jake Burkholder	98f93c07a5	Removed unneeded include (missed in last revision).	2002-09-22 06:05:23 +00:00
Jake Burkholder	e3b6e33c07	Moved netisr code from kern/kern_intr.c to net/netisr.c as threatened in a comment.	2002-09-22 05:56:41 +00:00
Jake Burkholder	05ba50f522	Use the fields in the sysentvec and in the vm map header in place of the constants VM_MIN_ADDRESS, VM_MAXUSER_ADDRESS, USRSTACK and PS_STRINGS. This is mainly so that they can be variable even for the native abi, based on different machine types. Get stack protections from the sysentvec too. This makes it trivial to map the stack non-executable for certain abis, on machines that support it.	2002-09-21 22:07:17 +00:00
Poul-Henning Kamp	66cdbc28d0	Assert my copyright on this file (using the default 2-clause BSD). The vast majority of the contents is from my keyboard and no significant pieces remain of the former copyright holders code.	2002-09-20 22:26:27 +00:00
Poul-Henning Kamp	7812d86f03	(This commit touches about 15 disk device drivers in a very consistent and predictable way, and I apologize if I have gotten it wrong anywhere, getting prior review on a patch like this is not feasible, considering the number of people involved and hardware availability etc.) If struct disklabel is the messenger: kill the messenger. Inside struct disk we had a struct disklabel which disk drivers used to communicate certain metrics to the disklayer above (GEOM or the disk mini-layer). This commit changes this communication to use four explicit fields instead. Amongst the benefits is that the fields do not get overwritten by wrong or bogus on-disk disklabels. Once that is clear, <sys/disk.h> which is included in the drivers no longer need to pull <sys/disklabel.h> and <sys/diskslice.h> in, the few places that needs them, have gotten explicit #includes for them. The disklabel inside struct disk is now only for internal use in the disk mini-layer, so instead of embedding it, we malloc it as we need it. This concludes (modulus any mistakes) the series of disklabel related commits. I belive it all amounts to a NOP for all the rest of you :-) Sponsored by: DARPA & NAI Labs.	2002-09-20 19:36:05 +00:00
Poul-Henning Kamp	6fb3d70418	For reasons now lost in historical fog, the bounds_check_with_label() function were put in i386/i386/machdep.c from where it has been cut and pasted to other architectures with only minor corruption. Disklabel is really a MI format in many ways, at least it certainly is when you operate on struct disklabel. Put bounds_check_with_label() back in subr_disklabel.c where it belongs. Sponsored by: DARPA & NAI Labs.	2002-09-20 17:51:00 +00:00
Poul-Henning Kamp	2e45c1b191	We don't need the <sys/disklabel.h> include for alpha anymore. Sponsored by: DARPA & NAI Labs.	2002-09-20 17:45:44 +00:00
Poul-Henning Kamp	2382fb0a84	Make FreeBSD "struct disklabel" agnostic, step 312 of 723: Rename bioqdisksort() to bioq_disksort(). Keep a #define around to avoid changing all diskdrivers right now. Move it from subr_disklabel.c to subr_disk.c. Move prototype from <sys/disklabel.h> to <sys/bio.h> Sponsored by: DARPA and NAI Labs.	2002-09-20 14:14:37 +00:00
Poul-Henning Kamp	f90c382c0c	Make FreeBSD "struct disklabel" agnostic, step 311 of 723: Rename diskerr() to disk_err() for naming consistency. Drop the by now entirely useless struct disklabel argument. Add a flag argument for new-line termination. Fix a couple of printf-format-casts to %j instead of %l. Correctly print the name of all bio commands. Move the function from subr_disklabel.c to subr_disk.c, and from <sys/disklabel.h> to <sys/disk.h>. Use the new disk_err() throughout, #include <sys/disk.h> as needed. Bump __FreeBSD_version for the sake of the aac disk drivers #ifdefs. Remove unused disklabel members of softc for aac, amr and mlx, which seem to originally have been intended for diskerr() use, but which only rotted and got Copy&Pasted at least two times to many. Sponsored by: DARPA & NAI Labs.	2002-09-20 12:52:03 +00:00
Poul-Henning Kamp	837c5e5c2b	Remove unused variable.	2002-09-20 09:33:30 +00:00
Poul-Henning Kamp	46714777f5	Retire now unused DIOCGDVIRGIN kludge. Sponsored by: DARPA & NAI Labs.	2002-09-20 09:31:14 +00:00
Maxime Henrion	e2587e98e5	Switch to using strlcpy() in several places. It seems there were cases where we could get unterminated strings before.	2002-09-19 18:54:22 +00:00
John Baldwin	e485b64b08	Add ability to dump stacktraces on kernel panics when DDB is compiled into the kernel. By default this is turned off since otherwise it could scroll valuable panic messages off of the screen. This option can be turned on by the DDB_TRACE kernel option as well as the debug.trace_on_panic sysctl. Also, fix the DDB_UNATTENDED option to use its own header instead of abusing opt_ddb.h. This way turning that one option on or off doesn't force you to recompile all of ddb. Requested by: many (1), bde (2) - I know bde prefers !abusing option headers in general but can't remember if he as brought up this specific case.	2002-09-19 18:49:46 +00:00
Don Lewis	fa288043e2	VOP_FSYNC() requires that it's vnode argument be locked, which nfs_link() wasn't doing. Rather than just lock and unlock the vnode around the call to VOP_FSYNC(), implement rwatson's suggestion to lock the file vnode in kern_link() before calling VOP_LINK(), since the other filesystems also locked the file vnode right away in their link methods. Remove the locking and and unlocking from the leaf filesystem link methods. Reviewed by: rwatson, bde (except for the unionfs_link() changes)	2002-09-19 13:32:45 +00:00
Julian Elischer	4a3276d4a4	While well intentionned the check to see it there is a packet header and return that length, was misguided. The check itself didn't take into account the fact that the mbuf pointer pased in may be null, and the function is defined specifically for cases where the caller knows what it wants. Rather than fix the check I'm removing it as phk suggested. Submitted by: phk@freebsd.org	2002-09-19 08:28:41 +00:00
Julian Elischer	4a49235b89	fix style.. Return in the kernel always has () around the arguments.	2002-09-19 03:18:44 +00:00
Julian Elischer	1494277d50	Compiler was correct: m WAS being used uninitialized..	2002-09-19 03:15:39 +00:00
Darren Reed	e62497713c	If M_PKTHDR is set then we don't need to do a loop to find the total length.	2002-09-19 01:21:24 +00:00
Alfred Perlstein	3ffb9fadc8	Regen for added syscalls.	2002-09-19 00:48:57 +00:00
Alfred Perlstein	6d5dec35b7	Add the rest of the kernel support for the sem_ API in kern/uipc_sem.c. Option 'P1003_1B_SEMAPHORES' to compile them in, or load the "sem" module to activate them. Have kern/makesyscalls.sh emit an include for sys/_semaphore.h into sysproto.h to pull in the typedef for semid_t. Add the syscalls to the syscall table as module stubs.	2002-09-19 00:43:32 +00:00
Alfred Perlstein	efaa658806	Bring in my implementation of kernel support for posix realtime semaphores that are shareable between processes. There will be a cleanup shortly along with the necessary changes made to libc, libc_r, libpthread as well as the hooks into sys/conf and sys/modules.	2002-09-18 22:47:42 +00:00
Robert Watson	cc51a2b55e	Remove un-needed stack variable 'ops'. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2002-09-18 22:35:02 +00:00
Bosko Milekic	bd395ae8f6	style nit: unsigned -> u_int in the kernel, particularly to stay consistent in this file, and keep m_length() and m_fixhdr() consistent with their prototypes in mbuf.h Inspired by: bde	2002-09-18 22:33:52 +00:00
Nate Lawson	86ed6d45ac	Remove any VOP_PRINT that redundantly prints the tag. Move lockmgr_printinfo() into vprint() for everyone's benefit. Suggested by: bde	2002-09-18 20:42:04 +00:00
Poul-Henning Kamp	7ed60de837	Use m_length() instead of home-rolled versions.	2002-09-18 19:44:14 +00:00
Poul-Henning Kamp	4e4425d486	Make m_length() and m_fixhdr() return unsigned. Suggested by: arr	2002-09-18 19:42:06 +00:00
Poul-Henning Kamp	ac6e585d24	Introduce the m_length() function which will return the accumulated length of an mbuf-chain and optionally a pointer to the last mbuf.	2002-09-18 14:57:35 +00:00
Poul-Henning Kamp	3f2e06c5e1	Move m_fixhdr() from "mbchain" to "mbuf" where it belongs.	2002-09-18 13:41:37 +00:00
Jeff Roberson	99571dc345	- Split UMA_ZFLAG_OFFPAGE into UMA_ZFLAG_OFFPAGE and UMA_ZFLAG_HASH. - Remove all instances of the mallochash. - Stash the slab pointer in the vm page's object pointer when allocating from the kmem_obj. - Use the overloaded object pointer to find slabs for malloced memory.	2002-09-18 08:26:30 +00:00
Robert Watson	ca7850c313	Add a toggle to disable VM enforcement. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-09-18 02:02:08 +00:00
Robert Watson	b88c98f6b1	At the cost of seeming a little gauche, make use of more traditional alphabetization for mac_enforce_pipe sysctl. Obtained from: TrustedBSD Project Sponsored by: DAPRA, NAI Labs	2002-09-18 02:00:19 +00:00
Robert Watson	289c6dea76	Don't call VOP_LEASE() while holding the accounting mutex.	2002-09-18 01:56:13 +00:00
Peter Wemm	acaa156683	Argh. I've been reading makefiles for too long. Change comment to a C-style comment.	2002-09-17 07:41:30 +00:00
Peter Wemm	1e19df3303	Stub out the calls to get_mcontext and set_mcontext which only exist on i386. This stuff should not be prototyped in MD inludes if the interface is expected to be MI.	2002-09-17 07:40:15 +00:00
Peter Wemm	66422f5b7a	Initiate deorbit burn for the i386-only a.out related support. Moves are under way to move the remnants of the a.out toolchain to ports. As the comment in src/Makefile said, this stuff is deprecated and one should not expect this to remain beyond 4.0-REL. It has already lasted WAY beyond that. Notable exceptions: gcc - I have not touched the a.out generation stuff there. ldd/ldconfig - still have some code to interface with a.out rtld. old as/ld/etc - I have not removed these yet, pending their move to ports. some includes - necessary for ldd/ldconfig for now. Tested on: i386 (extensively), alpha	2002-09-17 01:49:00 +00:00
Jonathan Mini	c76e33b681	Add kernel support needed for the KSE-aware libpthread: - Use ucontext_t's to store KSE thread state. - Synthesize state for the UTS upon each upcall, rather than saving and copying a trapframe. - Deliver signals to KSE-aware processes via upcall. - Rename kse mailbox structure fields to be more BSD-like. - Store the UTS's stack in struct proc in a stack_t. Reviewed by: bde, deischen, julian Approved by: -arch	2002-09-16 19:26:48 +00:00
Poul-Henning Kamp	7b08810243	Add a cast to make this file compile in userland on sparc64 without warnings.	2002-09-16 18:45:18 +00:00
Thomas Moestl	dde1c2c0d6	fcntl(..., F_SETLKW, ...) takes a pointer to a struct flock just like F_SETLK does, so it also needs this structure copied in in fnctl() before calling kern_fcntl().	2002-09-16 01:05:15 +00:00
Julian Elischer	4f0db5e08c	Allocate KSEs and KSEGRPs separatly and remove them from the proc structure. next step is to allow > 1 to be allocated per process. This would give multi-processor threads. (when the rest of the infrastructure is in place) While doing this I noticed libkvm and sys/kern/kern_proc.c:fill_kinfo_proc are diverging more than they should.. corrective action needed soon.	2002-09-15 23:52:25 +00:00
Poul-Henning Kamp	f986355c0e	s/Danglish/English/ Some style issues. Change the timeout to be hz/10 instead of hz. Brucification by: bde.	2002-09-15 17:52:35 +00:00
Poul-Henning Kamp	028e9e5902	Un-inline the non-trivial "trivial" bio* functions. Untangle devstat_end_transaction_bio()	2002-09-14 19:34:11 +00:00
Warner Losh	bed807193b	Add additional information returned to userland by the device querying functions. We add pnpinfo, locationinfo, devflags (the newbus flags on the device), flags (the flags that device_get_flags returns) and state to the list of things we return. pnpinfo and locationinfo are place holders at the moment that will be filled in by the device's parent (optionally). Userland programs will likely use this information from time to time and take appropriate actions. Improvements to devinfo to follow.	2002-09-14 19:15:12 +00:00
Nate Lawson	c1e2d3866f	Move setugidsafety() call outside of process lock. This prevents a lock recursion when closef() calls pfind() which also wants the proc lock. This case only occurred when setugidsafety() needed to close unsafe files. Reviewed by: truckman	2002-09-14 18:55:11 +00:00
Maxime Henrion	8179ab99d6	Remove a conditional #include <sys/kernel.h>, it is already included unconditionally before. Submitted by: Olivier Houchard <cognet@ci0.org>	2002-09-14 14:44:41 +00:00
Nate Lawson	06be2aaa83	Remove all use of vnode->v_tag, replacing with appropriate substitutes. v_tag is now const char * and should only be used for debugging. Additionally: 1. All users of VT_NTS now check vfsconf->vf_type VFCF_NETWORK 2. The user of VT_PROCFS now checks for the new flag VV_PROCDEP, which is propagated by pseudofs to all child vnodes if the fs sets PFS_PROCDEP. Suggested by: phk Reviewed by: bde, rwatson (earlier version)	2002-09-14 09:02:28 +00:00
Julian Elischer	e6e24ff9cd	Apparently something down in the guts of vm/uvm still needs giant Obtained from: mini via P4 KSE tree.	2002-09-14 06:23:43 +00:00
Thomas Moestl	4e115a85ab	Fix fcntl(..., F_GETOWN, ...) and fcntl(..., F_SETOWN, ...) on sparc64 by not passing a pointer to a register_t or intptr_t when the code in the lower layers expects one to an int.	2002-09-13 15:15:16 +00:00
Poul-Henning Kamp	c7143e7150	Oops, broke the build there. Uninline biodone() now that it is non-trivial. Introduce biowait() function. Currently there is a race condition and the mitigation is a timeout/retry. It is not obvious what kind of locking (if any) is suitable for BIO_DONE, since the majority of users take are of this themselves, and only a few places actually rely on the wakeup. Sponsored by: DARPA & NAI Labs.	2002-09-13 11:28:31 +00:00
Don Lewis	28b325aa60	Drop the proc lock while calling fdcheckstd() which may block to allocate memory. Reviewed by: jhb	2002-09-13 09:31:56 +00:00
Bruce Evans	f70de49661	Fixed style bugs in resource_list_add_next().	2002-09-12 13:45:38 +00:00
Andrew R. Reiter	b4dcc46af5	- Fix two obvious locking bugs; 1) returning with lock held when it needed to be dropped, 2) attempting to lock acct_mtx while already holding it. Sorry to those who experienced pain. - Added two comments referring to two areas in which acct_mtx is held over vnode operations that might sleep. Patch in the works for this.	2002-09-12 05:00:32 +00:00
John Baldwin	c9e7d28e26	- Change utrace ktrace events to malloc the work buffer before getting a request structure. - Re-optimize the case of utrace being disabled by doing an explicit KTRPOINT check instead of relying on the one in ktr_getrequest() so that we don't waste time on a malloc in the non-tracing case. - Change utrace() to return an error if the copyin() fails. Before it would just ignore the request but still return success. This last is a change in behavior and can be backed out if necessary.	2002-09-11 21:00:56 +00:00
John Baldwin	1d3ab18279	Remove support for synchronous ktrace requests now that none exist anymore. They were an ugly, gross hack.	2002-09-11 20:58:10 +00:00
John Baldwin	b92584a689	- Change ktrace genio events to only copy up to ktr_geniosize bytes of a transfer to a malloc'd buffer and use that bufer for the ktrace event. This means that genio ktrace events no longer need to be synchronous. - Now that ktr_buffer isn't overloaded to sometimes point to a cached uio pointer for genio requests and always points to a malloc'd buffer if not NULL, free the buffer in ktr_freerequest() instead of in ktr_writerequest(). This closes a memory leak for ktrace events that used a malloc'd buffer that had their vnode ripped out from under them while they were on the todo list. Suggested by: bde (1, in principle)	2002-09-11 20:56:05 +00:00
John Baldwin	12301fc3c7	- Add a kern.ktrace sysctl node. - Rename kern.ktrace_request_pool tunable/sysctl to kern.ktrace.request_pool. - Add a variable to control the max amount of data to log for genio events. This variable is tunable via the tunable/sysctl kern.ktrace.genio_size and defaults to one page.	2002-09-11 20:49:55 +00:00
John Baldwin	4b3aac3d4e	Change namei and syscall ktrace events to malloc work buffers before obtaining a ktr_request structure from the free pool so we can avoid starving other threads of ktr_request structures.	2002-09-11 20:46:50 +00:00
Julian Elischer	85e40eaf26	Indentation does not make a block.. need curly braces too. Submitted by: Eagle-eyes evans <bde@freebsd.org>	2002-09-11 18:15:26 +00:00
Julian Elischer	71fad9fdee	Completely redo thread states. Reviewed by: davidxu@freebsd.org	2002-09-11 08:13:56 +00:00
Bruce Evans	527eee2d40	Include <vm/uma.h> instead of depending on namespace pollution in <sys/malloc.h>. Sorted includes as much as possible. Removed banal comment(s) attached to includes.	2002-09-11 07:13:28 +00:00
Warner Losh	74014b7f0a	Clarify the return value from child_present.	2002-09-11 04:22:10 +00:00
Andrew R. Reiter	4f39d5d511	- Lock down the accounting code globals with a subsystem mutex. Reviewed by: jhb, mdodd	2002-09-11 04:10:41 +00:00
Bruce Evans	e5d6cd0c98	Include <sys/malloc.h> instead of depending on namespace pollution 2 layers deep in <sys/proc.h> or <sys/vnode.h>. Removed unused includes. Sorted includes.	2002-09-10 11:57:02 +00:00
Bruce Evans	d3a7b5e70e	vfs_syscalls.c: Changed rename(2) to follow the letter of the POSIX spec. POSIX requires rename() to have no effect if its args "resolve to the same existing file". I think "file" can only reasonably be read as referring to the inode, although the rationale and "resolve" seem to say that sameness is at the level of (resolved) directory entries. ext2fs_vnops.c, ufs_vnops.c: Replaced code that gave the historical BSD behaviour of removing one link name by checks that this code is now unreachable. This fixes some races. All vnodes needed to be unlocked for the removal, and locking at another level using something like IN_RENAME was not even attempted, so it was possible for rename(x, y) to return with both x and y removed even without any unlink(2) syscalls (one process can remove x using rename(x, y) and another process can remove y using rename(y, x)). Prodded by: alfred MFC after: 8 weeks PR: 42617	2002-09-10 11:09:13 +00:00
Robert Watson	c0f3990523	Add security.mac.mmap_revocation, a flag indicating whether we should revoke access to memory maps on a process label change. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-09-09 17:12:24 +00:00
Robert Watson	1614003510	Minor code sync to MAC tree: push Giant locking up from mac_cred_mmapped_drop_perms() to the caller. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-09-09 17:10:16 +00:00
Poul-Henning Kamp	5ea98f59b9	Fix a logic bug in the struct dev_t allocation code. Spotted by: Neelkanth Natu <neelnatu@yahoo.com>	2002-09-08 15:15:12 +00:00
Jake Burkholder	c0d676c068	Make this driver work a whole lot better. - Get the initial mode from the prom settings and don't clobber the mode on open. - Copy output into an internal ring buffer instead of accessing the tty outq directly in the interrupt handler. This fixes a problem where garbage would show up in the output stream. - Reset the console port completely and reprogram all the parameters before enabling it. This fixes seemingly random hangs on startup when using a fast interrupt handler. - Add minimal locking in place of spls. - Remove dead code and minor cleanups.	2002-09-08 04:45:16 +00:00
Peter Wemm	d0ca7c29dc	Do not blow up when we walk off the end of the brands list. Found by: kris, jake	2002-09-08 02:17:44 +00:00
Peter Wemm	a9f9df5daf	Tidy up some loose ends that bde pointed out. caddr_t bad, ok? Move fill_kinfo_proc to before we copy the results instead of after the copy and too late. There is still more to do here.	2002-09-07 22:31:44 +00:00
Peter Wemm	1ed8cb4870	Remove bogus fill_kinfo_proc() before ptrace_set_pc(). There was no need for this. Submitted by: bde	2002-09-07 22:18:19 +00:00
Peter Wemm	99a17113cd	The true value of how the kernel was configured for KSTACK_PAGES was not available at module compile time. Do not #include the bogus opt_kstack_pages.h at this point and instead refer to the variables that are also exported via sysctl.	2002-09-07 22:15:47 +00:00
Peter Wemm	b9f009b08d	Make UAREA_PAGES and KSTACK_PAGES visible to userland via sysctl, like PS_STRINGS and USRSTACK is. This is necessary in order to decode a.out core dumps. kern_proc.c was already referring to both of these values but was missing the #include "opt_kstack_pages.h". Make the sysctl variables visible so that certain kld modules can see how their parent kernel was configured.	2002-09-07 22:11:45 +00:00
Julian Elischer	c0698d32ce	fix braino.. was clearing part of wrong thread structure..	2002-09-07 12:58:44 +00:00
Julian Elischer	9b0e281b69	fix misplaced schedlock Submitted by: davidxu@freebsd.org	2002-09-07 01:48:53 +00:00
Peter Wemm	710ded3ac5	Collect the a.out coredump code into the calling functions. XXX why does pecoff dump in a.out format?	2002-09-07 01:23:51 +00:00
Robert Watson	6f22742b25	Minor spelling tweak: assume "his" is actually "This".	2002-09-06 13:22:44 +00:00
Julian Elischer	1faf202ea9	Use UMA as a complex object allocator. The process allocator now caches and hands out complete process structures including substructures . i.e. it get's the process structure with the first thread (and soon KSE) already allocated and attached, all in one hit. For the average non threaded program (non KSE that is) the allocated thread and its stack remain attached to the process, even when the process is unused and in the process cache. This saves having to allocate and attach it later, effectively bringing us (hopefully) close to the efficiency of pre-KSE systems where these were a single structure. Reviewed by: davidxu@freebsd.org, peter@freebsd.org	2002-09-06 07:00:37 +00:00
David Xu	65c17e749b	Remove extra ';'	2002-09-06 00:18:52 +00:00
Poul-Henning Kamp	e1657bbb97	Introduce the VOP_OPENEXTATTR() and VOP_CLOSEEXTATTR() methods. Together these two implement a simple transcation style grouping for modifications of extended attributes on a vnode. VOP_CLOSEEXTATTR() takes a boolean "commit" argument, which determines if the aggregate changes are attempted written or not. A commit will fail if any of the VOP_SETEXTATTR() calls since the VOP_OPENEXTATTR() have failed to meet their objective or if the flush to disk fails. The default operations for these two VOP's is to return EOPNOTSUPP. This API may still be subject to change. Sponsored by: DARPA & NAI Labs	2002-09-05 20:56:14 +00:00
Poul-Henning Kamp	f8b663614d	Fix an inherited style bug: compare with NOCRED instead of NULL. Sponsored by: DARPA & NAI Labs.	2002-09-05 20:46:19 +00:00
Poul-Henning Kamp	c1a925a637	Introduce new extattr_check_cred() function which implements the canonical crential washing for extended attributes. Sponsored by: DARPA & NAI Labs.	2002-09-05 20:38:57 +00:00
Mitsuru IWASAKI	2894f9d0a7	Add debug.rman_debug sysctl MIB and loader tunable instead of broken RMAN_DEBUG option. This would be useful for debugging resource manager code.	2002-09-05 11:45:02 +00:00
Poul-Henning Kamp	32c6c4780a	Fix a format buglet. Spotted by: iedowse	2002-09-05 11:42:03 +00:00
David Xu	1279572a92	s/SGNL/SIG/ s/SNGL/SINGLE/ s/SNGLE/SINGLE/ Fix abbreviation for P_STOPPED_* etc flags, in original code they were inconsistent and difficult to distinguish between them. Approved by: julian (mentor)	2002-09-05 07:30:18 +00:00
Bruce Evans	b656366b46	Include <sys/malloc.h> instead of depending on namespace pollution 2 layers deep in <sys/proc.h> or <sys/vnode.h>. Removed unused includes. Fixed some printf format errors (1 fatal on i386's; 1 fatal on alphas; 1 not fatal on any supported machine).	2002-09-05 07:02:43 +00:00
Ian Dowse	012e544f12	Split up ptrace() into a wrapper that does the copying to and from user space and a kern_ptrace() implementation. Use the kern_*() version in the Linux emulation code to remove more stack gap uses. Approved by: des	2002-09-05 01:02:50 +00:00
Poul-Henning Kamp	b336df68d1	Under DIAGNOSTIC, complain if a timeout(9) routine took more than 1msec.	2002-09-04 20:05:00 +00:00
Poul-Henning Kamp	e46eeb89b9	Do not employ timecounter hardware if our hz does not support their correct rewinding.	2002-09-04 19:32:18 +00:00
Poul-Henning Kamp	e7fa55af89	Give up on calling tc_ticktock() from a timeout, we have timeout functions which run for several milliseconds at a time and getting in queue behind one or more of those makes us miss our rewind. Instead call it from hardclock() like we used to do, but retain the prescaler so we still cope with high HZ values.	2002-09-04 10:15:19 +00:00
Matthew Dillon	21c2d0479c	Alright, fix the problems with the elf loader for the Alpha. It turns out that there is no easy way to discern the difference between a text segment and a data segment through the read-only OR execute attribute in the elf segment header, so revert the algorithm to what it was before. Neither can we account for multiple data load segments in the vmspace structure (at least not without more work), due to assumptions obreak() makes in regards to the data start and data size fields. Retain RLIMIT_VMEM checking by using a local variable to track the total bytes of data being loaded. Reviewed by: peter X-MFC after: ASAP	2002-09-04 04:42:12 +00:00
Peter Wemm	9782ecbab0	Make the text segment locating heuristics from rev 1.121 more reliable so that it works on the Alpha. This defines the segment that the entry point exists in as 'text' and any others (usually one) as data. Submitted by: tmm Tested on: i386, alpha	2002-09-03 21:18:17 +00:00
John Baldwin	5fc3031366	- Change falloc() to acquire an fd from the process table last so that it can do it w/o needing to hold the filelist_lock sx lock. - fdalloc() doesn't need Giant to call free() anymore. It also doesn't need to drop and reacquire the filedesc lock around free() now as a result. - Try to make the code that copies fd tables when extending the fd table in fdalloc() a bit more readable by performing assignments in separate statements. This is still a bit ugly though. - Use max() instead of an if statement so to figure out the starting point in the search-for-a-free-fd loop in fdalloc() so it reads better next to the min() in the previous line. - Don't grow nfiles in steps up to the size needed if we dup2() to some really large number. Go ahead and double 'nfiles' in a loop prior to doing the malloc(). - malloc() doesn't need Giant now. - Use malloc() and free() instead of MALLOC() and FREE() in fdalloc(). - Check to see if the size we are going to grow to is too big, not if the current size of the fd table is too big in the loop in fdalloc(). This means if we are out of space or if dup2() requests too high of a fd, then we will return an error before we go off and try to allocate some huge table and copy the existing table into it. - Move all of the logic for dup'ing a file descriptor into do_dup() instead of putting some of it in do_dup() and duplicating other parts in four different places. This makes dup(), dup2(), and fcntl(F_DUPFD) basically wrappers of do_dup now. fcntl() still has an extra check since it uses a different error return value in one case then the other functions. - Add a KASSERT() for an assertion that may not always be true where the fdcheckstd() function assumes that falloc() returns the fd requested and not some other fd. I think that the assertion is always true because we are always single-threaded when we get to this point, but if one was using rfork() and another process sharing the fd table were playing with the fd table, there might could be a problem. - To handle the problem of a file descriptor we are dup()'ing being closed out from under us in dup() in general, do_dup() now obtains a reference on the file in question before calling fdalloc(). If after the call to fdalloc() the file for the fd we are dup'ing is a different file, then we drop our reference on the original file and return EBADF. This race was only handled in the dup2() case before and would just retry the operation. The error return allows the user to know they are being stupid since they have a locking bug in their app instead of dup'ing some other descriptor and returning it to them. Tested on: i386, alpha, sparc64	2002-09-03 20:16:31 +00:00
John Baldwin	0d975d6341	Add some KASSERT()'s to ensure that we don't perform spin mutex ops on sleep mutexes and vice versa. WITNESS normally should catch this but not everyone uses WITNESS so this is a fallback to catch nasty but easy to do bugs.	2002-09-03 18:25:16 +00:00
David Xu	35c32a76f9	In the kernel code, we have the tsleep() call with the PCATCH argument. PCATCH means 'if we get a signal, interrupt me!" and tsleep returns either EINTR or ERESTART depending on the circumstances. ERESTART is "special" because it causes the system call to fail, but right as it returns back to userland it tells the trap handler to move %eip back a bit so that userland will immediately re-run the syscall. This is a syscall restart. It only works for things like read() etc where nothing has changed yet. Note that userland is tricked into restarting the syscall by the kernel. The kernel doesn't actually do the restart. It is deadly for things like select, poll, nanosleep etc where it might cause the elapsed time to be reset and start again from scratch. So those syscalls do this to prevent userland rerunning the syscall: if (error == ERESTART) error = EINTR; Fake "signals" like SIGTSTP from ^Z etc do not normally invoke userland signal handlers. But, in -current, the PCATCH is being triggered and tsleep is returning ERESTART, and the syscall is aborted even though no userland signal handler was run. That is the fault here. We're triggering the PCATCH in cases that we shouldn't. ie: it is being triggered on any signal processing, rather than the case where the signal is posted to userland. --- Peter The work of psignal() is a patchwork of special case required by the process debugging and job-control facilities... --- Kirk McKusick "The design and impelementation of the 4.4BSD Operating system" Page 105 in STABLE source, when psignal is posting a STOP signal to sleeping process and the signal action of the process is SIG_DFL, system will directly change the process state from SSLEEP to SSTOP, and when SIGCONT is posted to the stopped process, if it finds that the process is still on sleep queue, the process state will be restored to SSLEEP, and won't wakeup the process. this commit mimics the behaviour in STABLE source tree. Reviewed by: Jon Mini, Tim Robbins, Peter Wemm Approved by: julian@freebsd.org (mentor)	2002-09-03 12:56:01 +00:00
Ian Dowse	48b52b7a32	Split up __getcwd so that kernel callers of the internal version can specify whether the buffer is in user or system space.	2002-09-02 22:40:30 +00:00
Ian Dowse	49c2ff159f	Split fcntl() into a wrapper and a kernel-callable kern_fcntl() implementation. The wrapper is responsible for copying additional structure arguments (struct flock) to and from userland.	2002-09-02 22:24:14 +00:00
Matthew Dillon	05ef87980a	Grammer cleanup	2002-09-02 17:27:30 +00:00
David Xu	67bdda9718	fix bogus CTR3 message. Reviewed by: julian@freebsd.org (mentor)	2002-09-02 07:55:06 +00:00
Jake Burkholder	5fe3ed629a	Moved elf brand identification into a function. Fully identify the brand early in the process of loading an elf file, so that we can identify the sysentvec, and so that we do not continue if we do not have a brand (and thus a sysentvec). Use the values in the sysentvec for the page size and vm ranges unconditionally, since they are all filled in now.	2002-09-02 04:50:57 +00:00
Alan Cox	8a59b15cd4	o Synchronize updates to struct vm_page::cow with the page queues lock.	2002-09-02 04:04:12 +00:00
Jake Burkholder	8cf034521b	Fixed more indentation bugs.	2002-09-02 02:41:26 +00:00
Jake Burkholder	f36ba45234	Added fields for VM_MIN_ADDRESS, PS_STRINGS and stack protections to sysentvec. Initialized all fields of all sysentvecs, which will allow them to be used instead of constants in more places. Provided stack fixup routines for emulations that previously used the default.	2002-09-01 21:41:24 +00:00
Ian Dowse	8f19eb88df	Split out a number of mostly VFS and signal related syscalls into a kernel-internal kern_*() version and a wrapper that is called via the syscall vector table. For paths and structure pointers, the internal version either takes a uio_seg parameter or requires the caller to copyin() the data to kernel memory as appropiate. This will permit emulation layers to use these syscalls without having to copy out translated arguments to the stack gap. Discussed on: -arch Review/suggestions: bde, jhb, peter, marcel	2002-09-01 20:37:28 +00:00
Matthew Dillon	cac4515267	Implement data, text, and vmem limit checking in the elf loader and svr4 compat code. Clean up accounting for multiple segments. Part 1/2. Submitted by: Andrey Alekseyev <uitm@zenon.net> (with some modifications) MFC after: 3 days	2002-08-30 18:09:46 +00:00
Peter Wemm	447b3772dc	Change hw.physmem and hw.usermem to unsigned long like they used to be in the original hardwired sysctl implementation. The buf size calculator still overflows an integer on machines with large KVA (eg: ia64) where the number of pages does not fit into an int. Use 'long' there. Change Maxmem and physmem and related variables to 'long', mostly for completeness. Machines are not likely to overflow 'int' pages in the near term, but then again, 640K ought to be enough for anybody. This comes for free on 32 bit machines, so why not?	2002-08-30 04:04:37 +00:00

1 2 3 4 5 ...

5547 Commits