freebsd-dev

Author	SHA1	Message	Date
John Baldwin	c3f52eedeb	Protect p_stat with the sched_lock. Reviewed by: jake	2000-12-02 01:58:15 +00:00
Eivind Eklund	b8c8516a7f	More paranoia against overflows	2000-11-08 21:53:05 +00:00
Eivind Eklund	ab3240e198	Fix overflow from jail hostname. Bug found by: Esa Etelavuori <eetelavu@cc.hut.fi>	2000-11-01 19:38:08 +00:00
Alfred Perlstein	e5b7b6b78e	return correct type for process directory entries, DT_DIR not DT_REG	2000-10-05 23:19:51 +00:00
Dag-Erling Smørgrav	18203af447	Remove a comment that has been not only obsolete but patently wrong for the last 31 revisions (almost three years).	2000-09-04 18:18:17 +00:00
Robert Watson	84a5637620	o Simplify if/then clause equating ESRCH with ENOENT when hiding a process Submitted by: des	2000-09-01 18:41:32 +00:00
Robert Watson	ca94dd37a3	o Make procfs use vaccess() for procfs_access() DAC and super-user checks, rather than implementing its own {uid,gid,other} checks against vnode mode. Similar change to linprocfs currently under review. Obtained from: TrustedBSD Project	2000-09-01 13:41:41 +00:00
Robert Watson	387d2c036b	o Centralize inter-process access control, introducing: int p_can(p1, p2, operation, privused) which allows specification of subject process, object process, inter-process operation, and an optional call-by-reference privused flag, allowing the caller to determine if privilege was required for the call to succeed. This allows jail, kern.ps_showallprocs and regular credential-based interaction checks to occur in one block of code. Possible operations are P_CAN_SEE, P_CAN_SCHED, P_CAN_KILL, and P_CAN_DEBUG. p_can currently breaks out as a wrapper to a series of static function checks in kern_prot, which should not be invoked directly. o Commented out capabilities entries are included for some checks. o Update most inter-process authorization to make use of p_can() instead of manual checks, PRISON_CHECK(), P_TRESPASS(), and kern.ps_showallprocs. o Modify suser{,_xxx} to use const arguments, as it no longer modifies process flags due to the disabling of ASU. o Modify some checks/errors in procfs so that ENOENT is returned instead of ESRCH, further improving concealment of processes that should not be visible to other processes. Also introduce new access checks to improve hiding of processes for procfs_lookup(), procfs_getattr(), procfs_readdir(). Correct a bug reported by bp concerning not handling the CREATE case in procfs_lookup(). Remove volatile flag in procfs that caused apparently spurious qualifier warnigns (approved by bde). o Add comment noting that ktrace() has not been updated, as its access control checks are different from ptrace(), whereas they should probably be the same. Further discussion should happen on this topic. Reviewed by: bde, green, phk, freebsd-security, others Approved by: bde Obtained from: TrustedBSD Project	2000-08-30 04:49:09 +00:00
Poul-Henning Kamp	39f70682ae	Introduce vop_stdinactive() and make it the default if no vop_inactive is declared. Sort and prune a few vop_op[].	2000-08-18 10:01:02 +00:00
Poul-Henning Kamp	eb95c536ad	Remove unneeded #include <sys/kernel.h>	2000-04-29 15:36:14 +00:00
Brian Feldman	b7db19017b	Move procfs_fullpath() to vfs_cache.c, with a rename to textvp_fullpath(). There's no excuse to have code in synthetic filestores that allows direct references to the textvp anymore. Feature requested by: msmith Feature agreed to by: warner Move requested by: phk Move agreed to by: bde	2000-04-26 11:57:45 +00:00
Brian Feldman	28749c79c8	Quiet an unused variable warning by commenting out a variable declaration that goes with a commented out statement.	2000-04-22 17:58:40 +00:00
Brian Feldman	bb529be712	There's no reason to make "file" 0500 rather than 0555.	2000-04-22 04:01:54 +00:00
Brian Feldman	081d7b00c7	Welcome back our old friend from procfs, "file"!	2000-04-22 03:44:41 +00:00
Peter Wemm	c447342094	Change #ifdef KERNEL to #ifdef _KERNEL in the public headers. "KERNEL" is an application space macro and the applications are supposed to be free to use it as they please (but cannot). This is consistant with the other BSD's who made this change quite some time ago. More commits to come.	1999-12-29 05:07:58 +00:00
Peter Wemm	e0f9d2869a	Fix typo "," vs ";" PR: 15696 Submitted by: Takashi Okumura <taka@cs.pitt.edu>	1999-12-27 16:03:38 +00:00
Eivind Eklund	a6b174a83b	Include vm/vm_extern.h to get at prototypes	1999-12-20 18:26:58 +00:00
Robert Watson	91f37dcba1	Second pass commit to introduce new ACL and Extended Attribute system calls, vnops, vfsops, both in /kern, and to individual file systems that require a vfsop_ array entry. Reviewed by: eivind	1999-12-19 06:08:07 +00:00
Eivind Eklund	762e6b856c	Introduce NDFREE (and remove VOP_ABORTOP)	1999-12-15 23:02:35 +00:00
Peter Wemm	99b30c79b0	Don't simulate a pseudo address-space beyond VM_MAXUSER_ADDRESS that maps onto the upages. We used to use this extensively, particularly for ps and gdb. Both of these have been "fixed". ps gets the p_stats via eproc along with all the other stats, and gdb uses the regs, fpregs etc files. Once apon a time the UPAGES were mapped here, but that changed back in January '96. This essentially kills my revisions 1.16 and 1.17. The 2-page "hole" above the stack can be reclaimed now.	1999-12-11 10:21:34 +00:00
Poul-Henning Kamp	5ecdb702b0	Remove unused #includes. Obtained from: http://bogon.freebsd.dk/include	1999-12-08 08:59:40 +00:00
Poul-Henning Kamp	a8704f8999	Add a sysctl to control if argv is disclosed to the world: kern.ps_argsopen It defaults to 1 which means that all users can see all argvs in ps(1). Reviewed by: Warner	1999-11-26 08:27:16 +00:00
Poul-Henning Kamp	a9e0361b4a	Introduce the new function p_trespass(struct proc p1, struct proc p2) which returns zero or an errno depending on the legality of p1 trespassing on p2. Replace kern_sig.c:CANSIGNAL() with call to p_trespass() and one extra signal related check. Replace procfs.h:CHECKIO() macros with calls to p_trespass(). Only show command lines to process which can trespass on the target process.	1999-11-21 19:03:20 +00:00
Poul-Henning Kamp	da654d9070	s/p_cred->pc_ucred/p_ucred/g	1999-11-21 12:38:21 +00:00
Sean Eric Fagan	13baacebcb	A process should be able to examine itself.	1999-11-20 18:22:14 +00:00
Poul-Henning Kamp	6153cb2048	Make proc/*/cmdline use the cached argv if available. Submitted by: Paul Saab <paul@mu.org> Reviewed by: phk	1999-11-17 21:35:07 +00:00
Poul-Henning Kamp	3cf5d0fd07	The function `procfs_getattr()' in procfs doesn't set the value of vap->va_fsid, so we cannot get valid information about procfs. Submitted by: SAWADA Mizuki miz@pa.aix.or.jp Reviewed by: phk PR: 1654	1999-11-17 21:33:25 +00:00
Alan Cox	b561683329	Passing "0" or "FALSE" as the fourth argument to vm_fault is wrong. It should be "VM_FAULT_NORMAL".	1999-11-09 01:44:28 +00:00
Sean Eric Fagan	75bd443641	Explain why Warner is right, and I am wrong, in the removing of the file object. Also explain some possible directions to re-implement it -- I'm not sure it should be, given the minimal application use. (Other than having the debugger automatically access the symbols for a process, the main use I'd found was with some minor accounting ability, but _that_ depends on it being in the filesystem space; an ioctl access method would be useless in that case.) This is a code-less change; only a comment has been added.	1999-11-08 05:13:54 +00:00
Sean Eric Fagan	900e2da760	Make an incredibly stupid change because Warner threatened to do it and continue doing it despite objections by me (the principal author). Note that this doesn't fix the real problem -- the real problem is generally bad setup by ignorant users, and education is the right way to fix it. So while this doesn't actually solve the prolem mentioned in the complaint (since it's still possible to do it via other methods, although they mostly involve a bit more complicity), and there are better methods to do this, nobody was willing or able to provide me with a real world example that couldn't be worked around using the existing permissions and group mechanism. And therefore, security by removing features is the method of the day. I only had three applications that used it, in any event. One of them would have made debugging easier, but I still haven't finished it, and won't now, so it doesn't really matter.	1999-11-07 07:52:02 +00:00
Poul-Henning Kamp	923502ff91	useracc() the prequel: Merge the contents (less some trivial bordering the silly comments) of <vm/vm_prot.h> and <vm/vm_inherit.h> into <vm/vm.h>. This puts the #defines for the vm_inherit_t and vm_prot_t types next to their typedefs. This paves the road for the commit to follow shortly: change useracc() to use VM_PROT_{READ\|WRITE} rather than B_{READ\|WRITE} as argument.	1999-10-29 18:09:36 +00:00
Marcel Moolenaar	2c42a14602	sigset_t change (part 2 of 5) ----------------------------- The core of the signalling code has been rewritten to operate on the new sigset_t. No methodological changes have been made. Most references to a sigset_t object are through macros (see signalvar.h) to create a level of abstraction and to provide a basis for further improvements. The NSIG constant has not been changed to reflect the maximum number of signals possible. The reason is that it breaks programs (especially shells) which assume that all signals have a non-null name in sys_signame. See src/bin/sh/trap.c for an example. Instead _SIG_MAXSIG has been introduced to hold the maximum signal possible with the new sigset_t. struct sigprop has been moved from signalvar.h to kern_sig.c because a) it is only used there, and b) access must be done though function sigprop(). The latter because the table doesn't holds properties for all signals, but only for the first NSIG signals. signal.h has been reorganized to make reading easier and to add the new and/or modified structures. The "old" structures are moved to signalvar.h to prevent namespace polution. Especially the coda filesystem suffers from the change, because it contained lines like (p->p_sigmask == SIGIO), which is easy to do for integral types, but not for compound types. NOTE: kdump (and port linux_kdump) must be recompiled. Thanks to Garrett Wollman and Daniel Eischen for pressing the importance of changing sigreturn as well.	1999-09-29 15:03:48 +00:00
Alfred Perlstein	c24fda81c9	Seperate the export check in VFS_FHTOVP, exports are now checked via VFS_CHECKEXP. Add fh(open\|stat\|stafs) syscalls to allow userland to query filesystems based on (network) filehandle. Obtained from: NetBSD	1999-09-11 00:46:08 +00:00
Alfred Perlstein	5a5fccc8e7	All unimplemented VFS ops now have entries in kern/vfs_default.c that return reasonable defaults. This avoids confusing and ugly casting to eopnotsupp or making dummy functions. Bogus casting of filesystem sysctls to eopnotsupp() have been removed. This should make *_vfsops.c more readable and reduce bloat. Reviewed by: msmith, eivind Approved by: phk Tested by: Jeroen Ruigrok/Asmodai <asmodai@wxs.nl>	1999-09-07 22:42:38 +00:00
Peter Wemm	c3aac50f28	$Id$ -> $FreeBSD$	1999-08-28 01:08:13 +00:00
Marcel Moolenaar	63a9927353	Let processes retrieve their argv through procfs. Revert to the original behaviour in all other cases. Submitted by: Andrew Gordon <arg@arg1.demon.co.uk>	1999-08-19 19:41:08 +00:00
Bruce Evans	dd26feb90b	Fixed printf format errors (%qu -> %llu; the arg was already unsigned long long to hide problems on alphas).	1999-08-08 13:43:51 +00:00
Poul-Henning Kamp	46dcdb370e	Allow jailed proccesses to open non-process vnodes like the root of the fs.	1999-07-09 21:31:44 +00:00
Peter Wemm	ebce412ca2	Use %q rather than rolling a custom routine.	1999-07-09 17:56:59 +00:00
Jonathan Lemon	d6c4f01106	Support for i386 hardware breakpoints. Submitted by: Brian Dean <brdean@unx.sas.com>	1999-07-09 04:18:32 +00:00
Jonathan Lemon	ab001a72be	Implement support for hardware debug registers on the i386. Submitted by: Brian Dean <brdean@unx.sas.com>	1999-07-09 04:16:00 +00:00
Poul-Henning Kamp	7a7404d275	Eliminate the bogus procfs private almost struct dirent structure. Spotted by: Lars Hamren Reviewed by: bde	1999-06-13 20:53:16 +00:00
Dmitrij Tejblum	9d3a442583	Don't call calcru() on a swapped-out process. calcru() access p_stats, which is in U-area.	1999-05-22 20:10:31 +00:00
Poul-Henning Kamp	a6d3121589	Make the type and map files claim 0 bytes size. Tar doesn't get confused now, but doesn't store any data eiter. I wonder if we shouldn't claim to be fifos instead...	1999-05-04 08:01:55 +00:00
Poul-Henning Kamp	8902608d57	Add even more () to CHECKIO which by now feels positively LISPish. Submitted by: bde Reviewed by: phk	1999-05-04 08:00:10 +00:00
Poul-Henning Kamp	d37ed5a03a	Add a new "file" to procfs: "rlimit" which shows the resource limits for the process. PR: 11342 Submitted by: Adrian Chadd adrian@freebsd.org Reviewed by: phk	1999-04-30 13:04:21 +00:00
Poul-Henning Kamp	75c1354190	This Implements the mumbled about "Jail" feature. This is a seriously beefed up chroot kind of thing. The process is jailed along the same lines as a chroot does it, but with additional tough restrictions imposed on what the superuser can do. For all I know, it is safe to hand over the root bit inside a prison to the customer living in that prison, this is what it was developed for in fact: "real virtual servers". Each prison has an ip number associated with it, which all IP communications will be coerced to use and each prison has its own hostname. Needless to say, you need more RAM this way, but the advantage is that each customer can run their own particular version of apache and not stomp on the toes of their neighbors. It generally does what one would expect, but setting up a jail still takes a little knowledge. A few notes: I have no scripts for setting up a jail, don't ask me for them. The IP number should be an alias on one of the interfaces. mount a /proc in each jail, it will make ps more useable. /proc/<pid>/status tells the hostname of the prison for jailed processes. Quotas are only sensible if you have a mountpoint per prison. There are no privisions for stopping resource-hogging. Some "#ifdef INET" and similar may be missing (send patches!) If somebody wants to take it from here and develop it into more of a "virtual machine" they should be most welcome! Tools, comments, patches & documentation most welcome. Have fun... Sponsored by: http://www.rndassociates.com/ Run for almost a year by: http://www.servetheweb.com/	1999-04-28 11:38:52 +00:00
Poul-Henning Kamp	1c308b817a	Change suser_xxx() to suser() where it applies.	1999-04-27 12:21:16 +00:00
Poul-Henning Kamp	f711d546d2	Suser() simplification: 1: s/suser/suser_xxx/ 2: Add new function: suser(struct proc ), prototyped in <sys/proc.h>. 3: s/suser_xxx($[a-zA-Z0-9_]$->p_ucred, \&\1->p_acflag)/suser(\1)/ The remaining suser_xxx() calls will be scrutinized and dealt with later. There may be some unneeded #include <sys/cred.h>, but they are left as an exercise for Bruce. More changes to the suser() API will come along with the "jail" code.	1999-04-27 11:18:52 +00:00
Luoqi Chen	b1028ad122	Hide access to vmspace:vm_pmap with inline function vmspace_pmap(). This is the preparation step for moving pmap storage out of vmspace proper. Reviewed by: Alan Cox <alc@cs.rice.edu> Matthew Dillion <dillon@apollo.backplane.com>	1999-02-19 14:25:37 +00:00
Matthew Dillon	9fdfe602fc	Remove MAP_ENTRY_IS_A_MAP 'share' maps. These maps were once used to attempt to optimize forks but were essentially given-up on due to problems and replaced with an explicit dup of the vm_map_entry structure. Prior to the removal, they were entirely unused.	1999-02-07 21:48:23 +00:00
John Polstra	b7429e253a	Correct a format mismatch on 64-bit architectures. This should fix the erroneous values in the procfs "map" file on the Alpha.	1999-02-05 06:18:54 +00:00
Matthew Dillon	831a80b0d5	Fix warnings in preparation for adding -Wall -Wcast-qual to the kernel compile	1999-01-27 22:42:27 +00:00
Matthew Dillon	1c7c3c6a86	This is a rather large commit that encompasses the new swapper, changes to the VM system to support the new swapper, VM bug fixes, several VM optimizations, and some additional revamping of the VM code. The specific bug fixes will be documented with additional forced commits. This commit is somewhat rough in regards to code cleanup issues. Reviewed by: "John S. Dyson" <root@dyson.iquest.net>, "David Greenman" <dg@root.com>	1999-01-21 08:29:12 +00:00
Peter Wemm	75ba77578f	A partial implementation of the procfs cmdline pseudo-file. This is enough to satisfy things like StarOffice. This is a hack, but doing it properly would be a LOT of work, and would require extensive grovelling around in the user address space to find the argv[]. Obtained from: Mostly from Andrzej Bialecki <abial@nask.pl>.	1999-01-05 03:53:06 +00:00
Archie Cobbs	2127f26023	Examine all occurrences of sprintf(), strcat(), and str[n]cpy() for possible buffer overflow problems. Replaced most sprintf()'s with snprintf(); for others cases, added terminating NUL bytes where appropriate, replaced constants like "16" with sizeof(), etc. These changes include several bug fixes, but most changes are for maintainability's sake. Any instance where it wasn't "immediately obvious" that a buffer overflow could not occur was made safer. Reviewed by: Bruce Evans <bde@zeta.org.au> Reviewed by: Matthew Dillon <dillon@apollo.backplane.com> Reviewed by: Mike Spengler <mks@networkcs.com>	1998-12-04 22:54:57 +00:00
David Greenman	730075613a	Added a second argument, "activate" to the vm_page_unwire() call so that the caller can select either inactive or active queue to put the page on.	1998-10-28 13:37:02 +00:00
Bruce Evans	8994ca3ce9	Removed statically configured mount type numbers (MOUNT_) and all references to them. The change a couple of days ago to ignore these numbers in statically configured vfsconf structs was slightly premature because the cd9660, cfs, devfs, ext2fs, nfs vfs's still used MOUNT_ instead of the number in their vfsconf struct.	1998-09-07 13:17:06 +00:00
Alexander Langer	f35f7d0dfd	Style fixes and a bug fix: don't remove the exit handler if unmount fails. Submitted by: bde	1998-07-27 22:47:17 +00:00
Alexander Langer	3f47ee5c4d	A better solution to the rm_at_exit problem: Register the exit function during first mount. Unregister the exit function at last unmount. Concept by: sef Reviewed by: sef Implemented by: alex	1998-07-27 01:07:01 +00:00
Alexander Langer	ca2be56ff9	Override the default VFS LKM dispatch functions so that a module unload function can be provided (this is necessary to unregister the at_exit handler).	1998-07-25 15:52:44 +00:00
Bruce Evans	a23d65bfc8	Cast pointers to uintptr_t/intptr_t instead of to u_long/long, respectively. Most of the longs should probably have been u_longs, but this changes is just to prevent warnings about casts between pointers and integers of different sizes, not to fix poorly chosen types.	1998-07-15 02:32:35 +00:00
Bruce Evans	ac1e407b32	Fixed printf format errors.	1998-07-11 07:46:16 +00:00
Bruce Evans	96eb19e1a3	Quick fix for type mismatches which were fatal if longs aren't 32 bits. We used a private, wrong, version of `struct dirent' to help break getdirentries(), and we use a silly check that the size of this struct is a power of 2 to help break mount() if getdirentries() would not work. This fix just changes the struct to match `struct dirent' (except for the name length).	1998-07-07 04:08:44 +00:00
Dmitrij Tejblum	6bfc1a02b1	Remove "not hungly" panics. Cookies now used by the linux and ibcs2 emulators. The emulators assume that filesystem may just ignore cookies, and handle this case correctly. So we just ignore cookies. Also sync *_readdir "prototypes" with reality.	1998-06-25 16:54:41 +00:00
Bruce Evans	a395dbb153	Avoid a 64-bit division in procfs_readdir(). Fixed related overflows. Check args using the same expression as in fdesc and kernfs. The check was actually already correct, modulo overflow. It could be tightened up to either allow huge (aligned) offsets, treating them as EOF, or disallow all offsets beyond EOF. Didn't fix invalid address calculation &foo[i] where i may be out of bounds. Didn't fix shooting of foot using a private unportable dirent struct.	1998-06-14 12:53:39 +00:00
Peter Wemm	7a204420d3	Don't silently accept attempts to change flags where they are not supported.	1998-06-10 06:34:57 +00:00
Doug Rabson	ecbb00a262	This commit fixes various 64bit portability problems required for FreeBSD/alpha. The most significant item is to change the command argument to ioctl functions from int to u_long. This change brings us inline with various other BSD versions. Driver writers may like to use (__FreeBSD_version == 300003) to detect this change. The prototype FreeBSD/alpha machdep will follow in a couple of days time.	1998-06-07 17:13:14 +00:00
Tor Egge	afc6ea238f	Disallow reading the current kernel stack. Only the user structure and the current registers should be accessible. Reviewed by: David Greenman <dg@root.com>	1998-05-19 00:00:14 +00:00
Mike Smith	79cc756d8b	As described by the submitter: Reverse the VFS_VRELE patch. Reference counting of vnodes does not need to be done per-fs. I noticed this while fixing vfs layering violations. Doing reference counting in generic code is also the preference cited by John Heidemann in recent discussions with him. The implementation of alternative vnode management per-fs is still a valid requirement for some filesystems but will be revisited sometime later, most likely using a different framework. Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-05-06 05:29:41 +00:00
John Dyson	c0877f103f	Tighten up management of memory and swap space during map allocation, deallocation cycles. This should provide a measurable improvement on swap and memory allocation on loaded systems. It is unlikely a complete solution. Also, provide more map info with procfs. Chuck Cranor spurred on this improvement.	1998-04-29 04:28:22 +00:00
Dag-Erling Smørgrav	dc73342347	Seventy-odd "its" / "it's" typos in comments fixed as per kern/6108.	1998-04-17 22:37:19 +00:00
Poul-Henning Kamp	a0502b19d4	Add two new functions, get{micro\|nano}time. They are atomic, but return in essence what is in the "time" variable. gettime() is now a macro front for getmicrotime(). Various patches to use the two new functions instead of the various hacks used in their absence. Some puntuation and grammer patches from Bruce. A couple of XXX comments.	1998-03-26 20:54:05 +00:00
Mike Smith	34bdbbd0de	The intent is to get rid of WILLRELE in vnode_if.src by making a complement to all ops that return a vpp, VFS_VRELE. This is initially only for file systems that implement the following ops that do a WILLRELE: vop_create, vop_whiteout, vop_mknod, vop_remove, vop_link, vop_rename, vop_mkdir, vop_rmdir, vop_symlink This is initial DNA that doesn't do anything yet. VFS_VRELE is implemented but not called. A default vfs_vrele was created for fs implementations that use the standard vnode management routines. VFS_VRELE implementations were made for the following file systems: Standard (vfs_vrele) ffs mfs nfs msdosfs devfs ext2fs Custom union umapfs Just EOPNOTSUPP fdesc procfs kernfs portal cd9660 These implementations may change as VOP changes are implemented. In the next phase, in the vop implementations calls to vrele and the vrele part of vput will be moved to the top layer vfs_vnops and made visible to all layers. vput will be replaced by unlock in these cases. Unlocking will still be done in the per fs layer but the refcount decrement will be triggered at the top because it doesn't hurt to hold a vnode reference a little longer. This will have minimal impact on the structure of the existing code. This will only be done for vnode arguments that are released by the various fs vop implementations. Wider use of VFS_VRELE will likely require restructuring of the code. Reviewed by: phk, dyson, terry et. al. Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-03-01 22:46:53 +00:00
Eivind Eklund	303b270b0a	Staticize.	1998-02-09 06:11:36 +00:00
Eivind Eklund	0b08f5f737	Back out DIAGNOSTIC changes.	1998-02-06 12:14:30 +00:00
Eivind Eklund	47cfdb166d	Turn DIAGNOSTIC into a new-style option.	1998-02-04 22:34:03 +00:00
John Dyson	2d8acc0f4a	VM level code cleanups. 1) Start using TSM. Struct procs continue to point to upages structure, after being freed. Struct vmspace continues to point to pte object and kva space for kstack. u_map is now superfluous. 2) vm_map's don't need to be reference counted. They always exist either in the kernel or in a vmspace. The vmspaces are managed by reference counts. 3) Remove the "wired" vm_map nonsense. 4) No need to keep a cache of kernel stack kva's. 5) Get rid of strange looking ++var, and change to var++. 6) Change more data structures to use our "zone" allocator. Added struct proc, struct vmspace and struct vnode. This saves a significant amount of kva space and physical memory. Additionally, this enables TSM for the zone managed memory. 7) Keep ioopt disabled for now. 8) Remove the now bogus "single use" map concept. 9) Use generation counts or id's for data structures residing in TSM, where it allows us to avoid unneeded restart overhead during traversals, where blocking might occur. 10) Account better for memory deficits, so the pageout daemon will be able to make enough memory available (experimental.) 11) Fix some vnode locking problems. (From Tor, I think.) 12) Add a check in ufs_lookup, to avoid lots of unneeded calls to bcmp. (experimental.) 13) Significantly shrink, cleanup, and make slightly faster the vm_fault.c code. Use generation counts, get rid of unneded collpase operations, and clean up the cluster code. 14) Make vm_zone more suitable for TSM. This commit is partially as a result of discussions and contributions from other people, including DG, Tor Egge, PHK, and probably others that I have forgotten to attribute (so let me know, if I forgot.) This is not the infamous, final cleanup of the vnode stuff, but a necessary step. Vnode mgmt should be correct, but things might still change, and there is still some missing stuff (like ioopt, and physical backing of non-merged cache files, debugging of layering concepts.)	1998-01-22 17:30:44 +00:00
John Dyson	95e5e988e0	Make our v_usecount vnode reference count work identically to the original BSD code. The association between the vnode and the vm_object no longer includes reference counts. The major difference is that vm_object's are no longer freed gratuitiously from the vnode, and so once an object is created for the vnode, it will last as long as the vnode does. When a vnode object reference count is incremented, then the underlying vnode reference count is incremented also. The two "objects" are now more intimately related, and so the interactions are now much less complex. When vnodes are now normally placed onto the free queue with an object still attached. The rundown of the object happens at vnode rundown time, and happens with exactly the same filesystem semantics of the original VFS code. There is absolutely no need for vnode_pager_uncache and other travesties like that anymore. A side-effect of these changes is that SMP locking should be much simpler, the I/O copyin/copyout optimizations work, NFS should be more ponderable, and further work on layered filesystems should be less frustrating, because of the totally coherent management of the vnode objects and vnodes. Please be careful with your system while running this code, but I would greatly appreciate feedback as soon a reasonably possible.	1998-01-06 05:26:17 +00:00
Sean Eric Fagan	dd30ff81d9	Use CHECKIO in procfs_ioctl() to ensure that any changes in UID/GID result in the expected failure.	1998-01-06 01:37:12 +00:00
Bruce Evans	a954e88d0b	Fixed a missing/misplaced/misstyled prototype.	1997-12-30 08:46:44 +00:00
Bruce Evans	675ea6f083	Unspammed nested include of <vm/vm_zone.h>.	1997-12-27 02:56:39 +00:00
Sean Eric Fagan	d5f81602a7	Clear the p_stops field on change of user/group id, unless the correct flag is set in the p_pfsflags field. This, essentially, prevents an SUID proram from hanging after being traced. (E.g., "truss /usr/bin/rlogin" would fail, but leave rlogin in a stopevent state.) Yet another case where procctl is (hopefully ;)) no longer needed in the general case. Reviewed by: bde (thanks bruce :))	1997-12-20 03:05:47 +00:00
Sean Eric Fagan	d7b7dcba41	Change the ioctls for procfs around a bit; in particular, whever possible, change from ioctl(fd, PIOC<foo>, &i); to ioctl(fd, PIOC<foo>, i); This is going from the _IOW to _IO ioctl macro. The kernel, procctl, and truss must be in synch for it all to work (not doing so will get errors about inappropriate ioctl's, fortunately). Hopefully I didn't forget anything :).	1997-12-13 03:13:49 +00:00
Sean Eric Fagan	ba9d19e99b	Fix a problem with procfs_exit() that resulted in missing some procfs nodes; this also apparantly caused a panic in some circumstances. Also, since procfs_exit() is getting rid of the nodes when a process exits, don't bother checking for the process' existance in procfs_inactive().	1997-12-12 03:33:43 +00:00
Sean Eric Fagan	7486d3658f	Code to prevent a panic caused by procfs_exit(). Note that i don't know what is teh root cause -- but, sometimes, a procfs vnode in pfshead is apparantly corrupt (or a UFS vnode instead). Without this patch, I can get it to panic by doing (in csh) while (1) ps auxwww end and it will panic when the PID's wrap. With it, it does not panic. Yes -- I know that this is NOT the right way to fix it. But I haven't been able to get it to panic yet (which confuses me). I am going to be looking into the vgone() code now, as that may be a part of it.	1997-12-09 05:03:41 +00:00
Sean Eric Fagan	bfcfe2e1b1	A couple of fixes from bruce: first of all, psignal is a void (stupid me; unfortunately, also makes it hard ot check for errors); second, I had managed to forget a change to PIOCSFL (it should be _IOW, not _IOR) I had in my local copy, and Bruce called me on it. Submitted by: bde	1997-12-08 22:09:39 +00:00
Sean Eric Fagan	847e5f5f9a	Use at_exit() to invoke procfs_exit() instead of calling it directly. Note that an unload facility should be used to call rm_at_exit() (if procfs is being loaded as an LKM and is subsequently removed), but it was non-obvious how to do this in the VFS framework. Reviewed by: Julian Elischer	1997-12-08 01:06:36 +00:00
Sean Eric Fagan	d99616387e	Clear the stop events and wakeup the process on teh last close of the procfs/mem file. While this doesn't prevent an unkillable process, it means that a broken truss prorgam won't do it accidently now (well, there's a small window of opportunity). Note that this requires the change to truss I am about to commit.	1997-12-07 04:01:03 +00:00
Sean Eric Fagan	2a024a2b05	Changes to allow event-based process monitoring and control.	1997-12-06 04:11:14 +00:00
Bruce Evans	1cd52ec333	Don't include <sys/lock.h> in headers when only `struct simplelock' is required. Fixed everything that depended on the pollution.	1997-12-05 19:55:52 +00:00
Tor Egge	b872e9c03f	Don't try to obtain an excluive lock on the vm map, since a deadlock might occur if the process owning the map is wiring pages.	1997-11-14 22:57:46 +00:00
Poul-Henning Kamp	4a11ca4e29	Remove a bunch of variables which were unused both in GENERIC and LINT. Found by: -Wunused	1997-11-07 08:53:44 +00:00
Bruce Evans	521166c5b2	KNFize rev.1.31.	1997-10-27 15:39:01 +00:00
Bruce Evans	ef91bd5734	Removed unused #includes. The need for most of them went away with recent changes (docluster* and vfs improvements).	1997-10-27 13:33:47 +00:00
Poul-Henning Kamp	dba3870c10	VFS interior redecoration. Rename vn_default_error to vop_defaultop all over the place. Move vn_bwrite from vfs_bio.c to vfs_default.c and call it vop_stdbwrite. Use vop_null instead of nullop. Move vop_nopoll from vfs_subr.c to vfs_default.c Move vop_sharedlock from vfs_subr.c to vfs_default.c Move vop_nolock from vfs_subr.c to vfs_default.c Move vop_nounlock from vfs_subr.c to vfs_default.c Move vop_noislocked from vfs_subr.c to vfs_default.c Use vop_ebadf instead of *_ebadf. Add vop_defaultop for getpages on master vnode in MFS.	1997-10-26 20:55:39 +00:00
Poul-Henning Kamp	e9565321ea	VFS clean up "hekto commit" 1. Add defaults for more VOPs VOP_LOCK vop_nolock VOP_ISLOCKED vop_noislocked VOP_UNLOCK vop_nounlock and remove direct reference in filesystems. 2. Rename the nfsv2 vnop tables to improve sorting order.	1997-10-16 22:01:05 +00:00
Poul-Henning Kamp	987f569678	Another VFS cleanup "kilo commit" 1. Remove VOP_UPDATE, it is (also) an UFS/{FFS,LFS,EXT2FS,MFS} intereface function, and now lives in the ufsmount structure. 2. Remove VOP_SEEK, it was unused. 3. Add mode default vops: VOP_ADVLOCK vop_einval VOP_CLOSE vop_null VOP_FSYNC vop_null VOP_IOCTL vop_enotty VOP_MMAP vop_einval VOP_OPEN vop_null VOP_PATHCONF vop_einval VOP_READLINK vop_einval VOP_REALLOCBLKS vop_eopnotsupp And remove identical functionality from filesystems 4. Add vop_stdpathconf, which returns the canonical stuff. Use it in the filesystems. (XXX: It's probably wrong that specfs and fifofs sets this vop, shouldn't it come from the "host" filesystem, for instance ufs or cd9660 ?) 5. Try to make system wide VOP functions have vop_* names. 6. Initialize the um_* vectors in LFS. (Recompile your LKMS!!!)	1997-10-16 20:32:40 +00:00
Poul-Henning Kamp	cec0f20ce7	VFS mega cleanup commit (x/N) 1. Add new file "sys/kern/vfs_default.c" where default actions for VOPs go. Implement proper defaults for ABORTOP, BWRITE, LEASE, POLL, REVOKE and STRATEGY. Various stuff spread over the entire tree belongs here. 2. Change VOP_BLKATOFF to a normal function in cd9660. 3. Kill VOP_BLKATOFF, VOP_TRUNCATE, VOP_VFREE, VOP_VALLOC. These are private interface functions between UFS and the underlying storage manager layer (FFS/LFS/MFS/EXT2FS). The functions now live in struct ufsmount instead. 4. Remove a kludge of VOP_ functions in all filesystems, that did nothing but obscure the simplicity and break the expandability. If a filesystem doesn't implement VOP_FOO, it shouldn't have an entry for it in its vnops table. The system will try to DTRT if it is not implemented. There are still some cruft left, but the bulk of it is done. 5. Fix another VCALL in vfs_cache.c (thanks Bruce!)	1997-10-16 10:50:27 +00:00
Poul-Henning Kamp	6a525123aa	Hmm, realign the vnops into two columns.	1997-10-15 10:05:29 +00:00
Poul-Henning Kamp	539ef70c2d	Stylistic overhaul of vnops tables. 1. Remove comment stating the blatantly obvious. 2. Align in two columns. 3. Sort all but the default element alphabetically. 4. Remove XXX comments pointing out entries not needed.	1997-10-15 09:22:02 +00:00
John Dyson	99448ed11d	Change the M_NAMEI allocations to use the zone allocator. This change plus the previous changes to use the zone allocator decrease the useage of malloc by half. The Zone allocator will be upgradeable to be able to use per CPU-pools, and has more intelligent usage of SPLs. Additionally, it has reasonable stats gathering capabilities, while making most calls inline.	1997-09-21 04:24:27 +00:00
Peter Wemm	a6aeade2c4	Convert select -> poll. Delete 'always succeed' select/poll handlers, replaced with generic call. Flag missing vnode op table entries.	1997-09-14 02:58:12 +00:00
Bruce Evans	bea0f0be7b	Some staticized variables were still declared to be extern.	1997-09-07 05:27:26 +00:00
Garrett Wollman	57bf258e3d	Fix all areas of the system (or at least all those in LINT) to avoid storing socket addresses in mbufs. (Socket buffers are the one exception.) A number of kernel APIs needed to get fixed in order to make this happen. Also, fix three protocol families which kept PCBs in mbufs to not malloc them instead. Delete some old compatibility cruft while we're at it, and add some new routines in the in_cksum family.	1997-08-16 19:16:27 +00:00
Sean Eric Fagan	66ad7973ce	Check permissions for fp regs as well as normal regs.	1997-08-12 05:23:51 +00:00
Sean Eric Fagan	49356a1f4a	Fix procfs security hole -- check permissions on meaningful I/Os (namely, reading/writing of mem and regs). Also have to check for the requesting process being group KMEM -- this is a bit of a hack, but ps et al need it. Reviewed by: davidg	1997-08-12 04:34:30 +00:00
Bruce Evans	1fd0b0588f	Removed unused #includes.	1997-08-02 14:33:27 +00:00
Alexander Langer	2c39c8177b	Style fix my previous commit.	1997-06-26 16:12:53 +00:00
Alexander Langer	8a77722ad7	Block all write operations to /proc/1/* when securelevel > 0. The additional check in procfs_ctl.c could be backed out, but I'm leaving it in for good measure. Reviewed by: Theo de Raadt <deraadt@OpenBSD.org>	1997-06-21 16:09:49 +00:00
Alexander Langer	898fcb3b84	Removed bogon from previous commit: doubly included sys/systm.h.	1997-04-27 21:32:21 +00:00
Alexander Langer	ee7877dfec	Prevent debugger attachment to init when securelevel > 0. Noticed by: Brian Buchanan <brian@wasteland.calbbs.com>	1997-04-27 19:02:37 +00:00
John Dyson	7cac787bee	Fix both a problem with accessing backing objects, and also release the process map on nonexistant pages. PR: kern/3327 Submitted by: Tor Egge <Tor.Egge@idi.ntnu.no>	1997-04-20 17:12:11 +00:00
John Dyson	a04c970a7a	Fix the gdb executable modify problem. Thanks to the detective work by Alan Cox <alc@cs.rice.edu>, and his description of the problem. The bug was primarily in procfs_mem, but the mistake likely happened due to the lack of vm system support for the operation. I added better support for selective marking of page dirty flags so that vm_map_pageable(wiring) will not cause this problem again. The code in procfs_mem is now less bogus (but maybe still a little so.)	1997-04-06 02:29:45 +00:00
Bruce Evans	fce002fdef	Don't include <sys/ioctl.h> in the kernel. Stage 1: don't include it when it is not used. In most cases, the reasons for including it went away when the special ioctl headers became self-sufficient.	1997-03-24 11:25:10 +00:00
Bruce Evans	90106ad38b	Fixed missing initialisation of vp->v_type for types Pfile and Pmem in procfs_allocvp(). This fixes at least stat() of /proc//mem. stat() of /proc//file already worked. I think procfs_allocvp() isn't actually called for type Pfile.	1997-03-08 16:06:34 +00:00
Bruce Evans	a3d13e5054	Fixed procfs's locking vops. They were missed in the Lite2 merge, partly because the #define's for them were moved to a different file. At least the null VOP_LOCK() no longer works, since vclean() expects VOP_LOCK( ..., LK_DRAIN \| LK_INTERLOCK, ...) to clear the interlock. This probably only matters when simple_lock() is not null, i.e., when there are multiple CPUs or SIMPLELOCK_DEBUG is defined.	1997-02-24 16:44:11 +00:00
Peter Wemm	6875d25465	Back out part 1 of the MCFH that changed $Id$ to $FreeBSD$. We are not ready for it yet.	1997-02-22 09:48:43 +00:00
Mike Pritchard	724ab19569	Add function prototypes for most of the new Lite2 functions. Also made a few of the miscfs routines static to be consistent. Some modules simply required some additional #includes to remove -Wall warnings.	1997-02-12 06:52:51 +00:00
John Dyson	996c772f58	This is the kernel Lite/2 commit. There are some requisite userland changes, so don't expect to be able to run the kernel as-is (very well) without the appropriate Lite/2 userland changes. The system boots and can mount UFS filesystems. Untested: ext2fs, msdosfs, NFS Known problems: Incorrect Berkeley ID strings in some files. Mount_std mounts will not work until the getfsent library routine is changed. Reviewed by: various people Submitted by: Jeffery Hsu <hsu@freebsd.org>	1997-02-10 02:22:35 +00:00
John Dyson	afa07f7e83	Change the map entry flags from bitfields to bitmasks. Allows for some code simplification.	1997-01-16 04:16:22 +00:00
Jordan K. Hubbard	1130b656e5	Make the long-awaited change from $Id$ to $FreeBSD$ This will make a number of things easier in the future, as well as (finally!) avoiding the Id-smashing problem which has plagued developers for so long. Boy, I'm glad we're not using sup anymore. This update would have been insane otherwise.	1997-01-14 07:20:47 +00:00
John Dyson	d700038439	Fix a potential deadlock from the previous commit.	1996-10-30 03:52:57 +00:00
John Dyson	56673451a7	Fix the /proc/???/map file so that it is possible to read an arbitrarily large process map. Another commit will follow to fix a problem just found during this one... Sorry!!! :-(.	1996-10-30 03:45:00 +00:00
John Dyson	c864a7c0a9	Fix setting breakpoints in shared regions.	1996-10-24 02:47:05 +00:00
Bruce Evans	b71fec07db	Eliminated nested include of <sys/unistd.h> in <sys/file.h> in the kernel. Include it directly in the few places where it is used. Reduced some #includes of <sys/file.h> to #includes of <sys/fcntl.h> or nothing.	1996-09-03 14:25:27 +00:00
Bruce Evans	e0c95ed947	Fixed the easy cases of const poisoning in the kernel. Cosmetic.	1996-08-31 16:52:44 +00:00
John Dyson	3c087a2f30	Modify slightly the output from the map file in /proc. Now the executable bit is shown.	1996-07-27 19:47:04 +00:00
John Dyson	34e95a26ec	Under certain circumstances, reading the /proc/*/map file can crash the system. Nonexistant objects were not handled correctly.	1996-07-27 18:28:10 +00:00
John Dyson	688bbd5b76	Implement locking for pfs nodes, when at the leaf. Concurrent access to information from a single process causes hangs. Specifically, this fixes problems (hangs) with concurrent ps commands, when the system is under heavy memory load. Reviewed by: davidg	1996-07-02 13:38:10 +00:00
John Dyson	c6c4b08e59	Fix a serious problem, with a window where an object lock is needed, but not there. The extent of the object lock is expanded to be over the range that it is needed. Additionally, clean up the code so that it conforms to better coding style.	1996-07-02 01:40:52 +00:00
John Dyson	32acf9a16a	Add procfs_type.c to the repository.	1996-06-18 05:22:45 +00:00
John Dyson	6ead3edd9c	Clean-up the new VM map procfs code, and also add support for executable format file "etype". It contains a description of the binary type for a process.	1996-06-18 05:16:00 +00:00
John Dyson	975dcaa94f	This file is the "meat" of the process address space capability. If you would like other things added, just ask!!! It might be pretty easy to add.	1996-06-17 22:53:27 +00:00
John Dyson	9353aadd89	Add a feature to procfs to allow display of the process address map with multiple entries as follows: start address, end address, resident pages in range, private pages in range, RW/RO, COW or not, (vnode/device/swap/default).	1996-06-17 22:43:36 +00:00
David Greenman	2f9bae59d6	Moved the fsnode MALLOC to before the call to getnewvnode() so that the process won't possibly block before filling in the fsnode pointer (v_data) which might be dereferenced during a sync since the vnode is put on the mnt_vnodelist by getnewvnode. Pointed out by Matt Day <mday@artisoft.com>	1996-06-12 03:37:57 +00:00
John Dyson	dc48d43a6c	Properly lock the vm space when accessing the memory in a process. This fix could solve some "interesting" problems that could happen during process rundown.	1996-06-11 23:52:27 +00:00
Jeffrey Hsu	5a18c60a63	For Lite2: proc LIST changes. Reviewed by: davidg & bde	1996-03-11 06:13:09 +00:00
Wolfram Schneider	1d08058f65	add ruid and rgid to file 'status'	1996-02-02 05:19:20 +00:00
Peter Wemm	f863805df0	This time, really make the procfs work when reading stuff from the UPAGES. This is a really ugly bandaid on the problem, but it works well enough for 'ps -u' to start working again. The problem was caused by the user address space shrinking by a little bit and the UPAGES being "cast off" to become a seperate entity rather than being at the top of the process's vmspace. That optimization was part of John's most recent VM speedups. Now, rather than decoding the VM space, it merely ensures the pages are in core and accesses them the same way the ptrace(PT_READ_U..) code does, ie: off the p->p_addr pointer.	1996-01-25 06:05:38 +00:00
Peter Wemm	0cc7521383	Major fixes for procfs.. Implement a "variable" directory structure. Files that do not make sense for the given process do not "appear" and cannot be opened. For example, "system" processes do not have "file", "regs" or "fpregs", because they do not have a user area. "attempt" to fill in the user area of a given process when it is being accessed via /proc/pid/mem (the user struct is just after VM_MAXUSER_ADDRESS in the process address space.) Dont do IO to the U area while it's swapped, hold it in place if possible. Lock off access to the "ctl" file if it's done a setuid like the other pseudo-files in there.	1996-01-24 18:41:41 +00:00
John Dyson	bd7e5f992e	Eliminated many redundant vm_map_lookup operations for vm_mmap. Speed up for vfs_bio -- addition of a routine bqrelse to greatly diminish overhead for merged cache. Efficiency improvement for vfs_cluster. It used to do alot of redundant calls to cluster_rbuild. Correct the ordering for vrele of .text and release of credentials. Use the selective tlb update for 486/586/P6. Numerous fixes to the size of objects allocated for files. Additionally, fixes in the various pagers. Fixes for proper positioning of vnode_pager_setsize in msdosfs and ext2fs. Fixes in the swap pager for exhausted resources. The pageout code will not as readily thrash. Change the page queue flags (PG_ACTIVE, PG_INACTIVE, PG_FREE, PG_CACHE) into page queue indices (PQ_ACTIVE, PQ_INACTIVE, PQ_FREE, PQ_CACHE), thereby improving efficiency of several routines. Eliminate even more unnecessary vm_page_protect operations. Significantly speed up process forks. Make vm_object_page_clean more efficient, thereby eliminating the pause that happens every 30seconds. Make sequential clustered writes B_ASYNC instead of B_DELWRI even in the case of filesystems mounted async. Fix a panic with busy pages when write clustering is done for non-VMIO buffers.	1996-01-19 04:00:31 +00:00
Bruce Evans	f2c6b65bab	Fixed 1TB filesize changes. Some pindexes had bogus names and types but worked because vm_pindex_t is indistinuishable from vm_offset_t.	1995-12-17 07:19:58 +00:00
John Dyson	a316d390bd	Changes to support 1Tb filesizes. Pages are now named by an (object,index) pair instead of (object,offset) pair.	1995-12-11 04:58:34 +00:00
David Greenman	efeaf95a41	Untangled the vm.h include file spaghetti.	1995-12-07 12:48:31 +00:00
Bruce Evans	9b5e8b3aa5	Added prototypes. Removed some unnecessary #includes.	1995-12-03 14:54:48 +00:00
Bruce Evans	fcf985ad5d	Moved declarations for static functions to the correct place (not in a header). Removed stupid comments.	1995-11-16 11:39:11 +00:00
Bruce Evans	d77206b33b	Fixed the type of procfs_sync(). Trailing args were missing. Fixed the type of procfs_fhtovp(). The args had little resemblance to the correct ones. Added prototypes.	1995-11-16 11:34:51 +00:00
Bruce Evans	f57e65478d	Introduced a type `vop_t' for vnode operation functions and used it 1138 times (:-() in casts and a few more times in declarations. This change is null for the i386. The type has to be `typedef int vop_t(void *)' and not `typedef int vop_t()' because `gcc -Wstrict-prototypes' warns about the latter. Since vnode op functions are called with args of different (struct pointer) types, neither of these function types is any use for type checking of the arg, so it would be preferable not to use the complete function type, especially since using the complete type requires adding 1138 casts to avoid compiler warnings and another 40+ casts to reverse the function pointer conversions before calling the functions.	1995-11-09 08:17:23 +00:00
Poul-Henning Kamp	159a024357	Make a lot of private stuff static. Should anybody out there wonder about this vendetta against global variables, it is basically to make it more visible what our interfaces in the kernel really are. I'm almost convinced we should have a #define PUBLIC /* public interface */ and use it in the #includes...	1995-11-07 13:39:31 +00:00

1 2 3 4 5 ...

279 Commits