freebsd-skq

Author	SHA1	Message	Date
phk	8e3c3eafed	useracc() the prequel: Merge the contents (less some trivial bordering the silly comments) of <vm/vm_prot.h> and <vm/vm_inherit.h> into <vm/vm.h>. This puts the #defines for the vm_inherit_t and vm_prot_t types next to their typedefs. This paves the road for the commit to follow shortly: change useracc() to use VM_PROT_{READ\|WRITE} rather than B_{READ\|WRITE} as argument.	1999-10-29 18:09:36 +00:00
marcel	d5e8d714b9	sigset_t change (part 2 of 5) ----------------------------- The core of the signalling code has been rewritten to operate on the new sigset_t. No methodological changes have been made. Most references to a sigset_t object are through macros (see signalvar.h) to create a level of abstraction and to provide a basis for further improvements. The NSIG constant has not been changed to reflect the maximum number of signals possible. The reason is that it breaks programs (especially shells) which assume that all signals have a non-null name in sys_signame. See src/bin/sh/trap.c for an example. Instead _SIG_MAXSIG has been introduced to hold the maximum signal possible with the new sigset_t. struct sigprop has been moved from signalvar.h to kern_sig.c because a) it is only used there, and b) access must be done though function sigprop(). The latter because the table doesn't holds properties for all signals, but only for the first NSIG signals. signal.h has been reorganized to make reading easier and to add the new and/or modified structures. The "old" structures are moved to signalvar.h to prevent namespace polution. Especially the coda filesystem suffers from the change, because it contained lines like (p->p_sigmask == SIGIO), which is easy to do for integral types, but not for compound types. NOTE: kdump (and port linux_kdump) must be recompiled. Thanks to Garrett Wollman and Daniel Eischen for pressing the importance of changing sigreturn as well.	1999-09-29 15:03:48 +00:00
alfred	b9136a6115	Seperate the export check in VFS_FHTOVP, exports are now checked via VFS_CHECKEXP. Add fh(open\|stat\|stafs) syscalls to allow userland to query filesystems based on (network) filehandle. Obtained from: NetBSD	1999-09-11 00:46:08 +00:00
alfred	e16a3900a7	All unimplemented VFS ops now have entries in kern/vfs_default.c that return reasonable defaults. This avoids confusing and ugly casting to eopnotsupp or making dummy functions. Bogus casting of filesystem sysctls to eopnotsupp() have been removed. This should make *_vfsops.c more readable and reduce bloat. Reviewed by: msmith, eivind Approved by: phk Tested by: Jeroen Ruigrok/Asmodai <asmodai@wxs.nl>	1999-09-07 22:42:38 +00:00
peter	3b842d34e8	$Id$ -> $FreeBSD$	1999-08-28 01:08:13 +00:00
marcel	44e41869e3	Let processes retrieve their argv through procfs. Revert to the original behaviour in all other cases. Submitted by: Andrew Gordon <arg@arg1.demon.co.uk>	1999-08-19 19:41:08 +00:00
bde	557aa1ee53	Fixed printf format errors (%qu -> %llu; the arg was already unsigned long long to hide problems on alphas).	1999-08-08 13:43:51 +00:00
phk	841e614610	Allow jailed proccesses to open non-process vnodes like the root of the fs.	1999-07-09 21:31:44 +00:00
peter	4c303ef929	Use %q rather than rolling a custom routine.	1999-07-09 17:56:59 +00:00
jlemon	68eae7f53f	Support for i386 hardware breakpoints. Submitted by: Brian Dean <brdean@unx.sas.com>	1999-07-09 04:18:32 +00:00
jlemon	fb9048b07b	Implement support for hardware debug registers on the i386. Submitted by: Brian Dean <brdean@unx.sas.com>	1999-07-09 04:16:00 +00:00
phk	b4c0c39b6e	Eliminate the bogus procfs private almost struct dirent structure. Spotted by: Lars Hamren Reviewed by: bde	1999-06-13 20:53:16 +00:00
dt	20219ea966	Don't call calcru() on a swapped-out process. calcru() access p_stats, which is in U-area.	1999-05-22 20:10:31 +00:00
phk	29d1bb2312	Make the type and map files claim 0 bytes size. Tar doesn't get confused now, but doesn't store any data eiter. I wonder if we shouldn't claim to be fifos instead...	1999-05-04 08:01:55 +00:00
phk	49cc1d7cc9	Add even more () to CHECKIO which by now feels positively LISPish. Submitted by: bde Reviewed by: phk	1999-05-04 08:00:10 +00:00
phk	22dd8b1a32	Add a new "file" to procfs: "rlimit" which shows the resource limits for the process. PR: 11342 Submitted by: Adrian Chadd adrian@freebsd.org Reviewed by: phk	1999-04-30 13:04:21 +00:00
phk	ca21a25f17	This Implements the mumbled about "Jail" feature. This is a seriously beefed up chroot kind of thing. The process is jailed along the same lines as a chroot does it, but with additional tough restrictions imposed on what the superuser can do. For all I know, it is safe to hand over the root bit inside a prison to the customer living in that prison, this is what it was developed for in fact: "real virtual servers". Each prison has an ip number associated with it, which all IP communications will be coerced to use and each prison has its own hostname. Needless to say, you need more RAM this way, but the advantage is that each customer can run their own particular version of apache and not stomp on the toes of their neighbors. It generally does what one would expect, but setting up a jail still takes a little knowledge. A few notes: I have no scripts for setting up a jail, don't ask me for them. The IP number should be an alias on one of the interfaces. mount a /proc in each jail, it will make ps more useable. /proc/<pid>/status tells the hostname of the prison for jailed processes. Quotas are only sensible if you have a mountpoint per prison. There are no privisions for stopping resource-hogging. Some "#ifdef INET" and similar may be missing (send patches!) If somebody wants to take it from here and develop it into more of a "virtual machine" they should be most welcome! Tools, comments, patches & documentation most welcome. Have fun... Sponsored by: http://www.rndassociates.com/ Run for almost a year by: http://www.servetheweb.com/	1999-04-28 11:38:52 +00:00
phk	c85ea5017c	Change suser_xxx() to suser() where it applies.	1999-04-27 12:21:16 +00:00
phk	16e3fbd2c1	Suser() simplification: 1: s/suser/suser_xxx/ 2: Add new function: suser(struct proc ), prototyped in <sys/proc.h>. 3: s/suser_xxx($[a-zA-Z0-9_]$->p_ucred, \&\1->p_acflag)/suser(\1)/ The remaining suser_xxx() calls will be scrutinized and dealt with later. There may be some unneeded #include <sys/cred.h>, but they are left as an exercise for Bruce. More changes to the suser() API will come along with the "jail" code.	1999-04-27 11:18:52 +00:00
luoqi	082d37c1ac	Hide access to vmspace:vm_pmap with inline function vmspace_pmap(). This is the preparation step for moving pmap storage out of vmspace proper. Reviewed by: Alan Cox <alc@cs.rice.edu> Matthew Dillion <dillon@apollo.backplane.com>	1999-02-19 14:25:37 +00:00
dillon	98732ec693	Remove MAP_ENTRY_IS_A_MAP 'share' maps. These maps were once used to attempt to optimize forks but were essentially given-up on due to problems and replaced with an explicit dup of the vm_map_entry structure. Prior to the removal, they were entirely unused.	1999-02-07 21:48:23 +00:00
jdp	de45986f5a	Correct a format mismatch on 64-bit architectures. This should fix the erroneous values in the procfs "map" file on the Alpha.	1999-02-05 06:18:54 +00:00
dillon	dbf5cd2b57	Fix warnings in preparation for adding -Wall -Wcast-qual to the kernel compile	1999-01-27 22:42:27 +00:00
dillon	df24433bbe	This is a rather large commit that encompasses the new swapper, changes to the VM system to support the new swapper, VM bug fixes, several VM optimizations, and some additional revamping of the VM code. The specific bug fixes will be documented with additional forced commits. This commit is somewhat rough in regards to code cleanup issues. Reviewed by: "John S. Dyson" <root@dyson.iquest.net>, "David Greenman" <dg@root.com>	1999-01-21 08:29:12 +00:00
peter	1c3fe295c3	A partial implementation of the procfs cmdline pseudo-file. This is enough to satisfy things like StarOffice. This is a hack, but doing it properly would be a LOT of work, and would require extensive grovelling around in the user address space to find the argv[]. Obtained from: Mostly from Andrzej Bialecki <abial@nask.pl>.	1999-01-05 03:53:06 +00:00
archie	982e80577d	Examine all occurrences of sprintf(), strcat(), and str[n]cpy() for possible buffer overflow problems. Replaced most sprintf()'s with snprintf(); for others cases, added terminating NUL bytes where appropriate, replaced constants like "16" with sizeof(), etc. These changes include several bug fixes, but most changes are for maintainability's sake. Any instance where it wasn't "immediately obvious" that a buffer overflow could not occur was made safer. Reviewed by: Bruce Evans <bde@zeta.org.au> Reviewed by: Matthew Dillon <dillon@apollo.backplane.com> Reviewed by: Mike Spengler <mks@networkcs.com>	1998-12-04 22:54:57 +00:00
dg	20b2c33d9a	Added a second argument, "activate" to the vm_page_unwire() call so that the caller can select either inactive or active queue to put the page on.	1998-10-28 13:37:02 +00:00
bde	e170b2ba75	Removed statically configured mount type numbers (MOUNT_) and all references to them. The change a couple of days ago to ignore these numbers in statically configured vfsconf structs was slightly premature because the cd9660, cfs, devfs, ext2fs, nfs vfs's still used MOUNT_ instead of the number in their vfsconf struct.	1998-09-07 13:17:06 +00:00
alex	d48cc1feee	Style fixes and a bug fix: don't remove the exit handler if unmount fails. Submitted by: bde	1998-07-27 22:47:17 +00:00
alex	be85829abc	A better solution to the rm_at_exit problem: Register the exit function during first mount. Unregister the exit function at last unmount. Concept by: sef Reviewed by: sef Implemented by: alex	1998-07-27 01:07:01 +00:00
alex	4ab63c85e2	Override the default VFS LKM dispatch functions so that a module unload function can be provided (this is necessary to unregister the at_exit handler).	1998-07-25 15:52:44 +00:00
bde	863d5c8b68	Cast pointers to uintptr_t/intptr_t instead of to u_long/long, respectively. Most of the longs should probably have been u_longs, but this changes is just to prevent warnings about casts between pointers and integers of different sizes, not to fix poorly chosen types.	1998-07-15 02:32:35 +00:00
bde	f0b863f4b5	Fixed printf format errors.	1998-07-11 07:46:16 +00:00
bde	c0cb9588c8	Quick fix for type mismatches which were fatal if longs aren't 32 bits. We used a private, wrong, version of `struct dirent' to help break getdirentries(), and we use a silly check that the size of this struct is a power of 2 to help break mount() if getdirentries() would not work. This fix just changes the struct to match `struct dirent' (except for the name length).	1998-07-07 04:08:44 +00:00
dt	c6c8ca45cd	Remove "not hungly" panics. Cookies now used by the linux and ibcs2 emulators. The emulators assume that filesystem may just ignore cookies, and handle this case correctly. So we just ignore cookies. Also sync *_readdir "prototypes" with reality.	1998-06-25 16:54:41 +00:00
bde	a336cb95ff	Avoid a 64-bit division in procfs_readdir(). Fixed related overflows. Check args using the same expression as in fdesc and kernfs. The check was actually already correct, modulo overflow. It could be tightened up to either allow huge (aligned) offsets, treating them as EOF, or disallow all offsets beyond EOF. Didn't fix invalid address calculation &foo[i] where i may be out of bounds. Didn't fix shooting of foot using a private unportable dirent struct.	1998-06-14 12:53:39 +00:00
peter	e18ee5bdc0	Don't silently accept attempts to change flags where they are not supported.	1998-06-10 06:34:57 +00:00
dfr	1d5f38ac22	This commit fixes various 64bit portability problems required for FreeBSD/alpha. The most significant item is to change the command argument to ioctl functions from int to u_long. This change brings us inline with various other BSD versions. Driver writers may like to use (__FreeBSD_version == 300003) to detect this change. The prototype FreeBSD/alpha machdep will follow in a couple of days time.	1998-06-07 17:13:14 +00:00
tegge	9fdbafa2fe	Disallow reading the current kernel stack. Only the user structure and the current registers should be accessible. Reviewed by: David Greenman <dg@root.com>	1998-05-19 00:00:14 +00:00
msmith	c645da3999	As described by the submitter: Reverse the VFS_VRELE patch. Reference counting of vnodes does not need to be done per-fs. I noticed this while fixing vfs layering violations. Doing reference counting in generic code is also the preference cited by John Heidemann in recent discussions with him. The implementation of alternative vnode management per-fs is still a valid requirement for some filesystems but will be revisited sometime later, most likely using a different framework. Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-05-06 05:29:41 +00:00
dyson	b5a79794cd	Tighten up management of memory and swap space during map allocation, deallocation cycles. This should provide a measurable improvement on swap and memory allocation on loaded systems. It is unlikely a complete solution. Also, provide more map info with procfs. Chuck Cranor spurred on this improvement.	1998-04-29 04:28:22 +00:00
des	396b114475	Seventy-odd "its" / "it's" typos in comments fixed as per kern/6108.	1998-04-17 22:37:19 +00:00
phk	00475b662a	Add two new functions, get{micro\|nano}time. They are atomic, but return in essence what is in the "time" variable. gettime() is now a macro front for getmicrotime(). Various patches to use the two new functions instead of the various hacks used in their absence. Some puntuation and grammer patches from Bruce. A couple of XXX comments.	1998-03-26 20:54:05 +00:00
msmith	950d32131b	The intent is to get rid of WILLRELE in vnode_if.src by making a complement to all ops that return a vpp, VFS_VRELE. This is initially only for file systems that implement the following ops that do a WILLRELE: vop_create, vop_whiteout, vop_mknod, vop_remove, vop_link, vop_rename, vop_mkdir, vop_rmdir, vop_symlink This is initial DNA that doesn't do anything yet. VFS_VRELE is implemented but not called. A default vfs_vrele was created for fs implementations that use the standard vnode management routines. VFS_VRELE implementations were made for the following file systems: Standard (vfs_vrele) ffs mfs nfs msdosfs devfs ext2fs Custom union umapfs Just EOPNOTSUPP fdesc procfs kernfs portal cd9660 These implementations may change as VOP changes are implemented. In the next phase, in the vop implementations calls to vrele and the vrele part of vput will be moved to the top layer vfs_vnops and made visible to all layers. vput will be replaced by unlock in these cases. Unlocking will still be done in the per fs layer but the refcount decrement will be triggered at the top because it doesn't hurt to hold a vnode reference a little longer. This will have minimal impact on the structure of the existing code. This will only be done for vnode arguments that are released by the various fs vop implementations. Wider use of VFS_VRELE will likely require restructuring of the code. Reviewed by: phk, dyson, terry et. al. Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-03-01 22:46:53 +00:00
eivind	d7a6ab2803	Staticize.	1998-02-09 06:11:36 +00:00
eivind	4547a09753	Back out DIAGNOSTIC changes.	1998-02-06 12:14:30 +00:00
eivind	c552a9a1c3	Turn DIAGNOSTIC into a new-style option.	1998-02-04 22:34:03 +00:00
dyson	197bd655c4	VM level code cleanups. 1) Start using TSM. Struct procs continue to point to upages structure, after being freed. Struct vmspace continues to point to pte object and kva space for kstack. u_map is now superfluous. 2) vm_map's don't need to be reference counted. They always exist either in the kernel or in a vmspace. The vmspaces are managed by reference counts. 3) Remove the "wired" vm_map nonsense. 4) No need to keep a cache of kernel stack kva's. 5) Get rid of strange looking ++var, and change to var++. 6) Change more data structures to use our "zone" allocator. Added struct proc, struct vmspace and struct vnode. This saves a significant amount of kva space and physical memory. Additionally, this enables TSM for the zone managed memory. 7) Keep ioopt disabled for now. 8) Remove the now bogus "single use" map concept. 9) Use generation counts or id's for data structures residing in TSM, where it allows us to avoid unneeded restart overhead during traversals, where blocking might occur. 10) Account better for memory deficits, so the pageout daemon will be able to make enough memory available (experimental.) 11) Fix some vnode locking problems. (From Tor, I think.) 12) Add a check in ufs_lookup, to avoid lots of unneeded calls to bcmp. (experimental.) 13) Significantly shrink, cleanup, and make slightly faster the vm_fault.c code. Use generation counts, get rid of unneded collpase operations, and clean up the cluster code. 14) Make vm_zone more suitable for TSM. This commit is partially as a result of discussions and contributions from other people, including DG, Tor Egge, PHK, and probably others that I have forgotten to attribute (so let me know, if I forgot.) This is not the infamous, final cleanup of the vnode stuff, but a necessary step. Vnode mgmt should be correct, but things might still change, and there is still some missing stuff (like ioopt, and physical backing of non-merged cache files, debugging of layering concepts.)	1998-01-22 17:30:44 +00:00
dyson	cb2800cd94	Make our v_usecount vnode reference count work identically to the original BSD code. The association between the vnode and the vm_object no longer includes reference counts. The major difference is that vm_object's are no longer freed gratuitiously from the vnode, and so once an object is created for the vnode, it will last as long as the vnode does. When a vnode object reference count is incremented, then the underlying vnode reference count is incremented also. The two "objects" are now more intimately related, and so the interactions are now much less complex. When vnodes are now normally placed onto the free queue with an object still attached. The rundown of the object happens at vnode rundown time, and happens with exactly the same filesystem semantics of the original VFS code. There is absolutely no need for vnode_pager_uncache and other travesties like that anymore. A side-effect of these changes is that SMP locking should be much simpler, the I/O copyin/copyout optimizations work, NFS should be more ponderable, and further work on layered filesystems should be less frustrating, because of the totally coherent management of the vnode objects and vnodes. Please be careful with your system while running this code, but I would greatly appreciate feedback as soon a reasonably possible.	1998-01-06 05:26:17 +00:00
sef	082257799e	Use CHECKIO in procfs_ioctl() to ensure that any changes in UID/GID result in the expected failure.	1998-01-06 01:37:12 +00:00
bde	05d3a8c532	Fixed a missing/misplaced/misstyled prototype.	1997-12-30 08:46:44 +00:00
bde	3c1b6940fc	Unspammed nested include of <vm/vm_zone.h>.	1997-12-27 02:56:39 +00:00
sef	f4669f67bc	Clear the p_stops field on change of user/group id, unless the correct flag is set in the p_pfsflags field. This, essentially, prevents an SUID proram from hanging after being traced. (E.g., "truss /usr/bin/rlogin" would fail, but leave rlogin in a stopevent state.) Yet another case where procctl is (hopefully ;)) no longer needed in the general case. Reviewed by: bde (thanks bruce :))	1997-12-20 03:05:47 +00:00
sef	f13ddbc865	Change the ioctls for procfs around a bit; in particular, whever possible, change from ioctl(fd, PIOC<foo>, &i); to ioctl(fd, PIOC<foo>, i); This is going from the _IOW to _IO ioctl macro. The kernel, procctl, and truss must be in synch for it all to work (not doing so will get errors about inappropriate ioctl's, fortunately). Hopefully I didn't forget anything :).	1997-12-13 03:13:49 +00:00
sef	9fd84ad693	Fix a problem with procfs_exit() that resulted in missing some procfs nodes; this also apparantly caused a panic in some circumstances. Also, since procfs_exit() is getting rid of the nodes when a process exits, don't bother checking for the process' existance in procfs_inactive().	1997-12-12 03:33:43 +00:00
sef	f33f94c5ae	Code to prevent a panic caused by procfs_exit(). Note that i don't know what is teh root cause -- but, sometimes, a procfs vnode in pfshead is apparantly corrupt (or a UFS vnode instead). Without this patch, I can get it to panic by doing (in csh) while (1) ps auxwww end and it will panic when the PID's wrap. With it, it does not panic. Yes -- I know that this is NOT the right way to fix it. But I haven't been able to get it to panic yet (which confuses me). I am going to be looking into the vgone() code now, as that may be a part of it.	1997-12-09 05:03:41 +00:00
sef	ed33823bf0	A couple of fixes from bruce: first of all, psignal is a void (stupid me; unfortunately, also makes it hard ot check for errors); second, I had managed to forget a change to PIOCSFL (it should be _IOW, not _IOR) I had in my local copy, and Bruce called me on it. Submitted by: bde	1997-12-08 22:09:39 +00:00
sef	287c0e3604	Use at_exit() to invoke procfs_exit() instead of calling it directly. Note that an unload facility should be used to call rm_at_exit() (if procfs is being loaded as an LKM and is subsequently removed), but it was non-obvious how to do this in the VFS framework. Reviewed by: Julian Elischer	1997-12-08 01:06:36 +00:00
sef	3442d56966	Clear the stop events and wakeup the process on teh last close of the procfs/mem file. While this doesn't prevent an unkillable process, it means that a broken truss prorgam won't do it accidently now (well, there's a small window of opportunity). Note that this requires the change to truss I am about to commit.	1997-12-07 04:01:03 +00:00
sef	c7d273eccb	Changes to allow event-based process monitoring and control.	1997-12-06 04:11:14 +00:00
bde	efd51d84cf	Don't include <sys/lock.h> in headers when only `struct simplelock' is required. Fixed everything that depended on the pollution.	1997-12-05 19:55:52 +00:00
tegge	5ba8a227e1	Don't try to obtain an excluive lock on the vm map, since a deadlock might occur if the process owning the map is wiring pages.	1997-11-14 22:57:46 +00:00
phk	4d26888936	Remove a bunch of variables which were unused both in GENERIC and LINT. Found by: -Wunused	1997-11-07 08:53:44 +00:00
bde	2fa5d41745	KNFize rev.1.31.	1997-10-27 15:39:01 +00:00
bde	e38eb2dd99	Removed unused #includes. The need for most of them went away with recent changes (docluster* and vfs improvements).	1997-10-27 13:33:47 +00:00
phk	e3cdaf12b2	VFS interior redecoration. Rename vn_default_error to vop_defaultop all over the place. Move vn_bwrite from vfs_bio.c to vfs_default.c and call it vop_stdbwrite. Use vop_null instead of nullop. Move vop_nopoll from vfs_subr.c to vfs_default.c Move vop_sharedlock from vfs_subr.c to vfs_default.c Move vop_nolock from vfs_subr.c to vfs_default.c Move vop_nounlock from vfs_subr.c to vfs_default.c Move vop_noislocked from vfs_subr.c to vfs_default.c Use vop_ebadf instead of *_ebadf. Add vop_defaultop for getpages on master vnode in MFS.	1997-10-26 20:55:39 +00:00
phk	f82436f706	VFS clean up "hekto commit" 1. Add defaults for more VOPs VOP_LOCK vop_nolock VOP_ISLOCKED vop_noislocked VOP_UNLOCK vop_nounlock and remove direct reference in filesystems. 2. Rename the nfsv2 vnop tables to improve sorting order.	1997-10-16 22:01:05 +00:00
phk	373a865574	Another VFS cleanup "kilo commit" 1. Remove VOP_UPDATE, it is (also) an UFS/{FFS,LFS,EXT2FS,MFS} intereface function, and now lives in the ufsmount structure. 2. Remove VOP_SEEK, it was unused. 3. Add mode default vops: VOP_ADVLOCK vop_einval VOP_CLOSE vop_null VOP_FSYNC vop_null VOP_IOCTL vop_enotty VOP_MMAP vop_einval VOP_OPEN vop_null VOP_PATHCONF vop_einval VOP_READLINK vop_einval VOP_REALLOCBLKS vop_eopnotsupp And remove identical functionality from filesystems 4. Add vop_stdpathconf, which returns the canonical stuff. Use it in the filesystems. (XXX: It's probably wrong that specfs and fifofs sets this vop, shouldn't it come from the "host" filesystem, for instance ufs or cd9660 ?) 5. Try to make system wide VOP functions have vop_* names. 6. Initialize the um_* vectors in LFS. (Recompile your LKMS!!!)	1997-10-16 20:32:40 +00:00
phk	d166441755	VFS mega cleanup commit (x/N) 1. Add new file "sys/kern/vfs_default.c" where default actions for VOPs go. Implement proper defaults for ABORTOP, BWRITE, LEASE, POLL, REVOKE and STRATEGY. Various stuff spread over the entire tree belongs here. 2. Change VOP_BLKATOFF to a normal function in cd9660. 3. Kill VOP_BLKATOFF, VOP_TRUNCATE, VOP_VFREE, VOP_VALLOC. These are private interface functions between UFS and the underlying storage manager layer (FFS/LFS/MFS/EXT2FS). The functions now live in struct ufsmount instead. 4. Remove a kludge of VOP_ functions in all filesystems, that did nothing but obscure the simplicity and break the expandability. If a filesystem doesn't implement VOP_FOO, it shouldn't have an entry for it in its vnops table. The system will try to DTRT if it is not implemented. There are still some cruft left, but the bulk of it is done. 5. Fix another VCALL in vfs_cache.c (thanks Bruce!)	1997-10-16 10:50:27 +00:00
phk	92eeb70dc6	Hmm, realign the vnops into two columns.	1997-10-15 10:05:29 +00:00
phk	26130e0b77	Stylistic overhaul of vnops tables. 1. Remove comment stating the blatantly obvious. 2. Align in two columns. 3. Sort all but the default element alphabetically. 4. Remove XXX comments pointing out entries not needed.	1997-10-15 09:22:02 +00:00
dyson	e64b1984f9	Change the M_NAMEI allocations to use the zone allocator. This change plus the previous changes to use the zone allocator decrease the useage of malloc by half. The Zone allocator will be upgradeable to be able to use per CPU-pools, and has more intelligent usage of SPLs. Additionally, it has reasonable stats gathering capabilities, while making most calls inline.	1997-09-21 04:24:27 +00:00
peter	ce7feabb13	Convert select -> poll. Delete 'always succeed' select/poll handlers, replaced with generic call. Flag missing vnode op table entries.	1997-09-14 02:58:12 +00:00
bde	e499dfd06d	Some staticized variables were still declared to be extern.	1997-09-07 05:27:26 +00:00
wollman	4542c1cf5d	Fix all areas of the system (or at least all those in LINT) to avoid storing socket addresses in mbufs. (Socket buffers are the one exception.) A number of kernel APIs needed to get fixed in order to make this happen. Also, fix three protocol families which kept PCBs in mbufs to not malloc them instead. Delete some old compatibility cruft while we're at it, and add some new routines in the in_cksum family.	1997-08-16 19:16:27 +00:00
sef	47bfe6b362	Check permissions for fp regs as well as normal regs.	1997-08-12 05:23:51 +00:00
sef	ea579a477d	Fix procfs security hole -- check permissions on meaningful I/Os (namely, reading/writing of mem and regs). Also have to check for the requesting process being group KMEM -- this is a bit of a hack, but ps et al need it. Reviewed by: davidg	1997-08-12 04:34:30 +00:00
bde	9195bd1ec7	Removed unused #includes.	1997-08-02 14:33:27 +00:00
alex	f3cccc0874	Style fix my previous commit.	1997-06-26 16:12:53 +00:00
alex	793295a94d	Block all write operations to /proc/1/* when securelevel > 0. The additional check in procfs_ctl.c could be backed out, but I'm leaving it in for good measure. Reviewed by: Theo de Raadt <deraadt@OpenBSD.org>	1997-06-21 16:09:49 +00:00
alex	c38f6b729e	Removed bogon from previous commit: doubly included sys/systm.h.	1997-04-27 21:32:21 +00:00
alex	ced78602fe	Prevent debugger attachment to init when securelevel > 0. Noticed by: Brian Buchanan <brian@wasteland.calbbs.com>	1997-04-27 19:02:37 +00:00
dyson	5cd49302ac	Fix both a problem with accessing backing objects, and also release the process map on nonexistant pages. PR: kern/3327 Submitted by: Tor Egge <Tor.Egge@idi.ntnu.no>	1997-04-20 17:12:11 +00:00
dyson	22d3427970	Fix the gdb executable modify problem. Thanks to the detective work by Alan Cox <alc@cs.rice.edu>, and his description of the problem. The bug was primarily in procfs_mem, but the mistake likely happened due to the lack of vm system support for the operation. I added better support for selective marking of page dirty flags so that vm_map_pageable(wiring) will not cause this problem again. The code in procfs_mem is now less bogus (but maybe still a little so.)	1997-04-06 02:29:45 +00:00
bde	117209856b	Don't include <sys/ioctl.h> in the kernel. Stage 1: don't include it when it is not used. In most cases, the reasons for including it went away when the special ioctl headers became self-sufficient.	1997-03-24 11:25:10 +00:00
bde	0309fd38a3	Fixed missing initialisation of vp->v_type for types Pfile and Pmem in procfs_allocvp(). This fixes at least stat() of /proc//mem. stat() of /proc//file already worked. I think procfs_allocvp() isn't actually called for type Pfile.	1997-03-08 16:06:34 +00:00
bde	652d0883c7	Fixed procfs's locking vops. They were missed in the Lite2 merge, partly because the #define's for them were moved to a different file. At least the null VOP_LOCK() no longer works, since vclean() expects VOP_LOCK( ..., LK_DRAIN \| LK_INTERLOCK, ...) to clear the interlock. This probably only matters when simple_lock() is not null, i.e., when there are multiple CPUs or SIMPLELOCK_DEBUG is defined.	1997-02-24 16:44:11 +00:00
peter	94b6d72794	Back out part 1 of the MCFH that changed $Id$ to $FreeBSD$. We are not ready for it yet.	1997-02-22 09:48:43 +00:00
mpp	50f9d7b978	Add function prototypes for most of the new Lite2 functions. Also made a few of the miscfs routines static to be consistent. Some modules simply required some additional #includes to remove -Wall warnings.	1997-02-12 06:52:51 +00:00
dyson	10f666af84	This is the kernel Lite/2 commit. There are some requisite userland changes, so don't expect to be able to run the kernel as-is (very well) without the appropriate Lite/2 userland changes. The system boots and can mount UFS filesystems. Untested: ext2fs, msdosfs, NFS Known problems: Incorrect Berkeley ID strings in some files. Mount_std mounts will not work until the getfsent library routine is changed. Reviewed by: various people Submitted by: Jeffery Hsu <hsu@freebsd.org>	1997-02-10 02:22:35 +00:00
dyson	52f682b582	Change the map entry flags from bitfields to bitmasks. Allows for some code simplification.	1997-01-16 04:16:22 +00:00
jkh	808a36ef65	Make the long-awaited change from $Id$ to $FreeBSD$ This will make a number of things easier in the future, as well as (finally!) avoiding the Id-smashing problem which has plagued developers for so long. Boy, I'm glad we're not using sup anymore. This update would have been insane otherwise.	1997-01-14 07:20:47 +00:00
dyson	89da5daba5	Fix a potential deadlock from the previous commit.	1996-10-30 03:52:57 +00:00
dyson	0cadb84d6a	Fix the /proc/???/map file so that it is possible to read an arbitrarily large process map. Another commit will follow to fix a problem just found during this one... Sorry!!! :-(.	1996-10-30 03:45:00 +00:00
dyson	1b9def83a5	Fix setting breakpoints in shared regions.	1996-10-24 02:47:05 +00:00
bde	51ff523803	Eliminated nested include of <sys/unistd.h> in <sys/file.h> in the kernel. Include it directly in the few places where it is used. Reduced some #includes of <sys/file.h> to #includes of <sys/fcntl.h> or nothing.	1996-09-03 14:25:27 +00:00
bde	070eb30ca6	Fixed the easy cases of const poisoning in the kernel. Cosmetic.	1996-08-31 16:52:44 +00:00
dyson	844ec1f2c0	Modify slightly the output from the map file in /proc. Now the executable bit is shown.	1996-07-27 19:47:04 +00:00
dyson	3695a87050	Under certain circumstances, reading the /proc/*/map file can crash the system. Nonexistant objects were not handled correctly.	1996-07-27 18:28:10 +00:00
dyson	835bc69189	Implement locking for pfs nodes, when at the leaf. Concurrent access to information from a single process causes hangs. Specifically, this fixes problems (hangs) with concurrent ps commands, when the system is under heavy memory load. Reviewed by: davidg	1996-07-02 13:38:10 +00:00
dyson	66550c3ffb	Fix a serious problem, with a window where an object lock is needed, but not there. The extent of the object lock is expanded to be over the range that it is needed. Additionally, clean up the code so that it conforms to better coding style.	1996-07-02 01:40:52 +00:00
dyson	da96599602	Add procfs_type.c to the repository.	1996-06-18 05:22:45 +00:00
dyson	01d0667f09	Clean-up the new VM map procfs code, and also add support for executable format file "etype". It contains a description of the binary type for a process.	1996-06-18 05:16:00 +00:00
dyson	435fbbfe95	This file is the "meat" of the process address space capability. If you would like other things added, just ask!!! It might be pretty easy to add.	1996-06-17 22:53:27 +00:00
dyson	c74503c0b7	Add a feature to procfs to allow display of the process address map with multiple entries as follows: start address, end address, resident pages in range, private pages in range, RW/RO, COW or not, (vnode/device/swap/default).	1996-06-17 22:43:36 +00:00
dg	5026fc1c36	Moved the fsnode MALLOC to before the call to getnewvnode() so that the process won't possibly block before filling in the fsnode pointer (v_data) which might be dereferenced during a sync since the vnode is put on the mnt_vnodelist by getnewvnode. Pointed out by Matt Day <mday@artisoft.com>	1996-06-12 03:37:57 +00:00
dyson	94ca54ddd2	Properly lock the vm space when accessing the memory in a process. This fix could solve some "interesting" problems that could happen during process rundown.	1996-06-11 23:52:27 +00:00
hsu	0c2c8ab30d	For Lite2: proc LIST changes. Reviewed by: davidg & bde	1996-03-11 06:13:09 +00:00
wosch	ec52b9e55d	add ruid and rgid to file 'status'	1996-02-02 05:19:20 +00:00
peter	eb3b1f018c	This time, really make the procfs work when reading stuff from the UPAGES. This is a really ugly bandaid on the problem, but it works well enough for 'ps -u' to start working again. The problem was caused by the user address space shrinking by a little bit and the UPAGES being "cast off" to become a seperate entity rather than being at the top of the process's vmspace. That optimization was part of John's most recent VM speedups. Now, rather than decoding the VM space, it merely ensures the pages are in core and accesses them the same way the ptrace(PT_READ_U..) code does, ie: off the p->p_addr pointer.	1996-01-25 06:05:38 +00:00
peter	7ce7970731	Major fixes for procfs.. Implement a "variable" directory structure. Files that do not make sense for the given process do not "appear" and cannot be opened. For example, "system" processes do not have "file", "regs" or "fpregs", because they do not have a user area. "attempt" to fill in the user area of a given process when it is being accessed via /proc/pid/mem (the user struct is just after VM_MAXUSER_ADDRESS in the process address space.) Dont do IO to the U area while it's swapped, hold it in place if possible. Lock off access to the "ctl" file if it's done a setuid like the other pseudo-files in there.	1996-01-24 18:41:41 +00:00
dyson	8fc8a772af	Eliminated many redundant vm_map_lookup operations for vm_mmap. Speed up for vfs_bio -- addition of a routine bqrelse to greatly diminish overhead for merged cache. Efficiency improvement for vfs_cluster. It used to do alot of redundant calls to cluster_rbuild. Correct the ordering for vrele of .text and release of credentials. Use the selective tlb update for 486/586/P6. Numerous fixes to the size of objects allocated for files. Additionally, fixes in the various pagers. Fixes for proper positioning of vnode_pager_setsize in msdosfs and ext2fs. Fixes in the swap pager for exhausted resources. The pageout code will not as readily thrash. Change the page queue flags (PG_ACTIVE, PG_INACTIVE, PG_FREE, PG_CACHE) into page queue indices (PQ_ACTIVE, PQ_INACTIVE, PQ_FREE, PQ_CACHE), thereby improving efficiency of several routines. Eliminate even more unnecessary vm_page_protect operations. Significantly speed up process forks. Make vm_object_page_clean more efficient, thereby eliminating the pause that happens every 30seconds. Make sequential clustered writes B_ASYNC instead of B_DELWRI even in the case of filesystems mounted async. Fix a panic with busy pages when write clustering is done for non-VMIO buffers.	1996-01-19 04:00:31 +00:00
bde	ca36bb34d1	Fixed 1TB filesize changes. Some pindexes had bogus names and types but worked because vm_pindex_t is indistinuishable from vm_offset_t.	1995-12-17 07:19:58 +00:00
dyson	601ed1a4c0	Changes to support 1Tb filesizes. Pages are now named by an (object,index) pair instead of (object,offset) pair.	1995-12-11 04:58:34 +00:00
dg	c30f46c534	Untangled the vm.h include file spaghetti.	1995-12-07 12:48:31 +00:00
bde	64a1fca498	Added prototypes. Removed some unnecessary #includes.	1995-12-03 14:54:48 +00:00
bde	1f7a6b421f	Moved declarations for static functions to the correct place (not in a header). Removed stupid comments.	1995-11-16 11:39:11 +00:00
bde	536440376f	Fixed the type of procfs_sync(). Trailing args were missing. Fixed the type of procfs_fhtovp(). The args had little resemblance to the correct ones. Added prototypes.	1995-11-16 11:34:51 +00:00
bde	449a11eb88	Introduced a type `vop_t' for vnode operation functions and used it 1138 times (:-() in casts and a few more times in declarations. This change is null for the i386. The type has to be `typedef int vop_t(void *)' and not `typedef int vop_t()' because `gcc -Wstrict-prototypes' warns about the latter. Since vnode op functions are called with args of different (struct pointer) types, neither of these function types is any use for type checking of the arg, so it would be preferable not to use the complete function type, especially since using the complete type requires adding 1138 casts to avoid compiler warnings and another 40+ casts to reverse the function pointer conversions before calling the functions.	1995-11-09 08:17:23 +00:00
phk	06201558ed	Make a lot of private stuff static. Should anybody out there wonder about this vendetta against global variables, it is basically to make it more visible what our interfaces in the kernel really are. I'm almost convinced we should have a #define PUBLIC /* public interface */ and use it in the #includes...	1995-11-07 13:39:31 +00:00
dyson	0dbcd1947b	Removal of unnecessary usage of PG_COPYONWRITE.	1995-10-23 04:28:59 +00:00
mpp	325b318783	Change procfs_lookup to not allow delete/rename operations to prevent panics when a user tries to remove/rename the contents of /proc/###/*. Obtained from: 4.4BSD-lite2	1995-09-02 18:28:48 +00:00
dg	3a38ab9625	Be careful not to dereference NULL credentials pointers when doing the getattr function.	1995-08-11 07:26:26 +00:00
bde	81e1e32f6c	Don't include <sys/tty.h> in drivers that aren't tty drivers or in general files that don't depend on the internals of <sys/tty.h>	1995-07-16 10:13:08 +00:00
dg	c8b0a7332c	NOTE: libkvm, w, ps, 'top', and any other utility which depends on struct proc or any VM system structure will have to be rebuilt!!! Much needed overhaul of the VM system. Included in this first round of changes: 1) Improved pager interfaces: init, alloc, dealloc, getpages, putpages, haspage, and sync operations are supported. The haspage interface now provides information about clusterability. All pager routines now take struct vm_object's instead of "pagers". 2) Improved data structures. In the previous paradigm, there is constant confusion caused by pagers being both a data structure ("allocate a pager") and a collection of routines. The idea of a pager structure has escentially been eliminated. Objects now have types, and this type is used to index the appropriate pager. In most cases, items in the pager structure were duplicated in the object data structure and thus were unnecessary. In the few cases that remained, a un_pager structure union was created in the object to contain these items. 3) Because of the cleanup of #1 & #2, a lot of unnecessary layering can now be removed. For instance, vm_object_enter(), vm_object_lookup(), vm_object_remove(), and the associated object hash list were some of the things that were removed. 4) simple_lock's removed. Discussion with several people reveals that the SMP locking primitives used in the VM system aren't likely the mechanism that we'll be adopting. Even if it were, the locking that was in the code was very inadequate and would have to be mostly re-done anyway. The locking in a uni-processor kernel was a no-op but went a long way toward making the code difficult to read and debug. 5) Places that attempted to kludge-up the fact that we don't have kernel thread support have been fixed to reflect the reality that we are really dealing with processes, not threads. The VM system didn't have complete thread support, so the comments and mis-named routines were just wrong. We now use tsleep and wakeup directly in the lock routines, for instance. 6) Where appropriate, the pagers have been improved, especially in the pager_alloc routines. Most of the pager_allocs have been rewritten and are now faster and easier to maintain. 7) The pagedaemon pageout clustering algorithm has been rewritten and now tries harder to output an even number of pages before and after the requested page. This is sort of the reverse of the ideal pagein algorithm and should provide better overall performance. 8) Unnecessary (incorrect) casts to caddr_t in calls to tsleep & wakeup have been removed. Some other unnecessary casts have also been removed. 9) Some almost useless debugging code removed. 10) Terminology of shadow objects vs. backing objects straightened out. The fact that the vm_object data structure escentially had this backwards really confused things. The use of "shadow" and "backing object" throughout the code is now internally consistent and correct in the Mach terminology. 11) Several minor bug fixes, including one in the vm daemon that caused 0 RSS objects to not get purged as intended. 12) A "default pager" has now been created which cleans up the transition of objects to the "swap" type. The previous checks throughout the code for swp->pg_data != NULL were really ugly. This change also provides the rudiments for future backing of "anonymous" memory by something other than the swap pager (via the vnode pager, for example), and it allows the decision about which of these pagers to use to be made dynamically (although will need some additional decision code to do this, of course). 13) (dyson) MAP_COPY has been deprecated and the corresponding "copy object" code has been removed. MAP_COPY was undocumented and non- standard. It was furthermore broken in several ways which caused its behavior to degrade to MAP_PRIVATE. Binaries that use MAP_COPY will continue to work correctly, but via the slightly different semantics of MAP_PRIVATE. 14) (dyson) Sharing maps have been removed. It's marginal usefulness in a threads design can be worked around in other ways. Both #12 and #13 were done to simplify the code and improve readability and maintain- ability. (As were most all of these changes) TODO: 1) Rewrite most of the vnode pager to use VOP_GETPAGES/PUTPAGES. Doing this will reduce the vnode pager to a mere fraction of its current size. 2) Rewrite vm_fault and the swap/vnode pagers to use the clustering information provided by the new haspage pager interface. This will substantially reduce the overhead by eliminating a large number of VOP_BMAP() calls. The VOP_BMAP() filesystem interface should be improved to provide both a "behind" and "ahead" indication of contiguousness. 3) Implement the extended features of pager_haspage in swap_pager_haspage(). It currently just says 0 pages ahead/behind. 4) Re-implement the swap device (swstrategy) in a more elegant way, perhaps via a much more general mechanism that could also be used for disk striping of regular filesystems. 5) Do something to improve the architecture of vm_object_collapse(). The fact that it makes calls into the swap pager and knows too much about how the swap pager operates really bothers me. It also doesn't allow for collapsing of non-swap pager objects ("unnamed" objects backed by other pagers).	1995-07-13 08:48:48 +00:00
dg	6c5617ad78	Killed the "probably_never" ifdef'd code.	1995-06-28 04:51:06 +00:00
rgrimes	c86f0c7a71	Remove trailing whitespace.	1995-05-30 08:16:23 +00:00
dg	a4ec59246f	Fixed panic that resulted from mmaping files in kernfs and procfs. A regular user could panic the machine with a simple "tail /proc/curproc/mem" command. The problem was twofold: both kernfs and procfs didn't fill in the mnt_stat statfs struct (which would later lead to an integer divide fault in the vnode pager), and kernfs bogusly paniced if a bmap was attempted. Reviewed by: John Dyson	1995-05-25 01:35:24 +00:00
rgrimes	0e1db07cf9	Fix -Wformat warnings from LINT kernel.	1995-05-11 19:26:53 +00:00
dg	b7ad7fee08	For P_SUGID processes, we must also change ownership of the mem file to root so that group kmem can still get to it. SIGH	1995-04-15 03:20:31 +00:00
dg	f3c66b4569	Retain group kmem readability for P_SUGID processes.	1995-04-15 02:50:13 +00:00
dg	719b4da2ee	Made /proc/n/mem file group kmem and group readable. Needed to fix ps so that it doesn't need to be setuid root.	1995-04-15 02:30:17 +00:00
wollman	8e37e9f479	Add four more filesystem flags: VFCF_NETWORK (this FS goes over the net) VFCF_READONLY (read-write mounts do not make any sense) VFCF_SYNTHETIC (data in this FS is not real) VFCF_LOOPBACK (this FS aliases something else) cd9660 is readonly; nullfs, umapfs, and union are loopback; NFS is netowkr; procfs, kernfs, and fdesc are synthetic.	1995-03-16 20:23:48 +00:00
bde	289f11acb4	Add and move declarations to fix all of the warnings from `gcc -Wimplicit' (except in netccitt, netiso and netns) and most of the warnings from `gcc -Wnested-externs'. Fix all the bugs found. There were no serious ones.	1995-03-16 18:17:34 +00:00
dg	127fed2efa	Make sure process isn't swapped when messing with it. Added missing newline to log() call.	1995-02-20 15:53:33 +00:00
dg	8bb9dc4a89	Fixed bmap run-length brokeness. Use bmap run-length extension when doing clustered paging. Submitted by: John Dyson	1995-02-03 06:46:28 +00:00
dg	6272368331	Initialize map start hint to vm_map_find()...not doing so will cause it to fail if the random thing on the stack happens to be too large. Submitted by: David Jones <dej@qpoint.torfree.net>	1995-01-05 03:59:38 +00:00
ache	15cd9ece1a	Fix problem when attached process detached Submitted by: Gary Jennejohn	1994-12-31 12:26:50 +00:00
dg	3826d4d70e	Fixed bug I just introduced that would have allowed a user to clobber his kernel stack.	1994-10-18 04:40:41 +00:00
dg	57a3f6cf30	Allow upages to be paged in/accessed. Submitted by: John Dyson	1994-10-18 04:26:53 +00:00
phk	c0f48e68d2	Cosmetics. reduce the noise from gcc -Wall.	1994-10-10 07:55:48 +00:00
dg	2add6128e2	Use tsleep() rather than sleep so that 'ps' is more informative about the wait.	1994-10-06 21:07:04 +00:00
dg	c60a583f86	1) Added "." and ".." entries. 2) Fixed directory size to return something reasonable. 3) Disabled "file" until the code is completed. 4) Corrected directory link counts.	1994-09-24 17:01:05 +00:00
wollman	c289ac89a1	Implemented loadable VFS modules, and made most existing filesystems loadable. (NFS is a notable exception.)	1994-09-21 03:47:43 +00:00
bde	9400f03fed	Supply prototypes for some functions that were implicitly declared and fix the resulting warnings.	1994-09-15 19:47:47 +00:00
wollman	f9fc827448	Fix up some sloppy coding practices: - Delete redundant declarations. - Add -Wredundant-declarations to Makefile.i386 so they don't come back. - Delete sloppy COMMON-style declarations of uninitialized data in header files. - Add a few prototypes. - Clean up warnings resulting from the above. NB: ioconf.c will still generate a redundant-declaration warning, which is unavoidable unless somebody volunteers to make `config' smarter.	1994-08-18 22:36:09 +00:00
dg	8d205697aa	Added $Id$	1994-08-02 07:55:43 +00:00
rgrimes	2469c867a1	The big 4.4BSD Lite to FreeBSD 2.0.0 (Development) patch. Reviewed by: Rodney W. Grimes Submitted by: John Dyson and David Greenman	1994-05-25 09:21:21 +00:00
rgrimes	27464aaa8e	BSD 4.4 Lite Kernel Sources	1994-05-24 10:09:53 +00:00

... 3 4 5 6 7

349 Commits