freebsd-dev

Author	SHA1	Message	Date
Dag-Erling Smørgrav	8d5f9fac24	In procfs_readdir(), when the directory being read was a process directory, the target process was being held locked during the uiomove() call. If the process calling readdir() was the same as the target process (for instance 'ls /proc/curproc/'), and uiomove() caused a page fault, the result would be a proc lock recursion. I have no idea how long this has been broken - possibly ever since pfind() was changed to lock the process it returns. Also replace the one and only call to procfs_findtextvp() with a direct test of td->td_proc->p_textvp.	2001-10-07 19:37:13 +00:00
Dag-Erling Smørgrav	b84ce33438	Add a PFS_DISABLED flag; pfs_visible() automatically returns 0 if it is set on the node in question. Also add two API functions for setting and clearing this flag; setting it also reclaims all vnodes associated with the node.	2001-10-02 22:22:42 +00:00
Dag-Erling Smørgrav	b7004390b3	Only print "XXX (un)registered" message if bootverbose.	2001-10-02 22:21:07 +00:00
Dag-Erling Smørgrav	24efa9d3fa	[the previous commit to pseudofs_vncache.c got the wrong log message] YA pseudofs megacommit, part 2: - Merge the pfs_vnode and pfs_vdata structures, and make the vnode cache a doubly-linked list. This eliminates the need to walk the list in pfs_vncache_free(). - Add an exit callout which revokes vnodes associated with the process that just exited. Since it needs to lock the cache when it does this, pfs_vncache_mutex needs MTX_RECURSE.	2001-10-01 04:26:33 +00:00
Dag-Erling Smørgrav	198bc14b1d	YA pseudofs megacommit, part 1: - Add a third callback to the pfs_node structure. This one simply returns non-zero if the specified requesting process is allowed to access the specified node for the specified target process. This is used in addition to the usual permission checks, e.g. when certain files don't make sense for certain (system) processes. - Make sure that pfs_lookup() and pfs_readdir() don't yap about files which aren't pfs_visible(). Also check pfs_visible() before performing reads and writes, to prevent the kind of races reported in SA-00:77 and SA-01:55 (fork a child, open /proc/child/ctl, have that child fork a setuid binary, and assume control of it). - Add some more trace points.	2001-10-01 04:22:20 +00:00
Dag-Erling Smørgrav	7d8f809f00	pseudofs.h: - Rearrange the flag constants a little to simplify specifying and testing for readability and writeability. pseudofs_vnops.c: - Track the aforementioned change. - Add checks to pfs_open() to prevent opening read-only files for writing or vice versa (pfs_{read,write} would block the actual reads and writes, but it's still a bug to allow the open() to succeed). Also, return EOPNOTSUPP if the caller attempts to lock the file. - Add more trace points.	2001-09-30 19:41:29 +00:00
Poul-Henning Kamp	40739c02ae	The behaviour of whiteout'ing symlinks were too confusing, instead remove them when asked to.	2001-09-30 08:43:33 +00:00
Dag-Erling Smørgrav	80a3cef87d	Pseudofs take 2: - Remove hardcoded uid, gid, mode from struct pfs_node; make pfs_getattr() smart enough to get it right most of the time, and allow for callbacks to handle the remaining cases. Rework the definition macros to match. - Add lots of (conditional) debugging output. - Fix a long-standing bug inherited from procfs: don't pretend to be a read-only file system. Instead, return EOPNOTSUPP for operations we truly can't support and allow others to fail silently. In particular, pfs_lookup() now treats CREATE as LOOKUP. This may need more work. - In pfs_lookup(), if the parent node is process-dependent, check that the process in question still exists. - Implement pfs_open() - its only current function is to check that the process opening the file can see the process it belongs to. - Finish adding support for writeable nodes. - Bump module version number. - Introduce lots of new bugs.	2001-09-29 00:49:29 +00:00
Dag-Erling Smørgrav	b4056ade84	The previous commit introduced some references to "curproc" which should have been references to "curthread". Correct this.	2001-09-28 12:36:54 +00:00
Robert Watson	f86cf763ef	o Modify generic specfs device open access control checks to use securelevel_ge() instead of direct securelevel variable checks. Obtained from: TrustedBSD Project	2001-09-26 20:18:26 +00:00
Bill Fenner	bd5b9e17b0	Fix (typo? pasteo?): panic("ffs_mountroot..." -> panic("ntfs_mountroot...")	2001-09-26 00:36:33 +00:00
Dag-Erling Smørgrav	8712e867e1	Clean up my source tree to avoid getting hit too badly by the next KSE or whatever mega-commit. This goes some way towards adding support for writeable files (needed by procfs).	2001-09-25 13:25:30 +00:00
Mike Barcroft	3273a63ed9	A process name may contain whitespace and unprintable characters, so convert those characters to octal notation. Also convert backslashes to octal notation to avoid confusion. Reviewed by: des MFC after: 1 week	2001-09-25 04:42:40 +00:00
John Baldwin	bce94723a4	Use the passed in thread to selrecord() instead of curthread.	2001-09-21 22:26:51 +00:00
Robert Watson	3f9e888ebe	o Remove redundant securelevel/pid1 check in procfs_rw() -- this protection is enforced at the invidual method layer using p_candebug(). Obtained from: TrustedBSD Project	2001-09-18 19:53:10 +00:00
Julian Elischer	7405406837	fix typo pointed out by: jhb	2001-09-13 21:59:40 +00:00
John Baldwin	f1cbf4f92c	Restore these files to being portable: - Use some simple #define's at the top of the files for proc -> thread changes instead of having lots of needless #ifdef's in the code. - Don't try to use struct thread in !FreeBSD code. - Don't use a few struct lwp's in some of the NetBSD code since it isn't in their HEAD. The new diff relative to before KSE is now signficantly smaller and easier to maintain.	2001-09-12 23:39:36 +00:00
Julian Elischer	b40ce4165d	KSE Milestone 2 Note ALL MODULES MUST BE RECOMPILED make the kernel aware that there are smaller units of scheduling than the process. (but only allow one thread per process at this time). This is functionally equivalent to teh previousl -current except that there is a thread associated with each process. Sorry john! (your next MFC will be a doosie!) Reviewed by: peter@freebsd.org, dillon@freebsd.org X-MFC after: ha ha ha ha	2001-09-12 08:38:13 +00:00
Kris Kennaway	bf61e26696	Fix some signed/unsigned integer confusion, and add bounds checking of arguments to some functions. Obtained from: NetBSD Reviewed by: peter MFC after: 2 weeks	2001-09-10 11:28:07 +00:00
Semen Ustimenko	cc6b9b02be	Stole unicode translation table from mount_msdos. Add kernel code to support this translation. MFC after: 2 weeks	2001-09-08 23:03:52 +00:00
Semen Ustimenko	0895d6c389	Fix opening particular file's attributes (as described in man page). This is useful for debug purposes. MFC after: 2 weeks	2001-09-08 22:59:12 +00:00
Semen Ustimenko	ebcc9d9c8c	Reference devvp on ntnode creation and dereference on removal. Previous code lead to page faults becouse i_devvp went zero after VOP_RECLAIM, but ntnode was reused (not reclaimed). MFC after: 2 weeks	2001-09-08 22:57:03 +00:00
Semen Ustimenko	831aac011e	Fix errors and warnings when compiling with NTFS_DEBUG > 1 MFC after: 2 weeks	2001-09-08 22:53:27 +00:00
Andrey A. Chernov	159247784c	smbfs_advlock: simplify overflow checks (copy from kern_lockf.c) minor formatting issues to minimize differences	2001-08-29 18:59:04 +00:00
Andrey A. Chernov	fcbe9614ef	Cosmetique & style fixes from bde	2001-08-26 10:28:58 +00:00
Andrey A. Chernov	5215e1ea12	Copy from kern_lockf.c: remove extra check	2001-08-24 10:22:16 +00:00
Andrey A. Chernov	2a31175b6e	Copy yet one check for SEEK_END overflow	2001-08-23 17:12:42 +00:00
Andrey A. Chernov	ea4313e351	Copy my newly introduced l_len<0 'oops' fix from kern_lockf.c	2001-08-23 16:06:14 +00:00
Andrey A. Chernov	e3e2c03de3	Copy POSIX l_len<0 handling from kern_lockf.c	2001-08-23 15:44:24 +00:00
Andrey A. Chernov	bbf6984cec	Cosmetique: correct English in comments non-cosmetique: add missing break; - original code was broken here	2001-08-23 14:45:31 +00:00
Andrey A. Chernov	fb2f187058	Move <machine/> after <sys/> Pointed by: bde	2001-08-23 13:27:58 +00:00
Andrey A. Chernov	4779017439	adv. lock: copy EOVERFLOW handling code from main variant fix type of 'size' arg	2001-08-23 08:54:22 +00:00
Boris Popov	798bb23e93	Use proper endian conversion. Obtained from: Mac OS X MFC after: 1 week	2001-08-21 08:27:47 +00:00
Boris Popov	3419dc99dd	Return proper length of _PC_NAME_MAX value if long names support is enabled. Obtained from: Mac OS X MFC after: 1 week	2001-08-21 08:25:09 +00:00
Poul-Henning Kamp	12d1aec26f	linux ls fails on DEVFS /dev because linux_getdents fails because linux_getdents uses VOP_READDIR( ..., &ncookies, &cookies ) instead of VOP_READDIR( ..., NULL, NULL ) because it seems to need the offsets for linux_dirent and sizeof(dirent) != sizeof(linux_dirent)... PR: 29467 Submitted by: Michael Reifenberger <root@nihil.plaut.de> Reviewed by: phk	2001-08-14 06:42:32 +00:00
Robert Watson	7d69e57088	Remove dangling prototype for the now defunct procfs_kmemaccess() call. Obtained from: TrustedBSD Project	2001-08-03 17:51:05 +00:00
Robert Watson	436b89d434	Collapse a Pmem case in with the other debugging files case for procfs, as there are now "unusual" protection properties to Pmem that differ from the other files. While I'm at it, introduce proc locking for the other files, which was previously present only in the Pmem case. Obtained from: TrustedBSD Project	2001-08-03 17:20:34 +00:00
Robert Watson	57de737e82	Remove read permission for group on the /proc/*/mem file, since kmem no longer requires access. Reviewed by: tmm Obtained from: TrustedBSD Project	2001-08-03 17:15:40 +00:00
Robert Watson	f2e6be5865	Prior to support for almost all ps activity via sysctl, ps used procfs, and so special-casing was introduced to provide extra procfs privilege to the kmem group. With the advent of non-setgid kmem ps, this code is no longer required, and in fact, can is potentially harmful as it allocates privilege to a gid that is increasingly less meaningful. Knowledge of specific gid's in kernel is also generally bad precedent, as the kernel security policy doesn't distinguish gid's specifically, only uid 0. This commit removes reference to kmem in procfs, both in terms of access control decisions, and the applying of gid kmem to the /proc/*/mem file, simplifying the associated code considerably. Processes are still permitted to access the mem file based on the debugging policy, so ps -e still works fine for normal processes and use. Reviewed by: tmm Obtained from: TrustedBSD Project	2001-08-03 17:13:23 +00:00
Assar Westerlund	ac01ecd9fb	remove support for creating files and directories from msdosfs_mknod	2001-07-19 19:15:42 +00:00
John Baldwin	7063595315	Grab the process lock around psignal(). Noticed by: tanimura	2001-07-18 19:17:36 +00:00
Robert Watson	a0f75161f9	o Replace calls to p_can(..., P_CAN_xxx) with calls to p_canxxx(). The p_can(...) construct was a premature (and, it turns out, awkward) abstraction. The individual calls to p_canxxx() better reflect differences between the inter-process authorization checks, such as differing checks based on the type of signal. This has a side effect of improving code readability. o Replace direct credential authorization checks in ktrace() with invocation of p_candebug(), while maintaining the special case check of KTR_ROOT. This allows ktrace() to "play more nicely" with new mandatory access control schemes, as well as making its authorization checks consistent with other "debugging class" checks. o Eliminate "privused" construct for p_can*() calls which allowed the caller to determine if privilege was required for successful evaluation of the access control check. This primitive is currently unused, and as such, serves only to complicate the API. Approved by: ({procfs,linprocfs} changes) des Obtained from: TrustedBSD Project	2001-07-05 17:10:46 +00:00
John Baldwin	4a370459cc	- Update the vmmeter statistics for vnode pageins and pageouts in getpages/putpages. - Use vm_page_undirty() instead of messing with pages' dirty fields directly.	2001-07-04 19:55:01 +00:00
Matthew Dillon	0cddd8f023	With Alfred's permission, remove vm_mtx in favor of a fine-grained approach (this commit is just the first stage). Also add various GIANT_ macros to formalize the removal of Giant, making it easy to test in a more piecemeal fashion. These macros will allow us to test fine-grained locks to a degree before removing Giant, and also after, and to remove Giant in a piecemeal fashion via sysctl's on those subsystems which the authors believe can operate without Giant.	2001-07-04 16:20:28 +00:00
John Baldwin	797c3dba25	Fix a mntvnode and vnode interlock reversal.	2001-06-28 03:52:04 +00:00
John Baldwin	805d90f763	Protect the mnt_vnode list with the mntvnode lock.	2001-06-28 03:50:17 +00:00
Dag-Erling Smørgrav	56fe60b131	#if 0 out pfs_null() to silence the warning about it not being referenced.	2001-06-15 12:30:46 +00:00
Peter Wemm	70439d2750	Fix warning: 568: warning: `portal_badop' defined but not used	2001-06-15 00:38:03 +00:00
Peter Wemm	f14f48a226	Fix warning (exposed NetBSD code): 94: warning: `ntfs_bmap' declared `static' but never defined	2001-06-15 00:32:07 +00:00
Peter Wemm	e75a45be56	Fix warnings (mostly harmless, due to struct bio being embedded in buf): 738: warning: passing arg 1 of `biodone' from incompatible pointer type 745: warning: passing arg 1 of `biodone' from incompatible pointer type	2001-06-15 00:30:27 +00:00
Peter Wemm	42c187b77e	Fix warning: 552: warning: `fdesc_badop' defined but not used	2001-06-15 00:27:21 +00:00
Peter Wemm	13f961dbfd	Warning fix: coda_fbsd.c:113: warning: unused variable `ret'	2001-06-15 00:02:27 +00:00
Boris Popov	4587152a71	Coda do not call vop_defaultop(), so add nesessary calls for VM objects. Submitted by: Greg Troxel <gdt@ir.bbn.com> MFC after: 2 days	2001-06-14 09:28:30 +00:00
Matt Jacob	aa56d911a6	the last argument to copyinstr is of t ype size_t, not u_int	2001-06-13 18:58:11 +00:00
Peter Wemm	f41325db5f	With this commit, I hereby pronounce gensetdefs past its use-by date. Replace the a.out emulation of 'struct linker_set' with something a little more flexible. <sys/linker_set.h> now provides macros for accessing elements and completely hides the implementation. The linker_set.h macros have been on the back burner in various forms since 1998 and has ideas and code from Mike Smith (SET_FOREACH()), John Polstra (ELF clue) and myself (cleaned up API and the conversion of the rest of the kernel to use it). The macros declare a strongly typed set. They return elements with the type that you declare the set with, rather than a generic void *. For ELF, we use the magic ld symbols (__start_<setname> and __stop_<setname>). Thanks to Richard Henderson <rth@redhat.com> for the trick about how to force ld to provide them for kld's. For a.out, we use the old linker_set struct. NOTE: the item lists are no longer null terminated. This is why the code impact is high in certain areas. The runtime linker has a new method to find the linker set boundaries depending on which backend format is in use. linker sets are still module/kld unfriendly and should never be used for anything that may be modular one day. Reviewed by: eivind	2001-06-13 10:58:39 +00:00
Dag-Erling Smørgrav	21ceb6efa2	For some reason, though the module builds just fine without <sys/lock.h>, LINT fails to build without it.	2001-06-11 15:04:48 +00:00
Dag-Erling Smørgrav	b27acc8dd1	Bail out if the fill function failed.	2001-06-10 21:39:01 +00:00
Dag-Erling Smørgrav	7005ce8a5f	Whoops, some of my test code snuck in here.	2001-06-10 21:37:11 +00:00
Dag-Erling Smørgrav	497806b394	Argh. Fix braino in previous commit.	2001-06-10 18:54:04 +00:00
Dag-Erling Smørgrav	1828efef8d	Add a 'flags' argument to the PFS_PROCDIR macro.	2001-06-10 18:52:55 +00:00
Dag-Erling Smørgrav	649ad985c9	Add support for process-dependent directories. This means that save for the lack of a man page, pseudofs is mostly complete now.	2001-06-10 18:39:21 +00:00
Dag-Erling Smørgrav	1e4ebf4e8d	Blah, not my day. This file needs <sys/mutex.h> now.	2001-06-10 10:42:55 +00:00
Dag-Erling Smørgrav	ec09e7f25c	Remember to unlock the process pfind() returns.	2001-06-10 10:42:01 +00:00
Dag-Erling Smørgrav	49fa664f4e	Add missing #include of <sys/mutex.h>.	2001-06-10 10:36:16 +00:00
Dag-Erling Smørgrav	31f73b3fcd	Catch up with the change in sbuf_new's prototype.	2001-06-10 10:34:21 +00:00
Jonathan Lemon	2247e23a97	The kq write filter was hooked up to the wrong socket, and thus was not behaving correctly. Fix by attaching to the correct socket. Also call so{rw}wakeup in addition to the fifo wakeup, so that any kqfilters attached to the socket buffer get poked.	2001-06-06 17:38:36 +00:00
Seigo Tanimura	326f419bb9	Lock VM Giant prior to locking a vm map. Spotted by: Daniel Rock <D.Rock@t-online.de> Tested by: David Wolfskill <david@catwhisker.org>, Sean Eric Fagan <sef@kithrup.com>	2001-06-06 04:13:11 +00:00
Shafeeq Sinnamohideen	ba8aae1baf	Now works again and as a module and with devfs. Used the bpf & tun drivers as examples as to what is necessary for devfs.	2001-06-05 19:45:16 +00:00
Brian Somers	51716196a4	Support /dev/tun cloning. Ansify if_tun.c while I'm there. Only tun0 -> tun32767 may now be opened as struct ifnet's if_unit is a short. It's now possible to open /dev/tun and get a handle back for an available tun device (use devname to find out what you got). The implementation uses rman by popular demand (and against my judgement) to track opened devices and uses the new dev_depends() to ensure that all make_dev()d devices go away before the module is unloaded. Reviewed by: phk	2001-06-01 15:51:10 +00:00
Ruslan Ermilov	4ccd754686	- VFS_SET(msdos) -> VFS_SET(msdosfs) - msdos.ko -> msdosfs.ko - mount_msdos(8) -> mount_msdosfs(8) - "msdos" -> "msdosfs" compatibility glue in mount(8)	2001-06-01 10:57:26 +00:00
Poul-Henning Kamp	e33457d7eb	Don't copy the trailing zero in readlink, it confuses namei(). PR: 27656	2001-05-26 20:07:57 +00:00
Ruslan Ermilov	8a8402d3a5	- sys/n[tw]fs moved to sys/fs/n[tw]fs - /usr/include/n[tw]fs moved to /usr/include/fs/n[tw]fs	2001-05-26 11:57:45 +00:00
Poul-Henning Kamp	3344c5a17e	Create a general facility for making dev_t's depend on another dev_t. The dev_depends(dev_t, dev_t) function is for tying them to each other. When destroy_dev() is called on a dev_t, all dev_t's depending on it will also be destroyed (depth first order). Rewrite the make_dev_alias() to use this dependency facility. kern/subr_disk.c: Make the disk mini-layer use dependencies to make sure all relevant dev_t's are removed when the disk disappears. Make the disk mini-layer precreate some magic sub devices which the disk/slice/label code expects to be there. kern/subr_disklabel.c: Remove some now unneeded variables. kern/subr_diskmbr.c: Remove some ancient, commented out code. kern/subr_diskslice.c: Minor cleanup. Use name from dev_t instead of dsname()	2001-05-26 08:27:58 +00:00
Robert Watson	b1fc0ec1a7	o Merge contents of struct pcred into struct ucred. Specifically, add the real uid, saved uid, real gid, and saved gid to ucred, as well as the pcred->pc_uidinfo, which was associated with the real uid, only rename it to cr_ruidinfo so as not to conflict with cr_uidinfo, which corresponds to the effective uid. o Remove p_cred from struct proc; add p_ucred to struct proc, replacing original macro that pointed. p->p_ucred to p->p_cred->pc_ucred. o Universally update code so that it makes use of ucred instead of pcred, p->p_ucred instead of p->p_pcred, cr_ruidinfo instead of p_uidinfo, cr_{r,sv}{u,g}id instead of p_*, etc. o Remove pcred0 and its initialization from init_main.c; initialize cr_ruidinfo there. o Restruction many credential modification chunks to always crdup while we figure out locking and optimizations; generally speaking, this means moving to a structure like this: newcred = crdup(oldcred); ... p->p_ucred = newcred; crfree(oldcred); It's not race-free, but better than nothing. There are also races in sys_process.c, all inter-process authorization, fork, exec, and exit. o Remove sigio->sio_ruid since sigio->sio_ucred now contains the ruid; remove comments indicating that the old arrangement was a problem. o Restructure exec1() a little to use newcred/oldcred arrangement, and use improved uid management primitives. o Clean up exit1() so as to do less work in credential cleanup due to pcred removal. o Clean up fork1() so as to do less work in credential cleanup and allocation. o Clean up ktrcanset() to take into account changes, and move to using suser_xxx() instead of performing a direct uid==0 comparision. o Improve commenting in various kern_prot.c credential modification calls to better document current behavior. In a couple of places, current behavior is a little questionable and we need to check POSIX.1 to make sure it's "right". More commenting work still remains to be done. o Update credential management calls, such as crfree(), to take into account new ruidinfo reference. o Modify or add the following uid and gid helper routines: change_euid() change_egid() change_ruid() change_rgid() change_svuid() change_svgid() In each case, the call now acts on a credential not a process, and as such no longer requires more complicated process locking/etc. They now assume the caller will do any necessary allocation of an exclusive credential reference. Each is commented to document its reference requirements. o CANSIGIO() is simplified to require only credentials, not processes and pcreds. o Remove lots of (p_pcred==NULL) checks. o Add an XXX to authorization code in nfs_lock.c, since it's questionable, and needs to be considered carefully. o Simplify posix4 authorization code to require only credentials, not processes and pcreds. Note that this authorization, as well as CANSIGIO(), needs to be updated to use the p_cansignal() and p_cansched() centralized authorization routines, as they currently do not take into account some desirable restrictions that are handled by the centralized routines, as well as being inconsistent with other similar authorization instances. o Update libkvm to take these changes into account. Obtained from: TrustedBSD Project Reviewed by: green, bde, jhb, freebsd-arch, freebsd-audit	2001-05-25 16:59:11 +00:00
Ruslan Ermilov	1166fb516b	- sys/msdosfs moved to sys/fs/msdosfs - msdos.ko renamed to msdosfs.ko - /usr/include/msdosfs moved to /usr/include/fs/msdosfs	2001-05-25 08:14:14 +00:00
Ruslan Ermilov	c7b23e0fb4	Actually rename FDESC, PORTAL, UMAP and UNION file systems. OK'ed by: bp	2001-05-24 15:20:11 +00:00
Ruslan Ermilov	c99d12581a	mount_umap(8) -> mount_umapfs(8).	2001-05-24 13:20:41 +00:00
Ruslan Ermilov	57a523ae6b	mount_null(8) -> mount_nullfs(8).	2001-05-24 13:17:47 +00:00
John Baldwin	c7f52620e0	Don't acquire/release Giant around some of the places that need it in spec_getpages(). Instead, assert that Giant is held by the caller.	2001-05-23 22:20:29 +00:00
Poul-Henning Kamp	5a9300c451	Change the way deletes are managed in DEVFS. This fixes a number of warnings relating to removed cloned devices. It also makes it possible to recreate deleted devices with mknod(2). The major/minor arguments are ignored.	2001-05-23 17:48:20 +00:00
Ruslan Ermilov	99d300a1ec	- FDESC, FIFO, NULL, PORTAL, PROC, UMAP and UNION file systems were repo-copied from sys/miscfs to sys/fs. - Renamed the following file systems and their modules: fdesc -> fdescfs, portal -> portalfs, union -> unionfs. - Renamed corresponding kernel options: FDESC -> FDESCFS, PORTAL -> PORTALFS, UNION -> UNIONFS. - Install header files for the above file systems. - Removed bogus -I${.CURDIR}/../../sys CFLAGS from userland Makefiles.	2001-05-23 09:42:29 +00:00
John Baldwin	2178ff8b9f	Sort includes from previous commit.	2001-05-21 23:19:50 +00:00
Alfred Perlstein	2395531439	Introduce a global lock for the vm subsystem (vm_mtx). vm_mtx does not recurse and is required for most low level vm operations. faults can not be taken without holding Giant. Memory subsystems can now call the base page allocators safely. Almost all atomic ops were removed as they are covered under the vm mutex. Alpha and ia64 now need to catch up to i386's trap handlers. FFS and NFS have been tested, other filesystems will need minor changes (grabbing the vm lock when twiddling page properties). Reviewed (partially) by: jake, jhb	2001-05-19 01:28:09 +00:00
Boris Popov	10fa1684ed	Currently there is no way to tell if write operation invoked via vn_start_write() on the given vnode will be successful. VOP_LEASE() may help to solve this problem, but its return value ignored nearly everywhere. For now just assume that the missing upper layer on write means insufficient access rights (which is correct for most cases).	2001-05-18 07:43:13 +00:00
Boris Popov	f3d1ec67b2	VOP getwritemount() can be invoked on vnodes with VFREE flag set (used in snapshots code). At this point upper vp may not exist.	2001-05-17 04:58:25 +00:00
Boris Popov	3413421bda	Use vop_*vobject() VOPs to get reference to VM object from upper or lower fs.	2001-05-17 04:52:57 +00:00
Boris Popov	9dbd7336ee	Do not leave an extra reference on vnode. PR: kern/27250 Submitted by: "Vladimir B. Grebenschikov" <vova@express.ru> MFC after: 2 weeks	2001-05-17 04:40:01 +00:00
Ian Dowse	0864ef1e8a	Change the second argument of vflush() to an integer that specifies the number of references on the filesystem root vnode to be both expected and released. Many filesystems hold an extra reference on the filesystem root vnode, which must be accounted for when determining if the filesystem is busy and then released if it isn't busy. The old `skipvp' approach required individual filesystem xxx_unmount functions to re-implement much of vflush()'s logic to deal with the root vnode. All 9 filesystems that hold an extra reference on the root vnode got the logic wrong in the case of forced unmounts, so `umount -f' would always fail if there were any extra root vnode references. Fix this issue centrally in vflush(), now that we can. This commit also fixes a vnode reference leak in devfs, which could result in idle devfs filesystems that refuse to unmount. Reviewed by: phk, bp	2001-05-16 18:04:37 +00:00
Poul-Henning Kamp	f73cbde4cf	After a successfull poll of the cloning functions, match on the returned dev_t rather than the original name. This allows cloning from one name to another which is useful for /dev/tty and later for the pty's.	2001-05-14 08:20:46 +00:00
Poul-Henning Kamp	ab9f3b292e	Convert DEVFS from an "opt-in" to an "opt-out" option. If for some reason DEVFS is undesired, the "NODEVFS" option is needed now. Pending any significant issues, DEVFS will be made mandatory in -current on july 1st so that we can start reaping the full benefits of having it.	2001-05-13 20:52:40 +00:00
John Baldwin	b012b205a7	GC prototype for procfs_bmap() missed during a previous commit.	2001-05-11 23:37:37 +00:00
Poul-Henning Kamp	6bd2ea83ef	Remove unneeded devfs_badop() Noticed by: rwatson	2001-05-06 17:40:34 +00:00
Boris Popov	d759827bd9	Convert vnode_pager_freepage() to vm_free_page(). Forgotten by: alfred	2001-05-03 09:00:54 +00:00
Poul-Henning Kamp	a62615e59b	Implement vop_std{get\|put}pages() and add them to the default vop[]. Un-copy&paste all the VOP_{GET\|PUT}PAGES() functions which do nothing but the default.	2001-05-01 08:34:45 +00:00
Mark Murray	fb919e4d5a	Undo part of the tangle of having sys/lock.h and sys/mutex.h included in other "system" header files. Also help the deprecation of lockmgr.h by making it a sub-include of sys/lock.h and removing sys/lockmgr.h form kernel .c files. Sort sys/*.h includes where possible in affected files. OK'ed by: bde (with reservations)	2001-05-01 08:13:21 +00:00
Poul-Henning Kamp	e9d19a117e	Uncut&paste som bogus use of VOP_BMAP in cd9660::VOP_STRATEGY. XXX mark some stuff which looks like further cut&paste junk.	2001-04-30 21:23:05 +00:00
Poul-Henning Kamp	c1acc01996	Uncut&paste som bogus use of VOP_BMAP in hpfs::VOP_STRATEGY. At the same time, eliminate uninitialized use of a vnode pointer. Interesting GCC didn't spot this.	2001-04-30 21:21:53 +00:00
Bruce Evans	438abdb9c6	Backed out previous commit. It cause massive filesystem corruption, not to mention a compile-time warning about the critical function becoming unused, by replacing spec_bmap() with vop_stdbmap(). ntfs seems to have the same bug. The factor for converting specfs block numbers to physical block numbers is 1, but vop_stdbmap() uses the bogus factor btodb(ap->a_vp->v_mount->mnt_stat.f_iosize), which is 16 for ffs with the default block size of 8K. This factor is bogus even for vop_stdbmap() -- the correct factor is related to the filesystem blocksize which is not necessarily the same to the optimal i/o size. vop_stdbmap() was apparently cloned from nfs where these sizes happen to be the same. There may also be a problem with a_vp->v_mount being null. spec_bmap() still checks for this, but I think the checks in specfs are dead code which used to support block devices.	2001-04-30 14:35:35 +00:00
Poul-Henning Kamp	b7ebffbc08	Add a vop_stdbmap(), and make it part of the default vop vector. Make 7 filesystems which don't really know about VOP_BMAP rely on the default vector, rather than more or less complete local vop_nopbmap() implementations.	2001-04-29 11:48:41 +00:00
Greg Lehey	60fb0ce365	Revert consequences of changes to mount.h, part 2. Requested by: bde	2001-04-29 02:45:39 +00:00
Poul-Henning Kamp	a13234bb35	Move the netexport structure from the fs-specific mountstructure to struct mount. This makes the "struct netexport *" paramter to the vfs_export and vfs_checkexport interface unneeded. Consequently that all non-stacking filesystems can use vfs_stdcheckexp(). At the same time, make it a pointer to a struct netexport in struct mount, so that we can remove the bogus AF_MAX and #include <net/radix.h> from <sys/mount.h>	2001-04-25 07:07:52 +00:00
John Baldwin	33a9ed9d0e	Change the pfind() and zpfind() functions to lock the process that they find before releasing the allproc lock and returning. Reviewed by: -smp, dfr, jake	2001-04-24 00:51:53 +00:00
Matt Jacob	2b4169610b	fix it so it compiles again	2001-04-23 18:51:54 +00:00
Matt Jacob	3be6e0c249	add this ridiculous include foo so it will compile again	2001-04-23 18:14:41 +00:00
Greg Lehey	d98dc34f52	Correct #includes to work with fixed sys/mount.h.	2001-04-23 09:05:15 +00:00
Greg Lehey	97d5f7bb3b	Correct #includes to work with fixed sys/mount.h.	2001-04-23 08:28:44 +00:00
Alfred Perlstein	d8d5fa8805	vnode_pager_freepage() is really vm_page_free() in disguise, nuke vnode_pager_freepage() and replace all calls to it with vm_page_free()	2001-04-19 06:18:23 +00:00
Poul-Henning Kamp	f84e29a06c	This patch removes the VOP_BWRITE() vector. VOP_BWRITE() was a hack which made it possible for NFS client side to use struct buf with non-bio backing. This patch takes a more general approach and adds a bp->b_op vector where more methods can be added. The success of this patch depends on bp->b_op being initialized all relevant places for some value of "relevant" which is not easy to determine. For now the buffers have grown a b_magic element which will make such issues a tiny bit easier to debug.	2001-04-17 08:56:39 +00:00
Boris Popov	0fdabd3a45	Move VT_SMBFS definition to the proper place. Undefine VI_LOCK/VI_UNLOCK.	2001-04-13 11:26:54 +00:00
Boris Popov	681a5bbef2	Import kernel part of SMB/CIFS requester. Add smbfs(CIFS) filesystem. Userland part will be in the ports tree for a while. Obtained from: smbfs-1.3.7-dev package.	2001-04-10 07:59:06 +00:00
Dag-Erling Smørgrav	9733a80839	Let pseudofs into the warmth of the FreeBSD CVS repo. It's not finished yet (I still have to find a way to implement process- dependent nodes without consuming too much memory, and the permission system needs tightening up), but it's becoming hard to work on without a repo (I've accidentally almost nuked it once already), and it works (except for the lack of process-dependent nodes, that is). I was supposed to commit this a week ago, but timed out waiting for jkh to reply to some questions I had. Pass him a spoonful of bad karma :)	2001-04-07 19:51:12 +00:00
John Baldwin	0316f71d56	- Various style fixes. - Fix a silly bug so that we return the actual error code if a procfs attach fails rather than always returning 0. Reported by: bde	2001-03-29 18:10:46 +00:00
John Baldwin	1005a129e5	Convert the allproc and proctree locks from lockmgr locks to sx locks.	2001-03-28 11:52:56 +00:00
John Baldwin	f34fa851e0	Catch up to header include changes: - <sys/mutex.h> now requires <sys/systm.h> - <sys/mutex.h> and <sys/sx.h> now require <sys/lock.h>	2001-03-28 09:17:56 +00:00
Poul-Henning Kamp	f83880518b	Send the remains (such as I have located) of "block major numbers" to the bit-bucket.	2001-03-26 12:41:29 +00:00
Boris Popov	6306f8dad3	Add dependancy on libmchain module. Spotted by: Andrzej Tobola <san@iem.pw.edu.pl>	2001-03-22 06:51:53 +00:00
Robert Watson	70f3685105	o Change the API and ABI of the Extended Attribute kernel interfaces to introduce a new argument, "namespace", rather than relying on a first- character namespace indicator. This is in line with more recent thinking on EA interfaces on various mailing lists, including the posix1e, Linux acl-devel, and trustedbsd-discuss forums. Two namespaces are defined by default, EXTATTR_NAMESPACE_SYSTEM and EXTATTR_NAMESPACE_USER, where the primary distinction lies in the access control model: user EAs are accessible based on the normal MAC and DAC file/directory protections, and system attributes are limited to kernel-originated or appropriately privileged userland requests. o These API changes occur at several levels: the namespace argument is introduced in the extattr_{get,set}_file() system call interfaces, at the vnode operation level in the vop_{get,set}extattr() interfaces, and in the UFS extended attribute implementation. Changes are also introduced in the VFS extattrctl() interface (system call, VFS, and UFS implementation), where the arguments are modified to include a namespace field, as well as modified to advoid direct access to userspace variables from below the VFS layer (in the style of recent changes to mount by adrian@FreeBSD.org). This required some cleanup and bug fixing regarding VFS locks and the VFS interface, as a vnode pointer may now be optionally submitted to the VFS_EXTATTRCTL() call. Updated documentation for the VFS interface will be committed shortly. o In the near future, the auto-starting feature will be updated to search two sub-directories to the ".attribute" directory in appropriate file systems: "user" and "system" to locate attributes intended for those namespaces, as the single filename is no longer sufficient to indicate what namespace the attribute is intended for. Until this is committed, all attributes auto-started by UFS will be placed in the EXTATTR_NAMESPACE_SYSTEM namespace. o The default POSIX.1e attribute names for ACLs and Capabilities have been updated to no longer include the '$' in their filename. As such, if you're using these features, you'll need to rename the attribute backing files to the same names without '$' symbols in front. o Note that these changes will require changes in userland, which will be committed shortly. These include modifications to the extended attribute utilities, as well as to libutil for new namespace string conversion routines. Once the matching userland changes are committed, a buildworld is recommended to update all the necessary include files and verify that the kernel and userland environments are in sync. Note: If you do not use extended attributes (most people won't), upgrading is not imperative although since the system call API has changed, the new userland extended attribute code will no longer compile with old include files. o Couple of minor cleanups while I'm there: make more code compilation conditional on FFS_EXTATTR, which should recover a bit of space on kernels running without EA's, as well as update copyright dates. Obtained from: TrustedBSD Project	2001-03-15 02:54:29 +00:00
Maxim Sobolev	a7436e684a	Add missed MODULE_VERSION() call, so loading of unicode conversion routine works properly. Clue beaten in by: des	2001-03-11 15:28:42 +00:00
Boris Popov	e3c805cd07	Do not kill vnodes after rename. This can cause deadlocks in the deadfs. Noticed by: Matthew N. Dodd <winter@jurai.net>	2001-03-11 11:51:42 +00:00
Boris Popov	c35e8e54cd	Add a mount time option which slightly relaxes checks for valid Joilet extensions. PR: kern/23315 Reviewed by: adrian	2001-03-11 10:05:08 +00:00
Boris Popov	1db5c04bc0	Slightly reorganize allocation of new vnode. Use bit NVOLUME to detected vnodes which represent volumes (before it was done via strcmp()). Turn n_refparent into bit in the n_flag field.	2001-03-10 05:39:03 +00:00
Boris Popov	d691852ce6	Synch with changes in the NCP requester.	2001-03-10 05:31:22 +00:00
Kirk McKusick	589c7af992	Fixes to track snapshot copy-on-write checking in the specinfo structure rather than assuming that the device vnode would reside in the FFS filesystem (which is obviously a broken assumption with the device filesystem).	2001-03-07 07:09:55 +00:00
John Baldwin	19eb87d22a	Grab the process lock while calling psignal and before calling psignal.	2001-03-07 03:37:06 +00:00
John Baldwin	931cccf603	Proc locking identical to that of linprocfs' vnops except that we hold the proc lock while calling psignal.	2001-03-07 03:15:05 +00:00
John Baldwin	30ac5d0f9e	Protect read to p_pptr with proc lock rather than proctree lock.	2001-03-07 03:10:20 +00:00
John Baldwin	c65c565b44	Proc locking. Lock around psignal() and also ensure both an exclusive proctree lock and the process lock are held when updating p_pptr and p_oppid. When we are just reaading p_pptr we only need the proc lock and not a proctree lock as well.	2001-03-07 03:09:40 +00:00
John Baldwin	0087374731	Protect p_flag with the proc lock.	2001-03-07 02:07:56 +00:00
Boris Popov	1cebc48fb3	A name of the file can change while its id stays the same. So, we have to update it as well. Remove unused function.	2001-03-06 09:59:18 +00:00
Doug Rabson	a76decc6f7	Remove the copyinstr call which was trying to copy the pathname in from user space. It has already been copied in and mp->mnt_stat.f_mntonname has already been initialised by the caller. This fixes a panic on the alpha caused by the fact that the variable 'size' wasn't initialised because the call to copyinstr() bailed out with an EFAULT error.	2001-03-03 15:15:33 +00:00
Adrian Chadd	f3a90da995	Reviewed by: jlemon An initial tidyup of the mount() syscall and VFS mount code. This code replaces the earlier work done by jlemon in an attempt to make linux_mount() work. * the guts of the mount work has been moved into vfs_mount(). * move `type', `path' and `flags' from being userland variables into being kernel variables in vfs_mount(). `data' remains a pointer into userspace. * Attempt to verify the `type' and `path' strings passed to vfs_mount() aren't too long. * rework mount() and linux_mount() to take the userland parameters (besides data, as mentioned) and pass kernel variables to vfs_mount(). (linux_mount() already did this, I've just tidied it up a little more.) * remove the copyin() stuff for `path'. `data' still requires copyin() since its a pointer into userland. * set `mount->mnt_statf_mntonname' in vfs_mount() rather than in each filesystem. This variable is generally initialised with `path', and each filesystem can override it if they want to. * NOTE: f_mntonname is intiailised with "/" in the case of a root mount.	2001-03-01 21:00:17 +00:00
Alfred Perlstein	8283130be4	Display the Joliet Extension 'level' in the log message. PR: kern/24998	2001-02-23 03:43:05 +00:00
Robert Watson	91421ba234	o Move per-process jail pointer (p->pr_prison) to inside of the subject credential structure, ucred (cr->cr_prison). o Allow jail inheritence to be a function of credential inheritence. o Abstract prison structure reference counting behind pr_hold() and pr_free(), invoked by the similarly named credential reference management functions, removing this code from per-ABI fork/exit code. o Modify various jail() functions to use struct ucred arguments instead of struct proc arguments. o Introduce jailed() function to determine if a credential is jailed, rather than directly checking pointers all over the place. o Convert PRISON_CHECK() macro to prison_check() function. o Move jail() function prototypes to jail.h. o Emulate the P_JAILED flag in fill_kinfo_proc() and no longer set the flag in the process flags field itself. o Eliminate that "const" qualifier from suser/p_can/etc to reflect mutex use. Notes: o Some further cleanup of the linux/jail code is still required. o It's now possible to consider resolving some of the process vs credential based permission checking confusion in the socket code. o Mutex protection of struct prison is still not present, and is required to protect the reference count plus some fields in the structure. Reviewed by: freebsd-arch Obtained from: TrustedBSD Project	2001-02-21 06:39:57 +00:00
Poul-Henning Kamp	3e8bea9634	Remove a debug printf.	2001-02-18 09:16:49 +00:00
Jonathan Lemon	608a3ce62a	Extend kqueue down to the device layer. Backwards compatible approach suggested by: peter	2001-02-15 16:34:11 +00:00
Maxim Sobolev	c4fefc4887	Add a hook for loading of a Unicode -> char conversion routine as a kld at a run-time. This is temporary solution until proper kernel Unicode interfaces are in place and as such was purposely designed to be as tiny as possible (3 lines of the code not counting comments). The port with conversion routines for the most popular single-byte languages will be added later today Reviewed by: bp, "Michael C . Wu" <keichii@iteration.net> Approved by: bp	2001-02-13 11:48:31 +00:00
Bosko Milekic	9ed346bab0	Change and clean the mutex lock interface. mtx_enter(lock, type) becomes: mtx_lock(lock) for sleep locks (MTX_DEF-initialized locks) mtx_lock_spin(lock) for spin locks (MTX_SPIN-initialized) similarily, for releasing a lock, we now have: mtx_unlock(lock) for MTX_DEF and mtx_unlock_spin(lock) for MTX_SPIN. We change the caller interface for the two different types of locks because the semantics are entirely different for each case, and this makes it explicitly clear and, at the same time, it rids us of the extra `type' argument. The enter->lock and exit->unlock change has been made with the idea that we're "locking data" and not "entering locked code" in mind. Further, remove all additional "flags" previously passed to the lock acquire/release routines with the exception of two: MTX_QUIET and MTX_NOSWITCH The functionality of these flags is preserved and they can be passed to the lock/unlock routines by calling the corresponding wrappers: mtx_{lock, unlock}_flags(lock, flag(s)) and mtx_{lock, unlock}_spin_flags(lock, flag(s)) for MTX_DEF and MTX_SPIN locks, respectively. Re-inline some lock acq/rel code; in the sleep lock case, we only inline the _obtain_lock()s in order to ensure that the inlined code fits into a cache line. In the spin lock case, we inline recursion and actually only perform a function call if we need to spin. This change has been made with the idea that we generally tend to avoid spin locks and that also the spin locks that we do have and are heavily used (i.e. sched_lock) do recurse, and therefore in an effort to reduce function call overhead for some architectures (such as alpha), we inline recursion for this case. Create a new malloc type for the witness code and retire from using the M_DEV type. The new type is called M_WITNESS and is only declared if WITNESS is enabled. Begin cleaning up some machdep/mutex.h code - specifically updated the "optimized" inlined code in alpha/mutex.h and wrote MTX_LOCK_SPIN and MTX_UNLOCK_SPIN asm macros for the i386/mutex.h as we presently need those. Finally, caught up to the interface changes in all sys code. Contributors: jake, jhb, jasone (in no particular order)	2001-02-09 06:11:45 +00:00
Jeroen Ruigrok van der Werven	1a6e52d0e9	Fix typo: seperate -> separate. Seperate does not exist in the english language.	2001-02-06 11:21:58 +00:00
Poul-Henning Kamp	37d4006626	Another round of the <sys/queue.h> FOREACH transmogriffer. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 16:08:18 +00:00
Poul-Henning Kamp	fc2ffbe604	Mechanical change to use <sys/queue.h> macro API instead of fondling implementation details. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 13:13:25 +00:00
Poul-Henning Kamp	ef9e85abba	Use <sys/queue.h> macro API.	2001-02-04 12:37:48 +00:00
Poul-Henning Kamp	b99cfaf32c	Remove a DIAGNOSTIC check which belongs in <sys/queue.h> if anyplace at all.	2001-02-04 11:53:51 +00:00
Poul-Henning Kamp	4b1c62b3f2	At the point in time where most devices are created, we don't know what time it is because boottime is not yet initialized. Finagle the relevant fields when we get the chance.	2001-02-02 22:54:41 +00:00
Poul-Henning Kamp	ecde9a6dae	Only superuser can create symlinks. Give symlinks mode 755 by default to avoid triggering alert eyes. (the mode isn't use on symlinks)	2001-02-02 18:35:29 +00:00
Peter Wemm	2508f69037	Zap last remaining references to (and a use use of) of simple_locks.	2001-01-31 04:29:52 +00:00
Poul-Henning Kamp	4997ad7c1f	Add a BUF_KERNPROC() in the BIO_DELETE path. This seems to fix the problem which md(4) backed filesystems exposed.	2001-01-30 10:06:08 +00:00
Poul-Henning Kamp	aadf265525	Fix two minor nits. Existences revealed, but no details offered by: bp	2001-01-30 08:39:52 +00:00
Matthew Dillon	2a9737202a	This patch reestablishes the spec_fsync() guarentee that synchronous fsyncs, which typically occur during unmounting, will drain all dirty buffers even if it takes multiple passes to do so. The guarentee was mangled by the last patch which solved a problem due to -current disabling interrupts while holding giant (which caused an infinite spin loop waiting for I/O to complete). -stable does not have either patch, but has a similar bug in the original spec_fsync() code which is triggered by a bug in the softupdates umount code, a fix for which will be committed to -current as soon as Kirk stamps it. Then both solutions will be MFC'd to -stable. -stable currently suffers from a combination of the softupdates bug and a small window of opportunity in the original spec_fsync() code, and -stable also suffers from the spin-loop bug but since interrupts are enabled the spin resolves itself in a few milliseconds.	2001-01-29 08:19:28 +00:00
John Baldwin	ba88dfc733	Back out proc locking to protect p_ucred for obtaining additional references along with the actual obtaining of additional references.	2001-01-27 00:01:31 +00:00
Jason Evans	1b367556b5	Convert all simplelocks to mutexes and remove the simplelock implementations.	2001-01-24 12:35:55 +00:00

1 2 3 4 5 ...

1011 Commits