freebsd-skq

Author	SHA1	Message	Date
kris	bd6f9cb9b6	Fix some signed/unsigned integer confusion, and add bounds checking of arguments to some functions. Obtained from: NetBSD Reviewed by: peter MFC after: 2 weeks	2001-09-10 11:28:07 +00:00
semenu	8c98d68610	Stole unicode translation table from mount_msdos. Add kernel code to support this translation. MFC after: 2 weeks	2001-09-08 23:03:52 +00:00
semenu	4c1e708040	Fix opening particular file's attributes (as described in man page). This is useful for debug purposes. MFC after: 2 weeks	2001-09-08 22:59:12 +00:00
semenu	4d49e9cd2b	Reference devvp on ntnode creation and dereference on removal. Previous code lead to page faults becouse i_devvp went zero after VOP_RECLAIM, but ntnode was reused (not reclaimed). MFC after: 2 weeks	2001-09-08 22:57:03 +00:00
semenu	72ce608646	Fix errors and warnings when compiling with NTFS_DEBUG > 1 MFC after: 2 weeks	2001-09-08 22:53:27 +00:00
ache	adf3e081dc	smbfs_advlock: simplify overflow checks (copy from kern_lockf.c) minor formatting issues to minimize differences	2001-08-29 18:59:04 +00:00
ache	eb7b8850bf	Cosmetique & style fixes from bde	2001-08-26 10:28:58 +00:00
ache	ee0b05dda7	Copy from kern_lockf.c: remove extra check	2001-08-24 10:22:16 +00:00
ache	640689c04d	Copy yet one check for SEEK_END overflow	2001-08-23 17:12:42 +00:00
ache	f8ef40b0cb	Copy my newly introduced l_len<0 'oops' fix from kern_lockf.c	2001-08-23 16:06:14 +00:00
ache	e932bf8680	Copy POSIX l_len<0 handling from kern_lockf.c	2001-08-23 15:44:24 +00:00
ache	f79f77e5f9	Cosmetique: correct English in comments non-cosmetique: add missing break; - original code was broken here	2001-08-23 14:45:31 +00:00
ache	2879f02ee4	Move <machine/> after <sys/> Pointed by: bde	2001-08-23 13:27:58 +00:00
ache	9b50a3bc04	adv. lock: copy EOVERFLOW handling code from main variant fix type of 'size' arg	2001-08-23 08:54:22 +00:00
bp	555767b49c	Use proper endian conversion. Obtained from: Mac OS X MFC after: 1 week	2001-08-21 08:27:47 +00:00
bp	078a22aae0	Return proper length of _PC_NAME_MAX value if long names support is enabled. Obtained from: Mac OS X MFC after: 1 week	2001-08-21 08:25:09 +00:00
phk	14112e9d98	linux ls fails on DEVFS /dev because linux_getdents fails because linux_getdents uses VOP_READDIR( ..., &ncookies, &cookies ) instead of VOP_READDIR( ..., NULL, NULL ) because it seems to need the offsets for linux_dirent and sizeof(dirent) != sizeof(linux_dirent)... PR: 29467 Submitted by: Michael Reifenberger <root@nihil.plaut.de> Reviewed by: phk	2001-08-14 06:42:32 +00:00
rwatson	18a29c5f33	Remove dangling prototype for the now defunct procfs_kmemaccess() call. Obtained from: TrustedBSD Project	2001-08-03 17:51:05 +00:00
rwatson	90376d10b0	Collapse a Pmem case in with the other debugging files case for procfs, as there are now "unusual" protection properties to Pmem that differ from the other files. While I'm at it, introduce proc locking for the other files, which was previously present only in the Pmem case. Obtained from: TrustedBSD Project	2001-08-03 17:20:34 +00:00
rwatson	9b545cd960	Remove read permission for group on the /proc/*/mem file, since kmem no longer requires access. Reviewed by: tmm Obtained from: TrustedBSD Project	2001-08-03 17:15:40 +00:00
rwatson	306deb3ae6	Prior to support for almost all ps activity via sysctl, ps used procfs, and so special-casing was introduced to provide extra procfs privilege to the kmem group. With the advent of non-setgid kmem ps, this code is no longer required, and in fact, can is potentially harmful as it allocates privilege to a gid that is increasingly less meaningful. Knowledge of specific gid's in kernel is also generally bad precedent, as the kernel security policy doesn't distinguish gid's specifically, only uid 0. This commit removes reference to kmem in procfs, both in terms of access control decisions, and the applying of gid kmem to the /proc/*/mem file, simplifying the associated code considerably. Processes are still permitted to access the mem file based on the debugging policy, so ps -e still works fine for normal processes and use. Reviewed by: tmm Obtained from: TrustedBSD Project	2001-08-03 17:13:23 +00:00
assar	34e9a6f370	remove support for creating files and directories from msdosfs_mknod	2001-07-19 19:15:42 +00:00
jhb	99b71dfa30	Grab the process lock around psignal(). Noticed by: tanimura	2001-07-18 19:17:36 +00:00
rwatson	da1a848c61	o Replace calls to p_can(..., P_CAN_xxx) with calls to p_canxxx(). The p_can(...) construct was a premature (and, it turns out, awkward) abstraction. The individual calls to p_canxxx() better reflect differences between the inter-process authorization checks, such as differing checks based on the type of signal. This has a side effect of improving code readability. o Replace direct credential authorization checks in ktrace() with invocation of p_candebug(), while maintaining the special case check of KTR_ROOT. This allows ktrace() to "play more nicely" with new mandatory access control schemes, as well as making its authorization checks consistent with other "debugging class" checks. o Eliminate "privused" construct for p_can*() calls which allowed the caller to determine if privilege was required for successful evaluation of the access control check. This primitive is currently unused, and as such, serves only to complicate the API. Approved by: ({procfs,linprocfs} changes) des Obtained from: TrustedBSD Project	2001-07-05 17:10:46 +00:00
jhb	ff7f1b3c5b	- Update the vmmeter statistics for vnode pageins and pageouts in getpages/putpages. - Use vm_page_undirty() instead of messing with pages' dirty fields directly.	2001-07-04 19:55:01 +00:00
dillon	e028603b7e	With Alfred's permission, remove vm_mtx in favor of a fine-grained approach (this commit is just the first stage). Also add various GIANT_ macros to formalize the removal of Giant, making it easy to test in a more piecemeal fashion. These macros will allow us to test fine-grained locks to a degree before removing Giant, and also after, and to remove Giant in a piecemeal fashion via sysctl's on those subsystems which the authors believe can operate without Giant.	2001-07-04 16:20:28 +00:00
jhb	0bd9d86c0a	Fix a mntvnode and vnode interlock reversal.	2001-06-28 03:52:04 +00:00
jhb	54c05ef2f2	Protect the mnt_vnode list with the mntvnode lock.	2001-06-28 03:50:17 +00:00
des	cf75a40bc5	#if 0 out pfs_null() to silence the warning about it not being referenced.	2001-06-15 12:30:46 +00:00
peter	376d88fdd4	Fix warning: 568: warning: `portal_badop' defined but not used	2001-06-15 00:38:03 +00:00
peter	07d710ea39	Fix warning (exposed NetBSD code): 94: warning: `ntfs_bmap' declared `static' but never defined	2001-06-15 00:32:07 +00:00
peter	8c7837e7b6	Fix warnings (mostly harmless, due to struct bio being embedded in buf): 738: warning: passing arg 1 of `biodone' from incompatible pointer type 745: warning: passing arg 1 of `biodone' from incompatible pointer type	2001-06-15 00:30:27 +00:00
peter	964d84d5a2	Fix warning: 552: warning: `fdesc_badop' defined but not used	2001-06-15 00:27:21 +00:00
peter	c810b4bfde	Warning fix: coda_fbsd.c:113: warning: unused variable `ret'	2001-06-15 00:02:27 +00:00
bp	230e46d127	Coda do not call vop_defaultop(), so add nesessary calls for VM objects. Submitted by: Greg Troxel <gdt@ir.bbn.com> MFC after: 2 days	2001-06-14 09:28:30 +00:00
mjacob	a14c63022e	the last argument to copyinstr is of t ype size_t, not u_int	2001-06-13 18:58:11 +00:00
peter	f10fa038c1	With this commit, I hereby pronounce gensetdefs past its use-by date. Replace the a.out emulation of 'struct linker_set' with something a little more flexible. <sys/linker_set.h> now provides macros for accessing elements and completely hides the implementation. The linker_set.h macros have been on the back burner in various forms since 1998 and has ideas and code from Mike Smith (SET_FOREACH()), John Polstra (ELF clue) and myself (cleaned up API and the conversion of the rest of the kernel to use it). The macros declare a strongly typed set. They return elements with the type that you declare the set with, rather than a generic void *. For ELF, we use the magic ld symbols (__start_<setname> and __stop_<setname>). Thanks to Richard Henderson <rth@redhat.com> for the trick about how to force ld to provide them for kld's. For a.out, we use the old linker_set struct. NOTE: the item lists are no longer null terminated. This is why the code impact is high in certain areas. The runtime linker has a new method to find the linker set boundaries depending on which backend format is in use. linker sets are still module/kld unfriendly and should never be used for anything that may be modular one day. Reviewed by: eivind	2001-06-13 10:58:39 +00:00
des	cd17b04723	For some reason, though the module builds just fine without <sys/lock.h>, LINT fails to build without it.	2001-06-11 15:04:48 +00:00
des	169656d24e	Bail out if the fill function failed.	2001-06-10 21:39:01 +00:00
des	2002923bda	Whoops, some of my test code snuck in here.	2001-06-10 21:37:11 +00:00
des	7711a01ae2	Argh. Fix braino in previous commit.	2001-06-10 18:54:04 +00:00
des	7a9137328d	Add a 'flags' argument to the PFS_PROCDIR macro.	2001-06-10 18:52:55 +00:00
des	da96d2410a	Add support for process-dependent directories. This means that save for the lack of a man page, pseudofs is mostly complete now.	2001-06-10 18:39:21 +00:00
des	c48f6bb4db	Blah, not my day. This file needs <sys/mutex.h> now.	2001-06-10 10:42:55 +00:00
des	937629ee8a	Remember to unlock the process pfind() returns.	2001-06-10 10:42:01 +00:00
des	33f35efc4a	Add missing #include of <sys/mutex.h>.	2001-06-10 10:36:16 +00:00
des	2516536ce6	Catch up with the change in sbuf_new's prototype.	2001-06-10 10:34:21 +00:00
jlemon	60545edbcb	The kq write filter was hooked up to the wrong socket, and thus was not behaving correctly. Fix by attaching to the correct socket. Also call so{rw}wakeup in addition to the fifo wakeup, so that any kqfilters attached to the socket buffer get poked.	2001-06-06 17:38:36 +00:00
tanimura	f16ee52380	Lock VM Giant prior to locking a vm map. Spotted by: Daniel Rock <D.Rock@t-online.de> Tested by: David Wolfskill <david@catwhisker.org>, Sean Eric Fagan <sef@kithrup.com>	2001-06-06 04:13:11 +00:00
shafeeq	8c780c8d9a	Now works again and as a module and with devfs. Used the bpf & tun drivers as examples as to what is necessary for devfs.	2001-06-05 19:45:16 +00:00
brian	18d829816a	Support /dev/tun cloning. Ansify if_tun.c while I'm there. Only tun0 -> tun32767 may now be opened as struct ifnet's if_unit is a short. It's now possible to open /dev/tun and get a handle back for an available tun device (use devname to find out what you got). The implementation uses rman by popular demand (and against my judgement) to track opened devices and uses the new dev_depends() to ensure that all make_dev()d devices go away before the module is unloaded. Reviewed by: phk	2001-06-01 15:51:10 +00:00
ru	0c44ad95b8	- VFS_SET(msdos) -> VFS_SET(msdosfs) - msdos.ko -> msdosfs.ko - mount_msdos(8) -> mount_msdosfs(8) - "msdos" -> "msdosfs" compatibility glue in mount(8)	2001-06-01 10:57:26 +00:00
phk	89034502d1	Don't copy the trailing zero in readlink, it confuses namei(). PR: 27656	2001-05-26 20:07:57 +00:00
ru	05f3be90b2	- sys/n[tw]fs moved to sys/fs/n[tw]fs - /usr/include/n[tw]fs moved to /usr/include/fs/n[tw]fs	2001-05-26 11:57:45 +00:00
phk	2072a71f0e	Create a general facility for making dev_t's depend on another dev_t. The dev_depends(dev_t, dev_t) function is for tying them to each other. When destroy_dev() is called on a dev_t, all dev_t's depending on it will also be destroyed (depth first order). Rewrite the make_dev_alias() to use this dependency facility. kern/subr_disk.c: Make the disk mini-layer use dependencies to make sure all relevant dev_t's are removed when the disk disappears. Make the disk mini-layer precreate some magic sub devices which the disk/slice/label code expects to be there. kern/subr_disklabel.c: Remove some now unneeded variables. kern/subr_diskmbr.c: Remove some ancient, commented out code. kern/subr_diskslice.c: Minor cleanup. Use name from dev_t instead of dsname()	2001-05-26 08:27:58 +00:00
rwatson	f504530d9f	o Merge contents of struct pcred into struct ucred. Specifically, add the real uid, saved uid, real gid, and saved gid to ucred, as well as the pcred->pc_uidinfo, which was associated with the real uid, only rename it to cr_ruidinfo so as not to conflict with cr_uidinfo, which corresponds to the effective uid. o Remove p_cred from struct proc; add p_ucred to struct proc, replacing original macro that pointed. p->p_ucred to p->p_cred->pc_ucred. o Universally update code so that it makes use of ucred instead of pcred, p->p_ucred instead of p->p_pcred, cr_ruidinfo instead of p_uidinfo, cr_{r,sv}{u,g}id instead of p_*, etc. o Remove pcred0 and its initialization from init_main.c; initialize cr_ruidinfo there. o Restruction many credential modification chunks to always crdup while we figure out locking and optimizations; generally speaking, this means moving to a structure like this: newcred = crdup(oldcred); ... p->p_ucred = newcred; crfree(oldcred); It's not race-free, but better than nothing. There are also races in sys_process.c, all inter-process authorization, fork, exec, and exit. o Remove sigio->sio_ruid since sigio->sio_ucred now contains the ruid; remove comments indicating that the old arrangement was a problem. o Restructure exec1() a little to use newcred/oldcred arrangement, and use improved uid management primitives. o Clean up exit1() so as to do less work in credential cleanup due to pcred removal. o Clean up fork1() so as to do less work in credential cleanup and allocation. o Clean up ktrcanset() to take into account changes, and move to using suser_xxx() instead of performing a direct uid==0 comparision. o Improve commenting in various kern_prot.c credential modification calls to better document current behavior. In a couple of places, current behavior is a little questionable and we need to check POSIX.1 to make sure it's "right". More commenting work still remains to be done. o Update credential management calls, such as crfree(), to take into account new ruidinfo reference. o Modify or add the following uid and gid helper routines: change_euid() change_egid() change_ruid() change_rgid() change_svuid() change_svgid() In each case, the call now acts on a credential not a process, and as such no longer requires more complicated process locking/etc. They now assume the caller will do any necessary allocation of an exclusive credential reference. Each is commented to document its reference requirements. o CANSIGIO() is simplified to require only credentials, not processes and pcreds. o Remove lots of (p_pcred==NULL) checks. o Add an XXX to authorization code in nfs_lock.c, since it's questionable, and needs to be considered carefully. o Simplify posix4 authorization code to require only credentials, not processes and pcreds. Note that this authorization, as well as CANSIGIO(), needs to be updated to use the p_cansignal() and p_cansched() centralized authorization routines, as they currently do not take into account some desirable restrictions that are handled by the centralized routines, as well as being inconsistent with other similar authorization instances. o Update libkvm to take these changes into account. Obtained from: TrustedBSD Project Reviewed by: green, bde, jhb, freebsd-arch, freebsd-audit	2001-05-25 16:59:11 +00:00
ru	8094d979ca	- sys/msdosfs moved to sys/fs/msdosfs - msdos.ko renamed to msdosfs.ko - /usr/include/msdosfs moved to /usr/include/fs/msdosfs	2001-05-25 08:14:14 +00:00
ru	1fdf57d327	Actually rename FDESC, PORTAL, UMAP and UNION file systems. OK'ed by: bp	2001-05-24 15:20:11 +00:00
ru	ddb4a4df8f	mount_umap(8) -> mount_umapfs(8).	2001-05-24 13:20:41 +00:00
ru	739b6e17bf	mount_null(8) -> mount_nullfs(8).	2001-05-24 13:17:47 +00:00
jhb	67d46f1c41	Don't acquire/release Giant around some of the places that need it in spec_getpages(). Instead, assert that Giant is held by the caller.	2001-05-23 22:20:29 +00:00
phk	c8d4d78546	Change the way deletes are managed in DEVFS. This fixes a number of warnings relating to removed cloned devices. It also makes it possible to recreate deleted devices with mknod(2). The major/minor arguments are ignored.	2001-05-23 17:48:20 +00:00
ru	35437d86aa	- FDESC, FIFO, NULL, PORTAL, PROC, UMAP and UNION file systems were repo-copied from sys/miscfs to sys/fs. - Renamed the following file systems and their modules: fdesc -> fdescfs, portal -> portalfs, union -> unionfs. - Renamed corresponding kernel options: FDESC -> FDESCFS, PORTAL -> PORTALFS, UNION -> UNIONFS. - Install header files for the above file systems. - Removed bogus -I${.CURDIR}/../../sys CFLAGS from userland Makefiles.	2001-05-23 09:42:29 +00:00
jhb	a445507567	Sort includes from previous commit.	2001-05-21 23:19:50 +00:00
alfred	a3f0842419	Introduce a global lock for the vm subsystem (vm_mtx). vm_mtx does not recurse and is required for most low level vm operations. faults can not be taken without holding Giant. Memory subsystems can now call the base page allocators safely. Almost all atomic ops were removed as they are covered under the vm mutex. Alpha and ia64 now need to catch up to i386's trap handlers. FFS and NFS have been tested, other filesystems will need minor changes (grabbing the vm lock when twiddling page properties). Reviewed (partially) by: jake, jhb	2001-05-19 01:28:09 +00:00
bp	85b499e6f8	Currently there is no way to tell if write operation invoked via vn_start_write() on the given vnode will be successful. VOP_LEASE() may help to solve this problem, but its return value ignored nearly everywhere. For now just assume that the missing upper layer on write means insufficient access rights (which is correct for most cases).	2001-05-18 07:43:13 +00:00
bp	d0e9de0900	VOP getwritemount() can be invoked on vnodes with VFREE flag set (used in snapshots code). At this point upper vp may not exist.	2001-05-17 04:58:25 +00:00
bp	61ab4d4ff9	Use vop_*vobject() VOPs to get reference to VM object from upper or lower fs.	2001-05-17 04:52:57 +00:00
bp	f9fd9a4dce	Do not leave an extra reference on vnode. PR: kern/27250 Submitted by: "Vladimir B. Grebenschikov" <vova@express.ru> MFC after: 2 weeks	2001-05-17 04:40:01 +00:00
iedowse	dafd513732	Change the second argument of vflush() to an integer that specifies the number of references on the filesystem root vnode to be both expected and released. Many filesystems hold an extra reference on the filesystem root vnode, which must be accounted for when determining if the filesystem is busy and then released if it isn't busy. The old `skipvp' approach required individual filesystem xxx_unmount functions to re-implement much of vflush()'s logic to deal with the root vnode. All 9 filesystems that hold an extra reference on the root vnode got the logic wrong in the case of forced unmounts, so `umount -f' would always fail if there were any extra root vnode references. Fix this issue centrally in vflush(), now that we can. This commit also fixes a vnode reference leak in devfs, which could result in idle devfs filesystems that refuse to unmount. Reviewed by: phk, bp	2001-05-16 18:04:37 +00:00
phk	4fe46b461d	After a successfull poll of the cloning functions, match on the returned dev_t rather than the original name. This allows cloning from one name to another which is useful for /dev/tty and later for the pty's.	2001-05-14 08:20:46 +00:00
phk	0e2026a179	Convert DEVFS from an "opt-in" to an "opt-out" option. If for some reason DEVFS is undesired, the "NODEVFS" option is needed now. Pending any significant issues, DEVFS will be made mandatory in -current on july 1st so that we can start reaping the full benefits of having it.	2001-05-13 20:52:40 +00:00
jhb	9f9b0c09e4	GC prototype for procfs_bmap() missed during a previous commit.	2001-05-11 23:37:37 +00:00
phk	11b5378677	Remove unneeded devfs_badop() Noticed by: rwatson	2001-05-06 17:40:34 +00:00
bp	11e8bff787	Convert vnode_pager_freepage() to vm_free_page(). Forgotten by: alfred	2001-05-03 09:00:54 +00:00
phk	5948c9ed5b	Implement vop_std{get\|put}pages() and add them to the default vop[]. Un-copy&paste all the VOP_{GET\|PUT}PAGES() functions which do nothing but the default.	2001-05-01 08:34:45 +00:00
markm	bcca5847d5	Undo part of the tangle of having sys/lock.h and sys/mutex.h included in other "system" header files. Also help the deprecation of lockmgr.h by making it a sub-include of sys/lock.h and removing sys/lockmgr.h form kernel .c files. Sort sys/*.h includes where possible in affected files. OK'ed by: bde (with reservations)	2001-05-01 08:13:21 +00:00
phk	661864d53b	Uncut&paste som bogus use of VOP_BMAP in cd9660::VOP_STRATEGY. XXX mark some stuff which looks like further cut&paste junk.	2001-04-30 21:23:05 +00:00
phk	781e8e2614	Uncut&paste som bogus use of VOP_BMAP in hpfs::VOP_STRATEGY. At the same time, eliminate uninitialized use of a vnode pointer. Interesting GCC didn't spot this.	2001-04-30 21:21:53 +00:00
bde	e7467b0a7b	Backed out previous commit. It cause massive filesystem corruption, not to mention a compile-time warning about the critical function becoming unused, by replacing spec_bmap() with vop_stdbmap(). ntfs seems to have the same bug. The factor for converting specfs block numbers to physical block numbers is 1, but vop_stdbmap() uses the bogus factor btodb(ap->a_vp->v_mount->mnt_stat.f_iosize), which is 16 for ffs with the default block size of 8K. This factor is bogus even for vop_stdbmap() -- the correct factor is related to the filesystem blocksize which is not necessarily the same to the optimal i/o size. vop_stdbmap() was apparently cloned from nfs where these sizes happen to be the same. There may also be a problem with a_vp->v_mount being null. spec_bmap() still checks for this, but I think the checks in specfs are dead code which used to support block devices.	2001-04-30 14:35:35 +00:00
phk	608c1caf3b	Add a vop_stdbmap(), and make it part of the default vop vector. Make 7 filesystems which don't really know about VOP_BMAP rely on the default vector, rather than more or less complete local vop_nopbmap() implementations.	2001-04-29 11:48:41 +00:00
grog	4b9d9cbaac	Revert consequences of changes to mount.h, part 2. Requested by: bde	2001-04-29 02:45:39 +00:00
phk	cdc83afc7f	Move the netexport structure from the fs-specific mountstructure to struct mount. This makes the "struct netexport *" paramter to the vfs_export and vfs_checkexport interface unneeded. Consequently that all non-stacking filesystems can use vfs_stdcheckexp(). At the same time, make it a pointer to a struct netexport in struct mount, so that we can remove the bogus AF_MAX and #include <net/radix.h> from <sys/mount.h>	2001-04-25 07:07:52 +00:00
jhb	9c03a8ae91	Change the pfind() and zpfind() functions to lock the process that they find before releasing the allproc lock and returning. Reviewed by: -smp, dfr, jake	2001-04-24 00:51:53 +00:00
mjacob	e13477a78e	fix it so it compiles again	2001-04-23 18:51:54 +00:00
mjacob	053acf45ec	add this ridiculous include foo so it will compile again	2001-04-23 18:14:41 +00:00
grog	1f5de30718	Correct #includes to work with fixed sys/mount.h.	2001-04-23 09:05:15 +00:00
grog	a943ac2de3	Correct #includes to work with fixed sys/mount.h.	2001-04-23 08:28:44 +00:00
alfred	1e50a5f33e	vnode_pager_freepage() is really vm_page_free() in disguise, nuke vnode_pager_freepage() and replace all calls to it with vm_page_free()	2001-04-19 06:18:23 +00:00
phk	378e561228	This patch removes the VOP_BWRITE() vector. VOP_BWRITE() was a hack which made it possible for NFS client side to use struct buf with non-bio backing. This patch takes a more general approach and adds a bp->b_op vector where more methods can be added. The success of this patch depends on bp->b_op being initialized all relevant places for some value of "relevant" which is not easy to determine. For now the buffers have grown a b_magic element which will make such issues a tiny bit easier to debug.	2001-04-17 08:56:39 +00:00
bp	f9931b90b2	Move VT_SMBFS definition to the proper place. Undefine VI_LOCK/VI_UNLOCK.	2001-04-13 11:26:54 +00:00
bp	a414f03f5d	Import kernel part of SMB/CIFS requester. Add smbfs(CIFS) filesystem. Userland part will be in the ports tree for a while. Obtained from: smbfs-1.3.7-dev package.	2001-04-10 07:59:06 +00:00
des	ee97bef8dd	Let pseudofs into the warmth of the FreeBSD CVS repo. It's not finished yet (I still have to find a way to implement process- dependent nodes without consuming too much memory, and the permission system needs tightening up), but it's becoming hard to work on without a repo (I've accidentally almost nuked it once already), and it works (except for the lack of process-dependent nodes, that is). I was supposed to commit this a week ago, but timed out waiting for jkh to reply to some questions I had. Pass him a spoonful of bad karma :)	2001-04-07 19:51:12 +00:00
jhb	35bb2b40ac	- Various style fixes. - Fix a silly bug so that we return the actual error code if a procfs attach fails rather than always returning 0. Reported by: bde	2001-03-29 18:10:46 +00:00
jhb	79cf991a6b	Convert the allproc and proctree locks from lockmgr locks to sx locks.	2001-03-28 11:52:56 +00:00
jhb	b47bfbe544	Catch up to header include changes: - <sys/mutex.h> now requires <sys/systm.h> - <sys/mutex.h> and <sys/sx.h> now require <sys/lock.h>	2001-03-28 09:17:56 +00:00
phk	c47745e977	Send the remains (such as I have located) of "block major numbers" to the bit-bucket.	2001-03-26 12:41:29 +00:00
bp	dc983aecd7	Add dependancy on libmchain module. Spotted by: Andrzej Tobola <san@iem.pw.edu.pl>	2001-03-22 06:51:53 +00:00
rwatson	f773ff5a87	o Change the API and ABI of the Extended Attribute kernel interfaces to introduce a new argument, "namespace", rather than relying on a first- character namespace indicator. This is in line with more recent thinking on EA interfaces on various mailing lists, including the posix1e, Linux acl-devel, and trustedbsd-discuss forums. Two namespaces are defined by default, EXTATTR_NAMESPACE_SYSTEM and EXTATTR_NAMESPACE_USER, where the primary distinction lies in the access control model: user EAs are accessible based on the normal MAC and DAC file/directory protections, and system attributes are limited to kernel-originated or appropriately privileged userland requests. o These API changes occur at several levels: the namespace argument is introduced in the extattr_{get,set}_file() system call interfaces, at the vnode operation level in the vop_{get,set}extattr() interfaces, and in the UFS extended attribute implementation. Changes are also introduced in the VFS extattrctl() interface (system call, VFS, and UFS implementation), where the arguments are modified to include a namespace field, as well as modified to advoid direct access to userspace variables from below the VFS layer (in the style of recent changes to mount by adrian@FreeBSD.org). This required some cleanup and bug fixing regarding VFS locks and the VFS interface, as a vnode pointer may now be optionally submitted to the VFS_EXTATTRCTL() call. Updated documentation for the VFS interface will be committed shortly. o In the near future, the auto-starting feature will be updated to search two sub-directories to the ".attribute" directory in appropriate file systems: "user" and "system" to locate attributes intended for those namespaces, as the single filename is no longer sufficient to indicate what namespace the attribute is intended for. Until this is committed, all attributes auto-started by UFS will be placed in the EXTATTR_NAMESPACE_SYSTEM namespace. o The default POSIX.1e attribute names for ACLs and Capabilities have been updated to no longer include the '$' in their filename. As such, if you're using these features, you'll need to rename the attribute backing files to the same names without '$' symbols in front. o Note that these changes will require changes in userland, which will be committed shortly. These include modifications to the extended attribute utilities, as well as to libutil for new namespace string conversion routines. Once the matching userland changes are committed, a buildworld is recommended to update all the necessary include files and verify that the kernel and userland environments are in sync. Note: If you do not use extended attributes (most people won't), upgrading is not imperative although since the system call API has changed, the new userland extended attribute code will no longer compile with old include files. o Couple of minor cleanups while I'm there: make more code compilation conditional on FFS_EXTATTR, which should recover a bit of space on kernels running without EA's, as well as update copyright dates. Obtained from: TrustedBSD Project	2001-03-15 02:54:29 +00:00
sobomax	9695e56e6c	Add missed MODULE_VERSION() call, so loading of unicode conversion routine works properly. Clue beaten in by: des	2001-03-11 15:28:42 +00:00
bp	968f03fddd	Do not kill vnodes after rename. This can cause deadlocks in the deadfs. Noticed by: Matthew N. Dodd <winter@jurai.net>	2001-03-11 11:51:42 +00:00
bp	c259a60fbb	Add a mount time option which slightly relaxes checks for valid Joilet extensions. PR: kern/23315 Reviewed by: adrian	2001-03-11 10:05:08 +00:00
bp	cc5c440cbf	Slightly reorganize allocation of new vnode. Use bit NVOLUME to detected vnodes which represent volumes (before it was done via strcmp()). Turn n_refparent into bit in the n_flag field.	2001-03-10 05:39:03 +00:00
bp	a7f5447c8f	Synch with changes in the NCP requester.	2001-03-10 05:31:22 +00:00
mckusick	61db3f4296	Fixes to track snapshot copy-on-write checking in the specinfo structure rather than assuming that the device vnode would reside in the FFS filesystem (which is obviously a broken assumption with the device filesystem).	2001-03-07 07:09:55 +00:00
jhb	9cd254601b	Grab the process lock while calling psignal and before calling psignal.	2001-03-07 03:37:06 +00:00
jhb	23113ee580	Proc locking identical to that of linprocfs' vnops except that we hold the proc lock while calling psignal.	2001-03-07 03:15:05 +00:00
jhb	47cd1b179f	Protect read to p_pptr with proc lock rather than proctree lock.	2001-03-07 03:10:20 +00:00
jhb	2c951b9c74	Proc locking. Lock around psignal() and also ensure both an exclusive proctree lock and the process lock are held when updating p_pptr and p_oppid. When we are just reaading p_pptr we only need the proc lock and not a proctree lock as well.	2001-03-07 03:09:40 +00:00
jhb	6958204c78	Protect p_flag with the proc lock.	2001-03-07 02:07:56 +00:00
bp	342407e6c4	A name of the file can change while its id stays the same. So, we have to update it as well. Remove unused function.	2001-03-06 09:59:18 +00:00
dfr	182c65b1d9	Remove the copyinstr call which was trying to copy the pathname in from user space. It has already been copied in and mp->mnt_stat.f_mntonname has already been initialised by the caller. This fixes a panic on the alpha caused by the fact that the variable 'size' wasn't initialised because the call to copyinstr() bailed out with an EFAULT error.	2001-03-03 15:15:33 +00:00
adrian	4018955334	Reviewed by: jlemon An initial tidyup of the mount() syscall and VFS mount code. This code replaces the earlier work done by jlemon in an attempt to make linux_mount() work. * the guts of the mount work has been moved into vfs_mount(). * move `type', `path' and `flags' from being userland variables into being kernel variables in vfs_mount(). `data' remains a pointer into userspace. * Attempt to verify the `type' and `path' strings passed to vfs_mount() aren't too long. * rework mount() and linux_mount() to take the userland parameters (besides data, as mentioned) and pass kernel variables to vfs_mount(). (linux_mount() already did this, I've just tidied it up a little more.) * remove the copyin() stuff for `path'. `data' still requires copyin() since its a pointer into userland. * set `mount->mnt_statf_mntonname' in vfs_mount() rather than in each filesystem. This variable is generally initialised with `path', and each filesystem can override it if they want to. * NOTE: f_mntonname is intiailised with "/" in the case of a root mount.	2001-03-01 21:00:17 +00:00
alfred	642141e5c9	Display the Joliet Extension 'level' in the log message. PR: kern/24998	2001-02-23 03:43:05 +00:00
rwatson	ab5676fc87	o Move per-process jail pointer (p->pr_prison) to inside of the subject credential structure, ucred (cr->cr_prison). o Allow jail inheritence to be a function of credential inheritence. o Abstract prison structure reference counting behind pr_hold() and pr_free(), invoked by the similarly named credential reference management functions, removing this code from per-ABI fork/exit code. o Modify various jail() functions to use struct ucred arguments instead of struct proc arguments. o Introduce jailed() function to determine if a credential is jailed, rather than directly checking pointers all over the place. o Convert PRISON_CHECK() macro to prison_check() function. o Move jail() function prototypes to jail.h. o Emulate the P_JAILED flag in fill_kinfo_proc() and no longer set the flag in the process flags field itself. o Eliminate that "const" qualifier from suser/p_can/etc to reflect mutex use. Notes: o Some further cleanup of the linux/jail code is still required. o It's now possible to consider resolving some of the process vs credential based permission checking confusion in the socket code. o Mutex protection of struct prison is still not present, and is required to protect the reference count plus some fields in the structure. Reviewed by: freebsd-arch Obtained from: TrustedBSD Project	2001-02-21 06:39:57 +00:00
phk	15fc7bce14	Remove a debug printf.	2001-02-18 09:16:49 +00:00
jlemon	11781a7431	Extend kqueue down to the device layer. Backwards compatible approach suggested by: peter	2001-02-15 16:34:11 +00:00
sobomax	20103ed026	Add a hook for loading of a Unicode -> char conversion routine as a kld at a run-time. This is temporary solution until proper kernel Unicode interfaces are in place and as such was purposely designed to be as tiny as possible (3 lines of the code not counting comments). The port with conversion routines for the most popular single-byte languages will be added later today Reviewed by: bp, "Michael C . Wu" <keichii@iteration.net> Approved by: bp	2001-02-13 11:48:31 +00:00
bmilekic	f364d4ac36	Change and clean the mutex lock interface. mtx_enter(lock, type) becomes: mtx_lock(lock) for sleep locks (MTX_DEF-initialized locks) mtx_lock_spin(lock) for spin locks (MTX_SPIN-initialized) similarily, for releasing a lock, we now have: mtx_unlock(lock) for MTX_DEF and mtx_unlock_spin(lock) for MTX_SPIN. We change the caller interface for the two different types of locks because the semantics are entirely different for each case, and this makes it explicitly clear and, at the same time, it rids us of the extra `type' argument. The enter->lock and exit->unlock change has been made with the idea that we're "locking data" and not "entering locked code" in mind. Further, remove all additional "flags" previously passed to the lock acquire/release routines with the exception of two: MTX_QUIET and MTX_NOSWITCH The functionality of these flags is preserved and they can be passed to the lock/unlock routines by calling the corresponding wrappers: mtx_{lock, unlock}_flags(lock, flag(s)) and mtx_{lock, unlock}_spin_flags(lock, flag(s)) for MTX_DEF and MTX_SPIN locks, respectively. Re-inline some lock acq/rel code; in the sleep lock case, we only inline the _obtain_lock()s in order to ensure that the inlined code fits into a cache line. In the spin lock case, we inline recursion and actually only perform a function call if we need to spin. This change has been made with the idea that we generally tend to avoid spin locks and that also the spin locks that we do have and are heavily used (i.e. sched_lock) do recurse, and therefore in an effort to reduce function call overhead for some architectures (such as alpha), we inline recursion for this case. Create a new malloc type for the witness code and retire from using the M_DEV type. The new type is called M_WITNESS and is only declared if WITNESS is enabled. Begin cleaning up some machdep/mutex.h code - specifically updated the "optimized" inlined code in alpha/mutex.h and wrote MTX_LOCK_SPIN and MTX_UNLOCK_SPIN asm macros for the i386/mutex.h as we presently need those. Finally, caught up to the interface changes in all sys code. Contributors: jake, jhb, jasone (in no particular order)	2001-02-09 06:11:45 +00:00
asmodai	2f1d3e2cdf	Fix typo: seperate -> separate. Seperate does not exist in the english language.	2001-02-06 11:21:58 +00:00
phk	709379c1ae	Another round of the <sys/queue.h> FOREACH transmogriffer. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 16:08:18 +00:00
phk	e87f7a15ad	Mechanical change to use <sys/queue.h> macro API instead of fondling implementation details. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 13:13:25 +00:00
phk	f3b4fbe35f	Use <sys/queue.h> macro API.	2001-02-04 12:37:48 +00:00
phk	236808f33a	Remove a DIAGNOSTIC check which belongs in <sys/queue.h> if anyplace at all.	2001-02-04 11:53:51 +00:00
phk	99d7a44ee7	At the point in time where most devices are created, we don't know what time it is because boottime is not yet initialized. Finagle the relevant fields when we get the chance.	2001-02-02 22:54:41 +00:00
phk	766147079e	Only superuser can create symlinks. Give symlinks mode 755 by default to avoid triggering alert eyes. (the mode isn't use on symlinks)	2001-02-02 18:35:29 +00:00
peter	6150a50174	Zap last remaining references to (and a use use of) of simple_locks.	2001-01-31 04:29:52 +00:00
phk	3ed24cd17e	Add a BUF_KERNPROC() in the BIO_DELETE path. This seems to fix the problem which md(4) backed filesystems exposed.	2001-01-30 10:06:08 +00:00
phk	006cf45cd7	Fix two minor nits. Existences revealed, but no details offered by: bp	2001-01-30 08:39:52 +00:00
dillon	11fb1bf637	This patch reestablishes the spec_fsync() guarentee that synchronous fsyncs, which typically occur during unmounting, will drain all dirty buffers even if it takes multiple passes to do so. The guarentee was mangled by the last patch which solved a problem due to -current disabling interrupts while holding giant (which caused an infinite spin loop waiting for I/O to complete). -stable does not have either patch, but has a similar bug in the original spec_fsync() code which is triggered by a bug in the softupdates umount code, a fix for which will be committed to -current as soon as Kirk stamps it. Then both solutions will be MFC'd to -stable. -stable currently suffers from a combination of the softupdates bug and a small window of opportunity in the original spec_fsync() code, and -stable also suffers from the spin-loop bug but since interrupts are enabled the spin resolves itself in a few milliseconds.	2001-01-29 08:19:28 +00:00
jhb	b6baa60b1e	Back out proc locking to protect p_ucred for obtaining additional references along with the actual obtaining of additional references.	2001-01-27 00:01:31 +00:00
jasone	8d2ec1ebc4	Convert all simplelocks to mutexes and remove the simplelock implementations.	2001-01-24 12:35:55 +00:00
jhb	963052ead7	- Catch up to proc flag changes.	2001-01-24 11:20:05 +00:00
jhb	810630fa41	The lock being destroyed was misnamed, not unused. Add the lockdestroy() back in but with the proper name so that this compiles. Submitted by: jasone	2001-01-24 02:18:54 +00:00
jhb	f540aca984	Proc locking to protect p_ucred while we obtain additional references.	2001-01-24 00:26:19 +00:00
jhb	24fda4f13e	- Remove unused header include. - Use queue macros.	2001-01-23 22:38:38 +00:00
jhb	e7cd4ee729	Proc locking to protect p_ucred while we obtain an additional reference.	2001-01-23 22:38:15 +00:00
jhb	c55210afc5	- FreeBSD doesn't have an abortop vnop as far as I can tell, so #ifdef references to the hpf op out. - Remove a lockdestroy() on a non-existent variable.	2001-01-23 22:37:30 +00:00
peter	35aab82743	Fix breakage unconvered by LINT - dont refer to undefined variables in KASSERT()	2001-01-17 01:10:23 +00:00
wollman	73868ac960	Delete unused #include <sys/select.h>.	2001-01-09 04:32:24 +00:00
wollman	8a0e4fd3b6	Don't compile a dead variable declaration.	2001-01-09 04:24:43 +00:00
phk	5de479435a	Use macro API to <sys/queue.h>	2000-12-31 10:24:19 +00:00
dillon	41fd6873a8	Fix a lockup problem that occurs with 'cvs update'. specfs's fsync can get into the same sort of infinite loop that ffs's fsync used to get into, probably due to background bitmap writes. The solution is the same.	2000-12-30 23:32:24 +00:00
dillon	fd223545d4	This implements a better launder limiting solution. There was a solution in 4.2-REL which I ripped out in -stable and -current when implementing the low-memory handling solution. However, maxlaunder turns out to be the saving grace in certain very heavily loaded systems (e.g. newsreader box). The new algorithm limits the number of pages laundered in the first pageout daemon pass. If that is not sufficient then suceessive will be run without any limit. Write I/O is now pipelined using two sysctls, vfs.lorunningspace and vfs.hirunningspace. This prevents excessive buffered writes in the disk queues which cause long (multi-second) delays for reads. It leads to more stable (less jerky) and generally faster I/O streaming to disk by allowing required read ops (e.g. for indirect blocks and such) to occur without interrupting the write stream, amoung other things. NOTE: eventually, filesystem write I/O pipelining needs to be done on a per-device basis. At the moment it is globalized.	2000-12-26 19:41:38 +00:00
jake	fa7a58ab48	Protect proc.p_pptr and proc.p_children/p_sibling with the proctree_lock. linprocfs not locked pending response from informal maintainer. Reviewed by: jhb, -smp@	2000-12-23 19:43:10 +00:00
jhb	e086882f91	When p_ucred is passed to the venus daemon, first grab the proc lock to protect the p_ucred pointer, obtain a seperate reference to the ucred, release the lock, and then pass in the new ucred reference.	2000-12-15 00:12:30 +00:00
rwatson	22e2a46873	o Tighten restrictions on use of /proc/pid/ctl and move access checks in ctl to using centralized p_can() inter-process access control interface. Reviewed by: sef	2000-12-13 04:28:24 +00:00
jake	a4ad237eaa	- Change the allproc_lock to use a macro, ALLPROC_LOCK(how), instead of explicit calls to lockmgr. Also provides macros for the flags pased to specify shared, exclusive or release which map to the lockmgr flags. This is so that the use of lockmgr can be easily replaced with optimized reader-writer locks. - Add some locking that I missed the first time.	2000-12-13 00:17:05 +00:00
des	7f632ed13a	Add a module version (so that linprocfs can properly depend on procfs)	2000-12-09 13:17:51 +00:00
dwmalone	dd75d1d73b	Convert more malloc+bzero to malloc+M_ZERO. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net>	2000-12-08 21:51:06 +00:00
phk	e0196ec99c	staticize.	2000-12-08 15:07:24 +00:00
jhb	f31d014094	Protect accesses to member of struct proc with the proc lock.	2000-12-06 01:45:20 +00:00
jhb	ad7f89f777	Protect p_stat with the sched_lock. Reviewed by: jake	2000-12-02 01:58:15 +00:00
jlemon	f3b673b4a9	Update to reflect the disappearance of getsock(). Found by: LINT	2000-11-25 07:16:06 +00:00
bp	d1e0950f7e	Use vop_defaultop() instead of ntfs_bypass(). PR: kern/22756	2000-11-18 02:47:12 +00:00
mckusick	0263b689c1	Missed conversion of CIRCLEQ => TAILQ for mount list.	2000-11-14 06:38:18 +00:00
eivind	17ab837520	More paranoia against overflows	2000-11-08 21:53:05 +00:00
bp	099f33073e	v_interlock is a mutex now, not simple lock.	2000-11-04 02:42:11 +00:00
phk	4e063f5534	Take VBLK devices further out of their missery. This should fix the panic I introduced in my previous commit on this topic.	2000-11-02 21:14:13 +00:00
eivind	3b7fec2c02	Fix overflow from jail hostname. Bug found by: Esa Etelavuori <eetelavu@cc.hut.fi>	2000-11-01 19:38:08 +00:00
eivind	1afa7eea27	Give vop_mmap an untimely death. The opportunity to give it a timely death timed out in 1996.	2000-11-01 17:57:24 +00:00
dwmalone	e401b83c31	Make malloc use M_ZERO in some more locations. Don't check for a null pointer if malloc called with M_WAITOK. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net> Approved by: bp	2000-10-29 16:14:28 +00:00
phk	ff5cdfae2d	Move suser() and suser_xxx() prototypes and a related #define from <sys/proc.h> to <sys/systm.h>. Correctly document the #includes needed in the manpage. Add one now needed #include of <sys/systm.h>. Remove the consequent 48 unused #includes of <sys/proc.h>.	2000-10-29 16:06:56 +00:00
phk	f82e4ca62c	Weaken a bogus dependency on <sys/proc.h> in <sys/buf.h> by #ifdef'ing the offending inline function (BUF_KERNPROC) on it being #included already. I'm not sure BUF_KERNPROC() is even the right thing to do or in the right place or implemented the right way (inline vs normal function). Remove consequently unneeded #includes of <sys/proc.h>	2000-10-29 14:54:55 +00:00
phk	94a5006c9a	Remove unneeded #include <sys/proc.h> lines.	2000-10-29 13:57:19 +00:00
bp	bf8c7dab48	Rev 1.41 was committed from wrong diff, now do it right.	2000-10-22 16:15:12 +00:00
bp	f20992328c	Release and unlock vnode if resource deadlock detected.	2000-10-22 15:40:22 +00:00
bp	a74bc23d1f	Update stale comment. PR: kern/21805	2000-10-22 14:24:30 +00:00
bp	038c55d50e	Remove de_lock field from denode structure and make msdosfs PDIRUNLOCK aware.	2000-10-22 14:22:17 +00:00
bp	b9d830d3e7	Fix nullfs breakage caused by incomplete migration of v_interlock from simple_lock to mutex. Reset LK_INTERLOCK flag when interlock released manually.	2000-10-15 06:25:42 +00:00
chris	15107f5de5	o Move from Alfred Perstein's "exclusion" technique of handling special file types to requiring all file types to properly implement fo_stat. This makes any new file type additions much easier as this code no longer has to be modified to accomodate it. o Instead of using curproc in fdesc_allocvp, pass a `struct proc' pointer as a new fifth parameter.	2000-10-09 20:06:13 +00:00
eivind	4a39f454a0	Blow away the v_specmountpoint define, replacing it with what it was defined as (rdev->si_mountpoint)	2000-10-09 17:31:39 +00:00
phk	25e67656df	Don't hold an extra reference to vnodes. Devfs vnodes are sufficiently cheap to setup that it doesn't really matter that we recycle device vnodes at kleenex speed. Implement first cut try at killing cloned devices when they are not needed anymore. For now only the bpf driver is involved in this experiment. Cloned devices can set the SI_CHEAPCLONE flag which allows us to destroy_dev() it when the vcount() drops to zero and the vnode is reclaimed. For now it's a requirement that the driver doesn't keep persistent state from close to (re)open. Some whitespace changes.	2000-10-09 14:18:07 +00:00
alfred	1e98080e99	return correct type for process directory entries, DT_DIR not DT_REG	2000-10-05 23:19:51 +00:00
bde	c58e848e28	Forward-declare struct mbuf so that this file is less self-insufficient -- don't depend on garbage in <sys/mount.h>. mbufs aren't actually used here either. They should have been completely removed from filesystem interfaces when they were removed from the interfaces to convert between file handles and vnodes.	2000-10-05 11:58:22 +00:00
jasone	4e290e67b7	Convert lockmgr locks from using simple locks to using mutexes. Add lockdestroy() and appropriate invocations, which corresponds to lockinit() and must be called to clean up after a lockmgr lock is no longer needed.	2000-10-04 01:29:17 +00:00
bp	e9f8d8bbf5	Make cd9660 filesystem PDIRUNLOCK aware. Now it can be used in vnode stacks and nullfs mounts. Remove now unnecessary i_lock field from the iso_node structure.	2000-10-03 04:39:50 +00:00
bp	87071b03a6	Prevent dereference of NULL pointer when null_lock() and null_unlock() called and there is no underlying vnode.	2000-10-03 04:25:53 +00:00
bp	af5c59dc4f	Protect hash data with lock manager instead of home grown one. Replace shared lock on vnode with exclusive one. It shouldn't impact perfomance as NCP protocol doesn't support outstanding requests. Do not hold simple lock on vnode for long period of time. Add functionality to the nwfs_print() routine.	2000-10-02 09:49:04 +00:00
bp	72e68d3b76	Get rid from the legacy __P() macro. Remove 'register' keywords.	2000-10-02 09:29:59 +00:00
peter	991f1fafc7	PDIRUNLOCK now exists on FreeBSD. Remove the (now incorrect) redefinition.	2000-10-02 04:47:19 +00:00
bp	c2ae01d2e9	Fix vnode locking bugs in the nullfs. Add correct support for v_object management, so mmap() operation should work properly. Add support for extattrctl() routine (submitted by semenu). At this point nullfs can be considered as functional and much more stable. In fact, it should behave as a "hard" "symlink" to underlying filesystem. Reviewed in general by: mckusick, dillon Parts of logic obtained from: NetBSD	2000-09-25 15:38:32 +00:00
phk	56aecf1ece	Ignore attempts to set flags to zero. This quenches a syslog warning from login(1).	2000-09-18 09:40:01 +00:00
phk	d927c81a82	Add canonical checks to devfs_setattr().	2000-09-16 12:06:58 +00:00
jhb	f94cd225a3	Use size_t instead of u_int for 4th argument to copyinstr().	2000-09-12 22:39:34 +00:00
jasone	769e0f974d	Major update to the way synchronization is done in the kernel. Highlights include: * Mutual exclusion is used instead of spl(). See mutex(9). (Note: The alpha port is still in transition and currently uses both.) Per-CPU idle processes. * Interrupts are run in their own separate kernel threads and can be preempted (i386 only). Partially contributed by: BSDi (BSD/OS) Submissions by (at least): cp, dfr, dillon, grog, jake, jhb, sheldonh	2000-09-07 01:33:02 +00:00
phk	c9cb5c289d	Add refcounts to the "global" DEVFS inode slots, this allows us to recycle inodes after a destroy_dev() but not until all mounts have picked up the change. Add support for an overflow table for DEVFS inodes. The static table defaults to 1024 inodes, if that fills, an overflow table of 32k inodes is allocated. Both numbers can be changed at compile time, the size of the overflow table also with the sysctl vfs.devfs.noverflow. Use atomic instructions to barrier between make_dev()/destroy_dev() and the mounts. Add lockmgr() locking of directories for operations accessing or modifying the directory TAILQs. Various nitpicking here and there.	2000-09-06 11:26:43 +00:00
bp	64ac0aa678	Various cleanups towards make nullfs functional (it is still broken at this point): Replace all '#ifdef DEBUG' with '#ifdef NULLFS_DEBUG' and add NULLFSDEBUG macro. Protect nullfs hash table with lockmgr. Use proper order of operations when freeing mnt_data. Return correct fsid in the null_getattr(). Add null_open() function to catch MNT_NODEV (obtained from NetBSD). Add null_rename() to catch cross-fs rename operations (submitted by Ustimenko Semen <semen@iclub.nsu.ru>) Remove duplicate $FreeBSD$ tags.	2000-09-05 09:02:07 +00:00
bp	7106b8bf8a	Get rid from the __P() macros. Encouraged by: peter	2000-09-05 07:54:39 +00:00
phk	06c7160c02	Off by one error. Submitted by: des	2000-09-04 18:24:30 +00:00
des	571c2eccf9	Remove a comment that has been not only obsolete but patently wrong for the last 31 revisions (almost three years).	2000-09-04 18:18:17 +00:00
phk	e47f61e183	Avoid the modules madness I inadvertently introduced by making the cloning infrastructure standard in kern_conf. Modules are now the same with or without devfs support. If you need to detect if devfs is present, in modules or elsewhere, check the integer variable "devfs_present". This happily removes an ugly hack from kern/vfs_conf.c. This forces a rename of the eventhandler and the standard clone helper function. Include <sys/eventhandler.h> in <sys/conf.h>: it's a helper #include like <sys/queue.h> Remove all #includes of opt_devfs.h they no longer matter.	2000-09-02 19:17:34 +00:00
rwatson	e95936f6dd	o Simplify if/then clause equating ESRCH with ENOENT when hiding a process Submitted by: des	2000-09-01 18:41:32 +00:00
rwatson	544bd25255	o Make procfs use vaccess() for procfs_access() DAC and super-user checks, rather than implementing its own {uid,gid,other} checks against vnode mode. Similar change to linprocfs currently under review. Obtained from: TrustedBSD Project	2000-09-01 13:41:41 +00:00
rwatson	3dc6d2b9ea	o Centralize inter-process access control, introducing: int p_can(p1, p2, operation, privused) which allows specification of subject process, object process, inter-process operation, and an optional call-by-reference privused flag, allowing the caller to determine if privilege was required for the call to succeed. This allows jail, kern.ps_showallprocs and regular credential-based interaction checks to occur in one block of code. Possible operations are P_CAN_SEE, P_CAN_SCHED, P_CAN_KILL, and P_CAN_DEBUG. p_can currently breaks out as a wrapper to a series of static function checks in kern_prot, which should not be invoked directly. o Commented out capabilities entries are included for some checks. o Update most inter-process authorization to make use of p_can() instead of manual checks, PRISON_CHECK(), P_TRESPASS(), and kern.ps_showallprocs. o Modify suser{,_xxx} to use const arguments, as it no longer modifies process flags due to the disabling of ASU. o Modify some checks/errors in procfs so that ENOENT is returned instead of ESRCH, further improving concealment of processes that should not be visible to other processes. Also introduce new access checks to improve hiding of processes for procfs_lookup(), procfs_getattr(), procfs_readdir(). Correct a bug reported by bp concerning not handling the CREATE case in procfs_lookup(). Remove volatile flag in procfs that caused apparently spurious qualifier warnigns (approved by bde). o Add comment noting that ktrace() has not been updated, as its access control checks are different from ptrace(), whereas they should probably be the same. Further discussion should happen on this topic. Reviewed by: bde, green, phk, freebsd-security, others Approved by: bde Obtained from: TrustedBSD Project	2000-08-30 04:49:09 +00:00
rwatson	e54ea574fa	o Restructure vaccess() so as to check for DAC permission to modify the object before falling back on privilege. Make vaccess() accept an additional optional argument, privused, to determine whether privilege was required for vaccess() to return 0. Add commented out capability checks for reference. Rename some variables to make it more clear which modes/uids/etc are associated with the object, and which with the access mode. o Update file system use of vaccess() to pass NULL as the optional privused argument. Once additional patches are applied, suser() will no longer set ASU, so privused will permit passing of privilege information up the stack to the caller. Reviewed by: bde, green, phk, -security, others Obtained from: TrustedBSD Project	2000-08-29 14:45:49 +00:00
phk	1109c83215	Reorder vop's alphabetically. Smarter use of devfs_allocv() (from bp@) Introduce devfs_find() ".." fixes to devfs_lookup (from bp@)	2000-08-27 14:46:36 +00:00
phk	c1421f6ef5	Minor cleanups tp devfs_readdir(); Add devfs_read() for directories. (inspired by bp@)	2000-08-26 16:20:57 +00:00
bde	bfd8253e34	Quick fix for msdsofs_write() on alphas and other machines with either longs larger than 32 bits or strict alignment requirements. pm_fatmask had type u_long, but it must have a type that has precisely 32 bits and this type must be no smaller than int, so that ~pmp->pm_fatmask has no bits above the 31st set. Otherwise, comparisons between (cn \| ~pmp->pm_fatmask) and magic 32-bit "cluster" numbers always fail. The correct fix is to use the C99 type uint_least32_t and mask with 0xffffffff. The quick fix is to use u_int32_t and assume that ints have msdosfs metadata is riddled with unaligned fields, and on alphas, unaligned_fixup() apparently has problems fixing up the unaligned accesses caused by this. The quick fix is to not comment out the NetBSD code that sort of handles this, and define UNALIGNED_ACCESS on i386's so that the code doesn't change on i386's. The correct fix would define UNALIGNED_ACCESS in a central machine-dependent header and maybe add some extra cases to unaligned_fixup(). UNALIGNED_ACCESS is also tested in isofs. Submitted by: parts by Mark Abene <phiber@radicalmedia.com> PR: 19086	2000-08-25 09:03:58 +00:00
phk	ec761116e2	Fix panic when removing open device (found by bp@) Implement subdirs. Build the full "devicename" for cloning functions. Fix panic when deleted device goes away. Collaps devfs_dir and devfs_dirent structures. Add proper cloning to the /dev/fd* "device-"driver. Fix a bug in make_dev_alias() handling which made aliases appear multiple times. Use devfs_clone to implement getdiskbyname() Make specfs maintain the stat(2) timestamps per dev_t	2000-08-24 15:36:55 +00:00
phk	323f259948	Fix devfs_access() bug on directories. Remove unused #includes. Bug spotted by: markm	2000-08-21 14:45:19 +00:00
phk	b648921acc	Remove all traces of Julians DEVFS (incl from kern/subr_diskslice.c) Remove old DEVFS support fields from dev_t. Make uid, gid & mode members of dev_t and set them in make_dev(). Use correct uid, gid & mode in make_dev in disk minilayer. Add support for registering alias names for a dev_t using the new function make_dev_alias(). These will show up as symlinks in DEVFS. Use makedev() rather than make_dev() for MFSs magic devices to prevent DEVFS from noticing this abuse. Add a field for DEVFS inode number in dev_t. Add new DEVFS in fs/devfs. Add devfs cloning to: disk minilayer (ie: ad(4), sd(4), cd(4) etc etc) md(4), tun(4), bpf(4), fd(4) If DEVFS add -d flag to /sbin/inits args to make it mount devfs. Add commented out DEVFS to GENERIC	2000-08-20 21:34:39 +00:00
phk	3d2aecdc81	Centralize the canonical vop_access user/group/other check in vaccess(). Discussed with: bde	2000-08-20 08:36:26 +00:00
phk	6dde24da5e	Introduce vop_stdinactive() and make it the default if no vop_inactive is declared. Sort and prune a few vop_op[].	2000-08-18 10:01:02 +00:00
sheldonh	eba01e2cbc	Rename the loadable nullfs kernel module: null -> nullfs	2000-07-28 11:54:09 +00:00
mckusick	acc66855bf	This patch corrects the first round of panics and hangs reported with the new snapshot code. Update addaliasu to correctly implement the semantics of the old checkalias function. When a device vnode first comes into existence, check to see if an anonymous vnode for the same device was created at boot time by bdevvp(). If so, adopt the bdevvp vnode rather than creating a new vnode for the device. This corrects a problem which caused the kernel to panic when taking a snapshot of the root filesystem. Change the calling convention of vn_write_suspend_wait() to be the same as vn_start_write(). Split out softdep_flushworklist() from softdep_flushfiles() so that it can be used to clear the work queue when suspending filesystem operations. Access to buffers becomes recursive so that snapshots can recursively traverse their indirect blocks using ffs_copyonwrite() when checking for the need for copy on write when flushing one of their own indirect blocks. This eliminates a deadlock between the syncer daemon and a process taking a snapshot. Ensure that softdep_process_worklist() can never block because of a snapshot being taken. This eliminates a problem with buffer starvation. Cleanup change in ffs_sync() which did not synchronously wait when MNT_WAIT was specified. The result was an unclean filesystem panic when doing forcible unmount with heavy filesystem I/O in progress. Return a zero'ed block when reading a block that was not in use at the time that a snapshot was taken. Normally, these blocks should never be read. However, the readahead code will occationally read them which can cause unexpected behavior. Clean up the debugging code that ensures that no blocks be written on a filesystem while it is suspended. Snapshots must explicitly label the blocks that they are writing during the suspension so that they do not cause a `write on suspended filesystem' panic. Reorganize ffs_copyonwrite() to eliminate a deadlock and also to prevent a race condition that would permit the same block to be copied twice. This change eliminates an unexpected soft updates inconsistency in fsck caused by the double allocation. Use bqrelse rather than brelse for buffers that will be needed soon again by the snapshot code. This improves snapshot performance.	2000-07-24 05:28:33 +00:00
dwmalone	729fe7fb1f	Certain error contitions cause msdosfs_rename() to decrement the vnode reference count on 'fdvp' more times than it should. PR: 17347 Submitted by: Ian Dowse <iedowse@maths.tcd.ie> Approved by: bde	2000-07-14 11:52:56 +00:00
mckusick	a3d0c189ea	Add snapshots to the fast filesystem. Most of the changes support the gating of system calls that cause modifications to the underlying filesystem. The gating can be enabled by any filesystem that needs to consistently suspend operations by adding the vop_stdgetwritemount to their set of vnops. Once gating is enabled, the function vfs_write_suspend stops all new write operations to a filesystem, allows any filesystem modifying system calls already in progress to complete, then sync's the filesystem to disk and returns. The function vfs_write_resume allows the suspended write operations to begin again. Gating is not added by default for all filesystems as for SMP systems it adds two extra locks to such critical kernel paths as the write system call. Thus, gating should only be added as needed. Details on the use and current status of snapshots in FFS can be found in /sys/ufs/ffs/README.snapshot so for brevity and timelyness is not included here. Unless and until you create a snapshot file, these changes should have no effect on your system (famous last words).	2000-07-11 22:07:57 +00:00
phk	e5de271d47	Previous commit changing SYSCTL_HANDLER_ARGS violated KNF. Pointed out by: bde	2000-07-04 11:25:35 +00:00
phk	f101401a90	Pull the rug under block mode devices. they return ENXIO on open(2) now.	2000-07-03 13:48:37 +00:00
phk	61ff05be25	Style police catches up with rev 1.26 of src/sys/sys/sysctl.h: Sanitize SYSCTL_HANDLER_ARGS so that simplistic tools can grog our sources: -sysctl_vm_zone SYSCTL_HANDLER_ARGS +sysctl_vm_zone (SYSCTL_HANDLER_ARGS)	2000-07-03 09:35:31 +00:00
bp	579668ebe9	Fix memory leakage on module unload. Spotted by: fixed INVARIANTS code	2000-06-29 01:19:12 +00:00
bp	6c6297b200	Fix memory leakage on module unload. Spotted by: fixed INVARIANTS code	2000-06-29 01:12:47 +00:00
chris	6e95d4a6c3	fdesc_getattr: Don't fake any file types, just set vap->va_type to IFTOVT(stb.st_mode). If something does not report its mode, vap->va_type is set to VNON accordingly.	2000-06-28 19:18:25 +00:00
alfred	6a77970fb2	by changing the logic here we can support dynamic additions of new filetypes. Reviewed by: green	2000-06-27 22:46:35 +00:00
alfred	6887475162	if there are leading zeros fail the lookup Pointed out by: Alexander Viro <viro@math.psu.edu>	2000-06-27 21:37:17 +00:00
bp	860493e205	Remove obsolete comment. Submitted by: Marius Bendiksen <mbendiks@eunet.no>	2000-06-25 02:29:45 +00:00
chris	0790e5cf47	Rename the `VRXEC' macro used to clear read and exec bits to` FDRX' so as not to impede upon VFS namespace.	2000-06-20 20:34:11 +00:00
phk	4ec91666fa	Virtualizes & untangles the bioops operations vector. Ref: Message-ID: <18317.961014572@critter.freebsd.dk> To: current@	2000-06-16 08:48:51 +00:00
chris	b598f843e4	Remove unused include <sys/socketvar.h>.	2000-06-15 20:13:51 +00:00
chris	ea41821d31	Replace vattr_null() with VATTR_NULL() and do not explicity set vattr fields to VNOVAL afterwards.	2000-06-15 17:19:22 +00:00
jmb	777866439c	before this commit, specfs reported disk partitions using decimal major and minor numbers. "ls -l" reports disk partitions using decimal major numbers and hex minor numbers. make specfs use decimal major numbers and hex minor numbers, just like "ls -l"	2000-06-12 10:20:18 +00:00
chris	5895c7a8d4	Instead of completely disallowing VOP_SETATTR, just do it where there is an underlying vnode. Suggested by: bde	2000-06-06 00:35:39 +00:00
chris	ccec07bebe	Update the comment for fdesc_setattr to reflect that we no longer actually setattr() on underlying vnodes.	2000-06-02 07:08:18 +00:00
chris	571f018249	- Do not allow VOP_SETATTR to modify underlying vnodes at all. This caused problems when fetch(1) was passed `-o -'. The rationale of this change is that applications attempting to change underlying vnodes for /dev/fd nodes are improperly written and the use of this interface should not ever have been encouraged. Proper alternatives are fchmod, fchown and others. PR: 18952 - Remove stale, unused fdescnode->fd_link structure member.	2000-06-02 07:02:45 +00:00
jake	961b97d434	Back out the previous change to the queue(3) interface. It was not discussed and should probably not happen. Requested by: msmith and others	2000-05-26 02:09:24 +00:00
jake	d93fbc9916	Change the way that the queue(3) structures are declared; don't assume that the type argument to _HEAD and _ENTRY is a struct. Suggested by: phk Reviewed by: phk Approved by: mdodd	2000-05-23 20:41:01 +00:00
chris	9af0c6c060	Adapt fdesc to be mounted on /dev/fd and remove fd, stdin, stdout and stderr nodes. More specific items of this patch: o Removed support for symbolic links, and the need for fdesc_readlink(). o Put all the code from fdesc_attr() into fdesc_getattr() and removed fdesc_attr(). This also made it easier to properly give all nodes unique inode numbers. o The removal of all non-fd nodes allowed the removal of the fdesc_read(), fdesc_write(), and fdesc_ioctl() nodes, since we no longer have nodes that get special handling. o Correct the component name validity-checking in fdesc_lookup(). It previously detected the end of the string by checking for a terminating NUL, now it uses cnp->cn_namelen. o Handle kqueue files as FIFOs. This is probably the closest file type to represent this type of file there is, and it is unfortunately not very representative of a kqueue. Creation time is not supported by kqueue, so ctime, mtime and atime are all set to the current time when getattr() was called. o Also set st_[mca]time to the current time since there's no data in socket structures that can be used to fill this in (FIFOs). o Simplify fdesc_readdir() since it only has to report the numbered fd nodes. Add `.' and `..' directory links as well. o Remove read bits from directories as they tend to confuse programs like tar(1). Reviewed by: phk Discussed with: bde (earlier on, not quite review)	2000-05-11 22:10:51 +00:00
phk	bddf428952	Change the "bdev-whiner" to whine when open is attempted and extend the deadline a month.	2000-05-09 18:53:57 +00:00
phk	36c3965ff9	Separate the struct bio related stuff out of <sys/buf.h> into <sys/bio.h>. <sys/bio.h> is now a prerequisite for <sys/buf.h> but it shall not be made a nested include according to bdes teachings on the subject of nested includes. Diskdrivers and similar stuff below specfs::strategy() should no longer need to include <sys/buf.> unless they need caching of data. Still a few bogus uses of struct buf to track down. Repocopy by: peter	2000-05-05 09:59:14 +00:00
phk	62efea1e92	Remove 42 unneeded #include <sys/ioccom.h>. ioccom.h defines only implementation detail, and should therefore only be included from the #include which defines the ioctl tags, in other words: never include it from *.c	2000-05-03 07:31:38 +00:00
peter	22f6069a2a	Add $FreeBSD$	2000-05-01 20:32:07 +00:00
phk	10914aa708	Remove unneeded #include <vm/vm_zone.h> Generated by: src/tools/tools/kerninclude	2000-04-30 18:52:11 +00:00
phk	ce2aa22c93	Remove unneeded #include <sys/kernel.h>	2000-04-29 15:36:14 +00:00
peter	ff69b85a83	nwfs depends on ncp	2000-04-29 13:34:28 +00:00
green	6bad412525	Move procfs_fullpath() to vfs_cache.c, with a rename to textvp_fullpath(). There's no excuse to have code in synthetic filestores that allows direct references to the textvp anymore. Feature requested by: msmith Feature agreed to by: warner Move requested by: phk Move agreed to by: bde	2000-04-26 11:57:45 +00:00
green	aa6d0cfe54	Quiet an unused variable warning by commenting out a variable declaration that goes with a commented out statement.	2000-04-22 17:58:40 +00:00
green	365f24a27a	There's no reason to make "file" 0500 rather than 0555.	2000-04-22 04:01:54 +00:00
green	d6606f6ffa	Welcome back our old friend from procfs, "file"!	2000-04-22 03:44:41 +00:00
phk	6be1308ad1	Remove ~25 unneeded #include <sys/conf.h> Remove ~60 unneeded #include <sys/malloc.h>	2000-04-19 14:58:28 +00:00
phk	75e82c815e	Remove unneeded <sys/buf.h> includes. Due to some interesting cpp tricks in lockmgr, the LINT kernel shrinks by 924 bytes.	2000-04-18 15:15:39 +00:00
jlemon	c41c876463	Introduce kqueue() and kevent(), a kernel event notification facility.	2000-04-16 18:53:38 +00:00
phk	aaaef0b54e	Complete the bio/buf divorce for all code below devfs::strategy Exceptions: Vinum untouched. This means that it cannot be compiled. Greg Lehey is on the case. CCD not converted yet, casts to struct buf (still safe) atapi-cd casts to struct buf to examine B_PHYS	2000-04-15 05:54:02 +00:00
rwatson	a0dd5ab0fd	Introduce extended attribute support for FFS, allowing arbitrary (name, value) pairs to be associated with inodes. This support is used for ACLs, MAC labels, and Capabilities in the TrustedBSD security extensions, which are currently under development. In this implementation, attributes are backed to data vnodes in the style of the quota support in FFS. Support for FFS extended attributes may be enabled using the FFS_EXTATTR kernel option (disabled by default). Userland utilities and man pages will be committed in the next batch. VFS interfaces and man pages have been in the repo since 4.0-RELEASE and are unchanged. o ufs/ufs/extattr.h: UFS-specific extattr defines o ufs/ufs/ufs_extattr.c: bulk of support routines o ufs/{ufs,ffs,mfs}/*.[ch]: hooks and extattr.h includes o contrib/softupdates/ffs_softdep.c: extattr.h includes o conf/options, conf/files, i386/conf/LINT: added FFS_EXTATTR o coda/coda_vfsops.c: XXX required extattr.h due to ufsmount.h (This should not be the case, and will be fixed in a future commit) Currently attributes are not supported in MFS. This will be fixed. Reviewed by: adrian, bp, freebsd-fs, other unthanked souls Obtained from: TrustedBSD Project	2000-04-15 03:34:27 +00:00
bp	8ba252e4e1	Try to obtain timezone offset from an environment of mount program. This helps in cases where CMOS clock set to UTC time.	2000-04-05 10:44:04 +00:00
phk	8ee11d587f	Move B_ERROR flag to b_ioflags and call it BIO_ERROR. (Much of this done by script) Move B_ORDERED flag to b_ioflags and call it BIO_ORDERED. Move b_pblkno and b_iodone_chain to struct bio while we transition, they will be obsoleted once bio structs chain/stack. Add bio_queue field for struct bio aware disksort. Address a lot of stylistic issues brought up by bde.	2000-04-02 15:24:56 +00:00
dillon	8fb4c6b599	Commit the buffer cache cleanup patch to 4.x and 5.x. This patch fixes a fragmentation problem due to geteblk() reserving too much space for the buffer and imposes a larger granularity (16K) on KVA reservations for the buffer cache to avoid fragmentation issues. The buffer cache size calculations have been redone to simplify them (fewer defines, better comments, less chance of running out of KVA). The geteblk() fix solves a performance problem that DG was able reproduce. This patch does not completely fix the KVA fragmentation problems, but it goes a long way Mostly Reviewed by: bde and others Approved by: jkh	2000-03-27 21:29:33 +00:00
phk	5df766a0f8	Rename the existing BUF_STRATEGY() to DEV_STRATEGY() substitute BUF_WRITE(foo) for VOP_BWRITE(foo->b_vp, foo) substitute BUF_STRATEGY(foo) for VOP_STRATEGY(foo->b_vp, foo) This patch is machine generated except for the ccd.c and buf.h parts.	2000-03-20 11:29:10 +00:00
phk	a246e10f55	Remove B_READ, B_WRITE and B_FREEBUF and replace them with a new field in struct buf: b_iocmd. The b_iocmd is enforced to have exactly one bit set. B_WRITE was bogusly defined as zero giving rise to obvious coding mistakes. Also eliminate the redundant struct buf flag B_CALL, it can just as efficiently be done by comparing b_iodone to NULL. Should you get a panic or drop into the debugger, complaining about "b_iocmd", don't continue. It is likely to write on your disk where it should have been reading. This change is a step in the direction towards a stackable BIO capability. A lot of this patch were machine generated (Thanks to style(9) compliance!) Vinum users: Greg has not had time to test this yet, be careful.	2000-03-20 10:44:49 +00:00
phk	6b3385b773	Eliminate the undocumented, experimental, non-delivering and highly dangerous MAX_PERF option.	2000-03-16 08:51:55 +00:00

... 3 4 5 6 7 ...

1093 Commits