freebsd-dev

Author	SHA1	Message	Date
Ruslan Ermilov	c7b23e0fb4	Actually rename FDESC, PORTAL, UMAP and UNION file systems. OK'ed by: bp	2001-05-24 15:20:11 +00:00
Ruslan Ermilov	c99d12581a	mount_umap(8) -> mount_umapfs(8).	2001-05-24 13:20:41 +00:00
Ruslan Ermilov	57a523ae6b	mount_null(8) -> mount_nullfs(8).	2001-05-24 13:17:47 +00:00
John Baldwin	c7f52620e0	Don't acquire/release Giant around some of the places that need it in spec_getpages(). Instead, assert that Giant is held by the caller.	2001-05-23 22:20:29 +00:00
Poul-Henning Kamp	5a9300c451	Change the way deletes are managed in DEVFS. This fixes a number of warnings relating to removed cloned devices. It also makes it possible to recreate deleted devices with mknod(2). The major/minor arguments are ignored.	2001-05-23 17:48:20 +00:00
Ruslan Ermilov	99d300a1ec	- FDESC, FIFO, NULL, PORTAL, PROC, UMAP and UNION file systems were repo-copied from sys/miscfs to sys/fs. - Renamed the following file systems and their modules: fdesc -> fdescfs, portal -> portalfs, union -> unionfs. - Renamed corresponding kernel options: FDESC -> FDESCFS, PORTAL -> PORTALFS, UNION -> UNIONFS. - Install header files for the above file systems. - Removed bogus -I${.CURDIR}/../../sys CFLAGS from userland Makefiles.	2001-05-23 09:42:29 +00:00
John Baldwin	2178ff8b9f	Sort includes from previous commit.	2001-05-21 23:19:50 +00:00
Alfred Perlstein	2395531439	Introduce a global lock for the vm subsystem (vm_mtx). vm_mtx does not recurse and is required for most low level vm operations. faults can not be taken without holding Giant. Memory subsystems can now call the base page allocators safely. Almost all atomic ops were removed as they are covered under the vm mutex. Alpha and ia64 now need to catch up to i386's trap handlers. FFS and NFS have been tested, other filesystems will need minor changes (grabbing the vm lock when twiddling page properties). Reviewed (partially) by: jake, jhb	2001-05-19 01:28:09 +00:00
Boris Popov	10fa1684ed	Currently there is no way to tell if write operation invoked via vn_start_write() on the given vnode will be successful. VOP_LEASE() may help to solve this problem, but its return value ignored nearly everywhere. For now just assume that the missing upper layer on write means insufficient access rights (which is correct for most cases).	2001-05-18 07:43:13 +00:00
Boris Popov	f3d1ec67b2	VOP getwritemount() can be invoked on vnodes with VFREE flag set (used in snapshots code). At this point upper vp may not exist.	2001-05-17 04:58:25 +00:00
Boris Popov	3413421bda	Use vop_*vobject() VOPs to get reference to VM object from upper or lower fs.	2001-05-17 04:52:57 +00:00
Boris Popov	9dbd7336ee	Do not leave an extra reference on vnode. PR: kern/27250 Submitted by: "Vladimir B. Grebenschikov" <vova@express.ru> MFC after: 2 weeks	2001-05-17 04:40:01 +00:00
Ian Dowse	0864ef1e8a	Change the second argument of vflush() to an integer that specifies the number of references on the filesystem root vnode to be both expected and released. Many filesystems hold an extra reference on the filesystem root vnode, which must be accounted for when determining if the filesystem is busy and then released if it isn't busy. The old `skipvp' approach required individual filesystem xxx_unmount functions to re-implement much of vflush()'s logic to deal with the root vnode. All 9 filesystems that hold an extra reference on the root vnode got the logic wrong in the case of forced unmounts, so `umount -f' would always fail if there were any extra root vnode references. Fix this issue centrally in vflush(), now that we can. This commit also fixes a vnode reference leak in devfs, which could result in idle devfs filesystems that refuse to unmount. Reviewed by: phk, bp	2001-05-16 18:04:37 +00:00
Poul-Henning Kamp	f73cbde4cf	After a successfull poll of the cloning functions, match on the returned dev_t rather than the original name. This allows cloning from one name to another which is useful for /dev/tty and later for the pty's.	2001-05-14 08:20:46 +00:00
Poul-Henning Kamp	ab9f3b292e	Convert DEVFS from an "opt-in" to an "opt-out" option. If for some reason DEVFS is undesired, the "NODEVFS" option is needed now. Pending any significant issues, DEVFS will be made mandatory in -current on july 1st so that we can start reaping the full benefits of having it.	2001-05-13 20:52:40 +00:00
John Baldwin	b012b205a7	GC prototype for procfs_bmap() missed during a previous commit.	2001-05-11 23:37:37 +00:00
Poul-Henning Kamp	6bd2ea83ef	Remove unneeded devfs_badop() Noticed by: rwatson	2001-05-06 17:40:34 +00:00
Boris Popov	d759827bd9	Convert vnode_pager_freepage() to vm_free_page(). Forgotten by: alfred	2001-05-03 09:00:54 +00:00
Poul-Henning Kamp	a62615e59b	Implement vop_std{get\|put}pages() and add them to the default vop[]. Un-copy&paste all the VOP_{GET\|PUT}PAGES() functions which do nothing but the default.	2001-05-01 08:34:45 +00:00
Mark Murray	fb919e4d5a	Undo part of the tangle of having sys/lock.h and sys/mutex.h included in other "system" header files. Also help the deprecation of lockmgr.h by making it a sub-include of sys/lock.h and removing sys/lockmgr.h form kernel .c files. Sort sys/*.h includes where possible in affected files. OK'ed by: bde (with reservations)	2001-05-01 08:13:21 +00:00
Poul-Henning Kamp	e9d19a117e	Uncut&paste som bogus use of VOP_BMAP in cd9660::VOP_STRATEGY. XXX mark some stuff which looks like further cut&paste junk.	2001-04-30 21:23:05 +00:00
Poul-Henning Kamp	c1acc01996	Uncut&paste som bogus use of VOP_BMAP in hpfs::VOP_STRATEGY. At the same time, eliminate uninitialized use of a vnode pointer. Interesting GCC didn't spot this.	2001-04-30 21:21:53 +00:00
Bruce Evans	438abdb9c6	Backed out previous commit. It cause massive filesystem corruption, not to mention a compile-time warning about the critical function becoming unused, by replacing spec_bmap() with vop_stdbmap(). ntfs seems to have the same bug. The factor for converting specfs block numbers to physical block numbers is 1, but vop_stdbmap() uses the bogus factor btodb(ap->a_vp->v_mount->mnt_stat.f_iosize), which is 16 for ffs with the default block size of 8K. This factor is bogus even for vop_stdbmap() -- the correct factor is related to the filesystem blocksize which is not necessarily the same to the optimal i/o size. vop_stdbmap() was apparently cloned from nfs where these sizes happen to be the same. There may also be a problem with a_vp->v_mount being null. spec_bmap() still checks for this, but I think the checks in specfs are dead code which used to support block devices.	2001-04-30 14:35:35 +00:00
Poul-Henning Kamp	b7ebffbc08	Add a vop_stdbmap(), and make it part of the default vop vector. Make 7 filesystems which don't really know about VOP_BMAP rely on the default vector, rather than more or less complete local vop_nopbmap() implementations.	2001-04-29 11:48:41 +00:00
Greg Lehey	60fb0ce365	Revert consequences of changes to mount.h, part 2. Requested by: bde	2001-04-29 02:45:39 +00:00
Poul-Henning Kamp	a13234bb35	Move the netexport structure from the fs-specific mountstructure to struct mount. This makes the "struct netexport *" paramter to the vfs_export and vfs_checkexport interface unneeded. Consequently that all non-stacking filesystems can use vfs_stdcheckexp(). At the same time, make it a pointer to a struct netexport in struct mount, so that we can remove the bogus AF_MAX and #include <net/radix.h> from <sys/mount.h>	2001-04-25 07:07:52 +00:00
John Baldwin	33a9ed9d0e	Change the pfind() and zpfind() functions to lock the process that they find before releasing the allproc lock and returning. Reviewed by: -smp, dfr, jake	2001-04-24 00:51:53 +00:00
Matt Jacob	2b4169610b	fix it so it compiles again	2001-04-23 18:51:54 +00:00
Matt Jacob	3be6e0c249	add this ridiculous include foo so it will compile again	2001-04-23 18:14:41 +00:00
Greg Lehey	d98dc34f52	Correct #includes to work with fixed sys/mount.h.	2001-04-23 09:05:15 +00:00
Greg Lehey	97d5f7bb3b	Correct #includes to work with fixed sys/mount.h.	2001-04-23 08:28:44 +00:00
Alfred Perlstein	d8d5fa8805	vnode_pager_freepage() is really vm_page_free() in disguise, nuke vnode_pager_freepage() and replace all calls to it with vm_page_free()	2001-04-19 06:18:23 +00:00
Poul-Henning Kamp	f84e29a06c	This patch removes the VOP_BWRITE() vector. VOP_BWRITE() was a hack which made it possible for NFS client side to use struct buf with non-bio backing. This patch takes a more general approach and adds a bp->b_op vector where more methods can be added. The success of this patch depends on bp->b_op being initialized all relevant places for some value of "relevant" which is not easy to determine. For now the buffers have grown a b_magic element which will make such issues a tiny bit easier to debug.	2001-04-17 08:56:39 +00:00
Boris Popov	0fdabd3a45	Move VT_SMBFS definition to the proper place. Undefine VI_LOCK/VI_UNLOCK.	2001-04-13 11:26:54 +00:00
Boris Popov	681a5bbef2	Import kernel part of SMB/CIFS requester. Add smbfs(CIFS) filesystem. Userland part will be in the ports tree for a while. Obtained from: smbfs-1.3.7-dev package.	2001-04-10 07:59:06 +00:00
Dag-Erling Smørgrav	9733a80839	Let pseudofs into the warmth of the FreeBSD CVS repo. It's not finished yet (I still have to find a way to implement process- dependent nodes without consuming too much memory, and the permission system needs tightening up), but it's becoming hard to work on without a repo (I've accidentally almost nuked it once already), and it works (except for the lack of process-dependent nodes, that is). I was supposed to commit this a week ago, but timed out waiting for jkh to reply to some questions I had. Pass him a spoonful of bad karma :)	2001-04-07 19:51:12 +00:00
John Baldwin	0316f71d56	- Various style fixes. - Fix a silly bug so that we return the actual error code if a procfs attach fails rather than always returning 0. Reported by: bde	2001-03-29 18:10:46 +00:00
John Baldwin	1005a129e5	Convert the allproc and proctree locks from lockmgr locks to sx locks.	2001-03-28 11:52:56 +00:00
John Baldwin	f34fa851e0	Catch up to header include changes: - <sys/mutex.h> now requires <sys/systm.h> - <sys/mutex.h> and <sys/sx.h> now require <sys/lock.h>	2001-03-28 09:17:56 +00:00
Poul-Henning Kamp	f83880518b	Send the remains (such as I have located) of "block major numbers" to the bit-bucket.	2001-03-26 12:41:29 +00:00
Boris Popov	6306f8dad3	Add dependancy on libmchain module. Spotted by: Andrzej Tobola <san@iem.pw.edu.pl>	2001-03-22 06:51:53 +00:00
Robert Watson	70f3685105	o Change the API and ABI of the Extended Attribute kernel interfaces to introduce a new argument, "namespace", rather than relying on a first- character namespace indicator. This is in line with more recent thinking on EA interfaces on various mailing lists, including the posix1e, Linux acl-devel, and trustedbsd-discuss forums. Two namespaces are defined by default, EXTATTR_NAMESPACE_SYSTEM and EXTATTR_NAMESPACE_USER, where the primary distinction lies in the access control model: user EAs are accessible based on the normal MAC and DAC file/directory protections, and system attributes are limited to kernel-originated or appropriately privileged userland requests. o These API changes occur at several levels: the namespace argument is introduced in the extattr_{get,set}_file() system call interfaces, at the vnode operation level in the vop_{get,set}extattr() interfaces, and in the UFS extended attribute implementation. Changes are also introduced in the VFS extattrctl() interface (system call, VFS, and UFS implementation), where the arguments are modified to include a namespace field, as well as modified to advoid direct access to userspace variables from below the VFS layer (in the style of recent changes to mount by adrian@FreeBSD.org). This required some cleanup and bug fixing regarding VFS locks and the VFS interface, as a vnode pointer may now be optionally submitted to the VFS_EXTATTRCTL() call. Updated documentation for the VFS interface will be committed shortly. o In the near future, the auto-starting feature will be updated to search two sub-directories to the ".attribute" directory in appropriate file systems: "user" and "system" to locate attributes intended for those namespaces, as the single filename is no longer sufficient to indicate what namespace the attribute is intended for. Until this is committed, all attributes auto-started by UFS will be placed in the EXTATTR_NAMESPACE_SYSTEM namespace. o The default POSIX.1e attribute names for ACLs and Capabilities have been updated to no longer include the '$' in their filename. As such, if you're using these features, you'll need to rename the attribute backing files to the same names without '$' symbols in front. o Note that these changes will require changes in userland, which will be committed shortly. These include modifications to the extended attribute utilities, as well as to libutil for new namespace string conversion routines. Once the matching userland changes are committed, a buildworld is recommended to update all the necessary include files and verify that the kernel and userland environments are in sync. Note: If you do not use extended attributes (most people won't), upgrading is not imperative although since the system call API has changed, the new userland extended attribute code will no longer compile with old include files. o Couple of minor cleanups while I'm there: make more code compilation conditional on FFS_EXTATTR, which should recover a bit of space on kernels running without EA's, as well as update copyright dates. Obtained from: TrustedBSD Project	2001-03-15 02:54:29 +00:00
Maxim Sobolev	a7436e684a	Add missed MODULE_VERSION() call, so loading of unicode conversion routine works properly. Clue beaten in by: des	2001-03-11 15:28:42 +00:00
Boris Popov	e3c805cd07	Do not kill vnodes after rename. This can cause deadlocks in the deadfs. Noticed by: Matthew N. Dodd <winter@jurai.net>	2001-03-11 11:51:42 +00:00
Boris Popov	c35e8e54cd	Add a mount time option which slightly relaxes checks for valid Joilet extensions. PR: kern/23315 Reviewed by: adrian	2001-03-11 10:05:08 +00:00
Boris Popov	1db5c04bc0	Slightly reorganize allocation of new vnode. Use bit NVOLUME to detected vnodes which represent volumes (before it was done via strcmp()). Turn n_refparent into bit in the n_flag field.	2001-03-10 05:39:03 +00:00
Boris Popov	d691852ce6	Synch with changes in the NCP requester.	2001-03-10 05:31:22 +00:00
Kirk McKusick	589c7af992	Fixes to track snapshot copy-on-write checking in the specinfo structure rather than assuming that the device vnode would reside in the FFS filesystem (which is obviously a broken assumption with the device filesystem).	2001-03-07 07:09:55 +00:00
John Baldwin	19eb87d22a	Grab the process lock while calling psignal and before calling psignal.	2001-03-07 03:37:06 +00:00
John Baldwin	931cccf603	Proc locking identical to that of linprocfs' vnops except that we hold the proc lock while calling psignal.	2001-03-07 03:15:05 +00:00
John Baldwin	30ac5d0f9e	Protect read to p_pptr with proc lock rather than proctree lock.	2001-03-07 03:10:20 +00:00
John Baldwin	c65c565b44	Proc locking. Lock around psignal() and also ensure both an exclusive proctree lock and the process lock are held when updating p_pptr and p_oppid. When we are just reaading p_pptr we only need the proc lock and not a proctree lock as well.	2001-03-07 03:09:40 +00:00
John Baldwin	0087374731	Protect p_flag with the proc lock.	2001-03-07 02:07:56 +00:00
Boris Popov	1cebc48fb3	A name of the file can change while its id stays the same. So, we have to update it as well. Remove unused function.	2001-03-06 09:59:18 +00:00
Doug Rabson	a76decc6f7	Remove the copyinstr call which was trying to copy the pathname in from user space. It has already been copied in and mp->mnt_stat.f_mntonname has already been initialised by the caller. This fixes a panic on the alpha caused by the fact that the variable 'size' wasn't initialised because the call to copyinstr() bailed out with an EFAULT error.	2001-03-03 15:15:33 +00:00
Adrian Chadd	f3a90da995	Reviewed by: jlemon An initial tidyup of the mount() syscall and VFS mount code. This code replaces the earlier work done by jlemon in an attempt to make linux_mount() work. * the guts of the mount work has been moved into vfs_mount(). * move `type', `path' and `flags' from being userland variables into being kernel variables in vfs_mount(). `data' remains a pointer into userspace. * Attempt to verify the `type' and `path' strings passed to vfs_mount() aren't too long. * rework mount() and linux_mount() to take the userland parameters (besides data, as mentioned) and pass kernel variables to vfs_mount(). (linux_mount() already did this, I've just tidied it up a little more.) * remove the copyin() stuff for `path'. `data' still requires copyin() since its a pointer into userland. * set `mount->mnt_statf_mntonname' in vfs_mount() rather than in each filesystem. This variable is generally initialised with `path', and each filesystem can override it if they want to. * NOTE: f_mntonname is intiailised with "/" in the case of a root mount.	2001-03-01 21:00:17 +00:00
Alfred Perlstein	8283130be4	Display the Joliet Extension 'level' in the log message. PR: kern/24998	2001-02-23 03:43:05 +00:00
Robert Watson	91421ba234	o Move per-process jail pointer (p->pr_prison) to inside of the subject credential structure, ucred (cr->cr_prison). o Allow jail inheritence to be a function of credential inheritence. o Abstract prison structure reference counting behind pr_hold() and pr_free(), invoked by the similarly named credential reference management functions, removing this code from per-ABI fork/exit code. o Modify various jail() functions to use struct ucred arguments instead of struct proc arguments. o Introduce jailed() function to determine if a credential is jailed, rather than directly checking pointers all over the place. o Convert PRISON_CHECK() macro to prison_check() function. o Move jail() function prototypes to jail.h. o Emulate the P_JAILED flag in fill_kinfo_proc() and no longer set the flag in the process flags field itself. o Eliminate that "const" qualifier from suser/p_can/etc to reflect mutex use. Notes: o Some further cleanup of the linux/jail code is still required. o It's now possible to consider resolving some of the process vs credential based permission checking confusion in the socket code. o Mutex protection of struct prison is still not present, and is required to protect the reference count plus some fields in the structure. Reviewed by: freebsd-arch Obtained from: TrustedBSD Project	2001-02-21 06:39:57 +00:00
Poul-Henning Kamp	3e8bea9634	Remove a debug printf.	2001-02-18 09:16:49 +00:00
Jonathan Lemon	608a3ce62a	Extend kqueue down to the device layer. Backwards compatible approach suggested by: peter	2001-02-15 16:34:11 +00:00
Maxim Sobolev	c4fefc4887	Add a hook for loading of a Unicode -> char conversion routine as a kld at a run-time. This is temporary solution until proper kernel Unicode interfaces are in place and as such was purposely designed to be as tiny as possible (3 lines of the code not counting comments). The port with conversion routines for the most popular single-byte languages will be added later today Reviewed by: bp, "Michael C . Wu" <keichii@iteration.net> Approved by: bp	2001-02-13 11:48:31 +00:00
Bosko Milekic	9ed346bab0	Change and clean the mutex lock interface. mtx_enter(lock, type) becomes: mtx_lock(lock) for sleep locks (MTX_DEF-initialized locks) mtx_lock_spin(lock) for spin locks (MTX_SPIN-initialized) similarily, for releasing a lock, we now have: mtx_unlock(lock) for MTX_DEF and mtx_unlock_spin(lock) for MTX_SPIN. We change the caller interface for the two different types of locks because the semantics are entirely different for each case, and this makes it explicitly clear and, at the same time, it rids us of the extra `type' argument. The enter->lock and exit->unlock change has been made with the idea that we're "locking data" and not "entering locked code" in mind. Further, remove all additional "flags" previously passed to the lock acquire/release routines with the exception of two: MTX_QUIET and MTX_NOSWITCH The functionality of these flags is preserved and they can be passed to the lock/unlock routines by calling the corresponding wrappers: mtx_{lock, unlock}_flags(lock, flag(s)) and mtx_{lock, unlock}_spin_flags(lock, flag(s)) for MTX_DEF and MTX_SPIN locks, respectively. Re-inline some lock acq/rel code; in the sleep lock case, we only inline the _obtain_lock()s in order to ensure that the inlined code fits into a cache line. In the spin lock case, we inline recursion and actually only perform a function call if we need to spin. This change has been made with the idea that we generally tend to avoid spin locks and that also the spin locks that we do have and are heavily used (i.e. sched_lock) do recurse, and therefore in an effort to reduce function call overhead for some architectures (such as alpha), we inline recursion for this case. Create a new malloc type for the witness code and retire from using the M_DEV type. The new type is called M_WITNESS and is only declared if WITNESS is enabled. Begin cleaning up some machdep/mutex.h code - specifically updated the "optimized" inlined code in alpha/mutex.h and wrote MTX_LOCK_SPIN and MTX_UNLOCK_SPIN asm macros for the i386/mutex.h as we presently need those. Finally, caught up to the interface changes in all sys code. Contributors: jake, jhb, jasone (in no particular order)	2001-02-09 06:11:45 +00:00
Jeroen Ruigrok van der Werven	1a6e52d0e9	Fix typo: seperate -> separate. Seperate does not exist in the english language.	2001-02-06 11:21:58 +00:00
Poul-Henning Kamp	37d4006626	Another round of the <sys/queue.h> FOREACH transmogriffer. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 16:08:18 +00:00
Poul-Henning Kamp	fc2ffbe604	Mechanical change to use <sys/queue.h> macro API instead of fondling implementation details. Created with: sed(1) Reviewed by: md5(1)	2001-02-04 13:13:25 +00:00
Poul-Henning Kamp	ef9e85abba	Use <sys/queue.h> macro API.	2001-02-04 12:37:48 +00:00
Poul-Henning Kamp	b99cfaf32c	Remove a DIAGNOSTIC check which belongs in <sys/queue.h> if anyplace at all.	2001-02-04 11:53:51 +00:00
Poul-Henning Kamp	4b1c62b3f2	At the point in time where most devices are created, we don't know what time it is because boottime is not yet initialized. Finagle the relevant fields when we get the chance.	2001-02-02 22:54:41 +00:00
Poul-Henning Kamp	ecde9a6dae	Only superuser can create symlinks. Give symlinks mode 755 by default to avoid triggering alert eyes. (the mode isn't use on symlinks)	2001-02-02 18:35:29 +00:00
Peter Wemm	2508f69037	Zap last remaining references to (and a use use of) of simple_locks.	2001-01-31 04:29:52 +00:00
Poul-Henning Kamp	4997ad7c1f	Add a BUF_KERNPROC() in the BIO_DELETE path. This seems to fix the problem which md(4) backed filesystems exposed.	2001-01-30 10:06:08 +00:00
Poul-Henning Kamp	aadf265525	Fix two minor nits. Existences revealed, but no details offered by: bp	2001-01-30 08:39:52 +00:00
Matthew Dillon	2a9737202a	This patch reestablishes the spec_fsync() guarentee that synchronous fsyncs, which typically occur during unmounting, will drain all dirty buffers even if it takes multiple passes to do so. The guarentee was mangled by the last patch which solved a problem due to -current disabling interrupts while holding giant (which caused an infinite spin loop waiting for I/O to complete). -stable does not have either patch, but has a similar bug in the original spec_fsync() code which is triggered by a bug in the softupdates umount code, a fix for which will be committed to -current as soon as Kirk stamps it. Then both solutions will be MFC'd to -stable. -stable currently suffers from a combination of the softupdates bug and a small window of opportunity in the original spec_fsync() code, and -stable also suffers from the spin-loop bug but since interrupts are enabled the spin resolves itself in a few milliseconds.	2001-01-29 08:19:28 +00:00
John Baldwin	ba88dfc733	Back out proc locking to protect p_ucred for obtaining additional references along with the actual obtaining of additional references.	2001-01-27 00:01:31 +00:00
Jason Evans	1b367556b5	Convert all simplelocks to mutexes and remove the simplelock implementations.	2001-01-24 12:35:55 +00:00
John Baldwin	b939335607	- Catch up to proc flag changes.	2001-01-24 11:20:05 +00:00
John Baldwin	54bd3c0306	The lock being destroyed was misnamed, not unused. Add the lockdestroy() back in but with the proper name so that this compiles. Submitted by: jasone	2001-01-24 02:18:54 +00:00
John Baldwin	cfb4c0b4f9	Proc locking to protect p_ucred while we obtain additional references.	2001-01-24 00:26:19 +00:00
John Baldwin	d19a727628	- Remove unused header include. - Use queue macros.	2001-01-23 22:38:38 +00:00
John Baldwin	1aab03a584	Proc locking to protect p_ucred while we obtain an additional reference.	2001-01-23 22:38:15 +00:00
John Baldwin	f5343b3219	- FreeBSD doesn't have an abortop vnop as far as I can tell, so #ifdef references to the hpf op out. - Remove a lockdestroy() on a non-existent variable.	2001-01-23 22:37:30 +00:00
Peter Wemm	10cf882b4f	Fix breakage unconvered by LINT - dont refer to undefined variables in KASSERT()	2001-01-17 01:10:23 +00:00
Garrett Wollman	d31a0944a1	Delete unused #include <sys/select.h>.	2001-01-09 04:32:24 +00:00
Garrett Wollman	b7ef0b1281	Don't compile a dead variable declaration.	2001-01-09 04:24:43 +00:00
Poul-Henning Kamp	49851cc706	Use macro API to <sys/queue.h>	2000-12-31 10:24:19 +00:00
Matthew Dillon	08c0a67b2e	Fix a lockup problem that occurs with 'cvs update'. specfs's fsync can get into the same sort of infinite loop that ffs's fsync used to get into, probably due to background bitmap writes. The solution is the same.	2000-12-30 23:32:24 +00:00
Matthew Dillon	2b6b0df712	This implements a better launder limiting solution. There was a solution in 4.2-REL which I ripped out in -stable and -current when implementing the low-memory handling solution. However, maxlaunder turns out to be the saving grace in certain very heavily loaded systems (e.g. newsreader box). The new algorithm limits the number of pages laundered in the first pageout daemon pass. If that is not sufficient then suceessive will be run without any limit. Write I/O is now pipelined using two sysctls, vfs.lorunningspace and vfs.hirunningspace. This prevents excessive buffered writes in the disk queues which cause long (multi-second) delays for reads. It leads to more stable (less jerky) and generally faster I/O streaming to disk by allowing required read ops (e.g. for indirect blocks and such) to occur without interrupting the write stream, amoung other things. NOTE: eventually, filesystem write I/O pipelining needs to be done on a per-device basis. At the moment it is globalized.	2000-12-26 19:41:38 +00:00
Jake Burkholder	98f03f9030	Protect proc.p_pptr and proc.p_children/p_sibling with the proctree_lock. linprocfs not locked pending response from informal maintainer. Reviewed by: jhb, -smp@	2000-12-23 19:43:10 +00:00
John Baldwin	ea10b6b78e	When p_ucred is passed to the venus daemon, first grab the proc lock to protect the p_ucred pointer, obtain a seperate reference to the ucred, release the lock, and then pass in the new ucred reference.	2000-12-15 00:12:30 +00:00
Robert Watson	f6a99e61c5	o Tighten restrictions on use of /proc/pid/ctl and move access checks in ctl to using centralized p_can() inter-process access control interface. Reviewed by: sef	2000-12-13 04:28:24 +00:00
Jake Burkholder	c0c2557090	- Change the allproc_lock to use a macro, ALLPROC_LOCK(how), instead of explicit calls to lockmgr. Also provides macros for the flags pased to specify shared, exclusive or release which map to the lockmgr flags. This is so that the use of lockmgr can be easily replaced with optimized reader-writer locks. - Add some locking that I missed the first time.	2000-12-13 00:17:05 +00:00
Dag-Erling Smørgrav	668891c57b	Add a module version (so that linprocfs can properly depend on procfs)	2000-12-09 13:17:51 +00:00
David Malone	7cc0979fd6	Convert more malloc+bzero to malloc+M_ZERO. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net>	2000-12-08 21:51:06 +00:00
Poul-Henning Kamp	2a0436783d	staticize.	2000-12-08 15:07:24 +00:00
John Baldwin	f3a16450d0	Protect accesses to member of struct proc with the proc lock.	2000-12-06 01:45:20 +00:00
John Baldwin	c3f52eedeb	Protect p_stat with the sched_lock. Reviewed by: jake	2000-12-02 01:58:15 +00:00
Jonathan Lemon	747fa57549	Update to reflect the disappearance of getsock(). Found by: LINT	2000-11-25 07:16:06 +00:00
Boris Popov	f2b1e0d206	Use vop_defaultop() instead of ntfs_bypass(). PR: kern/22756	2000-11-18 02:47:12 +00:00
Kirk McKusick	dc029a4bd7	Missed conversion of CIRCLEQ => TAILQ for mount list.	2000-11-14 06:38:18 +00:00
Eivind Eklund	b8c8516a7f	More paranoia against overflows	2000-11-08 21:53:05 +00:00
Boris Popov	1c4c2d8235	v_interlock is a mutex now, not simple lock.	2000-11-04 02:42:11 +00:00
Poul-Henning Kamp	1d7e3e42e7	Take VBLK devices further out of their missery. This should fix the panic I introduced in my previous commit on this topic.	2000-11-02 21:14:13 +00:00
Eivind Eklund	ab3240e198	Fix overflow from jail hostname. Bug found by: Esa Etelavuori <eetelavu@cc.hut.fi>	2000-11-01 19:38:08 +00:00
Eivind Eklund	e3c4036b18	Give vop_mmap an untimely death. The opportunity to give it a timely death timed out in 1996.	2000-11-01 17:57:24 +00:00
David Malone	da71e9a21b	Make malloc use M_ZERO in some more locations. Don't check for a null pointer if malloc called with M_WAITOK. Submitted by: josh@zipperup.org Submitted by: Robert Drehmel <robd@gmx.net> Approved by: bp	2000-10-29 16:14:28 +00:00
Poul-Henning Kamp	cf9fa8e725	Move suser() and suser_xxx() prototypes and a related #define from <sys/proc.h> to <sys/systm.h>. Correctly document the #includes needed in the manpage. Add one now needed #include of <sys/systm.h>. Remove the consequent 48 unused #includes of <sys/proc.h>.	2000-10-29 16:06:56 +00:00
Poul-Henning Kamp	9f69a4578a	Weaken a bogus dependency on <sys/proc.h> in <sys/buf.h> by #ifdef'ing the offending inline function (BUF_KERNPROC) on it being #included already. I'm not sure BUF_KERNPROC() is even the right thing to do or in the right place or implemented the right way (inline vs normal function). Remove consequently unneeded #includes of <sys/proc.h>	2000-10-29 14:54:55 +00:00
Poul-Henning Kamp	53ce36d17a	Remove unneeded #include <sys/proc.h> lines.	2000-10-29 13:57:19 +00:00
Boris Popov	6716c905c9	Rev 1.41 was committed from wrong diff, now do it right.	2000-10-22 16:15:12 +00:00
Boris Popov	3ae19dd8cd	Release and unlock vnode if resource deadlock detected.	2000-10-22 15:40:22 +00:00
Boris Popov	b1b494a765	Update stale comment. PR: kern/21805	2000-10-22 14:24:30 +00:00
Boris Popov	e7b1ac75dd	Remove de_lock field from denode structure and make msdosfs PDIRUNLOCK aware.	2000-10-22 14:22:17 +00:00
Boris Popov	d45a191e99	Fix nullfs breakage caused by incomplete migration of v_interlock from simple_lock to mutex. Reset LK_INTERLOCK flag when interlock released manually.	2000-10-15 06:25:42 +00:00
Chris Costello	3157714de7	o Move from Alfred Perstein's "exclusion" technique of handling special file types to requiring all file types to properly implement fo_stat. This makes any new file type additions much easier as this code no longer has to be modified to accomodate it. o Instead of using curproc in fdesc_allocvp, pass a `struct proc' pointer as a new fifth parameter.	2000-10-09 20:06:13 +00:00
Eivind Eklund	7eb9fca557	Blow away the v_specmountpoint define, replacing it with what it was defined as (rdev->si_mountpoint)	2000-10-09 17:31:39 +00:00
Poul-Henning Kamp	8abea41d80	Don't hold an extra reference to vnodes. Devfs vnodes are sufficiently cheap to setup that it doesn't really matter that we recycle device vnodes at kleenex speed. Implement first cut try at killing cloned devices when they are not needed anymore. For now only the bpf driver is involved in this experiment. Cloned devices can set the SI_CHEAPCLONE flag which allows us to destroy_dev() it when the vcount() drops to zero and the vnode is reclaimed. For now it's a requirement that the driver doesn't keep persistent state from close to (re)open. Some whitespace changes.	2000-10-09 14:18:07 +00:00
Alfred Perlstein	e5b7b6b78e	return correct type for process directory entries, DT_DIR not DT_REG	2000-10-05 23:19:51 +00:00
Bruce Evans	253315a5a3	Forward-declare struct mbuf so that this file is less self-insufficient -- don't depend on garbage in <sys/mount.h>. mbufs aren't actually used here either. They should have been completely removed from filesystem interfaces when they were removed from the interfaces to convert between file handles and vnodes.	2000-10-05 11:58:22 +00:00
Jason Evans	a18b1f1d4d	Convert lockmgr locks from using simple locks to using mutexes. Add lockdestroy() and appropriate invocations, which corresponds to lockinit() and must be called to clean up after a lockmgr lock is no longer needed.	2000-10-04 01:29:17 +00:00
Boris Popov	beafe997c7	Make cd9660 filesystem PDIRUNLOCK aware. Now it can be used in vnode stacks and nullfs mounts. Remove now unnecessary i_lock field from the iso_node structure.	2000-10-03 04:39:50 +00:00
Boris Popov	611025ec57	Prevent dereference of NULL pointer when null_lock() and null_unlock() called and there is no underlying vnode.	2000-10-03 04:25:53 +00:00
Boris Popov	5c4db877e4	Protect hash data with lock manager instead of home grown one. Replace shared lock on vnode with exclusive one. It shouldn't impact perfomance as NCP protocol doesn't support outstanding requests. Do not hold simple lock on vnode for long period of time. Add functionality to the nwfs_print() routine.	2000-10-02 09:49:04 +00:00
Boris Popov	7f66accf02	Get rid from the legacy __P() macro. Remove 'register' keywords.	2000-10-02 09:29:59 +00:00
Peter Wemm	6adbf7beeb	PDIRUNLOCK now exists on FreeBSD. Remove the (now incorrect) redefinition.	2000-10-02 04:47:19 +00:00
Boris Popov	4451405fbd	Fix vnode locking bugs in the nullfs. Add correct support for v_object management, so mmap() operation should work properly. Add support for extattrctl() routine (submitted by semenu). At this point nullfs can be considered as functional and much more stable. In fact, it should behave as a "hard" "symlink" to underlying filesystem. Reviewed in general by: mckusick, dillon Parts of logic obtained from: NetBSD	2000-09-25 15:38:32 +00:00
Poul-Henning Kamp	eb7ba7f95c	Ignore attempts to set flags to zero. This quenches a syslog warning from login(1).	2000-09-18 09:40:01 +00:00
Poul-Henning Kamp	c80d29139c	Add canonical checks to devfs_setattr().	2000-09-16 12:06:58 +00:00
John Baldwin	b570da11fe	Use size_t instead of u_int for 4th argument to copyinstr().	2000-09-12 22:39:34 +00:00
Jason Evans	0384fff8c5	Major update to the way synchronization is done in the kernel. Highlights include: * Mutual exclusion is used instead of spl(). See mutex(9). (Note: The alpha port is still in transition and currently uses both.) Per-CPU idle processes. * Interrupts are run in their own separate kernel threads and can be preempted (i386 only). Partially contributed by: BSDi (BSD/OS) Submissions by (at least): cp, dfr, dillon, grog, jake, jhb, sheldonh	2000-09-07 01:33:02 +00:00
Poul-Henning Kamp	93bcdfe270	Add refcounts to the "global" DEVFS inode slots, this allows us to recycle inodes after a destroy_dev() but not until all mounts have picked up the change. Add support for an overflow table for DEVFS inodes. The static table defaults to 1024 inodes, if that fills, an overflow table of 32k inodes is allocated. Both numbers can be changed at compile time, the size of the overflow table also with the sysctl vfs.devfs.noverflow. Use atomic instructions to barrier between make_dev()/destroy_dev() and the mounts. Add lockmgr() locking of directories for operations accessing or modifying the directory TAILQs. Various nitpicking here and there.	2000-09-06 11:26:43 +00:00
Boris Popov	8da8066061	Various cleanups towards make nullfs functional (it is still broken at this point): Replace all '#ifdef DEBUG' with '#ifdef NULLFS_DEBUG' and add NULLFSDEBUG macro. Protect nullfs hash table with lockmgr. Use proper order of operations when freeing mnt_data. Return correct fsid in the null_getattr(). Add null_open() function to catch MNT_NODEV (obtained from NetBSD). Add null_rename() to catch cross-fs rename operations (submitted by Ustimenko Semen <semen@iclub.nsu.ru>) Remove duplicate $FreeBSD$ tags.	2000-09-05 09:02:07 +00:00
Boris Popov	7da1e3f0b0	Get rid from the __P() macros. Encouraged by: peter	2000-09-05 07:54:39 +00:00
Poul-Henning Kamp	7665e72021	Off by one error. Submitted by: des	2000-09-04 18:24:30 +00:00
Dag-Erling Smørgrav	18203af447	Remove a comment that has been not only obsolete but patently wrong for the last 31 revisions (almost three years).	2000-09-04 18:18:17 +00:00
Poul-Henning Kamp	db90128160	Avoid the modules madness I inadvertently introduced by making the cloning infrastructure standard in kern_conf. Modules are now the same with or without devfs support. If you need to detect if devfs is present, in modules or elsewhere, check the integer variable "devfs_present". This happily removes an ugly hack from kern/vfs_conf.c. This forces a rename of the eventhandler and the standard clone helper function. Include <sys/eventhandler.h> in <sys/conf.h>: it's a helper #include like <sys/queue.h> Remove all #includes of opt_devfs.h they no longer matter.	2000-09-02 19:17:34 +00:00
Robert Watson	84a5637620	o Simplify if/then clause equating ESRCH with ENOENT when hiding a process Submitted by: des	2000-09-01 18:41:32 +00:00
Robert Watson	ca94dd37a3	o Make procfs use vaccess() for procfs_access() DAC and super-user checks, rather than implementing its own {uid,gid,other} checks against vnode mode. Similar change to linprocfs currently under review. Obtained from: TrustedBSD Project	2000-09-01 13:41:41 +00:00
Robert Watson	387d2c036b	o Centralize inter-process access control, introducing: int p_can(p1, p2, operation, privused) which allows specification of subject process, object process, inter-process operation, and an optional call-by-reference privused flag, allowing the caller to determine if privilege was required for the call to succeed. This allows jail, kern.ps_showallprocs and regular credential-based interaction checks to occur in one block of code. Possible operations are P_CAN_SEE, P_CAN_SCHED, P_CAN_KILL, and P_CAN_DEBUG. p_can currently breaks out as a wrapper to a series of static function checks in kern_prot, which should not be invoked directly. o Commented out capabilities entries are included for some checks. o Update most inter-process authorization to make use of p_can() instead of manual checks, PRISON_CHECK(), P_TRESPASS(), and kern.ps_showallprocs. o Modify suser{,_xxx} to use const arguments, as it no longer modifies process flags due to the disabling of ASU. o Modify some checks/errors in procfs so that ENOENT is returned instead of ESRCH, further improving concealment of processes that should not be visible to other processes. Also introduce new access checks to improve hiding of processes for procfs_lookup(), procfs_getattr(), procfs_readdir(). Correct a bug reported by bp concerning not handling the CREATE case in procfs_lookup(). Remove volatile flag in procfs that caused apparently spurious qualifier warnigns (approved by bde). o Add comment noting that ktrace() has not been updated, as its access control checks are different from ptrace(), whereas they should probably be the same. Further discussion should happen on this topic. Reviewed by: bde, green, phk, freebsd-security, others Approved by: bde Obtained from: TrustedBSD Project	2000-08-30 04:49:09 +00:00
Robert Watson	012c643d3e	o Restructure vaccess() so as to check for DAC permission to modify the object before falling back on privilege. Make vaccess() accept an additional optional argument, privused, to determine whether privilege was required for vaccess() to return 0. Add commented out capability checks for reference. Rename some variables to make it more clear which modes/uids/etc are associated with the object, and which with the access mode. o Update file system use of vaccess() to pass NULL as the optional privused argument. Once additional patches are applied, suser() will no longer set ASU, so privused will permit passing of privilege information up the stack to the caller. Reviewed by: bde, green, phk, -security, others Obtained from: TrustedBSD Project	2000-08-29 14:45:49 +00:00
Poul-Henning Kamp	c32d0a1dcd	Reorder vop's alphabetically. Smarter use of devfs_allocv() (from bp@) Introduce devfs_find() ".." fixes to devfs_lookup (from bp@)	2000-08-27 14:46:36 +00:00
Poul-Henning Kamp	749e1537ec	Minor cleanups tp devfs_readdir(); Add devfs_read() for directories. (inspired by bp@)	2000-08-26 16:20:57 +00:00
Bruce Evans	ff4ad0c4d8	Quick fix for msdsofs_write() on alphas and other machines with either longs larger than 32 bits or strict alignment requirements. pm_fatmask had type u_long, but it must have a type that has precisely 32 bits and this type must be no smaller than int, so that ~pmp->pm_fatmask has no bits above the 31st set. Otherwise, comparisons between (cn \| ~pmp->pm_fatmask) and magic 32-bit "cluster" numbers always fail. The correct fix is to use the C99 type uint_least32_t and mask with 0xffffffff. The quick fix is to use u_int32_t and assume that ints have msdosfs metadata is riddled with unaligned fields, and on alphas, unaligned_fixup() apparently has problems fixing up the unaligned accesses caused by this. The quick fix is to not comment out the NetBSD code that sort of handles this, and define UNALIGNED_ACCESS on i386's so that the code doesn't change on i386's. The correct fix would define UNALIGNED_ACCESS in a central machine-dependent header and maybe add some extra cases to unaligned_fixup(). UNALIGNED_ACCESS is also tested in isofs. Submitted by: parts by Mark Abene <phiber@radicalmedia.com> PR: 19086	2000-08-25 09:03:58 +00:00
Poul-Henning Kamp	a481b90b82	Fix panic when removing open device (found by bp@) Implement subdirs. Build the full "devicename" for cloning functions. Fix panic when deleted device goes away. Collaps devfs_dir and devfs_dirent structures. Add proper cloning to the /dev/fd* "device-"driver. Fix a bug in make_dev_alias() handling which made aliases appear multiple times. Use devfs_clone to implement getdiskbyname() Make specfs maintain the stat(2) timestamps per dev_t	2000-08-24 15:36:55 +00:00
Poul-Henning Kamp	fcc9b84ca5	Fix devfs_access() bug on directories. Remove unused #includes. Bug spotted by: markm	2000-08-21 14:45:19 +00:00
Poul-Henning Kamp	3f54a085a6	Remove all traces of Julians DEVFS (incl from kern/subr_diskslice.c) Remove old DEVFS support fields from dev_t. Make uid, gid & mode members of dev_t and set them in make_dev(). Use correct uid, gid & mode in make_dev in disk minilayer. Add support for registering alias names for a dev_t using the new function make_dev_alias(). These will show up as symlinks in DEVFS. Use makedev() rather than make_dev() for MFSs magic devices to prevent DEVFS from noticing this abuse. Add a field for DEVFS inode number in dev_t. Add new DEVFS in fs/devfs. Add devfs cloning to: disk minilayer (ie: ad(4), sd(4), cd(4) etc etc) md(4), tun(4), bpf(4), fd(4) If DEVFS add -d flag to /sbin/inits args to make it mount devfs. Add commented out DEVFS to GENERIC	2000-08-20 21:34:39 +00:00
Poul-Henning Kamp	e39c53eda5	Centralize the canonical vop_access user/group/other check in vaccess(). Discussed with: bde	2000-08-20 08:36:26 +00:00
Poul-Henning Kamp	39f70682ae	Introduce vop_stdinactive() and make it the default if no vop_inactive is declared. Sort and prune a few vop_op[].	2000-08-18 10:01:02 +00:00
Sheldon Hearn	1b2fbe6ff9	Rename the loadable nullfs kernel module: null -> nullfs	2000-07-28 11:54:09 +00:00
Kirk McKusick	9b97113391	This patch corrects the first round of panics and hangs reported with the new snapshot code. Update addaliasu to correctly implement the semantics of the old checkalias function. When a device vnode first comes into existence, check to see if an anonymous vnode for the same device was created at boot time by bdevvp(). If so, adopt the bdevvp vnode rather than creating a new vnode for the device. This corrects a problem which caused the kernel to panic when taking a snapshot of the root filesystem. Change the calling convention of vn_write_suspend_wait() to be the same as vn_start_write(). Split out softdep_flushworklist() from softdep_flushfiles() so that it can be used to clear the work queue when suspending filesystem operations. Access to buffers becomes recursive so that snapshots can recursively traverse their indirect blocks using ffs_copyonwrite() when checking for the need for copy on write when flushing one of their own indirect blocks. This eliminates a deadlock between the syncer daemon and a process taking a snapshot. Ensure that softdep_process_worklist() can never block because of a snapshot being taken. This eliminates a problem with buffer starvation. Cleanup change in ffs_sync() which did not synchronously wait when MNT_WAIT was specified. The result was an unclean filesystem panic when doing forcible unmount with heavy filesystem I/O in progress. Return a zero'ed block when reading a block that was not in use at the time that a snapshot was taken. Normally, these blocks should never be read. However, the readahead code will occationally read them which can cause unexpected behavior. Clean up the debugging code that ensures that no blocks be written on a filesystem while it is suspended. Snapshots must explicitly label the blocks that they are writing during the suspension so that they do not cause a `write on suspended filesystem' panic. Reorganize ffs_copyonwrite() to eliminate a deadlock and also to prevent a race condition that would permit the same block to be copied twice. This change eliminates an unexpected soft updates inconsistency in fsck caused by the double allocation. Use bqrelse rather than brelse for buffers that will be needed soon again by the snapshot code. This improves snapshot performance.	2000-07-24 05:28:33 +00:00
David Malone	4ebb509c1f	Certain error contitions cause msdosfs_rename() to decrement the vnode reference count on 'fdvp' more times than it should. PR: 17347 Submitted by: Ian Dowse <iedowse@maths.tcd.ie> Approved by: bde	2000-07-14 11:52:56 +00:00

1 2 3 4 5 ...

936 Commits