freebsd-dev

Author	SHA1	Message	Date
Poul-Henning Kamp	45c26fa2b6	Improve the vfs_hash() API: vput() the unneeded vnode centrally to avoid replicating the vput in all the filesystems.	2005-03-15 20:00:03 +00:00
Jeff Roberson	bc855512c8	- Assume that all lower filesystems now support proper locking. Assert that they set v->v_vnlock. This is true for all filesystems in the tree. - Remove all uses of LK_THISLAYER. If the lower layer is locked, the null layer is locked. We only use vget() to get a reference now. null essentially does no locking. This fixes LOOKUP_SHARED with nullfs. - Remove the special LK_DRAIN considerations, I do not believe this is needed now as LK_DRAIN doesn't destroy the lower vnode's lock, and it's hardly used anymore. - Add one well commented hack to prevent the lowervp from going away while we're in it's VOP_LOCK routine. This can only happen if we're forcibly unmounted while some callers are waiting in the lock. In this case the lowervp could be recycled after we drop our last ref in null_reclaim(). Prevent this with a vhold().	2005-03-15 13:49:33 +00:00
Poul-Henning Kamp	7649bbb0b0	Disable two users of findcdev. They do the wrong thing now and will need to be fixed. In both cases the API should be reengineered to do something (more) sensible.	2005-03-15 12:39:30 +00:00
Jeff Roberson	9feb7408f8	- We have to transfer lockers after reseting our vnlock pointer. Sponsored by: Isilon Systems, Inc.	2005-03-15 11:28:45 +00:00
Poul-Henning Kamp	46d7d4a332	Don't export major,minor, instead export tty name.	2005-03-15 11:05:11 +00:00
Poul-Henning Kamp	6bc6a87cc9	Print devtoname() instead of minor().	2005-03-15 10:01:31 +00:00
Poul-Henning Kamp	40d04a26a0	Fix typo: pointers are not boolean in style(9).	2005-03-15 10:01:14 +00:00
Poul-Henning Kamp	e82ef95c11	Simplify the vfs_hash calling convention.	2005-03-15 08:07:07 +00:00
Dag-Erling Smørgrav	0e3b5c73b2	Hook pfs_lookup() up to vfs_cachedlookup_desc instead of vfs_lookup_desc, as suggested by Matt's comment. Also fix some style and paranoia issues. The entire function could benefit from review by a VFS guru. MFC after: 6 weeks	2005-03-14 16:24:50 +00:00
Dag-Erling Smørgrav	bc593ccd83	Fix two long-standing bugs in pfs_readdir(): Since we used an sbuf of size resid to accumulate dirents, we would end up returning one byte short when we had enough dirents to fill or exceed the size of the sbuf (the last byte being lost to bogus NUL termination) causing the next call to return EINVAL due to an unaligned offset. This went undetected for a long time because I did most of my testing in single-user mode, where there are rarely enough processes to fill the 4096-byte buffer ls(1) uses. The most common symptom of this bug is that tab completion of /proc or /compat/linux/proc does not work properly when many processes are running. Also, a check near the top would return EINVAL if resid was smaller than PFS_DELEN, even if it was 0, which is frequently the case and perfectly allowable. Change the test so that it returns 0 if resid is 0. MFC after: 2 weeks	2005-03-14 16:21:32 +00:00
Dag-Erling Smørgrav	cb5abc7d2d	If PSEUDOFS_TRACE is defined, create a sysctl knob to enable / disable pseudofs call tracing.	2005-03-14 16:06:47 +00:00
Dag-Erling Smørgrav	de52d21a02	fbsdidize.	2005-03-14 15:54:11 +00:00
Poul-Henning Kamp	2f00593534	Use vfs_hash instead of home-rolled.	2005-03-14 14:41:37 +00:00
Poul-Henning Kamp	dfb9f846e9	Use vfs_hash instead of home-rolled.	2005-03-14 13:22:41 +00:00
Poul-Henning Kamp	4e94fafc4f	Use vfs_hash instead of home-rolled. Correct locking around g_vfs_close()	2005-03-14 12:29:39 +00:00
Poul-Henning Kamp	a30fc63b19	Use vfs_hash instead of home-rolling.	2005-03-14 12:24:35 +00:00
Jeff Roberson	c1e7e9ba9b	- VOP_INACTIVE should no longer drop the vnode lock. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:47 +00:00
Jeff Roberson	8da0046596	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. - VOP_INACTIVE should no longer drop the vnode lock. - The vnode lock is required around calls to vrecycle() and vgone(). Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:25 +00:00
Jeff Roberson	c0f681c21d	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:14:56 +00:00
Jeff Roberson	172ffe319a	- The c_lock in the coda node does not offer any features over the standard vnode lock. Remove the c_lock and use the vn lock in its place. - Keep the coda lock functions so that the debugging information is preserved, but call directly to the vop_std*lock routines for the real functionality. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:09:34 +00:00
Jeff Roberson	3100b70037	- Deadfs may now use the standard vop lock, get rid of dead_lock(). - We no longer have to take the XLOCK state into consideration in any routines. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:06:20 +00:00
David E. O'Brien	bdc172ab8f	Used unsigned version. Submitted by: jmallett	2005-03-12 06:06:04 +00:00
David E. O'Brien	fb2eece6d2	Fix kernel build on 64-bit machines.	2005-03-12 03:50:39 +00:00
Nate Lawson	d81812be67	Correct a last-minute thinko. Instead of copying the nul with the string, nul-terminate the dp->d_name directly and only copy the string.	2005-03-11 23:35:23 +00:00
Nate Lawson	4cdb148352	The mbnambuf routines combine multiple substrings into a single long filename. Each substring is indexed by the windows ID, a sequential one-based value. The previous code was extremely slow, doing a malloc/strcpy/free for each substring. This code optimizes these routines with this in mind, using the ID to index into a single array and concatenating each WIN_CHARS chunk at once. (The last chunk is variable-length.) This code has been tested as working on an FS with difficult filename sizes (255, 13, 26, etc.) It gives a 77.1% decrease in profiled time (total across all functions) and a 73.7% decrease in wall time. Test was "ls -laR > /dev/null". Per-function time savings: mbnambuf_init: -90.7% mbnambuf_write: -18.7% mbnambuf_flush: -67.1% MFC after: 1 month	2005-03-11 23:27:45 +00:00
Poul-Henning Kamp	2647407860	One more bit of the major/minor patch to make ttyname happy as well.	2005-03-10 18:49:17 +00:00
Poul-Henning Kamp	b43ab0e378	Try to fix the mess I made of devname, with the minimal subset of the larger minor/major patch which was posted for testing.	2005-03-10 18:21:34 +00:00
Poul-Henning Kamp	f5af7353c0	Remove kernelside support for devfs rules filtering on major numbers.	2005-03-08 19:51:27 +00:00
Poul-Henning Kamp	a24042b727	Avoid a couple of mutex operations in the process exit path for the common case where procfs have never been mounted. OK'ed by: des	2005-03-01 12:20:49 +00:00
Poul-Henning Kamp	7ce296cf04	Remove debug printout of major/minor numbers, print name instead.	2005-02-27 21:16:26 +00:00
Sam Leffler	3cdbd5fb04	remove dead code Submitted by: Coverity Prevent analysis tool	2005-02-22 19:02:24 +00:00
Poul-Henning Kamp	0454a53d65	We may not have an actual cdev at this point.	2005-02-22 18:17:31 +00:00
Poul-Henning Kamp	aa2f6ddc3f	Reap more benefits from DEVFS: List devfs_dirents rather than vnodes off their shared struct cdev, this saves a pointer field in the vnode at the expense of a field in the devfs_dirent. There are often 100 times more vnodes so this is bargain. In addition it makes it harder for people to try to do stypid things like "finding the vnode from cdev". Since DEVFS handles all VCHR nodes now, we can do the vnode related cleanup in devfs_reclaim() instead of in dev_rel() and vgonel(). Similarly, we can do the struct cdev related cleanup in dev_rel() instead of devfs_reclaim(). rename idestroy_dev() to destroy_devl() for consistency. Add LIST_ENTRY de_alias to struct devfs_dirent. Remove v_specnext from struct vnode. Change si_hlist to si_alist in struct cdev. String new devfs vnodes' devfs_dirent on si_alist when we create them and take them off in devfs_reclaim(). Fix devfs_revoke() accordingly. Also don't clear fields devfs_reclaim() will clear when called from vgone(); Let devfs_reclaim() call dev_rel() instead of vgonel(). Move the usecount tracking from dev_rel() to devfs_reclaim(), and let dev_rel() take a struct cdev argument instead of vnode. Destroy SI_CHEAPCLONE devices in dev_rel() (instead of devfs_reclaim()) when they are no longer used. (This should maybe happen in devfs_close() instead.)	2005-02-22 15:51:07 +00:00
Poul-Henning Kamp	5a98dd4df5	vp->v_id is a private field for the vfs namecache and it is a big mistake that NFS ever started using it and an even bigger that it got copied&pasted to nwfs and smbfs. Replace with use of vhold()/vdrop().	2005-02-22 15:06:30 +00:00
Poul-Henning Kamp	f69d42a1d2	Use vn_printf() instead of home-rolling.	2005-02-22 14:58:59 +00:00
Poul-Henning Kamp	1a1457d427	Make dev_ref() require the dev_lock() to be held and use it from devfs instead of directly frobbing the si_refcount.	2005-02-22 14:41:04 +00:00
David Schultz	0e2b18143f	Replace the workaround for a deadlock bug in Coda with a different workaround that does not rely on vfs_start().	2005-02-20 23:01:57 +00:00
Robert Watson	1bfca411a6	Remove basically unused root_vp pointer in udfmount. MFC after: 1 week Discussed with: scottl	2005-02-18 11:47:51 +00:00
Robert Watson	5d0c377bfe	Conditionalize cd9660 chattiness regarding the nature of the file system mounted (is it Joliet, RockRidge, High Sierra) based on bootverbose. Most file systems don't generate log messages based on details of the file system superblock, and these log messages disrupt sysinstall output during a new install from CD. We may want to explore exposing this status information using nmount() at some point. MFC after: 3 days	2005-02-18 10:49:55 +00:00
Poul-Henning Kamp	4d8ac58b05	Introduce vx_wait{l}() and use it instead of home-rolled versions.	2005-02-17 10:49:51 +00:00
Poul-Henning Kamp	5ece08f57a	Make a SYSCTL_NODE static	2005-02-10 12:23:29 +00:00
Poul-Henning Kamp	66ae53f804	make M_NTFSMNT and ntfs_calccfree() static	2005-02-10 12:09:49 +00:00
Poul-Henning Kamp	9def42f333	Make fdesc_root static	2005-02-10 12:09:15 +00:00
Poul-Henning Kamp	f70f851c60	Make smbfs_debuglevel private.	2005-02-10 12:07:02 +00:00
Poul-Henning Kamp	271c679c17	don't call vprint with NULL.	2005-02-10 12:06:34 +00:00
Poul-Henning Kamp	87c045d5a2	Statize malloc types. Don't call vprint with NULL.	2005-02-10 12:05:06 +00:00
Poul-Henning Kamp	df32e67c73	Statize devfs_ops_f	2005-02-10 12:04:26 +00:00
Poul-Henning Kamp	c711aea6ca	Make a bunch of malloc types static. Found by: src/tools/tools/kernxref	2005-02-10 12:02:37 +00:00
Nate Lawson	2a05fbb949	Unroll the loop for calculating the 8.3 filename checksum. In testing on my P3, microbenchmarks show the unrolled version is 78x faster. In actual use (recursive ls), this gives an average of 9% improvement in system time and 2% improvement in wall time.	2005-02-08 07:51:14 +00:00
Poul-Henning Kamp	61f9cf813f	Remove vop_destroyvobject()	2005-02-07 09:23:34 +00:00
Poul-Henning Kamp	7bf4b73d6c	Deimplement vop_destroyvobject()	2005-02-07 08:23:36 +00:00
Poul-Henning Kamp	49829f2ec5	Remove vop_destroyvobject() initialization.	2005-02-07 08:04:24 +00:00
Peter Edwards	72b3e305af	Unbreak a few filesystems for which vnode_create_vobject() wasn't being called in "open", causing mmap() to fail. Where possible, pass size of file to vnode_create_vobject() rather than having it find it out the hard way via VOP_LOOKUP Reviewed by: phk	2005-01-29 16:23:39 +00:00
Poul-Henning Kamp	a369f34d76	Make filesystems get rid of their own vnodes vnode_pager object in VOP_RECLAIM().	2005-01-28 14:42:17 +00:00
Poul-Henning Kamp	d4eb29ba71	Remove unused argument to vrecycle()	2005-01-28 13:08:21 +00:00
Peter Edwards	174d6a9f73	Make NTFS at least minimally usable after bufobj and GEOM fallout. mmap() on NTFS files was hosed, returning pages offset from the start of the disk rather than the start of the file. (ie, "cp" of a 1-block file would get you a copy of the boot sector, not the data in the file.) The solution isn't ideal, but gives a functioning filesystem. Cached vnode lookup was also broken, resulting in vnode haemorrhage. A lookup on the same file twice would give you two vnodes, and the resulting cached pages. Just recently, mmap() was broken due to a lack of a call to vnode_create_vobject() in ntfs_open(). Discussed with: phk@	2005-01-27 13:50:27 +00:00
Poul-Henning Kamp	84a6975215	Introduce and use g_vfs_close().	2005-01-25 15:52:04 +00:00
Poul-Henning Kamp	729fcf7efb	Take VOP_GETVOBJECT() out to pasture. We use the direct pointer now.	2005-01-25 00:42:16 +00:00
Poul-Henning Kamp	69816ea35e	Kill VOP_CREATEVOBJECT(), it is now the responsibility of the filesystem for a given vnode to create a vnode_pager object if one is needed.	2005-01-25 00:12:24 +00:00
Poul-Henning Kamp	2a967a99c3	Don't implement vop_createvobject(), vop_open() and vop_close() manages this for nullfs now.	2005-01-24 23:54:45 +00:00
Poul-Henning Kamp	dcff5b1440	Don't call VOP_CREATEVOBJECT(), it's the responsibility of the filesystem which owns the vnode.	2005-01-24 23:53:54 +00:00
Poul-Henning Kamp	c683c4ee04	Add null_open() and null_close() which calls null_bypass() and managed the v_object pointer.	2005-01-24 22:56:24 +00:00
Poul-Henning Kamp	625d4bc03a	Create a vp->v_object in VFS_FHTOVP() if we want to be exportable with NFS. We are moving responsibility for creating the vnode_pager object into the filesystems which own the vnode, and this is one of the places we have to cover. We call vnode_create_vobject() directly because we own the vnode. If we can get the size easily, pass it as an argument to save the call to VOP_GETATTR() in vnode_create_vobject()	2005-01-24 21:51:19 +00:00
Poul-Henning Kamp	35764be39e	Kill the VV_OBJBUF and test the v_object for NULL instead.	2005-01-24 13:13:57 +00:00
Poul-Henning Kamp	d34dd851b8	Remove "register" keywords.	2005-01-24 12:37:51 +00:00
Poul-Henning Kamp	a515233f47	Style: Remove the commented out vop_foo_args replicas.	2005-01-24 11:49:41 +00:00
Poul-Henning Kamp	303793b564	whitespace nit	2005-01-19 09:07:56 +00:00
Poul-Henning Kamp	5873f57b29	Remove unused coda_fbsd_getpages()	2005-01-19 08:24:53 +00:00
Scott Long	a4d629e32d	Fix an incorrect cast. Submitted by: Andriy Gapon MFC-after: 3 days.	2005-01-18 10:15:23 +00:00
Scott Long	444acc1655	NULL-terminate the . and .. directory entries. Apparently some tools ignore d_namlen and assume that d_name is null-terminated. Submitted by: Andriy Gapon	2005-01-14 16:35:34 +00:00
Scott Long	43bc24bf5a	Replace the min() macro with a test that doesn't truncate the 64-bit values that are used. Thanks to Bruce Evans for pointing this out.	2005-01-14 16:24:31 +00:00
Poul-Henning Kamp	e50508df66	Eliminate unused and constant arguments to smbfs_vinvalbuf()	2005-01-14 08:52:55 +00:00
Poul-Henning Kamp	bf0063b87d	Eliminate constant and unused arguments to nwfs_vinvalbuf()	2005-01-14 08:09:42 +00:00
Poul-Henning Kamp	7c0745eeae	Eliminate unused and unnecessary "cred" argument from vinvalbuf()	2005-01-14 07:33:51 +00:00
Poul-Henning Kamp	83c6439714	Whitespace in vop_vector{} initializations.	2005-01-13 18:59:48 +00:00
Poul-Henning Kamp	e39db32ab0	Ditch vfs_object_create() and make the callers call VOP_CREATEVOBJECT() directly.	2005-01-13 12:25:19 +00:00
Poul-Henning Kamp	63f89abf4a	Change the generated VOP_ macro implementations to improve type checking and KASSERT coverage. After this check there is only one "nasty" cast in this code but there is a KASSERT to protect against the wrong argument structure behind that cast. Un-inlining the meat of VOP_FOO() saves 35kB of text segment on a typical kernel with no change in performance. We also now run the checking and tracing on VOP's which have been layered by nullfs, umapfs, deadfs or unionfs. Add new (non-inline) VOP_FOO_AP() functions which take a "struct foo_args" argument and does everything the VOP_FOO() macros used to do with checks and debugging code. Add KASSERT to VOP_FOO_AP() check for argument type being correct. Slim down VOP_FOO() inline functions to just stuff arguments into the struct foo_args and call VOP_FOO_AP(). Put function pointer to VOP_FOO_AP() into vop_foo_desc structure and make VCALL() use it instead of the current offsetoff() hack. Retire vcall() which implemented the offsetoff() Make deadfs and unionfs use VOP_FOO_AP() calls instead of VCALL(), we know which specific call we want already. Remove unneeded arguments to VCALL() in nullfs and umapfs bypass functions. Remove unused vdesc_offset and VOFFSET(). Generally improve style/readability of the generated code.	2005-01-13 07:53:01 +00:00
Scott Long	9d32fde894	Use off_t when passing and calculating file offsets. While a single extent in UDF is only 32 bits, multiple extents can exist in a file. Also clean up some minor whitespace problems. Submitted by: John Wehle	2005-01-12 06:42:13 +00:00
Scott Long	d1022c068e	Don't allow reads past the end of a file. Submitted by: John Wehle, Andriy Gapon MFC After: 3 days	2005-01-12 06:17:01 +00:00
Poul-Henning Kamp	7164e8f291	Silently ignore forced argument to unmount.	2005-01-11 12:02:26 +00:00
Poul-Henning Kamp	0391e5a151	Wrap the bufobj operations in macros: BO_STRATEGY() and BO_WRITE()	2005-01-11 09:10:46 +00:00
Poul-Henning Kamp	8df6bac4c7	Remove the unused credential argument from VOP_FSYNC() and VFS_SYNC(). I'm not sure why a credential was added to these in the first place, it is not used anywhere and it doesn't make much sense: The credentials for syncing a file (ability to write to the file) should be checked at the system call level. Credentials for syncing one or more filesystems ("none") should be checked at the system call level as well. If the filesystem implementation needs a particular credential to carry out the syncing it would logically have to the cached mount credential, or a credential cached along with any delayed write data. Discussed with: rwatson	2005-01-11 07:36:22 +00:00
Poul-Henning Kamp	b630d6f15a	whitespace	2005-01-10 13:09:33 +00:00
Robert Watson	f644bbc45c	Annotate that pfs_exit() always acquires and releases two mutexes for every process exist, even if procfs isn't mounted. And one of those mutexes is Giant. No immediate thoughts on fixing this.	2005-01-08 04:56:38 +00:00
Warner Losh	86cb007f9f	/* -> /*- for copyright notices, minor format tweaks as necessary	2005-01-06 22:18:23 +00:00
Warner Losh	d167cf6f3a	/* -> /*- for copyright notices, minor format tweaks as necessary	2005-01-06 18:10:42 +00:00
Warner Losh	5de2b5750c	Start each of the license/copyright comments with /*-	2005-01-05 23:35:00 +00:00
Poul-Henning Kamp	59f69ba49f	Unsupport forceful unmounts of DEVFS. After disscussing things I have decided to take the easy and consistent 90% solution instead of aiming for the very involved 99% solution. If we allow forceful unmounts of DEVFS we need to decide how to handle the devices which are in use through this filesystem at the time. We cannot just readopt the open devices in the main /dev instance since that would open us to security issues. For the majority of the devices, this is relatively straightforward as we can just pretend they got revoke(2)'ed. Some devices get tricky: /dev/console and /dev/tty for instance does a sort of recursive open of the real console device. Other devices may be mmap'ed (kill the processes ?). And then there are disk devices which are mounted. The correct thing here would be to recursively unmount the filesystems mounte from devices from our DEVFS instance (forcefully) and if this succeeds, complete the forcefully unmount of DEVFS. But if one of the forceful unmounts fail we cannot complete the forceful unmount of DEVFS, but we are likely to already have severed a lot of stuff in the process of trying. Event attempting this would be a lot of code for a very far out corner-case which most people would never see or get in touch with. It's just not worth it.	2005-01-04 07:52:26 +00:00
Poul-Henning Kamp	50a36c111f	Be consistent about flag values passed to device drivers read/write methods: Read can see O_NONBLOCK and O_DIRECT. Write can see O_NONBLOCK, O_DIRECT and O_FSYNC. In addition O_DIRECT is shadowed as IO_DIRECT for now for backwards compatibility.	2004-12-22 17:05:44 +00:00
Poul-Henning Kamp	10eee285f7	Shuffle numeric values of the IO_* flags to match the O_* flags from fcntl.h. This is in preparation for making the flags passed to device drivers be consistently from fcntl.h for all entrypoints. Today open, close and ioctl uses fcntl.h flags, while read and write uses vnode.h flags.	2004-12-22 16:25:50 +00:00
Poul-Henning Kamp	e87047b437	We can only ever get to vgonechrl() from a devfs vnode, so we do not need to reassign the vp->v_op to devfs_specops, we know that is the value already. Make devfs_specops private to devfs.	2004-12-20 21:34:29 +00:00
Poul-Henning Kamp	2c0220129d	Add a couple of KASSERTS to try to diagnose a problem reported.	2004-12-20 21:12:11 +00:00
Poul-Henning Kamp	2a9e0c3216	Be a bit more assertive about vnode bypass.	2004-12-14 09:32:18 +00:00
Suleiman Souhlal	3d96167a54	Exporting of NTFS filesystem broke in rev 1.70. Fix it. Approved by: phk, grehan (mentor)	2004-12-13 16:21:48 +00:00
Poul-Henning Kamp	5cb471d04d	Don't forget to bypass vnodes in corner cases. Found by: kkenn and ports/shell/zsh Thanks to: jeffr	2004-12-13 10:07:57 +00:00
Poul-Henning Kamp	1dc4727ea3	Another FNONBLOCK -> O_NONBLOCK. Don't unconditionally set IO_UNIT to device drivers in write: nobody checks it, and since it was always set it did not carry information anyway.	2004-12-13 07:41:19 +00:00
Poul-Henning Kamp	ab9caf9d67	Use O_NONBLOCK instead of FNONBLOCK alias.	2004-12-13 07:37:29 +00:00
Poul-Henning Kamp	f0d5cba935	Explicit panic in vop_read/vop_write for devices	2004-12-13 07:13:21 +00:00
Poul-Henning Kamp	dce357b112	Explicitly panic vop_read/vop_write on fifos.	2004-12-13 07:07:50 +00:00
Poul-Henning Kamp	e98fdc0d03	Don't deref NULL if no charset-conversion is specified. Return correct vnode in vop_bmap()	2004-12-12 12:02:34 +00:00
Poul-Henning Kamp	269c902f17	Handle MNT_UPDATE export requests first and return so we do not interpret the rest of the msdosfs_args structure. Detected by: marcel	2004-12-11 20:37:48 +00:00
Poul-Henning Kamp	708394ec72	typo	2004-12-11 12:45:24 +00:00
Poul-Henning Kamp	6366900a0f	First save from editor, then commit.	2004-12-07 15:25:36 +00:00
Poul-Henning Kamp	5c83b5551c	Fix exports.	2004-12-07 15:13:35 +00:00
Poul-Henning Kamp	20a92a18f1	The remaining part of nmount/omount/rootfs mount changes. I cannot sensibly split the conversion of the remaining three filesystems out from the root mounting changes, so in one go: cd9660: Convert to nmount. Add omount compat shims. Remove dedicated rootfs mounting code. Use vfs_mountedfrom() Rely on vfs_mount.c calling VFS_STATFS() nfs(client): Convert to nmount (the simple way, mount_nfs(8) is still necessary). Add omount compat shims. Drop COMPAT_PRELITE2 mount arg compatibility. ffs: Convert to nmount. Add omount compat shims. Remove dedicated rootfs mounting code. Use vfs_mountedfrom() Rely on vfs_mount.c calling VFS_STATFS() Remove vfs_omount() method, all filesystems are now converted. Remove MNTK_WANTRDWR, handling RO/RW conversions is a filesystem task, and they all do it now. Change rootmounting to use DEVFS trampoline: vfs_mount.c: Mount devfs on /. Devfs needs no 'from' so this is clean. symlink /dev to /. This makes it possible to lookup /dev/foo. Mount "real" root filesystem on /. Surgically move the devfs mountpoint from under the real root filesystem onto /dev in the real root filesystem. Remove now unnecessary getdiskbyname(). kern_init.c: Don't do devfs mounting and rootvnode assignment here, it was already handled by vfs_mount.c. Remove now unused bdevvp(), addaliasu() and addalias(). Put the few necessary lines in devfs where they belong. This eliminates the second-last source of bogo vnodes, leaving only the lemming-syncer. Remove rootdev variable, it doesn't give meaning in a global context and was not trustworth anyway. Correct information is provided by statfs(/).	2004-12-07 08:15:41 +00:00
Poul-Henning Kamp	def91cf267	Use vfs_mountedfrom(). Since VFS_STATFS() always calls the filesystem with mp->mnt_stat now, the vfs_statfs method is now a no-op. Explain this in a comment.	2004-12-06 20:52:46 +00:00
Poul-Henning Kamp	1a6cf6a3ad	Trust vfs_mount to call VFS_STATFS() on all mounts.	2004-12-06 20:31:36 +00:00
Poul-Henning Kamp	d14c8441e9	Convert to nmount. Add omount compat. Unpropagate the sm_args function into the runtime part.	2004-12-06 20:31:08 +00:00
Poul-Henning Kamp	bd50907c91	Convert to nmount. Add omount compat. Use vfs_mountedon(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:23:51 +00:00
Poul-Henning Kamp	c4048cf07f	Convert to nmount. Add omount compat. Same comment about charset conversions apply. Use vfs_mountedfrom(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:22:16 +00:00
Poul-Henning Kamp	55dca57ef2	Convert to nmount. Add backwards compat cmount method. Same comment as msdosfs applies: It would be nice if we had generic option names for charset conversions. Use vfs_mountefrom(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:14:20 +00:00
Poul-Henning Kamp	526463736e	Convert nwfs to nmount, but take the low road: There is no way this is ever going to work without a dedicated mount_nwfs(8) program so simply stick struct nwfs_args into a nmount argument and leave it at that.	2004-12-06 20:11:56 +00:00
Alexander Kabaev	f6968c4a99	Fix a typo in PFS_TRACE. PR: kern/74461 Submitted by: Craig Rodrigues <rodrigc at crodrigues.org>	2004-12-06 20:07:17 +00:00
Poul-Henning Kamp	935ab476fa	ufs vfs_mountedon(), rely on vfs_mount.c calling VFS_STATFS()	2004-12-06 20:03:58 +00:00
Poul-Henning Kamp	7ab8c8c03c	Use vfs_mountedfrom(), rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:02:13 +00:00
Poul-Henning Kamp	a1f5fe1538	Use vfs_mountedfrom() and rely on vfs_mount.c to call VFS_STATFS()	2004-12-06 19:54:31 +00:00
Poul-Henning Kamp	7df2fc80f8	Convert coda to nmount.	2004-12-06 19:46:02 +00:00
Poul-Henning Kamp	6a4b48f488	Convert msdosfs to nmount. Add a vfs_cmount() function which converts omount argument stucture to nmount arguments. Convert vfs_omount() to vfs_mount() and parse nmount arguments. This is 100% compatible with existing userland. Later on, but before userland gets converted to nmount we may want to revisit the names of the mountoptions, for instance it may make sense to use consistent options for charset conversion etc.	2004-12-06 19:05:48 +00:00
Poul-Henning Kamp	bed8b887ea	Fix warning	2004-12-06 12:34:28 +00:00
Poul-Henning Kamp	743312367a	VFS_STATFS(mp, ...) is mostly called with &mp->mnt_stat, but a few cases doesn't. Most of the implementations have grown weeds for this so they copy some fields from mnt_stat if the passed argument isn't that. Fix this the cleaner way: Always call the implementation on mnt_stat and copy that in toto to the VFS_STATFS argument if different.	2004-12-05 22:41:02 +00:00
Poul-Henning Kamp	91e691c2d5	Remove embryonic rootfs mounting facility. In the near future rootfs mounting will not require special handling in the filesystems.	2004-12-04 09:57:38 +00:00
Poul-Henning Kamp	4b44037433	Remove the de_devvp and stop VREF'ing it for every vnode we create.	2004-12-02 10:09:33 +00:00
Poul-Henning Kamp	aec0fb7b40	Back when VOP_* was introduced, we did not have new-style struct initializations but we did have lofty goals and big ideals. Adjust to more contemporary circumstances and gain type checking. Replace the entire vop_t frobbing thing with properly typed structures. The only casualty is that we can not add a new VOP_ method with a loadable module. History has not given us reason to belive this would ever be feasible in the the first place. Eliminate in toto VOCALL(), vop_t, VNODEOP_SET() etc. Give coda correct prototypes and function definitions for all vop_()s. Generate a bit more data from the vnode_if.src file: a struct vop_vector and protype typedefs for all vop methods. Add a new vop_bypass() and make vop_default be a pointer to another struct vop_vector. Remove a lot of vfs_init since vop_vector is ready to use from the compiler. Cast various vop_mumble() to void * with uppercase name, for instance VOP_PANIC, VOP_NULL etc. Implement VCALL() by making vdesc_offset the offsetof() the relevant function pointer in vop_vector. This is disgusting but since the code is generated by a script comparatively safe. The alternative for nullfs etc. would be much worse. Fix up all vnode method vectors to remove casts so they become typesafe. (The bulk of this is generated by scripts)	2004-12-01 23:16:38 +00:00
Colin Percival	691b3b0df9	Fix unvalidated pointer dereference. This is FreeBSD-SA-04:17.procfs.	2004-12-01 21:33:02 +00:00
Poul-Henning Kamp	22408f729e	hpfs_lookup() should have a vop_cachedlookup_t prototype an corresponding argument.	2004-12-01 20:24:01 +00:00
Poul-Henning Kamp	0731e6dfb7	Correctly prototype union_write with vop_write_t, not vop_read_t.	2004-12-01 19:15:00 +00:00
Poul-Henning Kamp	6fde64c778	Mechanically change prototypes for vnode operations to use the new typedefs.	2004-12-01 12:24:41 +00:00
Poul-Henning Kamp	ce59d2149d	Ignore MNT_NODEV, it is implicit in choice of filesystem these days.	2004-11-26 07:37:42 +00:00
Poul-Henning Kamp	c96c1bebe3	Eliminate null_open() and use instead null_bypass(). Null_open() was only here to handle MNT_NODEV, but since that does not affect any filesystems anymore, it could only have any effect if you nullfs mounted a devfs but didn't want devices to show up. If you need that, there are easier ways.	2004-11-26 07:18:28 +00:00
Poul-Henning Kamp	964ebefd8d	Use system wide no-op vfs_start function.	2004-11-25 09:11:27 +00:00
Poul-Henning Kamp	75ad04b4f6	Add dropped implementation of ioctl for fifos.	2004-11-18 17:18:11 +00:00
Poul-Henning Kamp	003e18aef4	Make vnode bypass for fifos (read, write, poll) mandatory.	2004-11-17 07:30:02 +00:00
Poul-Henning Kamp	ea566ae2a5	Make vnode bypass for devices mandatory.	2004-11-17 07:18:49 +00:00
Poul-Henning Kamp	8352b1925d	Make vnode bypass the default for devices. Can be disabled in case of problems with vfs.devfs.fops=0 in loader.conf	2004-11-15 22:11:09 +00:00
Poul-Henning Kamp	d6d64f0f2c	Add file ops to fifofs so that we can bypass vnodes (and Giant) for the heavy-duty operations (read, write, poll/select, kqueue). Disabled for now, enable with "vfs.fifofs.fops=1" in loader.conf.	2004-11-15 14:51:44 +00:00
Poul-Henning Kamp	9c83534dd8	Make VOP_BMAP return a struct bufobj for the underlying storage device instead of a vnode for it. The vnode_pager does not and should not have any interest in what the filesystem uses for backend. (vfs_cluster doesn't use the backing store argument.)	2004-11-15 09:18:27 +00:00
Poul-Henning Kamp	49b7607eba	Integrate most of vop_revoke() into devfs_revoke() where it belongs.	2004-11-13 23:37:29 +00:00
Poul-Henning Kamp	aac5167c38	Add the devfs_fp_check() function which helps us get from a struct file to a cdev and a devsw, doing all the relevant checks along the way. Add the check to see if fp->f_vnode->v_rdev differs from our cached fp->f_data copy of our cdev. If it does the device was revoked and we return ENXIO.	2004-11-13 23:21:54 +00:00
Poul-Henning Kamp	ecbcedb99f	VOP_REVOKE() is only ever for VCHR vnodes, so unionfs does not need a vop_revoke() method.	2004-11-13 22:56:26 +00:00
Poul-Henning Kamp	1ecf144493	fifos doesn't need a vop_lookup, the default will do fine.	2004-11-13 18:51:13 +00:00
Poul-Henning Kamp	124e4c3be8	Introduce an alias for FILEDESC_{UN}LOCK() with the suffix _FAST. Use this in all the places where sleeping with the lock held is not an issue. The distinction will become significant once we finalize the exact lock-type to use for this kind of case.	2004-11-13 11:53:02 +00:00
Tom Rhodes	18192f69c7	Remove stale comment after previous commit. Noticed by: pjd	2004-11-09 23:19:21 +00:00
Poul-Henning Kamp	282d0382ac	Detect root mount attempts on the flag, not on the NULL path.	2004-11-09 22:21:52 +00:00
Poul-Henning Kamp	64042a76b6	Refuse attempts to mount root filesystem	2004-11-09 22:21:10 +00:00
Poul-Henning Kamp	b0aed5267e	Refuse attemps to mount root filesystem	2004-11-09 22:14:57 +00:00
Poul-Henning Kamp	56dd3a6182	Add optional device vnode bypass to DEVFS. The tunable vfs.devfs.fops controls this feature and defaults to off. When enabled (vfs.devfs.fops=1 in loader), device vnodes opened through a filedescriptor gets a special fops vector which instead of the detour through the vnode layer goes directly to DEVFS. Amongst other things this allows us to run Giant free read/write to device drivers which have been weaned off D_NEEDGIANT. Currently this means /dev/null, /dev/zero, disks, (and maybe the random stuff ?) On a 700MHz K7 machine this doubles the speed of dd if=/dev/zero of=/dev/null bs=1 count=1000000 This roughly translates to shaving 2usec of each read/write syscall. The poll/kqfilter paths need more work before they are giant free, this work is ongoing in p4::phk_bufwork Please test this and report any problems, LORs etc.	2004-11-08 10:46:47 +00:00
Poul-Henning Kamp	5349c79d75	Properly implement a default version of VOP_GETWRITEMOUNT. Remove improper access to vop_stdgetwritemount() which should and will instead rely on the VOP default path.	2004-11-06 11:41:22 +00:00
Poul-Henning Kamp	ecc14aae12	Add back securelevel check for disks. XXX: This should live in geom_dev.c but we don't have access to the cred there. XXX: XXX: This may not matter anymore since filesystems use geom_vfs.	2004-11-04 09:17:55 +00:00
Poul-Henning Kamp	c7aaa71ce3	s/ffs/ntfs/ Fix error handling to not use VOP_CLOSE() on the disk. Spotted by: tegge	2004-11-04 07:18:54 +00:00
Poul-Henning Kamp	e1c6cbef33	Make a more whole-hearted attempt at GEOM'ifying NTFS. I must have been sleepy when I did the first pass. Spotted by: tegge	2004-11-03 21:36:41 +00:00
Poul-Henning Kamp	4cea3289da	Don't give disks special treatment, they don't come this way anymore.	2004-10-29 11:10:55 +00:00
Poul-Henning Kamp	c108bb741c	Remove VOP_SPECSTRATEGY() from the system.	2004-10-29 10:59:28 +00:00
Poul-Henning Kamp	5cdfa40c6b	Move NTFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:43:45 +00:00
Poul-Henning Kamp	a96d2ea768	Move HPFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:43:07 +00:00
Poul-Henning Kamp	bf7e2ae1c4	Move CD9660 to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:41:44 +00:00
Poul-Henning Kamp	429c018a9f	Move UDF to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:40:58 +00:00
Poul-Henning Kamp	9a135592e2	Move MSDOSFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:40:14 +00:00
Poul-Henning Kamp	6afb3b1c37	Give dev_strategy() an explict cdev argument in preparation for removing buf->b-dev. Put a bio between the buf passed to dev_strategy() and the device driver strategy routine in order to not clobber fields in the buf. Assert copyright on vfs_bio.c and update copyright message to canonical text. There is no legal difference between John Dysons two-clause abbreviated BSD license and the canonical text.	2004-10-29 07:16:37 +00:00
Poul-Henning Kamp	f00f5d71c2	Reduce the locking activity by epsilon by checking VNON condition before releasing the mountlock.	2004-10-28 08:22:11 +00:00
Poul-Henning Kamp	45628dd373	What can I say: don't allow people to mount DEVFS with option "nodev".	2004-10-28 06:03:25 +00:00
Poul-Henning Kamp	d83b7498a4	Eliminate unnecessary KASSERTs. Don't use bp->b_vp in VOP_STRATEGY: the vnode is passed in as an argument.	2004-10-27 06:48:21 +00:00
Poul-Henning Kamp	5d9d81e7ea	Put the I/O block size in bufobj->bo_bsize. We keep si_bsize_phys around for now as that is the simplest way to pull the number out of disk device drivers in devfs_open(). The correct solution would be to do an ioctl(DIOCGSECTORSIZE), but the point is probably mooth when filesystems sit on GEOM, so don't bother for now.	2004-10-26 07:39:12 +00:00
Poul-Henning Kamp	156cb26583	Loose the v_dirty* and v_clean* alias macros. Check the count field where we just want to know the full/empty state, rather than using TAILQ_EMPTY() or TAILQ_FIRST().	2004-10-25 09:14:03 +00:00
Poul-Henning Kamp	ff7c5a4880	Alas, poor SPECFS! -- I knew him, Horatio; A filesystem of infinite jest, of most excellent fancy: he hath taught me lessons a thousand times; and now, how abhorred in my imagination it is! my gorge rises at it. Here were those hacks that I have curs'd I know not how oft. Where be your kludges now? your workarounds? your layering violations, that were wont to set the table on a roar? Move the skeleton of specfs into devfs where it now belongs and bury the rest.	2004-10-22 09:59:37 +00:00
John Baldwin	78c85e8dfc	Rework how we store process times in the kernel such that we always store the raw values including for child process statistics and only compute the system and user timevals on demand. - Fix the various kern_wait() syscall wrappers to only pass in a rusage pointer if they are going to use the result. - Add a kern_getrusage() function for the ABI syscalls to use so that they don't have to play stackgap games to call getrusage(). - Fix the svr4_sys_times() syscall to just call calcru() to calculate the times it needs rather than calling getrusage() twice with associated stackgap, etc. - Add a new rusage_ext structure to store raw time stats such as tick counts for user, system, and interrupt time as well as a bintime of the total runtime. A new p_rux field in struct proc replaces the same inline fields from struct proc (i.e. p_[isu]ticks, p_[isu]u, and p_runtime). A new p_crux field in struct proc contains the "raw" child time usage statistics. ruadd() has been changed to handle adding the associated rusage_ext structures as well as the values in rusage. Effectively, the values in rusage_ext replace the ru_utime and ru_stime values in struct rusage. These two fields in struct rusage are no longer used in the kernel. - calcru() has been split into a static worker function calcru1() that calculates appropriate timevals for user and system time as well as updating the rux_[isu]u fields of a passed in rusage_ext structure. calcru() uses a copy of the process' p_rux structure to compute the timevals after updating the runtime appropriately if any of the threads in that process are currently executing. It also now only locks sched_lock internally while doing the rux_runtime fixup. calcru() now only requires the caller to hold the proc lock and calcru1() only requires the proc lock internally. calcru() also no longer allows callers to ask for an interrupt timeval since none of them actually did. - calcru() now correctly handles threads executing on other CPUs. - A new calccru() function computes the child system and user timevals by calling calcru1() on p_crux. Note that this means that any code that wants child times must now call this function rather than reading from p_cru directly. This function also requires the proc lock. - This finishes the locking for rusage and friends so some of the Giant locks in exit1() and kern_wait() are now gone. - The locking in ttyinfo() has been tweaked so that a shared lock of the proctree lock is used to protect the process group rather than the process group lock. By holding this lock until the end of the function we now ensure that the process/thread that we pick to dump info about will no longer vanish while we are trying to output its info to the console. Submitted by: bde (mostly) MFC after: 1 month	2004-10-05 18:51:11 +00:00
Takanori Watanabe	6e4c3467ce	Minor Bug fix. Some file was not translated.	2004-10-05 16:53:37 +00:00
Takanori Watanabe	919f5630ec	Fix unionfs problems when a directory is mounted on other directory with different file systems. This may cause ill things with my previous fix. Now it translate fsid of direct child of mount point directory only. Pointed out by: Uwe Doering	2004-10-05 05:59:29 +00:00
Takanori Watanabe	d354520ebc	Fix a problem when you try to mount a directory on another directory belongs to the same filesystem. In this problem, getcwd(3) will fail. I found the problem two years ago and I have forgotten to merge. http://docs.FreeBSD.org/cgi/mid.cgi?200202251435.XAA91094	2004-10-02 17:17:04 +00:00
David Schultz	616b5f90d3	Don't PHOLD() the target process in procfs, since this is already done in pseudofs. Moreover, PHOLD() may block between the p_candebug() access check and the actual operation.	2004-10-01 05:01:17 +00:00
Poul-Henning Kamp	891822a853	XXX mark two places where we do not hold a threadcount on the dev when frobbing the cdevsw. In both cases we examine only the cdevsw and it is a good question if we weren't better off copying those properties into the cdev in the first place. This question will be revisited.	2004-09-24 08:32:36 +00:00
Poul-Henning Kamp	9bd188b936	Hold proper thread count while frobbing drivers ioctl.	2004-09-24 07:24:02 +00:00
Poul-Henning Kamp	bd8a0d70f4	Remove devsw() call missed in last commit.	2004-09-24 07:08:33 +00:00
Poul-Henning Kamp	5ef8cac184	Use def_re[fl]thread(). Retire various old compatibility helpers.	2004-09-24 05:58:06 +00:00
Poul-Henning Kamp	1a52a73d68	Eliminate DEV_STRATEGY() macro: call dev_strategy() directly. Make dev_strategy() handle errors and departing devices properly.	2004-09-23 14:45:04 +00:00
Poul-Henning Kamp	d0c90fe668	Do not use devsw() but si_devsw direction. This is still bogus but a fair bit less so.	2004-09-23 12:19:24 +00:00
Poul-Henning Kamp	a0e78d2eb0	Do not refcount the cdevsw, but rather maintain a cdev->si_threadcount of the number of threads which are inside whatever is behind the cdevsw for this particular cdev. Make the device mutex visible through dev_lock() and dev_unlock(). We may want finer granularity later. Replace spechash_mtx use with dev_lock()/dev_unlock().	2004-09-23 07:17:41 +00:00
Poul-Henning Kamp	bc710003ac	Pointy hat please! Refuse VCHR not VREG.	2004-09-22 18:18:26 +00:00
Poul-Henning Kamp	a367987828	De support opening device nodes on CD9660 filesystems. They are still visible, they can still be seen, but they cannot be opened. Use DEVFS for that.	2004-09-21 08:42:37 +00:00
Poul-Henning Kamp	d705e025d0	The getpages VOP was a good stab at getting scatter/gather I/O without too much kernel copying, but it is not the right way to do it, and it is in the way for straightening out the buffer cache. The right way is to pass the VM page array down through the struct bio to the disk device driver and DMA directly in to/out off the physical memory. Once the VM/buf thing is sorted out it is next on the list. Retire most of vnode method. ffs_getpages(). It is not clear if what is left shouldn't be in the default implementation which we now fall back to. Retire specfs_getpages() as well, as it has no users now.	2004-09-19 08:14:55 +00:00
Poul-Henning Kamp	08dbd671ff	Remove unused B_WRITEINPROG flag	2004-09-15 21:49:22 +00:00
Poul-Henning Kamp	883d3c0c07	Remove the buffercache/vnode side of BIO_DELETE processing in preparation for integration of p4::phk_bufwork. In the future, local filesystems will talk to GEOM directly and they will consequently be able to issue BIO_DELETE directly. Since the removal of the fla driver, BIO_DELETE has effectively been a no-op anyway.	2004-09-13 06:50:42 +00:00
Tim J. Robbins	d676af371d	Reduce the size of struct defid's defid_dirclust, defid_dirofs and (disabled) defid_gen members from u_long to u_int32_t so that alignment requirements don't cause the structure to become larger than struct fid on LP64 platforms. This fixes NFS exports of msdos filesystems on at least amd64. PR: 71173	2004-09-08 13:03:19 +00:00
Tim J. Robbins	6a5bf04a5b	Merge from NetBSD: Fix a problem in previous: we can't blindly assume that we have wincnt entries available at the offset the file has been found. If the dos directory entry is not preceded by appropriate number of long name entries (happens e.g. when the filesystem is corrupted, or when the filename complies to DOS rules and doesn't use any long name entry), we would overwrite random directory entries. There are still some problems, the whole thing has to be revisited and solved right. Submitted by: Xin LI	2004-09-08 11:25:41 +00:00
Tim J. Robbins	d23af19a71	Merge from NetBSD: Fix a panic that occurred when trying to traverse a corrupt msdosfs filesystem. With this particular corruption, the code in pcbmap() would compute an offset into an array that was way out of bounds, so check the bounds before trying to access and return an error if the offset would be out of bounds. Submitted by: Xin LI	2004-09-08 10:57:09 +00:00
Poul-Henning Kamp	1affa3adc8	Create simple function init_va_filerev() for initializing a va_filerev field. Replace three instances of longhaired initialization va_filerev fields. Added XXX comment wondering why we don't use random bits instead of uptime of the system for this purpose.	2004-09-07 09:17:05 +00:00
Poul-Henning Kamp	066a8fea81	Explicitly pass vnode to smbfs_doio() function.	2004-09-07 08:53:28 +00:00
Poul-Henning Kamp	7ee3985c57	Explicitly pass the vnode to the nw_doio() function.	2004-09-07 08:53:03 +00:00
Tim J. Robbins	82c0aec8de	Temporarily back out revision 1.77. This changed cd9660_getattr() and cd9660_readdir() to return the address of the file's first data block as the inode number instead of the address of the directory entry, but neglected to update cd9660_vget_internal() for the new inode numbering scheme. Since the NFS server calls VFS_VGET (cd9660_vget()) with inode numbers returned through VOP_READDIR (cd9660_readdir()) when servicing a READDIRPLUS request, these two interfaces must agree on the numbering scheme; failure to do so caused panics and/or bogus information about the entries to be returned to clients using READDIRPLUS (Solaris, FreeBSD w/ mount -o rdirplus). PR: 63446	2004-09-05 11:18:53 +00:00
Robert Watson	10b7196db4	Back out pseudo_vnops.c:1.45, which was a workaround for pfind() returning incompletely initialized processes. This problem was eliminated by kern_proc.c:1.215, which causes pfind() not to return processes in the PRS_NEW state.	2004-09-02 16:04:09 +00:00
Brooks Davis	b443062227	General modernization of coda: - Ditch NVCODA - Don't use a static major - Don't declare functions extern Reviewed by: peter	2004-09-01 01:19:52 +00:00
Peter Wemm	f37a929ca1	Kill count device support from config. I've changed the last few remaining consumers to have the count passed as an option. This is i4b, pc98/wdc, and coda. Bump configvers.h from 500013 to 600000. Remove heuristics that tried to parse "device ed5" as 5 units of the ed device. This broke things like the snd_emu10k1 device, which required quotes to make it parse right. The no-longer-needed quotes have been removed from NOTES, GENERIC etc. eg, I've removed the quotes from: device snd_maestro device "snd_maestro3" device snd_mss I believe everything will still compile and work after this.	2004-08-30 23:03:58 +00:00
Tim J. Robbins	db575a8507	Remove bogus vrele() call added in previous.	2004-08-27 11:24:31 +00:00
Tim J. Robbins	1a9415af17	Improve the robustness of MSDOSFSMNT_KICONV handling: - Use copyinstr() to read cs_win, cs_dos, cs_local strings from the mount argument structure instead of reading through user-space pointers(!). - When mounting a filesystem, or updating an existing mount, only try to update the iconv handles from the information in the mount argument structure if the structure itself has the MSDOSFSMNT_KICONV flag set. - Attempt to handle failure of update_mp() in the MNT_UPDATE case.	2004-08-26 13:16:44 +00:00
Dag-Erling Smørgrav	c9b9a82654	Release the vnode cache mutex when calling vgone(), since vgone() may sleep. This makes pfs_exit() even less efficient than before, but on the bright side, the vnode cache mutex no longer needs to be recursive.	2004-08-15 21:58:02 +00:00
John-Mark Gurney	ad3b9257c2	Add locking to the kqueue subsystem. This also makes the kqueue subsystem a more complete subsystem, and removes the knowlege of how things are implemented from the drivers. Include locking around filter ops, so a module like aio will know when not to be unloaded if there are outstanding knotes using it's filter ops. Currently, it uses the MTX_DUPOK even though it is not always safe to aquire duplicate locks. Witness currently doesn't support the ability to discover if a dup lock is ok (in some cases). Reviewed by: green, rwatson (both earlier versions)	2004-08-15 06:24:42 +00:00
Robert Watson	d990378077	Commit a work-around for a more general bug involving process state: check whether p_ucred is NULL or not in pfs_getattr() before dereferencing the credential, and return ENOENT if there wasn't one. This is a symptom of a larger problem, wherein pfind() can return references to incompletely initialized processes, and we instead ought to not return them, or check the process state before acting on the process. Reported by: kris Discussed with: tjr, others	2004-08-13 20:27:56 +00:00
Poul-Henning Kamp	7ac439fec4	use bufdone() not biodone().	2004-08-08 13:23:05 +00:00
Poul-Henning Kamp	a8687125e5	Use bufdone(), not biodone().	2004-08-08 13:20:43 +00:00
Poul-Henning Kamp	e83f142353	Push all changes to disk before downgrading a mount from rw to ro.	2004-08-07 22:05:12 +00:00
Poul-Henning Kamp	5e8c582ac2	Put a version element in the VFS filesystem configuration structure and refuse initializing filesystems with a wrong version. This will aid maintenance activites on the 5-stable branch. s/vfs_mount/vfs_omount/ s/vfs_nmount/vfs_mount/ Name our filesystems mount function consistently. Eliminate the namiedata argument to both vfs_mount and vfs_omount. It was originally there to save stack space. A few places abused it to get hold of some credentials to pass around. Effectively it is unused. Reorganize the root filesystem selection code.	2004-07-30 22:08:52 +00:00
Poul-Henning Kamp	d634f69316	Remove global variable rootdevs and rootvp, they are unused as such. Add local rootvp variables as needed. Remove checks for miniroot's in the swappartition. We never did that and most of the filesystems could never be used for that, but it had still been copy&pasted all over the place.	2004-07-28 20:21:04 +00:00
Alexander Kabaev	1f74490224	Avoid casts as lvalues.	2004-07-28 06:30:43 +00:00
Alexander Kabaev	fa2edf7bcd	Avoid casts as lvalues.	2004-07-28 06:05:41 +00:00
Colin Percival	56f21b9d74	Rename suser_cred()'s PRISON_ROOT flag to SUSER_ALLOWJAIL. This is somewhat clearer, but more importantly allows for a consistent naming scheme for suser_cred flags. The old name is still defined, but will be removed in a few days (unless I hear any complaints...) Discussed with: rwatson, scottl Requested by: jhb	2004-07-26 07:24:04 +00:00
Robert Watson	de592112e1	In devfs_allocv(), rather than assigning 'td = curthread', assert that the caller passes in a td that is curthread, and consistently pass 'td' into vget(). Remove some bogus logic that passed in td or curthread conditional on td being non-NULL, which seems redundant in the face of the earlier assignment of td to curthread if td is NULL. In devfs_symlink(), cache the passed thread in 'td' so we don't have to keep retrieving it from the 'ap' structure, and assert that td is curthread (since we dereference it to get thread-local td_ucred). Use 'td' in preference to curthread for later lockmgr calls, since they are equal.	2004-07-22 17:03:14 +00:00
Poul-Henning Kamp	3e019deaed	Do a pass over all modules in the kernel and make them return EOPNOTSUPP for unknown events. A number of modules return EINVAL in this instance, and I have left those alone for now and instead taught MOD_QUIESCE to accept this as "didn't do anything".	2004-07-15 08:26:07 +00:00
Poul-Henning Kamp	d521eae21d	Another LINT compilation fix	2004-07-13 09:47:27 +00:00
Poul-Henning Kamp	a546742dd7	Make LINT compile	2004-07-13 09:46:46 +00:00
Robert Watson	ee3c585ae5	Remove 'td = curthread' that shadows the arguments to coda_root(). Missed by: alfred	2004-07-12 14:11:26 +00:00
Alfred Perlstein	f257b7a54b	Make VFS_ROOT() and vflush() take a thread argument. This is to allow filesystems to decide based on the passed thread which vnode to return. Several filesystems used curthread, they now use the passed thread.	2004-07-12 08:14:09 +00:00
Marcel Moolenaar	e2fb7a7be0	Update for the KDB framework: o Call kdb_enter() instead of Debugger().	2004-07-10 21:21:13 +00:00
Marcel Moolenaar	4ea4f1f97e	Update for the KDB framework: o Call kdb_enter() instead of Debugger(). o Make debugging code conditional upon KDB instead of DDB.	2004-07-10 21:20:11 +00:00
Dag-Erling Smørgrav	195a6b21e4	Accumulate directory entries in a fixed-length sbuf, and uiomove them in one go before returning. This avoids calling uiomove() while holding allproc_lock. Don't adjust uio->uio_offset manually, uiomove() does that for us. Don't drop allproc_lock before calling panic(). Suggested by: alfred	2004-07-09 11:43:37 +00:00
Poul-Henning Kamp	e3c5a7a4dd	When we traverse the vnodes on a mountpoint we need to look out for our cached 'next vnode' being removed from this mountpoint. If we find that it was recycled, we restart our traversal from the start of the list. Code to do that is in all local disk filesystems (and a few other places) and looks roughly like this: MNT_ILOCK(mp); loop: for (vp = TAILQ_FIRST(&mp...); (vp = nvp) != NULL; nvp = TAILQ_NEXT(vp,...)) { if (vp->v_mount != mp) goto loop; MNT_IUNLOCK(mp); ... MNT_ILOCK(mp); } MNT_IUNLOCK(mp); The code which takes vnodes off a mountpoint looks like this: MNT_ILOCK(vp->v_mount); ... TAILQ_REMOVE(&vp->v_mount->mnt_nvnodelist, vp, v_nmntvnodes); ... MNT_IUNLOCK(vp->v_mount); ... vp->v_mount = something; (Take a moment and try to spot the locking error before you read on.) On a SMP system, one CPU could have removed nvp from our mountlist but not yet gotten to assign a new value to vp->v_mount while another CPU simultaneously get to the top of the traversal loop where it finds that (vp->v_mount != mp) is not true despite the fact that the vnode has indeed been removed from our mountpoint. Fix: Introduce the macro MNT_VNODE_FOREACH() to traverse the list of vnodes on a mountpoint while taking into account that vnodes may be removed from the list as we go. This saves approx 65 lines of duplicated code. Split the insmntque() which potentially moves a vnode from one mount point to another into delmntque() and insmntque() which does just what the names say. Fix delmntque() to set vp->v_mount to NULL while holding the mountpoint lock.	2004-07-04 08:52:35 +00:00
Poul-Henning Kamp	bffd1b7af4	Remove "register" keyword and trailing white space.	2004-07-03 16:56:45 +00:00
Tim J. Robbins	3bc482ec1c	By popular request, add a workaround that allows large (>128GB or so) FAT32 filesystems to be mounted, subject to some fairly serious limitations. This works by extending the internal pseudo-inode-numbers generated from the file's starting cluster number to 64-bits, then creating a table mapping these into arbitrary 32-bit inode numbers, which can fit in struct dirent's d_fileno and struct vattr's va_fileid fields. The mappings do not persist across unmounts or reboots, so it's not possible to export these filesystems through NFS. The mapping table may grow to be rather large, and may grow large enough to exhaust kernel memory on filesystems with millions of files. Don't enable this option unless you understand the consequences.	2004-07-03 13:22:38 +00:00
Robert Watson	1cf30f57e1	Remove spls from portal_open(). Acquire socket lock while sleeping waiting for the socket to connect and use msleep() on the socket mute rather than tsleep(). Acquire socket buffer mutexes around read-modify-write of socket buffer flags.	2004-06-24 00:47:23 +00:00
Scott Long	bf1c3ddd68	Make the udf_vnops side endian clean.	2004-06-23 21:49:03 +00:00
Scott Long	89ec2c3c42	First half of making UDF be endian-clean. This addresses the vfsops side.	2004-06-23 19:36:09 +00:00
Bruce Evans	587a4462c9	Include <sys/mutex.h> and its prerequisite <sys/lock.h> instead of depending on namespace pollution in <sys/vnode.h> for the definition of mutex interfaces used in SOCKBUF_*LOCK(). Sorted includes. Removed unused includes.	2004-06-23 06:47:49 +00:00
Robert Watson	7d84f9d293	Remove unlocked read annotation for sbspace(); the read is locked.	2004-06-23 00:35:50 +00:00
Poul-Henning Kamp	9d96090725	Reduce a fair bit of the atomics because we are now called with a lock from kern_conf.c and cdev's act a lot more like real objects these days.	2004-06-18 08:08:47 +00:00
Robert Watson	c012260726	Merge some additional leaf node socket buffer locking from rwatson_netperf: Introduce conditional locking of the socket buffer in fifofs kqueue filters; KNOTE() will be called holding the socket buffer locks in fifofs, but sometimes the kqueue() system call will poll using the same entry point without holding the socket buffer lock. Introduce conditional locking of the socket buffer in the socket kqueue filters; KNOTE() will be called holding the socket buffer locks in the socket code, but sometimes the kqueue() system call will poll using the same entry points without holding the socket buffer lock. Simplify the logic in sodisconnect() since we no longer need spls. NOTE: To remove conditional locking in the kqueue filters, it would make sense to use a separate kqueue API entry into the socket/fifo code when calling from the kqueue() system call.	2004-06-18 02:57:55 +00:00
Robert Watson	9535efc00d	Merge additional socket buffer locking from rwatson_netperf: - Lock down low hanging fruit use of sb_flags with socket buffer lock. - Lock down low hanging fruit use of so_state with socket lock. - Lock down low hanging fruit use of so_options. - Lock down low-hanging fruit use of sb_lowwat and sb_hiwat with socket buffer lock. - Annotate situations in which we unlock the socket lock and then grab the receive socket buffer lock, which are currently actually the same lock. Depending on how we want to play our cards, we may want to coallesce these lock uses to reduce overhead. - Convert a if()->panic() into a KASSERT relating to so_state in soaccept(). - Remove a number of splnet()/splx() references. More complex merging of socket and socket buffer locking to follow.	2004-06-17 22:48:11 +00:00
Poul-Henning Kamp	f3732fd15b	Second half of the dev_t cleanup. The big lines are: NODEV -> NULL NOUDEV -> NODEV udev_t -> dev_t udev2dev() -> findcdev() Various minor adjustments including handling of userland access to kernel space struct cdev etc.	2004-06-17 17:16:53 +00:00
Poul-Henning Kamp	89c9c53da0	Do the dreaded s/dev_t/struct cdev */ Bump __FreeBSD_version accordingly.	2004-06-16 09:47:26 +00:00
Julian Elischer	fa88511615	Nice, is a property of a process as a whole.. I mistakenly moved it to the ksegroup when breaking up the process structure. Put it back in the proc structure.	2004-06-16 00:26:31 +00:00
Robert Watson	7721f5d760	Grab the socket buffer send or receive mutex when performing a read-modify-write on the sb_state field. This commit catches only the "easy" ones where it doesn't interact with as yet unmerged locking.	2004-06-15 03:51:44 +00:00
Robert Watson	c0b99ffa02	The socket field so_state is used to hold a variety of socket related flags relating to several aspects of socket functionality. This change breaks out several bits relating to send and receive operation into a new per-socket buffer field, sb_state, in order to facilitate locking. This is required because, in order to provide more granular locking of sockets, different state fields have different locking properties. The following fields are moved to sb_state: SS_CANTRCVMORE (so_state) SS_CANTSENDMORE (so_state) SS_RCVATMARK (so_state) Rename respectively to: SBS_CANTRCVMORE (so_rcv.sb_state) SBS_CANTSENDMORE (so_snd.sb_state) SBS_RCVATMARK (so_rcv.sb_state) This facilitates locking by isolating fields to be located with other identically locked fields, and permits greater granularity in socket locking by avoiding storing fields with different locking semantics in the same short (avoiding locking conflicts). In the future, we may wish to coallesce sb_state and sb_flags; for the time being I leave them separate and there is no additional memory overhead due to the packing/alignment of shorts in the socket buffer structure.	2004-06-14 18:16:22 +00:00
Don Lewis	866046f5a6	Add MSG_NBIO flag option to soreceive() and sosend() that causes them to behave the same as if the SS_NBIO socket flag had been set for this call. The SS_NBIO flag for ordinary sockets is set by fcntl(fd, F_SETFL, O_NONBLOCK). Pass the MSG_NBIO flag to the soreceive() and sosend() calls in fifo_read() and fifo_write() instead of frobbing the SS_NBIO flag on the underlying socket for each I/O operation. The O_NONBLOCK flag is a property of the descriptor, and unlike ordinary sockets, fifos may be referenced by multiple descriptors.	2004-06-01 01:18:51 +00:00
Poul-Henning Kamp	5dba30f15a	add missing #include <sys/module.h>	2004-05-30 20:27:19 +00:00
Don Lewis	2526dc2b61	Switch from using the vnode interlock to a private mutex in fifo_open() to avoid lock order problems when manipulating the sockets associated with the fifo. Minor optimization of a couple of calls to fifo_cleanup() from fifo_open().	2004-05-17 20:16:40 +00:00
Alan Cox	5a32489377	Make vm_page's PG_ZERO flag immutable between the time of the page's allocation and deallocation. This flag's principal use is shortly after allocation. For such cases, clearing the flag is pointless. The only unusual use of PG_ZERO is in vfs_bio_clrbuf(). However, allocbuf() never requests a prezeroed page. So, vfs_bio_clrbuf() never sees a prezeroed page. Reviewed by: tegge@	2004-05-06 05:03:23 +00:00
Poul-Henning Kamp	bc20ced763	Do not drop Giant around the poll method yet, we're not ready for it.	2004-04-12 21:52:52 +00:00
Warner Losh	f36cfd49ad	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999 and email from Peter Wemm, Alan Cox and Robert Watson. Approved by: core, peter, alc, rwatson	2004-04-07 20:46:16 +00:00
Pawel Jakub Dawidek	c5b7c33bc8	Remove ps_argsopen from this check, because of two reasons: 1. This check if wrong, because it is true by default (kern.ps_argsopen is 1 by default) (p_cansee() is not even checked). 2. Sysctl kern.ps_argsopen is going away.	2004-04-01 00:04:23 +00:00
Robert Watson	db48c0d254	Export uipc_connect2() from uipc_usrreq.c instead of unp_connect2(), and consume that interface in portalfs and fifofs instead. In the new world order, unp_connect2() assumes that the unpcb mutex is held, whereas uipc_connect2() validates that the passed sockets are UNIX domain sockets, then grabs the mutex. NB: the portalfs and fifofs code gets down and dirty with UNIX domain sockets. Maybe this is a bad thing.	2004-03-31 01:41:30 +00:00
Scott Long	744bb56d95	Catch all cases where bread() returns an error and a valid bp, and release the bp. Obtained from: DragonFlyBSD	2004-03-30 05:01:48 +00:00
Peter Wemm	b21126c6b3	Clean up the stub fake vnode locking implemenations. The main reason this stuff was here (NFS) was fixed by Alfred in November. The only remaining consumer of the stub functions was umapfs, which is horribly horribly broken. It has missed out on about the last 5 years worth of maintenence that was done on nullfs (from which umapfs is derived). It needs major work to bring it up to date with the vnode locking protocol. umapfs really needs to find a caretaker to bring it into the 21st century. Functions GC'ed: vop_noislocked, vop_nolock, vop_nounlock, vop_sharedlock.	2004-03-29 22:41:21 +00:00
Robert Watson	0ff34b5e1c	Don't reject FAT file systems with a number of "Heads" greater than 255; USB keychains exist that use 256 as the number of heads. This check has also been removed in Darwin (along with most of the other head/sector sanity checks).	2004-03-14 23:28:12 +00:00
Brian Feldman	6fedf94775	When taking event callbacks (like process_exit) out from under Giant, those which do not lock Giant themselves will be exposed. Unbreak pfs_exit().	2004-03-14 15:57:45 +00:00
Poul-Henning Kamp	ceb58ca58f	When I was a kid my work table was one cluttered mess an cleaning it up were a rather overwhelming task. I soon learned that if you don't know where you're going to store something, at least try to pile it next to something slightly related in the hope that a pattern emerges. Apply the same principle to the ffs/snapshot/softupdates code which have leaked into specfs: Add yet a buf-quasi-method and call it from the only two places I can see it can make a difference and implement the magic in ffs_softdep.c where it belongs. It's not pretty, but at least it's one less layer violated.	2004-03-11 18:50:33 +00:00
Poul-Henning Kamp	651b11eaf2	Remove unused second arg to vfinddev(). Don't call addaliasu() on VBLK nodes.	2004-03-11 16:33:11 +00:00
Poul-Henning Kamp	39a78f8cf4	Don't call devsw() more than we need to, and in particular do not expose ourselves to device removal by not checking for it the second time. Use count_dev(dev) rather than vcount(vp)	2004-03-10 20:56:28 +00:00
Scott Long	c80a90c51f	Change __FUNCTION__ to __func__ Submitted by: Stefan Farfeleder	2004-03-03 07:43:34 +00:00
Robert Watson	746e5bf09b	Rename dup_sockaddr() to sodupsockaddr() for consistency with other functions in kern_socket.c. Rename the "canwait" field to "mflags" and pass M_WAITOK and M_NOWAIT in from the caller context rather than "1" or "0". Correct mflags pass into mac_init_socket() from previous commit to not include M_ZERO. Submitted by: sam	2004-03-01 03:14:23 +00:00
Poul-Henning Kamp	ad3917e8e6	Do not attempt to open NODEV	2004-02-24 09:59:35 +00:00
Tim J. Robbins	f71cc8681b	Fix comment containing vop_readdir_args contents: a_cookies is really u_long ** not u_long *.	2004-02-23 01:43:00 +00:00
Tim J. Robbins	224f219b31	cookies is an array of u_long, not u_int, so MALLOC() it accordingly. Allocating it with the wrong size could have caused corruption on 64-bit architectures.	2004-02-23 01:40:46 +00:00
Bruce Evans	0ef0dd6f85	Fixed a serious off by 1 error. The cluster-in-use bitmap was overrun by 1 u_int if the number of clusters was 1 more than a multiple of (8 * sizeof(u_int)). The bitmap is malloced and large (often huge), so fatal overrun probably only occurred if the number of clusters was 1 more than 1 multiple of PAGE_SIZE/8.	2004-02-21 22:47:19 +00:00

... 3 4 5 6 7 ...

1845 Commits