freebsd-dev

Author	SHA1	Message	Date
Poul-Henning Kamp	3c925ad2aa	Ditch crummy fattime <--> timespec conversion functions	2006-10-24 11:55:18 +00:00
Poul-Henning Kamp	4a4cd136b4	Drop crummy fattime to timespec conversion routines. Leave a XXX here for anybody able to test.	2006-10-24 11:43:41 +00:00
Poul-Henning Kamp	3c960d9379	Replace slightly crummy fattime<->timespec conversion functions.	2006-10-24 11:14:05 +00:00
Robert Watson	aed5570872	Complete break-out of sys/sys/mac.h into sys/security/mac/mac_framework.h begun with a repo-copy of mac.h to mac_framework.h. sys/mac.h now contains the userspace and user<->kernel API and definitions, with all in-kernel interfaces moved to mac_framework.h, which is now included across most of the kernel instead. This change is the first step in a larger cleanup and sweep of MAC Framework interfaces in the kernel, and will not be MFC'd. Obtained from: TrustedBSD Project Sponsored by: SPARTA	2006-10-22 11:52:19 +00:00
Tom Rhodes	94a28290c1	Fake the link count until we have no choice but to load data from the MFT. PR: 86965 Submitted by: Lowell Gilbert <lgfbsd@be-well.ilk.org>	2006-10-21 08:17:17 +00:00
Konstantin Belousov	16f50bcd80	Update the access and modification times for dev while still holding thread reference on it. Reviewed by: tegge Approved by: pjd (mentor)	2006-10-20 08:03:42 +00:00
Konstantin Belousov	1663075c64	Fix the race between devfs_fp_check and devfs_reclaim. Derefence the vnode' v_rdev and increment the dev threadcount , as well as clear it (in devfs_reclaim) under the dev_lock(). Reviewed by: tegge Approved by: pjd (mentor)	2006-10-20 07:59:50 +00:00
Konstantin Belousov	828d6d12da	Properly lock the vnode around vgone() calls. Unlock the vnode in devfs_close() while calling into the driver d_close() routine. devfs_revoke() changes by: ups Reviewed and bugfixes by: tegge Tested by: mbr, Peter Holm Approved by: pjd (mentor) MFC after: 1 week	2006-10-18 11:17:14 +00:00
Poul-Henning Kamp	e5037a18a9	Use utc_offset() where applicable, and hide the internals of it as static variables.	2006-10-02 18:23:37 +00:00
Poul-Henning Kamp	f645b0b51c	First part of a little cleanup in the calendar/timezone/RTC handling. Move relevant variables to <sys/clock.h> and fix #includes as necessary. Use libkern's much more time- & spamce-efficient BCD routines.	2006-10-02 12:59:59 +00:00
Ruslan Ermilov	9fddcc6661	Fix our ioctl(2) implementation when the argument is "int". New ioctls passing integer arguments should use the _IOWINT() macro. This fixes a lot of ioctl's not working on sparc64, most notable being keyboard/syscons ioctls. Full ABI compatibility is provided, with the bonus of fixing the handling of old ioctls on sparc64. Reviewed by: bde (with contributions) Tested by: emax, marius MFC after: 1 week	2006-09-27 19:57:02 +00:00
Tor Egge	5da56ddb21	Use mount interlock to protect all changes to mnt_flag and mnt_kern_flag. This eliminates a race where MNT_UPDATE flag could be lost when nmount() raced against sync(), sync_fsync() or quotactl().	2006-09-26 04:12:49 +00:00
Konstantin Belousov	af72db7175	Fix the bug in rev. 1.134. In devfs_allocv_drop_refs(), when not_found == 2 and drop_dm_lock is true, no unlocking shall be attempted. The lock is already dropped and memory is freed. Found with: Coverity Prevent(tm) CID: 1536 Approved by: pjd (mentor)	2006-09-19 14:03:02 +00:00
Konstantin Belousov	e7f9b74438	Resolve the devfs deadlock caused by LOR between devfs_mount->dm_lock and vnode lock in devfs_allocv. Do this by temporary dropping dm_lock around vnode locking. For safe operation, add hold counters for both devfs_mount and devfs_dirent, and DE_DOOMED flag for devfs_dirent. The facilities allow to continue after dropping of the dm_lock, by making sure that referenced memory does not disappear. Reviewed by: tegge Tested by: kris Approved by: kan (mentor) PR: kern/102335	2006-09-18 13:23:08 +00:00
Warner Losh	b4583894aa	Put the osta.c license on osta.h. The license is the same. Approved by: scottl@	2006-09-12 19:02:34 +00:00
Warner Losh	1a3c917f9d	while (0); -> while (0) in multi-line macros	2006-08-17 22:50:33 +00:00
Alan Cox	5786be7cc7	Introduce a field to struct vm_page for storing flags that are synchronized by the lock on the object containing the page. Transition PG_WANTED and PG_SWAPINPROG to use the new field, eliminating the need for holding the page queues lock when setting or clearing these flags. Rename PG_WANTED and PG_SWAPINPROG to VPO_WANTED and VPO_SWAPINPROG, respectively. Eliminate the assertion that the page queues lock is held in vm_page_io_finish(). Eliminate the acquisition and release of the page queues lock around calls to vm_page_io_finish() in kern_sendfile() and vfs_unbusy_pages().	2006-08-09 17:43:27 +00:00
Yaroslav Tykhiy	776fc0e90e	Commit the results of the typo hunt by Darren Pilgrim. This change affects documentation and comments only, no real code involved. PR: misc/101245 Submitted by: Darren Pilgrim <darren pilgrim bitfreak org> Tested by: md5(1) MFC after: 1 week	2006-08-04 07:56:35 +00:00
Xin LI	bcc4260f3b	When the volume is being downgraded from a read-write mode, mark it as clean. PR: kern/85366 Submitted by: Dan Lukes <dan at obluda dot cz> MFC After: 2 weeks	2006-08-03 03:55:52 +00:00
Yaroslav Tykhiy	69f0212f52	In udf_find_partmaps(), when we find a type 1 partition map, we have to skip the actual type 1 length (6 bytes). With this change, it is now possible to correctly spot the VAT partition map in certain discs. Submitted by: Pedro Martelletto <pedro@ambientworks.net>	2006-07-25 14:15:50 +00:00
John Baldwin	c2de792e32	Update comment.	2006-07-18 22:29:54 +00:00
John Baldwin	fe78538353	Lock the smb share before doing a 'put' on it in smbfs_unmount(). Tested by: "Jiawei Ye" <leafy7382 at gmail>	2006-07-17 16:13:42 +00:00
Poul-Henning Kamp	9c499ad92f	Remove the NDEVFSINO and NDEVFSOVERFLOW options which no longer exists in DEVFS. Remove the opt_devfs.h file now that it is empty.	2006-07-17 09:07:02 +00:00
Stephan Uphoff	56eeb277cb	Add vnode interlocking to devfs. This prevents race conditions that can cause pagefaults or devfs to use arbitrary vnodes. MFC after: 1 week	2006-07-12 20:25:35 +00:00
John Baldwin	c1cccebe8b	Add a kern_close() so that the ABIs can close a file descriptor w/o having to populate a close_args struct and change some of the places that do.	2006-07-08 20:03:39 +00:00
Robert Watson	be54a5eeb3	Remove unneeded mac.h include. MFC after: 3 days	2006-07-06 13:25:01 +00:00
Robert Watson	2551d4f66e	Remove now unneeded opt_mac.h and mac.h includes. MFC after: 3 days	2006-07-06 13:24:22 +00:00
Robert Watson	83ff52a7f3	Use #include "", not #include <> for opt_foo.h. MFC after: 3 days	2006-07-06 13:22:08 +00:00
Alexander Leidinger	85646f7eb1	Correctly calculate a buffer length. It was off by one so a read() returned one byte less than needed. This is a RELENG_x_y candidate, since it fixes a problem with Oracle 10. Noticed by: Dmitry Ganenko <dima@apk-inform.com> Testcase by: Dmitry Ganenko <dima@apk-inform.com> Reviewed by: des Submitted by: rdivacky Sponsored by: Google SoC 2006 MFC after: 1 week	2006-06-27 20:21:38 +00:00
Scott Long	09e19031ab	Fix a memory leak and a nested 'for' loop in the spare table handling. Submitted by: Pedro Martelletto	2006-06-26 03:21:19 +00:00
Guy Helmer	3266c22854	Upon further review, DES prefers this change over that in revision 1.13 to resolve the directory access problem for processes with P_SUGID flag set. Suggested by: des	2006-06-05 16:41:27 +00:00
Craig Rodrigues	829b898c7c	mount_msdosfs.c: - remove call to getmntopts(), and just pass -o options to nmount(). This removes some confusion as to what options msdosfs can parse, by pushing the responsibility of option parsing to the VFS and FS specific code in the kernel. msdosfs_vfsops.c: - add "force" and "sync" to msdosfs_opts. They used to be specified in mount_msdosfs.c, so move them here. It's not clear whethere these options should be placed into global_opts in vfs_mount.c or not. Motivated by: marcus	2006-06-01 02:25:00 +00:00
Colin Percival	72f6a0fa7a	Enable inadvertantly disabled "securenet" access controls in ypserv. [1] Correct a bug in the handling of backslash characters in smbfs which can allow an attacker to escape from a chroot(2). [2] Security: FreeBSD-SA-06:15.ypserv [1] Security: FreeBSD-SA-06:16.smbfs [2]	2006-05-31 22:32:22 +00:00
Craig Rodrigues	05c0f5c1e2	Remove incorrect null_checkexp() routine. This will allow the NFS server to call vfs_stdcheckexp() on the exported nullfs filesystem, not the underlying filesystem being nullfs mounted. If the lower filesystem was not NFS exported, then the NFS exported null filesystem would not work. Pointed out by: scottl PR: kern/87906 MFC after: 1 week	2006-05-28 22:45:52 +00:00
Craig Rodrigues	ebbf93fd4c	Modify MNT_UPDATE behavior for nullfs so that it does not return EOPNOTSUPP if an "export" parameter was passed in. This should allow nullfs mounts to be NFS exported. PR: kern/87906 MFC after: 1 week	2006-05-28 20:09:18 +00:00
Craig Rodrigues	23badd1016	Remove calls to vfs_export() for exporting a filesystem for NFS mounting from individual filesystems. Call it instead in vfs_mount.c, after we call VFS_MOUNT() for a specific filesystem.	2006-05-26 01:21:51 +00:00
Craig Rodrigues	5eb304a91a	Remove calls to vfs_export() for exporting a filesystem for NFS mounting from individual filesystems. Call it instead in vfs_mount.c, after we call VFS_MOUNT() for a specific filesystem.	2006-05-26 00:32:21 +00:00
Stephan Uphoff	6c1b7d16c2	Call vm_object_page_clean() with the object lock held. Submitted by: kensmith@ Reviewed by: mohans@ MFC after: 6 days	2006-05-25 17:16:11 +00:00
Stephan Uphoff	dcf67e65d2	Do not set B_NOCACHE on buffers when releasing them in flushbuflist(). If B_NOCACHE is set the pages of vm backed buffers will be invalidated. However clean buffers can be backed by dirty VM pages so invalidating them can lead to data loss. Add support for flush dirty page in the data invalidation function of some network file systems. This fixes data losses during vnode recycling (and other code paths using invalbuf(,V_SAVE,,*)) for data written using an mmaped file. Collaborative effort by: jhb@,mohans@,peter@,ps@,ups@ Reviewed by: tegge@ MFC after: 7 days	2006-05-25 01:00:35 +00:00
Guy Helmer	e06dbd3229	Revision 1.4 set access for all sensitive files in /proc/<PID> to mode 0 if a process's uid or gid has changed, but the /proc/<PID> directory itself was also set to mode 0. Assuming this doesn't open any security holes, open access to the /proc/<PID> directory for users other than root to read or search the directory. Reviewed by: des (back in February) MFC after: 3 weeks	2006-05-24 14:03:51 +00:00
Poul-Henning Kamp	c40da00ca3	Since DELAY() was moved, most <machine/clock.h> #includes have been unnecessary.	2006-05-16 14:37:58 +00:00
Kelly Yancey	c9ad8a67af	Restore the ability to mount procfs and fdescfs filesystems via the mount(2) system call: * Add cmount hook to fdescfs and pseudofs (and, by extension, procfs and linprocfs). This (mostly) restores the ability to mount these filesystems using the old mount(2) system call (see below for the rest of the fix). * Remove not-NULL check for the data argument from the mount(2) entry point. Per the mount(2) man page, it is up to the individual filesystem being mounted to verify data. Or, in the case of procfs, etc. the filesystem is free to ignore the data parameter if it does not use it. Enforcing data to be not-NULL in the mount(2) system call entry point prevented passing NULL to filesystems which ignored the data pointer value. Apparently, passing NULL was common practice in such cases, as even our own mount_std(8) used to do it in the pre-nmount(2) world. All userland programs in the tree were converted to nmount(2) long ago, but I've found at least one external program which broke due to this (presumably unintentional) mount(2) API change. One could argue that external programs should also be converted to nmount(2), but then there isn't much point in keeping the mount(2) interface for backward compatibility if it isn't backward compatible.	2006-05-15 19:42:10 +00:00
Pawel Jakub Dawidek	00a480ac5c	Remove unused prototypes.	2006-04-12 12:17:29 +00:00
Jeff Roberson	23b77994f2	- Add a bogus vhold/vdrop around vgone() in devfs_revoke. Without this the vnode is never recycled. It is bogus because the reference really should be associated with the devfs dirent.	2006-03-31 23:37:29 +00:00
Tor Egge	87f0769a57	Call vn_start_write() before locking vnode.	2006-03-19 20:45:06 +00:00
Robert Watson	eca7e73743	Add a_fdidx to comment prototype for fifo_open(). MFC after: 3 days Submitted by: Kostik Belousov <kostikbel at gmail dot com>	2006-03-15 10:15:35 +00:00
Robert Watson	945a519a23	If fifo_open() is called with a negative file descriptor, return EINVAL rather than panicking later. This can occur if the kernel calls vn_open() on a fifo, as there will be no associated file descriptor, and therefore the file descriptor operations cannot be modified to point to the fifo operation set. MFC after: 3 days Reported by: Martin <nakal at nurfuerspam dot de> PR: 94278	2006-03-14 19:29:45 +00:00
Joerg Wunsch	f7d5a5328f	When encountering a ISO_SUSP_CFLAG_ROOT element in Rock Ridge processing, this actually means there's a double slash recorded in the symbolic link's path name. We used to start over from / then, which caused link targets like ../../bsdi.1.0/include//pathnames.h to be interpreted as /pathnahes.h. This is both contradictionary to our conventional slash interpretation, as well as potentially dangerous. The right thing to do is (obviously) to just ignore that element. bde once pointed out that mistake when he noticed it on the 4.4BSD-Lite2 CD-ROM, and asked me for help. Reviewed by: bde (about half a year ago) MFC after: 3 days	2006-03-13 22:32:33 +00:00
Jeff Roberson	4bf5133b1f	- Define a null_getwritemount to get the mount-point for the lower filesystem so that nullfs doesn't permit you to circumvent snapshots. Discussed with: tegge Sponsored by: Isilon Systems, Inc.	2006-03-12 04:58:18 +00:00
Kris Kennaway	7d65872fff	Correct the vnode locking in fdescfs. PR: kern/93905 Submitted by: Kostik Belousov <kostikbel@gmail.com> Reviewed by: jeff MFC After: 1 week	2006-02-28 00:05:44 +00:00
Yaroslav Tykhiy	82967ff0b8	CODA_COMPAT_5 may not be defined unconditionally in the coda5 module. Otherwise a kernel build would break in the coda5 module if the main kernel conf file enabled CODA_COMPAT_5, too. Redefined symbols are strictly disallowed by -Werror. To overcome this issue, introduce a different symbol indicating coda5 build, CODA5_MODULE, and translate it to CODA_COMPAT_5 appropriately in /sys/coda/coda.h. MFC after: 3 days	2006-02-27 12:04:13 +00:00
John Baldwin	06ad42b2f7	Close some races between procfs/ptrace and exit(2): - Reorder the events in exit(2) slightly so that we trigger the S_EXIT stop event earlier. After we have signalled that, we set P_WEXIT and then wait for any processes with a hold on the vmspace via PHOLD to release it. PHOLD now KASSERT()'s that P_WEXIT is clear when it is invoked, and PRELE now does a wakeup if P_WEXIT is set and p_lock drops to zero. - Change proc_rwmem() to require that the processing read from has its vmspace held via PHOLD by the caller and get rid of all the junk to screw around with the vmspace reference count as we no longer need it. - In ptrace() and pseudofs(), treat a process with P_WEXIT set as if it doesn't exist. - Only do one PHOLD in kern_ptrace() now, and do it earlier so it covers FIX_SSTEP() (since on alpha at least this can end up calling proc_rwmem() to clear an earlier single-step simualted via a breakpoint). We only do one to avoid races. Also, by making the EINVAL error for unknown requests be part of the default: case in the switch, the various switch cases can now just break out to return which removes a _lot_ of duplicated PRELE and proc unlocks, etc. Also, it fixes at least one bug where a LWP ptrace command could return EINVAL with the proc lock still held. - Changed the locking for ptrace_single_step(), ptrace_set_pc(), and ptrace_clear_single_step() to always be called with the proc lock held (it was a mixed bag previously). Alpha and arm have to drop the lock while the mess around with breakpoints, but other archs avoid extra lock release/acquires in ptrace(). I did have to fix a couple of other consumers in kern_kse and a few other places to hold the proc lock and PHOLD. Tested by: ps (1 mostly, but some bits of 2-4 as well) MFC after: 1 week	2006-02-22 18:57:50 +00:00
John Baldwin	f8e3eeb519	Change pfs_visible() to optionally return a pointer to the process associated with the passed in pfs_node. If it does return a pointer, it keeps the process locked. This allows a lot of places that were calling pfind() again right after pfs_visible() to not have to do that and avoids races since we don't drop the proc lock just to turn around and lock it again. This will become more important with future changes to fix races between procfs/ptrace and exit(2). Also, removed a duplicate pfs_visible() call in pfs_getextattr(). Reviewed by: des MFC after: 1 week	2006-02-22 17:24:54 +00:00
John Baldwin	7a61c1a3cb	Hold the proc lock while calling proc_sstep() since the function asserts it and remove a PRELE() that didn't have a matching PHOLD(). The calling code already has a PHOLD anyway. MFC after: 1 week	2006-02-22 17:20:37 +00:00
Jeff Roberson	f50b03bfd6	- We must hold a reference to a vnode before calling vgone() otherwise it may not be removed from the freelist. MFC After: 1 week Found by: kris	2006-02-22 09:05:40 +00:00
Jeff Roberson	f5cacb3964	- spell VOP_LOCK(vp, LK_RELEASE... VOP_UNLOCK(vp,... so that asserts in vop_lock_post do not trigger. - Rearrange null_inactive to null_hashrem earlier so there is no chance of finding the null node on the hash list after the locks have been switched. - We should never have a NULL lowervp in null_reclaim() so there is no need to handle this situation. panic instead. MFC After: 1 week	2006-02-22 06:17:31 +00:00
Jeff Roberson	9c12e63100	- Assert that the lowervp is locked in null_hashget(). - Simplify the logic dealing with recycled vnodes in null_hashget() and null_hashins(). Since we hold the lower node locked in both cases the null node can not be undergoing recycling unless reclaim somehow called null_nodeget(). The logic that was in place was not safe and was essentially dead code. MFC After: 1 week	2006-02-22 06:15:12 +00:00
Jeff Roberson	578abc8e54	- Deadfs should not use the std GETWRITEMOUNT routine. Add one that always returns NULL. MFC After: 1 week	2006-02-22 06:11:59 +00:00
John Baldwin	ccabcacb30	Correctly set MNTK_MPSAFE flag from the lower vnode's mount rather than always turning it on along with any flags set in the lower mount. Tested by: kris Reviewed by: jeff MFC after: 3 days	2006-02-10 18:06:49 +00:00
Jeff Roberson	fbf586bd40	- No need to WANTPARENT when we're just going to vrele it in a deadlock prone way later. Reported by: kkenn MFC After: 3 days	2006-02-07 11:31:32 +00:00
Will Andrews	937a238777	Make UDF endian-safe. Submitted by: Pedro Martelletto <pedro@ambientworks.net> (via scottl) Tested on: sparc64	2006-02-03 15:25:52 +00:00
Jeff Roberson	89b0e10910	- Reorder calls to vrele() after calls to vput() when the vrele is a directory. vrele() may lock the passed vnode, which in these cases would give an invalid lock order of child -> parent. These situations are deadlock prone although do not typically deadlock because the vrele is typically not releasing the last reference to the vnode. Users of vrele must consider it as a call to vn_lock() and order it appropriately. MFC After: 1 week Sponsored by: Isilon Systems, Inc. Tested by: kkenn	2006-02-01 00:25:26 +00:00
Jeff Roberson	3b77d80cdd	- Remove a stale comment. This function was rewritten to be SMP safe some time ago. Sponsored by: Isilon Systems, Inc.	2006-01-30 08:24:14 +00:00
Tom Rhodes	9fc31f8a5f	Update incorrect comments here, there should not be a call to panic() over fs corruption. Discussed with: alfred, phk	2006-01-23 17:45:57 +00:00
Max Khon	710a9accfe	Do not assume that `char direntry::deExtension[3]' starts right after `char direntry::deName[8]' and access deExtension[] explicitly. Found by: Coverity Prevent(tm) CID: 350, 351, 352	2006-01-22 21:09:38 +00:00
Robert Watson	0bdfeca765	Convert last four functions in coda_vnops.c to ANSI C function declarations. I knew I would get to fix something in Coda eventually. MFC after: 1 week	2006-01-21 19:51:47 +00:00
Alfred Perlstein	92e73f5711	I ran into an nfs client panic a couple of times in a row over the last few days. I tracked it down to the fact that nfs_reclaim() is setting vp->v_data to NULL _before_ calling vnode_destroy_object(). After silence from the mailing list I checked further and discovered that ufs_reclaim() is unique among FreeBSD filesystems for calling vnode_destroy_object() early, long before tossing v_data or much of anything else, for that matter. The rest, including NFS, appear to be identical, as if they were just clones of one original routine. The enclosed patch fixes all file systems in essentially the same way, by moving the call to vnode_destroy_object() to early in the routine (before the call to vfs_hash_remove(), if any). I have only tested NFS, but I've now run for over eighteen hours with the patch where I wouldn't get past four or five without it. Submitted by: Frank Mayhar Requested by: Mohan Srinivasan MFC After: 1 week	2006-01-17 17:29:03 +00:00
Tor Egge	82be0a5a24	Add marker vnodes to ensure that all vnodes associated with the mount point are iterated over when using MNT_VNODE_FOREACH. Reviewed by: truckman	2006-01-09 20:42:19 +00:00
Maxim Konovalov	036cd12a8d	o Fix typo in the define: s/MRAK_INT_GEN/MARK_INT_GEN/. The typo was harmless because the define is not used in coda_vfsops.c. Submitted by: Hugo Meiland	2006-01-09 18:07:06 +00:00
Maxim Konovalov	98a95f61fa	o Typo in the debug message: s/skiped/skipped. PR: kern/91346 Submitted by: Gavin Atkinson	2006-01-05 13:39:23 +00:00
Robert Watson	8f0d99d790	When returning EIO from DEVFSIO_RADD ioctl, drop the exclusive rule lock. Otherwise the system comes to a rather sudden and grinding halt. MFC after: 1 week	2006-01-03 09:49:10 +00:00
Tom Rhodes	09c00166e4	Make tv_sec a time_t on all platforms but alpha. Brings us more in line with POSIX. This also makes the struct correct we ever implement an i386-time64 architecture. Not that we need too. Reviewed by: imp, brooks Approved by: njl (acpica), des (no objects, touches procfs) Tested with: make universe	2005-12-24 22:22:17 +00:00
Dag-Erling Smørgrav	0430a5e289	Eradicate caddr_t from the VFS API.	2005-12-14 00:49:52 +00:00
Tai-hwa Liang	8bfc230455	Recent nmount(2) adoption in mount_smbfs(8) did not flag the "long" option since mount_smbfs(8) assumed long name mounting by default unless "-n long" was explicitly specified. Rather than supplying a "long" option in mount_smbfs(8), this commit brings back the original behaviour by associating SMBFS_MOUNT_NO_LONG with the "nolong" option. This should fix the broken long file names on smbfs people observed recently. Reported by: Vladimir Grebenschikov <vova at fbsd dot ru> Reviewed by: phk Tested by: Slawa Olhovchenkov <slw at zxy dot spb dot ru>	2005-12-05 19:05:06 +00:00
Ruslan Ermilov	342ed5d948	Fix -Wundef warnings found when compiling i386 LINT, GENERIC and custom kernels.	2005-12-05 11:58:35 +00:00
Ruslan Ermilov	3238c6bd33	Fix -Wundef from compiling the amd64 LINT.	2005-12-04 10:06:06 +00:00
Ruslan Ermilov	f4e9888107	Fix -Wundef.	2005-12-04 02:12:43 +00:00
Boris Popov	cc518d3b67	Fix interaction with Windows 2000/XP based servers: If the complete reply on the TRANS2_FIND_FIRST2 request fits exactly into one responce packet, then next call to TRANS2_FIND_NEXT2 will return zero entries and server will close current transaction. To avoid subsequent errors we should not perform FIND_CLOSE2 request. PR: kern/78953 Submitted by: Jim Carroll	2005-11-22 07:13:00 +00:00
Craig Rodrigues	d75b2048db	Properly parse the nowin95 mount option. Tested by: Rainer Hurling <rhurlin at gwdg dot de>	2005-11-19 16:38:39 +00:00
Craig Rodrigues	4ab125739b	Add "shortnames" and "longnames" mount options which are synonyms for "shortname" and "longname" mount options. The old (before nmount()) mount_msdosfs program accepted "shortnames" and "longnames", but the kernel nmount() checked for "shortname" and "longname". So, make the kernel accept "shortnames", "longnames", "shortname", "longname" for forwards and backwarsd compatibility. Discovered by: Rainer Hurling <rhurlin at gwdg dot de>	2005-11-18 22:34:31 +00:00
Craig Rodrigues	43fa5bf534	- Add errmsg to the list of smbfs mount options. - Use vfs_mount_error() to propagate smbfs mount errors back to userspace. Reviewed by: bp (smbfs maintainer)	2005-11-16 02:26:25 +00:00
Doug White	16e35dcc39	This is a workaround for a complicated issue involving VFS cookies and devfs. The PR and patch have the details. The ultimate fix requires architectural changes and clarifications to the VFS API, but this will prevent the system from panicking when someone does "ls /dev" while running in a shell under the linuxulator. This issue affects HEAD and RELENG_6 only. PR: 88249 Submitted by: "Devon H. O'Dell" <dodell@ixsystems.com> MFC after: 3 days	2005-11-09 22:03:50 +00:00
Robert Watson	5bb84bc84b	Normalize a significant number of kernel malloc type names: - Prefer '_' to ' ', as it results in more easily parsed results in memory monitoring tools such as vmstat. - Remove punctuation that is incompatible with using memory type names as file names, such as '/' characters. - Disambiguate some collisions by adding subsystem prefixes to some memory types. - Generally prefer lower case to upper case. - If the same type is defined in multiple architecture directories, attempt to use the same name in additional cases. Not all instances were caught in this change, so more work is required to finish this conversion. Similar changes are required for UMA zone names.	2005-10-31 15:41:29 +00:00
Poul-Henning Kamp	3b72f38b5e	Use correct cirteria for determining which directory entries we can purge right away and which we merely can hide. Beaten into my skull by: kris	2005-10-18 20:21:25 +00:00
Dag-Erling Smørgrav	a92fef8afc	Implement the full range of ISO9660 number conversion routines in iso.h. MFC after: 2 weeks	2005-10-18 13:35:08 +00:00
Craig Rodrigues	c583f369a7	Unconditionally mount a CD9660 filesystem as read-only, instead of returning EROFS if we forget to mount it as read-only.	2005-10-17 03:29:53 +00:00
Craig Rodrigues	b137e1c8ba	Use the actual sector size of the media instead of hard-coding it to 2048. This eliminates KASSERTs in GEOM if we accidentally mount an audio CD as a cd9660 filesystem.	2005-10-17 03:27:35 +00:00
Craig Rodrigues	073833a420	Unconditionally mount a UDF filesystem as read-only, instead of returning an EROFS if we forget to mount it as read-only.	2005-10-17 03:07:36 +00:00
Florent Thoumie	86391603da	- Fix typo. Approved by: ssouhlal MFC after: 1 week	2005-10-17 00:04:35 +00:00
Don Lewis	8bcc0d3f95	Update nwfs_lookup() to match the current cache_lookup() API. cache_lookup() has returned a ref'ed and locked vnode since vfs_cache.c:1.96, dated Tue Mar 29 12:59:06 2005 UTC. This change is similar to the change made to smbfs_lookup() in smbfs_vnops.c:1.58. Tested by: "Antony Mawer" ant AT mawer.org MFC after: 2 weeks	2005-10-16 21:54:35 +00:00
Kris Kennaway	3554cddbfa	Reflect mpsafety of the underlying filesystem in the nullfs image. I benchmarked this by simultaneously extracting 4 large tarballs (basically world images) on a 4-processor AMD64 system, in a malloc-backed md. With this patch, system time was reduced by 43%, and wall clock time by 33%. Submitted by: jeff MFC after: 1 week	2005-10-16 21:45:25 +00:00
Don Lewis	d31c91fbcf	Apply the same fix to a potential race in the ISDOTDOT code in cd9660_lookup() that was used to fix an actual race in ufs_lookup.c:1.78. This is not currently a hazard, but the bug would be activated by marking cd9660 as MPSAFE. Requested by: bde	2005-10-16 21:41:54 +00:00
Yaroslav Tykhiy	10d645b7e5	In preparation for making the modules actually use opt_*.h files provided in the kernel build directory, fix modules that were failing to build this way due to not quite correct kernel option usage. In particular: ng_mppc.c uses two complementary options, both of which are listed in sys/conf/files. Ideally, there should be a separate option for including ng_mppc.c in kernel build, but now only NETGRAPH_MPPC_ENCRYPTION is usable anyway, the other one requires proprietary files. nwfs and smbfs were trying to ensure they were built with proper network components, but the check was rather questionable. Discussed with: ru	2005-10-14 23:17:45 +00:00
David Xu	9104847f21	1. Change prototype of trapsignal and sendsig to use ksiginfo_t *, most changes in MD code are trivial, before this change, trapsignal and sendsig use discrete parameters, now they uses member fields of ksiginfo_t structure. For sendsig, this change allows us to pass POSIX realtime signal value to user code. 2. Remove cpu_thread_siginfo, it is no longer needed because we now always generate ksiginfo_t data and feed it to libpthread. 3. Add p_sigqueue to proc structure to hold shared signals which were blocked by all threads in the proc. 4. Add td_sigqueue to thread structure to hold all signals delivered to thread. 5. i386 and amd64 now return POSIX standard si_code, other arches will be fixed. 6. In this sigqueue implementation, pending signal set is kept as before, an extra siginfo list holds additional siginfo_t data for signals. kernel code uses psignal() still behavior as before, it won't be failed even under memory pressure, only exception is when deleting a signal, we should call sigqueue_delete to remove signal from sigqueue but not SIGDELSET. Current there is no kernel code will deliver a signal with additional data, so kernel should be as stable as before, a ksiginfo can carry more information, for example, allow signal to be delivered but throw away siginfo data if memory is not enough. SIGKILL and SIGSTOP have fast path in sigqueue_add, because they can not be caught or masked. The sigqueue() syscall allows user code to queue a signal to target process, if resource is unavailable, EAGAIN will be returned as specification said. Just before thread exits, signal queue memory will be freed by sigqueue_flush. Current, all signals are allowed to be queued, not only realtime signals. Earlier patch reviewed by: jhb, deischen Tested on: i386, amd64	2005-10-14 12:43:47 +00:00
Craig Rodrigues	a3d7f575c0	- Do not hardcode the bsize to a sectorsize of 2048, even though the UDF specification specifies a logical sectorsize of 2048. Instead, get it from GEOM. - When reading the UDF Anchor Volume Descriptor, use the logical sectorsize of 2048 when calculating the offset to read from, but use the actual sectorsize to determine how much to read. - works with reading a DVD disk and a DVD disk image file via mdconfig - correctly returns EINVAL if we try to mount_udf an audio CD, instead of panicking inside GEOM when INVARIANTS is set	2005-10-09 04:45:33 +00:00
Pawel Jakub Dawidek	8597a1c5b2	We don't need 'imp' here.	2005-10-07 10:30:47 +00:00
Robert Watson	2affdbee3e	Second attempt at a work-around for fifo-related socket panics during make -j with high levels of parallelism: acquire Giant in fifo I/O routines. Discussed with: ups MFC after: 3 days	2005-10-01 20:15:41 +00:00
Poul-Henning Kamp	73a2c3a32e	The NWFS code in RELENG_6 is broken due to a typo in sys/fs/nwfs/nwfs_vfsop= s.c, introduced with the conversion to nmount with revision 1.38. This causes mount_nwfs to fail with the error message: mount_nwfs: mount error: /mnt/netware: syserr = No such file or directo= ry This is caused by a typo on line 178, which specifies "nwfw_args" rather than "nwfs_args". Submitted by: Antony Mawer <gnats@mawer.org> Fat fingers: phk PR: 86757 MFC: 3 days	2005-09-30 18:21:05 +00:00
Peter Edwards	20c5ba3685	Remove checks for BOOTSIG[23] from FAT32 bootblocks. There seems to be very little documentary evidence outside this implementation to suggest a these checks are neccessary, and more than one camera-formatted flash disk fails the check, but mounts successfully on most other systems. Reviewed By: bde@	2005-09-29 14:09:46 +00:00
Robert Watson	a0e81bce69	Back out fifo_vnops.c:1.127, which introduced an sx lock around I/O on a fifo. While this did indeed close the race, confirming suspicions about the nature of the problem, it causes difficulties with blocking I/O on fifos. Discussed with: ups Also spotted by: Peter Holm <peter at holm dot cc>	2005-09-27 16:45:22 +00:00
Robert Watson	454c3d13be	Assert v_fifoinfo is non-NULL in fifo_close() in order to catch non-conforming cases sooner. MFC after: 3 days Reported by: Peter Holm <peter at holm dot cc>	2005-09-26 08:17:03 +00:00
Robert Watson	ee47648770	Lock the read socket receive buffer when frobbing the sb_state flag on that socket during open, not the write socket receive buffer. This might explain clearing of the sb_state SB_LOCK flag seen occasionally in soreceive() on fifos. MFC after: 3 days Spotted by: ups	2005-09-25 19:52:09 +00:00
Poul-Henning Kamp	e515ee7832	Make rule zero really magical, that way we don't have to do anything when we mount and get zero cost if no rules are used in a mountpoint. Add code to deref rules on unmount. Switch from SLIST to TAILQ. Drop SYSINIT, use SX_SYSINIT and static initializer of TAILQ instead. Drop goto, a break will do. Reduce double pointers to single pointers. Combine reaping and destroying rulesets. Avoid memory leaks in a some error cases.	2005-09-24 07:03:09 +00:00
Robert Watson	5d3df5cc1b	For reasons of consistency (and necessity), assert an exclusive vnode lock on the fifo vnode in fifo_open(): we rely on the vnode lock to serialize access to v_fifoinfo. MFC after: 3 days	2005-09-23 12:39:51 +00:00
Robert Watson	7028887eac	Add fi_sx, an sx lock to serialize I/O operations on the socket pair underlying the POSIX fifo implementation. In 6.x/7.x, fifo access is moved from the VFS layer, where it was serialized using the vnode lock, to the file descriptor layer, where access is protected by a reference count but not serialized. This exposed socket buffer locking to high levels of parallelism in specific fifo workloads, such as make -j 32, which expose as yet unresolved socket buffer bugs. fi_sx re-adds serialization about the read and write routines, although not paths that simply test socket buffer mbuf queue state, such as the poll and kqueue methods. This restores the extra locking cost previously present in some cases, but is an effective workaround for the instability that has been experienced. This workaround should be removed once the bug in socket buffer handling has been fixed. Reported by: kris, jhb, Julien Gabel <jpeg at thilelli dot net>, Peter Holm <peter at holm dot cc>, others MFC after: 3 days	2005-09-22 10:51:12 +00:00
Poul-Henning Kamp	e606a3c63e	Rewamp DEVFS internals pretty severely [1]. Give DEVFS a proper inode called struct cdev_priv. It is important to keep in mind that this "inode" is shared between all DEVFS mountpoints, therefore it is protected by the global device mutex. Link the cdev_priv's into a list, protected by the global device mutex. Keep track of each cdev_priv's state with a flag bit and of references from mountpoints with a dedicated usecount. Reap the benefits of much improved kernel memory allocator and the generally better defined device driver APIs to get rid of the tables of pointers + serial numbers, their overflow tables, the atomics to muck about in them and all the trouble that resulted in. This makes RAM the only limit on how many devices we can have. The cdev_priv is actually a super struct containing the normal cdev as the "public" part, and therefore allocation and freeing has moved to devfs_devs.c from kern_conf.c. The overall responsibility is (to be) split such that kern/kern_conf.c is the stuff that deals with drivers and struct cdev and fs/devfs handles filesystems and struct cdev_priv and their private liason exposed only in devfs_int.h. Move the inode number from cdev to cdev_priv and allocate inode numbers properly with unr. Local dirents in the mountpoints (directories, symlinks) allocate inodes from the same pool to guarantee against overlaps. Various other fields are going to migrate from cdev to cdev_priv in the future in order to hide them. A few fields may migrate from devfs_dirent to cdev_priv as well. Protect the DEVFS mountpoint with an sx lock instead of lockmgr, this lock also protects the directory tree of the mountpoint. Give each mountpoint a unique integer index, allocated with unr. Use it into an array of devfs_dirent pointers in each cdev_priv. Initially the array points to a single element also inside cdev_priv, but as more devfs instances are mounted, the array is extended with malloc(9) as necessary when the filesystem populates its directory tree. Retire the cdev alias lists, the cdev_priv now know about all the relevant devfs_dirents (and their vnodes) and devfs_revoke() will pick them up from there. We still spelunk into other mountpoints and fondle their data without 100% good locking. It may make better sense to vector the revoke event into the tty code and there do a destroy_dev/make_dev on the tty's devices, but that's for further study. Lots of shuffling of stuff and churn of bits for no good reason[2]. XXX: There is still nothing preventing the dev_clone EVENTHANDLER from being invoked at the same time in two devfs mountpoints. It is not obvious what the best course of action is here. XXX: comment out an if statement that lost its body, until I can find out what should go there so it doesn't do damage in the meantime. XXX: Leave in a few extra malloc types and KASSERTS to help track down any remaining issues. Much testing provided by: Kris Much confusion caused by (races in): md(4) [1] You are not supposed to understand anything past this point. [2] This line should simplify life for the peanut gallery.	2005-09-19 19:56:48 +00:00
Robert Watson	526e258d3a	Assert that (vp) is locked in fifo_close(), since we rely on the exclusive vnode lock to synchronize the reference counts on struct fifoinfo. MFC after: 3 days	2005-09-18 10:44:50 +00:00
Poul-Henning Kamp	59307b0dfe	Don't attempt to recurse lockmgr, it doesn't like it.	2005-09-15 21:16:43 +00:00
Alexander Kabaev	d11c07ba56	Handle a race condition where NULLFS vnode can be cleaned while threads can still be asleep waiting for lowervp lock. Tested by: kkenn Discussed with: ssouhlal, jeffr	2005-09-15 19:21:26 +00:00
Robert Watson	ca17bccaa1	The socket pointers in fifoinfo are not permitted to be NULL, so don't check if they are, it just confuses the fifo code more. MFC after: 3 days	2005-09-15 15:45:34 +00:00
Poul-Henning Kamp	214c8ff0e4	Various minor polishing.	2005-09-15 10:28:19 +00:00
Poul-Henning Kamp	6556102dcb	Protect the devfs rule internal global lists with a sx lock, the per mount locks are not enough. Finer granularity (x)locking could be implemented, but I prefer to keep it simple for now.	2005-09-15 08:50:16 +00:00
Poul-Henning Kamp	ab32e95296	Absolve devfs_rule.c from locking responsibility and call it with all necessary locking held.	2005-09-15 08:36:37 +00:00
Poul-Henning Kamp	5e080af41f	Close a race which could result in unwarranted "ruleset %d already running" panics. Previously, recursion through the "include" feature was prevented by marking each ruleset as "running" when applied. This doesn't work for the case where two DEVFS instances try to apply the same ruleset at the same time. Instead introduce the sysctl vfs.devfs.rule_depth (default == 1) which limits how many levels of "include" we will traverse. Be aware that traversal of "include" is recursive and kernel stack size is limited. MFC: after 3 days	2005-09-15 06:57:28 +00:00
Robert Watson	447bbaa2cf	Trim down now (believed to be) unused fifo_ioctl() and fifo_kqfilter() VOP implementations, since they in theory are used only on open file descriptors, in which case the ioctls are via fifo_ioctl_f() and kqueue requests are via fifo_kqfilter_f(). Generate warnings if they are entered for now. These printf() calls should become panic() calls. Annotate and re-implement fifo_ioctl_f(): don't arbitrarily forward ioctls to the socket layer, only forward the ones we explicitly support for fifos. In the case of FIONREAD, don't forward the request to the write socket on a read-write fifo, or the read result is overwritten. Annotate a nasty case for the undefined POSIX O_RDWR on fifos, in which failure of the second ioctl will result in the socket pair being in an inconsistent state. Assert copyright as I find myself rewriting non-trivial parts of fifofs. MFC after: 3 days	2005-09-13 17:46:48 +00:00
Robert Watson	8a22e151be	As a result of kqueue locking work, socket buffer locks will always be held when entering a kqueue filter for fifos via a socket buffer event: as such, assert the lock unconditionally rather than acquiring it conditionall. MFC after: 3 days	2005-09-13 10:39:24 +00:00
Robert Watson	db7a6c2f43	Annotate two issues: 1) fifo_kqfilter() is not actually ever used, it likely should be GC'd. 2) fifo_kqfilter_f() doesn't implement EVFILT_VNODE, so detecting events on the underlying vnode for a fifo no longer works (it did in 4.x). Likely, fifo_kqfilter_f() should forward the request to the VFS using fp->f_vnode, which would work once fifo_kqfilter() was detached from the vnode operation vector (removing the fifo override). Discussed with: phk	2005-09-13 09:23:22 +00:00
Robert Watson	88f39e8e95	Introduce no-op nosup fifo kqueue filter and detach routine, which are used when a read filter is requested on a write-only fifo descriptor, or a write filter is requested on a read-only fifo descriptor. This permits the filters to be registered, but never raises the event, which causes kqueue behavior for fifos to more closely match similar semantics for poll and select, which permit testing for the condition even though the condition will never be raised, and is consistent with POSIX's notion that a fifo has identical semantics to a one-way IPC channel created using pipe() on most operating systems. The fifo regression test suite can now run to completion on HEAD without errors. MFC after: 3 days	2005-09-12 19:59:12 +00:00
Robert Watson	48afebb83d	When a request is made to register a filter on a fifo that doesn't apply to the fifo (i.e., not EVFILT_READ or EVFILT_WRITE), reject it as EINVAL, not by returning 1 (EPERM). MFC after: 3 days	2005-09-12 18:07:49 +00:00
Robert Watson	114538d85b	Remove DFLAG_SEEKABLE from fifo file descriptors: fifos are not seekable according to POSIX, not to mention the fact that it doesn't make sense (and hence isn't really implemented). This causes the fifo_misc regression test to succeed.	2005-09-12 12:15:12 +00:00
Robert Watson	6dd84b0bdc	Only poll the fifo for read events if the fifo is attached to a readable file descriptor. Otherwise, the read end of a fifo might return that it is writable (which it isn't). Only poll the fifo for write events if the fifo attached to a writable file descriptor. Otherwise, the write end of a fifo might return that it is readable (which it isn't). In the event that a file is FREAD\|FWRITE (which is allowed by POSIX, but has undefined behavior), we poll for both. MFC after: 3 days	2005-09-12 10:16:18 +00:00
Robert Watson	845e8e827b	After going to some trouble to identify only the write-related events to poll the write socket for, the fifo polling code proceeded to poll for the complete set of events. Use 'levents' instead of 'events' as the argument to poll, and only poll the write socket if there is interest in write events. MFC after: 3 days	2005-09-12 10:13:15 +00:00
Robert Watson	ab5182012a	When a writer opens a fifo, wake up the read socket for read, not the write socket. MFC after: 3 days	2005-09-12 10:07:21 +00:00
Robert Watson	a1b9943657	Add an assertion that fifo_open() doesn't race against other threads while sleeping to allocate fifo state: due to using the vnode lock to serialize access to a fifo during open, it shouldn't happen (tm). MFC after: 3 days	2005-09-12 10:06:38 +00:00
Robert Watson	ba9eeb43fe	Rather than reaching into the internals of the UNIX domain socket code by calling uipc_connect2() to connect two socket endpoints to create a fifo, call soconnect2(). MFC after: 3 days	2005-09-12 10:05:08 +00:00
Poul-Henning Kamp	21806f30bc	Clean up prototypes.	2005-09-12 08:03:15 +00:00
Craig Rodrigues	b575132598	Cast bf_sysid to const char * when passing it to strncmp(), because strncmp does not take an unsigned char *. Eliminates warning with GCC 4.0.	2005-09-11 16:02:14 +00:00
Craig Rodrigues	2a3e0acc5d	Do not declare M_NTFSMNT with extern linkage here, since it is defined with static linkage in ntfs_vfsops.c. Fixes compilation with GCC 4.0.	2005-09-11 15:57:07 +00:00
David E. O'Brien	5ddf29857e	Ensure the full value is written into inode variables. PR: 85503 Submitted by: Dmitry Pryanishnikov <dmitry@atlantis.dp.ua>	2005-09-07 10:32:58 +00:00
Suleiman Souhlal	68da388325	Unbreak hpfs/ntfs/udf/ext2fs/reiserfs mounting. Another pointyhat to: ssouhlal	2005-09-03 20:23:41 +00:00
Suleiman Souhlal	44bd2bc19a	Unbreak the build. Pointyhat to: ssouhlal	2005-09-03 00:40:19 +00:00
Suleiman Souhlal	cdeb72045b	Use vput() instead of vrele() in null_reclaim() since the lower vnode is locked. MFC after: 3 days	2005-09-02 15:49:55 +00:00
Suleiman Souhlal	75d7ba93af	*_mountfs() (if the filesystem mounts from a device) needs devvp to be locked, so lock it. Glanced at by: phk MFC after: 3 days	2005-09-02 15:27:23 +00:00
Poul-Henning Kamp	80447bf701	Add a missing dev_relthread() call. Remove unused variable. Spotted by: Hans Petter Selasky <hselasky@c2i.net>	2005-08-29 11:14:18 +00:00
Poul-Henning Kamp	516ad423b1	Handle device drivers with D_NEEDGIANT in a way which does not penalize the 'good' drivers: Allocate a shadow cdevsw and populate it with wrapper functions which grab Giant	2005-08-17 08:19:52 +00:00
Poul-Henning Kamp	31cc57cdbd	Collect the devfs related sysctls in one place	2005-08-16 19:25:02 +00:00
Poul-Henning Kamp	9c0af1310c	Create a new internal .h file to communicate very private stuff from kern_conf.c to devfs. For now just two prototypes, more to come.	2005-08-16 19:08:01 +00:00
Poul-Henning Kamp	d785dfefa4	Eliminate effectively unused dm_basedir field from devfs_mount.	2005-08-15 19:40:53 +00:00
Peter Grehan	14dcd40fde	- restore the ability to mount cd9660 filesystems as root by inverting some of the options test, specifically the joliet and rockridge tests. Since the root mount callchain doesn't go through cd9660_cmount, the default mount options aren't set. Rather than having the main codepath assume the options are there, test for the absence of the inverted optioin e.g. instead of vfs_flagopt(.. "joliet" ..), test for !vfs_flagopt(.. "nojoliet" ..) This works for root mount, non-root mount and future nmount cases. - in cd9660_cmount, remove inadvertent setting of "gens" when "extatt" was set. Reported by: grehan, Dario Freni <saturnero at freesbie org> Tested by: Dario Freni Not objected to by: phk MFC after: 3 days	2005-08-14 04:19:36 +00:00
Dag-Erling Smørgrav	8ab2a64d2f	Eliminate an unnecessary bcopy().	2005-08-12 12:22:05 +00:00
David E. O'Brien	c11ba30c9a	Remove public declarations of variables that were forgotten when they were made static.	2005-08-10 07:10:02 +00:00
David E. O'Brien	cec9a4bf57	Remove the need to forward declare statics by moving them around.	2005-08-10 07:08:14 +00:00
Robert Watson	6a113b3de7	Merge the dev_clone and dev_clone_cred event handlers into a single event handler, dev_clone, which accepts a credential argument. Implementors of the event can ignore it if they're not interested, and most do. This avoids having multiple event handler types and fall-back/precedence logic in devfs. This changes the kernel API for /dev cloning, and may affect third party packages containg cloning kernel modules. Requested by: phk MFC after: 3 days	2005-08-08 19:55:32 +00:00
Kris Kennaway	e29c976a58	devfs is not yet fully MPSAFE - for example, multiple concurrent devfs(8) processes can cause a panic when operating on rulesets. Approved by: phk	2005-07-29 23:00:56 +00:00
Simon L. B. Nielsen	02a4be3f74	Correct devfs ruleset bypass. Submitted by: csjp Reviewed by: phk Security: FreeBSD-SA-05:17.devfs Approved by: cperciva	2005-07-20 13:34:16 +00:00
R. Imura	697ab829fc	[1] unix2doschr() If a character cannot be converted to DOS code page, unix2doschr() returned `0'. As a result, unix2dosfn() was forced to return `0', so we saw a file which was composed of these characters as `Invalid argument'. To correct this, if a character can be converted to Unicode, unix2doschr() now returns `1' which is a magic number to make unix2dosfn() know that the character must be converted to `_'. [2] unix2dosfn() The above-mentioned solution only works if a file has both of Unicode name and DOS code page name. Unicode name would not be recorded if file name can be settled within 11 bytes (DOS short name) and if no conversion from Unix charset to DOS code page has occurred. Thus, FreeBSD can create a file which has only short name, but there is no guarantee that the short name contains allways valid characters because we leave it to people by using mount_msdosfs(8) to select which conversion is used between DOS code page and unix charset. To avoid this, Unicode file name should be recorded unless a character is an ascii character. This is the way Windows XP do. PR: 77074 [1] MFC after: 1 week	2005-07-17 07:10:05 +00:00
Robert Watson	d26dd2d99e	When devfs cloning takes place, provide access to the credential of the process that caused the clone event to take place for the device driver creating the device. This allows cloned device drivers to adapt the device node based on security aspects of the process, such as the uid, gid, and MAC label. - Add a cred reference to struct cdev, so that when a device node is instantiated as a vnode, the cloning credential can be exposed to MAC. - Add make_dev_cred(), a version of make_dev() that additionally accepts the credential to stick in the struct cdev. Implement it and make_dev() in terms of a back-end make_dev_credv(). - Add a new event handler, dev_clone_cred, which can be registered to receive the credential instead of dev_clone, if desired. - Modify the MAC entry point mac_create_devfs_device() to accept an optional credential pointer (may be NULL), so that MAC policies can inspect and act on the label or other elements of the credential when initializing the skeleton device protections. - Modify tty_pty.c to register clone_dev_cred and invoke make_dev_cred(), so that the pty clone credential is exposed to the MAC Framework. While currently primarily focussed on MAC policies, this change is also a prerequisite for changes to allow ptys to be instantiated with the UID of the process looking up the pty. This requires further changes to the pty driver -- in particular, to immediately recycle pty nodes on last close so that the credential-related state can be recreated on next lookup. Submitted by: Andrew Reisse <andrew.reisse@sparta.com> Obtained from: TrustedBSD Project Sponsored by: SPAWAR, SPARTA MFC after: 1 week MFC note: Merge to 6.x, but not 5.x for ABI reasons	2005-07-14 10:22:09 +00:00
Seigo Tanimura	045f25a28d	Regrab dvp only when ISDOTDOT. Approved by: re (scottl)	2005-07-09 13:52:49 +00:00
Jeff Roberson	8b3676f1a1	- Since we don't hold a usecount in pfs_exit we have to get a holdcnt prior to calling vgone() to prevent any races. Sponsored by: Isilon Systems, Inc. Approved by: re (vfs blanket)	2005-07-07 07:33:10 +00:00
Peter Wemm	62919d788b	Jumbo-commit to enhance 32 bit application support on 64 bit kernels. This is good enough to be able to run a RELENG_4 gdb binary against a RELENG_4 application, along with various other tools (eg: 4.x gcore). We use this at work. ia32_reg.[ch]: handle the 32 bit register file format, used by ptrace, procfs and core dumps. procfs_regs.c: vary the format of proc/XXX/regs depending on the client and target application. procfs_map.c: Don't print a 64 bit value to 32 bit consumers, or their sscanf fails. They expect an unsigned long. imgact_elf.c: produce a valid 32 bit coredump for 32 bit apps. sys_process.c: handle 32 bit consumers debugging 32 bit targets. Note that 64 bit consumers can still debug 32 bit targets. IA64 has got stubs for ia32_reg.c. Known limitations: a 5.x/6.x gdb uses get/setcontext(), which isn't implemented in the 32/64 wrapper yet. We also make a tiny patch to gdb pacify it over conflicting formats of ld-elf.so.1. Approved by: re	2005-06-30 07:49:22 +00:00
Peter Wemm	2de92a386e	Conditionally weaken sys_generic.c rev 1.136 to allow certain dubious ioctl numbers in backwards compatability mode. eg: an IOC_IN ioctl with a size of zero. Traditionally this was what you did before IOC_VOID existed, and we had some established users of this in the tree, namely procfs. Certain 3rd party drivers with binary userland components also have this too. This is necessary to have 4.x and 5.x binaries use these ioctl's. We found this at work when trying to run 4.x binaries. Approved by: re	2005-06-30 00:19:08 +00:00
R. Imura	181fc3c6ea	Avoid casting from (int ) to (size_t ) in order to fix udf_iconv on amd64. Reviewed by: scottl MFC after: 2 weeks	2005-06-05 02:09:48 +00:00
Craig Rodrigues	fd225fe4a3	Do not declare a struct as extern, and then implement it as static in the same file. This is not legal C, and GCC 4.0 will issue an error. Reviewed by: phk Approved by: das (mentor)	2005-05-31 14:50:49 +00:00
Christian Brueffer	befb7f333f	Fix three typos in comments. Two of them obtained from OpenBSD. MFC after: 3 days	2005-05-11 21:10:35 +00:00
Alexander Kabaev	42e1d99cc8	Do not dereference dvp pointer before doing a NULL check. Noticed by: Coverity Prevent analysis tool.	2005-05-11 19:08:38 +00:00
Eric Anholt	1493ed4108	Staticize a symbol used only in this file. PR: kern/43613 Submitted by: Matt Emmerton, matt at gsicomp dot on dot ca	2005-05-06 20:47:09 +00:00
Robert Drehmel	9c0c1ab87d	The printf(9) `%p' conversion specifier puts an "0x" in front of the pointer value. Therefore, remove the "0x" from the format string.	2005-05-06 00:15:57 +00:00
Robert Drehmel	e7aabf96a4	Fix our NTFS readdir function. To check a directory's in-use bitmap bit by bit, we use a pointer to an 8 bit wide unsigned value. The index used to dereference this pointer is calculated by shifting the bit index right 3 bits. Then we do a logical AND with the bit# represented by the lower 3 bits of the bit index. This is an idiomatic way of iterating through a bit map with simple bitwise operations. This commit fixes the bug that we only checked bits 3:0 of each 8 bit chunk, because we only used bits 1:0 of the bit index for the bit# in the current 8 bit value. This resulted in files not being returned by getdirentries(2). Change the type of the bit map pointer from `char ' to `u_int8_t '.	2005-05-06 00:06:06 +00:00
Takanori Watanabe	1e8a69609e	Fix breakage on alpha. Pointed out by: hrs via IRC	2005-05-05 07:02:51 +00:00
Takanori Watanabe	4ebd3ea1f6	Make smbfs capable to use 16bit char set in filenames. PR:78110	2005-05-04 15:05:46 +00:00
Jeff Roberson	d65736a1c0	- Set the v_object pointer after a successful VOP_OPEN(). This isn't a perfect solution as the lower vm object can change at unpredictable times if our lower vp happens to be on another unionfs, etc. Submitted by: Oleg Sharoiko <os@rsu.ru>	2005-05-03 11:05:33 +00:00
Jeff Roberson	7b6b7657d2	- In devfs_open() and devfs_close() grab Giant if the driver sets NEEDGIANT. We still have to DROP_GIANT and PICKUP_GIANT when NEEDGIANT is not set because vfs is still sometime entered with Giant held.	2005-05-01 00:56:34 +00:00
Dag-Erling Smørgrav	4cd27a97bc	Fix an old pasto.	2005-04-30 16:27:20 +00:00
Jeff Roberson	cd360e947b	- Mark devfs as MNTK_MPSAFE as I belive it does not require Giant. Sponsored by: Isilon Systems, Inc. Agreed in principle by: phk	2005-04-30 11:24:17 +00:00
Jeff Roberson	568556d720	- Fix several locking problems in unionfs_mount so that it will come closer to passing DEBUG_VFS_LOCKS.	2005-04-27 09:07:13 +00:00
Jeff Roberson	189dd72df3	- Pass the ISOPEN flag down to our lower filesystems. - Remove an erroneous VOP lock assert.	2005-04-27 09:06:06 +00:00
Jeff Roberson	7fd2deacb4	- As this is presently the one and only place where duplicate acquires of the vnode interlock are allowed mark it by passing MTX_DUPOK to this lock operation only. Sponsored by: Isilon Systems, Inc.	2005-04-22 22:42:44 +00:00
David Schultz	23e8fcaf66	Disable negative name caching for msdosfs to work around a bug. Since the name cache is case-sensitive and msdosfs isn't, creating a file 'foo' won't invalidate a negative entry for 'FOO'. There are similar problems related to 8.3 filenames. A better solution is to override VOP_LOOKUP with a method that canonicalizes the name, then calls vfs_cache_lookup(). Unfortunately, it's not quite that simple because vfs_cache_lookup() will call msdosfs_lookup() on a cache miss, and msdosfs_lookup() needs a way to get at the original component name.	2005-04-16 23:47:19 +00:00
Nate Lawson	58ad326be6	Fix mbnambuf support for multi-byte characters. If a substring is larger than WIN_CHARS bytes, we shift the suffix (previous substrings) upwards by the amount this substring exceeds its WIN_CHARS slot. Profiling shows this change is indistinguishable from the previous code at 95% confidence. This bug would result in attempts to access or create files or directories with multi-byte characters returning an error but no data loss. Reported and tested by: avatar MFC after: 3 days	2005-04-16 01:49:50 +00:00
Christian Brueffer	9f07f44971	Correct typo. Obtained from: OpenBSD	2005-04-14 14:40:09 +00:00
Jeff Roberson	4585e3ac5a	- Change all filesystems and vfs_cache to relock the dvp once the child is locked in the ISDOTDOT case. Se vfs_lookup.c r1.79 for details. Sponsored by: Isilon Systems, Inc.	2005-04-13 10:59:09 +00:00
Jeff Roberson	8e82c4cd5f	- Clear VI_OWEINACT before calling vget() with no lock type. We know the node is actually already locked, and VOP_INACTIVE is not desirable in this case.	2005-04-11 11:17:20 +00:00
Jeff Roberson	316ec7bb7f	- Honor the flags argument passed to null_root(). The filesystem below us will decide whether or not to grab a real shared lock.	2005-04-11 11:16:29 +00:00
Xin LI	e8943128a9	Initialize vp before using it. Failing to do this can cause instant panic when trying to access a file on mounted smbfs. Submitted by: takawata at jp freebsd org	2005-04-10 03:17:42 +00:00
Poul-Henning Kamp	f4b423ae60	Give msdosfs a unique inode number which is really the byteoffset of the directory entry. This solves the corruption problem I belive. Regression test script by: silby	2005-04-07 07:55:37 +00:00
Jeff Roberson	9370c333ce	- Fix union's assumptions about when the dvp is unlocked. It is only unlocked in the ISDOTDOT case now, not for all !ISLASTCN lookups.	2005-04-04 09:36:26 +00:00
Poul-Henning Kamp	f4f6abcb4e	Explicitly hold a reference to the cdev we have just cloned. This closes the race where the cdev was reclaimed before it ever made it back to devfs lookup.	2005-03-31 12:19:44 +00:00
Poul-Henning Kamp	9477d73e32	cdev (still) needs per instance uid/gid/mode Add unlocked version of dev_ref() Clean up various stuff in sys/conf.h	2005-03-31 10:29:57 +00:00
Poul-Henning Kamp	eb151cb989	Rename dev_ref() to dev_refl()	2005-03-31 06:51:54 +00:00
Jeff Roberson	ea124bf597	- LK_NOPAUSE is a nop now. Sponsored by: Isilon Systems, Inc.	2005-03-31 04:27:49 +00:00
Jeff Roberson	da1c9cb2b5	- Remove wantparent, it is no longer necessary. An assert in vfs_lookup.c prevents any callers from doing a modifying op without LOCKPARENT or WANTPARENT.	2005-03-29 13:09:42 +00:00
Jeff Roberson	fcc9c112cf	- Remove wantparent, it is no longer necessary. An assert in vfs_lookup.c prevents any callers from doing a DELETE or RENAME without locking the parent.	2005-03-29 13:04:00 +00:00
Jeff Roberson	5c5e51fd9a	- cache_lookup() now locks the new vnode for us to prevent some races. Remove redundant code. Sponsored by: Isilon Systems, Inc.	2005-03-29 13:00:37 +00:00
Jeff Roberson	654f669c9a	- Correct the dprintf format int the _lookup routine. Spotted by: pjd	2005-03-28 14:26:01 +00:00
Jeff Roberson	e4fefa9bd5	- Garbage collect an unused variable.	2005-03-28 13:45:09 +00:00
Jeff Roberson	b2255473fb	- Don't panic if we can't lock a child in lookup, return an error instead. - Only unlock the directory if this is a DOTDOT lookup. Previously this code could have deadlocked if there was a DOTDOT lookup with LOCKPARENT set and another thread was locking the other way up the tree. Sponsored by: Isilon Systems, Inc.	2005-03-28 13:39:16 +00:00
Jeff Roberson	e32addd40d	- Remove unnecessary LOCKPARENT manipulation. Sponsored by: Isilon Systems, Inc.	2005-03-28 13:29:15 +00:00
Jeff Roberson	ce5846dc19	- nwfs_lookup() is no longer responsible for unlocking the dvp, this is handled in vfs_lookup.c. This code was missing PDIRUNLOCK use prior to the removal of PDIRUNLOCK in rev 1.73 of vfs_lookup.c. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:46:33 +00:00
Jeff Roberson	7539637508	- hpfs_lookup() is no longer responsible for unlocking the dvp, this is handled in vfs_lookup.c. This code was missing PDIRUNLOCK use prior to the removal of PDIRUNLOCK in rev 1.73 of vfs_lookup.c. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:40:59 +00:00
Jeff Roberson	eddcb03d02	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:34:36 +00:00
Jeff Roberson	27ad03cb5d	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. - In the ISDOTDOT case we have to unlock the dvp before locking the child, if this fails we must relock dvp before returning an error. This was missing before. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:31:57 +00:00
Jeff Roberson	f6576f194e	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. - Network filesystems are written with a special idiom that checks the cache first, and may even unlock dvp before discovering that a network round-trip is required to resolve the name. I believe dvp is prevented from being recycled even in the forced unmount case by the shared lock on the mount point. If not, this code should grow checks for VI_DOOMED after it relocks dvp or it will access NULL v_data fields. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:29:58 +00:00
Jeff Roberson	7d2832e654	- Pass LK_EXCLUSIVE as the lock type to vget in vfs_hash_insert().	2005-03-25 10:51:55 +00:00
Jeff Roberson	a176ceb322	- Update vfs_root implementations to match the new prototype. None of these filesystems will support shared locks until they are explicitly modified to do so. Careful review must be done to ensure that this is safe for each individual filesystem. Sponsored by: Isilon Systems, Inc.	2005-03-24 07:39:03 +00:00
Jeff Roberson	d9b2d9f7a2	- Update vfs_root implementations to match the new prototype. None of these filesystems will support shared locks until they are explicitly modified to do so. Careful review must be done to ensure that this is safe for each individual filesystem. Sponsored by: Isilon Systems, Inc.	2005-03-24 07:36:16 +00:00
Poul-Henning Kamp	7f661c6ba1	Use subr_unit	2005-03-19 08:22:36 +00:00
Poul-Henning Kamp	c049546e16	Also remember to set the fsid here.	2005-03-17 15:15:29 +00:00
Poul-Henning Kamp	e3b803b148	Forgot to replace code to set fsid in vop_getattr.	2005-03-17 14:43:40 +00:00
Poul-Henning Kamp	800b42bde0	Prepare for the final onslaught on devices: Move uid/gid/mode from cdev to cdevsw. Add kind field to use for devd(8) later. Bump both D_VERSION and __FreeBSD_version	2005-03-17 12:07:00 +00:00
Jeff Roberson	ba73105324	- Lock the clearing of v_data so it is safe to inspect it with the interlock. Sponsored by: Isilon Systems, Inc.	2005-03-17 12:00:05 +00:00
Poul-Henning Kamp	51f5ce0c8c	Add two arguments to the vfs_hash() KPI so that filesystems which do not have unique hashes (NFS) can also use it.	2005-03-16 11:20:51 +00:00
Poul-Henning Kamp	9ed94841d9	Remove unused file	2005-03-16 11:10:38 +00:00
Poul-Henning Kamp	fd475cc19d	Remove inode fields previously used for private inode hash tables.	2005-03-16 08:09:52 +00:00
Poul-Henning Kamp	beddd41467	XXX: unnecessary pointer in inode.	2005-03-16 07:21:38 +00:00
Poul-Henning Kamp	7e1dd21ccf	Don't store the disk cdev in all inodes.	2005-03-16 07:17:39 +00:00
Poul-Henning Kamp	e0251bbbe7	Don't hold a reference to the disk vnode for each inode. Eliminate cdev and vnode pointer to the disk from the inodes, the mount holds everything we need.	2005-03-15 21:09:52 +00:00
Poul-Henning Kamp	3b97f388d8	Eliminate cdev pointer in inodes, they're not used or needed. The cdev could have been pulled out of the mountpoint cheaper back when it was used anyway.	2005-03-15 20:57:25 +00:00
Poul-Henning Kamp	de68347b1b	Don't hold a reference on the disk vnode for each inode.	2005-03-15 20:50:58 +00:00
Poul-Henning Kamp	45c26fa2b6	Improve the vfs_hash() API: vput() the unneeded vnode centrally to avoid replicating the vput in all the filesystems.	2005-03-15 20:00:03 +00:00
Jeff Roberson	bc855512c8	- Assume that all lower filesystems now support proper locking. Assert that they set v->v_vnlock. This is true for all filesystems in the tree. - Remove all uses of LK_THISLAYER. If the lower layer is locked, the null layer is locked. We only use vget() to get a reference now. null essentially does no locking. This fixes LOOKUP_SHARED with nullfs. - Remove the special LK_DRAIN considerations, I do not believe this is needed now as LK_DRAIN doesn't destroy the lower vnode's lock, and it's hardly used anymore. - Add one well commented hack to prevent the lowervp from going away while we're in it's VOP_LOCK routine. This can only happen if we're forcibly unmounted while some callers are waiting in the lock. In this case the lowervp could be recycled after we drop our last ref in null_reclaim(). Prevent this with a vhold().	2005-03-15 13:49:33 +00:00
Poul-Henning Kamp	7649bbb0b0	Disable two users of findcdev. They do the wrong thing now and will need to be fixed. In both cases the API should be reengineered to do something (more) sensible.	2005-03-15 12:39:30 +00:00
Jeff Roberson	9feb7408f8	- We have to transfer lockers after reseting our vnlock pointer. Sponsored by: Isilon Systems, Inc.	2005-03-15 11:28:45 +00:00
Poul-Henning Kamp	46d7d4a332	Don't export major,minor, instead export tty name.	2005-03-15 11:05:11 +00:00
Poul-Henning Kamp	6bc6a87cc9	Print devtoname() instead of minor().	2005-03-15 10:01:31 +00:00
Poul-Henning Kamp	40d04a26a0	Fix typo: pointers are not boolean in style(9).	2005-03-15 10:01:14 +00:00
Poul-Henning Kamp	e82ef95c11	Simplify the vfs_hash calling convention.	2005-03-15 08:07:07 +00:00
Dag-Erling Smørgrav	0e3b5c73b2	Hook pfs_lookup() up to vfs_cachedlookup_desc instead of vfs_lookup_desc, as suggested by Matt's comment. Also fix some style and paranoia issues. The entire function could benefit from review by a VFS guru. MFC after: 6 weeks	2005-03-14 16:24:50 +00:00
Dag-Erling Smørgrav	bc593ccd83	Fix two long-standing bugs in pfs_readdir(): Since we used an sbuf of size resid to accumulate dirents, we would end up returning one byte short when we had enough dirents to fill or exceed the size of the sbuf (the last byte being lost to bogus NUL termination) causing the next call to return EINVAL due to an unaligned offset. This went undetected for a long time because I did most of my testing in single-user mode, where there are rarely enough processes to fill the 4096-byte buffer ls(1) uses. The most common symptom of this bug is that tab completion of /proc or /compat/linux/proc does not work properly when many processes are running. Also, a check near the top would return EINVAL if resid was smaller than PFS_DELEN, even if it was 0, which is frequently the case and perfectly allowable. Change the test so that it returns 0 if resid is 0. MFC after: 2 weeks	2005-03-14 16:21:32 +00:00
Dag-Erling Smørgrav	cb5abc7d2d	If PSEUDOFS_TRACE is defined, create a sysctl knob to enable / disable pseudofs call tracing.	2005-03-14 16:06:47 +00:00
Dag-Erling Smørgrav	de52d21a02	fbsdidize.	2005-03-14 15:54:11 +00:00
Poul-Henning Kamp	2f00593534	Use vfs_hash instead of home-rolled.	2005-03-14 14:41:37 +00:00
Poul-Henning Kamp	dfb9f846e9	Use vfs_hash instead of home-rolled.	2005-03-14 13:22:41 +00:00
Poul-Henning Kamp	4e94fafc4f	Use vfs_hash instead of home-rolled. Correct locking around g_vfs_close()	2005-03-14 12:29:39 +00:00
Poul-Henning Kamp	a30fc63b19	Use vfs_hash instead of home-rolling.	2005-03-14 12:24:35 +00:00
Jeff Roberson	c1e7e9ba9b	- VOP_INACTIVE should no longer drop the vnode lock. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:47 +00:00
Jeff Roberson	8da0046596	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. - VOP_INACTIVE should no longer drop the vnode lock. - The vnode lock is required around calls to vrecycle() and vgone(). Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:25 +00:00
Jeff Roberson	c0f681c21d	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:14:56 +00:00
Jeff Roberson	172ffe319a	- The c_lock in the coda node does not offer any features over the standard vnode lock. Remove the c_lock and use the vn lock in its place. - Keep the coda lock functions so that the debugging information is preserved, but call directly to the vop_std*lock routines for the real functionality. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:09:34 +00:00
Jeff Roberson	3100b70037	- Deadfs may now use the standard vop lock, get rid of dead_lock(). - We no longer have to take the XLOCK state into consideration in any routines. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:06:20 +00:00
David E. O'Brien	bdc172ab8f	Used unsigned version. Submitted by: jmallett	2005-03-12 06:06:04 +00:00
David E. O'Brien	fb2eece6d2	Fix kernel build on 64-bit machines.	2005-03-12 03:50:39 +00:00
Nate Lawson	d81812be67	Correct a last-minute thinko. Instead of copying the nul with the string, nul-terminate the dp->d_name directly and only copy the string.	2005-03-11 23:35:23 +00:00
Nate Lawson	4cdb148352	The mbnambuf routines combine multiple substrings into a single long filename. Each substring is indexed by the windows ID, a sequential one-based value. The previous code was extremely slow, doing a malloc/strcpy/free for each substring. This code optimizes these routines with this in mind, using the ID to index into a single array and concatenating each WIN_CHARS chunk at once. (The last chunk is variable-length.) This code has been tested as working on an FS with difficult filename sizes (255, 13, 26, etc.) It gives a 77.1% decrease in profiled time (total across all functions) and a 73.7% decrease in wall time. Test was "ls -laR > /dev/null". Per-function time savings: mbnambuf_init: -90.7% mbnambuf_write: -18.7% mbnambuf_flush: -67.1% MFC after: 1 month	2005-03-11 23:27:45 +00:00
Poul-Henning Kamp	2647407860	One more bit of the major/minor patch to make ttyname happy as well.	2005-03-10 18:49:17 +00:00
Poul-Henning Kamp	b43ab0e378	Try to fix the mess I made of devname, with the minimal subset of the larger minor/major patch which was posted for testing.	2005-03-10 18:21:34 +00:00
Poul-Henning Kamp	f5af7353c0	Remove kernelside support for devfs rules filtering on major numbers.	2005-03-08 19:51:27 +00:00
Poul-Henning Kamp	a24042b727	Avoid a couple of mutex operations in the process exit path for the common case where procfs have never been mounted. OK'ed by: des	2005-03-01 12:20:49 +00:00
Poul-Henning Kamp	7ce296cf04	Remove debug printout of major/minor numbers, print name instead.	2005-02-27 21:16:26 +00:00
Sam Leffler	3cdbd5fb04	remove dead code Submitted by: Coverity Prevent analysis tool	2005-02-22 19:02:24 +00:00
Poul-Henning Kamp	0454a53d65	We may not have an actual cdev at this point.	2005-02-22 18:17:31 +00:00
Poul-Henning Kamp	aa2f6ddc3f	Reap more benefits from DEVFS: List devfs_dirents rather than vnodes off their shared struct cdev, this saves a pointer field in the vnode at the expense of a field in the devfs_dirent. There are often 100 times more vnodes so this is bargain. In addition it makes it harder for people to try to do stypid things like "finding the vnode from cdev". Since DEVFS handles all VCHR nodes now, we can do the vnode related cleanup in devfs_reclaim() instead of in dev_rel() and vgonel(). Similarly, we can do the struct cdev related cleanup in dev_rel() instead of devfs_reclaim(). rename idestroy_dev() to destroy_devl() for consistency. Add LIST_ENTRY de_alias to struct devfs_dirent. Remove v_specnext from struct vnode. Change si_hlist to si_alist in struct cdev. String new devfs vnodes' devfs_dirent on si_alist when we create them and take them off in devfs_reclaim(). Fix devfs_revoke() accordingly. Also don't clear fields devfs_reclaim() will clear when called from vgone(); Let devfs_reclaim() call dev_rel() instead of vgonel(). Move the usecount tracking from dev_rel() to devfs_reclaim(), and let dev_rel() take a struct cdev argument instead of vnode. Destroy SI_CHEAPCLONE devices in dev_rel() (instead of devfs_reclaim()) when they are no longer used. (This should maybe happen in devfs_close() instead.)	2005-02-22 15:51:07 +00:00
Poul-Henning Kamp	5a98dd4df5	vp->v_id is a private field for the vfs namecache and it is a big mistake that NFS ever started using it and an even bigger that it got copied&pasted to nwfs and smbfs. Replace with use of vhold()/vdrop().	2005-02-22 15:06:30 +00:00
Poul-Henning Kamp	f69d42a1d2	Use vn_printf() instead of home-rolling.	2005-02-22 14:58:59 +00:00
Poul-Henning Kamp	1a1457d427	Make dev_ref() require the dev_lock() to be held and use it from devfs instead of directly frobbing the si_refcount.	2005-02-22 14:41:04 +00:00
David Schultz	0e2b18143f	Replace the workaround for a deadlock bug in Coda with a different workaround that does not rely on vfs_start().	2005-02-20 23:01:57 +00:00
Robert Watson	1bfca411a6	Remove basically unused root_vp pointer in udfmount. MFC after: 1 week Discussed with: scottl	2005-02-18 11:47:51 +00:00
Robert Watson	5d0c377bfe	Conditionalize cd9660 chattiness regarding the nature of the file system mounted (is it Joliet, RockRidge, High Sierra) based on bootverbose. Most file systems don't generate log messages based on details of the file system superblock, and these log messages disrupt sysinstall output during a new install from CD. We may want to explore exposing this status information using nmount() at some point. MFC after: 3 days	2005-02-18 10:49:55 +00:00
Poul-Henning Kamp	4d8ac58b05	Introduce vx_wait{l}() and use it instead of home-rolled versions.	2005-02-17 10:49:51 +00:00
Poul-Henning Kamp	5ece08f57a	Make a SYSCTL_NODE static	2005-02-10 12:23:29 +00:00
Poul-Henning Kamp	66ae53f804	make M_NTFSMNT and ntfs_calccfree() static	2005-02-10 12:09:49 +00:00
Poul-Henning Kamp	9def42f333	Make fdesc_root static	2005-02-10 12:09:15 +00:00
Poul-Henning Kamp	f70f851c60	Make smbfs_debuglevel private.	2005-02-10 12:07:02 +00:00
Poul-Henning Kamp	271c679c17	don't call vprint with NULL.	2005-02-10 12:06:34 +00:00
Poul-Henning Kamp	87c045d5a2	Statize malloc types. Don't call vprint with NULL.	2005-02-10 12:05:06 +00:00
Poul-Henning Kamp	df32e67c73	Statize devfs_ops_f	2005-02-10 12:04:26 +00:00
Poul-Henning Kamp	c711aea6ca	Make a bunch of malloc types static. Found by: src/tools/tools/kernxref	2005-02-10 12:02:37 +00:00
Nate Lawson	2a05fbb949	Unroll the loop for calculating the 8.3 filename checksum. In testing on my P3, microbenchmarks show the unrolled version is 78x faster. In actual use (recursive ls), this gives an average of 9% improvement in system time and 2% improvement in wall time.	2005-02-08 07:51:14 +00:00
Poul-Henning Kamp	61f9cf813f	Remove vop_destroyvobject()	2005-02-07 09:23:34 +00:00
Poul-Henning Kamp	7bf4b73d6c	Deimplement vop_destroyvobject()	2005-02-07 08:23:36 +00:00
Poul-Henning Kamp	49829f2ec5	Remove vop_destroyvobject() initialization.	2005-02-07 08:04:24 +00:00
Peter Edwards	72b3e305af	Unbreak a few filesystems for which vnode_create_vobject() wasn't being called in "open", causing mmap() to fail. Where possible, pass size of file to vnode_create_vobject() rather than having it find it out the hard way via VOP_LOOKUP Reviewed by: phk	2005-01-29 16:23:39 +00:00
Poul-Henning Kamp	a369f34d76	Make filesystems get rid of their own vnodes vnode_pager object in VOP_RECLAIM().	2005-01-28 14:42:17 +00:00
Poul-Henning Kamp	d4eb29ba71	Remove unused argument to vrecycle()	2005-01-28 13:08:21 +00:00
Peter Edwards	174d6a9f73	Make NTFS at least minimally usable after bufobj and GEOM fallout. mmap() on NTFS files was hosed, returning pages offset from the start of the disk rather than the start of the file. (ie, "cp" of a 1-block file would get you a copy of the boot sector, not the data in the file.) The solution isn't ideal, but gives a functioning filesystem. Cached vnode lookup was also broken, resulting in vnode haemorrhage. A lookup on the same file twice would give you two vnodes, and the resulting cached pages. Just recently, mmap() was broken due to a lack of a call to vnode_create_vobject() in ntfs_open(). Discussed with: phk@	2005-01-27 13:50:27 +00:00
Poul-Henning Kamp	84a6975215	Introduce and use g_vfs_close().	2005-01-25 15:52:04 +00:00
Poul-Henning Kamp	729fcf7efb	Take VOP_GETVOBJECT() out to pasture. We use the direct pointer now.	2005-01-25 00:42:16 +00:00
Poul-Henning Kamp	69816ea35e	Kill VOP_CREATEVOBJECT(), it is now the responsibility of the filesystem for a given vnode to create a vnode_pager object if one is needed.	2005-01-25 00:12:24 +00:00
Poul-Henning Kamp	2a967a99c3	Don't implement vop_createvobject(), vop_open() and vop_close() manages this for nullfs now.	2005-01-24 23:54:45 +00:00
Poul-Henning Kamp	dcff5b1440	Don't call VOP_CREATEVOBJECT(), it's the responsibility of the filesystem which owns the vnode.	2005-01-24 23:53:54 +00:00
Poul-Henning Kamp	c683c4ee04	Add null_open() and null_close() which calls null_bypass() and managed the v_object pointer.	2005-01-24 22:56:24 +00:00
Poul-Henning Kamp	625d4bc03a	Create a vp->v_object in VFS_FHTOVP() if we want to be exportable with NFS. We are moving responsibility for creating the vnode_pager object into the filesystems which own the vnode, and this is one of the places we have to cover. We call vnode_create_vobject() directly because we own the vnode. If we can get the size easily, pass it as an argument to save the call to VOP_GETATTR() in vnode_create_vobject()	2005-01-24 21:51:19 +00:00
Poul-Henning Kamp	35764be39e	Kill the VV_OBJBUF and test the v_object for NULL instead.	2005-01-24 13:13:57 +00:00
Poul-Henning Kamp	d34dd851b8	Remove "register" keywords.	2005-01-24 12:37:51 +00:00
Poul-Henning Kamp	a515233f47	Style: Remove the commented out vop_foo_args replicas.	2005-01-24 11:49:41 +00:00
Poul-Henning Kamp	303793b564	whitespace nit	2005-01-19 09:07:56 +00:00
Poul-Henning Kamp	5873f57b29	Remove unused coda_fbsd_getpages()	2005-01-19 08:24:53 +00:00
Scott Long	a4d629e32d	Fix an incorrect cast. Submitted by: Andriy Gapon MFC-after: 3 days.	2005-01-18 10:15:23 +00:00
Scott Long	444acc1655	NULL-terminate the . and .. directory entries. Apparently some tools ignore d_namlen and assume that d_name is null-terminated. Submitted by: Andriy Gapon	2005-01-14 16:35:34 +00:00
Scott Long	43bc24bf5a	Replace the min() macro with a test that doesn't truncate the 64-bit values that are used. Thanks to Bruce Evans for pointing this out.	2005-01-14 16:24:31 +00:00
Poul-Henning Kamp	e50508df66	Eliminate unused and constant arguments to smbfs_vinvalbuf()	2005-01-14 08:52:55 +00:00
Poul-Henning Kamp	bf0063b87d	Eliminate constant and unused arguments to nwfs_vinvalbuf()	2005-01-14 08:09:42 +00:00
Poul-Henning Kamp	7c0745eeae	Eliminate unused and unnecessary "cred" argument from vinvalbuf()	2005-01-14 07:33:51 +00:00
Poul-Henning Kamp	83c6439714	Whitespace in vop_vector{} initializations.	2005-01-13 18:59:48 +00:00
Poul-Henning Kamp	e39db32ab0	Ditch vfs_object_create() and make the callers call VOP_CREATEVOBJECT() directly.	2005-01-13 12:25:19 +00:00
Poul-Henning Kamp	63f89abf4a	Change the generated VOP_ macro implementations to improve type checking and KASSERT coverage. After this check there is only one "nasty" cast in this code but there is a KASSERT to protect against the wrong argument structure behind that cast. Un-inlining the meat of VOP_FOO() saves 35kB of text segment on a typical kernel with no change in performance. We also now run the checking and tracing on VOP's which have been layered by nullfs, umapfs, deadfs or unionfs. Add new (non-inline) VOP_FOO_AP() functions which take a "struct foo_args" argument and does everything the VOP_FOO() macros used to do with checks and debugging code. Add KASSERT to VOP_FOO_AP() check for argument type being correct. Slim down VOP_FOO() inline functions to just stuff arguments into the struct foo_args and call VOP_FOO_AP(). Put function pointer to VOP_FOO_AP() into vop_foo_desc structure and make VCALL() use it instead of the current offsetoff() hack. Retire vcall() which implemented the offsetoff() Make deadfs and unionfs use VOP_FOO_AP() calls instead of VCALL(), we know which specific call we want already. Remove unneeded arguments to VCALL() in nullfs and umapfs bypass functions. Remove unused vdesc_offset and VOFFSET(). Generally improve style/readability of the generated code.	2005-01-13 07:53:01 +00:00
Scott Long	9d32fde894	Use off_t when passing and calculating file offsets. While a single extent in UDF is only 32 bits, multiple extents can exist in a file. Also clean up some minor whitespace problems. Submitted by: John Wehle	2005-01-12 06:42:13 +00:00
Scott Long	d1022c068e	Don't allow reads past the end of a file. Submitted by: John Wehle, Andriy Gapon MFC After: 3 days	2005-01-12 06:17:01 +00:00
Poul-Henning Kamp	7164e8f291	Silently ignore forced argument to unmount.	2005-01-11 12:02:26 +00:00
Poul-Henning Kamp	0391e5a151	Wrap the bufobj operations in macros: BO_STRATEGY() and BO_WRITE()	2005-01-11 09:10:46 +00:00
Poul-Henning Kamp	8df6bac4c7	Remove the unused credential argument from VOP_FSYNC() and VFS_SYNC(). I'm not sure why a credential was added to these in the first place, it is not used anywhere and it doesn't make much sense: The credentials for syncing a file (ability to write to the file) should be checked at the system call level. Credentials for syncing one or more filesystems ("none") should be checked at the system call level as well. If the filesystem implementation needs a particular credential to carry out the syncing it would logically have to the cached mount credential, or a credential cached along with any delayed write data. Discussed with: rwatson	2005-01-11 07:36:22 +00:00
Poul-Henning Kamp	b630d6f15a	whitespace	2005-01-10 13:09:33 +00:00
Robert Watson	f644bbc45c	Annotate that pfs_exit() always acquires and releases two mutexes for every process exist, even if procfs isn't mounted. And one of those mutexes is Giant. No immediate thoughts on fixing this.	2005-01-08 04:56:38 +00:00
Warner Losh	86cb007f9f	/* -> /*- for copyright notices, minor format tweaks as necessary	2005-01-06 22:18:23 +00:00
Warner Losh	d167cf6f3a	/* -> /*- for copyright notices, minor format tweaks as necessary	2005-01-06 18:10:42 +00:00
Warner Losh	5de2b5750c	Start each of the license/copyright comments with /*-	2005-01-05 23:35:00 +00:00
Poul-Henning Kamp	59f69ba49f	Unsupport forceful unmounts of DEVFS. After disscussing things I have decided to take the easy and consistent 90% solution instead of aiming for the very involved 99% solution. If we allow forceful unmounts of DEVFS we need to decide how to handle the devices which are in use through this filesystem at the time. We cannot just readopt the open devices in the main /dev instance since that would open us to security issues. For the majority of the devices, this is relatively straightforward as we can just pretend they got revoke(2)'ed. Some devices get tricky: /dev/console and /dev/tty for instance does a sort of recursive open of the real console device. Other devices may be mmap'ed (kill the processes ?). And then there are disk devices which are mounted. The correct thing here would be to recursively unmount the filesystems mounte from devices from our DEVFS instance (forcefully) and if this succeeds, complete the forcefully unmount of DEVFS. But if one of the forceful unmounts fail we cannot complete the forceful unmount of DEVFS, but we are likely to already have severed a lot of stuff in the process of trying. Event attempting this would be a lot of code for a very far out corner-case which most people would never see or get in touch with. It's just not worth it.	2005-01-04 07:52:26 +00:00
Poul-Henning Kamp	50a36c111f	Be consistent about flag values passed to device drivers read/write methods: Read can see O_NONBLOCK and O_DIRECT. Write can see O_NONBLOCK, O_DIRECT and O_FSYNC. In addition O_DIRECT is shadowed as IO_DIRECT for now for backwards compatibility.	2004-12-22 17:05:44 +00:00
Poul-Henning Kamp	10eee285f7	Shuffle numeric values of the IO_* flags to match the O_* flags from fcntl.h. This is in preparation for making the flags passed to device drivers be consistently from fcntl.h for all entrypoints. Today open, close and ioctl uses fcntl.h flags, while read and write uses vnode.h flags.	2004-12-22 16:25:50 +00:00
Poul-Henning Kamp	e87047b437	We can only ever get to vgonechrl() from a devfs vnode, so we do not need to reassign the vp->v_op to devfs_specops, we know that is the value already. Make devfs_specops private to devfs.	2004-12-20 21:34:29 +00:00
Poul-Henning Kamp	2c0220129d	Add a couple of KASSERTS to try to diagnose a problem reported.	2004-12-20 21:12:11 +00:00
Poul-Henning Kamp	2a9e0c3216	Be a bit more assertive about vnode bypass.	2004-12-14 09:32:18 +00:00
Suleiman Souhlal	3d96167a54	Exporting of NTFS filesystem broke in rev 1.70. Fix it. Approved by: phk, grehan (mentor)	2004-12-13 16:21:48 +00:00
Poul-Henning Kamp	5cb471d04d	Don't forget to bypass vnodes in corner cases. Found by: kkenn and ports/shell/zsh Thanks to: jeffr	2004-12-13 10:07:57 +00:00
Poul-Henning Kamp	1dc4727ea3	Another FNONBLOCK -> O_NONBLOCK. Don't unconditionally set IO_UNIT to device drivers in write: nobody checks it, and since it was always set it did not carry information anyway.	2004-12-13 07:41:19 +00:00
Poul-Henning Kamp	ab9caf9d67	Use O_NONBLOCK instead of FNONBLOCK alias.	2004-12-13 07:37:29 +00:00
Poul-Henning Kamp	f0d5cba935	Explicit panic in vop_read/vop_write for devices	2004-12-13 07:13:21 +00:00
Poul-Henning Kamp	dce357b112	Explicitly panic vop_read/vop_write on fifos.	2004-12-13 07:07:50 +00:00
Poul-Henning Kamp	e98fdc0d03	Don't deref NULL if no charset-conversion is specified. Return correct vnode in vop_bmap()	2004-12-12 12:02:34 +00:00
Poul-Henning Kamp	269c902f17	Handle MNT_UPDATE export requests first and return so we do not interpret the rest of the msdosfs_args structure. Detected by: marcel	2004-12-11 20:37:48 +00:00
Poul-Henning Kamp	708394ec72	typo	2004-12-11 12:45:24 +00:00
Poul-Henning Kamp	6366900a0f	First save from editor, then commit.	2004-12-07 15:25:36 +00:00
Poul-Henning Kamp	5c83b5551c	Fix exports.	2004-12-07 15:13:35 +00:00
Poul-Henning Kamp	20a92a18f1	The remaining part of nmount/omount/rootfs mount changes. I cannot sensibly split the conversion of the remaining three filesystems out from the root mounting changes, so in one go: cd9660: Convert to nmount. Add omount compat shims. Remove dedicated rootfs mounting code. Use vfs_mountedfrom() Rely on vfs_mount.c calling VFS_STATFS() nfs(client): Convert to nmount (the simple way, mount_nfs(8) is still necessary). Add omount compat shims. Drop COMPAT_PRELITE2 mount arg compatibility. ffs: Convert to nmount. Add omount compat shims. Remove dedicated rootfs mounting code. Use vfs_mountedfrom() Rely on vfs_mount.c calling VFS_STATFS() Remove vfs_omount() method, all filesystems are now converted. Remove MNTK_WANTRDWR, handling RO/RW conversions is a filesystem task, and they all do it now. Change rootmounting to use DEVFS trampoline: vfs_mount.c: Mount devfs on /. Devfs needs no 'from' so this is clean. symlink /dev to /. This makes it possible to lookup /dev/foo. Mount "real" root filesystem on /. Surgically move the devfs mountpoint from under the real root filesystem onto /dev in the real root filesystem. Remove now unnecessary getdiskbyname(). kern_init.c: Don't do devfs mounting and rootvnode assignment here, it was already handled by vfs_mount.c. Remove now unused bdevvp(), addaliasu() and addalias(). Put the few necessary lines in devfs where they belong. This eliminates the second-last source of bogo vnodes, leaving only the lemming-syncer. Remove rootdev variable, it doesn't give meaning in a global context and was not trustworth anyway. Correct information is provided by statfs(/).	2004-12-07 08:15:41 +00:00
Poul-Henning Kamp	def91cf267	Use vfs_mountedfrom(). Since VFS_STATFS() always calls the filesystem with mp->mnt_stat now, the vfs_statfs method is now a no-op. Explain this in a comment.	2004-12-06 20:52:46 +00:00
Poul-Henning Kamp	1a6cf6a3ad	Trust vfs_mount to call VFS_STATFS() on all mounts.	2004-12-06 20:31:36 +00:00
Poul-Henning Kamp	d14c8441e9	Convert to nmount. Add omount compat. Unpropagate the sm_args function into the runtime part.	2004-12-06 20:31:08 +00:00
Poul-Henning Kamp	bd50907c91	Convert to nmount. Add omount compat. Use vfs_mountedon(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:23:51 +00:00
Poul-Henning Kamp	c4048cf07f	Convert to nmount. Add omount compat. Same comment about charset conversions apply. Use vfs_mountedfrom(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:22:16 +00:00
Poul-Henning Kamp	55dca57ef2	Convert to nmount. Add backwards compat cmount method. Same comment as msdosfs applies: It would be nice if we had generic option names for charset conversions. Use vfs_mountefrom(). Rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:14:20 +00:00
Poul-Henning Kamp	526463736e	Convert nwfs to nmount, but take the low road: There is no way this is ever going to work without a dedicated mount_nwfs(8) program so simply stick struct nwfs_args into a nmount argument and leave it at that.	2004-12-06 20:11:56 +00:00
Alexander Kabaev	f6968c4a99	Fix a typo in PFS_TRACE. PR: kern/74461 Submitted by: Craig Rodrigues <rodrigc at crodrigues.org>	2004-12-06 20:07:17 +00:00
Poul-Henning Kamp	935ab476fa	ufs vfs_mountedon(), rely on vfs_mount.c calling VFS_STATFS()	2004-12-06 20:03:58 +00:00
Poul-Henning Kamp	7ab8c8c03c	Use vfs_mountedfrom(), rely on vfs_mount.c calling VFS_STATFS().	2004-12-06 20:02:13 +00:00
Poul-Henning Kamp	a1f5fe1538	Use vfs_mountedfrom() and rely on vfs_mount.c to call VFS_STATFS()	2004-12-06 19:54:31 +00:00
Poul-Henning Kamp	7df2fc80f8	Convert coda to nmount.	2004-12-06 19:46:02 +00:00
Poul-Henning Kamp	6a4b48f488	Convert msdosfs to nmount. Add a vfs_cmount() function which converts omount argument stucture to nmount arguments. Convert vfs_omount() to vfs_mount() and parse nmount arguments. This is 100% compatible with existing userland. Later on, but before userland gets converted to nmount we may want to revisit the names of the mountoptions, for instance it may make sense to use consistent options for charset conversion etc.	2004-12-06 19:05:48 +00:00
Poul-Henning Kamp	bed8b887ea	Fix warning	2004-12-06 12:34:28 +00:00
Poul-Henning Kamp	743312367a	VFS_STATFS(mp, ...) is mostly called with &mp->mnt_stat, but a few cases doesn't. Most of the implementations have grown weeds for this so they copy some fields from mnt_stat if the passed argument isn't that. Fix this the cleaner way: Always call the implementation on mnt_stat and copy that in toto to the VFS_STATFS argument if different.	2004-12-05 22:41:02 +00:00
Poul-Henning Kamp	91e691c2d5	Remove embryonic rootfs mounting facility. In the near future rootfs mounting will not require special handling in the filesystems.	2004-12-04 09:57:38 +00:00
Poul-Henning Kamp	4b44037433	Remove the de_devvp and stop VREF'ing it for every vnode we create.	2004-12-02 10:09:33 +00:00
Poul-Henning Kamp	aec0fb7b40	Back when VOP_* was introduced, we did not have new-style struct initializations but we did have lofty goals and big ideals. Adjust to more contemporary circumstances and gain type checking. Replace the entire vop_t frobbing thing with properly typed structures. The only casualty is that we can not add a new VOP_ method with a loadable module. History has not given us reason to belive this would ever be feasible in the the first place. Eliminate in toto VOCALL(), vop_t, VNODEOP_SET() etc. Give coda correct prototypes and function definitions for all vop_()s. Generate a bit more data from the vnode_if.src file: a struct vop_vector and protype typedefs for all vop methods. Add a new vop_bypass() and make vop_default be a pointer to another struct vop_vector. Remove a lot of vfs_init since vop_vector is ready to use from the compiler. Cast various vop_mumble() to void * with uppercase name, for instance VOP_PANIC, VOP_NULL etc. Implement VCALL() by making vdesc_offset the offsetof() the relevant function pointer in vop_vector. This is disgusting but since the code is generated by a script comparatively safe. The alternative for nullfs etc. would be much worse. Fix up all vnode method vectors to remove casts so they become typesafe. (The bulk of this is generated by scripts)	2004-12-01 23:16:38 +00:00
Colin Percival	691b3b0df9	Fix unvalidated pointer dereference. This is FreeBSD-SA-04:17.procfs.	2004-12-01 21:33:02 +00:00
Poul-Henning Kamp	22408f729e	hpfs_lookup() should have a vop_cachedlookup_t prototype an corresponding argument.	2004-12-01 20:24:01 +00:00
Poul-Henning Kamp	0731e6dfb7	Correctly prototype union_write with vop_write_t, not vop_read_t.	2004-12-01 19:15:00 +00:00
Poul-Henning Kamp	6fde64c778	Mechanically change prototypes for vnode operations to use the new typedefs.	2004-12-01 12:24:41 +00:00
Poul-Henning Kamp	ce59d2149d	Ignore MNT_NODEV, it is implicit in choice of filesystem these days.	2004-11-26 07:37:42 +00:00
Poul-Henning Kamp	c96c1bebe3	Eliminate null_open() and use instead null_bypass(). Null_open() was only here to handle MNT_NODEV, but since that does not affect any filesystems anymore, it could only have any effect if you nullfs mounted a devfs but didn't want devices to show up. If you need that, there are easier ways.	2004-11-26 07:18:28 +00:00
Poul-Henning Kamp	964ebefd8d	Use system wide no-op vfs_start function.	2004-11-25 09:11:27 +00:00
Poul-Henning Kamp	75ad04b4f6	Add dropped implementation of ioctl for fifos.	2004-11-18 17:18:11 +00:00
Poul-Henning Kamp	003e18aef4	Make vnode bypass for fifos (read, write, poll) mandatory.	2004-11-17 07:30:02 +00:00
Poul-Henning Kamp	ea566ae2a5	Make vnode bypass for devices mandatory.	2004-11-17 07:18:49 +00:00
Poul-Henning Kamp	8352b1925d	Make vnode bypass the default for devices. Can be disabled in case of problems with vfs.devfs.fops=0 in loader.conf	2004-11-15 22:11:09 +00:00
Poul-Henning Kamp	d6d64f0f2c	Add file ops to fifofs so that we can bypass vnodes (and Giant) for the heavy-duty operations (read, write, poll/select, kqueue). Disabled for now, enable with "vfs.fifofs.fops=1" in loader.conf.	2004-11-15 14:51:44 +00:00
Poul-Henning Kamp	9c83534dd8	Make VOP_BMAP return a struct bufobj for the underlying storage device instead of a vnode for it. The vnode_pager does not and should not have any interest in what the filesystem uses for backend. (vfs_cluster doesn't use the backing store argument.)	2004-11-15 09:18:27 +00:00
Poul-Henning Kamp	49b7607eba	Integrate most of vop_revoke() into devfs_revoke() where it belongs.	2004-11-13 23:37:29 +00:00
Poul-Henning Kamp	aac5167c38	Add the devfs_fp_check() function which helps us get from a struct file to a cdev and a devsw, doing all the relevant checks along the way. Add the check to see if fp->f_vnode->v_rdev differs from our cached fp->f_data copy of our cdev. If it does the device was revoked and we return ENXIO.	2004-11-13 23:21:54 +00:00
Poul-Henning Kamp	ecbcedb99f	VOP_REVOKE() is only ever for VCHR vnodes, so unionfs does not need a vop_revoke() method.	2004-11-13 22:56:26 +00:00
Poul-Henning Kamp	1ecf144493	fifos doesn't need a vop_lookup, the default will do fine.	2004-11-13 18:51:13 +00:00
Poul-Henning Kamp	124e4c3be8	Introduce an alias for FILEDESC_{UN}LOCK() with the suffix _FAST. Use this in all the places where sleeping with the lock held is not an issue. The distinction will become significant once we finalize the exact lock-type to use for this kind of case.	2004-11-13 11:53:02 +00:00
Tom Rhodes	18192f69c7	Remove stale comment after previous commit. Noticed by: pjd	2004-11-09 23:19:21 +00:00
Poul-Henning Kamp	282d0382ac	Detect root mount attempts on the flag, not on the NULL path.	2004-11-09 22:21:52 +00:00
Poul-Henning Kamp	64042a76b6	Refuse attempts to mount root filesystem	2004-11-09 22:21:10 +00:00
Poul-Henning Kamp	b0aed5267e	Refuse attemps to mount root filesystem	2004-11-09 22:14:57 +00:00
Poul-Henning Kamp	56dd3a6182	Add optional device vnode bypass to DEVFS. The tunable vfs.devfs.fops controls this feature and defaults to off. When enabled (vfs.devfs.fops=1 in loader), device vnodes opened through a filedescriptor gets a special fops vector which instead of the detour through the vnode layer goes directly to DEVFS. Amongst other things this allows us to run Giant free read/write to device drivers which have been weaned off D_NEEDGIANT. Currently this means /dev/null, /dev/zero, disks, (and maybe the random stuff ?) On a 700MHz K7 machine this doubles the speed of dd if=/dev/zero of=/dev/null bs=1 count=1000000 This roughly translates to shaving 2usec of each read/write syscall. The poll/kqfilter paths need more work before they are giant free, this work is ongoing in p4::phk_bufwork Please test this and report any problems, LORs etc.	2004-11-08 10:46:47 +00:00
Poul-Henning Kamp	5349c79d75	Properly implement a default version of VOP_GETWRITEMOUNT. Remove improper access to vop_stdgetwritemount() which should and will instead rely on the VOP default path.	2004-11-06 11:41:22 +00:00
Poul-Henning Kamp	ecc14aae12	Add back securelevel check for disks. XXX: This should live in geom_dev.c but we don't have access to the cred there. XXX: XXX: This may not matter anymore since filesystems use geom_vfs.	2004-11-04 09:17:55 +00:00
Poul-Henning Kamp	c7aaa71ce3	s/ffs/ntfs/ Fix error handling to not use VOP_CLOSE() on the disk. Spotted by: tegge	2004-11-04 07:18:54 +00:00
Poul-Henning Kamp	e1c6cbef33	Make a more whole-hearted attempt at GEOM'ifying NTFS. I must have been sleepy when I did the first pass. Spotted by: tegge	2004-11-03 21:36:41 +00:00
Poul-Henning Kamp	4cea3289da	Don't give disks special treatment, they don't come this way anymore.	2004-10-29 11:10:55 +00:00
Poul-Henning Kamp	c108bb741c	Remove VOP_SPECSTRATEGY() from the system.	2004-10-29 10:59:28 +00:00
Poul-Henning Kamp	5cdfa40c6b	Move NTFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:43:45 +00:00
Poul-Henning Kamp	a96d2ea768	Move HPFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:43:07 +00:00
Poul-Henning Kamp	bf7e2ae1c4	Move CD9660 to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:41:44 +00:00
Poul-Henning Kamp	429c018a9f	Move UDF to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:40:58 +00:00
Poul-Henning Kamp	9a135592e2	Move MSDOSFS to GEOM backing instead of DEVFS. For details, please see src/sys/ufs/ffs/ffs_vfsops.c 1.250.	2004-10-29 10:40:14 +00:00
Poul-Henning Kamp	6afb3b1c37	Give dev_strategy() an explict cdev argument in preparation for removing buf->b-dev. Put a bio between the buf passed to dev_strategy() and the device driver strategy routine in order to not clobber fields in the buf. Assert copyright on vfs_bio.c and update copyright message to canonical text. There is no legal difference between John Dysons two-clause abbreviated BSD license and the canonical text.	2004-10-29 07:16:37 +00:00
Poul-Henning Kamp	f00f5d71c2	Reduce the locking activity by epsilon by checking VNON condition before releasing the mountlock.	2004-10-28 08:22:11 +00:00
Poul-Henning Kamp	45628dd373	What can I say: don't allow people to mount DEVFS with option "nodev".	2004-10-28 06:03:25 +00:00
Poul-Henning Kamp	d83b7498a4	Eliminate unnecessary KASSERTs. Don't use bp->b_vp in VOP_STRATEGY: the vnode is passed in as an argument.	2004-10-27 06:48:21 +00:00
Poul-Henning Kamp	5d9d81e7ea	Put the I/O block size in bufobj->bo_bsize. We keep si_bsize_phys around for now as that is the simplest way to pull the number out of disk device drivers in devfs_open(). The correct solution would be to do an ioctl(DIOCGSECTORSIZE), but the point is probably mooth when filesystems sit on GEOM, so don't bother for now.	2004-10-26 07:39:12 +00:00
Poul-Henning Kamp	156cb26583	Loose the v_dirty* and v_clean* alias macros. Check the count field where we just want to know the full/empty state, rather than using TAILQ_EMPTY() or TAILQ_FIRST().	2004-10-25 09:14:03 +00:00
Poul-Henning Kamp	ff7c5a4880	Alas, poor SPECFS! -- I knew him, Horatio; A filesystem of infinite jest, of most excellent fancy: he hath taught me lessons a thousand times; and now, how abhorred in my imagination it is! my gorge rises at it. Here were those hacks that I have curs'd I know not how oft. Where be your kludges now? your workarounds? your layering violations, that were wont to set the table on a roar? Move the skeleton of specfs into devfs where it now belongs and bury the rest.	2004-10-22 09:59:37 +00:00
John Baldwin	78c85e8dfc	Rework how we store process times in the kernel such that we always store the raw values including for child process statistics and only compute the system and user timevals on demand. - Fix the various kern_wait() syscall wrappers to only pass in a rusage pointer if they are going to use the result. - Add a kern_getrusage() function for the ABI syscalls to use so that they don't have to play stackgap games to call getrusage(). - Fix the svr4_sys_times() syscall to just call calcru() to calculate the times it needs rather than calling getrusage() twice with associated stackgap, etc. - Add a new rusage_ext structure to store raw time stats such as tick counts for user, system, and interrupt time as well as a bintime of the total runtime. A new p_rux field in struct proc replaces the same inline fields from struct proc (i.e. p_[isu]ticks, p_[isu]u, and p_runtime). A new p_crux field in struct proc contains the "raw" child time usage statistics. ruadd() has been changed to handle adding the associated rusage_ext structures as well as the values in rusage. Effectively, the values in rusage_ext replace the ru_utime and ru_stime values in struct rusage. These two fields in struct rusage are no longer used in the kernel. - calcru() has been split into a static worker function calcru1() that calculates appropriate timevals for user and system time as well as updating the rux_[isu]u fields of a passed in rusage_ext structure. calcru() uses a copy of the process' p_rux structure to compute the timevals after updating the runtime appropriately if any of the threads in that process are currently executing. It also now only locks sched_lock internally while doing the rux_runtime fixup. calcru() now only requires the caller to hold the proc lock and calcru1() only requires the proc lock internally. calcru() also no longer allows callers to ask for an interrupt timeval since none of them actually did. - calcru() now correctly handles threads executing on other CPUs. - A new calccru() function computes the child system and user timevals by calling calcru1() on p_crux. Note that this means that any code that wants child times must now call this function rather than reading from p_cru directly. This function also requires the proc lock. - This finishes the locking for rusage and friends so some of the Giant locks in exit1() and kern_wait() are now gone. - The locking in ttyinfo() has been tweaked so that a shared lock of the proctree lock is used to protect the process group rather than the process group lock. By holding this lock until the end of the function we now ensure that the process/thread that we pick to dump info about will no longer vanish while we are trying to output its info to the console. Submitted by: bde (mostly) MFC after: 1 month	2004-10-05 18:51:11 +00:00
Takanori Watanabe	6e4c3467ce	Minor Bug fix. Some file was not translated.	2004-10-05 16:53:37 +00:00
Takanori Watanabe	919f5630ec	Fix unionfs problems when a directory is mounted on other directory with different file systems. This may cause ill things with my previous fix. Now it translate fsid of direct child of mount point directory only. Pointed out by: Uwe Doering	2004-10-05 05:59:29 +00:00
Takanori Watanabe	d354520ebc	Fix a problem when you try to mount a directory on another directory belongs to the same filesystem. In this problem, getcwd(3) will fail. I found the problem two years ago and I have forgotten to merge. http://docs.FreeBSD.org/cgi/mid.cgi?200202251435.XAA91094	2004-10-02 17:17:04 +00:00
David Schultz	616b5f90d3	Don't PHOLD() the target process in procfs, since this is already done in pseudofs. Moreover, PHOLD() may block between the p_candebug() access check and the actual operation.	2004-10-01 05:01:17 +00:00
Poul-Henning Kamp	891822a853	XXX mark two places where we do not hold a threadcount on the dev when frobbing the cdevsw. In both cases we examine only the cdevsw and it is a good question if we weren't better off copying those properties into the cdev in the first place. This question will be revisited.	2004-09-24 08:32:36 +00:00
Poul-Henning Kamp	9bd188b936	Hold proper thread count while frobbing drivers ioctl.	2004-09-24 07:24:02 +00:00
Poul-Henning Kamp	bd8a0d70f4	Remove devsw() call missed in last commit.	2004-09-24 07:08:33 +00:00
Poul-Henning Kamp	5ef8cac184	Use def_re[fl]thread(). Retire various old compatibility helpers.	2004-09-24 05:58:06 +00:00
Poul-Henning Kamp	1a52a73d68	Eliminate DEV_STRATEGY() macro: call dev_strategy() directly. Make dev_strategy() handle errors and departing devices properly.	2004-09-23 14:45:04 +00:00
Poul-Henning Kamp	d0c90fe668	Do not use devsw() but si_devsw direction. This is still bogus but a fair bit less so.	2004-09-23 12:19:24 +00:00
Poul-Henning Kamp	a0e78d2eb0	Do not refcount the cdevsw, but rather maintain a cdev->si_threadcount of the number of threads which are inside whatever is behind the cdevsw for this particular cdev. Make the device mutex visible through dev_lock() and dev_unlock(). We may want finer granularity later. Replace spechash_mtx use with dev_lock()/dev_unlock().	2004-09-23 07:17:41 +00:00
Poul-Henning Kamp	bc710003ac	Pointy hat please! Refuse VCHR not VREG.	2004-09-22 18:18:26 +00:00
Poul-Henning Kamp	a367987828	De support opening device nodes on CD9660 filesystems. They are still visible, they can still be seen, but they cannot be opened. Use DEVFS for that.	2004-09-21 08:42:37 +00:00
Poul-Henning Kamp	d705e025d0	The getpages VOP was a good stab at getting scatter/gather I/O without too much kernel copying, but it is not the right way to do it, and it is in the way for straightening out the buffer cache. The right way is to pass the VM page array down through the struct bio to the disk device driver and DMA directly in to/out off the physical memory. Once the VM/buf thing is sorted out it is next on the list. Retire most of vnode method. ffs_getpages(). It is not clear if what is left shouldn't be in the default implementation which we now fall back to. Retire specfs_getpages() as well, as it has no users now.	2004-09-19 08:14:55 +00:00
Poul-Henning Kamp	08dbd671ff	Remove unused B_WRITEINPROG flag	2004-09-15 21:49:22 +00:00
Poul-Henning Kamp	883d3c0c07	Remove the buffercache/vnode side of BIO_DELETE processing in preparation for integration of p4::phk_bufwork. In the future, local filesystems will talk to GEOM directly and they will consequently be able to issue BIO_DELETE directly. Since the removal of the fla driver, BIO_DELETE has effectively been a no-op anyway.	2004-09-13 06:50:42 +00:00
Tim J. Robbins	d676af371d	Reduce the size of struct defid's defid_dirclust, defid_dirofs and (disabled) defid_gen members from u_long to u_int32_t so that alignment requirements don't cause the structure to become larger than struct fid on LP64 platforms. This fixes NFS exports of msdos filesystems on at least amd64. PR: 71173	2004-09-08 13:03:19 +00:00
Tim J. Robbins	6a5bf04a5b	Merge from NetBSD: Fix a problem in previous: we can't blindly assume that we have wincnt entries available at the offset the file has been found. If the dos directory entry is not preceded by appropriate number of long name entries (happens e.g. when the filesystem is corrupted, or when the filename complies to DOS rules and doesn't use any long name entry), we would overwrite random directory entries. There are still some problems, the whole thing has to be revisited and solved right. Submitted by: Xin LI	2004-09-08 11:25:41 +00:00
Tim J. Robbins	d23af19a71	Merge from NetBSD: Fix a panic that occurred when trying to traverse a corrupt msdosfs filesystem. With this particular corruption, the code in pcbmap() would compute an offset into an array that was way out of bounds, so check the bounds before trying to access and return an error if the offset would be out of bounds. Submitted by: Xin LI	2004-09-08 10:57:09 +00:00
Poul-Henning Kamp	1affa3adc8	Create simple function init_va_filerev() for initializing a va_filerev field. Replace three instances of longhaired initialization va_filerev fields. Added XXX comment wondering why we don't use random bits instead of uptime of the system for this purpose.	2004-09-07 09:17:05 +00:00
Poul-Henning Kamp	066a8fea81	Explicitly pass vnode to smbfs_doio() function.	2004-09-07 08:53:28 +00:00
Poul-Henning Kamp	7ee3985c57	Explicitly pass the vnode to the nw_doio() function.	2004-09-07 08:53:03 +00:00
Tim J. Robbins	82c0aec8de	Temporarily back out revision 1.77. This changed cd9660_getattr() and cd9660_readdir() to return the address of the file's first data block as the inode number instead of the address of the directory entry, but neglected to update cd9660_vget_internal() for the new inode numbering scheme. Since the NFS server calls VFS_VGET (cd9660_vget()) with inode numbers returned through VOP_READDIR (cd9660_readdir()) when servicing a READDIRPLUS request, these two interfaces must agree on the numbering scheme; failure to do so caused panics and/or bogus information about the entries to be returned to clients using READDIRPLUS (Solaris, FreeBSD w/ mount -o rdirplus). PR: 63446	2004-09-05 11:18:53 +00:00
Robert Watson	10b7196db4	Back out pseudo_vnops.c:1.45, which was a workaround for pfind() returning incompletely initialized processes. This problem was eliminated by kern_proc.c:1.215, which causes pfind() not to return processes in the PRS_NEW state.	2004-09-02 16:04:09 +00:00
Brooks Davis	b443062227	General modernization of coda: - Ditch NVCODA - Don't use a static major - Don't declare functions extern Reviewed by: peter	2004-09-01 01:19:52 +00:00
Peter Wemm	f37a929ca1	Kill count device support from config. I've changed the last few remaining consumers to have the count passed as an option. This is i4b, pc98/wdc, and coda. Bump configvers.h from 500013 to 600000. Remove heuristics that tried to parse "device ed5" as 5 units of the ed device. This broke things like the snd_emu10k1 device, which required quotes to make it parse right. The no-longer-needed quotes have been removed from NOTES, GENERIC etc. eg, I've removed the quotes from: device snd_maestro device "snd_maestro3" device snd_mss I believe everything will still compile and work after this.	2004-08-30 23:03:58 +00:00
Tim J. Robbins	db575a8507	Remove bogus vrele() call added in previous.	2004-08-27 11:24:31 +00:00

... 6 7 8 9 10 ...

2203 Commits