freebsd-skq

Author	SHA1	Message	Date
rwatson	be4f357149	Normalize a significant number of kernel malloc type names: - Prefer '_' to ' ', as it results in more easily parsed results in memory monitoring tools such as vmstat. - Remove punctuation that is incompatible with using memory type names as file names, such as '/' characters. - Disambiguate some collisions by adding subsystem prefixes to some memory types. - Generally prefer lower case to upper case. - If the same type is defined in multiple architecture directories, attempt to use the same name in additional cases. Not all instances were caught in this change, so more work is required to finish this conversion. Similar changes are required for UMA zone names.	2005-10-31 15:41:29 +00:00
phk	3ed5c9efd0	Use correct cirteria for determining which directory entries we can purge right away and which we merely can hide. Beaten into my skull by: kris	2005-10-18 20:21:25 +00:00
des	4426988f2c	Implement the full range of ISO9660 number conversion routines in iso.h. MFC after: 2 weeks	2005-10-18 13:35:08 +00:00
rodrigc	cb57dd08dd	Unconditionally mount a CD9660 filesystem as read-only, instead of returning EROFS if we forget to mount it as read-only.	2005-10-17 03:29:53 +00:00
rodrigc	49d8776e03	Use the actual sector size of the media instead of hard-coding it to 2048. This eliminates KASSERTs in GEOM if we accidentally mount an audio CD as a cd9660 filesystem.	2005-10-17 03:27:35 +00:00
rodrigc	adce9d7a14	Unconditionally mount a UDF filesystem as read-only, instead of returning an EROFS if we forget to mount it as read-only.	2005-10-17 03:07:36 +00:00
flz	a0cf3f9b58	- Fix typo. Approved by: ssouhlal MFC after: 1 week	2005-10-17 00:04:35 +00:00
truckman	321926b9ba	Update nwfs_lookup() to match the current cache_lookup() API. cache_lookup() has returned a ref'ed and locked vnode since vfs_cache.c:1.96, dated Tue Mar 29 12:59:06 2005 UTC. This change is similar to the change made to smbfs_lookup() in smbfs_vnops.c:1.58. Tested by: "Antony Mawer" ant AT mawer.org MFC after: 2 weeks	2005-10-16 21:54:35 +00:00
kris	fa8ac58228	Reflect mpsafety of the underlying filesystem in the nullfs image. I benchmarked this by simultaneously extracting 4 large tarballs (basically world images) on a 4-processor AMD64 system, in a malloc-backed md. With this patch, system time was reduced by 43%, and wall clock time by 33%. Submitted by: jeff MFC after: 1 week	2005-10-16 21:45:25 +00:00
truckman	80700e8efc	Apply the same fix to a potential race in the ISDOTDOT code in cd9660_lookup() that was used to fix an actual race in ufs_lookup.c:1.78. This is not currently a hazard, but the bug would be activated by marking cd9660 as MPSAFE. Requested by: bde	2005-10-16 21:41:54 +00:00
yar	924e74a759	In preparation for making the modules actually use opt_*.h files provided in the kernel build directory, fix modules that were failing to build this way due to not quite correct kernel option usage. In particular: ng_mppc.c uses two complementary options, both of which are listed in sys/conf/files. Ideally, there should be a separate option for including ng_mppc.c in kernel build, but now only NETGRAPH_MPPC_ENCRYPTION is usable anyway, the other one requires proprietary files. nwfs and smbfs were trying to ensure they were built with proper network components, but the check was rather questionable. Discussed with: ru	2005-10-14 23:17:45 +00:00
davidxu	3fbdb3c215	1. Change prototype of trapsignal and sendsig to use ksiginfo_t *, most changes in MD code are trivial, before this change, trapsignal and sendsig use discrete parameters, now they uses member fields of ksiginfo_t structure. For sendsig, this change allows us to pass POSIX realtime signal value to user code. 2. Remove cpu_thread_siginfo, it is no longer needed because we now always generate ksiginfo_t data and feed it to libpthread. 3. Add p_sigqueue to proc structure to hold shared signals which were blocked by all threads in the proc. 4. Add td_sigqueue to thread structure to hold all signals delivered to thread. 5. i386 and amd64 now return POSIX standard si_code, other arches will be fixed. 6. In this sigqueue implementation, pending signal set is kept as before, an extra siginfo list holds additional siginfo_t data for signals. kernel code uses psignal() still behavior as before, it won't be failed even under memory pressure, only exception is when deleting a signal, we should call sigqueue_delete to remove signal from sigqueue but not SIGDELSET. Current there is no kernel code will deliver a signal with additional data, so kernel should be as stable as before, a ksiginfo can carry more information, for example, allow signal to be delivered but throw away siginfo data if memory is not enough. SIGKILL and SIGSTOP have fast path in sigqueue_add, because they can not be caught or masked. The sigqueue() syscall allows user code to queue a signal to target process, if resource is unavailable, EAGAIN will be returned as specification said. Just before thread exits, signal queue memory will be freed by sigqueue_flush. Current, all signals are allowed to be queued, not only realtime signals. Earlier patch reviewed by: jhb, deischen Tested on: i386, amd64	2005-10-14 12:43:47 +00:00
rodrigc	5eb4cdb703	- Do not hardcode the bsize to a sectorsize of 2048, even though the UDF specification specifies a logical sectorsize of 2048. Instead, get it from GEOM. - When reading the UDF Anchor Volume Descriptor, use the logical sectorsize of 2048 when calculating the offset to read from, but use the actual sectorsize to determine how much to read. - works with reading a DVD disk and a DVD disk image file via mdconfig - correctly returns EINVAL if we try to mount_udf an audio CD, instead of panicking inside GEOM when INVARIANTS is set	2005-10-09 04:45:33 +00:00
pjd	5c01c35a0c	We don't need 'imp' here.	2005-10-07 10:30:47 +00:00
rwatson	5758dab896	Second attempt at a work-around for fifo-related socket panics during make -j with high levels of parallelism: acquire Giant in fifo I/O routines. Discussed with: ups MFC after: 3 days	2005-10-01 20:15:41 +00:00
phk	b23c35710b	The NWFS code in RELENG_6 is broken due to a typo in sys/fs/nwfs/nwfs_vfsop= s.c, introduced with the conversion to nmount with revision 1.38. This causes mount_nwfs to fail with the error message: mount_nwfs: mount error: /mnt/netware: syserr = No such file or directo= ry This is caused by a typo on line 178, which specifies "nwfw_args" rather than "nwfs_args". Submitted by: Antony Mawer <gnats@mawer.org> Fat fingers: phk PR: 86757 MFC: 3 days	2005-09-30 18:21:05 +00:00
peadar	e0565b5794	Remove checks for BOOTSIG[23] from FAT32 bootblocks. There seems to be very little documentary evidence outside this implementation to suggest a these checks are neccessary, and more than one camera-formatted flash disk fails the check, but mounts successfully on most other systems. Reviewed By: bde@	2005-09-29 14:09:46 +00:00
rwatson	332a994af0	Back out fifo_vnops.c:1.127, which introduced an sx lock around I/O on a fifo. While this did indeed close the race, confirming suspicions about the nature of the problem, it causes difficulties with blocking I/O on fifos. Discussed with: ups Also spotted by: Peter Holm <peter at holm dot cc>	2005-09-27 16:45:22 +00:00
rwatson	87ac2d2498	Assert v_fifoinfo is non-NULL in fifo_close() in order to catch non-conforming cases sooner. MFC after: 3 days Reported by: Peter Holm <peter at holm dot cc>	2005-09-26 08:17:03 +00:00
rwatson	c9044f078d	Lock the read socket receive buffer when frobbing the sb_state flag on that socket during open, not the write socket receive buffer. This might explain clearing of the sb_state SB_LOCK flag seen occasionally in soreceive() on fifos. MFC after: 3 days Spotted by: ups	2005-09-25 19:52:09 +00:00
phk	a951e327e6	Make rule zero really magical, that way we don't have to do anything when we mount and get zero cost if no rules are used in a mountpoint. Add code to deref rules on unmount. Switch from SLIST to TAILQ. Drop SYSINIT, use SX_SYSINIT and static initializer of TAILQ instead. Drop goto, a break will do. Reduce double pointers to single pointers. Combine reaping and destroying rulesets. Avoid memory leaks in a some error cases.	2005-09-24 07:03:09 +00:00
rwatson	27cf2ffed6	For reasons of consistency (and necessity), assert an exclusive vnode lock on the fifo vnode in fifo_open(): we rely on the vnode lock to serialize access to v_fifoinfo. MFC after: 3 days	2005-09-23 12:39:51 +00:00
rwatson	9c2c1cb9fb	Add fi_sx, an sx lock to serialize I/O operations on the socket pair underlying the POSIX fifo implementation. In 6.x/7.x, fifo access is moved from the VFS layer, where it was serialized using the vnode lock, to the file descriptor layer, where access is protected by a reference count but not serialized. This exposed socket buffer locking to high levels of parallelism in specific fifo workloads, such as make -j 32, which expose as yet unresolved socket buffer bugs. fi_sx re-adds serialization about the read and write routines, although not paths that simply test socket buffer mbuf queue state, such as the poll and kqueue methods. This restores the extra locking cost previously present in some cases, but is an effective workaround for the instability that has been experienced. This workaround should be removed once the bug in socket buffer handling has been fixed. Reported by: kris, jhb, Julien Gabel <jpeg at thilelli dot net>, Peter Holm <peter at holm dot cc>, others MFC after: 3 days	2005-09-22 10:51:12 +00:00
phk	6a408cbd71	Rewamp DEVFS internals pretty severely [1]. Give DEVFS a proper inode called struct cdev_priv. It is important to keep in mind that this "inode" is shared between all DEVFS mountpoints, therefore it is protected by the global device mutex. Link the cdev_priv's into a list, protected by the global device mutex. Keep track of each cdev_priv's state with a flag bit and of references from mountpoints with a dedicated usecount. Reap the benefits of much improved kernel memory allocator and the generally better defined device driver APIs to get rid of the tables of pointers + serial numbers, their overflow tables, the atomics to muck about in them and all the trouble that resulted in. This makes RAM the only limit on how many devices we can have. The cdev_priv is actually a super struct containing the normal cdev as the "public" part, and therefore allocation and freeing has moved to devfs_devs.c from kern_conf.c. The overall responsibility is (to be) split such that kern/kern_conf.c is the stuff that deals with drivers and struct cdev and fs/devfs handles filesystems and struct cdev_priv and their private liason exposed only in devfs_int.h. Move the inode number from cdev to cdev_priv and allocate inode numbers properly with unr. Local dirents in the mountpoints (directories, symlinks) allocate inodes from the same pool to guarantee against overlaps. Various other fields are going to migrate from cdev to cdev_priv in the future in order to hide them. A few fields may migrate from devfs_dirent to cdev_priv as well. Protect the DEVFS mountpoint with an sx lock instead of lockmgr, this lock also protects the directory tree of the mountpoint. Give each mountpoint a unique integer index, allocated with unr. Use it into an array of devfs_dirent pointers in each cdev_priv. Initially the array points to a single element also inside cdev_priv, but as more devfs instances are mounted, the array is extended with malloc(9) as necessary when the filesystem populates its directory tree. Retire the cdev alias lists, the cdev_priv now know about all the relevant devfs_dirents (and their vnodes) and devfs_revoke() will pick them up from there. We still spelunk into other mountpoints and fondle their data without 100% good locking. It may make better sense to vector the revoke event into the tty code and there do a destroy_dev/make_dev on the tty's devices, but that's for further study. Lots of shuffling of stuff and churn of bits for no good reason[2]. XXX: There is still nothing preventing the dev_clone EVENTHANDLER from being invoked at the same time in two devfs mountpoints. It is not obvious what the best course of action is here. XXX: comment out an if statement that lost its body, until I can find out what should go there so it doesn't do damage in the meantime. XXX: Leave in a few extra malloc types and KASSERTS to help track down any remaining issues. Much testing provided by: Kris Much confusion caused by (races in): md(4) [1] You are not supposed to understand anything past this point. [2] This line should simplify life for the peanut gallery.	2005-09-19 19:56:48 +00:00
rwatson	2d8b6f2e27	Assert that (vp) is locked in fifo_close(), since we rely on the exclusive vnode lock to synchronize the reference counts on struct fifoinfo. MFC after: 3 days	2005-09-18 10:44:50 +00:00
phk	2d4ad1cc44	Don't attempt to recurse lockmgr, it doesn't like it.	2005-09-15 21:16:43 +00:00
kan	b4bff5a977	Handle a race condition where NULLFS vnode can be cleaned while threads can still be asleep waiting for lowervp lock. Tested by: kkenn Discussed with: ssouhlal, jeffr	2005-09-15 19:21:26 +00:00
rwatson	7584b6b2c3	The socket pointers in fifoinfo are not permitted to be NULL, so don't check if they are, it just confuses the fifo code more. MFC after: 3 days	2005-09-15 15:45:34 +00:00
phk	3c4b94c1fe	Various minor polishing.	2005-09-15 10:28:19 +00:00
phk	eafa84f647	Protect the devfs rule internal global lists with a sx lock, the per mount locks are not enough. Finer granularity (x)locking could be implemented, but I prefer to keep it simple for now.	2005-09-15 08:50:16 +00:00
phk	1a7de63bbb	Absolve devfs_rule.c from locking responsibility and call it with all necessary locking held.	2005-09-15 08:36:37 +00:00
phk	a543c5b761	Close a race which could result in unwarranted "ruleset %d already running" panics. Previously, recursion through the "include" feature was prevented by marking each ruleset as "running" when applied. This doesn't work for the case where two DEVFS instances try to apply the same ruleset at the same time. Instead introduce the sysctl vfs.devfs.rule_depth (default == 1) which limits how many levels of "include" we will traverse. Be aware that traversal of "include" is recursive and kernel stack size is limited. MFC: after 3 days	2005-09-15 06:57:28 +00:00
rwatson	0e4f08263a	Trim down now (believed to be) unused fifo_ioctl() and fifo_kqfilter() VOP implementations, since they in theory are used only on open file descriptors, in which case the ioctls are via fifo_ioctl_f() and kqueue requests are via fifo_kqfilter_f(). Generate warnings if they are entered for now. These printf() calls should become panic() calls. Annotate and re-implement fifo_ioctl_f(): don't arbitrarily forward ioctls to the socket layer, only forward the ones we explicitly support for fifos. In the case of FIONREAD, don't forward the request to the write socket on a read-write fifo, or the read result is overwritten. Annotate a nasty case for the undefined POSIX O_RDWR on fifos, in which failure of the second ioctl will result in the socket pair being in an inconsistent state. Assert copyright as I find myself rewriting non-trivial parts of fifofs. MFC after: 3 days	2005-09-13 17:46:48 +00:00
rwatson	afc7b6e916	As a result of kqueue locking work, socket buffer locks will always be held when entering a kqueue filter for fifos via a socket buffer event: as such, assert the lock unconditionally rather than acquiring it conditionall. MFC after: 3 days	2005-09-13 10:39:24 +00:00
rwatson	2bda369cf8	Annotate two issues: 1) fifo_kqfilter() is not actually ever used, it likely should be GC'd. 2) fifo_kqfilter_f() doesn't implement EVFILT_VNODE, so detecting events on the underlying vnode for a fifo no longer works (it did in 4.x). Likely, fifo_kqfilter_f() should forward the request to the VFS using fp->f_vnode, which would work once fifo_kqfilter() was detached from the vnode operation vector (removing the fifo override). Discussed with: phk	2005-09-13 09:23:22 +00:00
rwatson	bc5e7eb1f3	Introduce no-op nosup fifo kqueue filter and detach routine, which are used when a read filter is requested on a write-only fifo descriptor, or a write filter is requested on a read-only fifo descriptor. This permits the filters to be registered, but never raises the event, which causes kqueue behavior for fifos to more closely match similar semantics for poll and select, which permit testing for the condition even though the condition will never be raised, and is consistent with POSIX's notion that a fifo has identical semantics to a one-way IPC channel created using pipe() on most operating systems. The fifo regression test suite can now run to completion on HEAD without errors. MFC after: 3 days	2005-09-12 19:59:12 +00:00
rwatson	491de3e2d2	When a request is made to register a filter on a fifo that doesn't apply to the fifo (i.e., not EVFILT_READ or EVFILT_WRITE), reject it as EINVAL, not by returning 1 (EPERM). MFC after: 3 days	2005-09-12 18:07:49 +00:00
rwatson	6b308e01a1	Remove DFLAG_SEEKABLE from fifo file descriptors: fifos are not seekable according to POSIX, not to mention the fact that it doesn't make sense (and hence isn't really implemented). This causes the fifo_misc regression test to succeed.	2005-09-12 12:15:12 +00:00
rwatson	1481446aae	Only poll the fifo for read events if the fifo is attached to a readable file descriptor. Otherwise, the read end of a fifo might return that it is writable (which it isn't). Only poll the fifo for write events if the fifo attached to a writable file descriptor. Otherwise, the write end of a fifo might return that it is readable (which it isn't). In the event that a file is FREAD\|FWRITE (which is allowed by POSIX, but has undefined behavior), we poll for both. MFC after: 3 days	2005-09-12 10:16:18 +00:00
rwatson	919d519cbb	After going to some trouble to identify only the write-related events to poll the write socket for, the fifo polling code proceeded to poll for the complete set of events. Use 'levents' instead of 'events' as the argument to poll, and only poll the write socket if there is interest in write events. MFC after: 3 days	2005-09-12 10:13:15 +00:00
rwatson	5be69d1d56	When a writer opens a fifo, wake up the read socket for read, not the write socket. MFC after: 3 days	2005-09-12 10:07:21 +00:00
rwatson	c9f007b159	Add an assertion that fifo_open() doesn't race against other threads while sleeping to allocate fifo state: due to using the vnode lock to serialize access to a fifo during open, it shouldn't happen (tm). MFC after: 3 days	2005-09-12 10:06:38 +00:00
rwatson	a079789c36	Rather than reaching into the internals of the UNIX domain socket code by calling uipc_connect2() to connect two socket endpoints to create a fifo, call soconnect2(). MFC after: 3 days	2005-09-12 10:05:08 +00:00
phk	aca041ee53	Clean up prototypes.	2005-09-12 08:03:15 +00:00
rodrigc	97c59ad2c7	Cast bf_sysid to const char * when passing it to strncmp(), because strncmp does not take an unsigned char *. Eliminates warning with GCC 4.0.	2005-09-11 16:02:14 +00:00
rodrigc	aeeba2bf5b	Do not declare M_NTFSMNT with extern linkage here, since it is defined with static linkage in ntfs_vfsops.c. Fixes compilation with GCC 4.0.	2005-09-11 15:57:07 +00:00
obrien	4b003b6283	Ensure the full value is written into inode variables. PR: 85503 Submitted by: Dmitry Pryanishnikov <dmitry@atlantis.dp.ua>	2005-09-07 10:32:58 +00:00
ssouhlal	5ea64800e1	Unbreak hpfs/ntfs/udf/ext2fs/reiserfs mounting. Another pointyhat to: ssouhlal	2005-09-03 20:23:41 +00:00
ssouhlal	ad48f84298	Unbreak the build. Pointyhat to: ssouhlal	2005-09-03 00:40:19 +00:00
ssouhlal	45954b5047	Use vput() instead of vrele() in null_reclaim() since the lower vnode is locked. MFC after: 3 days	2005-09-02 15:49:55 +00:00
ssouhlal	f8217f350b	*_mountfs() (if the filesystem mounts from a device) needs devvp to be locked, so lock it. Glanced at by: phk MFC after: 3 days	2005-09-02 15:27:23 +00:00
phk	a469be1ef3	Add a missing dev_relthread() call. Remove unused variable. Spotted by: Hans Petter Selasky <hselasky@c2i.net>	2005-08-29 11:14:18 +00:00
phk	fcf6768753	Handle device drivers with D_NEEDGIANT in a way which does not penalize the 'good' drivers: Allocate a shadow cdevsw and populate it with wrapper functions which grab Giant	2005-08-17 08:19:52 +00:00
phk	8a3fe94804	Collect the devfs related sysctls in one place	2005-08-16 19:25:02 +00:00
phk	e89ebd4119	Create a new internal .h file to communicate very private stuff from kern_conf.c to devfs. For now just two prototypes, more to come.	2005-08-16 19:08:01 +00:00
phk	4edc625526	Eliminate effectively unused dm_basedir field from devfs_mount.	2005-08-15 19:40:53 +00:00
grehan	ba88fd3c57	- restore the ability to mount cd9660 filesystems as root by inverting some of the options test, specifically the joliet and rockridge tests. Since the root mount callchain doesn't go through cd9660_cmount, the default mount options aren't set. Rather than having the main codepath assume the options are there, test for the absence of the inverted optioin e.g. instead of vfs_flagopt(.. "joliet" ..), test for !vfs_flagopt(.. "nojoliet" ..) This works for root mount, non-root mount and future nmount cases. - in cd9660_cmount, remove inadvertent setting of "gens" when "extatt" was set. Reported by: grehan, Dario Freni <saturnero at freesbie org> Tested by: Dario Freni Not objected to by: phk MFC after: 3 days	2005-08-14 04:19:36 +00:00
des	5610061cd3	Eliminate an unnecessary bcopy().	2005-08-12 12:22:05 +00:00
obrien	d5d343a0fd	Remove public declarations of variables that were forgotten when they were made static.	2005-08-10 07:10:02 +00:00
obrien	10886230c5	Remove the need to forward declare statics by moving them around.	2005-08-10 07:08:14 +00:00
rwatson	daa1c89f45	Merge the dev_clone and dev_clone_cred event handlers into a single event handler, dev_clone, which accepts a credential argument. Implementors of the event can ignore it if they're not interested, and most do. This avoids having multiple event handler types and fall-back/precedence logic in devfs. This changes the kernel API for /dev cloning, and may affect third party packages containg cloning kernel modules. Requested by: phk MFC after: 3 days	2005-08-08 19:55:32 +00:00
kris	509594693e	devfs is not yet fully MPSAFE - for example, multiple concurrent devfs(8) processes can cause a panic when operating on rulesets. Approved by: phk	2005-07-29 23:00:56 +00:00
simon	dd09386bed	Correct devfs ruleset bypass. Submitted by: csjp Reviewed by: phk Security: FreeBSD-SA-05:17.devfs Approved by: cperciva	2005-07-20 13:34:16 +00:00
imura	a519d1b3a0	[1] unix2doschr() If a character cannot be converted to DOS code page, unix2doschr() returned `0'. As a result, unix2dosfn() was forced to return `0', so we saw a file which was composed of these characters as `Invalid argument'. To correct this, if a character can be converted to Unicode, unix2doschr() now returns `1' which is a magic number to make unix2dosfn() know that the character must be converted to `_'. [2] unix2dosfn() The above-mentioned solution only works if a file has both of Unicode name and DOS code page name. Unicode name would not be recorded if file name can be settled within 11 bytes (DOS short name) and if no conversion from Unix charset to DOS code page has occurred. Thus, FreeBSD can create a file which has only short name, but there is no guarantee that the short name contains allways valid characters because we leave it to people by using mount_msdosfs(8) to select which conversion is used between DOS code page and unix charset. To avoid this, Unicode file name should be recorded unless a character is an ascii character. This is the way Windows XP do. PR: 77074 [1] MFC after: 1 week	2005-07-17 07:10:05 +00:00
rwatson	79690d711b	When devfs cloning takes place, provide access to the credential of the process that caused the clone event to take place for the device driver creating the device. This allows cloned device drivers to adapt the device node based on security aspects of the process, such as the uid, gid, and MAC label. - Add a cred reference to struct cdev, so that when a device node is instantiated as a vnode, the cloning credential can be exposed to MAC. - Add make_dev_cred(), a version of make_dev() that additionally accepts the credential to stick in the struct cdev. Implement it and make_dev() in terms of a back-end make_dev_credv(). - Add a new event handler, dev_clone_cred, which can be registered to receive the credential instead of dev_clone, if desired. - Modify the MAC entry point mac_create_devfs_device() to accept an optional credential pointer (may be NULL), so that MAC policies can inspect and act on the label or other elements of the credential when initializing the skeleton device protections. - Modify tty_pty.c to register clone_dev_cred and invoke make_dev_cred(), so that the pty clone credential is exposed to the MAC Framework. While currently primarily focussed on MAC policies, this change is also a prerequisite for changes to allow ptys to be instantiated with the UID of the process looking up the pty. This requires further changes to the pty driver -- in particular, to immediately recycle pty nodes on last close so that the credential-related state can be recreated on next lookup. Submitted by: Andrew Reisse <andrew.reisse@sparta.com> Obtained from: TrustedBSD Project Sponsored by: SPAWAR, SPARTA MFC after: 1 week MFC note: Merge to 6.x, but not 5.x for ABI reasons	2005-07-14 10:22:09 +00:00
tanimura	f03d3eb58c	Regrab dvp only when ISDOTDOT. Approved by: re (scottl)	2005-07-09 13:52:49 +00:00
jeff	0d69457df8	- Since we don't hold a usecount in pfs_exit we have to get a holdcnt prior to calling vgone() to prevent any races. Sponsored by: Isilon Systems, Inc. Approved by: re (vfs blanket)	2005-07-07 07:33:10 +00:00
peter	921b3c5ee4	Jumbo-commit to enhance 32 bit application support on 64 bit kernels. This is good enough to be able to run a RELENG_4 gdb binary against a RELENG_4 application, along with various other tools (eg: 4.x gcore). We use this at work. ia32_reg.[ch]: handle the 32 bit register file format, used by ptrace, procfs and core dumps. procfs_regs.c: vary the format of proc/XXX/regs depending on the client and target application. procfs_map.c: Don't print a 64 bit value to 32 bit consumers, or their sscanf fails. They expect an unsigned long. imgact_elf.c: produce a valid 32 bit coredump for 32 bit apps. sys_process.c: handle 32 bit consumers debugging 32 bit targets. Note that 64 bit consumers can still debug 32 bit targets. IA64 has got stubs for ia32_reg.c. Known limitations: a 5.x/6.x gdb uses get/setcontext(), which isn't implemented in the 32/64 wrapper yet. We also make a tiny patch to gdb pacify it over conflicting formats of ld-elf.so.1. Approved by: re	2005-06-30 07:49:22 +00:00
peter	2778435f72	Conditionally weaken sys_generic.c rev 1.136 to allow certain dubious ioctl numbers in backwards compatability mode. eg: an IOC_IN ioctl with a size of zero. Traditionally this was what you did before IOC_VOID existed, and we had some established users of this in the tree, namely procfs. Certain 3rd party drivers with binary userland components also have this too. This is necessary to have 4.x and 5.x binaries use these ioctl's. We found this at work when trying to run 4.x binaries. Approved by: re	2005-06-30 00:19:08 +00:00
imura	5809391d17	Avoid casting from (int ) to (size_t ) in order to fix udf_iconv on amd64. Reviewed by: scottl MFC after: 2 weeks	2005-06-05 02:09:48 +00:00
rodrigc	dba46a3ce7	Do not declare a struct as extern, and then implement it as static in the same file. This is not legal C, and GCC 4.0 will issue an error. Reviewed by: phk Approved by: das (mentor)	2005-05-31 14:50:49 +00:00
brueffer	7bc989441b	Fix three typos in comments. Two of them obtained from OpenBSD. MFC after: 3 days	2005-05-11 21:10:35 +00:00
kan	3f8ab6c93f	Do not dereference dvp pointer before doing a NULL check. Noticed by: Coverity Prevent analysis tool.	2005-05-11 19:08:38 +00:00
anholt	b6be180393	Staticize a symbol used only in this file. PR: kern/43613 Submitted by: Matt Emmerton, matt at gsicomp dot on dot ca	2005-05-06 20:47:09 +00:00
robert	38711d8dea	The printf(9) `%p' conversion specifier puts an "0x" in front of the pointer value. Therefore, remove the "0x" from the format string.	2005-05-06 00:15:57 +00:00
robert	46abb4815c	Fix our NTFS readdir function. To check a directory's in-use bitmap bit by bit, we use a pointer to an 8 bit wide unsigned value. The index used to dereference this pointer is calculated by shifting the bit index right 3 bits. Then we do a logical AND with the bit# represented by the lower 3 bits of the bit index. This is an idiomatic way of iterating through a bit map with simple bitwise operations. This commit fixes the bug that we only checked bits 3:0 of each 8 bit chunk, because we only used bits 1:0 of the bit index for the bit# in the current 8 bit value. This resulted in files not being returned by getdirentries(2). Change the type of the bit map pointer from `char ' to `u_int8_t '.	2005-05-06 00:06:06 +00:00
takawata	68a4e0f83e	Fix breakage on alpha. Pointed out by: hrs via IRC	2005-05-05 07:02:51 +00:00
takawata	a61ec3d816	Make smbfs capable to use 16bit char set in filenames. PR:78110	2005-05-04 15:05:46 +00:00
jeff	2b167167e2	- Set the v_object pointer after a successful VOP_OPEN(). This isn't a perfect solution as the lower vm object can change at unpredictable times if our lower vp happens to be on another unionfs, etc. Submitted by: Oleg Sharoiko <os@rsu.ru>	2005-05-03 11:05:33 +00:00
jeff	6c4a330d28	- In devfs_open() and devfs_close() grab Giant if the driver sets NEEDGIANT. We still have to DROP_GIANT and PICKUP_GIANT when NEEDGIANT is not set because vfs is still sometime entered with Giant held.	2005-05-01 00:56:34 +00:00
des	fe9d4ac270	Fix an old pasto.	2005-04-30 16:27:20 +00:00
jeff	53caed435d	- Mark devfs as MNTK_MPSAFE as I belive it does not require Giant. Sponsored by: Isilon Systems, Inc. Agreed in principle by: phk	2005-04-30 11:24:17 +00:00
jeff	5ae67dae9a	- Fix several locking problems in unionfs_mount so that it will come closer to passing DEBUG_VFS_LOCKS.	2005-04-27 09:07:13 +00:00
jeff	b6552bddeb	- Pass the ISOPEN flag down to our lower filesystems. - Remove an erroneous VOP lock assert.	2005-04-27 09:06:06 +00:00
jeff	cb40cf9c09	- As this is presently the one and only place where duplicate acquires of the vnode interlock are allowed mark it by passing MTX_DUPOK to this lock operation only. Sponsored by: Isilon Systems, Inc.	2005-04-22 22:42:44 +00:00
das	839fea181d	Disable negative name caching for msdosfs to work around a bug. Since the name cache is case-sensitive and msdosfs isn't, creating a file 'foo' won't invalidate a negative entry for 'FOO'. There are similar problems related to 8.3 filenames. A better solution is to override VOP_LOOKUP with a method that canonicalizes the name, then calls vfs_cache_lookup(). Unfortunately, it's not quite that simple because vfs_cache_lookup() will call msdosfs_lookup() on a cache miss, and msdosfs_lookup() needs a way to get at the original component name.	2005-04-16 23:47:19 +00:00
njl	9de8e0daf9	Fix mbnambuf support for multi-byte characters. If a substring is larger than WIN_CHARS bytes, we shift the suffix (previous substrings) upwards by the amount this substring exceeds its WIN_CHARS slot. Profiling shows this change is indistinguishable from the previous code at 95% confidence. This bug would result in attempts to access or create files or directories with multi-byte characters returning an error but no data loss. Reported and tested by: avatar MFC after: 3 days	2005-04-16 01:49:50 +00:00
brueffer	bee55215dc	Correct typo. Obtained from: OpenBSD	2005-04-14 14:40:09 +00:00
jeff	afab3762a0	- Change all filesystems and vfs_cache to relock the dvp once the child is locked in the ISDOTDOT case. Se vfs_lookup.c r1.79 for details. Sponsored by: Isilon Systems, Inc.	2005-04-13 10:59:09 +00:00
jeff	69e9f89f88	- Clear VI_OWEINACT before calling vget() with no lock type. We know the node is actually already locked, and VOP_INACTIVE is not desirable in this case.	2005-04-11 11:17:20 +00:00
jeff	9375f1d524	- Honor the flags argument passed to null_root(). The filesystem below us will decide whether or not to grab a real shared lock.	2005-04-11 11:16:29 +00:00
delphij	4c97b619e5	Initialize vp before using it. Failing to do this can cause instant panic when trying to access a file on mounted smbfs. Submitted by: takawata at jp freebsd org	2005-04-10 03:17:42 +00:00
phk	bae9f6cfa0	Give msdosfs a unique inode number which is really the byteoffset of the directory entry. This solves the corruption problem I belive. Regression test script by: silby	2005-04-07 07:55:37 +00:00
jeff	3ae5dd8f5a	- Fix union's assumptions about when the dvp is unlocked. It is only unlocked in the ISDOTDOT case now, not for all !ISLASTCN lookups.	2005-04-04 09:36:26 +00:00
phk	7af1e31761	Explicitly hold a reference to the cdev we have just cloned. This closes the race where the cdev was reclaimed before it ever made it back to devfs lookup.	2005-03-31 12:19:44 +00:00
phk	2379f61770	cdev (still) needs per instance uid/gid/mode Add unlocked version of dev_ref() Clean up various stuff in sys/conf.h	2005-03-31 10:29:57 +00:00
phk	b83adaf8e5	Rename dev_ref() to dev_refl()	2005-03-31 06:51:54 +00:00
jeff	902bc24bce	- LK_NOPAUSE is a nop now. Sponsored by: Isilon Systems, Inc.	2005-03-31 04:27:49 +00:00
jeff	ca1e4c2fe0	- Remove wantparent, it is no longer necessary. An assert in vfs_lookup.c prevents any callers from doing a modifying op without LOCKPARENT or WANTPARENT.	2005-03-29 13:09:42 +00:00
jeff	7d8081dca4	- Remove wantparent, it is no longer necessary. An assert in vfs_lookup.c prevents any callers from doing a DELETE or RENAME without locking the parent.	2005-03-29 13:04:00 +00:00
jeff	141aba2c7b	- cache_lookup() now locks the new vnode for us to prevent some races. Remove redundant code. Sponsored by: Isilon Systems, Inc.	2005-03-29 13:00:37 +00:00
jeff	4084503aa0	- Correct the dprintf format int the _lookup routine. Spotted by: pjd	2005-03-28 14:26:01 +00:00
jeff	efb09df0e7	- Garbage collect an unused variable.	2005-03-28 13:45:09 +00:00
jeff	d673a48266	- Don't panic if we can't lock a child in lookup, return an error instead. - Only unlock the directory if this is a DOTDOT lookup. Previously this code could have deadlocked if there was a DOTDOT lookup with LOCKPARENT set and another thread was locking the other way up the tree. Sponsored by: Isilon Systems, Inc.	2005-03-28 13:39:16 +00:00
jeff	a84b0d4580	- Remove unnecessary LOCKPARENT manipulation. Sponsored by: Isilon Systems, Inc.	2005-03-28 13:29:15 +00:00
jeff	2e5ff94ef5	- nwfs_lookup() is no longer responsible for unlocking the dvp, this is handled in vfs_lookup.c. This code was missing PDIRUNLOCK use prior to the removal of PDIRUNLOCK in rev 1.73 of vfs_lookup.c. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:46:33 +00:00
jeff	527fc2c9cc	- hpfs_lookup() is no longer responsible for unlocking the dvp, this is handled in vfs_lookup.c. This code was missing PDIRUNLOCK use prior to the removal of PDIRUNLOCK in rev 1.73 of vfs_lookup.c. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:40:59 +00:00
jeff	b136fd4eee	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:34:36 +00:00
jeff	0afa18e58f	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. - In the ISDOTDOT case we have to unlock the dvp before locking the child, if this fails we must relock dvp before returning an error. This was missing before. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:31:57 +00:00
jeff	5f8bc80203	- We no longer have to bother with PDIRUNLOCK, lookup() handles it for us. - Network filesystems are written with a special idiom that checks the cache first, and may even unlock dvp before discovering that a network round-trip is required to resolve the name. I believe dvp is prevented from being recycled even in the forced unmount case by the shared lock on the mount point. If not, this code should grow checks for VI_DOOMED after it relocks dvp or it will access NULL v_data fields. Sponsored by: Isilon Systems, Inc.	2005-03-28 09:29:58 +00:00
jeff	430e7e9a03	- Pass LK_EXCLUSIVE as the lock type to vget in vfs_hash_insert().	2005-03-25 10:51:55 +00:00
jeff	56f1fc7189	- Update vfs_root implementations to match the new prototype. None of these filesystems will support shared locks until they are explicitly modified to do so. Careful review must be done to ensure that this is safe for each individual filesystem. Sponsored by: Isilon Systems, Inc.	2005-03-24 07:39:03 +00:00
jeff	226bf6ead4	- Update vfs_root implementations to match the new prototype. None of these filesystems will support shared locks until they are explicitly modified to do so. Careful review must be done to ensure that this is safe for each individual filesystem. Sponsored by: Isilon Systems, Inc.	2005-03-24 07:36:16 +00:00
phk	3b151f1bf5	Use subr_unit	2005-03-19 08:22:36 +00:00
phk	444989f1a6	Also remember to set the fsid here.	2005-03-17 15:15:29 +00:00
phk	96d39aba6d	Forgot to replace code to set fsid in vop_getattr.	2005-03-17 14:43:40 +00:00
phk	cfa6bb09ea	Prepare for the final onslaught on devices: Move uid/gid/mode from cdev to cdevsw. Add kind field to use for devd(8) later. Bump both D_VERSION and __FreeBSD_version	2005-03-17 12:07:00 +00:00
jeff	91796cd6d7	- Lock the clearing of v_data so it is safe to inspect it with the interlock. Sponsored by: Isilon Systems, Inc.	2005-03-17 12:00:05 +00:00
phk	98f1c9b062	Add two arguments to the vfs_hash() KPI so that filesystems which do not have unique hashes (NFS) can also use it.	2005-03-16 11:20:51 +00:00
phk	6552218237	Remove unused file	2005-03-16 11:10:38 +00:00
phk	5443e9818b	Remove inode fields previously used for private inode hash tables.	2005-03-16 08:09:52 +00:00
phk	eeb2c527c0	XXX: unnecessary pointer in inode.	2005-03-16 07:21:38 +00:00
phk	9189809602	Don't store the disk cdev in all inodes.	2005-03-16 07:17:39 +00:00
phk	909be0f0c2	Don't hold a reference to the disk vnode for each inode. Eliminate cdev and vnode pointer to the disk from the inodes, the mount holds everything we need.	2005-03-15 21:09:52 +00:00
phk	c3c76f8185	Eliminate cdev pointer in inodes, they're not used or needed. The cdev could have been pulled out of the mountpoint cheaper back when it was used anyway.	2005-03-15 20:57:25 +00:00
phk	54d4b170ba	Don't hold a reference on the disk vnode for each inode.	2005-03-15 20:50:58 +00:00
phk	d043926750	Improve the vfs_hash() API: vput() the unneeded vnode centrally to avoid replicating the vput in all the filesystems.	2005-03-15 20:00:03 +00:00
jeff	57fd917aad	- Assume that all lower filesystems now support proper locking. Assert that they set v->v_vnlock. This is true for all filesystems in the tree. - Remove all uses of LK_THISLAYER. If the lower layer is locked, the null layer is locked. We only use vget() to get a reference now. null essentially does no locking. This fixes LOOKUP_SHARED with nullfs. - Remove the special LK_DRAIN considerations, I do not believe this is needed now as LK_DRAIN doesn't destroy the lower vnode's lock, and it's hardly used anymore. - Add one well commented hack to prevent the lowervp from going away while we're in it's VOP_LOCK routine. This can only happen if we're forcibly unmounted while some callers are waiting in the lock. In this case the lowervp could be recycled after we drop our last ref in null_reclaim(). Prevent this with a vhold().	2005-03-15 13:49:33 +00:00
phk	651dd9f4d4	Disable two users of findcdev. They do the wrong thing now and will need to be fixed. In both cases the API should be reengineered to do something (more) sensible.	2005-03-15 12:39:30 +00:00
jeff	b59222bfe5	- We have to transfer lockers after reseting our vnlock pointer. Sponsored by: Isilon Systems, Inc.	2005-03-15 11:28:45 +00:00
phk	3337fd988c	Don't export major,minor, instead export tty name.	2005-03-15 11:05:11 +00:00
phk	4799d2dacc	Print devtoname() instead of minor().	2005-03-15 10:01:31 +00:00
phk	8ea9004b75	Fix typo: pointers are not boolean in style(9).	2005-03-15 10:01:14 +00:00
phk	124bf5e823	Simplify the vfs_hash calling convention.	2005-03-15 08:07:07 +00:00
des	8bd55ce9cb	Hook pfs_lookup() up to vfs_cachedlookup_desc instead of vfs_lookup_desc, as suggested by Matt's comment. Also fix some style and paranoia issues. The entire function could benefit from review by a VFS guru. MFC after: 6 weeks	2005-03-14 16:24:50 +00:00
des	aa287f3e12	Fix two long-standing bugs in pfs_readdir(): Since we used an sbuf of size resid to accumulate dirents, we would end up returning one byte short when we had enough dirents to fill or exceed the size of the sbuf (the last byte being lost to bogus NUL termination) causing the next call to return EINVAL due to an unaligned offset. This went undetected for a long time because I did most of my testing in single-user mode, where there are rarely enough processes to fill the 4096-byte buffer ls(1) uses. The most common symptom of this bug is that tab completion of /proc or /compat/linux/proc does not work properly when many processes are running. Also, a check near the top would return EINVAL if resid was smaller than PFS_DELEN, even if it was 0, which is frequently the case and perfectly allowable. Change the test so that it returns 0 if resid is 0. MFC after: 2 weeks	2005-03-14 16:21:32 +00:00
des	892fcdb014	If PSEUDOFS_TRACE is defined, create a sysctl knob to enable / disable pseudofs call tracing.	2005-03-14 16:06:47 +00:00
des	cae8560557	fbsdidize.	2005-03-14 15:54:11 +00:00
phk	7112e51413	Use vfs_hash instead of home-rolled.	2005-03-14 14:41:37 +00:00
phk	d26364a9a9	Use vfs_hash instead of home-rolled.	2005-03-14 13:22:41 +00:00
phk	550b89f8bf	Use vfs_hash instead of home-rolled. Correct locking around g_vfs_close()	2005-03-14 12:29:39 +00:00
phk	5c46906245	Use vfs_hash instead of home-rolling.	2005-03-14 12:24:35 +00:00
jeff	9fe4365207	- VOP_INACTIVE should no longer drop the vnode lock. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:47 +00:00
jeff	0d9df2e12d	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. - VOP_INACTIVE should no longer drop the vnode lock. - The vnode lock is required around calls to vrecycle() and vgone(). Sponsored by: Isilon Systems, Inc.	2005-03-13 12:18:25 +00:00
jeff	5bd51ec6e6	- The VI_DOOMED flag now signals the end of a vnode's relationship with the filesystem. Check that rather than VI_XLOCK. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:14:56 +00:00
jeff	1c1edae55c	- The c_lock in the coda node does not offer any features over the standard vnode lock. Remove the c_lock and use the vn lock in its place. - Keep the coda lock functions so that the debugging information is preserved, but call directly to the vop_std*lock routines for the real functionality. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:09:34 +00:00
jeff	71f62892ab	- Deadfs may now use the standard vop lock, get rid of dead_lock(). - We no longer have to take the XLOCK state into consideration in any routines. Sponsored by: Isilon Systems, Inc.	2005-03-13 12:06:20 +00:00
obrien	6fe43669c4	Used unsigned version. Submitted by: jmallett	2005-03-12 06:06:04 +00:00
obrien	1f273a8d02	Fix kernel build on 64-bit machines.	2005-03-12 03:50:39 +00:00
njl	f5d07b2dc9	Correct a last-minute thinko. Instead of copying the nul with the string, nul-terminate the dp->d_name directly and only copy the string.	2005-03-11 23:35:23 +00:00

1 2 3 4 5 ...

1871 Commits