freebsd-skq

Author	SHA1	Message	Date
truckman	321926b9ba	Update nwfs_lookup() to match the current cache_lookup() API. cache_lookup() has returned a ref'ed and locked vnode since vfs_cache.c:1.96, dated Tue Mar 29 12:59:06 2005 UTC. This change is similar to the change made to smbfs_lookup() in smbfs_vnops.c:1.58. Tested by: "Antony Mawer" ant AT mawer.org MFC after: 2 weeks	2005-10-16 21:54:35 +00:00
kris	fa8ac58228	Reflect mpsafety of the underlying filesystem in the nullfs image. I benchmarked this by simultaneously extracting 4 large tarballs (basically world images) on a 4-processor AMD64 system, in a malloc-backed md. With this patch, system time was reduced by 43%, and wall clock time by 33%. Submitted by: jeff MFC after: 1 week	2005-10-16 21:45:25 +00:00
truckman	80700e8efc	Apply the same fix to a potential race in the ISDOTDOT code in cd9660_lookup() that was used to fix an actual race in ufs_lookup.c:1.78. This is not currently a hazard, but the bug would be activated by marking cd9660 as MPSAFE. Requested by: bde	2005-10-16 21:41:54 +00:00
yar	924e74a759	In preparation for making the modules actually use opt_*.h files provided in the kernel build directory, fix modules that were failing to build this way due to not quite correct kernel option usage. In particular: ng_mppc.c uses two complementary options, both of which are listed in sys/conf/files. Ideally, there should be a separate option for including ng_mppc.c in kernel build, but now only NETGRAPH_MPPC_ENCRYPTION is usable anyway, the other one requires proprietary files. nwfs and smbfs were trying to ensure they were built with proper network components, but the check was rather questionable. Discussed with: ru	2005-10-14 23:17:45 +00:00
davidxu	3fbdb3c215	1. Change prototype of trapsignal and sendsig to use ksiginfo_t *, most changes in MD code are trivial, before this change, trapsignal and sendsig use discrete parameters, now they uses member fields of ksiginfo_t structure. For sendsig, this change allows us to pass POSIX realtime signal value to user code. 2. Remove cpu_thread_siginfo, it is no longer needed because we now always generate ksiginfo_t data and feed it to libpthread. 3. Add p_sigqueue to proc structure to hold shared signals which were blocked by all threads in the proc. 4. Add td_sigqueue to thread structure to hold all signals delivered to thread. 5. i386 and amd64 now return POSIX standard si_code, other arches will be fixed. 6. In this sigqueue implementation, pending signal set is kept as before, an extra siginfo list holds additional siginfo_t data for signals. kernel code uses psignal() still behavior as before, it won't be failed even under memory pressure, only exception is when deleting a signal, we should call sigqueue_delete to remove signal from sigqueue but not SIGDELSET. Current there is no kernel code will deliver a signal with additional data, so kernel should be as stable as before, a ksiginfo can carry more information, for example, allow signal to be delivered but throw away siginfo data if memory is not enough. SIGKILL and SIGSTOP have fast path in sigqueue_add, because they can not be caught or masked. The sigqueue() syscall allows user code to queue a signal to target process, if resource is unavailable, EAGAIN will be returned as specification said. Just before thread exits, signal queue memory will be freed by sigqueue_flush. Current, all signals are allowed to be queued, not only realtime signals. Earlier patch reviewed by: jhb, deischen Tested on: i386, amd64	2005-10-14 12:43:47 +00:00
rodrigc	5eb4cdb703	- Do not hardcode the bsize to a sectorsize of 2048, even though the UDF specification specifies a logical sectorsize of 2048. Instead, get it from GEOM. - When reading the UDF Anchor Volume Descriptor, use the logical sectorsize of 2048 when calculating the offset to read from, but use the actual sectorsize to determine how much to read. - works with reading a DVD disk and a DVD disk image file via mdconfig - correctly returns EINVAL if we try to mount_udf an audio CD, instead of panicking inside GEOM when INVARIANTS is set	2005-10-09 04:45:33 +00:00
pjd	5c01c35a0c	We don't need 'imp' here.	2005-10-07 10:30:47 +00:00
rwatson	5758dab896	Second attempt at a work-around for fifo-related socket panics during make -j with high levels of parallelism: acquire Giant in fifo I/O routines. Discussed with: ups MFC after: 3 days	2005-10-01 20:15:41 +00:00
phk	b23c35710b	The NWFS code in RELENG_6 is broken due to a typo in sys/fs/nwfs/nwfs_vfsop= s.c, introduced with the conversion to nmount with revision 1.38. This causes mount_nwfs to fail with the error message: mount_nwfs: mount error: /mnt/netware: syserr = No such file or directo= ry This is caused by a typo on line 178, which specifies "nwfw_args" rather than "nwfs_args". Submitted by: Antony Mawer <gnats@mawer.org> Fat fingers: phk PR: 86757 MFC: 3 days	2005-09-30 18:21:05 +00:00
peadar	e0565b5794	Remove checks for BOOTSIG[23] from FAT32 bootblocks. There seems to be very little documentary evidence outside this implementation to suggest a these checks are neccessary, and more than one camera-formatted flash disk fails the check, but mounts successfully on most other systems. Reviewed By: bde@	2005-09-29 14:09:46 +00:00
rwatson	332a994af0	Back out fifo_vnops.c:1.127, which introduced an sx lock around I/O on a fifo. While this did indeed close the race, confirming suspicions about the nature of the problem, it causes difficulties with blocking I/O on fifos. Discussed with: ups Also spotted by: Peter Holm <peter at holm dot cc>	2005-09-27 16:45:22 +00:00
rwatson	87ac2d2498	Assert v_fifoinfo is non-NULL in fifo_close() in order to catch non-conforming cases sooner. MFC after: 3 days Reported by: Peter Holm <peter at holm dot cc>	2005-09-26 08:17:03 +00:00
rwatson	c9044f078d	Lock the read socket receive buffer when frobbing the sb_state flag on that socket during open, not the write socket receive buffer. This might explain clearing of the sb_state SB_LOCK flag seen occasionally in soreceive() on fifos. MFC after: 3 days Spotted by: ups	2005-09-25 19:52:09 +00:00
phk	a951e327e6	Make rule zero really magical, that way we don't have to do anything when we mount and get zero cost if no rules are used in a mountpoint. Add code to deref rules on unmount. Switch from SLIST to TAILQ. Drop SYSINIT, use SX_SYSINIT and static initializer of TAILQ instead. Drop goto, a break will do. Reduce double pointers to single pointers. Combine reaping and destroying rulesets. Avoid memory leaks in a some error cases.	2005-09-24 07:03:09 +00:00
rwatson	27cf2ffed6	For reasons of consistency (and necessity), assert an exclusive vnode lock on the fifo vnode in fifo_open(): we rely on the vnode lock to serialize access to v_fifoinfo. MFC after: 3 days	2005-09-23 12:39:51 +00:00
rwatson	9c2c1cb9fb	Add fi_sx, an sx lock to serialize I/O operations on the socket pair underlying the POSIX fifo implementation. In 6.x/7.x, fifo access is moved from the VFS layer, where it was serialized using the vnode lock, to the file descriptor layer, where access is protected by a reference count but not serialized. This exposed socket buffer locking to high levels of parallelism in specific fifo workloads, such as make -j 32, which expose as yet unresolved socket buffer bugs. fi_sx re-adds serialization about the read and write routines, although not paths that simply test socket buffer mbuf queue state, such as the poll and kqueue methods. This restores the extra locking cost previously present in some cases, but is an effective workaround for the instability that has been experienced. This workaround should be removed once the bug in socket buffer handling has been fixed. Reported by: kris, jhb, Julien Gabel <jpeg at thilelli dot net>, Peter Holm <peter at holm dot cc>, others MFC after: 3 days	2005-09-22 10:51:12 +00:00
phk	6a408cbd71	Rewamp DEVFS internals pretty severely [1]. Give DEVFS a proper inode called struct cdev_priv. It is important to keep in mind that this "inode" is shared between all DEVFS mountpoints, therefore it is protected by the global device mutex. Link the cdev_priv's into a list, protected by the global device mutex. Keep track of each cdev_priv's state with a flag bit and of references from mountpoints with a dedicated usecount. Reap the benefits of much improved kernel memory allocator and the generally better defined device driver APIs to get rid of the tables of pointers + serial numbers, their overflow tables, the atomics to muck about in them and all the trouble that resulted in. This makes RAM the only limit on how many devices we can have. The cdev_priv is actually a super struct containing the normal cdev as the "public" part, and therefore allocation and freeing has moved to devfs_devs.c from kern_conf.c. The overall responsibility is (to be) split such that kern/kern_conf.c is the stuff that deals with drivers and struct cdev and fs/devfs handles filesystems and struct cdev_priv and their private liason exposed only in devfs_int.h. Move the inode number from cdev to cdev_priv and allocate inode numbers properly with unr. Local dirents in the mountpoints (directories, symlinks) allocate inodes from the same pool to guarantee against overlaps. Various other fields are going to migrate from cdev to cdev_priv in the future in order to hide them. A few fields may migrate from devfs_dirent to cdev_priv as well. Protect the DEVFS mountpoint with an sx lock instead of lockmgr, this lock also protects the directory tree of the mountpoint. Give each mountpoint a unique integer index, allocated with unr. Use it into an array of devfs_dirent pointers in each cdev_priv. Initially the array points to a single element also inside cdev_priv, but as more devfs instances are mounted, the array is extended with malloc(9) as necessary when the filesystem populates its directory tree. Retire the cdev alias lists, the cdev_priv now know about all the relevant devfs_dirents (and their vnodes) and devfs_revoke() will pick them up from there. We still spelunk into other mountpoints and fondle their data without 100% good locking. It may make better sense to vector the revoke event into the tty code and there do a destroy_dev/make_dev on the tty's devices, but that's for further study. Lots of shuffling of stuff and churn of bits for no good reason[2]. XXX: There is still nothing preventing the dev_clone EVENTHANDLER from being invoked at the same time in two devfs mountpoints. It is not obvious what the best course of action is here. XXX: comment out an if statement that lost its body, until I can find out what should go there so it doesn't do damage in the meantime. XXX: Leave in a few extra malloc types and KASSERTS to help track down any remaining issues. Much testing provided by: Kris Much confusion caused by (races in): md(4) [1] You are not supposed to understand anything past this point. [2] This line should simplify life for the peanut gallery.	2005-09-19 19:56:48 +00:00
rwatson	2d8b6f2e27	Assert that (vp) is locked in fifo_close(), since we rely on the exclusive vnode lock to synchronize the reference counts on struct fifoinfo. MFC after: 3 days	2005-09-18 10:44:50 +00:00
phk	2d4ad1cc44	Don't attempt to recurse lockmgr, it doesn't like it.	2005-09-15 21:16:43 +00:00
kan	b4bff5a977	Handle a race condition where NULLFS vnode can be cleaned while threads can still be asleep waiting for lowervp lock. Tested by: kkenn Discussed with: ssouhlal, jeffr	2005-09-15 19:21:26 +00:00
rwatson	7584b6b2c3	The socket pointers in fifoinfo are not permitted to be NULL, so don't check if they are, it just confuses the fifo code more. MFC after: 3 days	2005-09-15 15:45:34 +00:00
phk	3c4b94c1fe	Various minor polishing.	2005-09-15 10:28:19 +00:00
phk	eafa84f647	Protect the devfs rule internal global lists with a sx lock, the per mount locks are not enough. Finer granularity (x)locking could be implemented, but I prefer to keep it simple for now.	2005-09-15 08:50:16 +00:00
phk	1a7de63bbb	Absolve devfs_rule.c from locking responsibility and call it with all necessary locking held.	2005-09-15 08:36:37 +00:00
phk	a543c5b761	Close a race which could result in unwarranted "ruleset %d already running" panics. Previously, recursion through the "include" feature was prevented by marking each ruleset as "running" when applied. This doesn't work for the case where two DEVFS instances try to apply the same ruleset at the same time. Instead introduce the sysctl vfs.devfs.rule_depth (default == 1) which limits how many levels of "include" we will traverse. Be aware that traversal of "include" is recursive and kernel stack size is limited. MFC: after 3 days	2005-09-15 06:57:28 +00:00
rwatson	0e4f08263a	Trim down now (believed to be) unused fifo_ioctl() and fifo_kqfilter() VOP implementations, since they in theory are used only on open file descriptors, in which case the ioctls are via fifo_ioctl_f() and kqueue requests are via fifo_kqfilter_f(). Generate warnings if they are entered for now. These printf() calls should become panic() calls. Annotate and re-implement fifo_ioctl_f(): don't arbitrarily forward ioctls to the socket layer, only forward the ones we explicitly support for fifos. In the case of FIONREAD, don't forward the request to the write socket on a read-write fifo, or the read result is overwritten. Annotate a nasty case for the undefined POSIX O_RDWR on fifos, in which failure of the second ioctl will result in the socket pair being in an inconsistent state. Assert copyright as I find myself rewriting non-trivial parts of fifofs. MFC after: 3 days	2005-09-13 17:46:48 +00:00
rwatson	afc7b6e916	As a result of kqueue locking work, socket buffer locks will always be held when entering a kqueue filter for fifos via a socket buffer event: as such, assert the lock unconditionally rather than acquiring it conditionall. MFC after: 3 days	2005-09-13 10:39:24 +00:00
rwatson	2bda369cf8	Annotate two issues: 1) fifo_kqfilter() is not actually ever used, it likely should be GC'd. 2) fifo_kqfilter_f() doesn't implement EVFILT_VNODE, so detecting events on the underlying vnode for a fifo no longer works (it did in 4.x). Likely, fifo_kqfilter_f() should forward the request to the VFS using fp->f_vnode, which would work once fifo_kqfilter() was detached from the vnode operation vector (removing the fifo override). Discussed with: phk	2005-09-13 09:23:22 +00:00
rwatson	bc5e7eb1f3	Introduce no-op nosup fifo kqueue filter and detach routine, which are used when a read filter is requested on a write-only fifo descriptor, or a write filter is requested on a read-only fifo descriptor. This permits the filters to be registered, but never raises the event, which causes kqueue behavior for fifos to more closely match similar semantics for poll and select, which permit testing for the condition even though the condition will never be raised, and is consistent with POSIX's notion that a fifo has identical semantics to a one-way IPC channel created using pipe() on most operating systems. The fifo regression test suite can now run to completion on HEAD without errors. MFC after: 3 days	2005-09-12 19:59:12 +00:00
rwatson	491de3e2d2	When a request is made to register a filter on a fifo that doesn't apply to the fifo (i.e., not EVFILT_READ or EVFILT_WRITE), reject it as EINVAL, not by returning 1 (EPERM). MFC after: 3 days	2005-09-12 18:07:49 +00:00
rwatson	6b308e01a1	Remove DFLAG_SEEKABLE from fifo file descriptors: fifos are not seekable according to POSIX, not to mention the fact that it doesn't make sense (and hence isn't really implemented). This causes the fifo_misc regression test to succeed.	2005-09-12 12:15:12 +00:00
rwatson	1481446aae	Only poll the fifo for read events if the fifo is attached to a readable file descriptor. Otherwise, the read end of a fifo might return that it is writable (which it isn't). Only poll the fifo for write events if the fifo attached to a writable file descriptor. Otherwise, the write end of a fifo might return that it is readable (which it isn't). In the event that a file is FREAD\|FWRITE (which is allowed by POSIX, but has undefined behavior), we poll for both. MFC after: 3 days	2005-09-12 10:16:18 +00:00
rwatson	919d519cbb	After going to some trouble to identify only the write-related events to poll the write socket for, the fifo polling code proceeded to poll for the complete set of events. Use 'levents' instead of 'events' as the argument to poll, and only poll the write socket if there is interest in write events. MFC after: 3 days	2005-09-12 10:13:15 +00:00
rwatson	5be69d1d56	When a writer opens a fifo, wake up the read socket for read, not the write socket. MFC after: 3 days	2005-09-12 10:07:21 +00:00
rwatson	c9f007b159	Add an assertion that fifo_open() doesn't race against other threads while sleeping to allocate fifo state: due to using the vnode lock to serialize access to a fifo during open, it shouldn't happen (tm). MFC after: 3 days	2005-09-12 10:06:38 +00:00
rwatson	a079789c36	Rather than reaching into the internals of the UNIX domain socket code by calling uipc_connect2() to connect two socket endpoints to create a fifo, call soconnect2(). MFC after: 3 days	2005-09-12 10:05:08 +00:00
phk	aca041ee53	Clean up prototypes.	2005-09-12 08:03:15 +00:00
rodrigc	97c59ad2c7	Cast bf_sysid to const char * when passing it to strncmp(), because strncmp does not take an unsigned char *. Eliminates warning with GCC 4.0.	2005-09-11 16:02:14 +00:00
rodrigc	aeeba2bf5b	Do not declare M_NTFSMNT with extern linkage here, since it is defined with static linkage in ntfs_vfsops.c. Fixes compilation with GCC 4.0.	2005-09-11 15:57:07 +00:00
obrien	4b003b6283	Ensure the full value is written into inode variables. PR: 85503 Submitted by: Dmitry Pryanishnikov <dmitry@atlantis.dp.ua>	2005-09-07 10:32:58 +00:00
ssouhlal	5ea64800e1	Unbreak hpfs/ntfs/udf/ext2fs/reiserfs mounting. Another pointyhat to: ssouhlal	2005-09-03 20:23:41 +00:00
ssouhlal	ad48f84298	Unbreak the build. Pointyhat to: ssouhlal	2005-09-03 00:40:19 +00:00
ssouhlal	45954b5047	Use vput() instead of vrele() in null_reclaim() since the lower vnode is locked. MFC after: 3 days	2005-09-02 15:49:55 +00:00
ssouhlal	f8217f350b	*_mountfs() (if the filesystem mounts from a device) needs devvp to be locked, so lock it. Glanced at by: phk MFC after: 3 days	2005-09-02 15:27:23 +00:00
phk	a469be1ef3	Add a missing dev_relthread() call. Remove unused variable. Spotted by: Hans Petter Selasky <hselasky@c2i.net>	2005-08-29 11:14:18 +00:00
phk	fcf6768753	Handle device drivers with D_NEEDGIANT in a way which does not penalize the 'good' drivers: Allocate a shadow cdevsw and populate it with wrapper functions which grab Giant	2005-08-17 08:19:52 +00:00
phk	8a3fe94804	Collect the devfs related sysctls in one place	2005-08-16 19:25:02 +00:00
phk	e89ebd4119	Create a new internal .h file to communicate very private stuff from kern_conf.c to devfs. For now just two prototypes, more to come.	2005-08-16 19:08:01 +00:00
phk	4edc625526	Eliminate effectively unused dm_basedir field from devfs_mount.	2005-08-15 19:40:53 +00:00
grehan	ba88fd3c57	- restore the ability to mount cd9660 filesystems as root by inverting some of the options test, specifically the joliet and rockridge tests. Since the root mount callchain doesn't go through cd9660_cmount, the default mount options aren't set. Rather than having the main codepath assume the options are there, test for the absence of the inverted optioin e.g. instead of vfs_flagopt(.. "joliet" ..), test for !vfs_flagopt(.. "nojoliet" ..) This works for root mount, non-root mount and future nmount cases. - in cd9660_cmount, remove inadvertent setting of "gens" when "extatt" was set. Reported by: grehan, Dario Freni <saturnero at freesbie org> Tested by: Dario Freni Not objected to by: phk MFC after: 3 days	2005-08-14 04:19:36 +00:00

1 2 3 4 5 ...

1764 Commits