freebsd-skq

Author	SHA1	Message	Date
Robert Watson	10b7196db4	Back out pseudo_vnops.c:1.45, which was a workaround for pfind() returning incompletely initialized processes. This problem was eliminated by kern_proc.c:1.215, which causes pfind() not to return processes in the PRS_NEW state.	2004-09-02 16:04:09 +00:00
Brooks Davis	b443062227	General modernization of coda: - Ditch NVCODA - Don't use a static major - Don't declare functions extern Reviewed by: peter	2004-09-01 01:19:52 +00:00
Peter Wemm	f37a929ca1	Kill count device support from config. I've changed the last few remaining consumers to have the count passed as an option. This is i4b, pc98/wdc, and coda. Bump configvers.h from 500013 to 600000. Remove heuristics that tried to parse "device ed5" as 5 units of the ed device. This broke things like the snd_emu10k1 device, which required quotes to make it parse right. The no-longer-needed quotes have been removed from NOTES, GENERIC etc. eg, I've removed the quotes from: device snd_maestro device "snd_maestro3" device snd_mss I believe everything will still compile and work after this.	2004-08-30 23:03:58 +00:00
Tim J. Robbins	db575a8507	Remove bogus vrele() call added in previous.	2004-08-27 11:24:31 +00:00
Tim J. Robbins	1a9415af17	Improve the robustness of MSDOSFSMNT_KICONV handling: - Use copyinstr() to read cs_win, cs_dos, cs_local strings from the mount argument structure instead of reading through user-space pointers(!). - When mounting a filesystem, or updating an existing mount, only try to update the iconv handles from the information in the mount argument structure if the structure itself has the MSDOSFSMNT_KICONV flag set. - Attempt to handle failure of update_mp() in the MNT_UPDATE case.	2004-08-26 13:16:44 +00:00
Dag-Erling Smørgrav	c9b9a82654	Release the vnode cache mutex when calling vgone(), since vgone() may sleep. This makes pfs_exit() even less efficient than before, but on the bright side, the vnode cache mutex no longer needs to be recursive.	2004-08-15 21:58:02 +00:00
John-Mark Gurney	ad3b9257c2	Add locking to the kqueue subsystem. This also makes the kqueue subsystem a more complete subsystem, and removes the knowlege of how things are implemented from the drivers. Include locking around filter ops, so a module like aio will know when not to be unloaded if there are outstanding knotes using it's filter ops. Currently, it uses the MTX_DUPOK even though it is not always safe to aquire duplicate locks. Witness currently doesn't support the ability to discover if a dup lock is ok (in some cases). Reviewed by: green, rwatson (both earlier versions)	2004-08-15 06:24:42 +00:00
Robert Watson	d990378077	Commit a work-around for a more general bug involving process state: check whether p_ucred is NULL or not in pfs_getattr() before dereferencing the credential, and return ENOENT if there wasn't one. This is a symptom of a larger problem, wherein pfind() can return references to incompletely initialized processes, and we instead ought to not return them, or check the process state before acting on the process. Reported by: kris Discussed with: tjr, others	2004-08-13 20:27:56 +00:00
Poul-Henning Kamp	7ac439fec4	use bufdone() not biodone().	2004-08-08 13:23:05 +00:00
Poul-Henning Kamp	a8687125e5	Use bufdone(), not biodone().	2004-08-08 13:20:43 +00:00
Poul-Henning Kamp	e83f142353	Push all changes to disk before downgrading a mount from rw to ro.	2004-08-07 22:05:12 +00:00
Poul-Henning Kamp	5e8c582ac2	Put a version element in the VFS filesystem configuration structure and refuse initializing filesystems with a wrong version. This will aid maintenance activites on the 5-stable branch. s/vfs_mount/vfs_omount/ s/vfs_nmount/vfs_mount/ Name our filesystems mount function consistently. Eliminate the namiedata argument to both vfs_mount and vfs_omount. It was originally there to save stack space. A few places abused it to get hold of some credentials to pass around. Effectively it is unused. Reorganize the root filesystem selection code.	2004-07-30 22:08:52 +00:00
Poul-Henning Kamp	d634f69316	Remove global variable rootdevs and rootvp, they are unused as such. Add local rootvp variables as needed. Remove checks for miniroot's in the swappartition. We never did that and most of the filesystems could never be used for that, but it had still been copy&pasted all over the place.	2004-07-28 20:21:04 +00:00
Alexander Kabaev	1f74490224	Avoid casts as lvalues.	2004-07-28 06:30:43 +00:00
Alexander Kabaev	fa2edf7bcd	Avoid casts as lvalues.	2004-07-28 06:05:41 +00:00
Colin Percival	56f21b9d74	Rename suser_cred()'s PRISON_ROOT flag to SUSER_ALLOWJAIL. This is somewhat clearer, but more importantly allows for a consistent naming scheme for suser_cred flags. The old name is still defined, but will be removed in a few days (unless I hear any complaints...) Discussed with: rwatson, scottl Requested by: jhb	2004-07-26 07:24:04 +00:00
Robert Watson	de592112e1	In devfs_allocv(), rather than assigning 'td = curthread', assert that the caller passes in a td that is curthread, and consistently pass 'td' into vget(). Remove some bogus logic that passed in td or curthread conditional on td being non-NULL, which seems redundant in the face of the earlier assignment of td to curthread if td is NULL. In devfs_symlink(), cache the passed thread in 'td' so we don't have to keep retrieving it from the 'ap' structure, and assert that td is curthread (since we dereference it to get thread-local td_ucred). Use 'td' in preference to curthread for later lockmgr calls, since they are equal.	2004-07-22 17:03:14 +00:00
Poul-Henning Kamp	3e019deaed	Do a pass over all modules in the kernel and make them return EOPNOTSUPP for unknown events. A number of modules return EINVAL in this instance, and I have left those alone for now and instead taught MOD_QUIESCE to accept this as "didn't do anything".	2004-07-15 08:26:07 +00:00
Poul-Henning Kamp	d521eae21d	Another LINT compilation fix	2004-07-13 09:47:27 +00:00
Poul-Henning Kamp	a546742dd7	Make LINT compile	2004-07-13 09:46:46 +00:00
Robert Watson	ee3c585ae5	Remove 'td = curthread' that shadows the arguments to coda_root(). Missed by: alfred	2004-07-12 14:11:26 +00:00
Alfred Perlstein	f257b7a54b	Make VFS_ROOT() and vflush() take a thread argument. This is to allow filesystems to decide based on the passed thread which vnode to return. Several filesystems used curthread, they now use the passed thread.	2004-07-12 08:14:09 +00:00
Marcel Moolenaar	e2fb7a7be0	Update for the KDB framework: o Call kdb_enter() instead of Debugger().	2004-07-10 21:21:13 +00:00
Marcel Moolenaar	4ea4f1f97e	Update for the KDB framework: o Call kdb_enter() instead of Debugger(). o Make debugging code conditional upon KDB instead of DDB.	2004-07-10 21:20:11 +00:00
Dag-Erling Smørgrav	195a6b21e4	Accumulate directory entries in a fixed-length sbuf, and uiomove them in one go before returning. This avoids calling uiomove() while holding allproc_lock. Don't adjust uio->uio_offset manually, uiomove() does that for us. Don't drop allproc_lock before calling panic(). Suggested by: alfred	2004-07-09 11:43:37 +00:00
Poul-Henning Kamp	e3c5a7a4dd	When we traverse the vnodes on a mountpoint we need to look out for our cached 'next vnode' being removed from this mountpoint. If we find that it was recycled, we restart our traversal from the start of the list. Code to do that is in all local disk filesystems (and a few other places) and looks roughly like this: MNT_ILOCK(mp); loop: for (vp = TAILQ_FIRST(&mp...); (vp = nvp) != NULL; nvp = TAILQ_NEXT(vp,...)) { if (vp->v_mount != mp) goto loop; MNT_IUNLOCK(mp); ... MNT_ILOCK(mp); } MNT_IUNLOCK(mp); The code which takes vnodes off a mountpoint looks like this: MNT_ILOCK(vp->v_mount); ... TAILQ_REMOVE(&vp->v_mount->mnt_nvnodelist, vp, v_nmntvnodes); ... MNT_IUNLOCK(vp->v_mount); ... vp->v_mount = something; (Take a moment and try to spot the locking error before you read on.) On a SMP system, one CPU could have removed nvp from our mountlist but not yet gotten to assign a new value to vp->v_mount while another CPU simultaneously get to the top of the traversal loop where it finds that (vp->v_mount != mp) is not true despite the fact that the vnode has indeed been removed from our mountpoint. Fix: Introduce the macro MNT_VNODE_FOREACH() to traverse the list of vnodes on a mountpoint while taking into account that vnodes may be removed from the list as we go. This saves approx 65 lines of duplicated code. Split the insmntque() which potentially moves a vnode from one mount point to another into delmntque() and insmntque() which does just what the names say. Fix delmntque() to set vp->v_mount to NULL while holding the mountpoint lock.	2004-07-04 08:52:35 +00:00
Poul-Henning Kamp	bffd1b7af4	Remove "register" keyword and trailing white space.	2004-07-03 16:56:45 +00:00
Tim J. Robbins	3bc482ec1c	By popular request, add a workaround that allows large (>128GB or so) FAT32 filesystems to be mounted, subject to some fairly serious limitations. This works by extending the internal pseudo-inode-numbers generated from the file's starting cluster number to 64-bits, then creating a table mapping these into arbitrary 32-bit inode numbers, which can fit in struct dirent's d_fileno and struct vattr's va_fileid fields. The mappings do not persist across unmounts or reboots, so it's not possible to export these filesystems through NFS. The mapping table may grow to be rather large, and may grow large enough to exhaust kernel memory on filesystems with millions of files. Don't enable this option unless you understand the consequences.	2004-07-03 13:22:38 +00:00
Robert Watson	1cf30f57e1	Remove spls from portal_open(). Acquire socket lock while sleeping waiting for the socket to connect and use msleep() on the socket mute rather than tsleep(). Acquire socket buffer mutexes around read-modify-write of socket buffer flags.	2004-06-24 00:47:23 +00:00
Scott Long	bf1c3ddd68	Make the udf_vnops side endian clean.	2004-06-23 21:49:03 +00:00
Scott Long	89ec2c3c42	First half of making UDF be endian-clean. This addresses the vfsops side.	2004-06-23 19:36:09 +00:00
Bruce Evans	587a4462c9	Include <sys/mutex.h> and its prerequisite <sys/lock.h> instead of depending on namespace pollution in <sys/vnode.h> for the definition of mutex interfaces used in SOCKBUF_*LOCK(). Sorted includes. Removed unused includes.	2004-06-23 06:47:49 +00:00
Robert Watson	7d84f9d293	Remove unlocked read annotation for sbspace(); the read is locked.	2004-06-23 00:35:50 +00:00
Poul-Henning Kamp	9d96090725	Reduce a fair bit of the atomics because we are now called with a lock from kern_conf.c and cdev's act a lot more like real objects these days.	2004-06-18 08:08:47 +00:00
Robert Watson	c012260726	Merge some additional leaf node socket buffer locking from rwatson_netperf: Introduce conditional locking of the socket buffer in fifofs kqueue filters; KNOTE() will be called holding the socket buffer locks in fifofs, but sometimes the kqueue() system call will poll using the same entry point without holding the socket buffer lock. Introduce conditional locking of the socket buffer in the socket kqueue filters; KNOTE() will be called holding the socket buffer locks in the socket code, but sometimes the kqueue() system call will poll using the same entry points without holding the socket buffer lock. Simplify the logic in sodisconnect() since we no longer need spls. NOTE: To remove conditional locking in the kqueue filters, it would make sense to use a separate kqueue API entry into the socket/fifo code when calling from the kqueue() system call.	2004-06-18 02:57:55 +00:00
Robert Watson	9535efc00d	Merge additional socket buffer locking from rwatson_netperf: - Lock down low hanging fruit use of sb_flags with socket buffer lock. - Lock down low hanging fruit use of so_state with socket lock. - Lock down low hanging fruit use of so_options. - Lock down low-hanging fruit use of sb_lowwat and sb_hiwat with socket buffer lock. - Annotate situations in which we unlock the socket lock and then grab the receive socket buffer lock, which are currently actually the same lock. Depending on how we want to play our cards, we may want to coallesce these lock uses to reduce overhead. - Convert a if()->panic() into a KASSERT relating to so_state in soaccept(). - Remove a number of splnet()/splx() references. More complex merging of socket and socket buffer locking to follow.	2004-06-17 22:48:11 +00:00
Poul-Henning Kamp	f3732fd15b	Second half of the dev_t cleanup. The big lines are: NODEV -> NULL NOUDEV -> NODEV udev_t -> dev_t udev2dev() -> findcdev() Various minor adjustments including handling of userland access to kernel space struct cdev etc.	2004-06-17 17:16:53 +00:00
Poul-Henning Kamp	89c9c53da0	Do the dreaded s/dev_t/struct cdev */ Bump __FreeBSD_version accordingly.	2004-06-16 09:47:26 +00:00
Julian Elischer	fa88511615	Nice, is a property of a process as a whole.. I mistakenly moved it to the ksegroup when breaking up the process structure. Put it back in the proc structure.	2004-06-16 00:26:31 +00:00
Robert Watson	7721f5d760	Grab the socket buffer send or receive mutex when performing a read-modify-write on the sb_state field. This commit catches only the "easy" ones where it doesn't interact with as yet unmerged locking.	2004-06-15 03:51:44 +00:00
Robert Watson	c0b99ffa02	The socket field so_state is used to hold a variety of socket related flags relating to several aspects of socket functionality. This change breaks out several bits relating to send and receive operation into a new per-socket buffer field, sb_state, in order to facilitate locking. This is required because, in order to provide more granular locking of sockets, different state fields have different locking properties. The following fields are moved to sb_state: SS_CANTRCVMORE (so_state) SS_CANTSENDMORE (so_state) SS_RCVATMARK (so_state) Rename respectively to: SBS_CANTRCVMORE (so_rcv.sb_state) SBS_CANTSENDMORE (so_snd.sb_state) SBS_RCVATMARK (so_rcv.sb_state) This facilitates locking by isolating fields to be located with other identically locked fields, and permits greater granularity in socket locking by avoiding storing fields with different locking semantics in the same short (avoiding locking conflicts). In the future, we may wish to coallesce sb_state and sb_flags; for the time being I leave them separate and there is no additional memory overhead due to the packing/alignment of shorts in the socket buffer structure.	2004-06-14 18:16:22 +00:00
Don Lewis	866046f5a6	Add MSG_NBIO flag option to soreceive() and sosend() that causes them to behave the same as if the SS_NBIO socket flag had been set for this call. The SS_NBIO flag for ordinary sockets is set by fcntl(fd, F_SETFL, O_NONBLOCK). Pass the MSG_NBIO flag to the soreceive() and sosend() calls in fifo_read() and fifo_write() instead of frobbing the SS_NBIO flag on the underlying socket for each I/O operation. The O_NONBLOCK flag is a property of the descriptor, and unlike ordinary sockets, fifos may be referenced by multiple descriptors.	2004-06-01 01:18:51 +00:00
Poul-Henning Kamp	5dba30f15a	add missing #include <sys/module.h>	2004-05-30 20:27:19 +00:00
Don Lewis	2526dc2b61	Switch from using the vnode interlock to a private mutex in fifo_open() to avoid lock order problems when manipulating the sockets associated with the fifo. Minor optimization of a couple of calls to fifo_cleanup() from fifo_open().	2004-05-17 20:16:40 +00:00
Alan Cox	5a32489377	Make vm_page's PG_ZERO flag immutable between the time of the page's allocation and deallocation. This flag's principal use is shortly after allocation. For such cases, clearing the flag is pointless. The only unusual use of PG_ZERO is in vfs_bio_clrbuf(). However, allocbuf() never requests a prezeroed page. So, vfs_bio_clrbuf() never sees a prezeroed page. Reviewed by: tegge@	2004-05-06 05:03:23 +00:00
Poul-Henning Kamp	bc20ced763	Do not drop Giant around the poll method yet, we're not ready for it.	2004-04-12 21:52:52 +00:00
Warner Losh	f36cfd49ad	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999 and email from Peter Wemm, Alan Cox and Robert Watson. Approved by: core, peter, alc, rwatson	2004-04-07 20:46:16 +00:00
Pawel Jakub Dawidek	c5b7c33bc8	Remove ps_argsopen from this check, because of two reasons: 1. This check if wrong, because it is true by default (kern.ps_argsopen is 1 by default) (p_cansee() is not even checked). 2. Sysctl kern.ps_argsopen is going away.	2004-04-01 00:04:23 +00:00
Robert Watson	db48c0d254	Export uipc_connect2() from uipc_usrreq.c instead of unp_connect2(), and consume that interface in portalfs and fifofs instead. In the new world order, unp_connect2() assumes that the unpcb mutex is held, whereas uipc_connect2() validates that the passed sockets are UNIX domain sockets, then grabs the mutex. NB: the portalfs and fifofs code gets down and dirty with UNIX domain sockets. Maybe this is a bad thing.	2004-03-31 01:41:30 +00:00
Scott Long	744bb56d95	Catch all cases where bread() returns an error and a valid bp, and release the bp. Obtained from: DragonFlyBSD	2004-03-30 05:01:48 +00:00
Peter Wemm	b21126c6b3	Clean up the stub fake vnode locking implemenations. The main reason this stuff was here (NFS) was fixed by Alfred in November. The only remaining consumer of the stub functions was umapfs, which is horribly horribly broken. It has missed out on about the last 5 years worth of maintenence that was done on nullfs (from which umapfs is derived). It needs major work to bring it up to date with the vnode locking protocol. umapfs really needs to find a caretaker to bring it into the 21st century. Functions GC'ed: vop_noislocked, vop_nolock, vop_nounlock, vop_sharedlock.	2004-03-29 22:41:21 +00:00
Robert Watson	0ff34b5e1c	Don't reject FAT file systems with a number of "Heads" greater than 255; USB keychains exist that use 256 as the number of heads. This check has also been removed in Darwin (along with most of the other head/sector sanity checks).	2004-03-14 23:28:12 +00:00
Brian Feldman	6fedf94775	When taking event callbacks (like process_exit) out from under Giant, those which do not lock Giant themselves will be exposed. Unbreak pfs_exit().	2004-03-14 15:57:45 +00:00
Poul-Henning Kamp	ceb58ca58f	When I was a kid my work table was one cluttered mess an cleaning it up were a rather overwhelming task. I soon learned that if you don't know where you're going to store something, at least try to pile it next to something slightly related in the hope that a pattern emerges. Apply the same principle to the ffs/snapshot/softupdates code which have leaked into specfs: Add yet a buf-quasi-method and call it from the only two places I can see it can make a difference and implement the magic in ffs_softdep.c where it belongs. It's not pretty, but at least it's one less layer violated.	2004-03-11 18:50:33 +00:00
Poul-Henning Kamp	651b11eaf2	Remove unused second arg to vfinddev(). Don't call addaliasu() on VBLK nodes.	2004-03-11 16:33:11 +00:00
Poul-Henning Kamp	39a78f8cf4	Don't call devsw() more than we need to, and in particular do not expose ourselves to device removal by not checking for it the second time. Use count_dev(dev) rather than vcount(vp)	2004-03-10 20:56:28 +00:00
Scott Long	c80a90c51f	Change __FUNCTION__ to __func__ Submitted by: Stefan Farfeleder	2004-03-03 07:43:34 +00:00
Robert Watson	746e5bf09b	Rename dup_sockaddr() to sodupsockaddr() for consistency with other functions in kern_socket.c. Rename the "canwait" field to "mflags" and pass M_WAITOK and M_NOWAIT in from the caller context rather than "1" or "0". Correct mflags pass into mac_init_socket() from previous commit to not include M_ZERO. Submitted by: sam	2004-03-01 03:14:23 +00:00
Poul-Henning Kamp	ad3917e8e6	Do not attempt to open NODEV	2004-02-24 09:59:35 +00:00
Tim J. Robbins	f71cc8681b	Fix comment containing vop_readdir_args contents: a_cookies is really u_long ** not u_long *.	2004-02-23 01:43:00 +00:00
Tim J. Robbins	224f219b31	cookies is an array of u_long, not u_int, so MALLOC() it accordingly. Allocating it with the wrong size could have caused corruption on 64-bit architectures.	2004-02-23 01:40:46 +00:00
Bruce Evans	0ef0dd6f85	Fixed a serious off by 1 error. The cluster-in-use bitmap was overrun by 1 u_int if the number of clusters was 1 more than a multiple of (8 * sizeof(u_int)). The bitmap is malloced and large (often huge), so fatal overrun probably only occurred if the number of clusters was 1 more than 1 multiple of PAGE_SIZE/8.	2004-02-21 22:47:19 +00:00
Poul-Henning Kamp	cd690b60de	Device megapatch 6/6: This is what we came here for: Hang dev_t's from their cdevsw, refcount cdevsw and dev_t and generally keep track of things a lot better than we used to: Hold a cdevsw reference around all entrances into the device driver, this will be necessary to safely determine when we can unload driver code. Hold a dev_t reference while the device is open. KASSERT that we do not enter the driver on a non-referenced dev_t. Remove old D_NAG code, anonymous dev_t's are not a problem now. When destroy_dev() is called on a referenced dev_t, move it to dead_cdevsw's list. When the refcount drops, free it. Check that cdevsw->d_version is correct. If not, set all methods to the dead_*() methods to prevent entrance into driver. Print warning on console to this effect. The device driver may still explode if it is also incompatible with newbus, but in that case we probably didn't get this far in the first place.	2004-02-21 21:57:26 +00:00
Poul-Henning Kamp	816d62bbb9	Device megapatch 5/6: Remove the unused second argument from udev2dev(). Convert all remaining users of makedev() to use udev2dev(). The semantic difference is that udev2dev() will only locate a pre-existing dev_t, it will not line makedev() create a new one. Apart from the tiny well controlled windown in D_PSEUDO drivers, there should no longer be any "anonymous" dev_t's in the system now, only dev_t's created with make_dev() and make_dev_alias()	2004-02-21 21:32:15 +00:00
Poul-Henning Kamp	dc08ffec87	Device megapatch 4/6: Introduce d_version field in struct cdevsw, this must always be initialized to D_VERSION. Flip sense of D_NOGIANT flag to D_NEEDGIANT, this involves removing four D_NOGIANT flags and adding 145 D_NEEDGIANT flags.	2004-02-21 21:10:55 +00:00
Poul-Henning Kamp	bc55355956	Report the correct length for symlink entries.	2004-02-19 19:09:52 +00:00
Tim J. Robbins	bddcdc5193	Use size_t or ssize_t wherever appropriate instead of casting from int * to size_t *, which is incorrect because they may have different widths. This caused some subtle forms of corruption, the mostly frequently reported one being that the last character of a filename was sometimes duplicated on amd64.	2004-02-19 09:56:58 +00:00
Tom Rhodes	01ba334c9a	Do not place dirmask in unnamed padding. Move it to the bottom of this list where it should have been added originally. Prodded by: bde	2004-02-17 16:53:41 +00:00
Tim J. Robbins	8bb386f217	If the "next free cluster" field of the FSInfo block is 0xFFFFFFFF, it means that the correct value is unknown. Since this value is just a hint to improve performance, initially assume that the first non-reserved cluster is free, then correct this assumption if necessary before writing the FSInfo block back to disk. PR: 62826 MFC after: 2 weeks	2004-02-17 08:51:49 +00:00
Poul-Henning Kamp	f82dfde7e5	White-space align a struct definition. Move a SYSINIT to the file where it belongs.	2004-02-15 21:43:08 +00:00
Bruce Evans	e9827c6d93	Fixed some style bugs: - don't unlock the vnode after vinvalbuf() only to have to relock it almost immediately. - don't refer to devices classified by vn_isdisk() as block devices.	2004-02-14 04:41:13 +00:00
Bruce Evans	67406320ff	MFffs (ffs_vfsops.c 1.227: clean up open mode bandaid). This reduces gratuitous differences with ffs a little.	2004-02-12 17:54:58 +00:00
Jacques Vidrine	a9c2bfa8e9	Fix a panic in pseudofs(9) that could occur when doing an I/O operation with a large request or large offset. Reported by: Joel Ray Holveck <joelh@piquan.org> Submitted by: des	2004-02-10 21:06:47 +00:00
Tim J. Robbins	b4484bf031	Fixes problems that occurred when a file was removed and a directory created with the same name, and vice versa: - Immediately recycle vnodes of files & directories that have been deleted or renamed. - When looking an entry in the VFS name cache or smbfs's private cache, make sure the vnode type is consistent with the type of file the server thinks it is, and re-create the vnode if it isn't. The alternative to this is to recycle vnodes unconditionally when their use count drops to 0, but this would make all the caching we do mostly useless. PR: 62342 MFC after: 2 weeks	2004-02-10 05:53:02 +00:00
John Baldwin	91d5354a2c	Locking for the per-process resource limits structure. - struct plimit includes a mutex to protect a reference count. The plimit structure is treated similarly to struct ucred in that is is always copy on write, so having a reference to a structure is sufficient to read from it without needing a further lock. - The proc lock protects the p_limit pointer and must be held while reading limits from a process to keep the limit structure from changing out from under you while reading from it. - Various global limits that are ints are not protected by a lock since int writes are atomic on all the archs we support and thus a lock wouldn't buy us anything. - All accesses to individual resource limits from a process are abstracted behind a simple lim_rlimit(), lim_max(), and lim_cur() API that return either an rlimit, or the current or max individual limit of the specified resource from a process. - dosetrlimit() was renamed to kern_setrlimit() to match existing style of other similar syscall helper functions. - The alpha OSF/1 compat layer no longer calls getrlimit() and setrlimit() (it didn't used the stackgap when it should have) but uses lim_rlimit() and kern_setrlimit() instead. - The svr4 compat no longer uses the stackgap for resource limits calls, but uses lim_rlimit() and kern_setrlimit() instead. - The ibcs2 compat no longer uses the stackgap for resource limits. It also no longer uses the stackgap for accessing sysctl's for the ibcs2_sysconf() syscall but uses kernel_sysctl() instead. As a result, ibcs2_sysconf() no longer needs Giant. - The p_rlimit macro no longer exists. Submitted by: mtm (mostly, I only did a few cleanups and catchups) Tested on: i386 Compiled on: alpha, amd64	2004-02-04 21:52:57 +00:00
Colin Percival	9d0be84912	Fix style(9) of my previous commit. Noticed by: nate Approved by: nate, rwatson (mentor)	2004-01-21 18:03:54 +00:00
Colin Percival	9f8ef8b8d1	Allow devfs path rules to work on directories. Without this fix, devfs rule add path fd unhide is a no-op, while it should unhide the fd subdirectory. Approved by: phk, rwatson (mentor) PR: kern/60897	2004-01-21 16:43:29 +00:00
Alexander Kabaev	6bd39fe978	Spell magic '16' number as IO_SEQSHIFT.	2004-01-19 20:03:43 +00:00
Brian Feldman	6d8e1f823b	Do not allow operations which cause known file-system corruption.	2004-01-16 22:55:25 +00:00
Brian Feldman	478382096e	Remove a warning.	2004-01-16 22:54:33 +00:00
Brian Feldman	016344807a	Fix an upper-vnode leak created in revision 1.52. When an upper-layer file has been removed, it should be purged from the cache, but it need not be removed from the directory stack causing corruption; instead, it will simply be removed once the last references and holds on it are dropped at the end of the unlink/rmdir system calls, and the normal !UN_CACHED VOP_INACTIVE() handler for unionfs finishes it off. This is easily reproduced by repeated "echo >file; rm file" on a unionfs mount. Strangely, "echo -n >file; rm file" didn't make it happen.	2004-01-16 16:31:01 +00:00
Tim J. Robbins	08fe4bfbda	Fix an inverted test for NOPEN in the unused function smb_smb_flush().	2004-01-12 14:43:49 +00:00
Don Lewis	b8b67f93a4	Don't try to unlock the directory vnode in null_lookup() if the lock is shared with the underlying file system and the lookup in the underlying file system did the unlock for us.	2004-01-11 23:44:32 +00:00
Tim J. Robbins	835fb61618	Restore closing of SMB find handle in smbfs_close().	2004-01-10 03:45:30 +00:00
Robert Watson	1f1ca35f69	Lock p->p_textvp before calling vn_fullpath() on it. Note the potential lock order concern due to the vnode lock held simultaneously by the caller into procfs. Reported by: kuriyama Approved by: des	2004-01-07 17:58:51 +00:00
Tim J. Robbins	208a7a97a3	In smbfs_inactive(), only invalidate the node's attribute cache if we had to send a file close request to the server.	2004-01-04 00:57:22 +00:00
Tim J. Robbins	00fff2c71a	Pass ACL, extended attribute and MAC vnode ops down the vnode stack.	2004-01-03 00:36:46 +00:00
Poul-Henning Kamp	49e9fc0a0d	Improve on POLA by populating DEVFS before doing devfs(8) rule ioctls. PR: 60687 Spotted by: Colin Percival <cperciva@daemonology.net>	2004-01-02 19:02:28 +00:00
Bruce Evans	392dbea3f6	Fixed some (most) style bugs in rev.1.33. Mainly 4-char indentation (msdosfs uses normal 8-char indentation almost everywhere else), too-long lines, and minor English usage errors. The verbose formal comment before the new function is still abnormal.	2003-12-29 11:59:05 +00:00
Bruce Evans	be039c568f	Fixed some minor style bugs in rev.1.144. All related to msdosfs_advlock() (mainly unsorting). There were no changes related to the dirty flag here. The reference NetBSD implementation put msdosfs_advlock() in a different place. This commit only moves its declarations and changes some of the function body to be like the NetBSD version.	2003-12-29 10:12:02 +00:00
Bruce Evans	a78c928753	Fixed style bugs in rev.1.112. The bugs started with obscure magic numbers in comments (Apple PR numbers?) and didn't improve.	2003-12-29 09:50:42 +00:00
Bruce Evans	9efe7d9d83	v_vxproc was a bogus name for a thread (pointer).	2003-12-28 09:12:56 +00:00
Tom Rhodes	cede1f563c	Make msdosfs support the dirty flag in FAT16 and FAT32. Enable lockf support. PR: 55861 Submitted by: Jun Su <junsu@m-net.arbornet.org> (original version) Reviewed by: make universe	2003-12-26 17:19:19 +00:00
Tim J. Robbins	07a65634d9	Make oldsize in smbfs_getattr() 64 bits wide instead of 32 to avoid truncation when files are larger than 4GB.	2003-12-22 12:33:31 +00:00
Tim J. Robbins	ce55c2612c	Avoid sign extension when casting signed characters to unsigned wide characters in ntfs_u28(). This fixes the conversion of filenames containing single-byte characters with the high bit set.	2003-12-16 01:52:54 +00:00
Max Khon	0f4e4130e1	Make msdosfs long filenames matching case insensitive again. PR: 59765 Submitted by: Ryuichiro Imura <imura@ryu16.org>	2003-12-08 08:32:20 +00:00
Dag-Erling Smørgrav	b331ec01c4	Constify, and add an API function to find a named node in a directory.	2003-12-07 17:41:19 +00:00
Dag-Erling Smørgrav	7caaf6c9c9	Minor whitespace and style issues.	2003-12-07 17:40:00 +00:00
Dag-Erling Smørgrav	e4b3851568	Remove useless SMP check code.	2003-12-07 17:37:44 +00:00
Scott Long	774114995e	Re-arrange and consolidate some random debugging stuff	2003-12-07 05:04:49 +00:00
Alexander Kabaev	c391349841	Fix vnode locking in fdesc_setattr. Lock vnode before invoking VOP_SETATTR on it. Approved by: re@ (rwatson)	2003-11-19 04:14:42 +00:00
Don Lewis	95c6cd2f4b	Use "fip->fi_readers == 0 && fip->fi_writers == 0" as the condition for disposing fifo resources in fifo_cleanup() instead using of "vp->v_usecount == 1". There may be other references to the vnode, for instance by nullfs, at the time fifo_open() or fifo_close() is called, which could cause a resource leak. Don't bother grabbing the vnode interlock in fifo_cleanup() since it no longer accesses v_usecount.	2003-11-16 01:11:11 +00:00
David Schultz	95eac68fea	- A sanity check in unionfs verifies that lookups of '.' return the vnode of the parent. However, this check should not be performed if the lookup failed. This change should fix "union_lookup returning . not same as startdir" panics people were seeing. The bug was introduced by an incomplete import of a NetBSD delta in rev 1.38. - Move the aforementioned check out from DIAGNOSTIC. Performance is the least of our unionfs worries. - Minor reorganization. PR: 53004 MFC after: 1 week	2003-11-14 08:23:13 +00:00
Poul-Henning Kamp	98d87dfecd	Initialize b_iooffset correctly.	2003-11-13 09:58:09 +00:00
Poul-Henning Kamp	2178f0e340	Don't mess around with spare fields of public structures.	2003-11-12 09:54:07 +00:00
Poul-Henning Kamp	1054ba63be	Don't mess about with spare fields in public structures.	2003-11-12 09:52:10 +00:00
Robert Watson	eca8a663d4	Modify the MAC Framework so that instead of embedding a (struct label) in various kernel objects to represent security data, we embed a (struct label *) pointer, which now references labels allocated using a UMA zone (mac_label.c). This allows the size and shape of struct label to be varied without changing the size and shape of these kernel objects, which become part of the frozen ABI with 5-STABLE. This opens the door for boot-time selection of the number of label slots, and hence changes to the bound on the number of simultaneous labeled policies at boot-time instead of compile-time. This also makes it easier to embed label references in new objects as required for locking/caching with fine-grained network stack locking, such as inpcb structures. This change also moves us further in the direction of hiding the structure of kernel objects from MAC policy modules, not to mention dramatically reducing the number of '&' symbols appearing in both the MAC Framework and MAC policy modules, and improving readability. While this results in minimal performance change with MAC enabled, it will observably shrink the size of a number of critical kernel data structures for the !MAC case, and should have a small (but measurable) performance benefit (i.e., struct vnode, struct socket) do to memory conservation and reduced cost of zeroing memory. NOTE: Users of MAC must recompile their kernel and all MAC modules as a result of this change. Because this is an API change, third party MAC modules will also need to be updated to make less use of the '&' symbol. Suggestions from: bmilekic Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2003-11-12 03:14:31 +00:00
Don Lewis	8d0c247562	If fifo_open() is interrupted, fifo_close() may not get called, causing a resource leak. Move the resource deallocation code from fifo_close() to a new function, fifo_cleanup(), and call fifo_cleanup() from fifo_close() and the appropriate places in fifo_open(). Tested by: Lukas Ertl Pointy hat to: truckman	2003-11-10 22:21:00 +00:00
Seigo Tanimura	512824f8f7	- Implement selwakeuppri() which allows raising the priority of a thread being waken up. The thread waken up can run at a priority as high as after tsleep(). - Replace selwakeup()s with selwakeuppri()s and pass appropriate priorities. - Add cv_broadcastpri() which raises the priority of the broadcast threads. Used by selwakeuppri() if collision occurs. Not objected in: -arch, -current	2003-11-09 09:17:26 +00:00
Scott Long	6565282c62	Add hooks for translating directories entries using the iconv methods. Submitted by: imura@ryu16.org	2003-11-05 06:56:08 +00:00
Scott Long	c5a1bf1b20	Add udf_UncompressUnicodeByte() for processing cs0 strings in a way that the iconv mehtods can handle Submitted by: imura@ryu16.org	2003-11-05 06:55:23 +00:00
Alexander Kabaev	ca430f2e92	Remove mntvnode_mtx and replace it with per-mountpoint mutex. Introduce two new macros MNT_ILOCK(mp)/MNT_IUNLOCK(mp) to operate on this mutex transparently. Eventually new mutex will be protecting more fields in struct mount, not only vnode list. Discussed with: jeff	2003-11-05 04:30:08 +00:00
Alexander Kabaev	cb9ddc80ae	Take care not to call vput if thread used in corresponding vget wasn't curthread, i.e. when we receive a thread pointer to use as a function argument. Use VOP_UNLOCK/vrele in these cases. The only case there td != curthread known at the moment is boot() calling sync with thread0 pointer. This fixes the panic on shutdown people have reported.	2003-11-02 04:52:53 +00:00
Alexander Kabaev	9185a9f5e3	Remove now unused variable.	2003-11-01 15:04:50 +00:00
Alexander Kabaev	492c1e68fb	Temporarily undo parts of the stuct mount locking commit by jeff. It is unsafe to hold a mutex across vput/vrele calls. This will be redone when a better locking strategy is agreed upon. Discussed with: jeff	2003-11-01 05:51:54 +00:00
Alexander Kabaev	b792e03079	Do not bother walking mount point vnode list just to calculate the number of vnodes. Use precomputed mp->mnt_nvnodelistsize value instead.	2003-11-01 04:36:50 +00:00
Poul-Henning Kamp	8b285b9088	Remember to check the DE_WHITEOUT flag in the case where a cloned device is hidden by a devfs(8) rule. Spotted by: Adam Nowacki <ptnowak@bsk.vectranet.pl>	2003-10-20 15:08:10 +00:00
Poul-Henning Kamp	7e8766a940	When a driver successfully created a device on demand, we can directly pick up the DEVFS inode number from the dev_t and find our directory entry from that, we don't need to scan the directory to find it. This also solves an issue with on-demand devices in subdirectories. Submitted by: cognet	2003-10-20 07:04:09 +00:00
Maxime Henrion	6fb826df1c	Remove debug printf().	2003-10-19 14:33:00 +00:00
Poul-Henning Kamp	01758670e9	Initialize b_iooffset before calling strategy	2003-10-18 19:48:21 +00:00
Poul-Henning Kamp	2c18019f14	DuH! bp->b_iooffset (the spot on the disk), not bp->b_offset (the offset in the file)	2003-10-18 14:10:28 +00:00
Poul-Henning Kamp	f14676dc6f	Initialize b_offset before calling VOP_SPECSTRATEGY()	2003-10-18 11:08:33 +00:00
Poul-Henning Kamp	c87b01a0fd	Initialize b_offset before calling VOP_STRATEGY/VOP_SPECSTRATEGY. Remove various comments of KASSERTS and comments about B_PHYS which does not apply anymore.	2003-10-18 11:06:15 +00:00
Poul-Henning Kamp	583b92e328	Convert some if(bla) panic("foo") to KASSERTS to improve grep-ability.	2003-10-18 09:32:39 +00:00
Poul-Henning Kamp	0023f61848	Introduce a new optional memberfunction for cdevsw, fdopen() which passes the fdidx from VOP_OPEN down. This is for all I know the final API for this functionality, but the locking semantics for messing with the filedescriptor from the device driver are not settled at this time.	2003-10-15 20:00:59 +00:00
Bruce Evans	201e0377ca	Include <sys/mutex.h>. Don't depend on namespace pollution in <sys/vnode.h>. Fixed a nearby style bug. The include of vcoda.h used angle brackets and was not used.	2003-10-05 07:44:45 +00:00
Jeff Roberson	4ab2c8bd52	- Check the XLOCK prior to inspecting v_data.	2003-10-05 06:44:53 +00:00
Jeff Roberson	055cfed702	- Check XLOCK prior to accessing v_data.	2003-10-05 06:43:30 +00:00
Jeff Roberson	7bfaa956e8	- Don't cache_purge() in cd9660_reclaim. vclean() does it for us so this is redundant.	2003-10-05 02:45:36 +00:00
Jeff Roberson	9c695a2697	- Don't cache_purge() in *_reclaim routines. vclean() does it for us so this is redundant.	2003-10-05 02:43:30 +00:00
Alan Cox	6caf7e9fa4	Synchronize access to a vm page's valid field using the containing vm object's lock.	2003-10-04 23:37:38 +00:00
Jeff Roberson	0c31ea3b2d	- Make proper use of the mntvnode_mtx. We do not need the loop label because we do not drop the mntvnode_mtx. If this code had ever executed and hit the loop condition it would have spun forever.	2003-10-04 13:16:54 +00:00
Jeff Roberson	c13c04a426	- Acquire the vnode interlock prior to droping the mntvnode_mtx. This does not eliminate races where the vnode could be reclaimed and end up with a NULL v_data pointer but Giant is protecting us from that at the moment.	2003-10-04 12:52:37 +00:00
Alan Cox	10e9e2d1b9	Synchronize access to a page's valid field by using the lock from its containing object.	2003-10-04 09:20:00 +00:00
Jeff Roberson	8b5905a47d	- Remove the backtrace() call from the *_vinvalbuf() functions. Thanks to a stack trace supplied by phk, I now understand what's going on here. The check for VI_XLOCK stops us from calling vinvalbuf once the vnode has been partially torn down in vclean(). It is not clear that this would cause a problem. Document this in nfs_bio.c, which is where the other two filesystems copied this code from.	2003-10-04 08:51:50 +00:00
Jacques Vidrine	8b7358ca43	Introduce a uiomove_frombuf helper routine that handles computing and validating the offset within a given memory buffer before handing the real work off to uiomove(9). Use uiomove_frombuf in procfs to correct several issues with integer arithmetic that could result in underflows/overflows. As a side-effect, the code is significantly simplified. Add additional sanity checks when computing a memory allocation size in pfs_read. Submitted by: rwatson (original uiomove_frombuf -- bugs are mine :-) Reported by: Joost Pol <joost@pine.nl> (integer underflows/overflows)	2003-10-02 15:00:55 +00:00
Robert Watson	309cd88432	Add a new column to the procfs map to hold the name of the mapped file for vnode mappings. Note that this uses vn_fullpath() and may be somewhat unreliable, although not too unreliable for shared libraries. For non-vnode mappings, just print "-" for the field. Obtained from: TrustedBSD Projects Sponsored by: DARPA, AFRL, Network Associates Laboratories	2003-09-29 20:53:19 +00:00
Poul-Henning Kamp	25d6da1b5e	forgot to remove static declaration of fdesc_poll()	2003-09-27 12:34:41 +00:00
Poul-Henning Kamp	f1cf3def78	fdesc_poll() called seltrue() to do the default thing, this is pointlessly wrong when we have a default in vop_nopoll() which does the right thing.	2003-09-27 12:24:42 +00:00
Bruce Evans	67425e4678	Fixed some style bugs in previous commit. Mainly, forward-declare struct msdosfsmount so that this file has the same prerequisites as it used to. The new prerequistite was a meta-style bug. It required many style bugs (unsorted includes ...) elsewhere. Formatted prototypes in KNF. Resisted urge to sort all the prototypes, to minimise differences with NetBSD. (NetBSD has reformatted the prototypes but has not sorted them and still uses __P(()).)	2003-09-27 01:18:27 +00:00
Max Khon	c4f02a891f	- Support for multibyte charsets in LIBICONV. - CD9660_ICONV, NTFS_ICONV and MSDOSFS_ICONV kernel options (with corresponding modules). - kiconv(3) for loadable charset conversion tables support. Submitted by: Ryuichiro Imura <imura@ryu16.org>	2003-09-26 20:26:25 +00:00
Tim J. Robbins	aa808a7fa6	Allow the [, ], and = characters in non-8.3 filenames since they are allowed by Windows (ref: MS KB article 120138). XXX From my reading of the CIFS specification, it's not clear that clients need to validate filenames at all. PR: 57123 Submitted by: Paul Coucher MFC after: 1 month	2003-09-26 12:11:08 +00:00
Jeff Roberson	ce1fb23146	- Remove interlock protection around VI_XLOCK. The interlock is not sufficient to guarantee that this race is not hit. The XLOCK will likely have to be redesigned due to the way reference counting and mutexes work in FreeBSD. We currently can not be guaranteed that xlock was not set and cleared while we were blocked on the interlock while waiting to check for XLOCK. This would lead us to reference a vnode which was not the vnode we requested. - Add a backtrace() call inside of INVARIANTS in the hopes of finding out if this condition is ever hit. It should not, since we should be retaining a reference to the vnode in these cases. The reference would be sufficient to block recycling.	2003-09-19 23:37:49 +00:00
Tim J. Robbins	5f39b226ce	Move an overly verbose message under #ifdef CODA_VERBOSE.	2003-09-13 01:13:56 +00:00
Tim J. Robbins	e1d237ec05	Move an annoying printf() call that gets triggered every time an operation is interrupted (with ^C or ^Z) under CODA_VERBOSE.	2003-09-10 01:41:15 +00:00
Tim J. Robbins	d14e51c95c	Add support for the Coda 6.x venus<->kernel interface. This extends FIDs to be 128-bits wide and adds support for realms. Add a new CODA_COMPAT_5 option, which requests support for the old Coda 5.x interface instead of the new one. Create a new coda5.ko module that supports the 5.x interface, and make the existing coda.ko module use the new 6.x interface. These modules cannot both be loaded at the same time. Obtained from: Jan Harkes & the coda-6.0.2 distribution, NetBSD (drochner) (CODA_COMPAT_5 option).	2003-09-07 07:43:10 +00:00
Marcel Moolenaar	fccf82902d	The valid field in struct vm_page can be of type unsigned long when 32K pages are selected. In spec_getpages() change the printf format specifier and add an explicit cast so that we always print the field as a long type.	2003-08-28 01:52:14 +00:00
Alan Cox	49dc7ac17d	Use the requested page's object field instead of the vnode's. In some cases, the vnode's object field is not initialized leading to a NULL pointer dereference when the object is locked. Tested by: rwatson	2003-08-22 17:50:32 +00:00
Dag-Erling Smørgrav	134ce0f9cc	Add pfs_visible() checks to pfs_getattr() and pfs_getextattr(). This also fixes pfs_access() since it relies on VOP_GETATTR() which will call pfs_getattr(). This prevents jailed processes from discovering the existence, start time and ownership of processes outside the jail. PR: kern/48156	2003-08-19 10:26:41 +00:00
John Baldwin	d49ebea58c	Spell the name of the lock right in addition to getting the type right. Submitted by: Kim Culhan <kimc@w8hd.org>	2003-08-18 19:23:01 +00:00

1 2 3 4 5 ...

1557 Commits