freebsd-nq

Author	SHA1	Message	Date
Jason Evans	dd85920a4f	Add the VFS_AIO config option and leave it off by default. Unless the VFS_AIO option is specified, all aio-related syscalls return ENOSYS. The aio code is very fragile right now, and is unsuitable for default inclusion in a production shell box. Approved by: jkh	2000-02-23 07:44:25 +00:00
Jason Evans	b7592c7bea	Back out the previous spl change, since it opens a race window. Reviewed by: alfred, dillon, peter	2000-01-20 08:15:13 +00:00
Jason Evans	60ffb01993	Don't tsleep() while at splbio(). Correctly return EINPROGRESS from aio_error() even when an aio request is still in the socket queue. Submitted by: Adrian Chadd <adrian@bofh.co.uk>	2000-01-20 01:59:58 +00:00
Brian Feldman	f582ac0630	Fix vn_isdisk() usage to make AIO work on non-disk-files again, rather than just return ENOTBLK. PR: 16163 Submitted by: Adrian Chadd <adrian@FreeBSD.org>	2000-01-17 21:18:39 +00:00
Jason Evans	bfbbc4aa44	Add aio_waitcomplete(). Make aio work correctly for socket descriptors. Make gratuitous style(9) fixes (me, not the submitter) to make the aio code more readable. PR: kern/12053 Submitted by: Chris Sedore <cmsedore@maxwell.syr.edu>	2000-01-14 02:53:29 +00:00
Poul-Henning Kamp	ba4ad1fcea	Give vn_isdisk() a second argument where it can return a suitable errno. Suggested by: bde	2000-01-10 12:04:27 +00:00
Poul-Henning Kamp	38224dcd59	Convert various pieces of code to use vn_isdisk() rather than checking for vp->v_type == VBLK. In ccd: we don't need to call VOP_GETATTR to find the type of a vnode. Reviewed by: sos	1999-11-22 10:33:55 +00:00
Poul-Henning Kamp	008626c39e	Simplify and de-bogotify check for raw disk.	1999-11-07 13:09:09 +00:00
Poul-Henning Kamp	02c58685a4	Change useracc() and kernacc() to use VM_PROT_{READ\|WRITE\|EXECUTE} for the "rw" argument, rather than hijacking B_{READ\|WRITE}. Fix two bugs (physio & cam) resulting by the confusion caused by this. Submitted by: Tor.Egge@fast.no Reviewed by: alc, ken (partly)	1999-10-30 06:32:05 +00:00
Peter Wemm	d1f088dab5	Trim unused options (or #ifdef for undoc options). Submitted by: phk	1999-10-11 15:19:12 +00:00
Brian Feldman	13ccadd4b0	This is what was "fdfix2.patch," a fix for fd sharing. It's pretty far-reaching in fd-land, so you'll want to consult the code for changes. The biggest change is that now, you don't use fp->f_ops->fo_foo(fp, bar) but instead fo_foo(fp, bar), which increments and decrements the fp refcount upon entry and exit. Two new calls, fhold() and fdrop(), are provided. Each does what it seems like it should, and if fdrop() brings the refcount to zero, the fd is freed as well. Thanks to peter ("to hell with it, it looks ok to me.") for his review. Thanks to msmith for keeping me from putting locks everywhere :) Reviewed by: peter	1999-09-19 17:00:25 +00:00
Peter Wemm	c3aac50f28	$Id$ -> $FreeBSD$	1999-08-28 01:08:13 +00:00
Poul-Henning Kamp	49ff4debd3	Spring cleaning around strategy and disklabels/slices: Introduce BUF_STRATEGY(struct buf *, int flag) macro, and use it throughout. please see comment in sys/conf.h about the flag argument. Remove strategy argument from all the diskslice/label/bad144 implementations, it should be found from the dev_t. Remove bogus and unused strategy1 routines. Remove open/close arguments from dssize(). Pick them up from dev_t. Remove unused and unfinished setgeom support from diskslice/label/bad144 code.	1999-08-14 11:40:51 +00:00
Poul-Henning Kamp	4d4f932326	s/v_specinfo/v_rdev/	1999-08-13 10:10:12 +00:00
Poul-Henning Kamp	0ef1c82630	Decommision miscfs/specfs/specdev.h. Most of it goes into <sys/conf.h>, a few lines into <sys/vnode.h>. Add a few fields to struct specinfo, paving the way for the fun part.	1999-08-08 18:43:05 +00:00
Peter Wemm	9c8b8baa38	Slight reorganization of kernel thread/process creation. Instead of using SYSINIT_KT() etc (which is a static, compile-time procedure), use a NetBSD-style kthread_create() interface. kproc_start is still available as a SYSINIT() hook. This allowed simplification of chunks of the sysinit code in the process. This kthread_create() is our old kproc_start internals, with the SYSINIT_KT fork hooks grafted in and tweaked to work the same as the NetBSD one. One thing I'd like to do shortly is get rid of nfsiod as a user initiated process. It makes sense for the nfs client code to create them on the fly as needed up to a user settable limit. This means that nfsiod doesn't need to be in /sbin and is always "available". This is a fair bit easier to do outside of the SYSINIT_KT() framework.	1999-07-01 13:21:46 +00:00
Peter Wemm	df8abd0bb9	Slight tweak to fork1() calling conventions. Add a third argument so the caller can easily find the child proc struct. fork(), rfork() etc syscalls set p->p_retval[] themselves. Simplify the SYSINIT_KT() code and other kernel thread creators to not need to use pfind() to find the child based on the pid. While here, partly tidy up some of the fork1() code for RF_SIGSHARE etc.	1999-06-30 15:33:41 +00:00
Kirk McKusick	67812eacd7	Convert buffer locking from using the B_BUSY and B_WANTED flags to using lockmgr locks. This commit should be functionally equivalent to the old semantics. That is, all buffer locking is done with LK_EXCLUSIVE requests. Changes to take advantage of LK_SHARED and LK_RECURSIVE will be done in future commits.	1999-06-26 02:47:16 +00:00
Poul-Henning Kamp	6fcd8a7c93	Introduce the makebdev() function, it does the same as the makedev() function for now, but that will change.	1999-06-01 18:56:26 +00:00
Poul-Henning Kamp	0a346dab99	major(something) can never become NODEV.	1999-05-09 13:13:52 +00:00
Poul-Henning Kamp	4be2eb8c49	I got tired of seeing all the cdevsw[major(foo)] all over the place. Made a new (inline) function devsw(dev_t dev) and substituted it. Changed to the BDEV variant to this format as well: bdevsw(dev_t dev) DEVFS will eventually benefit from this change too.	1999-05-08 06:40:31 +00:00
Poul-Henning Kamp	b0eeea2042	remove b_proc from struct buf, it's (now) unused. Reviewed by: dillon, bde	1999-05-06 20:00:34 +00:00
Peter Wemm	d5558c001a	Fix up a few easy 'assignment used as truth value' and 'suggest parens around && within \|\|' type warnings. I'm pretty sure I have not masked any problems here, I've committed real problem fixes seperately.	1999-05-06 18:44:42 +00:00
Luoqi Chen	5206bca10a	Enable vmspace sharing on SMP. Major changes are, - %fs register is added to trapframe and saved/restored upon kernel entry/exit. - Per-cpu pages are no longer mapped at the same virtual address. - Each cpu now has a separate gdt selector table. A new segment selector is added to point to per-cpu pages, per-cpu global variables are now accessed through this new selector (%fs). The selectors in gdt table are rearranged for cache line optimization. - fask_vfork is now on as default for both UP and SMP. - Some aio code cleanup. Reviewed by: Alan Cox <alc@cs.rice.edu> John Dyson <dyson@iquest.net> Julian Elischer <julian@whistel.com> Bruce Evans <bde@zeta.org.au> David Greenman <dg@root.com>	1999-04-28 01:04:33 +00:00
Dmitrij Tejblum	8fe387ab84	Add standard padding argument to pread and pwrite syscall. That should make them NetBSD compatible. Add parameter to fo_read and fo_write. (The only flag FOF_OFFSET mean that the offset is set in the struct uio). Factor out some common code from read/pread/write/pwrite syscalls.	1999-04-04 21:41:28 +00:00
Bruce Evans	a5c9bce777	Added a used #include (don't depend on "vnode_if.h" including <sys/buf.h>).	1999-02-25 15:54:06 +00:00
Luoqi Chen	b1028ad122	Hide access to vmspace:vm_pmap with inline function vmspace_pmap(). This is the preparation step for moving pmap storage out of vmspace proper. Reviewed by: Alan Cox <alc@cs.rice.edu> Matthew Dillion <dillon@apollo.backplane.com>	1999-02-19 14:25:37 +00:00
Matthew Dillon	bc81493155	More const fixes for -Wall, -Wcast-qual	1999-01-29 23:18:50 +00:00
Bruce Evans	9e26dd2a54	Removed bogus casts to c_caddr_t. This is part of terminating c_caddr_t with extreme prejudice. Here the original casts to caddr_t were to support K&R compilers (or missing prototypes), but the relevant source files require an ANSI compiler.	1999-01-29 08:29:05 +00:00
Matthew Dillon	697457a133	Fix warnings related to -Wall -Wcast-qual	1999-01-28 17:32:05 +00:00
Matthew Dillon	8aef171243	Fix warnings in preparation for adding -Wall -Wcast-qual to the kernel compile	1999-01-28 00:57:57 +00:00
Matthew Dillon	d254af07a1	Fix warnings in preparation for adding -Wall -Wcast-qual to the kernel compile	1999-01-27 21:50:00 +00:00
Matthew Dillon	1c7c3c6a86	This is a rather large commit that encompasses the new swapper, changes to the VM system to support the new swapper, VM bug fixes, several VM optimizations, and some additional revamping of the VM code. The specific bug fixes will be documented with additional forced commits. This commit is somewhat rough in regards to code cleanup issues. Reviewed by: "John S. Dyson" <root@dyson.iquest.net>, "David Greenman" <dg@root.com>	1999-01-21 08:29:12 +00:00
Dag-Erling Smørgrav	e3b3ba2d79	Wrap two macros into do { ... } while (0), and fix the way they're used in the kernel. Reviewed by: bde	1998-12-15 17:38:33 +00:00
Tor Egge	18830dba83	Don't forget to update the pmap associated with aio daemons when adding new page directory entries for a growing kernel virtual address space.	1998-11-27 01:14:21 +00:00
Poul-Henning Kamp	f5ef029e92	Nitpicking and dusting performed on a train. Removes trivial warnings about unused variables, labels and other lint.	1998-10-25 17:44:59 +00:00
Bruce Evans	2d2f8ae7ad	Fixed nonsense overflow checking (checking that a long variable is less than INT_MAX after it has possibly overflowed). Removed an unused variable and its associated 2 style bugs. Removed unused includes.	1998-08-17 17:28:10 +00:00
Bruce Evans	30166fabb6	Cast between longs and pointers via intptr_t. There shouldn't be nearly so many casts here. Casting an pointer that was an integer back to an integer just to compare it with -1 is bad, and casting it back just to compare it with NULL is just wrong.	1998-07-15 06:51:14 +00:00
Julian Elischer	596f8506ad	fix braino from yesterdays' megacommit Not sure of the result of it.. (may or may not effect anything) but it's fixed now. (found by: comparing what cvsup sent back to me with what I tested..)	1998-07-05 20:33:18 +00:00
Julian Elischer	f7ea2f55d1	There is no such thing any more as "struct bdevsw". There is only cdevsw (which should be renamed in a later edit to deventry or something). cdevsw contains the union of what were in both bdevsw an cdevsw entries. The bdevsw[] table stiff exists and is a second pointer to the cdevsw entry of the device. it's major is in d_bmaj rather than d_maj. some cleanup still to happen (e.g. dsopen now gets two pointers to the same cdevsw struct instead of one to a bdevsw and one to a cdevsw). rawread()/rawwrite() went away as part of this though it's not strictly the same patch, just that it involves all the same lines in the drivers. cdroms no longer have write() entries (they did have rawwrite (?)). tapes no longer have support for bdev operations. Reviewed by: Eivind Eklund and Mike Smith Changes suggested by eivind.	1998-07-04 22:30:26 +00:00
Doug Rabson	8c12612cf6	64bit fixes: don't cast pointers to int.	1998-06-10 10:31:08 +00:00
Dag-Erling Smørgrav	dc73342347	Seventy-odd "its" / "it's" typos in comments fixed as per kern/6108.	1998-04-17 22:37:19 +00:00
Poul-Henning Kamp	227ee8a188	Eradicate the variable "time" from the kernel, using various measures. "time" wasn't a atomic variable, so splfoo() protection were needed around any access to it, unless you just wanted the seconds part. Most uses of time.tv_sec now uses the new variable time_second instead. gettime() changed to getmicrotime(0. Remove a couple of unneeded splfoo() protections, the new getmicrotime() is atomic, (until Bruce sets a breakpoint in it). A couple of places needed random data, so use read_random() instead of mucking about with time which isn't random. Add a new nfs_curusec() function. Mark a couple of bogosities involving the now disappeard time variable. Update ffs_update() to avoid the weird "== &time" checks, by fixing the one remaining call that passwd &time as args. Change profiling in ncr.c to use ticks instead of time. Resolution is the same. Add new function "tvtohz()" to avoid the bogus "splfoo(), add time, call hzto() which subtracts time" sequences. Reviewed by: bde	1998-03-30 09:56:58 +00:00
Peter Dufault	8a6472b723	Finish _POSIX_PRIORITY_SCHEDULING. Needs P1003_1B and _KPOSIX_PRIORITY_SCHEDULING options to work. Changes: Change all "posix4" to "p1003_1b". Misnamed files are left as "posix4" until I'm told if I can simply delete them and add new ones; Add _POSIX_PRIORITY_SCHEDULING system calls for FreeBSD and Linux; Add man pages for _POSIX_PRIORITY_SCHEDULING system calls; Add options to LINT; Minor fixes to P1003_1B code during testing.	1998-03-28 11:51:01 +00:00
Bruce Evans	08637435f2	Moved some #includes from <sys/param.h> nearer to where they are actually used.	1998-03-28 10:33:27 +00:00
Bruce Evans	57518a4e83	Removed a stale comment and staler code.	1998-02-25 06:30:15 +00:00
Eivind Eklund	303b270b0a	Staticize.	1998-02-09 06:11:36 +00:00
Eivind Eklund	0b08f5f737	Back out DIAGNOSTIC changes.	1998-02-06 12:14:30 +00:00
Eivind Eklund	47cfdb166d	Turn DIAGNOSTIC into a new-style option.	1998-02-04 22:34:03 +00:00
John Dyson	648899413d	Quiet some lint.	1997-12-10 04:14:23 +00:00
John Dyson	78922e413c	Correct prototypes to match POSIX. Correct return code for aio_cancel. Submitted by: Alex Nash <nash@mcs.com>	1997-12-08 02:18:25 +00:00
John Dyson	e499ed6f86	Fix a problem when creating a new kernel thread. In some cases, aio_read or aio_write can return the pid of the new thread. This is due to the way that return values from system calls being passed by side-effect in the proc structure now. This commit fixes the problem with aio_read and aio_write.	1997-12-01 18:41:08 +00:00
John Dyson	11783b142b	Fix error handling for VCHR type I/O. Also, fix another spl problem, and remove alot of overly verbose debugging statements. ioproclist { int aioprocflags; /* AIO proc flags / TAILQ_ENTRY(aioproclist) list; / List of processes / struct proc aioproc; /* The AIO thread / TAILQ_HEAD (,aiocblist) jobtorun; / suggested job to run / }; / * data-structure for lio signal management / struct aio_liojob { int lioj_flags; int lioj_buffer_count; int lioj_buffer_finished_count; int lioj_queue_count; int lioj_queue_finished_count; struct sigevent lioj_signal; / signal on all I/O done / TAILQ_ENTRY (aio_liojob) lioj_list; struct kaioinfo lioj_ki; }; #define LIOJ_SIGNAL 0x1 /* signal on all done (lio) / #define LIOJ_SIGNAL_POSTED 0x2 / signal has been posted / / * per process aio data structure / struct kaioinfo { int kaio_flags; / per process kaio flags / int kaio_maxactive_count; / maximum number of AIOs / int kaio_active_count; / number of currently used AIOs / int kaio_qallowed_count; / maxiumu size of AIO queue / int kaio_queue_count; / size of AIO queue / int kaio_ballowed_count; / maximum number of buffers / int kaio_queue_finished_count; / number of daemon jobs finished / int kaio_buffer_count; / number of physio buffers / int kaio_buffer_finished_count; / count of I/O done / struct proc kaio_p; /* process that uses this kaio block / TAILQ_HEAD (,aio_liojob) kaio_liojoblist; / list of lio jobs / TAILQ_HEAD (,aiocblist) kaio_jobqueue; / job queue for process / TAILQ_HEAD (,aiocblist) kaio_jobdone; / done queue for process / TAILQ_HEAD (,aiocblist) kaio_bufqueue; / buffer job queue for process / TAILQ_HEAD (,aiocblist) kaio_bufdone; / buffer done queue for process / }; #define KAIO_RUNDOWN 0x1 / process is being run down / #define KAIO_WAKEUP 0x2 / wakeup process when there is a significant event / TAILQ_HEAD (,aioproclist) aio_freeproc, aio_activeproc; TAILQ_HEAD(,aiocblist) aio_jobs; / Async job list / TAILQ_HEAD(,aiocblist) aio_bufjobs; / Phys I/O job list / TAILQ_HEAD(,aiocblist) aio_freejobs; / Pool of free jobs / static void aio_init_aioinfo(struct proc p) ; static void aio_onceonly(void ) ; static int aio_free_entry(struct aiocblist aiocbe); static void aio_process(struct aiocblist aiocbe); static int aio_newproc(void) ; static int aio_aqueue(struct proc p, struct aiocb job, int type) ; static void aio_physwakeup(struct buf bp); static int aio_fphysio(struct proc p, struct aiocblist aiocbe, int type); static int aio_qphysio(struct proc p, struct aiocblist iocb); static void aio_daemon(void uproc); SYSINIT(aio, SI_SUB_VFS, SI_ORDER_ANY, aio_onceonly, NULL); static vm_zone_t kaio_zone=0, aiop_zone=0, aiocb_zone=0, aiol_zone=0, aiolio_zone=0; / * Single AIOD vmspace shared amongst all of them / static struct vmspace aiovmspace = NULL; /* * Startup initialization / void aio_onceonly(void na) { TAILQ_INIT(&aio_freeproc); TAILQ_INIT(&aio_activeproc); TAILQ_INIT(&aio_jobs); TAILQ_INIT(&aio_bufjobs); TAILQ_INIT(&aio_freejobs); kaio_zone = zinit("AIO", sizeof (struct kaioinfo), 0, 0, 1); aiop_zone = zinit("AIOP", sizeof (struct aioproclist), 0, 0, 1); aiocb_zone = zinit("AIOCB", sizeof (struct aiocblist), 0, 0, 1); aiol_zone = zinit("AIOL", AIO_LISTIO_MAX * sizeof (int), 0, 0, 1); aiolio_zone = zinit("AIOLIO", AIO_LISTIO_MAX * sizeof (struct aio_liojob), 0, 0, 1); aiod_timeout = AIOD_TIMEOUT_DEFAULT; aiod_lifetime = AIOD_LIFETIME_DEFAULT; jobrefid = 1; } /* * Init the per-process aioinfo structure. * The aioinfo limits are set per-process for user limit (resource) management. / void aio_init_aioinfo(struct proc p) { struct kaioinfo *ki; if (p->p_aioinfo == NULL) { ki = zalloc(kaio_zone); p->p_aioinfo = ki	1997-12-01 07:01:45 +00:00
John Dyson	f4f0ecefab	Correct a last minute code change. Would have been an infinite loop under certain error conditions. Submitted by: pst@shockwave.com	1997-11-30 23:21:08 +00:00
John Dyson	c5efdcbdec	Fix an spl nit.	1997-11-30 21:47:36 +00:00
John Dyson	84af4da65a	Finish up the vast majority of the AIO/LIO functionality. Proper signal support was missing in the previous version of the AIO code. More tunables added, and very efficient support for VCHR files has been added. Kernel threads are not used for VCHR files, all work for such files is done for the requesting process directly. Some attempt has been made to charge the requesting process for resource utilization, but more work is needed. aio_fsync is still missing (but the original fsync system call can be used for now.) aio_cancel is essentially a noop, but that is okay per POSIX. More aio_cancel functionality can be added later, if it is found to be needed. The functions implemented include: aio_read, aio_write, lio_listio, aio_error, aio_return, aio_cancel, aio_suspend. The code has been implemented to support the POSIX spec 1003.1b (formerly known as POSIX 1003.4 spec) features of the above. The async I/O features are truly async, with the VCHR mode of operation being essentially the same as physio (for appropriate files) for maximum efficiency. This code also supports the signal capability, is highly tunable, allowing management of resource usage, and has been written to allow a per process usage quota. Both the O'Reilly POSIX.4 book and the actual POSIX 1003.1b document were the reference specs used. Any filedescriptor can be used with these new system calls. I know of no exceptions where these system calls will not work. (TTY's will also probably work.)	1997-11-30 04:36:31 +00:00
John Dyson	f4feb04e1f	Disable the VCHR optimization for AIO until I have implemented it. Just in case anyone wants to play with the POSIX AIO/LIO stuff. (As it is, it should work with ANY vnode, on UP systems only, for now.)	1997-11-29 02:57:46 +00:00
John Dyson	fd3bf77574	Fix and complete the AIO syscalls. There are some performance enhancements coming up soon, but the code is functional. Docs will be forthcoming.	1997-11-29 01:33:10 +00:00
Bruce Evans	fdebd4f0fd	Get locking stuff by #including <sys/lock.h> instead of <sys/vnode.h>.	1997-11-18 10:02:40 +00:00
Poul-Henning Kamp	4a11ca4e29	Remove a bunch of variables which were unused both in GENERIC and LINT. Found by: -Wunused	1997-11-07 08:53:44 +00:00
Poul-Henning Kamp	cb226aaa62	Move the "retval" (3rd) parameter from all syscall functions and put it in struct proc instead. This fixes a boatload of compiler warning, and removes a lot of cruft from the sources. I have not removed the /ARGSUSED/, they will require some looking at. libkvm, ps and other userland struct proc frobbing programs will need recompiled.	1997-11-06 19:29:57 +00:00
Poul-Henning Kamp	a1c995b626	Last major round (Unless Bruce thinks of somthing :-) of malloc changes. Distribute all but the most fundamental malloc types. This time I also remembered the trick to making things static: Put "static" in front of them. A couple of finer points by: bde	1997-10-12 20:26:33 +00:00
Poul-Henning Kamp	55166637cd	Distribute and statizice a lot of the malloc M_* types. Substantial input from: bde	1997-10-11 18:31:40 +00:00
John Dyson	c486068695	Make the target for the number of AIO daemons work.	1997-10-11 01:07:03 +00:00
John Dyson	a624e84fff	Major cleanup and debugging of the new AIO/LIO code. The LIO code is now corrected. New tunables/instrumentation added. The code is now likely "good enough to use." I will add the userland support soon. The "high performance" mode for raw devices is still missing, and will be added next. POSIX system calls that now appear to work: aio_cancel, aio_error, aio_read, aio_return, aio_suspend, aio_write, lio_listio. Missing, but to be added soon: aio_fsync.	1997-10-09 04:14:41 +00:00
Bruce Evans	e4ba6a82b0	Removed unused #includes.	1997-09-02 20:06:59 +00:00
John Dyson	5aaef07c50	Clean up some lint associated with the AIO code.	1997-07-17 04:49:43 +00:00
John Dyson	2244ea07dc	This is an upgrade so that the kernel supports the AIO calls from POSIX.4. Additionally, there is some initial code that supports LIO. This code supports AIO/LIO for all types of file descriptors, with few if any restrictions. There will be a followup very soon that will support significantly more efficient operation for VCHR type files (raw.) This code is also dependent on some kernel features that don't work under SMP yet. After I commit the changes to the kernel to support proper address space sharing on SMP, this code will also work under SMP.	1997-07-06 02:40:43 +00:00
John Dyson	ee877a356c	Add initial AIO/LIO kernel thread support files. This is preliminary, and further features will be added.	1997-06-16 00:27:26 +00:00

1 2 3 4 5

219 Commits