freebsd-nq

Author	SHA1	Message	Date
Peter Wemm	c828c7b784	Fix warnings - make kevent args in comment match those in syscalls.master. Deal with consts.	2000-07-28 22:32:25 +00:00
Peter Wemm	b31ae1adc5	Fix a warning that has been annoying me for some time: "kern/sys_generic.c:358: warning: cast discards qualifiers from pointer target type" The idea for using the uintptr_t intermediate cast for de-constifying a pointer was hinted at by bde some time ago.	2000-07-28 22:17:42 +00:00
Robert Watson	fc3345a4a7	o Modify extattr_{set,get}() syscalls so that partial reads and writes with an error condition such as EINTR, EWOULDBLOCK, and ERESTART, are reported to the application, not silently conceal. This behavior was copied from the {read,write}v() syscalls, and is appropriate there but not here. o Correct a bug in extattr_delete() wherein the LOCKLEAF flag is passed to the wrong argument in namei(), resulting in some unexpected errors during name resolution, and passing in an unlocked vnode. Obtained from: TrustedBSD Project	2000-07-28 19:52:38 +00:00
Jonathan Lemon	ab2adc20f2	Have kevent() automatically restart if interrupted by a signal. If this is not desired, then the user can register an EV_SIGNAL filter to explicitly catch a signal event. Change requested by: jayanth, ps, peter "Why is kevent non-restartable after a signal?"	2000-07-27 23:06:14 +00:00
Brian Feldman	3c89e357f0	Distinguish between whether ktraceing was enabled before an IO operation or after it. If the ktrace operation was enabled while the process was blocked doing IO, the race would allow it to pass down invalid (uninitialized) data and panic later down the call stack.	2000-07-27 03:45:18 +00:00
Robert Watson	3ce7b7aa84	o Lock vnode before calling extattr_* VOP's, and modify vnode spec to allow for that. o Remember to call NDFREE() if exiting as a result of a failed vn_start_write() when snapshotting. Reviewed by: mckusick Obtained from: TrustedBSD Project	2000-07-26 20:29:20 +00:00
Kirk McKusick	54e53ebda7	Now that buffer locks can be recursive, we need to delete the panics that complain about them. Obtained from: Brian Fundakowski Feldman <green@FreeBSD.org>	2000-07-25 18:28:46 +00:00
Kirk McKusick	aec3bbe11c	Do not need vrele(nd.ni_vp) as that is done by NDFREE(&nd, 0); Submitted by: Peter Holm <pho@freebsd.org>	2000-07-25 05:38:54 +00:00
Robert Watson	e2e45aa8a0	o Add missing function return types from capability syscall call stubs, fix compiler warning. Submitted by: jake	2000-07-25 03:37:36 +00:00
Kirk McKusick	9b97113391	This patch corrects the first round of panics and hangs reported with the new snapshot code. Update addaliasu to correctly implement the semantics of the old checkalias function. When a device vnode first comes into existence, check to see if an anonymous vnode for the same device was created at boot time by bdevvp(). If so, adopt the bdevvp vnode rather than creating a new vnode for the device. This corrects a problem which caused the kernel to panic when taking a snapshot of the root filesystem. Change the calling convention of vn_write_suspend_wait() to be the same as vn_start_write(). Split out softdep_flushworklist() from softdep_flushfiles() so that it can be used to clear the work queue when suspending filesystem operations. Access to buffers becomes recursive so that snapshots can recursively traverse their indirect blocks using ffs_copyonwrite() when checking for the need for copy on write when flushing one of their own indirect blocks. This eliminates a deadlock between the syncer daemon and a process taking a snapshot. Ensure that softdep_process_worklist() can never block because of a snapshot being taken. This eliminates a problem with buffer starvation. Cleanup change in ffs_sync() which did not synchronously wait when MNT_WAIT was specified. The result was an unclean filesystem panic when doing forcible unmount with heavy filesystem I/O in progress. Return a zero'ed block when reading a block that was not in use at the time that a snapshot was taken. Normally, these blocks should never be read. However, the readahead code will occationally read them which can cause unexpected behavior. Clean up the debugging code that ensures that no blocks be written on a filesystem while it is suspended. Snapshots must explicitly label the blocks that they are writing during the suspension so that they do not cause a `write on suspended filesystem' panic. Reorganize ffs_copyonwrite() to eliminate a deadlock and also to prevent a race condition that would permit the same block to be copied twice. This change eliminates an unexpected soft updates inconsistency in fsck caused by the double allocation. Use bqrelse rather than brelse for buffers that will be needed soon again by the snapshot code. This improves snapshot performance.	2000-07-24 05:28:33 +00:00
Brian Feldman	55af4c7d94	Using an atomic operation here won't help if nobody else uses them (for this). Use the simple_lock() on v_interlock like elsewhere.	2000-07-23 22:19:49 +00:00
Brian Feldman	25ead03462	Solve the problem where it is possible to get the kernel stuck in a loop down in pmap_init_pt(). A subtraction causes the number of pages to become negative, that was assigned to an unsigned variable, and there is a lot of iteration. The bug is due to the ELF image activator not properly checking for its files being the correct size as specified by the ELF header. The solution is to check that the header doesn't ask for part of a file when that part of the file doesn't exist. Make sure to set VEXEC at the proper times to make the executables immutable (remove race conditions). Also, the ELF format specifiies header entries that allow embedding of other executables (hence how ld-elf.so.1 gets loaded, but not the same as loading shared libraries), so those executables need to be set VEXEC, too, so they're immutable. Reviewed by: peter	2000-07-23 06:49:46 +00:00
Alfred Perlstein	f408896444	only allow accept filter modifications on listening sockets Submitted by: ps	2000-07-20 12:17:17 +00:00
Alfred Perlstein	85f5e7f098	disallow unload until we do proper refcounting	2000-07-20 12:12:41 +00:00
Jonathan Lemon	2ba03123c5	Fix a bug which would cause some knotes to get lost when two kqueues were being used in a process at the same time. Test case provided by: Chris Peiffer <peifferc@CS.Stanford.EDU>	2000-07-18 21:41:47 +00:00
Jonathan Lemon	a8e65b915e	Simplify kqueue API slightly. Discussed on: -arch	2000-07-18 19:31:52 +00:00
Peter Wemm	f03c9f90d1	Patch up some bogons in the resource_find() vs resource_find_hard() interfaces. The original resource_find() returned a pointer to an internal resource table entry. resource_find_hard() dereferences the actual passed in value (oops!) - effectively trashing random memory due to the pointer being passed in with a random initial value. Submitted by: bde	2000-07-18 06:08:27 +00:00
Andrzej Bialecki	bd3cdc3105	These patches implement dynamic sysctls. It's possible now to add and remove sysctl oids at will during runtime - they don't rely on linker sets. Also, the node oids can be referenced by more than one kernel user, which means that it's possible to create partially overlapping trees. Add sysctl contexts to help programmers manage multiple dynamic oids in convenient way. Please see the manpages for detailed discussion, and example module for typical use. This work is based on ideas and code snippets coming from many people, among them: Arun Sharma, Jonathan Lemon, Doug Rabson, Brian Feldman, Kelly Yancey, Poul-Henning Kamp and others. I'd like to specially thank Brian Feldman for detailed review and style fixes. PR: kern/16928 Reviewed by: dfr, green, phk	2000-07-15 10:26:04 +00:00
Alfred Perlstein	af0e6bcdf0	Make mbstat.m_mtypes seperate and viewable via sysctl, also expand the size from short to ulong Submitted by: Ian Dowse <iedowse@maths.tcd.ie> PR: kern/19809	2000-07-15 06:02:48 +00:00
Paul Saab	88f675ba30	Change the way NMI's are handled. Before, if DDB was enabled and a NMI occured, you could type continue in DDB and the kernel would not attempt to detect what type of NMI was recieved. Now we check for the type of NMI first and then go to DDB if it is enabled. This will solve the problem with having DDB enabled and getting an NMI due to some possibly bad error and being able to continue the operation of the kernel when you really want to panic and know what happened. Submitted by: jhb	2000-07-14 11:49:44 +00:00
Robert Watson	e8483a05a6	o Commit two of two, introducing __cap_{get,set}_{fd,file} syscalls to modify capability sets on files. Obtained from: TrustedBSD Project	2000-07-13 20:38:52 +00:00
Robert Watson	92eebb8a9b	o Introduce syscall prototypes, stubs for __cap_{get,set}_{fd,file}, syscalls to manage capability sets on files. First of two commits. Obtained from: TrustedBSD Project	2000-07-13 20:31:24 +00:00
John Baldwin	9c386f6b7d	For infinite timeouts, set both the tv_sec and tv_usec fields to zero in poll() and select(). Noticed by: Wesley Morgan <morganw@chemicals.tacorp.com>	2000-07-13 02:12:25 +00:00
John Baldwin	4da144c091	Fix a very obscure bug in select() and poll() where the timeout would never expire if poll() or select() was called before the system had been in multiuser for 1 second. This was caused by only checking to see if tv_sec was zero rather than checking both tv_sec and tv_usec.	2000-07-12 22:46:40 +00:00
Jun-ichiro itojun Hagino	f38211642f	remove m_pulldown statistics, which is highly experimental and does not belong to *bsd-merged tree	2000-07-12 16:39:13 +00:00
Kirk McKusick	f2a2857bb3	Add snapshots to the fast filesystem. Most of the changes support the gating of system calls that cause modifications to the underlying filesystem. The gating can be enabled by any filesystem that needs to consistently suspend operations by adding the vop_stdgetwritemount to their set of vnops. Once gating is enabled, the function vfs_write_suspend stops all new write operations to a filesystem, allows any filesystem modifying system calls already in progress to complete, then sync's the filesystem to disk and returns. The function vfs_write_resume allows the suspended write operations to begin again. Gating is not added by default for all filesystems as for SMP systems it adds two extra locks to such critical kernel paths as the write system call. Thus, gating should only be added as needed. Details on the use and current status of snapshots in FFS can be found in /sys/ufs/ffs/README.snapshot so for brevity and timelyness is not included here. Unless and until you create a snapshot file, these changes should have no effect on your system (famous last words).	2000-07-11 22:07:57 +00:00
Boris Popov	2ff087318a	Correct SYSINIT execution order in the case when KLD contains more than one SYSINIT with the same 'subsystem' id and different 'order' id. Reviewed by: peter	2000-07-09 23:58:56 +00:00
Brian Feldman	7ceba2d755	Remove two micro-pessimizations I made. Bruce is teaching me well :) KTRPOINT(p, KTR_GENIO) is more uncommon than error == 0, so it should be first in the && statement.	2000-07-07 22:11:37 +00:00
Brian Feldman	9d1cfdce2a	Change that &@!$# UIO_READ to be UIO_WRITE. I tested the ktrace stuff, but somehow... pass the pointy hat, again!	2000-07-07 21:52:15 +00:00
Boris Popov	3660ebc2c0	Fix support for more than 256 simultaneous mounts. Theoretical limit is 2^16 mounts per fs type. Reported by: Troy Arie Cobb <tcobb@staff.circle.net> via phk Reviewed by: bde	2000-07-07 14:01:08 +00:00
John Baldwin	9701cd40b4	Support for unsigned integer and long sysctl variables. Update the SYSCTL_LONG macro to be consistent with other integer sysctl variables and require an initial value instead of assuming 0. Update several sysctl variables to use the unsigned types. PR: 15251 Submitted by: Kelly Yancey <kbyanc@posi.net>	2000-07-05 07:46:41 +00:00
Warner Losh	5d10777c46	End two weeks of on and off debugging. Fix the crash on the Nth insertion of a CF card, for random values of N > 1. With these fixes, I've been able to do 100 insert/remove of the cards w/o a crash with lots of system activity going on that in the past would help trigger the crash. The problem: FreeBSD creates dev_t's on the fly as they are needed and never destroys them. These dev_t's point to a struct disk that is used for housekeeping on the disk. When a device goes away, the struct disk pointer becomes a dangling pointer. Sometimes when the device comes back, the pointer will point to the new struct disk (in which case the insertion will work). Other times it won't (especially if any length of time has passed, since it is dependent on memory returned from malloc). The Fix: There is one of these dev_t's that is always correct. The device for the WHOLE_DISK_SLICE is always right. It gets set at create_disk() time. So, the fix is to spend a little CPU time and lookup the WHOLE_DISK_SLICE dev_t and use the si_disk from that in preference to the one that's in the device asking to do the I/O. In addition, we change the test of si_disk == NULL meaning that the dev needed to inherit properties from the pdev to dev->si_disk != pdev->si_disk. This test is a little stronger than the previous test, but can sometimes be fooled into not inheriting. However, the results of this fooling are that the old values will be used, which will generally always be the same as before. si_drv[12] are the only values that are copied that might pose a problem. They tend to change as the si_disk field would change, so it is a hole, but it is a small hole. One could correctly argue that one should replace much of this code with something much much better. I would be on the pro side of that argument. Reviewed by: phk (who also ported the original patch to current) Sponsored by: Timing Solutions	2000-07-05 06:01:33 +00:00
Jun-ichiro itojun Hagino	686cdd19b1	sync with kame tree as of july00. tons of bug fixes/improvements. API changes: - additional IPv6 ioctls - IPsec PF_KEY API was changed, it is mandatory to upgrade setkey(8). (also syntax change)	2000-07-04 16:35:15 +00:00
Poul-Henning Kamp	77978ab8bc	Previous commit changing SYSCTL_HANDLER_ARGS violated KNF. Pointed out by: bde	2000-07-04 11:25:35 +00:00
Kirk McKusick	c904bbbdd8	Simplify and rationalise the management of the vnode free list (preparing the code to add snapshots).	2000-07-04 04:32:40 +00:00
Kirk McKusick	e6796b67d9	Move the truncation code out of vn_open and into the open system call after the acquisition of any advisory locks. This fix corrects a case in which a process tries to open a file with a non-blocking exclusive lock. Even if it fails to get the lock it would still truncate the file even though its open failed. With this change, the truncation is done only after the lock is successfully acquired. Obtained from: BSD/OS	2000-07-04 03:34:11 +00:00
Kirk McKusick	3764219663	If a buffer flush fails when trying to reclaim a vnode, it is too late to save the vnode, so just toss any remaining unwritten buffers rather than leaving them lying around to make trouble in the future.	2000-07-04 03:23:29 +00:00
Kirk McKusick	bdbd3ff7cf	Update tags directive to reflect the new location of soft updates and the reorganization of the eisa directory.	2000-07-04 00:18:43 +00:00
Poul-Henning Kamp	3275cf7379	Make the two calls from kern/* into softupdates #ifdef SOFTUPDATES, that is way cleaner than using the softupdates_stub stunt, which should be killed when convenient. Discussed with: mckusick	2000-07-03 13:26:54 +00:00
Poul-Henning Kamp	9282307a5d	Add device_set_softc() which does the obvious. Not objected to by: dfr	2000-07-03 13:06:29 +00:00
Poul-Henning Kamp	82d9ae4e32	Style police catches up with rev 1.26 of src/sys/sys/sysctl.h: Sanitize SYSCTL_HANDLER_ARGS so that simplistic tools can grog our sources: -sysctl_vm_zone SYSCTL_HANDLER_ARGS +sysctl_vm_zone (SYSCTL_HANDLER_ARGS)	2000-07-03 09:35:31 +00:00
Chris Costello	d41c16130b	Instead of just blindly setting -rw-rw-rw-: o Set access mode to -r--r--r-- if SS_CANTRCVMORE is set and the receive buffer is empty. o Set access mode to --w--w--w- is SS_CANTSENDMORE is set. Discussed with: alfred	2000-07-02 23:56:45 +00:00
Chris Costello	417779230b	Report -rw-rw-rw file access modes in soo_stat. Reviewed by: alfred	2000-07-02 19:31:00 +00:00
Brian Feldman	42ebfbf227	Modify ktrace's general I/O tracing, ktrgenio(), to use a struct uio * instead of a struct iovec * array and int len. Get rid of stupidly trying to allocate all of the memory and copyin()ing the entire iovec[], and instead just do the proper VOP_WRITE() in ktrwrite() using a copy of the struct uio that the syscall originally used. This solves the DoS which could easily be performed; to work around the DoS, one could also remove "options KTRACE" from the kernel. This is a very strong MFC candidate for 4.1. Found by: art@OpenBSD.org	2000-07-02 08:08:09 +00:00
Brian S. Dean	c6d3f3bfc1	Fix my own style bugs (use of spaces instead of tabs for indentation). This is a style-only change.	2000-07-01 02:40:13 +00:00
Archie Cobbs	6c66bbed1a	Move the securelevel check before loading KLD's into linker_load_file(), instead of requiring every caller of linker_load_file() to perform the check itself. This avoids netgraph loading KLD's when securelevel > 0, not to mention any future code that may call linker_load_file(). Reviewed by: dfr	2000-06-29 17:57:04 +00:00
Boris Popov	5badeabaca	Move #ifdef to the right place.	2000-06-29 09:26:26 +00:00
Boris Popov	99063cf89e	If kernel compiled with INVARIANTS: On unload, remove references from freelist to memory type defined by module. Print a warning if module defines and allocate its own memory type, but didn't free it all on unload. Reviewed by: peter	2000-06-29 03:41:30 +00:00
Chris Costello	0e8363eca9	Report a file type (S_IFIFO) in kqueue_stat().	2000-06-28 19:16:27 +00:00
Alfred Perlstein	1a61fa5e0d	don't panic the system when fpathconv is called on an unsupported filetype.	2000-06-27 23:08:36 +00:00
Alfred Perlstein	35b1da8080	remove crufty exec stuff, perl is in the base system make it work with warnings on (there was some harmless use of uninitialized variables) make it work with 'use strict' Approved by: peter	2000-06-27 19:09:55 +00:00
Poul-Henning Kamp	a8b1f9d2c9	Move prtactive to vfs from ufs. It is used all over the place.	2000-06-27 07:46:22 +00:00
Neil Blakey-Milner	47fdd692c6	Add sysctl descriptions to a few sysctls. Simply "documentation". PR: kern/8015 Submitted by: Stefan Eggers <seggers@semyam.dinoco.de>	2000-06-26 13:52:31 +00:00
Peter Wemm	ce365ee318	Some changes and fixes from Bruce: Use strtoul(), not strtol() in the hints decoder so that 'flags 0xa0ffa0ff' is not truncated to 0x7fffffff. Use a stack buffer instead of a static 100 byte bss buffer. Use \0 for the NUL character. Remove some ``excessive'' parens.	2000-06-26 09:53:37 +00:00
Jonathan Lemon	cb5ad9d362	Fix stupid braino in last commit, initialize `vp' before we test vp->v_tag. Spotted by: dillon	2000-06-25 18:10:45 +00:00
Mark Murray	b6e67f5c7d	Remove no-longer-relevant comment.	2000-06-25 10:14:06 +00:00
Mark Murray	4eeb4f04c3	Forgot this earlier; delete the old /dev/random driver, bring in the header for the new. Reviewed by: dfr	2000-06-25 09:35:40 +00:00
Dima Ruban	1a432a2f54	Fix typo (inT -> int)	2000-06-23 07:10:34 +00:00
Alfred Perlstein	c636255150	fix races in the uidinfo subsystem, several problems existed: 1) while allocating a uidinfo struct malloc is called with M_WAITOK, it's possible that while asleep another process by the same user could have woken up earlier and inserted an entry into the uid hash table. Having redundant entries causes inconsistancies that we can't handle. fix: do a non-waiting malloc, and if that fails then do a blocking malloc, after waking up check that no one else has inserted an entry for us already. 2) Because many checks for sbsize were done as "test then set" in a non atomic manner it was possible to exceed the limits put up via races. fix: instead of querying the count then setting, we just attempt to set the count and leave it up to the function to return success or failure. 3) The uidinfo code was inlining and repeating, lookups and insertions and deletions needed to be in their own functions for clarity. Reviewed by: green	2000-06-22 22:27:16 +00:00
Jonathan Lemon	c8bea19ee3	Add a hack to fail registration of kq events on a non-ufs filesystem, as support for those is non-existent at the moment.	2000-06-22 18:41:07 +00:00
Jonathan Lemon	d2693dbbc4	Add code so that the udata field is preserved across a TRACK event. When re-adding an event, do not reset the event state. If the event was pending, it will remain pending. This allows the user to change the udata field after the event was registered, while not losing any events which have already occurred. Reported by: jmg	2000-06-22 18:39:31 +00:00
Neil Blakey-Milner	445572c1ed	Add 'kern.disks', a sysctl which returns the list of disks from disk_enumerate(), space delimited. This allows non-root users to get a list of disks and will simplify libdisk's Disk_Names(). Reviewed by: phk	2000-06-22 11:44:43 +00:00
Alfred Perlstein	a79b71281c	return of the accept filter part II accept filters are now loadable as well as able to be compiled into the kernel. two accept filters are provided, one that returns sockets when data arrives the other when an http request is completed (doesn't work with 0.9 requests) Reviewed by: jmg	2000-06-20 01:09:23 +00:00
Alfred Perlstein	a72fda7154	backout accept optimizations. Requested by: jmg, dcs, jdp, nate	2000-06-18 08:49:13 +00:00
Poul-Henning Kamp	7c50d77218	Revert part of my bioops change which implemented panic(8).	2000-06-16 14:32:13 +00:00
Poul-Henning Kamp	a2e7a027a7	Virtualizes & untangles the bioops operations vector. Ref: Message-ID: <18317.961014572@critter.freebsd.dk> To: current@	2000-06-16 08:48:51 +00:00
Robert Watson	625cc84808	Second of two commits adding capability manipulation syscalls for processes. Obtained from: TrustedBSD Project	2000-06-15 23:27:18 +00:00
Robert Watson	b09b66abf6	Introduce syscalls for process capability manipulation. Currently backs onto already committed stubs. Commit one of two. Reviewed by: Damned if I can remember. Many people. Obtained from: TrustedBSD Project	2000-06-15 23:08:17 +00:00
Poul-Henning Kamp	4bd02a5609	Add disk_enumerate() for finding names of disks. Vinum and libh will need this RSN. Remove a pointless warning in the root device locating code. Remove the "wd" compatibility name from the "ad" driver. WARNING: If you have not updated to use /dev/wd* in your /etc/fstab and modern bootblocks, it would be a very good idea to do so BEFORE you upgrade your kernel.	2000-06-15 20:30:53 +00:00
Alfred Perlstein	8f4e4aa5f1	add socketoptions DELAYACCEPT and HTTPACCEPT which will not allow an accept() until the incoming connection has either data waiting or what looks like a HTTP request header already in the socketbuffer. This ought to reduce the context switch time and overhead for processing requests. The initial idea and code for HTTPACCEPT came from Yahoo engineers and has been cleaned up and a more lightweight DELAYACCEPT for non-http servers has been added Reviewed by: silence on hackers.	2000-06-15 18:18:43 +00:00
Peter Wemm	7d02379e48	As a bit of a gross hack, allow earlier access to both the static and dynamic hints. This allows the resource_XXX_value() calls to work before malloc() has started. This gets the serial console working as well as a few other things.	2000-06-15 09:57:20 +00:00
Peter Wemm	690f8fc4c3	Fix a stray debug output. change if (1 \|\| bootverbose) to if (bootverbose)	2000-06-15 04:12:17 +00:00
Bruce Evans	8e8cac5555	sys/malloc.h: Order the SYSINIT() for MALLOC_DEFINE() correctly so that malloc() doesn't have to waste time initializing itself. The (SI_SUB_KMEM, SI_ORDER_ANY) order was shared with syscons' SYSINIT() for scmeminit(), and scmeminit() calls malloc(), so malloc() initialization was not always complete on the first call to malloc(). kern/kern_malloc.c: - Removed self-initialization in malloc(). - Removed half-baked sanity check in free(). Trust MALLOC_DEFINE().	2000-06-14 18:31:42 +00:00
Peter Wemm	f71c01cc52	Borrow phk's axe and apply the next stage of config(8)'s evolution. Use Warner Losh's "hint" driver to decode ascii strings to fill the resource table at boot time. config(8) no longer generates an ioconf.c table - ie: the configuration no longer has to be compiled into the kernel. You can reconfigure your isa devices with the likes of this at loader(8) time: set hint.ed.0.port=0x320 userconfig will be rewritten to use this style interface one day and will move to /boot/userconfig.4th or something like that. It is still possible to statically compile in a set of hints into a kernel if you do not wish to use loader(8). See the "hints" directive in GENERIC as an example. All device wiring has been moved out of config(8). There is a set of helper scripts (see i386/conf/gethints.pl, and the same for alpha and pc98) that extract the 'at isa? port foo irq bar' from the old files and produces a hints file. If you install this file as /boot/device.hints (and update /boot/defaults/loader.conf - You can do a build/install in sys/boot) then loader will load it automatically for you. You can also compile in the hints directly with: hints "device.hints" as well. There are a few things that I'm not too happy with yet. Under this scheme, things like LINT would no longer be useful as "documentation" of settings. I have renamed this file to 'NOTES' and stored the example hints strings in it. However... this is not something that config(8) understands, so there is a script that extracts the build-specific data from the documentation file (NOTES) to produce a LINT that can be config'ed and built. A stack of man4 pages will need updating. :-/ Also, since there is no longer a difference between 'device' and 'pseudo-device' I collapsed the two together, and the resulting 'device' takes a 'number of units' for devices that still have it statically allocated. eg: 'device fe 4' will compile the fe driver with NFE set to 4. You can then set hints for 4 units (0 - 3). Also note that 'device fe0' will be interpreted as "zero units of 'fe'" which would be bad, so there is a config warning for this. This is only needed for old drivers that still have static limits on numbers of units. All the statically limited drivers that I could find were marked. Please exercise EXTREME CAUTION when transitioning! Moral support by: phk, msmith, dfr, asmodai, imp, and others	2000-06-13 22:28:50 +00:00
Jeroen Ruigrok van der Werven	3b43fd626a	Fix panic by moving the prp == 0 check up the order of sanity checks. Submitted by: Bart Thate <freebsd@1st.dudi.org> on -current Approved by: rwatson	2000-06-13 15:44:04 +00:00
Alfred Perlstein	8757e5bbc5	unstatic getfp() so that other subsystems can use it. make sendfile() use it. Approved by: dg	2000-06-12 18:06:12 +00:00
Bruce Evans	0477138dad	Fixed allocation of unit numbers. Allocate the amount of space actually required (rounded up a little) instead of twice the previous amount (or a fixed amount for the first allocation). The bug caused memory corruption when a new unit number for a devclass was more than about twice the previous maximum one (or more than 3 for the first one), so it corrupted memory (which happened to be the atkbdc port resource list) in the reporter's configuration with sio unit numbers { 0, 25, 1, 2, ... }. Reviewed by: dfr Reported by: Leonid Lukiyanets <stalwar78@hotmail.com>	2000-06-11 07:19:20 +00:00
Poul-Henning Kamp	c27f4d3c50	fix a typo	2000-06-10 19:21:20 +00:00
Peter Wemm	53cc6add2a	Unused include: #include "pty.h"	2000-06-10 07:12:40 +00:00
Jonathan Lemon	d36cb22369	malloc(..., M_WAITOK) will not return NULL, so remove the error handling for this case (which was slightly broken anyway) Fix up some whitespace problems while I'm here too. Submitted by: alfred (in a slightly different form)	2000-06-10 01:51:18 +00:00
Robert Watson	e812e4917d	Dammit. Trimmed an extra sysctl when I moved kern.suser_permitted from kern_mib.c to kern_prot.c. This commit should restore it, as well as fix the resulting build problems. Submitted by: asmodai	2000-06-07 18:54:41 +00:00
Robert Watson	a996141f6e	Introduce additional POSIX.1e-related stubs o options CAPABILITIES o kern/kern_cap.c -- syscall stubs returning ENOSYS syscalls.master changes to follow Obtained from: TrustedBSD Project	2000-06-07 04:53:49 +00:00
Robert Watson	579f4eb4cd	o bde suggested moving the SYSCTL from kern_mib to the more appropriate kern_prot, which cleans up some namespace issues o Don't need a special handler to limit un-setting, as suser is used to protect suser_permitted, making it one-way by definition. Suggested by: bde	2000-06-05 18:30:55 +00:00
Robert Watson	0309554711	o Introduce kern.suser_permitted, a sysctl that disables the suser_xxx() returning anything but EPERM. o suser is enabled by default; once disabled, cannot be reenabled o To be used in alternative security models where uid0 does not connote additional privileges o Should be noted that uid0 still has some additional powers as it owns many important files and executables, so suffers from the same fundamental security flaws as securelevels. This is fixed with MAC integrity protection code (in progress) o Not safe for consumption unless you are really sure you don't want things like shutdown to work, et al :-) Obtained from: TrustedBSD Project	2000-06-05 14:53:55 +00:00
Robert Watson	7cadc2663e	o Modify jail to limit creation of sockets to UNIX domain sockets, TCP/IP (v4) sockets, and routing sockets. Previously, interaction with IPv6 was not well-defined, and might be inappropriate for some environments. Similarly, sysctl MIB entries providing interface information also give out only addresses from those protocol domains. For the time being, this functionality is enabled by default, and toggleable using the sysctl variable jail.socket_unixiproute_only. In the future, protocol domains will be able to determine whether or not they are ``jail aware''. o Further limitations on process use of getpriority() and setpriority() by jailed processes. Addresses problem described in kern/17878. Reviewed by: phk, jmg	2000-06-04 04:28:31 +00:00
Bruce Evans	f47f0edde4	Use "nm \| awk ..." instead of genassym(1) to generate symbol value headers. Symbol values are now represented using array sizes (4 arrays per symbol so that 16-bit machines can represent 64-bit values) instead of being raw binary values. Reviewed by: marcel	2000-06-02 09:27:48 +00:00
Mike Smith	c3c50c4e3a	Further fixes for multiple-IO-APIC systems from Tor Egge: Further experimentation showed that some Dell 2450 machines with the prevention kludge installed still got T_RESERVED traps. CPU interrupt vector 0x7A was observed to be triggered. This might have been the bitwise OR of two different vectors sent from each of the IOAPICs at the same time. IOAPIC #0: 0x68 --> irq 8: RTC timer interrupt IOAPIC #1: 0x32 --> irq 18: scsi host adapter or network interface ---- 0x7a --> T_RESERVED Both IOAPICs had ID 0. Appendix B.3 in the MP spec indicates that the operating system is responsible for assigning unique IDs to the IOAPICs. The enclosed patch programs the IOAPIC IDs according to the IOAPIC entries in the MP table. Submitted by: tegge	2000-05-31 21:37:28 +00:00
Matthew Dillon	8b03c8ed5e	This is a cleanup patch to Peter's new OBJT_PHYS VM object type and sysv shared memory support for it. It implements a new PG_UNMANAGED flag that has slightly different characteristics from PG_FICTICIOUS. A new sysctl, kern.ipc.shm_use_phys has been added to enable the use of physically-backed sysv shared memory rather then swap-backed. Physically backed shm segments are not tracked with PV entries, allowing programs which use a large shm segment as a rendezvous point to operate without eating an insane amount of KVM in the PV entry management. Read: Oracle. Peter's OBJT_PHYS object will also allow us to eventually implement page-table sharing and/or 4MB physical page support for such segments. We're half way there.	2000-05-29 22:40:54 +00:00
Doug Rabson	ca2e05343b	Add taskqueue system for easy-to-use SWIs among other things. Reviewed by: arch	2000-05-28 15:45:30 +00:00
Søren Schmidt	d5f65fcbd7	If devclass_alloc_unit() is called with a wired unit #, and this is buzy, only search upwards for a free slot to use.. This broke unit numbering on ATA systems where PCI attached controllers come before the mainboard ones... Reviewed by: dfr	2000-05-26 13:59:05 +00:00
Jake Burkholder	e39756439c	Back out the previous change to the queue(3) interface. It was not discussed and should probably not happen. Requested by: msmith and others	2000-05-26 02:09:24 +00:00
Jake Burkholder	740a1973a6	Change the way that the queue(3) structures are declared; don't assume that the type argument to _HEAD and _ENTRY is a struct. Suggested by: phk Reviewed by: phk Approved by: mdodd	2000-05-23 20:41:01 +00:00
Mike Smith	b38f58db69	Make a trip to Pointy-Hats-R-Us and actually include the header that defines ROOTDEVNAME. Submitted by: "Jeffrey S. Sharp" <jss@subatomix.com>	2000-05-22 17:25:47 +00:00
David E. O'Brien	d4af7a50dc	Sort the sys includes.	2000-05-22 17:09:13 +00:00
Brian Feldman	a274d19ba2	Back out NOTE_EXIT status reporting pending discussion.	2000-05-21 16:27:41 +00:00
Peter Wemm	24488c7498	Provide a temporary undocumented option: SHM_PHYS_BACKED. This will become sysctl and/or flags controlled later. It's mainly here for an easy place to test the physical memory backed objects.	2000-05-21 13:52:13 +00:00
Brian Feldman	a24b514d72	Put the wait(2) exit status in "data" for NOTE_EXIT kevents.	2000-05-17 01:16:11 +00:00
Jeroen Ruigrok van der Werven	01f76720fb	Fix the rootmount code for now. This function will probably rewritten/renamed to devpp. Submitted by: Assar Westerlund <assar@sics.se> on -current Confirmed to work: Steinar Haug <sthaug@nethelp.no>, Manfred Antar <mantar@pacbell.net> Reviewed by: phk	2000-05-14 07:43:12 +00:00
Jeroen Ruigrok van der Werven	37d90a44af	Fix comment typo. Submitted by: nrahlstr	2000-05-12 16:06:49 +00:00
Chris Costello	040fac0bbd	Include the UID and GID values filled in by socreate() into socket->so_cred for stat() calls. Reviewed by: phk	2000-05-11 22:08:57 +00:00
Chris Costello	12861d58db	Include UID and GID information for stat() calls using the values filled into the file descriptor data by falloc(). Reviewed by: phk	2000-05-11 22:08:20 +00:00
Bruce Evans	9114579d7a	Regenerated (fixed the calculation of sy_nargs in sysent tables).	2000-05-09 21:52:02 +00:00
Bruce Evans	6b972e0bdd	Fixed the calculation of sy_nargs in sysent tables. We attempted to do this in awk using the hack of counting args of type off_t twice and args of all other types once. This is too simple to work. It gave benignly wrong results on alphas (off_t shouldn't be counted twice) and for svr4_sys_mmap64() on i386's (off64_t should be counted twice). It gave fatally wrong results for i386's with 64-bit longs (longs should be counted twice). The correct value for sy_nargs is easier to determine from the size of the args struct anyway, except for complications to make the generated code almost readable. Improved formatting of sysent tables by lining up the comments where possible.	2000-05-09 21:18:30 +00:00
Poul-Henning Kamp	192c06ea1b	Change the "bdev-whiner" to whine when open is attempted and extend the deadline a month.	2000-05-09 18:53:57 +00:00
Matthew Dillon	d2ba455c2c	Some ioctl routines assume that the ioctl buffer is aligned, but a char[] declaration makes no such guarentee. A union is used to force alignment of the char buffer.	2000-05-09 17:43:21 +00:00
Bruce Evans	4aee570d90	Regenerated (fixed the type of mmap()'s padding arg).	2000-05-09 08:35:51 +00:00
Bruce Evans	aa4b7eae22	Fixed the declaration of mmap(). The crufty padding arg had the wrong type. This gave an inconsistent amount of crufty padding on i386's with 64-bit longs (8 bytes instead of 4). On alphas it gives a consistent amount of crufty padding (8 bytes) in addition to the 4 bytes of normal padding caused by passing int args as register_t's. Fixed the args struct tag for the NOPROTO syscalls (netbsd_lchown() and netbsd_msync()). The tag is currently unused for NOPROTO syscalls, so the bug has no effect, but it will be used even in the NOPROTO case to calculate sy_nargs correctly.	2000-05-09 08:31:06 +00:00
Peter Wemm	0e59fec6d8	Make issetugid return correctly. It was returning -1 with errno == 1 if it was set?id! Submitted by: Valentin Nechayev <netch@segfault.kiev.ua>	2000-05-09 00:58:34 +00:00
Greg Lehey	72cc7e2dce	Correct a couple of typos.	2000-05-07 05:09:45 +00:00
Poul-Henning Kamp	ad7ba3d455	Remove devstat_end_transaction_buf() everybody uses devstat_end_transaction_bio() now.	2000-05-06 06:59:08 +00:00
Poul-Henning Kamp	9626b608de	Separate the struct bio related stuff out of <sys/buf.h> into <sys/bio.h>. <sys/bio.h> is now a prerequisite for <sys/buf.h> but it shall not be made a nested include according to bdes teachings on the subject of nested includes. Diskdrivers and similar stuff below specfs::strategy() should no longer need to include <sys/buf.> unless they need caching of data. Still a few bogus uses of struct buf to track down. Repocopy by: peter	2000-05-05 09:59:14 +00:00
Jonathan Lemon	b4b03426ca	Fix one bug where the kn_head list could be manipulated without spl() protection in the case of a copyout error. Add missing spl calls around the intial activation call that is done when when the kevent is added. Add two KASSERT macros to help catch errors in the future.	2000-05-04 20:19:17 +00:00
Paul Richards	8651b9ec1b	If BUS_DEBUG is defined then create a sysctl, debug.bus_debug, that is used to control whether the debug messages are output at runtime. It defaults to on so that if you define BUS_DEBUG in your kernel then you get all the debugging info when you boot. It's very useful for disabling all the debugging info when you're developing a loadable device driver and you're doing lots of loads and unloads but don't always want to see all the debugging info.	2000-05-03 17:45:04 +00:00
Paul Richards	c0151c49d2	Replace all the ifdef debugging spaghetti with a single ifdef and a macro so that it is easier to read the flow of the code.	2000-05-03 00:20:36 +00:00
Peter Wemm	365c5db0a7	Add $FreeBSD$	2000-05-01 20:32:07 +00:00
Poul-Henning Kamp	017ef345bc	Give struct bio it's own call back mechanism.	2000-05-01 13:36:25 +00:00
Peter Wemm	ab063af911	Move the MSG* and SEM* options to opt_sysvipc.h Remove evil allocation macros from machdep.c (why was that there???) and use malloc() instead. Move paramters out of param.h and into the code itself. Move a bunch of internal definitions from public sys/.h headers (without #ifdef _KERNEL even) into the code itself. I had hoped to make some of this more dynamic, but the cost of doing wakeups on all sleeping processes on old arrays was too frightening. The other possibility is to initialize on the first use, and allow dynamic sysctl changes to parameters right until that point. That would allow /etc/rc.sysctl to change SEM and MSG* defaults as we presently do with SHM*, but without the nightmare of changing a running system.	2000-05-01 13:33:56 +00:00
Peter Wemm	2553c04ce2	Regenerate (removed semconfig)	2000-05-01 11:14:08 +00:00
Peter Wemm	b423446cc0	Remove the undocumented, flawed, broken-as-designed semconfig() syscall.	2000-05-01 11:13:41 +00:00
Peter Wemm	39e4c0c888	Remove undocumented broken-as-designed semconfig() syscall.	2000-05-01 11:11:44 +00:00
Andrey A. Chernov	051f60b976	Move t_timeout initializing to ttyregister Pointed-by: bde	2000-05-01 10:51:54 +00:00
Doug Rabson	4b4a49fda5	* Move the driver_t::refs field to kobj_t to replace kobj_t::instances. * Back out a couple of workarounds for the confusion between kobj_t::instances and driver_t::refs.	2000-05-01 10:45:15 +00:00
Andrey A. Chernov	ef4de1ad38	Since ptys are allocated dynamically, there is no needs to keep their t_timeout across close, so move t_timeout initializing to ptcopen	2000-05-01 10:24:21 +00:00
Andrey A. Chernov	4eaed34ba0	Set t_timeout to its default sysctl value only once in ttyopen Initialize t_timeout to -1 for this reason Pointed-by: bde	2000-05-01 09:05:03 +00:00
Poul-Henning Kamp	2c9b67a8df	Remove unneeded #include <vm/vm_zone.h> Generated by: src/tools/tools/kerninclude	2000-04-30 18:52:11 +00:00
Brian Feldman	226f14bc83	Change the scheduler to actually respect the PUSER barrier. It's been wrong for many years that negative niceness would lower the priority of a process below PUSER, and once below PUSER, there were conditionals in the code that are required to test for whether a process was in the kernel which would break. The breakage could (and did) cause lock-ups, basically nothing else but the least nice program being able to run in some conditions. The algorithm which adjusts the priority now subtracts PRIO_MIN to do things properly, and the ESTCPULIM() algorithm was updated to use PRIO_TOTAL (PRIO_MAX - PRIO_MIN) to calculate the estcpu. NICE_WEIGHT is now 1 to accomodate the full range of priorities better (a -20 process with full CPU time has the priority of a +0 process with no CPU time). There are now 20 queues (exactly; 80 priorities) for use in user processes' scheduling, and PUSER has been lowered to 48 to accomplish this. This means, to the user, that things will be scheduled more correctly (noticeable), there is no lock-up anymore WRT a niced -20 process never releasing the CPU time for other processes. In this fair system, tsleep()ed < PUSER processes now will get the proper higher priority than priority >= PUSER user processes. The detective work of this was done by me, along with part of the solution. Luoqi Chen has provided most of the solution, and really helped me understand what was happening better, to boot :) Submitted by: luoqi Concept reviewed by: bde	2000-04-30 18:33:43 +00:00
Andrey A. Chernov	c1d0c3a89d	Add sysctl variable to set initial drainwait timeout on ttyopen, default to 5 minutes	2000-04-30 16:00:53 +00:00
Poul-Henning Kamp	95bdaa0ee8	Hmm, diff/patch still doesn't like me. Missed one s/biowait/bufwait/g	2000-04-30 06:16:03 +00:00
Poul-Henning Kamp	87150cb06d	s/biowait/bufwait/g Prodded by: several.	2000-04-29 16:25:22 +00:00
Poul-Henning Kamp	c1462ad325	Remove a leftover dysonism.	2000-04-29 16:14:10 +00:00
Poul-Henning Kamp	eb95c536ad	Remove unneeded #include <sys/kernel.h>	2000-04-29 15:36:14 +00:00
Peter Wemm	eb2d8c2e8a	The newer module dependency code exposes an apparent bug in the bus/driver/kobj system. I am not 100% sure that this is the correct fix, but it is harmless and does seem to solve the problem. At worst, it could cause a tiny memory leak at unload time - this is better than a free(NULL) and subsequent panic. I'm waiting for comments from Doug about this. This may yet be backed out and fixed differently. The change itself is to increment the reference count on drivers in one case where it appears to have been missed. When everything is unloaded, kobj_class_free() was being called twice in some cases, and panicing the second time.	2000-04-29 13:24:35 +00:00
Peter Wemm	54823af256	First round implementation of a fine grain enhanced module to module version dependency system. This isn't quite finished, but it is at a useful stage to do a functional checkpoint. Highlights: - version and dependency metadata is gathered via linker sets, so things are handled the same for static kernels and code built to live in a kld. - The dependencies are at module level (versus at file level). - Dependencies determine kld symbol search order - this means that you cannot link against symbols in another file unless you depend on it. This is so that you cannot accidently unload the target out from underneath the ones referencing it. - It is flexible enough that we can put tags in #include files and macros so that we can get decent hooks for enforcing recompiles on incompatable ABI changes. eg: if we change struct proc, we could force a recompile for all kld's that reference the proc struct. - Tangled dependency references at boot time are sorted. Files are relocated once all their dependencies are already relocated. Caveats: - Loader support is incomplete, but has been worked on seperately. - Actual enforcement of the version number tags is not active yet - just the module dependencies are live. The actual structure of versioning hasn't been agreed on yet. (eg: major.minor, or whatever) - There is some backwards compatability for old modules without metadata but I'm not sure how good it is. This is based on work originally done by Boris Popov (bp@freebsd.org), but I'm not sure he'd recognize much of it now. Don't blame him. :-) Also, ideas have been borrowed from Mike Smith.	2000-04-29 13:19:31 +00:00
Peter Wemm	7c3fdf6bbc	Do not fault if curproc is null.	2000-04-29 11:32:15 +00:00
Peter Wemm	ef83592d2c	Do not use uprintf() for link time error messages. This has unpleasant consequences when it happens in the preload support, before curproc or the tty system exist.	2000-04-29 11:21:44 +00:00
David E. O'Brien	b870c55839	Hookup /dev/[u]random on the Alpha.	2000-04-28 17:18:48 +00:00
Andrey A. Chernov	2cddfc0992	Add default 5min timeout for output drain to stop hanging on exit or in other places when connection dropped	2000-04-27 20:14:21 +00:00
Matthew Dillon	d323ddf317	Fix #! script exec under linux emulation. If a script is exec'd from a program running under linux emulation, the script binary is checked for in /compat/linux first. Without this patch the wrong script binary (i.e. the FreeBSD binary) will be run instead of the linux binary. For example, #!/bin/sh, thus breaking out of linux compatibility mode. This solves a number of problems people have had installing linux software on FreeBSD boxes.	2000-04-26 20:58:40 +00:00
Brian Feldman	b7db19017b	Move procfs_fullpath() to vfs_cache.c, with a rename to textvp_fullpath(). There's no excuse to have code in synthetic filestores that allows direct references to the textvp anymore. Feature requested by: msmith Feature agreed to by: warner Move requested by: phk Move agreed to by: bde	2000-04-26 11:57:45 +00:00
Matt Jacob	94a0705727	Remove unused variable.	2000-04-26 00:20:01 +00:00
Poul-Henning Kamp	67f3c95cf9	Clone the {b\|bio}_offset field, and make sure it is always initialized in struct bio. Eventually, bio_offset will probably obsolete the bio_blkno and bio_pblkno fields. Remove the special hack in atapi-cd.c to determine of bio_offset was valid.	2000-04-25 10:51:18 +00:00
David E. O'Brien	b0e56cde37	* Use sys/sys/random.h rather than a i386 specific one. * There was nothing that should be machine dependant about i386/isa/random_machdep.c, so it is now sys/kern/kern_random.c.	2000-04-24 17:30:08 +00:00
Doug Rabson	326e27d81f	* Rewrite to use kobj(9) instead of hard-coded function tables. * Report link errors to stdout with uprintf() so that the user can see what went wrong (PR kern/9214). * Add support code to allow module symbols to be loaded into GDB using the debugger's "sharedlibrary" command.	2000-04-24 17:08:04 +00:00
Garrett Wollman	4505fec89e	Add $FreeBSD$. Initialize the POSIX.1b sysconf information appropriately for non-optional kernel functions.	2000-04-22 15:13:06 +00:00
Doug Rabson	0d484d4793	Make sure the driver's ops table has been initialised before calling static methods.	2000-04-22 15:03:08 +00:00
Brian Feldman	8a2852b12f	Move the declaration of "struct namecache" to vnode.h, as it can be useful elsewhere. Note, of course, that in an ideal world nothing should need to see our VFS implementation :-/	2000-04-22 03:44:00 +00:00
Poul-Henning Kamp	3389ae9350	Remove ~25 unneeded #include <sys/conf.h> Remove ~60 unneeded #include <sys/malloc.h>	2000-04-19 14:58:28 +00:00
Poul-Henning Kamp	ed6aff7387	Remove unneeded <sys/buf.h> includes. Due to some interesting cpp tricks in lockmgr, the LINT kernel shrinks by 924 bytes.	2000-04-18 15:15:39 +00:00
Poul-Henning Kamp	11f8a0ca77	Retire bufqdisksort(), all drivers use bioqdisksort now.	2000-04-18 13:25:19 +00:00
Poul-Henning Kamp	19583a8007	Don't declare common variables in include files: move buftimelock til vfs_bio.c where it is initialized.	2000-04-18 11:21:28 +00:00
David E. O'Brien	c815a20cb2	Change our ELF binary branding to something more acceptable to the Binutils maintainers. After we established our branding method of writing upto 8 characters of the OS name into the ELF header in the padding; the Binutils maintainers and/or SCO (as USL) decided that instead the ELF header should grow two new fields -- EI_OSABI and EI_ABIVERSION. Each of these are an 8-bit unsigned integer. SCO has assigned official values for the EI_OSABI field. In addition to this, the Binutils maintainers and NetBSD decided that a better ELF branding method was to include ABI information in a ".note" ELF section. With this set of changes, we will now create ELF binaries branded using both "official" methods. Due to the complexity of adding a section to a binary, binaries branded with ``brandelf'' will only brand using the EI_OSABI method. Also due to the complexity of pulling a section out of an ELF file vs. poking around in the ELF header, our image activator only looks at the EI_OSABI header field. Note that a new kernel can still properly load old binaries except for Linux static binaries branded in our old method. * * For a short period of time, ``ld'' will also brand ELF binaries * using our old method. This is so people can still use kernel.old * with a new world. This support will be removed before 5.0-RELEASE, * and may not last anywhere upto the actual release. My expiration * time for this is about 6mo. *	2000-04-18 02:39:26 +00:00
Doug Rabson	8cb3dda2df	Fix LINT.	2000-04-17 08:09:43 +00:00
Warner Losh	d543f330aa	Issue a detached message after detaching the device. Not Objected to by: new-bus@	2000-04-17 04:30:48 +00:00
Jonathan Lemon	3ee12e4fe3	Add files that I forgot to `cvs add' on last commit.	2000-04-16 19:02:08 +00:00
Jonathan Lemon	cb679c385e	Introduce kqueue() and kevent(), a kernel event notification facility.	2000-04-16 18:53:38 +00:00
Poul-Henning Kamp	8177437d85	Complete the bio/buf divorce for all code below devfs::strategy Exceptions: Vinum untouched. This means that it cannot be compiled. Greg Lehey is on the case. CCD not converted yet, casts to struct buf (still safe) atapi-cd casts to struct buf to examine B_PHYS	2000-04-15 05:54:02 +00:00
Doug Rabson	f7b7769172	* Factor out the object system from new-bus so that it can be used by non-device code. * Re-implement the method dispatch to improve efficiency. The new system takes about 40ns for a method dispatch on a 300Mhz PII which is only 10ns slower than a direct function call on the same hardware. This changes the new-bus ABI slightly so make sure you re-compile any driver modules which you use.	2000-04-08 14:17:18 +00:00
Archie Cobbs	b76f24f759	Fix a bug where SIGIO was not being delivered to a process requesting async I/O when a tty device became writable. PR: kern/8324 Submitted by: Don Lewis <Don.Lewis@tsc.tdk.com>	2000-04-05 18:38:21 +00:00
Alfred Perlstein	6288517674	regenerate with MPSAFE from syscalls.master	2000-04-03 06:36:57 +00:00
Alfred Perlstein	c01df63183	Make makesyscalls.sh parse an optional field 'MPSAFE' that specifies that a syscall does not want the BGL to be grabbed automatically. Add the new MPSAFE flag to the syscalls that dillon has determined to be MPSAFE.	2000-04-03 06:36:14 +00:00
Poul-Henning Kamp	282ac69ede	Clone bio versions of certain bits of infrastructure: devstat_end_transaction_bio() bioq_* versions of bufq_* incl bioqdisksort() the corresponding "buf" versions will disappear when no longer used. Move b_offset, b_data and b_bcount to struct bio. Add BIO_FORMAT as a hack for fd.c etc. We are now largely ready to start converting drivers to use struct bio instead of struct buf.	2000-04-02 19:08:05 +00:00
Matthew Dillon	7c8fdcbd19	Make the sigprocmask() and geteuid() system calls MP SAFE. Expand commentary for copyin/copyout to indicate that they are MP SAFE as well. Reviewed by: msmith	2000-04-02 17:52:43 +00:00
Poul-Henning Kamp	c244d2de43	Move B_ERROR flag to b_ioflags and call it BIO_ERROR. (Much of this done by script) Move B_ORDERED flag to b_ioflags and call it BIO_ORDERED. Move b_pblkno and b_iodone_chain to struct bio while we transition, they will be obsoleted once bio structs chain/stack. Add bio_queue field for struct bio aware disksort. Address a lot of stylistic issues brought up by bde.	2000-04-02 15:24:56 +00:00
Poul-Henning Kamp	8c125869a9	Draw the outline of "struct bio". Struct bio is the future carrier of I/O requests for "struct buf".	2000-04-02 09:26:51 +00:00
Matthew Dillon	e4649cfac3	Change the write-behind code to take more care when starting async I/O's. The sequential read heuristic has been extended to cover writes as well. We continue to call cluster_write() normally, thus blocks in the file will still be reallocated for large (but still random) I/O's, but I/O will only be initiated for truely sequential writes. This solves a number of annoying situations, especially with DBM (hash method) writes, and also has the side effect of fixing a number of (stupid) benchmarks. Reviewed-by: mckusick	2000-04-02 00:55:28 +00:00
Brian Feldman	76e90dbcc9	Unstaticize this driver. You can have as many snoop devices as you can mknod :) Clean things up a lot while I'm here. A lot of KNF changes.	2000-04-02 00:35:37 +00:00
Warner Losh	ce73953a1e	device_set_unit() DO NOT USE THIS. This was approved before 4.0 release for inclusion into the release, but bde talked me out of committing the module that needs this until after the release. It is after the release now. :-)	2000-04-01 06:06:37 +00:00
Peter Wemm	a84e0a1cfe	Remove #ifdef for sem_wakeup() - we just use wakeup().	2000-03-30 11:35:25 +00:00
Peter Wemm	255108f385	Make sysv-style shared memory tuneable params fully runtime adjustable via sysctl. It's done pretty simply but it should be quite adequate. Also move SHMMAXPGS from $machine/include/vmparam.h as the comments that went with it were wrong... we don't allocate KVM space for the pages so that comment is bogus.. The only practical limit is how much physical ram you want to lock up as this stuff isn't paged out or swap backed.	2000-03-30 07:17:05 +00:00
Matthew Dillon	db6a426158	The SMP cleanup commit broke UP compiles. Make UP compiles work again.	2000-03-28 18:06:49 +00:00
Matthew Dillon	36e9f877df	Commit major SMP cleanups and move the BGL (big giant lock) in the syscall path inward. A system call may select whether it needs the MP lock or not (the default being that it does need it). A great deal of conditional SMP code for various deadended experiments has been removed. 'cil' and 'cml' have been removed entirely, and the locking around the cpl has been removed. The conditional separately-locked fast-interrupt code has been removed, meaning that interrupts must hold the CPL now (but they pretty much had to anyway). Another reason for doing this is that the original separate-lock for interrupts just doesn't apply to the interrupt thread mechanism being contemplated. Modifications to the cpl may now ONLY occur while holding the MP lock. For example, if an otherwise MP safe syscall needs to mess with the cpl, it must hold the MP lock for the duration and must (as usual) save/restore the cpl in a nested fashion. This is precursor work for the real meat coming later: avoiding having to hold the MP lock for common syscalls and I/O's and interrupt threads. It is expected that the spl mechanisms and new interrupt threading mechanisms will be able to run in tandem, allowing a slow piecemeal transition to occur. This patch should result in a moderate performance improvement due to the considerable amount of code that has been removed from the critical path, especially the simplification of the spl*() calls. The real performance gains will come later. Approved by: jkh Reviewed by: current, bde (exception.s) Some work taken from: luoqi's patch	2000-03-28 07:16:37 +00:00
Matthew Dillon	7c58e473f5	Commit the buffer cache cleanup patch to 4.x and 5.x. This patch fixes a fragmentation problem due to geteblk() reserving too much space for the buffer and imposes a larger granularity (16K) on KVA reservations for the buffer cache to avoid fragmentation issues. The buffer cache size calculations have been redone to simplify them (fewer defines, better comments, less chance of running out of KVA). The geteblk() fix solves a performance problem that DG was able reproduce. This patch does not completely fix the KVA fragmentation problems, but it goes a long way Mostly Reviewed by: bde and others Approved by: jkh	2000-03-27 21:29:33 +00:00
Kris Kennaway	8c6ac5e5a5	Reword warning to make it clearer (I read it as "remove block devices created before 2000-06-01" which is obviously not what was intended :-)	2000-03-25 21:10:20 +00:00
Matthew Dillon	f1924a54f8	Fix in-kernel infinite loop in pipe_write() when the reader goes away at just the wrong time.	2000-03-24 00:47:37 +00:00
Poul-Henning Kamp	e004067750	Whine at users who still have block devices in /dev, give them until june 1st to fix their system.	2000-03-21 19:25:56 +00:00
Paul Saab	e5a28db9f5	Add sysctl kern.coredump to enable/disable core dumps system wide.	2000-03-21 07:10:42 +00:00
Brian Feldman	16aae9cbc0	Split the logic of static int setrootbyname(char name); out into dev_t getdiskbyname(char name); This makes it easy to create a new DDB command, which is the big reason for the change. You can now do the following in DDB: Example rc.conf entry: dumpdev="/dev/ad0s1b" # Device name to crashdump to (if enabled). db> show disk/ad0s1b dev_t = 0xc0b7ea00 db> p *dumpdev c0b7ea00	2000-03-20 16:28:35 +00:00
Poul-Henning Kamp	91266b96c4	Isolate the Timecounter internals in their own two files. Make the public interface more systematically named. Remove the alternate method, it doesn't do any good, only ruins performance. Add counters to profile the usage of the 8 access functions. Apply the beer-ware to my code. The weird +/- counts are caused by two repocopies behind the scenes: kern/kern_clock.c -> kern/kern_tc.c sys/time.h -> sys/timetc.h (thanks peter!)	2000-03-20 14:09:06 +00:00
Poul-Henning Kamp	ce6acbb664	diff, patch and cvs didn't like these three last time around, try again.	2000-03-20 12:34:21 +00:00
Poul-Henning Kamp	b99c307a21	Rename the existing BUF_STRATEGY() to DEV_STRATEGY() substitute BUF_WRITE(foo) for VOP_BWRITE(foo->b_vp, foo) substitute BUF_STRATEGY(foo) for VOP_STRATEGY(foo->b_vp, foo) This patch is machine generated except for the ccd.c and buf.h parts.	2000-03-20 11:29:10 +00:00
Poul-Henning Kamp	21144e3bf1	Remove B_READ, B_WRITE and B_FREEBUF and replace them with a new field in struct buf: b_iocmd. The b_iocmd is enforced to have exactly one bit set. B_WRITE was bogusly defined as zero giving rise to obvious coding mistakes. Also eliminate the redundant struct buf flag B_CALL, it can just as efficiently be done by comparing b_iodone to NULL. Should you get a panic or drop into the debugger, complaining about "b_iocmd", don't continue. It is likely to write on your disk where it should have been reading. This change is a step in the direction towards a stackable BIO capability. A lot of this patch were machine generated (Thanks to style(9) compliance!) Vinum users: Greg has not had time to test this yet, be careful.	2000-03-20 10:44:49 +00:00
Bill Fenner	95b2b777b5	Make sure to free the socket in soabort() if the protocol couldn't free it (this could happen if the protocol already freed its part and we just kept the socket around to make sure accept(2) didn't block)	2000-03-18 08:56:56 +00:00
Chris Costello	b081a64afb	In vn_isdisk(), check whether vp->v_rdev is NULL. If it is, then return ENXIO (Device not configured). Without this, vn_isdisk() could (and did in the case of lstat() under fdesc) pass a NULL pointer to devsw(), which caused a page fault. Reviewed by: alfred	2000-03-18 01:27:44 +00:00
Nick Hibma	846664235c	Instead of using the next unit available, use the first unit available. This avoids the unit number from going up indefinitely when diconnecting and connecting 2 devices alternately. Noticed by: nsayer (quite a while ago) And stop calling DEVICE_NOMATCH at probe repeatedly. This stops the message on the PCI VGA board from being printed when loading a PCI driver.	2000-03-16 09:32:59 +00:00
Poul-Henning Kamp	db5f635acc	Eliminate the undocumented, experimental, non-delivering and highly dangerous MAX_PERF option.	2000-03-16 08:51:55 +00:00
Jun Kuriyama	dc76063419	Print "previous type" correctly when INVARIANTS is defined. Reviewed by: current@FreeBSD.org	2000-03-14 14:58:04 +00:00
Bruce Evans	05ecdd7037	Don't try so hard to make the lower 16 bits of fsids unique. It tended to recycle full fsids after only 16 mount/unmount's. This is probably too often for exported fsids. Now we recycle the full fsids only after 2^16 mount/ umount's and only ensure uniqueness in the lower 16 bits if there have been <= 256 calls to vfs_getnewfsid() since the system started.	2000-03-14 14:19:49 +00:00
Brian S. Dean	56fc73ff9b	In 'ipcperm()', only call 'suser()' if it is actually required. Previously, it was being called whether it was needed or not and the ASU flag was being set (as a side affect of calling 'suser()') in cases where superuser privileges were not actually needed. This was all pointed out to me by Bruce Evans. Reviewed by: bde	2000-03-13 23:00:08 +00:00
Poul-Henning Kamp	7de472559c	Remove unused 3rd argument from vsunlock() which abused B_WRITE.	2000-03-13 10:47:24 +00:00
Bruce Evans	61214975da	Try harder to make the lower 16 bits of fsids unique. The vfs type number was packed very wastefully, giving perfect non-uniqeness in the lower 16 bits of fsids for filesystems with the same vfs type. This made linux_stat() return perfectly non-unique (broken) 16-bit st_dev's for nfs mount points, and effectively reduced mntid_base to 8 bits so that the vfs_getnewfsid() looped endlessly when there are already 256 mounted filesystems with the required vfs type. Approved by: jkh	2000-03-12 14:23:21 +00:00
Alan Cox	af25d10c91	shmat: If VM_PROT_READ_IS_EXEC is defined and prot includes VM_PROT_READ, VM_PROT_EXECUTE must be added to prot before calling vm_map_find. Without this change, an mprotect on a shmat'ed region fails (when it shouldn't). This bug was reported Feb 28 by Brooks Davis <brooks@one-eyed-alien.net> on -hackers. Reviewed by: bde Approved by: jkh	2000-03-10 09:11:24 +00:00
Yoshinobu Inoue	8692c02553	Enable SCM_RIGHTS on alpha. Allocate necessary buffer as conversion between int and struct file *. Approved by: jkh Submitted by: brian Reviewed by: bde, brian, peter	2000-03-09 15:15:27 +00:00
Bruce Evans	a4fcac54a1	Fixed a null pointer panic for dumpon(8) on a nonexistent device whose driver uses the new disk layer. Reviewed by: phk Approved by: jkh	2000-03-09 12:40:41 +00:00
Yoshinobu Inoue	7d0d8dc306	CMSG_XXX macros alignment fixes to follow RFC2292. Approved by: jkh Submitted by: Partly from tech@openbsd Reviewed by: itojun	2000-03-03 11:13:12 +00:00
Peter Dufault	6d9a8d3e8f	I applied the wrong patch set. Back out anything associated with the known bogus currtpriority. This undoes the previous changes to sys/i386/i386/trap.c, sys/alpha/alpha/trap.c, sys/sys/systm.h Now we have the patch set approved by bde. Approved by: bde	2000-03-02 22:03:49 +00:00
Peter Dufault	383774c417	Patches that eliminate extra context switches in FIFO case. Fixes p1003_1b regression test in the simple case of no RR and FIFO processes competing. Reviewed by: jkh, bde	2000-03-02 16:20:07 +00:00
Brian S. Dean	e777d9c31a	Fix a superuser credential check. Reviewed by: phk Approved by: jkh	2000-02-29 22:58:59 +00:00
Doug Rabson	1d9a6ae08b	If a driver probe fails, unset it from the device. This fixes a problem with certain multiport cards. Approved by: jkh	2000-02-29 09:36:25 +00:00
Paul Saab	77ac690c97	Update a comment in elf_coredump to reflect that if you madvise with MADV_NOCORE, its address space is also excluded from a core file. Pointed out by: alc	2000-02-28 06:36:45 +00:00
Paul Saab	9730a5daab	Add MAP_NOCORE to mmap(2), and MADV_NOCORE and MADV_CORE to madvise(2). This This feature allows you to specify if mmap'd data is included in an application's corefile. Change the type of eflags in struct vm_map_entry from u_char to vm_eflags_t (an unsigned int). Reviewed by: dillon,jdp,alfred Approved by: jkh	2000-02-28 04:10:35 +00:00

... 2 3 4 5 6 ...

3199 Commits