freebsd-dev

Author	SHA1	Message	Date
Bruce Evans	f47f0edde4	Use "nm \| awk ..." instead of genassym(1) to generate symbol value headers. Symbol values are now represented using array sizes (4 arrays per symbol so that 16-bit machines can represent 64-bit values) instead of being raw binary values. Reviewed by: marcel	2000-06-02 09:27:48 +00:00
Mike Smith	c3c50c4e3a	Further fixes for multiple-IO-APIC systems from Tor Egge: Further experimentation showed that some Dell 2450 machines with the prevention kludge installed still got T_RESERVED traps. CPU interrupt vector 0x7A was observed to be triggered. This might have been the bitwise OR of two different vectors sent from each of the IOAPICs at the same time. IOAPIC #0: 0x68 --> irq 8: RTC timer interrupt IOAPIC #1: 0x32 --> irq 18: scsi host adapter or network interface ---- 0x7a --> T_RESERVED Both IOAPICs had ID 0. Appendix B.3 in the MP spec indicates that the operating system is responsible for assigning unique IDs to the IOAPICs. The enclosed patch programs the IOAPIC IDs according to the IOAPIC entries in the MP table. Submitted by: tegge	2000-05-31 21:37:28 +00:00
Matthew Dillon	8b03c8ed5e	This is a cleanup patch to Peter's new OBJT_PHYS VM object type and sysv shared memory support for it. It implements a new PG_UNMANAGED flag that has slightly different characteristics from PG_FICTICIOUS. A new sysctl, kern.ipc.shm_use_phys has been added to enable the use of physically-backed sysv shared memory rather then swap-backed. Physically backed shm segments are not tracked with PV entries, allowing programs which use a large shm segment as a rendezvous point to operate without eating an insane amount of KVM in the PV entry management. Read: Oracle. Peter's OBJT_PHYS object will also allow us to eventually implement page-table sharing and/or 4MB physical page support for such segments. We're half way there.	2000-05-29 22:40:54 +00:00
Doug Rabson	ca2e05343b	Add taskqueue system for easy-to-use SWIs among other things. Reviewed by: arch	2000-05-28 15:45:30 +00:00
Søren Schmidt	d5f65fcbd7	If devclass_alloc_unit() is called with a wired unit #, and this is buzy, only search upwards for a free slot to use.. This broke unit numbering on ATA systems where PCI attached controllers come before the mainboard ones... Reviewed by: dfr	2000-05-26 13:59:05 +00:00
Jake Burkholder	e39756439c	Back out the previous change to the queue(3) interface. It was not discussed and should probably not happen. Requested by: msmith and others	2000-05-26 02:09:24 +00:00
Jake Burkholder	740a1973a6	Change the way that the queue(3) structures are declared; don't assume that the type argument to _HEAD and _ENTRY is a struct. Suggested by: phk Reviewed by: phk Approved by: mdodd	2000-05-23 20:41:01 +00:00
Mike Smith	b38f58db69	Make a trip to Pointy-Hats-R-Us and actually include the header that defines ROOTDEVNAME. Submitted by: "Jeffrey S. Sharp" <jss@subatomix.com>	2000-05-22 17:25:47 +00:00
David E. O'Brien	d4af7a50dc	Sort the sys includes.	2000-05-22 17:09:13 +00:00
Brian Feldman	a274d19ba2	Back out NOTE_EXIT status reporting pending discussion.	2000-05-21 16:27:41 +00:00
Peter Wemm	24488c7498	Provide a temporary undocumented option: SHM_PHYS_BACKED. This will become sysctl and/or flags controlled later. It's mainly here for an easy place to test the physical memory backed objects.	2000-05-21 13:52:13 +00:00
Brian Feldman	a24b514d72	Put the wait(2) exit status in "data" for NOTE_EXIT kevents.	2000-05-17 01:16:11 +00:00
Jeroen Ruigrok van der Werven	01f76720fb	Fix the rootmount code for now. This function will probably rewritten/renamed to devpp. Submitted by: Assar Westerlund <assar@sics.se> on -current Confirmed to work: Steinar Haug <sthaug@nethelp.no>, Manfred Antar <mantar@pacbell.net> Reviewed by: phk	2000-05-14 07:43:12 +00:00
Jeroen Ruigrok van der Werven	37d90a44af	Fix comment typo. Submitted by: nrahlstr	2000-05-12 16:06:49 +00:00
Chris Costello	040fac0bbd	Include the UID and GID values filled in by socreate() into socket->so_cred for stat() calls. Reviewed by: phk	2000-05-11 22:08:57 +00:00
Chris Costello	12861d58db	Include UID and GID information for stat() calls using the values filled into the file descriptor data by falloc(). Reviewed by: phk	2000-05-11 22:08:20 +00:00
Bruce Evans	9114579d7a	Regenerated (fixed the calculation of sy_nargs in sysent tables).	2000-05-09 21:52:02 +00:00
Bruce Evans	6b972e0bdd	Fixed the calculation of sy_nargs in sysent tables. We attempted to do this in awk using the hack of counting args of type off_t twice and args of all other types once. This is too simple to work. It gave benignly wrong results on alphas (off_t shouldn't be counted twice) and for svr4_sys_mmap64() on i386's (off64_t should be counted twice). It gave fatally wrong results for i386's with 64-bit longs (longs should be counted twice). The correct value for sy_nargs is easier to determine from the size of the args struct anyway, except for complications to make the generated code almost readable. Improved formatting of sysent tables by lining up the comments where possible.	2000-05-09 21:18:30 +00:00
Poul-Henning Kamp	192c06ea1b	Change the "bdev-whiner" to whine when open is attempted and extend the deadline a month.	2000-05-09 18:53:57 +00:00
Matthew Dillon	d2ba455c2c	Some ioctl routines assume that the ioctl buffer is aligned, but a char[] declaration makes no such guarentee. A union is used to force alignment of the char buffer.	2000-05-09 17:43:21 +00:00
Bruce Evans	4aee570d90	Regenerated (fixed the type of mmap()'s padding arg).	2000-05-09 08:35:51 +00:00
Bruce Evans	aa4b7eae22	Fixed the declaration of mmap(). The crufty padding arg had the wrong type. This gave an inconsistent amount of crufty padding on i386's with 64-bit longs (8 bytes instead of 4). On alphas it gives a consistent amount of crufty padding (8 bytes) in addition to the 4 bytes of normal padding caused by passing int args as register_t's. Fixed the args struct tag for the NOPROTO syscalls (netbsd_lchown() and netbsd_msync()). The tag is currently unused for NOPROTO syscalls, so the bug has no effect, but it will be used even in the NOPROTO case to calculate sy_nargs correctly.	2000-05-09 08:31:06 +00:00
Peter Wemm	0e59fec6d8	Make issetugid return correctly. It was returning -1 with errno == 1 if it was set?id! Submitted by: Valentin Nechayev <netch@segfault.kiev.ua>	2000-05-09 00:58:34 +00:00
Greg Lehey	72cc7e2dce	Correct a couple of typos.	2000-05-07 05:09:45 +00:00
Poul-Henning Kamp	ad7ba3d455	Remove devstat_end_transaction_buf() everybody uses devstat_end_transaction_bio() now.	2000-05-06 06:59:08 +00:00
Poul-Henning Kamp	9626b608de	Separate the struct bio related stuff out of <sys/buf.h> into <sys/bio.h>. <sys/bio.h> is now a prerequisite for <sys/buf.h> but it shall not be made a nested include according to bdes teachings on the subject of nested includes. Diskdrivers and similar stuff below specfs::strategy() should no longer need to include <sys/buf.> unless they need caching of data. Still a few bogus uses of struct buf to track down. Repocopy by: peter	2000-05-05 09:59:14 +00:00
Jonathan Lemon	b4b03426ca	Fix one bug where the kn_head list could be manipulated without spl() protection in the case of a copyout error. Add missing spl calls around the intial activation call that is done when when the kevent is added. Add two KASSERT macros to help catch errors in the future.	2000-05-04 20:19:17 +00:00
Paul Richards	8651b9ec1b	If BUS_DEBUG is defined then create a sysctl, debug.bus_debug, that is used to control whether the debug messages are output at runtime. It defaults to on so that if you define BUS_DEBUG in your kernel then you get all the debugging info when you boot. It's very useful for disabling all the debugging info when you're developing a loadable device driver and you're doing lots of loads and unloads but don't always want to see all the debugging info.	2000-05-03 17:45:04 +00:00
Paul Richards	c0151c49d2	Replace all the ifdef debugging spaghetti with a single ifdef and a macro so that it is easier to read the flow of the code.	2000-05-03 00:20:36 +00:00
Peter Wemm	365c5db0a7	Add $FreeBSD$	2000-05-01 20:32:07 +00:00
Poul-Henning Kamp	017ef345bc	Give struct bio it's own call back mechanism.	2000-05-01 13:36:25 +00:00
Peter Wemm	ab063af911	Move the MSG* and SEM* options to opt_sysvipc.h Remove evil allocation macros from machdep.c (why was that there???) and use malloc() instead. Move paramters out of param.h and into the code itself. Move a bunch of internal definitions from public sys/.h headers (without #ifdef _KERNEL even) into the code itself. I had hoped to make some of this more dynamic, but the cost of doing wakeups on all sleeping processes on old arrays was too frightening. The other possibility is to initialize on the first use, and allow dynamic sysctl changes to parameters right until that point. That would allow /etc/rc.sysctl to change SEM and MSG* defaults as we presently do with SHM*, but without the nightmare of changing a running system.	2000-05-01 13:33:56 +00:00
Peter Wemm	2553c04ce2	Regenerate (removed semconfig)	2000-05-01 11:14:08 +00:00
Peter Wemm	b423446cc0	Remove the undocumented, flawed, broken-as-designed semconfig() syscall.	2000-05-01 11:13:41 +00:00
Peter Wemm	39e4c0c888	Remove undocumented broken-as-designed semconfig() syscall.	2000-05-01 11:11:44 +00:00
Andrey A. Chernov	051f60b976	Move t_timeout initializing to ttyregister Pointed-by: bde	2000-05-01 10:51:54 +00:00
Doug Rabson	4b4a49fda5	* Move the driver_t::refs field to kobj_t to replace kobj_t::instances. * Back out a couple of workarounds for the confusion between kobj_t::instances and driver_t::refs.	2000-05-01 10:45:15 +00:00
Andrey A. Chernov	ef4de1ad38	Since ptys are allocated dynamically, there is no needs to keep their t_timeout across close, so move t_timeout initializing to ptcopen	2000-05-01 10:24:21 +00:00
Andrey A. Chernov	4eaed34ba0	Set t_timeout to its default sysctl value only once in ttyopen Initialize t_timeout to -1 for this reason Pointed-by: bde	2000-05-01 09:05:03 +00:00
Poul-Henning Kamp	2c9b67a8df	Remove unneeded #include <vm/vm_zone.h> Generated by: src/tools/tools/kerninclude	2000-04-30 18:52:11 +00:00
Brian Feldman	226f14bc83	Change the scheduler to actually respect the PUSER barrier. It's been wrong for many years that negative niceness would lower the priority of a process below PUSER, and once below PUSER, there were conditionals in the code that are required to test for whether a process was in the kernel which would break. The breakage could (and did) cause lock-ups, basically nothing else but the least nice program being able to run in some conditions. The algorithm which adjusts the priority now subtracts PRIO_MIN to do things properly, and the ESTCPULIM() algorithm was updated to use PRIO_TOTAL (PRIO_MAX - PRIO_MIN) to calculate the estcpu. NICE_WEIGHT is now 1 to accomodate the full range of priorities better (a -20 process with full CPU time has the priority of a +0 process with no CPU time). There are now 20 queues (exactly; 80 priorities) for use in user processes' scheduling, and PUSER has been lowered to 48 to accomplish this. This means, to the user, that things will be scheduled more correctly (noticeable), there is no lock-up anymore WRT a niced -20 process never releasing the CPU time for other processes. In this fair system, tsleep()ed < PUSER processes now will get the proper higher priority than priority >= PUSER user processes. The detective work of this was done by me, along with part of the solution. Luoqi Chen has provided most of the solution, and really helped me understand what was happening better, to boot :) Submitted by: luoqi Concept reviewed by: bde	2000-04-30 18:33:43 +00:00
Andrey A. Chernov	c1d0c3a89d	Add sysctl variable to set initial drainwait timeout on ttyopen, default to 5 minutes	2000-04-30 16:00:53 +00:00
Poul-Henning Kamp	95bdaa0ee8	Hmm, diff/patch still doesn't like me. Missed one s/biowait/bufwait/g	2000-04-30 06:16:03 +00:00
Poul-Henning Kamp	87150cb06d	s/biowait/bufwait/g Prodded by: several.	2000-04-29 16:25:22 +00:00
Poul-Henning Kamp	c1462ad325	Remove a leftover dysonism.	2000-04-29 16:14:10 +00:00
Poul-Henning Kamp	eb95c536ad	Remove unneeded #include <sys/kernel.h>	2000-04-29 15:36:14 +00:00
Peter Wemm	eb2d8c2e8a	The newer module dependency code exposes an apparent bug in the bus/driver/kobj system. I am not 100% sure that this is the correct fix, but it is harmless and does seem to solve the problem. At worst, it could cause a tiny memory leak at unload time - this is better than a free(NULL) and subsequent panic. I'm waiting for comments from Doug about this. This may yet be backed out and fixed differently. The change itself is to increment the reference count on drivers in one case where it appears to have been missed. When everything is unloaded, kobj_class_free() was being called twice in some cases, and panicing the second time.	2000-04-29 13:24:35 +00:00
Peter Wemm	54823af256	First round implementation of a fine grain enhanced module to module version dependency system. This isn't quite finished, but it is at a useful stage to do a functional checkpoint. Highlights: - version and dependency metadata is gathered via linker sets, so things are handled the same for static kernels and code built to live in a kld. - The dependencies are at module level (versus at file level). - Dependencies determine kld symbol search order - this means that you cannot link against symbols in another file unless you depend on it. This is so that you cannot accidently unload the target out from underneath the ones referencing it. - It is flexible enough that we can put tags in #include files and macros so that we can get decent hooks for enforcing recompiles on incompatable ABI changes. eg: if we change struct proc, we could force a recompile for all kld's that reference the proc struct. - Tangled dependency references at boot time are sorted. Files are relocated once all their dependencies are already relocated. Caveats: - Loader support is incomplete, but has been worked on seperately. - Actual enforcement of the version number tags is not active yet - just the module dependencies are live. The actual structure of versioning hasn't been agreed on yet. (eg: major.minor, or whatever) - There is some backwards compatability for old modules without metadata but I'm not sure how good it is. This is based on work originally done by Boris Popov (bp@freebsd.org), but I'm not sure he'd recognize much of it now. Don't blame him. :-) Also, ideas have been borrowed from Mike Smith.	2000-04-29 13:19:31 +00:00
Peter Wemm	7c3fdf6bbc	Do not fault if curproc is null.	2000-04-29 11:32:15 +00:00
Peter Wemm	ef83592d2c	Do not use uprintf() for link time error messages. This has unpleasant consequences when it happens in the preload support, before curproc or the tty system exist.	2000-04-29 11:21:44 +00:00
David E. O'Brien	b870c55839	Hookup /dev/[u]random on the Alpha.	2000-04-28 17:18:48 +00:00
Andrey A. Chernov	2cddfc0992	Add default 5min timeout for output drain to stop hanging on exit or in other places when connection dropped	2000-04-27 20:14:21 +00:00
Matthew Dillon	d323ddf317	Fix #! script exec under linux emulation. If a script is exec'd from a program running under linux emulation, the script binary is checked for in /compat/linux first. Without this patch the wrong script binary (i.e. the FreeBSD binary) will be run instead of the linux binary. For example, #!/bin/sh, thus breaking out of linux compatibility mode. This solves a number of problems people have had installing linux software on FreeBSD boxes.	2000-04-26 20:58:40 +00:00
Brian Feldman	b7db19017b	Move procfs_fullpath() to vfs_cache.c, with a rename to textvp_fullpath(). There's no excuse to have code in synthetic filestores that allows direct references to the textvp anymore. Feature requested by: msmith Feature agreed to by: warner Move requested by: phk Move agreed to by: bde	2000-04-26 11:57:45 +00:00
Matt Jacob	94a0705727	Remove unused variable.	2000-04-26 00:20:01 +00:00
Poul-Henning Kamp	67f3c95cf9	Clone the {b\|bio}_offset field, and make sure it is always initialized in struct bio. Eventually, bio_offset will probably obsolete the bio_blkno and bio_pblkno fields. Remove the special hack in atapi-cd.c to determine of bio_offset was valid.	2000-04-25 10:51:18 +00:00
David E. O'Brien	b0e56cde37	* Use sys/sys/random.h rather than a i386 specific one. * There was nothing that should be machine dependant about i386/isa/random_machdep.c, so it is now sys/kern/kern_random.c.	2000-04-24 17:30:08 +00:00
Doug Rabson	326e27d81f	* Rewrite to use kobj(9) instead of hard-coded function tables. * Report link errors to stdout with uprintf() so that the user can see what went wrong (PR kern/9214). * Add support code to allow module symbols to be loaded into GDB using the debugger's "sharedlibrary" command.	2000-04-24 17:08:04 +00:00
Garrett Wollman	4505fec89e	Add $FreeBSD$. Initialize the POSIX.1b sysconf information appropriately for non-optional kernel functions.	2000-04-22 15:13:06 +00:00
Doug Rabson	0d484d4793	Make sure the driver's ops table has been initialised before calling static methods.	2000-04-22 15:03:08 +00:00
Brian Feldman	8a2852b12f	Move the declaration of "struct namecache" to vnode.h, as it can be useful elsewhere. Note, of course, that in an ideal world nothing should need to see our VFS implementation :-/	2000-04-22 03:44:00 +00:00
Poul-Henning Kamp	3389ae9350	Remove ~25 unneeded #include <sys/conf.h> Remove ~60 unneeded #include <sys/malloc.h>	2000-04-19 14:58:28 +00:00
Poul-Henning Kamp	ed6aff7387	Remove unneeded <sys/buf.h> includes. Due to some interesting cpp tricks in lockmgr, the LINT kernel shrinks by 924 bytes.	2000-04-18 15:15:39 +00:00
Poul-Henning Kamp	11f8a0ca77	Retire bufqdisksort(), all drivers use bioqdisksort now.	2000-04-18 13:25:19 +00:00
Poul-Henning Kamp	19583a8007	Don't declare common variables in include files: move buftimelock til vfs_bio.c where it is initialized.	2000-04-18 11:21:28 +00:00
David E. O'Brien	c815a20cb2	Change our ELF binary branding to something more acceptable to the Binutils maintainers. After we established our branding method of writing upto 8 characters of the OS name into the ELF header in the padding; the Binutils maintainers and/or SCO (as USL) decided that instead the ELF header should grow two new fields -- EI_OSABI and EI_ABIVERSION. Each of these are an 8-bit unsigned integer. SCO has assigned official values for the EI_OSABI field. In addition to this, the Binutils maintainers and NetBSD decided that a better ELF branding method was to include ABI information in a ".note" ELF section. With this set of changes, we will now create ELF binaries branded using both "official" methods. Due to the complexity of adding a section to a binary, binaries branded with ``brandelf'' will only brand using the EI_OSABI method. Also due to the complexity of pulling a section out of an ELF file vs. poking around in the ELF header, our image activator only looks at the EI_OSABI header field. Note that a new kernel can still properly load old binaries except for Linux static binaries branded in our old method. * * For a short period of time, ``ld'' will also brand ELF binaries * using our old method. This is so people can still use kernel.old * with a new world. This support will be removed before 5.0-RELEASE, * and may not last anywhere upto the actual release. My expiration * time for this is about 6mo. *	2000-04-18 02:39:26 +00:00
Doug Rabson	8cb3dda2df	Fix LINT.	2000-04-17 08:09:43 +00:00
Warner Losh	d543f330aa	Issue a detached message after detaching the device. Not Objected to by: new-bus@	2000-04-17 04:30:48 +00:00
Jonathan Lemon	3ee12e4fe3	Add files that I forgot to `cvs add' on last commit.	2000-04-16 19:02:08 +00:00
Jonathan Lemon	cb679c385e	Introduce kqueue() and kevent(), a kernel event notification facility.	2000-04-16 18:53:38 +00:00
Poul-Henning Kamp	8177437d85	Complete the bio/buf divorce for all code below devfs::strategy Exceptions: Vinum untouched. This means that it cannot be compiled. Greg Lehey is on the case. CCD not converted yet, casts to struct buf (still safe) atapi-cd casts to struct buf to examine B_PHYS	2000-04-15 05:54:02 +00:00
Doug Rabson	f7b7769172	* Factor out the object system from new-bus so that it can be used by non-device code. * Re-implement the method dispatch to improve efficiency. The new system takes about 40ns for a method dispatch on a 300Mhz PII which is only 10ns slower than a direct function call on the same hardware. This changes the new-bus ABI slightly so make sure you re-compile any driver modules which you use.	2000-04-08 14:17:18 +00:00
Archie Cobbs	b76f24f759	Fix a bug where SIGIO was not being delivered to a process requesting async I/O when a tty device became writable. PR: kern/8324 Submitted by: Don Lewis <Don.Lewis@tsc.tdk.com>	2000-04-05 18:38:21 +00:00
Alfred Perlstein	6288517674	regenerate with MPSAFE from syscalls.master	2000-04-03 06:36:57 +00:00
Alfred Perlstein	c01df63183	Make makesyscalls.sh parse an optional field 'MPSAFE' that specifies that a syscall does not want the BGL to be grabbed automatically. Add the new MPSAFE flag to the syscalls that dillon has determined to be MPSAFE.	2000-04-03 06:36:14 +00:00
Poul-Henning Kamp	282ac69ede	Clone bio versions of certain bits of infrastructure: devstat_end_transaction_bio() bioq_* versions of bufq_* incl bioqdisksort() the corresponding "buf" versions will disappear when no longer used. Move b_offset, b_data and b_bcount to struct bio. Add BIO_FORMAT as a hack for fd.c etc. We are now largely ready to start converting drivers to use struct bio instead of struct buf.	2000-04-02 19:08:05 +00:00
Matthew Dillon	7c8fdcbd19	Make the sigprocmask() and geteuid() system calls MP SAFE. Expand commentary for copyin/copyout to indicate that they are MP SAFE as well. Reviewed by: msmith	2000-04-02 17:52:43 +00:00
Poul-Henning Kamp	c244d2de43	Move B_ERROR flag to b_ioflags and call it BIO_ERROR. (Much of this done by script) Move B_ORDERED flag to b_ioflags and call it BIO_ORDERED. Move b_pblkno and b_iodone_chain to struct bio while we transition, they will be obsoleted once bio structs chain/stack. Add bio_queue field for struct bio aware disksort. Address a lot of stylistic issues brought up by bde.	2000-04-02 15:24:56 +00:00
Poul-Henning Kamp	8c125869a9	Draw the outline of "struct bio". Struct bio is the future carrier of I/O requests for "struct buf".	2000-04-02 09:26:51 +00:00
Matthew Dillon	e4649cfac3	Change the write-behind code to take more care when starting async I/O's. The sequential read heuristic has been extended to cover writes as well. We continue to call cluster_write() normally, thus blocks in the file will still be reallocated for large (but still random) I/O's, but I/O will only be initiated for truely sequential writes. This solves a number of annoying situations, especially with DBM (hash method) writes, and also has the side effect of fixing a number of (stupid) benchmarks. Reviewed-by: mckusick	2000-04-02 00:55:28 +00:00
Brian Feldman	76e90dbcc9	Unstaticize this driver. You can have as many snoop devices as you can mknod :) Clean things up a lot while I'm here. A lot of KNF changes.	2000-04-02 00:35:37 +00:00
Warner Losh	ce73953a1e	device_set_unit() DO NOT USE THIS. This was approved before 4.0 release for inclusion into the release, but bde talked me out of committing the module that needs this until after the release. It is after the release now. :-)	2000-04-01 06:06:37 +00:00
Peter Wemm	a84e0a1cfe	Remove #ifdef for sem_wakeup() - we just use wakeup().	2000-03-30 11:35:25 +00:00
Peter Wemm	255108f385	Make sysv-style shared memory tuneable params fully runtime adjustable via sysctl. It's done pretty simply but it should be quite adequate. Also move SHMMAXPGS from $machine/include/vmparam.h as the comments that went with it were wrong... we don't allocate KVM space for the pages so that comment is bogus.. The only practical limit is how much physical ram you want to lock up as this stuff isn't paged out or swap backed.	2000-03-30 07:17:05 +00:00
Matthew Dillon	db6a426158	The SMP cleanup commit broke UP compiles. Make UP compiles work again.	2000-03-28 18:06:49 +00:00
Matthew Dillon	36e9f877df	Commit major SMP cleanups and move the BGL (big giant lock) in the syscall path inward. A system call may select whether it needs the MP lock or not (the default being that it does need it). A great deal of conditional SMP code for various deadended experiments has been removed. 'cil' and 'cml' have been removed entirely, and the locking around the cpl has been removed. The conditional separately-locked fast-interrupt code has been removed, meaning that interrupts must hold the CPL now (but they pretty much had to anyway). Another reason for doing this is that the original separate-lock for interrupts just doesn't apply to the interrupt thread mechanism being contemplated. Modifications to the cpl may now ONLY occur while holding the MP lock. For example, if an otherwise MP safe syscall needs to mess with the cpl, it must hold the MP lock for the duration and must (as usual) save/restore the cpl in a nested fashion. This is precursor work for the real meat coming later: avoiding having to hold the MP lock for common syscalls and I/O's and interrupt threads. It is expected that the spl mechanisms and new interrupt threading mechanisms will be able to run in tandem, allowing a slow piecemeal transition to occur. This patch should result in a moderate performance improvement due to the considerable amount of code that has been removed from the critical path, especially the simplification of the spl*() calls. The real performance gains will come later. Approved by: jkh Reviewed by: current, bde (exception.s) Some work taken from: luoqi's patch	2000-03-28 07:16:37 +00:00
Matthew Dillon	7c58e473f5	Commit the buffer cache cleanup patch to 4.x and 5.x. This patch fixes a fragmentation problem due to geteblk() reserving too much space for the buffer and imposes a larger granularity (16K) on KVA reservations for the buffer cache to avoid fragmentation issues. The buffer cache size calculations have been redone to simplify them (fewer defines, better comments, less chance of running out of KVA). The geteblk() fix solves a performance problem that DG was able reproduce. This patch does not completely fix the KVA fragmentation problems, but it goes a long way Mostly Reviewed by: bde and others Approved by: jkh	2000-03-27 21:29:33 +00:00
Kris Kennaway	8c6ac5e5a5	Reword warning to make it clearer (I read it as "remove block devices created before 2000-06-01" which is obviously not what was intended :-)	2000-03-25 21:10:20 +00:00
Matthew Dillon	f1924a54f8	Fix in-kernel infinite loop in pipe_write() when the reader goes away at just the wrong time.	2000-03-24 00:47:37 +00:00
Poul-Henning Kamp	e004067750	Whine at users who still have block devices in /dev, give them until june 1st to fix their system.	2000-03-21 19:25:56 +00:00
Paul Saab	e5a28db9f5	Add sysctl kern.coredump to enable/disable core dumps system wide.	2000-03-21 07:10:42 +00:00
Brian Feldman	16aae9cbc0	Split the logic of static int setrootbyname(char name); out into dev_t getdiskbyname(char name); This makes it easy to create a new DDB command, which is the big reason for the change. You can now do the following in DDB: Example rc.conf entry: dumpdev="/dev/ad0s1b" # Device name to crashdump to (if enabled). db> show disk/ad0s1b dev_t = 0xc0b7ea00 db> p *dumpdev c0b7ea00	2000-03-20 16:28:35 +00:00
Poul-Henning Kamp	91266b96c4	Isolate the Timecounter internals in their own two files. Make the public interface more systematically named. Remove the alternate method, it doesn't do any good, only ruins performance. Add counters to profile the usage of the 8 access functions. Apply the beer-ware to my code. The weird +/- counts are caused by two repocopies behind the scenes: kern/kern_clock.c -> kern/kern_tc.c sys/time.h -> sys/timetc.h (thanks peter!)	2000-03-20 14:09:06 +00:00
Poul-Henning Kamp	ce6acbb664	diff, patch and cvs didn't like these three last time around, try again.	2000-03-20 12:34:21 +00:00
Poul-Henning Kamp	b99c307a21	Rename the existing BUF_STRATEGY() to DEV_STRATEGY() substitute BUF_WRITE(foo) for VOP_BWRITE(foo->b_vp, foo) substitute BUF_STRATEGY(foo) for VOP_STRATEGY(foo->b_vp, foo) This patch is machine generated except for the ccd.c and buf.h parts.	2000-03-20 11:29:10 +00:00
Poul-Henning Kamp	21144e3bf1	Remove B_READ, B_WRITE and B_FREEBUF and replace them with a new field in struct buf: b_iocmd. The b_iocmd is enforced to have exactly one bit set. B_WRITE was bogusly defined as zero giving rise to obvious coding mistakes. Also eliminate the redundant struct buf flag B_CALL, it can just as efficiently be done by comparing b_iodone to NULL. Should you get a panic or drop into the debugger, complaining about "b_iocmd", don't continue. It is likely to write on your disk where it should have been reading. This change is a step in the direction towards a stackable BIO capability. A lot of this patch were machine generated (Thanks to style(9) compliance!) Vinum users: Greg has not had time to test this yet, be careful.	2000-03-20 10:44:49 +00:00
Bill Fenner	95b2b777b5	Make sure to free the socket in soabort() if the protocol couldn't free it (this could happen if the protocol already freed its part and we just kept the socket around to make sure accept(2) didn't block)	2000-03-18 08:56:56 +00:00
Chris Costello	b081a64afb	In vn_isdisk(), check whether vp->v_rdev is NULL. If it is, then return ENXIO (Device not configured). Without this, vn_isdisk() could (and did in the case of lstat() under fdesc) pass a NULL pointer to devsw(), which caused a page fault. Reviewed by: alfred	2000-03-18 01:27:44 +00:00
Nick Hibma	846664235c	Instead of using the next unit available, use the first unit available. This avoids the unit number from going up indefinitely when diconnecting and connecting 2 devices alternately. Noticed by: nsayer (quite a while ago) And stop calling DEVICE_NOMATCH at probe repeatedly. This stops the message on the PCI VGA board from being printed when loading a PCI driver.	2000-03-16 09:32:59 +00:00
Poul-Henning Kamp	db5f635acc	Eliminate the undocumented, experimental, non-delivering and highly dangerous MAX_PERF option.	2000-03-16 08:51:55 +00:00
Jun Kuriyama	dc76063419	Print "previous type" correctly when INVARIANTS is defined. Reviewed by: current@FreeBSD.org	2000-03-14 14:58:04 +00:00
Bruce Evans	05ecdd7037	Don't try so hard to make the lower 16 bits of fsids unique. It tended to recycle full fsids after only 16 mount/unmount's. This is probably too often for exported fsids. Now we recycle the full fsids only after 2^16 mount/ umount's and only ensure uniqueness in the lower 16 bits if there have been <= 256 calls to vfs_getnewfsid() since the system started.	2000-03-14 14:19:49 +00:00
Brian S. Dean	56fc73ff9b	In 'ipcperm()', only call 'suser()' if it is actually required. Previously, it was being called whether it was needed or not and the ASU flag was being set (as a side affect of calling 'suser()') in cases where superuser privileges were not actually needed. This was all pointed out to me by Bruce Evans. Reviewed by: bde	2000-03-13 23:00:08 +00:00
Poul-Henning Kamp	7de472559c	Remove unused 3rd argument from vsunlock() which abused B_WRITE.	2000-03-13 10:47:24 +00:00
Bruce Evans	61214975da	Try harder to make the lower 16 bits of fsids unique. The vfs type number was packed very wastefully, giving perfect non-uniqeness in the lower 16 bits of fsids for filesystems with the same vfs type. This made linux_stat() return perfectly non-unique (broken) 16-bit st_dev's for nfs mount points, and effectively reduced mntid_base to 8 bits so that the vfs_getnewfsid() looped endlessly when there are already 256 mounted filesystems with the required vfs type. Approved by: jkh	2000-03-12 14:23:21 +00:00
Alan Cox	af25d10c91	shmat: If VM_PROT_READ_IS_EXEC is defined and prot includes VM_PROT_READ, VM_PROT_EXECUTE must be added to prot before calling vm_map_find. Without this change, an mprotect on a shmat'ed region fails (when it shouldn't). This bug was reported Feb 28 by Brooks Davis <brooks@one-eyed-alien.net> on -hackers. Reviewed by: bde Approved by: jkh	2000-03-10 09:11:24 +00:00
Yoshinobu Inoue	8692c02553	Enable SCM_RIGHTS on alpha. Allocate necessary buffer as conversion between int and struct file *. Approved by: jkh Submitted by: brian Reviewed by: bde, brian, peter	2000-03-09 15:15:27 +00:00
Bruce Evans	a4fcac54a1	Fixed a null pointer panic for dumpon(8) on a nonexistent device whose driver uses the new disk layer. Reviewed by: phk Approved by: jkh	2000-03-09 12:40:41 +00:00
Yoshinobu Inoue	7d0d8dc306	CMSG_XXX macros alignment fixes to follow RFC2292. Approved by: jkh Submitted by: Partly from tech@openbsd Reviewed by: itojun	2000-03-03 11:13:12 +00:00
Peter Dufault	6d9a8d3e8f	I applied the wrong patch set. Back out anything associated with the known bogus currtpriority. This undoes the previous changes to sys/i386/i386/trap.c, sys/alpha/alpha/trap.c, sys/sys/systm.h Now we have the patch set approved by bde. Approved by: bde	2000-03-02 22:03:49 +00:00
Peter Dufault	383774c417	Patches that eliminate extra context switches in FIFO case. Fixes p1003_1b regression test in the simple case of no RR and FIFO processes competing. Reviewed by: jkh, bde	2000-03-02 16:20:07 +00:00
Brian S. Dean	e777d9c31a	Fix a superuser credential check. Reviewed by: phk Approved by: jkh	2000-02-29 22:58:59 +00:00
Doug Rabson	1d9a6ae08b	If a driver probe fails, unset it from the device. This fixes a problem with certain multiport cards. Approved by: jkh	2000-02-29 09:36:25 +00:00
Paul Saab	77ac690c97	Update a comment in elf_coredump to reflect that if you madvise with MADV_NOCORE, its address space is also excluded from a core file. Pointed out by: alc	2000-02-28 06:36:45 +00:00
Paul Saab	9730a5daab	Add MAP_NOCORE to mmap(2), and MADV_NOCORE and MADV_CORE to madvise(2). This This feature allows you to specify if mmap'd data is included in an application's corefile. Change the type of eflags in struct vm_map_entry from u_char to vm_eflags_t (an unsigned int). Reviewed by: dillon,jdp,alfred Approved by: jkh	2000-02-28 04:10:35 +00:00
Jordan K. Hubbard	8d0bf3d6f8	Add new oid, debug.boothowto. This allows userland apps to see how the kernel was booted and perhaps do conditional things based upon it (sysinstall, for example, will now turn Debug mode on automatically if boot -v was done). Submitted by: msmith Suggested by: ulf	2000-02-25 11:43:08 +00:00
Yoshinobu Inoue	0b97e97cd2	Add length check to sbcreatecontrol(). Now this check is necessary because IPv6 source routing might use control data bigger than MLEN. (e.g. 16bytes IPv6 addr x 23 hops) Actually mbuf cluster should be used in uipc_socket.c:sbcreatecontrol() and uipc_syscalls.c:sockargs() when data size is bigger then MLEN, and such patches were already in KAME environment and have been confirmed to work well. I just forgot to merge them into 4.0, sorry. For safety, I'll postpone such patches until after 4.0 release. The effect of postponement is followings. -Ping6 source routing hops are limitted to around 6 or so. -If some apps do setsockopt IPV6_RTHDR and try to receive incoming IPv6 source routing info, it can't receive more than 6 hops source routing info. (But currently, no apps seems to be doing it.) Approved by: jkh	2000-02-24 19:21:26 +00:00
Jason Evans	dd85920a4f	Add the VFS_AIO config option and leave it off by default. Unless the VFS_AIO option is specified, all aio-related syscalls return ENOSYS. The aio code is very fragile right now, and is unsuitable for default inclusion in a production shell box. Approved by: jkh	2000-02-23 07:44:25 +00:00
Brian S. Dean	de8050f9b8	Don't forget to reset the hardware debug registers when a process that was using them exits. Don't allow a user process to cause the kernel to take a TRCTRAP on a user space address. Reviewed by: jlemon, sef Approved by: jkh	2000-02-20 20:51:23 +00:00
Peter Wemm	f082218c18	Fix select(2) for the Alpha. (!!) It was never returning true for fd's in the range of 32-63, 96-127 etc. The first problem was the FD_*() macros were shifting a 32 bit integer "1" left by more than 32 bits. The same problem happened in selscan(). ffs() also takes an int argument and causes failure. For cases where int == long (ie: the usual case for x86, but not always as gcc can have long being a 64 bit quantity) ffs() could be used. Reported by: Marian Stagarescu <marian@bile.skycache.com> Reviewed by: dfr, gallatin (sys/types.h only) Approved by: jkh	2000-02-20 13:36:26 +00:00
Søren Schmidt	a6e23e28c1	Hide the "devclass_alloc_unit: %s%d already exists, using next available..." behind bootverbose Approved by: jkh	2000-02-20 10:07:28 +00:00
Søren Schmidt	47351d2774	Update the ata driver to take more advantage of newbus, this was needed to make attach/detach of devices work, which is needed for the PCCARD support. (PCCARD support is still not working though, more to come on that) Support the CMD646 chip which is used on many alphas, sadly only in WDMA2 mode, as the silicon is broken beyond belief for UDMA modes. Lots of cosmetic fixes here and there. Sorry for the size of this megapatchfromhell but it was not possible otherwise... newbus patches based on work from: dfr (Doug Rabson)	2000-02-18 20:57:33 +00:00
Mike Smith	bb328b87c1	Change the mountroot prompt to something that doesn't look at all like a firmware prompt. Several sleepy folk mistook the '>>>' for the SRM prompt, which was never the desired idea. Submitted by: Andrew Gallatin <gallatin@cs.duke.edu> Approved by: jkh	2000-02-17 23:32:08 +00:00
Matthew Dillon	1f6889a1eb	Fix null-pointer dereference crash when the system is intentionally run out of KVM through a mmap()/fork() bomb that allocates hundreds of thousands of vm_map_entry structures. Add panic to make null-pointer dereference crash a little more verbose. Add a new sysctl, vm.max_proc_mmap, which specifies the maximum number of mmap()'d spaces (discrete vm_map_entry's in the process). The value defaults to around 9000 for a 128MB machine. The test is scaled for the number of processes sharing a vmspace (aka linux threads). Setting the value to 0 disables the feature. PR: kern/16573 Approved by: jkh	2000-02-16 21:11:33 +00:00
Joerg Wunsch	2c6610f343	Hide the boring ``not probed (disabled)'' messages behind` bootverbose'. This unspams the boot messages, concentrating on the drivers that have actually been probed. This basically resurrects revision 1.106 from old /sys/i386/isa/isa.c. Reviewed by: jkh, dfr	2000-02-15 19:23:34 +00:00
Poul-Henning Kamp	693e27d473	Don't try to account for the partial quantum unless the process is curproc. This only makes any difference on SMP, where we used a (potentially very bogus) switchtime from our own CPU to calculate resource usage on another CPU. This should remove some if not all calcru() related warnings on SMP. Approved by: jkh	2000-02-15 09:02:07 +00:00
Martin Cracauer	30de91e8b8	Allow comments in interpreter specification lines as in #! /bin/sh # -- perl -- This is simply "delete everything after the next '#', not counting the first char in the line". No effort has been made to allow quoting, backslash escaping or '#' in interpreter names. The complies to POSIX 1003.2 in that Posix says the implementation is free to choose whatever it likes. PR: bin/16393	2000-02-15 08:49:57 +00:00
Peter Wemm	194a0b6c97	Avoid a panic in __getcwd(2) when combined with umount -f.	2000-02-14 06:09:01 +00:00
Poul-Henning Kamp	f1220da49c	Fix sign reversal in adjtime(2). Approved by: jkh	2000-02-13 10:56:32 +00:00
Robert Watson	83f1e257e0	Yet-another-update: rename ``kern.prison'' to a new sysctl root entry, ``jail'', and move the set_hostname_allowed sysctl there, as well as fixing a bug in the sysctl that resulted in jails being over-limited (preventing them from reading as well as writing the hostname). Also, correct some formatting issues, courtesy bde :-). Reviewed by: phk Approved by: jkh	2000-02-12 13:41:56 +00:00
Robert Watson	5bdee2c5d5	Fix sysctl namespace for jail: move the kern.jailcansethostname to kern.prison.set_hostname_allowed, off of the kern.prison node. Future jail twiddles should be placed in this namespace.	2000-02-10 18:51:58 +00:00
Robert Watson	6c144e7521	Introduce a new sysctl, kern.jailcansethostname, which determines whether or not a process in a jail, with privilege, may set the jail's hostname. Defaults to 1, which permits this. May be set to 0 by a process with appropriate privilege outside of jail. Preventing hostname renaming from within a jail is currently required to make jails manageable, as they a currently identifiable only by hostname using /proc, which may be modified without this sysctl being set to 0. This will be documented in upcoming man commits. Authorized by: jkh, the ever-patient	2000-02-10 05:32:03 +00:00
Robert Watson	35a0a88fda	Correct an oversight in jail() that allowed processes in jail to access ptys in ways that might be unethical, especially towards processes not in jail, or in other jails. Submitted by: phk Reviewed by: rwatson Approved by: jkh	2000-02-09 03:32:11 +00:00
Poul-Henning Kamp	9b6d9dba20	Also allow non-rot processes to setproctitle() Submitted by: Paul Saab <paul@mu.org> Approved by: jkh	2000-02-08 19:54:15 +00:00
Søren Schmidt	e8359a57de	Do refcounting of open devices (more) correctly. count_dev funtion by phk.	2000-02-07 23:05:40 +00:00
Robert Watson	b7a5f3ca1b	Remove static qualifier from vgonel, as it is needed by the Arla folk outside of vfs_subr.c. Submitted by: Assar Westerlund <assar@sics.se> Reviewed by: rwatson Approved by: jkh	2000-02-02 07:07:17 +00:00
Peter Wemm	d2b4236a60	Don't refer to TABLDISC in the comments here. Submitted by: bde Approved by: jkh	2000-01-30 10:14:13 +00:00
Peter Wemm	f7b79efbc1	Remove sys/tablet.h and kern/tty_tb.c (the old RS232 CAD-style tablet support code). It hasn't worked since at least October 1995, and probably has never worked in the FreeBSD 2.0+ tree. Obviously it's not a priority to many folks. Reviewed by: phk, sos	2000-01-29 16:34:46 +00:00
Robert Watson	9a2b8fca80	This patch fixes a locking bug that can result in deadlock if the codepath is followed. From the PR: vclean calls vrele leading to deadlock (if usecount > 0) vclean() calls vrele() if v_usecount of the node was higher than one. But before calling it, it sets the VXLOCK flag, which will make vn_lock called from vrele dead-lock. PR: kern/15117 Submitted by: Assar Westerlund <assar@stacken.kth.se> Reviewed by: rwatson Obtained from: NetBSD	2000-01-29 15:22:58 +00:00
Poul-Henning Kamp	1edde29e97	rename disk_delete() to disk_destroy().	2000-01-28 20:49:43 +00:00
Brian Feldman	8950d24456	Fix a bug that could crash the system if you press ^T while a slower system is slowed down and in the right spot (a race condition in fork()). The "previous time" fields have moved from pstat to proc. Anything which uses KVM needs to be recompiled with a new libkvm/headers. A couple wacky u_quad_t's in struct proc are now u_int64_t (the same, but according to lack of 'quad's in proc.h and usage in kern_resource.c). This will have no effect on code. This has been make-world-and-installed-new-kernel-which-works-fine-tested. Reviewed by: bde (previous version)	2000-01-28 20:40:29 +00:00
Archie Cobbs	ee51b2c45f	Back out previous commit; it was premature.	2000-01-28 17:11:07 +00:00
Bruce Evans	b473d62c73	Fixed a memory leak for slices with an (unsupported) bad sector table. Broken in: rev.1.80.	2000-01-28 11:51:08 +00:00
Bruce Evans	7b17f8ffbd	Don't permit generation of non-physical disk addresses. subr_diskmbr.c: Don't "helpfully" enlarge our idea of the disk size to cover all the primary slices. Instead, truncate or discard slices that don't seem to be on the disk. The enlargement was a hack for disks that don't report their size (e.g., MFM disks). It is just wrong in general. wd.c: In CHS mode, limit the disk size so that cylinder numbers >= 65536 cannot occur. This normally only affects disks larger than 33.8GB. CHS mode accesses to addresses above the limit are now properly broken (an error is returned instead of garbage for reads and disk corruption for writes). PR: 15611 Reviewed by: readers of freebsd-bugs did not respond to a request for review	2000-01-28 10:22:07 +00:00
David Greenman	27b8623f21	Fixed sign and overflow bugs that caused the allocation size of the kernel malloc region (kmem_map) to be wrong and semi-random on systems with more than 1GB of RAM. This is not a complete fix, but is sufficient for machines with 4GB or less of memory. A complete fix will require some changes to the getenv stuff so that 64bit values can be passed around. NOT FIXED: machines with more than 4GB of RAM (e.g. some large Alphas) since we're still using ints to hold some of the values. Reviewed by: bde	2000-01-28 04:04:58 +00:00
Archie Cobbs	d6113ed044	When an attempt to install a line discipline fails, check for known KLD's that might support it, and load the KLD if found. Currently the list includes SLIPDISC, PPPDISC, and NETGRAPHDISC.	2000-01-28 02:22:22 +00:00
Bruce Evans	6bfb820292	Quick fix for stack overflow when there are more than about 25 slices. Using recursion to traverse the recursive data structure for extended partitions was never good, but when slice support was implemented in 1995, the recursion worked for the default maximum number of slices (32), and standard fdisk utilities didn't support creating more than the default number. Even then, corrupt extended partitions could cause endless recursion, because we attempt to check all slices, even ones which we don't turn into devices. The recursion has succumbed to creeping features. The stack requirements for each level had grown to 204 bytes on i386's. Most of the growth was caused by adding a 64-byte copy of the DOSpartition table to each frame. The kernel stack size has shrunk to about 5K on i386's. Most of the shrinkage was caused by the growth of `struct sigacts' by 2388 bytes to support 128 signals. Linux fdisk (a 1997 version at least) can now create 60 slices (4 standard ones, 56 for logical drives within extended partitions, and it seems to be leaving room to map the 4 BSD partitions on my test drive), and Linux (2.2.29 and 2.3.35 at least) now reports all these slices at boot time. The fix limits the recursion to 16 levels (4 + 16 slices) and recovers 32 bytes per level caused by gcc pessimizing for space. Switching to a static buffer doesn't cause any problems due to recursion, since the buffer is not passed down. Using a static buffer is wrong in general because it requires the giant lock to protect it. However, this problem is small compared with using a static buffer for dsname(). We sometimes neglect to copy the result of dsname() before sleeping. Also fixed slice names when we find more than MAX_SLICES (32) slices. The number of the last slice found was not passed passed recursively. The limit on the recursion now prevents finding more than 32 slices with a standard extended partition data structure anyway.	2000-01-27 05:11:29 +00:00
Kirk McKusick	7881bb5d5f	Add soft updates to the set of things being tagged. Syntax cleanup.	2000-01-27 01:22:06 +00:00
Bruce Evans	2f40e526a5	Improved English in the messages printed by diskerr(). Fixed some formatting bugs.	2000-01-26 10:28:23 +00:00
Bruce Evans	f4675a30ed	Don't follow null pointers if we somehow have a null devswitch entry despite having a non-null cn_tab entry. This case now works the same as if there is no physical console, except i/o at the kernel printf level may still work. This frees drivers of physical console drivers from the responsibility of attaching the device no matter what.	2000-01-25 09:20:08 +00:00
Bruce Evans	dd7f8ecff6	Fixed some style bugs (mainly ones associated with the bogus name condev_t for a non-typedef).	2000-01-24 11:48:11 +00:00
Boris Popov	8dc74b6288	Backout previous commit. It was a mistake.	2000-01-23 15:47:46 +00:00
Boris Popov	9e991dfa2f	Replace non obvious number with SPECNAMELEN constant. Reviewed by: phk	2000-01-23 14:58:53 +00:00
Poul-Henning Kamp	7fd299cb92	Add a couple of strategic sysctls for monitoring. In the rather obscure case of hardpps(), use a type-II PLL if the external signal is phase locked, but a FLL if it isn't.	2000-01-23 14:52:37 +00:00
Warner Losh	27e2c03a27	Fix the style bugs in the style bugs fix. The style bug fix made the new function inconsistant with the rest of this file. The spelling and grammer fixes were good and remain.	2000-01-21 06:57:52 +00:00
Brian Feldman	bd9079fa6c	Fix style bugs in the last commit.	2000-01-21 02:52:54 +00:00
Warner Losh	7001be49f8	bdeize last commit: o Remove opt_dontuse.h and ifdef PROCFS Subitted by: bde, peter	2000-01-20 17:03:53 +00:00
Jason Evans	b7592c7bea	Back out the previous spl change, since it opens a race window. Reviewed by: alfred, dillon, peter	2000-01-20 08:15:13 +00:00
Warner Losh	5e2664428c	When we are execing a setugid program, and we have a procfs filesystem file open in one of the special file descriptors (0, 1, or 2), close it before completing the exec. Submitted by: nergal@idea.avet.com.pl Constructive comments: deraadt@openbsd.org, sef, peter, jkh	2000-01-20 07:12:52 +00:00
Jason Evans	60ffb01993	Don't tsleep() while at splbio(). Correctly return EINPROGRESS from aio_error() even when an aio request is still in the socket queue. Submitted by: Adrian Chadd <adrian@bofh.co.uk>	2000-01-20 01:59:58 +00:00
Robert Watson	8f0738756c	Fix bde'isms in acl/extattr syscall interface, renaming syscalls to prettier (?) names, adding some const's around here, et al. Reviewed by: bde	2000-01-19 06:07:34 +00:00
Robert Watson	9b0be035b8	Fix bde'isms in acl/extattr syscall interface, renaming syscalls to prettier (?) names, adding some const's around here, et al. Commit 2 out of 3. Reviewed by: bde	2000-01-19 06:02:31 +00:00
Robert Watson	5134b3e92a	Fix bde'isms in acl/extattr syscall interface, renaming syscalls to prettier (?) names, adding some const's around here, et al. Commit 1 out of 3. Reviewed by: bde	2000-01-19 06:01:07 +00:00
Kirk McKusick	71c87cfd7e	Need to reset the buffer pointer to avoid reconsidering the same buffer again (without this the rollback analysis was being lost). Should reduce the write count for most workloads. Submitted by: Craig A Soules <soules+@andrew.cmu.edu>	2000-01-18 02:13:26 +00:00
Brian Feldman	f582ac0630	Fix vn_isdisk() usage to make AIO work on non-disk-files again, rather than just return ENOTBLK. PR: 16163 Submitted by: Adrian Chadd <adrian@FreeBSD.org>	2000-01-17 21:18:39 +00:00
Peter Wemm	8ccd633455	Implement setres[ug]id() and getres[ug]id(). This has been sitting in my tree for ages (~2 years) waiting for an excuse to commit it. Now Linux has implemented it and it seems that Staroffice (when using the linux_base6.1 port's libc) calls this in the linux emulator and dies in setup. The Linux emulator can call these now.	2000-01-16 16:34:26 +00:00
Poul-Henning Kamp	2a277567e2	Cleanup some more remaining bdev fluff.	2000-01-16 09:25:34 +00:00
Jason Evans	bfbbc4aa44	Add aio_waitcomplete(). Make aio work correctly for socket descriptors. Make gratuitous style(9) fixes (me, not the submitter) to make the aio code more readable. PR: kern/12053 Submitted by: Chris Sedore <cmsedore@maxwell.syr.edu>	2000-01-14 02:53:29 +00:00
Matthew N. Dodd	de9cfdd736	Allow SMP systems with an MCA bus to work properly. Reviewed by: peter	2000-01-13 09:09:02 +00:00
Luoqi Chen	49503b44fd	Seconds to ticks conversion was done at the wrong place.	2000-01-12 17:26:42 +00:00
Kazutaka YOKOTA	35e61cbd71	Add a new mechanism, cndbctl(), to tell the console driver that ddb is entered. Don't refer to `in_Debugger' to see if we are in the debugger. (The variable used to be static in Debugger() and wasn't updated if ddb is entered via traps and panic anyway.) - Don't refer to `in_Debugger'. - Add `db_active' to i386/i386/db_interface.d (as in alpha/alpha/db_interface.c). - Remove cnpollc() stub from ddb/db_input.c. - Add the dbctl function to syscons, pcvt, and sio. (The function for pcvt and sio is noop at the moment.) Jointly developed by: bde and me (The final version was tweaked by me and not reviewed by bde. Thus, if there is any error in this commit, that is entirely of mine, not his.) Some changes were obtained from: NetBSD	2000-01-11 14:54:01 +00:00
Poul-Henning Kamp	d685023e68	Also handle zero return from dscheck(). PR: 15956	2000-01-10 12:21:39 +00:00
Poul-Henning Kamp	ba4ad1fcea	Give vn_isdisk() a second argument where it can return a suitable errno. Suggested by: bde	2000-01-10 12:04:27 +00:00
Warner Losh	310086415a	Panic if proc0 hasn't been created and we try to call kthread_create. This prevents a more mysterious crash later. XXX The long term solution is defer creation of these things until XXX proc0 lives	2000-01-10 08:00:58 +00:00
Sean Eric Fagan	893618352c	Handle the case where we truss an SUGID program -- in particular, we need to wake up any processes waiting via PIOCWAIT on process exit, and truss needs to be more aware that a process may actually disappear while it's waiting. Reviewed by: Paul Saab <ps@yahoo-inc.com>	2000-01-10 04:09:05 +00:00
Kirk McKusick	cf60e8e4bf	Several performance improvements for soft updates have been added: 1) Fastpath deletions. When a file is being deleted, check to see if it was so recently created that its inode has not yet been written to disk. If so, the delete can proceed to immediately free the inode. 2) Background writes: No file or block allocations can be done while the bitmap is being written to disk. To avoid these stalls, the bitmap is copied to another buffer which is written thus leaving the original available for futher allocations. 3) Link count tracking. Constantly track the difference in i_effnlink and i_nlink so that inodes that have had no change other than i_effnlink need not be written. 4) Identify buffers with rollback dependencies so that the buffer flushing daemon can choose to skip over them.	2000-01-10 00:24:24 +00:00
Kirk McKusick	bd5f5da94d	Add bwillwrite to all system calls that create things in the filesystem. Benchmarks that create huge trees of empty files overwhelm the buffer cache.	2000-01-10 00:08:53 +00:00
Kirk McKusick	411e1480fd	Remove the P_BUFEXHAUST flag from the syncer process (leaving it only on the buf_daemon process). The problem is that when the syncer process starts running the worklist, it wants to delete lots of files. It does this by VFS_VGET'ing the vnodes, clearing the blocks in them and bdwrite'ing the buffer. It can process close to a thousand files per second which generates a large number of dirty buffers. So, giving it special priviledge at the buffer trough leads to trouble as the buf_daemon does occationally need a free buffer to proceed and if the syncer has used every last one up, we are toast.	2000-01-10 00:07:24 +00:00
Eivind Eklund	e12d97d239	Change NDFREE() from a macro to a function for the time being; the macro version caused intolerable bloat (30k). I'm likely to revisit this with an attempt at a smarter macro. Bloat noticed by: bde	2000-01-08 16:20:06 +00:00
Luoqi Chen	5c8b298e0e	Allow SMP && NCPU == 1 to work. From now on, there's no restriction on the value of NCPU relative to the number of cpus physically present, the actual number of cpus utilized will be the smaller of the two.	2000-01-07 08:49:25 +00:00
Luoqi Chen	5e95083920	Introduce a mechanism to suspend/resume system processes. Suspend syncer and bufdaemon prior to disk sync during system shutdown.	2000-01-07 08:36:44 +00:00
Peter Wemm	8cb96f20f8	Export the nselcoll counter via the kern.nselcoll sysctl so we can see just how bad it gets in various situations. Reminded by: adrian	2000-01-05 19:40:17 +00:00
Matthew Dillon	c37c9620cd	Enhance reassignbuf(). When a buffer cannot be time-optimally inserted into vnode dirtyblkhd we append it to the list instead of prepend it to the list in order to maintain a 'forward' locality of reference, which is arguably better then 'reverse'. The original algorithm did things this way to but at a huge time cost. Enhance the append interlock for NFS writes to handle intr/soft mounts better. Fix the hysteresis for NFS async daemon I/O requests to reduce the number of unnecessary context switches. Modify handling of NFS mount options. Any given user option that is too high now defaults to the kernel maximum for that option rather then the kernel default for that option. Reviewed by: Alfred Perlstein <bright@wintelcom.net>	2000-01-05 05:11:37 +00:00
Tor Egge	82916a1126	ISA device drivers use the ISA source interrupt number in locations where the low level interrupt handler number should be used. Change setup_apic_irq_mapping() to allocate low level interrupt handler X (Xintr${X}) for any ISA interrupt X mentioned in the MP table. Remove an assumption in the driver for the system clock (clock.c) that interrupts mentioned in the MP table as delivered to IOAPIC #0 intpin Y is handled by low level interrupt handler Y (Xintr${Y}) but don't assume that low level interrupt handler 0 (Xintr0) is used. Don't allocate two low level interrupt handlers for the system clock. Reviewed by: NOKUBI Hirotaka <hnokubi@yyy.or.jp>	2000-01-04 22:24:59 +00:00
Poul-Henning Kamp	fb01c24c11	Be more careful about NOUDEV and NODEV. Submitted by: bde	2000-01-04 12:51:50 +00:00
Poul-Henning Kamp	6a77f60d4a	Create a separate pps_offset variable to use for applying the hardpps() produced offset component. This is tested and behaved stable with frequency offsets from -338.05 to +499.91 PPM. Interestingly the machine I tested this on would fail if the clock were slower than 14.3132 MHz whereas it was perfectly happy to run at 16.384 MHz, in other words [-340PPM ... +14.4%] Make pps_shift tweakable with sysctl.	2000-01-04 12:04:39 +00:00
Poul-Henning Kamp	b9effc8901	truss /usr/bin/su login (or not if root) then exit the shell truss will get stuct in tsleep I dont know if this is correct, but it fixes the problem and according to the commends in pioctl.h, PF_ISUGID is set when we want to ignore UID changes. The code is checking for when PF_ISUGID is not set and since it never is set, we always ignore UID changes. Submitted by: Paul Saab <ps@yahoo-inc.com>	2000-01-03 14:26:47 +00:00
Poul-Henning Kamp	19c5221906	Don't use time_offset as a leaky bucket variable in hardpps(), this resulted in vastly optimistic offset values reported to userland (typically a factor 40+ too small). Apart from that, the code had two sign-bugs. Apply the hardpps() phase with the right sign with a simply scaling by integration interval. (This may be too stiff at long integration intervals, see below). Allow pps_shiftmax to be reduced again. Before this, the phase lock in hardpps() were broken, but due to two bugs mostly cancelling out, it would end up basically working with a large stochastic component. Now it behaves as one would expect: smooth and quiet. It seems that pps_shiftmax above 7..9 somewhere makes the phaselock too weak to hold onto random walk phase errors from a HP-105 OCXO, which basically means that it is too weak for real-life use with such integration times. This is yet to be resolved. Submitted to: Prof. Dave "NTP" Mills. Tested by: Terje Mathisen <Terje.Mathisen@hda.hydro.com>	1999-12-29 14:39:24 +00:00
Peter Wemm	8b70d192e3	Remove vnode_if.sh - it's a perl script. This stayed around for a while because bsd.kmod.mk is usually out of sync with kernel source. However bsd.kmod.mk has to be updated now because of the _KERNEL change so there is no need to keep this (pre-repo copy) version around.	1999-12-29 05:37:14 +00:00
Peter Wemm	c447342094	Change #ifdef KERNEL to #ifdef _KERNEL in the public headers. "KERNEL" is an application space macro and the applications are supposed to be free to use it as they please (but cannot). This is consistant with the other BSD's who made this change quite some time ago. More commits to come.	1999-12-29 05:07:58 +00:00
Mike Smith	736e4b67ae	Actively limit the allocation of mbufs to NMBUFS/nmbufs and mbuf clusters to NMBCLUSTERS/nmbclusters/kern.ipc.nmbclusters. Add a read-only sysctl kern.ipc.nmbufs matching kern.ipc.nmbclusters. Submitted by: Bosko Milekic <bmilekic@dsuper.net>	1999-12-28 06:35:57 +00:00
Bruce Evans	654f6be1c8	Changed the type used to represent the user stack pointer from `long ' to `register_t '. This fixes bugs like misplacement of argc and argv on the user stack on i386's with 64-bit longs. We still use longs to represent "words" like argc and argv, and assume that they are on the stack (and that there is stack). The suword() and fuword() families should also use register_t.	1999-12-27 10:42:55 +00:00
Bruce Evans	9f79feec16	Fixed some type mismatches. p_retval[0] in struct proc has type register_t, so pointers to it must be passed around as `register_t ', not as `int '. The type mismatches were non-benign on alphas, but the broken code is normally only configured by LINT.	1999-12-27 10:22:09 +00:00
Brian Feldman	c2696359ab	Correct an uninitialized variable use, which, unlike most times, is actually a bug this time. Submitted by: bde Reviewed by: bde	1999-12-27 06:31:53 +00:00
Bruce Evans	f85bdfcc66	Removed unused includes. Rumoved unused compatibility cruft for dup(). Using it today would just break dup() on fd's >= 64. Fixed some style bugs.	1999-12-26 14:07:43 +00:00
Bruce Evans	da211f5bf6	Use vfs_timestamp() instead of getnanotime() to set timestamps. This fixee incoherency of pipe timestamps relative to file timestamps in the usual case where getnanotime() is not used for the latter. (File and pipe timestamps are still incoherent relative to real time unless the vfs_timestamp_precision sysctl is set to 2 or 3).	1999-12-26 13:04:52 +00:00
Doug Rabson	54ac5b9b76	* Set the devclass of a device before calling the probe method. This allows device_printf() etc. to print something intelligible. * Allow device_set_devclass(dev, 0) for clearing the devclass.	1999-12-24 16:21:15 +00:00
Bruce Evans	586453fee2	Fixed a cast of a pointer to an integer of a possibly different size. Fixed casts of non-`void *' pointers to uintptr_t. Fixed related style bugs. This file uses perfectly non-KNF formatting for casts.	1999-12-24 15:33:36 +00:00
Kirk McKusick	02b0085406	Prettyness police: Identify flags in b_xflags with BX_ to distinguish them from flags in b_flags which are prefixed with B_	1999-12-22 03:11:04 +00:00
Alfred Perlstein	80ef02b65d	regenerate after making getfh a standard syscall.	1999-12-21 20:21:48 +00:00
Alfred Perlstein	20883b0f10	make getfh a standard syscall instead of dependant on having NFSSERVER defined, useful for userland fileservers that want to use a filehandle type interface to the filesystem. Submitted by: Assar Westerlund assar@stacken.kth.se PR: kern/15452	1999-12-21 20:21:12 +00:00
Eivind Eklund	369dc8ceb8	Change incorrect NULLs to 0s	1999-12-21 11:14:12 +00:00
Matthew Dillon	8f95b97072	Reimplement buf_daemon / getnewbuf() interaction for dealing with stressful situations. buf_daemon now makes a distinction between being woken up and its sleep timing out, and as a consequence is now much better able to dynamically tune itself to its environment. Reviewed by: Alfred Perlstein <bright@wintelcom.net>	1999-12-20 20:28:40 +00:00
Eivind Eklund	6357e7b507	Make m_print const correct (avoids a warning)	1999-12-20 18:10:00 +00:00
Greg Lehey	580e7e5a0f	If we fail to find init, print out the search path used. This helps differentiate between one of three different scenarios: 1. No init. 2. Path to init munged by an incorrect loader configuration. 3. Root file system not mounted. Reviewed-by: billf	1999-12-20 02:50:49 +00:00
Poul-Henning Kamp	1b4ce5ce9b	Don't ignore return value from tsleep(). Spotted by: charnier	1999-12-19 12:36:41 +00:00
Robert Watson	91f37dcba1	Second pass commit to introduce new ACL and Extended Attribute system calls, vnops, vfsops, both in /kern, and to individual file systems that require a vfsop_ array entry. Reviewed by: eivind	1999-12-19 06:08:07 +00:00
Robert Watson	ef351daa32	First pass commit to introduce new ACL and Extended Attribute system calls. The second pass commit with all the supporting code will happen shortly afterwards. Reviewed by: eivind	1999-12-19 05:54:46 +00:00
Eivind Eklund	d776d82c4d	Since VOP_LOCK can be used to up and downgrade locks, it is not possible to say anything about the lockstate before and after it. Thus, change the lockspec from U L U to ? ? ?.	1999-12-18 23:01:52 +00:00
Brian Feldman	2cfa173c50	Woops, I'm so sorry I forgot this! From the last mbuf.h change: m_mballoc_wakeup() (inline) -> MMBWAKEUP() (macro) m_clalloc_wakeup() (inline) -> MCLWAKEUP() (macro) Noticed by: peter	1999-12-18 20:04:19 +00:00
Eivind Eklund	762e6b856c	Introduce NDFREE (and remove VOP_ABORTOP)	1999-12-15 23:02:35 +00:00
Brian Feldman	6c54410230	Bug fix: The variables "m_mclalloc_wid" and "m_mballoc_wid" were not in the proper place. They should have been in uipc_mbuf.c and have been global, not in mbuf.h and local per each file that uses mbuf.h. Sorta bug fix: In mbuf.h, the definitions of various things for KERNEL and not KERNEL cases were very screwy. This fixes all of that which I could find.	1999-12-14 02:23:14 +00:00
Tor Egge	f52523701c	Fix two problems with pipe_write(): 1. Data written beyond end of pipe buffer, causing kernel memory corruption. - Check that space is still valid after obtaining the pipe lock. - Defer the calculation of transfer size until the pipe lock has been obtained. - Update the pipe buffer pointers while holding the pipe lock. 2. Writes of size <= PIPE_BUF not always atomic. - Allow an internal write to span two contiguous segments, so writes of size <= PIPE_BUF can be kept atomic when wrapping around from the end to the start of the pipe buffer. PR: 15235 Reviewed by: Matt Dillon <dillon@FreeBSD.org>	1999-12-13 02:55:47 +00:00
Peter Wemm	6f77b2defc	Use a seperate -c and -h mode. The vnode_if.c file is compiled only into the kernel while the vnode_if.h header is a bunch of inlines to call the code that is in the kernel. Generating the .h file on the fly is kinda bogus because it has to match the one compiled into the kernel. IMHO we should have kern/vnode_if.c and sys/vnode_if.h committed in the tree but that's another battle.	1999-12-12 16:43:05 +00:00
Peter Wemm	cfdd238335	Put on asbestos suit and put a splcam() around the 'Mounting root from..' message to stop it splitting. Every single scsi machine I've seen seems to reliably collide with this and it's rather annoying.	1999-12-12 16:34:43 +00:00
Peter Wemm	016fc47da3	The sysctl mod_xx hack is no longer required now that we have totally dynamic sysctl registration.	1999-12-12 16:30:34 +00:00
Brian Feldman	f48b807fc0	This is Bosko Milekic's mbuf allocation waiting code. Basically, this means that running out of mbuf space isn't a panic anymore, and code which runs out of network memory will sleep to wait for it. Submitted by: Bosko Milekic <bmilekic@dsuper.net> Reviewed by: green, wollman	1999-12-12 05:52:51 +00:00
Matthew Dillon	3854a87ef3	Remove accidental pollution unrelated to previous commit. The issue here is real but has not yet been discussed with Eivind.	1999-12-12 03:28:14 +00:00
Matthew Dillon	4f79d873c1	Add MAP_NOSYNC feature to mmap(), and MADV_NOSYNC and MADV_AUTOSYNC to madvise(). This feature prevents the update daemon from gratuitously flushing dirty pages associated with a mapped file-backed region of memory. The system pager will still page the memory as necessary and the VM system will still be fully coherent with the filesystem. Modifications made by other means to the same area of memory, for example by write(), are unaffected. The feature works on a page-granularity basis. MAP_NOSYNC allows one to use mmap() to share memory between processes without incuring any significant filesystem overhead, putting it in the same performance category as SysV Shared memory and anonymous memory. Reviewed by: julian, alc, dg	1999-12-12 03:19:33 +00:00
Eivind Eklund	6bdfe06ad9	Lock reporting and assertion changes. * lockstatus() and VOP_ISLOCKED() gets a new process argument and a new return value: LK_EXCLOTHER, when the lock is held exclusively by another process. * The ASSERT_VOP_(UN)LOCKED family is extended to use what this gives them * Extend the vnode_if.src format to allow more exact specification than locked/unlocked. This commit should not do any semantic changes unless you are using DEBUG_VFS_LOCKS. Discussed with: grog, mch, peter, phk Reviewed by: peter	1999-12-11 16:13:02 +00:00
Peter Wemm	4537138981	Zap c_index() and c_rindex(). Bruce prefers these to implicitly convert a const into a non-const as they do in libc. I feel that defeating the type checking like that quite evil, but that's the way it is.	1999-12-10 17:38:41 +00:00
Poul-Henning Kamp	2ac0d0a1f7	Make adjtime(2) adjust boottime so it doesn't cause non-monotonous uptime.	1999-12-08 10:02:12 +00:00
Poul-Henning Kamp	e8452d59e3	Scan cdevs for potential root devices, rather than bdevs.	1999-12-08 10:01:18 +00:00
Poul-Henning Kamp	7d5961670c	Remove BAD144 support, it has already been disabled for some time.	1999-12-08 09:33:00 +00:00
Mike Smith	9eec696993	Change the default poweroff delay from 0 to 5 seconds. This seems to be adequate for the IDE disks that I have available for testing. Most seem to wait between 1 and 3 seconds before flushing their caches. Add the ability to override the delay at compile time via the undocumented option POWEROFF_DELAY. The delay can still be set via sysctl as it was originally implemented.	1999-12-07 04:35:37 +00:00
Poul-Henning Kamp	72dfe7a3d7	I always forget to check before I reboot a system, and while it boots I try in vain to remember which month or even year this system was last booted in. Print out the uptime before rebooting, and give people like me less (or more as it may be) to think about while the systems boots.	1999-12-06 22:35:51 +00:00
Peter Wemm	bb6a234e47	Put on my asbestos underwear and commit the patch that I posted to -arch some time ago that changes kern.randompid from a boolean to a randomness range for the next pid assigment. Too high causes a lot of extra work to scan for free pids, and too low merely wastes randomness entropy. It's still possible to select a completely random range by using PID_MAX (100k) or -1 as a shortcut to mean "the whole range". Also, don't waste randomness when doing a wraparound.	1999-12-06 11:13:50 +00:00
Luoqi Chen	91c28bfde0	User ldt sharing.	1999-12-06 04:53:08 +00:00
Matt Jacob	9efed284d5	correct incomplete last change	1999-12-03 09:10:04 +00:00
Matthew N. Dodd	fe0d408987	Remove the 'ivars' arguement to device_add_child() and device_add_child_ordered(). 'ivars' may now be set using the device_set_ivars() function. This makes it easier for us to change how arbitrary data structures are associated with a device_t. Eventually we won't be modifying device_t to add additional pointers for ivars, softc data etc. Despite my best efforts I've probably forgotten something so let me know if this breaks anything. I've been running with this change for months and its been quite involved actually isolating all the changes from the rest of the local changes in my tree. Reviewed by: peter, dfr	1999-12-03 08:41:24 +00:00
Nick Hibma	39e86a12aa	Remove check for attached state. sc = devclass_get_softc(devclass, unit); doesn't return NULL during attach anymore, and produces the sc, identical to (for devclass_get_unit(devclass, unit) != NULL that is): sc = device_get_softc(devclass_get_unit(devclass, unit)); Reviewed-by: dfr	1999-12-02 16:30:21 +00:00
Archie Cobbs	1c38f2ea70	The functions m_copym() and m_copypacket() return read-only copies, because in the case of mbuf clusters they only increment the reference count rather than actually copying the data. Add comments to this effect, and add a new routine called m_dup() that returns a real, writable copy of an mbuf chain. This is preliminary work required for implementing 'ipfw tee'. Reviewed by: julian	1999-12-01 22:31:32 +00:00
Brian Feldman	226420a464	Separate some common sysctl code into sysctl_find_oid() and calling thereof. Also, make the errno returns _correct_, and add a new one which is more appropriate.	1999-12-01 02:25:19 +00:00
Kirk McKusick	e9cc475851	Collect read and write counts for filesystems. This new code drops the counting in bwrite and puts it all in spec_strategy. I did some tests and verified that the counts collected for writes in spec_strategy is identical to the counts that we previously collected in bwrite. We now also get read counts (async reads come from requests for read-ahead blocks). Note that you need to compile a new version of mount to get the read counts printed out. The old mount binary is completely compatible, the only reason to install a new mount is to get the read counts printed. Submitted by: Craig A Soules <soules+@andrew.cmu.edu> Reviewed by: Kirk McKusick <mckusick@mckusick.com>	1999-12-01 02:09:30 +00:00
Peter Wemm	ebc49c5654	Don't make the ktrace hook in tsleep() deref a null curproc after a panic. PR: 15169 Submitted by: David Gilbert <dgilbert@velocet.ca>	1999-11-30 09:01:46 +00:00
Matthew N. Dodd	44a451ba24	Reduce code duplication. Hopefully this clears up some confusion about the nature of devclass_get_softc() vs. device_get_softc() as well. The check against DS_ATTACHED remains as this is not a change that modifies functionality. Reviewed by: Peter "in principle" Wemm	1999-11-30 07:06:03 +00:00
Matthew Dillon	245efbba4d	Remove vfs_getrootfsid() function (a temporary hack added a few months ago to make BOOTP work again). It is no longer required by BOOTP and no longer used.	1999-11-29 22:25:36 +00:00
Poul-Henning Kamp	c464420c89	Report swapdevices as cdevs rather than bdevs. Remove unused dev2budev() function.	1999-11-29 21:37:18 +00:00
Poul-Henning Kamp	38941f351c	Remove the now unused chrtoblk() function.	1999-11-29 20:50:58 +00:00
Matthew Dillon	99e659dcfa	Make BOOTP work again. Submitted by: Doug Ambrisko <ambrisko@whistle.com>	1999-11-29 18:51:04 +00:00
Poul-Henning Kamp	8f04f6c729	Add a bit of sanity checking and problem avoidance in case the timecounter hardware is bogus. This will produce a new warning "microuptime() went backwards" and try to not screw up the process resource accounting.	1999-11-29 11:29:04 +00:00
Mike Smith	c0da4cacd0	Use the correct mounted-from path when allocating the root mount, if we know what it is. Be more correct in unbusying the mountpoint (especially before freeing it). Remove support for mounting 'r' devices as root. You don't mount 'r' devices anywhere else, and they're going away anyway. Submitted by: bde	1999-11-28 22:20:18 +00:00
Dan Moschuk	ee3fd60126	Introduce OpenBSD-like Random PIDs. Controlled by a sysctl knob (kern.randompid), which is currently defaulted off. Use ARC4 (RC4) for our random number generation, which will not get me executed for violating crypto laws; a Good Thing(tm). Reviewed and Approved by: bde, imp	1999-11-28 17:51:09 +00:00
Poul-Henning Kamp	ee072c08d0	Convert dumpon to work on character devices instead of block devices. NB: You may need to change your /etc/rc.conf!	1999-11-28 16:25:17 +00:00
Bruce Evans	bdf423572e	Scheduler fixes equivalent to the ones logged in the following NetBSD commit to kern_synch.c: ---------------------------- revision 1.55 date: 1999/02/23 02:56:03; author: ross; state: Exp; lines: +39 -10 Scheduler bug fixes and reorganization * fix the ancient nice(1) bug, where nice +20 processes incorrectly steal 10 - 20% of the CPU, (or even more depending on load average) * provide a new schedclk() mechanism at a new clock at schedhz, so high platform hz values don't cause nice +0 processes to look like they are niced * change the algorithm slightly, and reorganize the code a lot * fix percent-CPU calculation bugs, and eliminate some no-op code === nice bug === Correctly divide the scheduler queues between niced and compute-bound processes. The current nice weight of two (sort of, see `algorithm change' below) neatly divides the USRPRI queues in half; this should have been used to clip p_estcpu, instead of UCHAR_MAX. Besides being the wrong amount, clipping an unsigned char to UCHAR_MAX is a no-op, and it was done after decay_cpu() which can only _reduce_ the value. It has to be kept <= NICE_WEIGHT * PRIO_MAX - PPQ or processes can scheduler-penalize themselves onto the same queue as nice +20 processes. (Or even a higher one.) === New schedclk() mechansism === Some platforms should be cutting down stathz before hitting the scheduler, since the scheduler algorithm only works right in the vicinity of 64 Hz. Rather than prescale hz, then scale back and forth by 4 every time p_estcpu is touched (each occurance an abstraction violation), use p_estcpu without scaling and require schedhz to be generated directly at the right frequency. Use a default stathz (well, actually, profhz) / 4, so nothing changes unless a platform defines schedhz and a new clock. Define these for alpha, where hz==1024, and nice was totally broke. === Algorithm change === The nice value used to be added to the exponentially-decayed scheduler history value p_estcpu, in _addition_ to be incorporated directly (with greater wieght) into the priority calculation. At first glance, it appears to be a pointless increase of 1/8 the nice effect (pri = p_estcpu/4 + nice*2), but it's actually at least 3x that because it will ramp up linearly but be decayed only exponentially, thus converging to an additional .75 nice for a loadaverage of one. I killed this, it makes the behavior hard to control, almost impossible to analyze, and the effect (~~nothing at for the first second, then somewhat increased niceness after three seconds or more, depending on load average) pointless. === Other bugs === hz -> profhz in the p_pctcpu = f(p_cpticks) calcuation. Collect scheduler functionality. Try to put each abstraction in just one place. ---------------------------- The details are a little different in FreeBSD: === nice bug === Fixing this is the main point of this commit. We use essentially the same clipping rule as NetBSD (our limit on p_estcpu differs by a scale factor). However, clipping at all is fundamentally bad. It gives free CPU the hoggiest hogs once they reach the limit, and reaching the limit is normal for long-running hogs. This will be fixed later. === New schedclk() mechanism === We don't use the NetBSD schedclk() (now schedclock()) mechanism. We require (real)stathz to be about 128 and scale by an extra factor of 2 compared with NetBSD's statclock(). We scale p_estcpu instead of scaling the clock. This is more accurate and flexible. === Algorithm change === Same change. === Other bugs === The p_pctcpu bug was fixed long ago. We don't try as hard to abstract functionality yet. Related changes: the new limit on p_estcpu must be exported to kern_exit.c for clipping in wait1(). Agreed with by: dufault	1999-11-28 12:12:14 +00:00
Bruce Evans	f0ebe4973f	Scheduler fixes equivalent to the ones logged in the following NetBSD commit to kern_synch.c: ---------------------------- revision 1.55 date: 1999/02/23 02:56:03; author: ross; state: Exp; lines: +39 -10 Scheduler bug fixes and reorganization * fix the ancient nice(1) bug, where nice +20 processes incorrectly steal 10 - 20% of the CPU, (or even more depending on load average) * provide a new schedclk() mechanism at a new clock at schedhz, so high platform hz values don't cause nice +0 processes to look like they are niced * change the algorithm slightly, and reorganize the code a lot * fix percent-CPU calculation bugs, and eliminate some no-op code === nice bug === Correctly divide the scheduler queues between niced and compute-bound processes. The current nice weight of two (sort of, see `algorithm change' below) neatly divides the USRPRI queues in half; this should have been used to clip p_estcpu, instead of UCHAR_MAX. Besides being the wrong amount, clipping an unsigned char to UCHAR_MAX is a no-op, and it was done after decay_cpu() which can only _reduce_ the value. It has to be kept <= NICE_WEIGHT * PRIO_MAX - PPQ or processes can scheduler-penalize themselves onto the same queue as nice +20 processes. (Or even a higher one.) === New schedclk() mechansism === Some platforms should be cutting down stathz before hitting the scheduler, since the scheduler algorithm only works right in the vicinity of 64 Hz. Rather than prescale hz, then scale back and forth by 4 every time p_estcpu is touched (each occurance an abstraction violation), use p_estcpu without scaling and require schedhz to be generated directly at the right frequency. Use a default stathz (well, actually, profhz) / 4, so nothing changes unless a platform defines schedhz and a new clock. Define these for alpha, where hz==1024, and nice was totally broke. === Algorithm change === The nice value used to be added to the exponentially-decayed scheduler history value p_estcpu, in _addition_ to be incorporated directly (with greater wieght) into the priority calculation. At first glance, it appears to be a pointless increase of 1/8 the nice effect (pri = p_estcpu/4 + nice*2), but it's actually at least 3x that because it will ramp up linearly but be decayed only exponentially, thus converging to an additional .75 nice for a loadaverage of one. I killed this, it makes the behavior hard to control, almost impossible to analyze, and the effect (~~nothing at for the first second, then somewhat increased niceness after three seconds or more, depending on load average) pointless. === Other bugs === hz -> profhz in the p_pctcpu = f(p_cpticks) calcuation. Collect scheduler functionality. Try to put each abstraction in just one place. ---------------------------- The details are a little different in FreeBSD: === nice bug === Fixing this is the main point of this commit. We use essentially the same clipping rule as NetBSD (our limit on p_estcpu differs by a scale factor). However, clipping at all is fundamentally bad. It gives free CPU the hoggiest hogs once they reach the limit, and reaching the limit is normal for long-running hogs. This will be fixed later. === New schedclk() mechanism === We don't use the NetBSD schedclk() (now schedclock()) mechanism. We require (real)stathz to be about 128 and scale by an extra factor of 2 compared with NetBSD's statclock(). We scale p_estcpu instead of scaling the clock. This is more accurate and flexible. === Algorithm change === Same change. === Other bugs === The p_pctcpu bug was fixed long ago. We don't try as hard to abstract functionality yet. Related changes: the new limit on p_estcpu must be exported to kern_exit.c for clipping in wait1(). Agreed with by: dufault	1999-11-28 12:12:13 +00:00
Peter Wemm	64f86df1ed	Take a shot at implementing the fix for PR 15014 for the a.out kernel linker as well. PR: 15014 Submitted by: Vladimir N. Silyaev <vns@delta.odessa.ua>	1999-11-28 12:06:29 +00:00
Peter Wemm	b5abfb708c	Fix an embarresing mistake in the kld symbol lookup for DDB. It should now correctly do a traceback when crashing inside a KLD module. PR: 15014 Submitted by: Vladimir N. Silyaev <vns@delta.odessa.ua>	1999-11-28 11:59:18 +00:00
Bruce Evans	9bc8d885ed	Updated comments for the move in the previous commit.	1999-11-27 15:27:11 +00:00
Bruce Evans	71a62f8a05	Fixed some comments in statclock(). The previous commit made it clearer that one comment was attached to null code.	1999-11-27 14:37:34 +00:00

... 3 4 5 6 7 ...

3164 Commits