freebsd-nq

Author	SHA1	Message	Date
Peter Wemm	02fc72dbe5	make mount(2) automatically kldload modules if the requested filesystem isn't present.	1998-11-03 14:29:09 +00:00
Peter Wemm	7837745438	Have the in-kernel linker try a default extension of .ko. This means that "kldload nfs" works. We use the same default extension in the /boot/loader system.	1998-11-03 14:27:05 +00:00
Peter Wemm	e91a7f30cf	Initialize the a.out kld loader after elf, so that elf gets first shot at a kldload attempt.	1998-11-03 14:25:21 +00:00
Peter Wemm	b913711e0c	Use the kvm space pathname that we copied in, not the one in user space.	1998-11-03 13:09:31 +00:00
Peter Wemm	8c14bf40a1	Change the #ifdef UNION code into a callable hook. Arrange to have this set up when unionfs is present, either statically or as a kld module.	1998-11-03 08:01:48 +00:00
Peter Wemm	f3d6ee090e	Only do one VOP_ACCESS() per open() instead of two. This should reduce the NFSv3 ACCESS RPC problems a little for busy clients that do a lot of open/close. The nfs code could probably cache the results, but I'm not sure whether this would be legal or useful. The problem is that with a CPU farm, on each open there would be a lookup, getattr then access RPC then the read/write RPC activity. Caching the access results probably isn't going to help much if the clients access lots of files. Having the nfs_access() routine interpret the getattr results is a bit of a hack, but it's how NFSv2 is done and it might be OK for a mount attribute for v3.	1998-11-02 02:36:16 +00:00
Peter Wemm	40c8cfe552	Use TAILQ macros for clean/dirty block list processing. Set b_xflags rather than abusing the list next pointer with a magic number.	1998-10-31 15:31:29 +00:00
Peter Wemm	16e9e530cc	Convert lists for bufs attached to vnodes from a LIST to a TAILQ. - Use TAILQ_* macros extensively instead of internal names - use b_xflags instead of the NOLIST magic number hack in the next pointer - clean bufs are inserted at the tail rather than the head. - redo dirty buffer insert so that metadata (negative lbn) goes to the tail directly rather than at the HEAD. This makes a difference when inserting dirty data blocks in lbn sorted order since data block insertion will not have to bypass all the metadata cruft. data is lbn sorted since it makes sense for clustering and writeback ordering, while metadata sorting doesn't help much since the lbn's are meaningless when walking the list for writebacks. Small systems will not notice much (if any) benefit from this, but really busy systems with large dirty block lists should get a lot more. I've tested this with softdep, and it doesn't seem to mind the change of queueing of metadata. Reviewed (in princible) by: dg Obtained from: partly from John Dyson's work-in-progress patches in June.	1998-10-31 14:20:39 +00:00
Peter Wemm	b421db370b	The last argument to vm_object_page_clean() are now bit flags, rather than the old true/false. While here, have vfs_msync() only call vm_object_page_clean() with OBJPC_SYNC if called with MNT_WAIT flags. vfs_msync() is called at unmount time (with MNT_WAIT) and from the syncer process (formerly update). This should make dirty mmap writebacks a little less nasty. I have tested this a little with SOFTUPDATES enabled, but I don't normally use it since I've been badly burned too many times.	1998-10-31 07:42:04 +00:00
David Greenman	2a78b8d1f8	Unwire everything to the inactive queue in order to preserve LRU ordering.	1998-10-30 14:53:54 +00:00
Mike Smith	35d27a0f39	Add the ability to specify where on the at_shutdown queue a handler is installed. Remove cpu_power_down, and replace it with an entry at the end of the SHUTDOWN_FINAL queue in the only place it's used (APM). Submitted by: Some ideas from Bruce Walter <walter@fortean.com>	1998-10-30 05:41:15 +00:00
Bruce Evans	cbbbd4c330	Oops, rev.1.167 made the device number checking in bdevvp() too strict for mfs root mounts. Don't require major 255 to be in bdevsw[].	1998-10-29 11:50:32 +00:00
David Greenman	0d5a725446	Fixed editing error. Pointed out by bde.	1998-10-29 11:04:22 +00:00
Peter Wemm	20f02ef5e9	Remove the V_SAVEMETA flag, nothing uses it any more now that msdosfs and ext2fs call vtruncbuf() directly. This simplifies and cleans up vinvalbuf() a little.	1998-10-29 09:51:28 +00:00
Garrett Wollman	a64d49d14f	A Perl5 version of makedevops.sh. Fixes a bug in the sh/awk version which really annoyed me at the time but I don't think bites me any more.	1998-10-29 01:52:59 +00:00
Garrett Wollman	af2a5c769a	The new resource manager, hopefully in a reasonably stable form.	1998-10-29 01:48:36 +00:00
David Greenman	730075613a	Added a second argument, "activate" to the vm_page_unwire() call so that the caller can select either inactive or active queue to put the page on.	1998-10-28 13:37:02 +00:00
Doug Rabson	45a576b57a	Lots of debugging code and some bugfixes. Submitted by: Nick Hibma <nick.hibma@jrc.it>	1998-10-27 09:21:43 +00:00
Bruce Evans	885bf0b57a	Updated the major number check in vfs_object_create(). It's not clear if the check is necessary, but vfs_object_create() is called for all vnodes and it was silly to create objects for VBLK vnodes that don't even have a driver.	1998-10-26 08:07:00 +00:00
Bruce Evans	fffd686a31	Fixed breakage of the GPROF case of statclock() in the previous commit.	1998-10-26 06:13:18 +00:00
Bruce Evans	b64f28be2d	Oops, back out some local changes that shouldn't have been in the previous commit.	1998-10-25 20:11:36 +00:00
Bruce Evans	c8a1bed2ab	Fixed breakage of the !SMP case of roundrobin() in the previous commit.	1998-10-25 19:57:23 +00:00
Poul-Henning Kamp	f5ef029e92	Nitpicking and dusting performed on a train. Removes trivial warnings about unused variables, labels and other lint.	1998-10-25 17:44:59 +00:00
Bruce Evans	37906c686d	Fixed device number checking in bdevvp(): - dev != NODEV was checked for, but 0 was returned on failure. This was fixed in Lite2 (except the return code was still slightly wrong (ENODEV instead of ENXIO)) but the changes were not merged. This case probably doesn't actually occur under FreeBSD. - major(dev) was not checked to have a valid non-NULL bdevsw entry. This caused panics when the driver for the root device didn't exist. Fixed minor misformattings in bdevvp(). Rev.1.14 consisted mainly of gratuitous reformattings that seem to have caused many Lite2 merge errors. PR: 8417	1998-10-25 16:11:49 +00:00
Bruce Evans	a6cda5b68f	Fixed sysctl attachment for statically configured vfs's. Broken in: previous commit	1998-10-25 10:52:34 +00:00
Mike Smith	6fe8861e01	Don't put 0x in front of %p, it does it already. Submitted by: Brian Feldman <green@janus.syracuse.net>	1998-10-24 18:35:09 +00:00
Poul-Henning Kamp	3bac064ff7	Change the way we simulate stable storage for timecounters. If you have problems with the "calcru" messages and processes being killed for excessive cpu time, try to increase the NTIMECOUNTER #define and report your findings.	1998-10-23 10:44:52 +00:00
John Polstra	2c2a0cf191	Eliminate a superfluous comment.	1998-10-21 16:31:38 +00:00
Peter Wemm	52c24af701	Some cleanups and optimizations: - Use the system headers method for Elf32/Elf64 symbol compatability - get rid of the UPRINTF debugging. - check the ELF header for compatability much more completely - optimize the section mapper. Use the same direct VM interfaces that imgact_aout.c and kern_exec.c use. - Check the return codes from the vm_* functions better. Some return KERN_* results, not an errno. - prefault the page tables to reduce startup faults on page tables like a.out does. - reset the segment protection to zero for each loop, otherwise each segment could get progressively more privs. (eg: if the first was read/write/execute, and the second was meant to be read/execute, the bug would make the second r/w/x too. In practice this was not a problem because executables are normally laid out with text first.) - Don't impose arbitary limits. Use the limits on headers imposed by the need to fit them into one page. - Remove unused switch() cases now that the verbose debugging is gone. I've been using an earlier version of this for a month or so. This sped up ELF exec speed a bit for me but I found it hard to get consistant benchmarks when I tested it last (a few weeks ago). I'm still bothered by the page read out of order caused by the transition from data to bss. This which requires either part filling the transition page or clearing the remainder.	1998-10-18 15:55:12 +00:00
Bruce Evans	95242f5a02	Overload the correct errno for attempts to set an in-core label with a raw partition at a nonzero offset (EINVAL should have been EXDEV; DIOCSDINFO was broken, and DIOCWDINFO was broken because it depended on DIOCSDINFO). A zero offset for the raw partition should probably be enforced in setdisklabel(), and DIOCWDINFO should probably always be handled by first calling setdisklabel() so that writedisklabel() doesn't need to enforce it, but this has never been done; dsioctl() has a special check. Changes in this commit are limited to dsioctl() to preserve bug for bug compatibility in drivers that don't use the slice code (notably the ccd driver, which allows setting a bogus label in DIOCWDINFO and doesn't undo the setting when writedisklabel() fails).	1998-10-17 09:46:42 +00:00
Bruce Evans	e36b4f594a	Use only the correct raw partition for writing labels. Don't use the partition that the label ioctl is being done on just because it has offset 0, since there is no guarantee that such a partition is large enough to contain the label. Don't use the wrong raw partition (0 instead of RAW_PART). This fixes problems rewriting bizarre labels (with a nonzero offset for the 'a' partition) in newfs(8). Such labels shouldn't normally be used, but creating them was allowed if the ioctl was done on the raw partition, and sysinstall creates them if the root partition isn't allocated first. Note that allowing write access to a partition other than the one that has been checked for write access doesn't increase security holes significantly, since write access to any partition already allows changing the in-core label. This fix should be in 3.0R. Rev.1.26 of newfs/newfs.c shouldn't be in 3.0R.	1998-10-17 07:49:04 +00:00
Jordan K. Hubbard	908dcbd2a4	fixup for alpha.	1998-10-16 10:14:21 +00:00
Peter Wemm	aa855a598d	gulp. Jordan specifically OK'ed this.. This is the bulk of the support for doing kld modules. Two linker_sets were replaced by SYSINIT()'s. VFS's and exec handlers are self registered. kld is now a superset of lkm. I have converted most of them, they will follow as a seperate commit as samples. This all still works as a static a.out kernel using LKM's.	1998-10-16 03:55:01 +00:00
Peter Wemm	df481e793f	Fix some bugs in link_aout.c caused by using uninitialized malloc space. Pre-Approved by: jkh	1998-10-16 03:45:35 +00:00
Peter Wemm	434bf653d2	- bzero() after malloc() bug fix - look up relocation symbol names in correct table bugfix. - remove unused (initialized to 0) variable and conditional free() of it.	1998-10-15 17:16:24 +00:00
Peter Wemm	bd4e381b03	- bzero() after malloc(). This is especially obvious when kern_malloc is compiled with DIAGNOSTIC. - Don't break from the preload module processing loop prematurely.	1998-10-15 17:12:12 +00:00
Peter Wemm	ddd62546e1	Fix sysinit_add(). - Don't include multiple copies of the previous sysinit in the new one. - Leave space for and explicitly null terminate the new list.	1998-10-15 17:09:19 +00:00
Doug Rabson	216a0f2d7f	Don't frob the user stack directly, use suword instead. This fixes the elf_freebsd_fixup() panic which many people have noticed on the alpha.	1998-10-15 09:52:19 +00:00
Doug Rabson	67fc32a9ff	Use the interface name as well as the method name when creating the method description. Submitted by: Nick Hibma <nick.hibma@jrc.it>	1998-10-15 09:50:45 +00:00
Kenneth D. Merry	a937ccdc09	Disable the 'devstat_end_transaction' busy count printf until after 3.0 release goes out the door. We know there's a bug in the devstat implementation in the wd driver, but bde and msmith haven't been able to fix it yet. So, disable the printf to avoid confusing/worrying people. Suggested by: msmith	1998-10-14 20:44:05 +00:00
Dmitrij Tejblum	f74d75a2b6	Backed out rev. 1.164. It caused problems on SMP. PR: 8309	1998-10-14 15:05:52 +00:00
Peter Wemm	2a26e9ea83	Align to sizeof(long) rather than sizeof(int32_t). It needs to be long because this code is shared with the alpha. I hope the alpha can read 32 bit ints at 32 bit alignment (vs. 64 bit alignment).	1998-10-14 05:07:23 +00:00
Peter Wemm	ca65d5c714	Load the full symbol tables if they are present. This means that ddb and tracebacks have access to local symbols. This is particularly important for the Alpha.	1998-10-13 09:27:00 +00:00
David Greenman	6cde7a165f	Fixed two potentially serious classes of bugs: 1) The vnode pager wasn't properly tracking the file size due to "size" being page rounded in some cases and not in others. This sometimes resulted in corrupted files. First noticed by Terry Lambert. Fixed by changing the "size" pager_alloc parameter to be a 64bit byte value (as opposed to a 32bit page index) and changing the pagers and their callers to deal with this properly. 2) Fixed a bogus type cast in round_page() and trunc_page() that caused some 64bit offsets and sizes to be scrambled. Removing the cast required adding casts at a few dozen callers. There may be problems with other bogus casts in close-by macros. A quick check seemed to indicate that those were okay, however.	1998-10-13 08:24:45 +00:00
Dmitrij Tejblum	9bbd8a2498	UnVMIO vnodes of block devices when they are no longer in use. (Some things, like msdosfs, do not work (panic) on devices with VMIO enabled. FFS enable VMIO on mounted devices, and nothing previously disabled it, so, after you mounted FFS floppy, you could not mount msdosfs floppy anymore...) This is mostly a quick before-release fix. Reviewed by: bde	1998-10-12 20:14:09 +00:00
Peter Wemm	2d636ab077	Only print kernel entry point during load. Drastically quieten down the verbose load progress messages. They were more useful for debugging than anything, but are beyond a joke when loading a few dozen modules. Simplify the ELF extended symbol table load format. Just take the main symbol table and the string table that corresponds. This is what we will be getting local symbols from. (needed for the alpha stack tracebacks). Use the (optional) full symbol tables in lookups. This means we have to furhter distinguish between symbols that can come from the dynamic linking table and the complete table. The alpha boot code now needs to be adapted as ddb/db_elf.c cannot use the simpler format. I have not implemented loading the extended symbol tables from the syscall interface yet, just for preloaded modules. I am not sure about the symbol resolution. I think it's possible that a local symbol can be found in preference to a global, depending on the search sequence and dependency tree.	1998-10-12 09:13:50 +00:00
Peter Wemm	2da2eeacee	Relocate a few extra addresses that are stored as physical addresses via the boot loader.	1998-10-12 09:03:48 +00:00
John Polstra	d1dbc69449	If an ELF executable has a recognized brand, then believe it. Formerly, the heuristic involving the interpreter path took precedence. Also, print a better error message if the brand is missing or not recognized. If there is no brand at all, give the user a hint that "brandelf" needs to be run.	1998-10-11 19:22:07 +00:00
KATO Takenori	7344a7bb2e	mp_machdep.c: Set a vector to boot code (PC-98). locore.s: Tell the bios to warmboot next time (PC-98).	1998-10-10 13:37:16 +00:00
KATO Takenori	ed68ba2cd0	PC-98 doesn't have CMOS ram.	1998-10-10 09:38:02 +00:00
Peter Wemm	26deceba9c	Display module type as well as module name when we find one preloaded.	1998-10-10 02:29:07 +00:00
Peter Wemm	51f3fe7ab2	Use Mike Smith's linker module search path code. Implement preloading in a fairly MI way, assuming the information is prepared. DDB interface helpers.. Provide some support for db_kld.c so that we don't have to export too much detail. Debugging and cosmetic nits left in from development.. The other half of the containing file hack so modules can associate themselves with their "file".	1998-10-10 00:07:53 +00:00
Peter Wemm	f3b0d44290	Keep track of the containing file for modules. This is a bit of a hack, but I can't think of another (relatively) easy way of getting the info since the boot-time initialization is not done immediately after "loading". XXX module_register() gained an extra arg. This might break the alpha compile, if so, just add a zero to get the old behavior.	1998-10-10 00:03:07 +00:00
Peter Wemm	e4f1a52f7a	Updates for alignment rounding. XXX this is highly machine dependent and should probably be moved to i386/i386/link_machdep.c (and the same for the alpha). Implement "deleting" a preloaded module by destroying it's tags. This is a hack. We cannot reuse the data, it's been destroyed by relocation, statically initialized variables have been modified, etc. Note that to reclaim the load space is going to be more machine-dependent work. Implement a relocate hook for machdep.c to call so that the physical addresses get converted to the equivalent KVM addresses.	1998-10-09 23:59:01 +00:00
Peter Wemm	de78ca7e63	Fully implement KLD and preloading. - seperate unload for preloaded linker objects. - Don't build a kernel object if running as an a.out kernel. - extract the real kernel name rather than hardwiring "kernel" for kldstat. (sysctl kern.bootfile getst the full name via bootinfo) - use real addresses on the kernel "module" rather than fictitious ones. - preloaded module support - search module path for file modules. - symbols are checked to see if they are in the right containing file before using their indexes into string tables. This is to help ddb since it only supplies a pointer to an opaque symbol and there is no telling which file/object/module/whatever it came from.	1998-10-09 23:55:31 +00:00
Peter Wemm	adbb7dbee6	Updates for KLD backends. - symbol_values checks that the symbol is indeed belonging to the correct symbol and string table pairs before looking up. (since there could be many pairs, and KLD/DDB need to find out). - different ops for files versus preload modules - the unload mechanism is different. (a preloaded module has to be deleted on unload since the in-core image is tainted by relocation and variables used) - Do not build an a.out kernel module if we're running on an elf kernel. :-) Note that it should theoretically be possible to mix a.out and elf KLD modules providing -mno-underscores was used to compile it, or some other symbol conversion takes place. - Support preload modules (even though /boot/loader doesn't yet) - Search the module path when loading files.	1998-10-09 23:49:28 +00:00
Peter Wemm	94e9d7c12d	Implement merging SYSINIT's from preloaded KLD modules. This means we check off SYSINIT entries as they are run, and when more arrive, we re-sort and restart (skipping the already-run entries). This can only be done after KMEM (and malloc) is up and running - this is fine because KLD is the only consumer of this and it's done after that. The nice thing about this is that the SYSINIT's within preloaded KLD modules are executed in their natural order. It should be possible to register devices for the probes which follow, etc. (soon.. several key things prevent this, such as use of linker sets for things like pci devices).	1998-10-09 23:42:47 +00:00
Mike Smith	a8dad95401	Don't try to initialise the environment out of a sysinit, it's handled in MD code instead.	1998-10-09 21:21:34 +00:00
Jordan K. Hubbard	e4f570a215	Fix damaged comment.	1998-10-09 11:03:46 +00:00
Mike Smith	6ba9413b55	Kernel environment access, preloaded module lookup.	1998-10-09 00:31:29 +00:00
KATO Takenori	5baef6cd27	BIOS ROM base address is 0xe8000 on PC-98.	1998-10-08 16:15:22 +00:00
Alexander Langer	d611666328	Cast the return value of tvtohz() from a long to an int to satisfy the compiler that we know what we're doing (the value returned has already been restricted to int ranges). Reviewed by: bde	1998-10-06 23:17:44 +00:00
Doug Rabson	a20d77550a	Make sure that the argv pointers for init are aligned to the correct boundary on the alpha.	1998-10-06 11:55:40 +00:00
Kenneth D. Merry	a795f8bbaa	Make the printf when busy_time < 0 a little more descriptive. This may help track down bugs in the devstat implementation in various drivers. (i.e., any situation where the driver does not call the devstat routines once and only once for each transaction initiation and completion) Prompted by: msmith	1998-10-06 04:16:07 +00:00
David E. O'Brien	efcddb8e39	Taking the GENERIC kernel and commenting out MSDOSFS, CD9660{,_ROOT}, PROCFS, NFS_ROOT will produce kernel that cannot mount a UFS /. Vfs type numbers must be distinct from VFS_GENERIC (and VFS_VFSCONF, but that has the same value and should go away). The problem happens because NFS is the first vfs (in sys/conf order) so it gets type number 0 and conflicts harmfully with VFS_GENERIC which is also 0. The conflict is apparently harmless in the usual case when another vfs gets type number 0, because nfs is the only vfs that has sysctls. Inital fix by: Dima <dima@tejblum.dnttm.rssi.ru> Reason why it worked by: bde	1998-10-05 11:10:55 +00:00
Doug Rabson	780330ee00	Send a MOD_UNLOAD event before deleting the module. if the MOD_LOAD event failed.	1998-10-03 11:05:45 +00:00
Doug Rabson	84bc104b8e	Don't call DEVICE_DETACH directly, always go through the device_detach() wrapper which checks to make sure the device isn't currently open.	1998-10-03 08:55:29 +00:00
John Polstra	7b4c881cef	Fix a bug which caused the dynamic linker pathname in the PT_INTERP program header entry to be ignored if a recognized brand was found.	1998-10-03 04:12:09 +00:00
Doug Rabson	da653c6148	Start using the new SWI registration system instead of hardwiring everything.	1998-09-26 14:25:32 +00:00
Matthew Dillon	ff8fae607b	PR: kern/7418 Reviewed by: Luoqi Chen <luoqi@watermarkgroup.com> Fixed problem where write()s can get lost due to buffers flagged B_DELWRI being improperly released in brelse().	1998-09-26 00:12:35 +00:00
Peter Wemm	10baba4b95	Goodbye BOUNCE_BUFFERS, for a hack it has served us well. The last consumer of this code (the old SCSI system) has left us and the CAM code does it's own bouncing. The isa dma system has been doing it's own bouncing for a while too. Reviewed by: core	1998-09-25 17:34:49 +00:00
Luoqi Chen	e266594c25	Eliminate a race in VOP_FSYNC() when softupdates is enabled. Submitted by: Kirk McKusick <mckusick@McKusick.COM> Two minor changes are also included, 1. Remove gratuitious checks for error return from vn_lock with LK_RETRY set, vn_lock should always succeed in these cases. 2. Back out change rev. 1.36->1.37, which unnecessarily makes async mount a little more unstable. It also keeps us in sync with other BSDs. Suggested by: Bruce Evans <bde@zeta.org.au>	1998-09-24 15:02:46 +00:00
Dmitrij Tejblum	a511bf18fe	Fix precedence bug, so that kernel dump works.	1998-09-20 16:50:31 +00:00
Kenneth D. Merry	bcc6a3da92	Change the devstat generation number from an int to a long. The int-sized generation was causing unaligned access faults on the Alpha. I have incremented the devstat version number, since this is an interface change. You'll need to recompile libdevstat, systat, iostat, vmstat and rpc.rstatd along with your kernel. Partially Submitted by: Andrew Gallatin <gallatin@cs.duke.edu>	1998-09-20 00:11:23 +00:00
Doug Rabson	6e4fb91578	Document BUS_CREATE_INTR and BUS_CONNECT_INTR.	1998-09-16 08:25:56 +00:00
John Polstra	0ff27d31ea	Restore the core-dumping of all writable segments for ELF executables, minus the NULL pointer dereference in rev. 1.33. Also simplify things somewhat by eliminating one traversal of the VM map entries. Finally, eliminate calls to vm_map_{un,}lock_read() which aren't needed here. I originally took them from procfs_map.c, but here we know we are dealing only with the map of the current process.	1998-09-16 02:04:05 +00:00
John Polstra	dada02781d	Erk. Revert back to 1.31, dumping only data and stack to the core file, until I can solve a panic that has just cropped up.	1998-09-15 22:23:12 +00:00
John Polstra	6bb20c5063	When choosing segments to write to the core file, don't assume that writable implies readable.	1998-09-15 22:07:20 +00:00
John Polstra	8162da636b	Instead of just the data and stack segments, include all writable segments (except memory-mapped devices) in the ELF core file. This is really nice. You get access to the data areas of all shared libraries, and even to files that are mapped read-write. In the future, it might be good to add a new resource limit in the spirit of RLIMIT_CORE. It would specify the maximum sized writable segment to include in core dumps. Segments larger than that would be omitted. This would be useful for programs that map very large files read/write but that still would like to get usable core dumps.	1998-09-15 21:46:34 +00:00
Justin T. Gibbs	7ea97031d1	kern_clock.c: Remove old disk statistics variables. vfs_bio.c: Enable bowrite now that B_ORDERED works for all buffer devices.	1998-09-15 10:05:18 +00:00
Justin T. Gibbs	eda00cb5d2	When a buffer is removed from a buffer queue, remember it's block number and use it as "the currently active" buffer in doing disk sort calculations.	1998-09-15 08:55:03 +00:00
Justin T. Gibbs	2cfa0a0381	Add a new at_shutdown queue, SHUTDOWN_FINAL. This queue is run at splhigh() after any system dumps have completed. SHUTDOWN_POST_SYNC isn't quite late enough for disk controllers. Converted at_shutdown queues to use the queue(3) macros.	1998-09-15 08:49:52 +00:00
Justin T. Gibbs	7a59208d92	New Kernel device statistics code. Submitted by: "Kenneth D. Merry" <ken@plutotech.com>	1998-09-15 08:16:17 +00:00
John Polstra	c284427561	Remove includes that are no longer needed, now that the core dumping code has been moved into the respective imgact_xxx.c sources.	1998-09-14 23:25:18 +00:00
John Polstra	8c64af4f75	Viola! The kernel now generates standard ELF core dumps for ELF executables. Currently only data and stack are included in the core dumps. I am looking into adding the other (mmapped) writable segments as well.	1998-09-14 22:46:08 +00:00
Søren Schmidt	d024c95599	Remove the SLICE code. This clearly needs alot more thought, and we dont need this to hunt us down in 3.0-RELEASE.	1998-09-14 19:56:42 +00:00
John Polstra	22d4b0fb41	Add provisions for variant core dump file formats, depending on the object format of the executable being dumped. This is the first step toward producing ELF core dumps in the proper format. I will commit the code to generate the ELF core dumps Real Soon Now. In the meantime, ELF executables won't dump core at all. That is probably no less useful than dumping a.out-style core dumps as they have done until now. Submitted by: Alex <garbanzo@hooked.net> (with very minor changes by me)	1998-09-14 05:36:51 +00:00
Doug Rabson	fe3db7c7c7	Implement dynamic loading for ELF.	1998-09-11 08:46:15 +00:00
Doug Rabson	9386468ef8	Avoid a possible memory leak.	1998-09-11 08:45:32 +00:00
Tor Egge	a58915fc10	Don't keep the underlying directory locked while performing the file system specific VFS_MOUNT operation. PR: 1067	1998-09-10 02:27:52 +00:00
Bruce Evans	355a2610a7	Don't use CTL_VFS at the wrong level. This caused loops in the sysctl tree if CTL_VFS happened to get assigned as a type number to a vfs that has some vfs sysctls.	1998-09-09 07:41:41 +00:00
Bruce Evans	cae300be0f	Made unloading of the nfs LKM sort of work. This is mainly to test detachment of vfs sysctls. Unloading of vfs LKMs doesn't actually work for any vfs, since it leaves garbage pointers to memory allocation control structures.	1998-09-07 05:42:15 +00:00
Tor Egge	572d053e17	Maintain a mapping from irq number to (ioapic number, int pin) tuple, and use this when masking/unmasking interrupts. Maintain a mapping from (iopaic number, int pin) tuple to irq number, and use this when configuring devices and programming the ioapics. Previous code assumed that irq number was equal to int pin number, and that the ioapic number was 0. Don't let an AP enter _cpu_switch before all local apics are initialized.	1998-09-06 22:41:42 +00:00
Andrey A. Chernov	99237364cc	Store formatted panic string in static buffer to make it available later for savecore. Previous code give only panic format to savecore	1998-09-06 06:25:18 +00:00
Bruce Evans	e99ea9ec2b	Ignore the statically configured vfs type numbers and assign vfs type numbers in vfs attach order (modulo incomplete reuse of old numbers after vfs LKMs are unloaded). This requires reinitializing the sysctl tree (or at least the vfs subtree) for vfs's that support sysctls (currently only nfs). sysctl_order() already handled reinitialization reasonably except it checked for annulled self references in the wrong place. Fixed sysctls for vfs LKMs.	1998-09-05 17:13:28 +00:00
Bruce Evans	500b04a257	Instantiate `nfs_mount_type' in a standard file so that it is present when nfs is an LKM. Declare it in a header file. Don't forget to use it in non-Lite2 code. Initialize it to -1 instead of to 0, since 0 will soon be the mount type number for the first vfs loaded. NetBSD uses strcmp() to avoid this ugly global.	1998-09-05 15:17:34 +00:00
Bruce Evans	134e06fe71	Fixed bogotification of pseudocode for syscall args by rev.1.53 of syscalls.master.	1998-09-05 14:30:11 +00:00
Poul-Henning Kamp	0375c9f2b8	Add a new vnode op, VOP_FREEBLKS(), which filesystems can use to inform device drivers about sectors no longer in use. Device-drivers receive the call through d_strategy, if they have D_CANFREE in d_flags. This allows flash based devices to erase the sectors and avoid pointlessly carrying them around in compactions. Reviewed by: Kirk Mckusick, bde Sponsored by: M-Systems (www.m-sys.com)	1998-09-05 14:13:12 +00:00
Bruce Evans	a8b8bc0730	Fixed recently perpetrated printf format errors.	1998-09-05 13:24:39 +00:00
Andrey A. Chernov	253ab668e2	make sbflush panic messages more descriptive	1998-09-04 13:13:18 +00:00
Doug Rabson	e69763a315	Cosmetic changes to the PAGE_XXX macros to make them consistent with the other objects in vm.	1998-09-04 08:06:57 +00:00
Garrett Wollman	9898afa1f1	Bow to tradition and correctly implement the bogus-but-hallowed semantics of getsockopt never telling how much it might have copied if only the buffer were big enough.	1998-08-31 18:07:23 +00:00
Garrett Wollman	d224dbc106	Correctly set the return length regardless of the relative size of the user's buffer. Simplify the logic a bit. (Can we have a version of min() for size_t?)	1998-08-31 15:34:55 +00:00
KATO Takenori	582e52862a	- hw.machine_arch returns cpu architecture type. - moved definition of MACHINE_ARCH from cpu.h to parm.h as alpha. - Added definitions of _MACHINE and _MACHINE_ARCH. - Added hw.ispc98. The hw.ispc98 is 1 in PC98 kernel and is 0 in IBM-PC kernel. Discussed with: John Birrell <jb@FreeBSD.ORG>	1998-08-31 08:41:58 +00:00
Bruce Evans	f5ce675296	Oops, the previous revision unconfigured too much pre-Lite2 compatibilty cruft. At least lsvfs(1) was broken.	1998-08-29 13:13:10 +00:00
Luoqi Chen	ddae3cb9a0	Close a race window for getnewbuf() between shared lock holders of the vnode. Reviewed by: Mike Smith	1998-08-28 20:07:13 +00:00
Matthew Dillon	8e519d1f35	priority comparison in maybe_resched() didn't work properly if current and chk process were on different scheduler queues. Fixed.	1998-08-26 05:27:42 +00:00
Poul-Henning Kamp	12e14047a4	Fix DDBs printing of buf-flags after I changed them yesterday.	1998-08-25 14:41:42 +00:00
Poul-Henning Kamp	1d9b3ba13d	Remove the last remaining evidence of B_TAPE. Reclaim 3 unused bits in b_flags	1998-08-24 17:47:25 +00:00
Doug Rabson	069e9bc1b4	Change various syscalls to use size_t arguments instead of u_int. Add some overflow checks to read/write (from bde). Change all modifications to vm_page::flags, vm_page::busy, vm_object::flags and vm_object::paging_in_progress to use operations which are not interruptable. Reviewed by: Bruce Evans <bde@zeta.org.au>	1998-08-24 08:39:39 +00:00
Doug Rabson	c49265d091	Regnerate.	1998-08-24 08:32:19 +00:00
Doug Rabson	2e83b28161	Fix a few syscall arguments to use size_t instead of u_int.	1998-08-24 08:29:52 +00:00
Doug Rabson	a4f6773848	Add partial KLD support for ELF. The module loading is not written yet.	1998-08-24 08:25:26 +00:00
Bruce Evans	00671271c3	Fixed printf format errors. Only one left in LINT on i386's.	1998-08-24 02:28:16 +00:00
Poul-Henning Kamp	be18fc123b	remove bdevsw arg from dsopen(); Forgotten by: julian Reviewed by: bde	1998-08-23 20:16:35 +00:00
Dag-Erling Smørgrav	70d154a652	Don't check minor number of dump device at all. Discussed-with: Jörg Wunsch	1998-08-23 14:18:08 +00:00
Bruce Evans	1fcee46997	Fixed printf format errors.	1998-08-23 10:16:26 +00:00
Bruce Evans	cf8c7b0963	Added D_TTY to the cdevswitch flags for all tty drivers. This is required for the Lite2 fix for always returning EIO in dead_read(). Cleaned up the cdevswitch initializers for all tty drivers. Removed explicit calls to ttsetwater() from all (tty) drivers. ttsetwater() is now called centrally for opens, not just for parameter changes.	1998-08-23 08:26:42 +00:00
Garrett Wollman	cfe8b629f1	Yow! Completely change the way socket options are handled, eliminating another specialized mbuf type in the process. Also clean up some of the cruft surrounding IPFW, multicast routing, RSVP, and other ill-explored corners.	1998-08-23 03:07:17 +00:00
Bruce Evans	5879dcdb05	Moved `nx' functions to the one place where they are used (su.c). They shouldn't be used there either. They should have gone away about 3 years ago when the statically initialized devswitches went away, but su.c unfortunately still frobs the cdevswitch in the old way.	1998-08-20 06:10:42 +00:00
Dag-Erling Smørgrav	9103e8640c	Include opt_devfs.h which defines SLICE, to make previous commit meaningful. Pointed out by: Luoqi Chen	1998-08-19 20:20:52 +00:00
Søren Schmidt	e620a1cbed	Make struct buf->b_offset reflect the real byte offset which got in via the uio struct. This enables device drivers to use != DEV_BSIZE blocking on devices with wierd sector/block sizes (ie CDROM's).	1998-08-19 10:50:32 +00:00
Bruce Evans	5cf40a698b	A limit of 200000 for the output buffer high watermark was excessive, since (hardware) ttys have too low a bandwidth to benefit significantly from large buffers. Use twice the old limit for the new-default case and 8 times the old limit for the driver-specifies-watermark case. Nothing uses these cases yet. Removed related debugging code.	1998-08-19 04:01:00 +00:00
Mike Smith	287e61c39f	Presently there is only one `currentldt' variable for all cpus in a SMP system. Unexpected things could happen if each cpu has a different ldt setting and one cpu tries to use value of currentldt set by another cpu. The fix is to move currentldt to the per-cpu area. It includes patches I filed in PR i386/6219 which are also user ldt related. PR: i386/7591, i386/6219 Submitted by: Luoqi Chen <luoqi@watermarkgroup.com>	1998-08-18 07:47:12 +00:00
Bruce Evans	2d2f8ae7ad	Fixed nonsense overflow checking (checking that a long variable is less than INT_MAX after it has possibly overflowed). Removed an unused variable and its associated 2 style bugs. Removed unused includes.	1998-08-17 17:28:10 +00:00
Dag-Erling Smørgrav	d08b9c139f	Enable kernel dumps on SLICE systems.	1998-08-16 11:27:19 +00:00
John Polstra	317c91f4d4	Make ELF kernels build again.	1998-08-16 04:19:03 +00:00
Bruce Evans	86a14a7a0a	Use [u]intptr_t instead of [u_]long for casts between pointers and integers. Don't forget to cast to (void *) as well.	1998-08-16 01:21:52 +00:00
Bruce Evans	69ed480f48	pmap.c: Cast pointers to (vm_offset_t) instead of to (u_long) (as before) or to (uintptr_t)(void ) (as would be more correct). Don't cast vm_offset_t's to (u_long) just to do arithmetic on them. mp_machdep.c: Cast pointers to (uintptr_t) instead of to (u_long). Don't forget to cast pointers to (void ) first or to recover from integral possible integral promotions, although this is too much work for machine-dependent code. vm code generally avoids warnings for pointer vs long size mismatches by using vm_offset_t to represent pointers; pmap.c often uses plain `unsigned int' instead of vm_offset_t and didn't use u_long elsewhere, but this style was messed up by code apparently imported from mp_machdep.c.	1998-08-16 00:41:40 +00:00
Bruce Evans	160bd4c62f	Oops, the printf format error fixes confused curp->area with a pointer.	1998-08-15 22:42:20 +00:00
Doug Rabson	7032ad107e	Protect all modifications to v_numoutput with splbio().	1998-08-13 08:09:08 +00:00
Bruce Evans	13950bd2ed	Don't configure compatibility code for pre-Lite2 mount() calls by default. This code should go away soon.	1998-08-12 20:17:42 +00:00
Doug Rabson	a2c99e3e72	Modify the internal interfaces to the kernel linker to make it possible for DDB to use its symbol tables.	1998-08-12 08:44:21 +00:00
Bruce Evans	18c5a6c435	Implemented dynamic registration of software interrupt handlers. Not used yet. Use dummy SWI handlers to avoid some checks for null pointers.	1998-08-11 15:08:13 +00:00
Bruce Evans	c41141b002	Fixed the formatting of some tables (mainly the one produced by ps in ddb) which I broke by changing %8[l]x to %8p. Hacked the central printf routine to not add an "0x" prefix for %p formats if the field width is nonzero. The tables are still horribly misformatted on 64-bit machines. Use %p instead of %8p to print pointers when the field width isn't important.	1998-08-10 14:27:34 +00:00
Poul-Henning Kamp	22126f4208	The machine dependent disk slice manager does not recognize DOS partition type 15 (Extended DOS, LBA) as a container for DOS logical volumes, so the appropriate slices (e.g. sd1s5) are not initialized. PR: 7549 PR: 4120 Reviewed by: phk Submitted by: Jim Mattson <jmattson@sonic.net>	1998-08-10 07:22:14 +00:00
Doug Rabson	d474eaaa5f	Protect all modifications to paging_in_progress with splvm(). The i386 managed to avoid corruption of this variable by luck (the compiler used a memory read-modify-write instruction which wasn't interruptable) but other architectures cannot. With this change, I am now able to 'make buildworld' on the alpha (sfx: the crowd goes wild...)	1998-08-06 08:33:19 +00:00
Bruce Evans	6360628342	Removed unused function hzto().	1998-08-05 18:06:40 +00:00
David Greenman	760c5490ee	Move assignment of cur_rlp to after the acquisition of the list lock. PR: 7496 Submitted by: Stefan Eggers <seggers@semyam.dinoco.de>	1998-08-05 14:06:04 +00:00
Poul-Henning Kamp	205d5ed6ff	remove nonsense code. PR: 7482 Reviewed by: phk Submitted by: Stefan Eggers <seggers@semyam.dinoco.de>	1998-08-04 09:21:04 +00:00
Bruce Evans	34e9dea435	Added a flags arg to dsopen() and updated drivers. The DSO_ONESLICE and DSO_NOLABELS flags prevent searching for slices and labels respectively. Current drivers don't set these flags. When DSO_NOLABELS is set, the in-core label for the whole disk is cloned to create an in-core label for each slice. This gives the correct result (a good in-core label for the compatibility slice) if DSO_ONESLICE is set or only one slice is found, but usually gives broken labels otherwise, so DSO_ONESLICE should be set if DSO_NOLABELS is set.	1998-07-30 15:16:06 +00:00
Doug Rabson	8a8a13c8f0	Only access an int for READU/WRITEU since that is what ptrace is declared to return.	1998-07-29 18:41:30 +00:00
Doug Rabson	a9d81f7c5c	Default to FreeBSD if no brand detected. This makes life easier when bootstrapping from NetBSD/alpha.	1998-07-29 18:39:35 +00:00
Bruce Evans	d974cf4dda	Fixed printf format errors.	1998-07-29 17:38:14 +00:00
Bruce Evans	f9a9c96c25	Centralized and optimized handling of large sectors. Centralized checking of transfer sizes and alignments. Old version tested with 2K-sectors on od disks by: Shunsuke Akiyama <akiyama@kme.mei.co.jp>.	1998-07-29 11:15:54 +00:00
Bruce Evans	ea0823f2c9	Use the slice-relative blkno in all parts of the label write protection checks. Using the partition-relative blkno in some parts broke the write protection for partitions at unusual offsets (only for partitions at offset 1 on i386's).	1998-07-29 08:24:23 +00:00
Joerg Wunsch	57308494ec	Make the logging of abnormally exiting processes optional by a sysctl. PR: kern/1711 Submitted by: Nick Sayer <nsayer@kfu.com>	1998-07-28 22:34:12 +00:00
Bruce Evans	1733a6c1df	Set bp->b_resid for failed transfers in dscheck(). This is the best place to set it, and the wd and wfd strategy routines don't set it (for failed transfers) because they expect dscheck() to initialize everything necessary. dscheck() has always set B_ERROR, but this is not quite sufficient, because b_resid is used by physio() to decide how much of a B_ERROR'ed i/o was done.	1998-07-28 19:39:09 +00:00
Bruce Evans	aa6db4230d	Used daddr_t's, not ints, to store disk block numbers. Updated printf formats and args to match. Fixed old printf format errors (all related; most were hidden by calling printf indirectly). This change somehow avoids compiler bugs for 64-bit longs on i386's, although it increases the number of 64-bit calculations.	1998-07-28 18:25:51 +00:00
Bruce Evans	bc9e7c3b42	Fixed double counting of runtime after a process exits. The last timeslice of the exiting process was counted for both the exiting process and the next process to run if the next process runs immediately. Broken in: mostly in kern_clock.c rev.1.70 (1998/05/28)	1998-07-27 19:16:21 +00:00
David Greenman	01ddfa33e6	Only call m_reclaim() if M_WAIT since calling it from an interrupt can cause problems. PR: 7403	1998-07-27 03:59:48 +00:00
Bruce Evans	f69c53b019	Don't pass the label to diskerr(), since the label is being constructed and may be invalid. In particular, d_secpercyl may be 0, and diskerr() divides by it.	1998-07-25 16:35:06 +00:00
Doug Rabson	0bf030847d	Add some very simple support for a compiled in (from config(8)) resource database.	1998-07-22 08:35:52 +00:00
Bruce Evans	7cb743ae28	Initialize more defaults for the in-core label for the whole disk. Callers only need to initialize d_secperunit now, but should initialize d_type (to reduce the IDE/SCSI confusion), d_typename (put the disk model in it) and geometry info (if it isn't completely ficticious). Callers will soon need to initialize d_secsize.	1998-07-20 14:35:27 +00:00
Bruce Evans	f21b93a0f7	Cleaned up rev.1.39 - the shadowing variable should have just gone away.	1998-07-20 13:51:11 +00:00
Bruce Evans	92d1f65ed2	Moved allocation of the slices struct to the right place. Initialize everything in it (the devsw pointers were not initialized early or at all for the !DEVFS case, but this was harmless on i386's).	1998-07-20 13:39:45 +00:00
Bruce Evans	1e550e3809	Backed out rev.1.43 (removed nonsense SLICE ifdef). SLICE is normally only defined in opt_devfs.h, so testing it before including anything is normally a no-op. Undef'ing DEVFS before including opt_devfs.h is similarly useless. OTOH, DEVFS support for sliced but not SLICEd (despite defined(SLICE)) devices is either harmless (if there are no such devices, then nothing in this file is used) or necessary (otherwise). It even seems to work for sliced cd devices.	1998-07-20 12:37:59 +00:00
Bill Fenner	0c495036b4	Undo rev 1.41 until we get more details about why it makes some systems fail.	1998-07-18 18:48:45 +00:00
Bruce Evans	18da528d41	Changed %n to %r in devfs name format strings. %n has almost gone away.	1998-07-15 12:18:34 +00:00
Bruce Evans	30166fabb6	Cast between longs and pointers via intptr_t. There shouldn't be nearly so many casts here. Casting an pointer that was an integer back to an integer just to compare it with -1 is bad, and casting it back just to compare it with NULL is just wrong.	1998-07-15 06:51:14 +00:00
Bruce Evans	d4d88b1e4d	Cast between u_longs and object pointers via uintptr_t. Access the entry address as a uintfptr_t, not as a long, and not necessarily as what modload(8) passes (it takes a u_long from the exec header and passes a u_int).	1998-07-15 06:39:12 +00:00
Bruce Evans	aae0aa4593	Cast between longs and pointers via intptr_t. The results of fuword() should be checked before casting. The results of suword() should be checked.	1998-07-15 06:19:33 +00:00
Bruce Evans	1ede4662be	Cast longs to intptr_t before casting them to pointers. Fixed bitrot in pseudo-declaration of `struct fcntl_args'. fcntl() is now broken in some cases when ints are larger than longs.	1998-07-15 06:10:16 +00:00
Bruce Evans	c2da0fd903	Cast pointers to intptr_t instead of or before casting to long. Fixed bitrot in K&R support (suword() now takes a long word). Didn't fix corresponding bitrot in store.9 and fetch.9. The correct types for the store and fetch families are problematic. The `word' functions are unfortunately named and need to be split to handle ints/longs/object pointers/function pointers. Storing argv[] as longs is quite broken when longs are longer than pointers, but usually works because it clobbers variables that will soon be reinitialized.	1998-07-15 05:21:48 +00:00
Bruce Evans	7cd99438f8	Cast u_longs to uintptr_t before casting them to pointers. Don't attempt to even partially support systems with function pointers larger than object pointers.	1998-07-15 05:00:26 +00:00
Bruce Evans	6a206dd96a	Cast function pointers to uintfptr_t before casting them to u_long. Hopefully caddr_t is large enough to hold function pointers. Cast object pointers to uintptr_t before casting them to u_long. Types are wronger than usual for the PT_READ_U case. ptrace() can only return ints, but longs are accessed.	1998-07-15 04:43:49 +00:00
Bruce Evans	a23d65bfc8	Cast pointers to uintptr_t/intptr_t instead of to u_long/long, respectively. Most of the longs should probably have been u_longs, but this changes is just to prevent warnings about casts between pointers and integers of different sizes, not to fix poorly chosen types.	1998-07-15 02:32:35 +00:00
Bruce Evans	37889b394a	Changed to the C9x draft spelling of the (unsigned) integral type suitable for holding object pointers (ptrint_t -> uintptr_t). Added corresponding signed type (intptr_t). Changed/added corresponding non-C9x types for function pointers to match. Don't use nonstandard types to implement these types, and don't comment on them in <machine/types.h>.	1998-07-14 05:09:48 +00:00
Bruce Evans	9f14a215f4	Fixed printf format errors.	1998-07-13 07:05:55 +00:00
Doug Rabson	7a6c46b55a	Initialise all the fields separately in vattr_null since on the alpha they are not all the same width.	1998-07-12 16:45:39 +00:00
Doug Rabson	45c95fa1d6	Change interrupt api to be closer to intr_create/intr_connect.	1998-07-12 16:20:52 +00:00
Bruce Evans	bef7db2e66	Moved definition of fscale from param.c to kern_synch.c where it should always have been (it has no user-servicable parts even at compile time) and staticized it.	1998-07-11 13:06:41 +00:00
Bruce Evans	2f18a2801b	Fixed printf format errors.	1998-07-11 10:45:45 +00:00
Bruce Evans	ed62fb52ec	Fixed printf format errors.	1998-07-11 10:28:47 +00:00
Bruce Evans	ac1e407b32	Fixed printf format errors.	1998-07-11 07:46:16 +00:00
Bruce Evans	e0c38587af	Fixed (un)sign extension bugs in %+n format. -4 became (long)(u_long)(u_int)-4 = 0x00000000fffffffc on machines with 32-bit ints and 64-bit longs. Restored %z format for printing signed hex. %+x shouldn't have been used since it is an error in userland. Prepared to nuke %n format by cloning it to %r. %n shouldn't have been used because it means something completely different in userland. Now %+r is equivalent to ddb's original %r, and %r is equivalent to ddb's original %n. Ignore '+' flag in combination with unsigned formats %{o,p,u,x}.	1998-07-08 10:41:32 +00:00
Sean Eric Fagan	c5edb423c6	Add support for run-time configuration of core file names. In a nutshell, you can specify the corefile name by using: sysctl -w kern.corefile="format" where format is a pathname (relative or absolute -- default is "%N.core"), with "%N" (process name), "%P" (process ID), and "%U" (user ID) formats. Reviewed by: Mike Smith, with strong requests by Julian :)	1998-07-08 06:38:39 +00:00
Julian Elischer	6deaf84b1f	Catch a few corner cases where FreeBSD differs enough from BSD 4.4 to confuse Soft updates.. Should solve several "dangling deps" panics.	1998-07-08 01:04:33 +00:00
Bruce Evans	c4ebf24f6e	Don't depend on gcc's feature of casting lvalues.	1998-07-07 04:36:23 +00:00
Bill Fenner	dece5b6a43	Introduce (fairly hacky) workaround for odd TCP behavior with application writes of size (100,208]+N*MCLBYTES. The bug: sosend() hands each mbuf off to the protocol output routine as soon as it has copied it, in the hopes of increasing parallelism (see http://www.kohala.com/~rstevens/vanj.88jul20.txt ). This works well for TCP as long as the first mbuf handed off is at least the MSS. However, when doing small writes (between MHLEN and MINCLSIZE), the transaction is split into 2 small MBUF's and each is individually handed off to TCP. TCP assumes that the first small mbuf is the whole transaction, so sends a small packet. When the second small mbuf arrives, Nagle prevents TCP from sending it so it must wait for a (potentially delayed) ACK. This sends throughput down the toilet. The workaround: Set the "atomic" flag when we're doing small writes. The "atomic" flag has two meanings: 1. Copy all of the data into a chain of mbufs before handing off to the protocol. 2. Leave room for a datagram header in said mbuf chain. TCP wants the first but doesn't want the second. However, the second simply results in some memory wastage (but is why the workaround is a hack and not a fix). The real fix: The real fix for this problem is to introduce something like a "requested transfer size" variable in the socket->protocol interface. sosend() would then accumulate an mbuf chain until it exceeded the "requested transfer size". TCP could set it to the TCP MSS (note that the current interface causes strange TCP behaviors when the MSS > MCLBYTES; nobody notices because MCLBYTES > ethernet's MTU).	1998-07-06 19:27:14 +00:00
Julian Elischer	596f8506ad	fix braino from yesterdays' megacommit Not sure of the result of it.. (may or may not effect anything) but it's fixed now. (found by: comparing what cvsup sent back to me with what I tested..)	1998-07-05 20:33:18 +00:00
Julian Elischer	f7ea2f55d1	There is no such thing any more as "struct bdevsw". There is only cdevsw (which should be renamed in a later edit to deventry or something). cdevsw contains the union of what were in both bdevsw an cdevsw entries. The bdevsw[] table stiff exists and is a second pointer to the cdevsw entry of the device. it's major is in d_bmaj rather than d_maj. some cleanup still to happen (e.g. dsopen now gets two pointers to the same cdevsw struct instead of one to a bdevsw and one to a cdevsw). rawread()/rawwrite() went away as part of this though it's not strictly the same patch, just that it involves all the same lines in the drivers. cdroms no longer have write() entries (they did have rawwrite (?)). tapes no longer have support for bdev operations. Reviewed by: Eivind Eklund and Mike Smith Changes suggested by eivind.	1998-07-04 22:30:26 +00:00
Julian Elischer	fd5d1124e2	VOP_STRATEGY grows an (struct vnode *) argument as the value in b_vp is often not really what you want. (and needs to be frobbed). more cleanups will follow this. Reviewed by: Bruce Evans <bde@freebsd.org>	1998-07-04 20:45:42 +00:00
Poul-Henning Kamp	52f8e5d672	Hmm, braino in last commit.	1998-07-04 19:29:15 +00:00
Poul-Henning Kamp	0edd53d22a	Change the sign on a race-condition, so that instead of ending up several tens of milliseconds out in the future we end up the right place with a subweeniesecond error.	1998-07-04 19:12:21 +00:00
Poul-Henning Kamp	3e5e083cb7	Update M_EXT support in m_copypacket(). PR: 7122 Reviewed by: phk Submitted by: Castor Fu <castor@geocast.com> Originally forgotten by: julian	1998-07-03 08:36:48 +00:00
David Greenman	e25169f239	Reset MNT_ASYNC flag if needed if unmount() should fail. Submitted by: Paul Saab <paul@mu.org>	1998-07-03 03:47:24 +00:00
Poul-Henning Kamp	6ca4ca2476	When we transfer time from one timecounter to the next, use nanouptime(), not nanotime(); Otherwise we end up in 2026... Fix the arg to dummy_get_timecount()	1998-07-02 21:35:02 +00:00
Poul-Henning Kamp	8cb5266728	Add 3 sysctl variables for future use by ps)1_	1998-06-30 21:25:58 +00:00
Bruce Evans	673796a715	Nuked opt_defunct.h and kern_opt.c. config(8) now generates good enough warnings about all unknown options.	1998-06-30 14:43:04 +00:00
Poul-Henning Kamp	67f4e2ed05	Add trailing newline to sys/syscall.mk so that diff doesn't choke on it.	1998-06-28 10:01:52 +00:00
David Greenman	c87e2930e6	Added a sysctl variable kern.sugid_coredump for controlling coredump behavior of setuid/setgid binaries that defaults to 0 (coredump disabled).	1998-06-28 08:37:45 +00:00
Poul-Henning Kamp	c259b8dd2b	Report the mode as the result of the VOP_GETATTR rather than the vnodes type, they may not correspond.	1998-06-27 06:43:09 +00:00
Poul-Henning Kamp	7c281842e3	Remove isdisk() hacks.	1998-06-26 18:14:25 +00:00
Poul-Henning Kamp	b62591052c	Remove bdevsw_add(), change the only two users to use bdevsw_add_generic(). Extend cdevsw to be superset of bdevsw. Remove non-functional bdev lkm support. Teach wcd what the open() args mean.	1998-06-25 11:28:07 +00:00
Bruce Evans	be160d60ab	Removed unused includes.	1998-06-21 18:02:50 +00:00
Bruce Evans	e5b19842ef	Removed unused includes.	1998-06-21 14:53:44 +00:00
Bruce Evans	df471779ea	Round tickadj up. This prevents tickadj from being 0 when HZ > 500, which makes adjtime(2) useless and confuses xntpd(8) into refusing to start even when it would use the kernel PLL instead of adjtime(). The result is the same as recommended by tickadj(8), at least when HZ divides 10^6. Of course, you wouldn't want to actually use adjtime() when HZ is large. In the silly boundary case of HZ == 10^6, tickadj == tick == 1 so the clock stops while adjtime() is active.	1998-06-21 12:22:35 +00:00
Bruce Evans	316bbd5c6f	Converted add_interrupt_randomness() to take a `void *' arg. Rewrote mmioctl() to fix hundreds of style bugs and a few error handling bugs (don't check for superuser privilege for inappropriate ioctls, don't check the input arg for the output-only MEM_RETURNIRQ ioctl, and don't return EPERM for null changes).	1998-06-21 11:33:32 +00:00
Bruce Evans	9a2daf9190	Changed the type of an isa/general interrupt handler to take a `void *' arg. Fixed or hid most of the resulting type mismatches. Handlers can now be updated locally (except for reworking their global declarations in isa_device.h).	1998-06-18 15:32:09 +00:00
Bruce Evans	f95ac73519	Use copyout() instead of bcopy() to copy the image to user space. bcopy() caused panics under heavy paging (not quite as suspected - the kernel stack seemed to get corrupted). Fixed long lines. Reviewed by: phk	1998-06-16 14:36:40 +00:00
Doug Rabson	b1bf661000	[Add missing files from previous commit] Major changes to the generic device framework for FreeBSD/alpha: * Eliminate bus_t and make it possible for all devices to have attached children. * Support dynamically extendable interfaces for drivers to replace both the function pointers in driver_t and bus_ops_t (which has been removed entirely. Two system defined interfaces have been defined, 'device' which is mandatory for all devices and 'bus' which is recommended for all devices which support attached children. * In addition, the alpha port defines two simple interfaces 'clock' for attaching various real time clocks to the system and 'mcclock' for the many different variations of mc146818 clocks which can be attached to different alpha platforms. This eliminates two more function pointer tables in favour of the generic method dispatch system provided by the device framework. Future device interfaces may include: * cdev and bdev interfaces for devfs to use in replacement for specfs and the fixed interfaces bdevsw and cdevsw. * scsi interface to replace struct scsi_adapter (not sure how this works in CAM but I imagine there is something similar there). * various tailored interfaces for different bus types such as pci, isa, pccard etc.	1998-06-14 13:53:12 +00:00
Doug Rabson	99d11cde56	Major changes to the generic device framework for FreeBSD/alpha: * Eliminate bus_t and make it possible for all devices to have attached children. * Support dynamically extendable interfaces for drivers to replace both the function pointers in driver_t and bus_ops_t (which has been removed entirely. Two system defined interfaces have been defined, 'device' which is mandatory for all devices and 'bus' which is recommended for all devices which support attached children. * In addition, the alpha port defines two simple interfaces 'clock' for attaching various real time clocks to the system and 'mcclock' for the many different variations of mc146818 clocks which can be attached to different alpha platforms. This eliminates two more function pointer tables in favour of the generic method dispatch system provided by the device framework. Future device interfaces may include: * cdev and bdev interfaces for devfs to use in replacement for specfs and the fixed interfaces bdevsw and cdevsw. * scsi interface to replace struct scsi_adapter (not sure how this works in CAM but I imagine there is something similar there). * various tailored interfaces for different bus types such as pci, isa, pccard etc.	1998-06-14 13:46:10 +00:00
Poul-Henning Kamp	938ee3ce4d	Introduce std_pps_ioctl() to automagically DTRT. Add scaling capability to timex.offset, ntpd-4.0.73 will support this.	1998-06-13 09:30:26 +00:00
Doug Rabson	3900ddb2dc	Only build this on i386 for now. I may use it for the alpha later but currently it doesn't compile.	1998-06-11 07:23:59 +00:00
Julian Elischer	32f5d4d843	Replace 'sleep()' with 'tsleep()' Accidentally imported from Kirk's codebase. Pointed out by: various.	1998-06-10 22:02:14 +00:00
Julian Elischer	28913ebe4e	Submitted by: Kirk McKusick <mckusick@McKusick.COM> Fix for potential hang when trying to reboot the system or to forcibly unmount a soft update enabled filesystem. FreeBSD already handled the reboot case differently, this is however a better fix.	1998-06-10 18:13:19 +00:00
Doug Rabson	897cd717a5	Add initial support for the FreeBSD/alpha kernel. This is very much a work in progress and has never booted a real machine. Initial development and testing was done using SimOS (see http://simos.stanford.edu for details). On the SimOS simulator, this port successfully reaches single-user mode and has been tested with loads as high as one copy of /bin/ls :-). Obtained from: partly from NetBSD/alpha	1998-06-10 10:57:29 +00:00
Doug Rabson	8c12612cf6	64bit fixes: don't cast pointers to int.	1998-06-10 10:31:08 +00:00
Doug Rabson	2b605d0804	64bit fixes: don't cast p->p_retval to an int*.	1998-06-10 10:30:23 +00:00
Doug Rabson	831b9ef2be	64bit fixes: use u_long not int for ioctl command.	1998-06-10 10:29:31 +00:00
Doug Rabson	10d4743f6f	64bit fixes: use size_t not u_int for sizes.	1998-06-10 10:28:29 +00:00
Doug Rabson	2ef49ddfcb	64bit fixes: p->p_retval is a register_t[] not an int[].	1998-06-10 10:27:43 +00:00
Poul-Henning Kamp	a58f0f8e66	Add a tc_ prefix to struct timecounter members. Urged by: bde	1998-06-09 13:10:54 +00:00
Bruce Evans	1afde994e9	Pass lists of possible root devices and their names up to the machine-independent code and try mounting the devices in the lists instead of guessing alternative root devices in a machine- dependent way. autoconf.c: Reject preposterous slice numbers instead of silently converting them to COMPATIBILITY_SLICE. Don't forget to force slice = COMPATIBILITY_SLICE in the floppy device name. Eliminated most magic numbers and magic device names in setroot(). Fixed dozens of style bugs. vfs_conf.c: Put the actual root device name instead of "root_device" in the mount struct if the actual name is available. This is useful after booting with -s. If it were set in all cases then it could be used to do mount(8)'s ROOTSLICE_HUNT and fsck(8)'s hotroot guess better.	1998-06-09 12:52:35 +00:00
Bruce Evans	e7c1c309fa	Don't generate COMPAT_43 cruft if there are no COMPAT_43 syscalls. In particular, don't generate an include of "opt_compat.h" if it wouldn't affect anything we create. This will fix recent breakage of the ibcs2 LKM. The ibcs2 syscall files were not regenerated properly, so the LKM didn't break immediately when we started generating this extraneous include.	1998-06-09 03:32:05 +00:00
John Dyson	0d3dd8fbc5	Remove some junk left over from a previous commit. Submitted by: phk	1998-06-08 18:18:28 +00:00
Bruce Evans	414c93f3aa	Updated generated files.	1998-06-08 11:08:35 +00:00
Bruce Evans	bf0955a99d	Fixed some style bugs in output (missing tabs and unparenthesized macros). Fixed some style bugs in source (mostly, superfluous backslashes).	1998-06-08 11:02:00 +00:00
Doug Rabson	2e91d07af9	Fix a typo which prevented i386 elf from working at all (including Linux emulated elf binaries).	1998-06-08 09:19:35 +00:00
Poul-Henning Kamp	48115288df	Add a member function more to the timecounters, this one is for use with latch based PPS implementations. The client that uses it will be committed after more testing.	1998-06-07 20:36:55 +00:00
Doug Rabson	ecbb00a262	This commit fixes various 64bit portability problems required for FreeBSD/alpha. The most significant item is to change the command argument to ioctl functions from int to u_long. This change brings us inline with various other BSD versions. Driver writers may like to use (__FreeBSD_version == 300003) to detect this change. The prototype FreeBSD/alpha machdep will follow in a couple of days time.	1998-06-07 17:13:14 +00:00
Poul-Henning Kamp	dbb3475507	Add a "this" style argument and a "void *private" so timecounters can figure out which instance to wount with.	1998-06-07 08:40:53 +00:00
Bruce Evans	e3a03f0cfb	Don't attempt to copy the whole slices "struct" for DIOCGSLICEINFO. The slices "struct" isn't really a struct; we allocate only part of it in the fully dangerously dedicated case. Since the "struct" is malloced, the page beyond it may not be mapped, so attempts to copy it would crash. This problem became larger when the full struct was bloated from < 1K to > 3K by the addition of (mostly unused) DEVFS tokens some time before 2.2.0 was released.	1998-06-06 03:06:55 +00:00
David Greenman	b5afad7198	Moved limit frobbing (and the resulting limcopy()) that occurs for accounting to the accounting function so that this isn't needlessly done for some process exits. Reviewed by: bde,phk	1998-06-05 21:44:20 +00:00
David Greenman	9523f5c199	If we are out of mb_map space and we failed to m_reclaim() anything and the alloc is not M_DONTWAIT, then panic with "Out of mbuf clusters". Callers that specify M_WAIT can't deal with getting a NULL buffer, so this is a more graceful failure than randomly page faulting in the socket code or elsewhere.	1998-06-05 21:41:48 +00:00
John Dyson	e8f367853b	Correct sleep priority.	1998-06-02 05:39:13 +00:00
Peter Dufault	ce47711dee	Set PAGE_SIZE for _SC_PAGESIZE sysconf().	1998-06-01 21:54:43 +00:00
Peter Wemm	4dc75870b2	Have the wakeup routine do the upcall if needed. Obtained from: NetBSD	1998-05-31 18:38:43 +00:00
Poul-Henning Kamp	e796e00de3	Some cleanups related to timecounters and weird ifdefs in <sys/time.h>. Clean up (or if antipodic: down) some of the msgbuf stuff. Use an inline function rather than a macro for timecounter delta. Maintain process "on-cpu" time as 64 bits of microseconds to avoid needless second rollover overhead. Avoid calling microuptime the second time in mi_switch() if we do not pass through _idle in cpu_switch() This should reduce our context-switch overhead a bit, in particular on pre-P5 and SMP systems. WARNING: Programs which muck about with struct proc in userland will have to be fixed. Reviewed, but found imperfect by: bde	1998-05-28 09:30:28 +00:00
John Dyson	cf2819ccb8	Make flushing dirty pages work correctly on filesystems that unexpectedly do not complete writes even with sync I/O requests. This should help the behavior of mmaped files when using softupdates (and perhaps in other circumstances also.)	1998-05-21 07:47:58 +00:00
Peter Dufault	aebde78243	1. Add new defs for mins and maxs for the POSIX flavor priorities. They end up being the same, but it doesn't look like you're comparing apples and oranges. 2. Use need_resched instead of reset_priority. This isn't right either, since for example you'll round-robin against equal priority FIFO processes when lowering the priority of another process, but this works better and a real fix needs to be in kern_synch and not out here. 3. This is not a device driver: copyin/copyout the structure.	1998-05-19 21:11:53 +00:00
Poul-Henning Kamp	579f4456b9	Change a data type internal to the timecounters, and remove the "delta" function. Reviewed, but not entirely approved by: bde	1998-05-19 18:55:02 +00:00
Poul-Henning Kamp	58067a9909	Make the size of the msgbuf (dmesg) a "normal" option.	1998-05-19 08:58:53 +00:00
Tor Egge	afc6ea238f	Disallow reading the current kernel stack. Only the user structure and the current registers should be accessible. Reviewed by: David Greenman <dg@root.com>	1998-05-19 00:00:14 +00:00
Peter Dufault	2a61a11038	1. Don't use "nosys" and generate coredumps for unconfigured system calls - return ENOSYS per the spec. 2. Fix interface stub to set priority properly.	1998-05-18 12:53:45 +00:00
Tor Egge	2f1e70693d	Add forwarding of roundrobin to other cpus. This gives a more regular update of cpu usage as shown by top when one process is cpu bound (no system calls) while the system is otherwise idle (except for top). Don't attempt to switch to the BSP in boot(). If the system was idle when an interrupt caused a panic, this won't work. Instead, switch to the BSP in cpu_reset. Remove some spurious forward_statclock/forward_hardclock warnings.	1998-05-17 22:12:14 +00:00
Bruce Evans	ee002b68d1	Fixed interval calculation in realitimexpire() again. Obtained from: rev.1.9. Broken in: rev.1.50. Fixed a spelling error. Obtained from: Lite2.	1998-05-17 20:13:01 +00:00
Bruce Evans	c8b4782815	Fixed stale references to hzto() in comments.	1998-05-17 20:08:05 +00:00
Tor Egge	cb87a87c16	Supply the correct process argument to dounmount when possible.	1998-05-17 19:38:55 +00:00
Tor Egge	5931a9c24e	For SMP, use prv_PPAGE1/prv_PMAP1 instead of PADDR1/PMAP1. get_ptbase and pmap_pte_quick no longer generates IPIs. This should reduce the number of IPIs during heavy paging.	1998-05-17 18:53:19 +00:00
Poul-Henning Kamp	c21410e119	s/nanoruntime/nanouptime/g s/microruntime/microuptime/g Reviewed by: bde	1998-05-17 11:53:46 +00:00
Garrett Wollman	98271db4d5	Convert socket structures to be type-stable and add a version number. Define a parameter which indicates the maximum number of sockets in a system, and use this to size the zone allocators used for sockets and for certain PCBs. Convert PF_LOCAL PCB structures to be type-stable and add a version number. Define an external format for infomation about socket structures and use it in several places. Define a mechanism to get all PF_LOCAL and PF_INET PCB lists through sysctl(3) without blocking network interrupts for an unreasonable length of time. This probably still has some bugs and/or race conditions, but it seems to work well enough on my machines. It is now possible for `netstat' to get almost all of its information via the sysctl(3) interface rather than reading kmem (changes to follow).	1998-05-15 20:11:40 +00:00
Peter Wemm	9c4aed2ed7	Nuke signanosleep(). (I've left nanosleep1() seperate to nanosleep() as I don't want to mess with the multiple returns)	1998-05-14 11:31:08 +00:00
Peter Wemm	06b6493558	regen after signanosleep nuke	1998-05-14 11:29:06 +00:00
Peter Wemm	786cf38a29	deep-six signanosleep(). It sounded like a good idea at the time.	1998-05-14 11:28:11 +00:00
Peter Wemm	1973d51bfb	Commit an old change that has been sitting around for a long while. signanosleep() did not deal with signal masks properly. This change was based on a discussion with bde some time ago (at least 6 months or more). signanosleep() should probably go away since it was never really used for more than a few weeks and doesn't appear in released code. It should probably be killed before somebody uses it and it becomes a gratuitous nonstandard feature.	1998-05-14 10:38:52 +00:00
Bruce Evans	b322fb5d76	Backed out previous commit. It is invalid to call d_ioctl() on possibly non-open devices, and we don't want to restrict dumping to swap devices anwyay. It is especially invalid to call d_ioctl() in non-process context for panics. d_psize() can be called on non-open devices, at least on non-SLICED ones that support d_dump(), and setdumpdev() has depended on this for a long time although it is probably wrong, but even d_psize() can't be called in non-process context - that's why dumpsys() depends on previously computed values although these values may be stale. The historical restriction to devices with dkpart(dev) == SWAP_PART should go away.	1998-05-12 17:34:02 +00:00
John Dyson	1f56217280	Fix the futimes/undelete/utrace conflict with other BSD's. Note that the only common usage of utrace (the possible problem with this commit) is with malloc, so this should be a real problem. Add the various NetBSD syscalls that allow full emulation of their development environment.	1998-05-11 03:55:28 +00:00
John Dyson	f0175db1ee	Attempt to set write combining mode for graphics devices.	1998-05-11 01:06:08 +00:00
Mike Smith	7be2d30077	In the words of the submitter: --------- Make callers of namei() responsible for releasing references or locks instead of having the underlying filesystems do it. This eliminates redundancy in all terminal filesystems and makes it possible for stacked transport layers such as umapfs or nullfs to operate correctly. Quality testing was done with testvn, and lat_fs from the lmbench suite. Some NFS client testing courtesy of Patrik Kudo. vop_mknod and vop_symlink still release the returned vpp. vop_rename still releases 4 vnode arguments before it returns. These remaining cases will be corrected in the next set of patches. --------- Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-05-07 04:58:58 +00:00
Julian Elischer	7f2f1b784e	Add dump support to the DEVFS/slice code. now we can actually catch our crashes :-) Submitted by: Luoqi Chen <luoqi@chen.ml.org> (the man who's everywhere)	1998-05-06 22:14:48 +00:00
Mike Smith	79cc756d8b	As described by the submitter: Reverse the VFS_VRELE patch. Reference counting of vnodes does not need to be done per-fs. I noticed this while fixing vfs layering violations. Doing reference counting in generic code is also the preference cited by John Heidemann in recent discussions with him. The implementation of alternative vnode management per-fs is still a valid requirement for some filesystems but will be revisited sometime later, most likely using a different framework. Submitted by: Michael Hancock <michaelh@cet.co.jp>	1998-05-06 05:29:41 +00:00
John Dyson	96fb8cf258	Fix the shm panic. I mistakenly used the shadow_count to keep the object from being split, and instead added an OBJ_NOSPLIT.	1998-05-04 17:12:53 +00:00
John Dyson	cbd8ec0902	Work around some VM bugs, the worst being an overly aggressive swap space free calculation. More complete fixes will be forthcoming, in a week.	1998-05-04 03:01:44 +00:00
Bruce Evans	77849078bf	Oops, the previous commit should have changed `i386' to` __i386__', not `__i386'.	1998-05-01 16:40:21 +00:00
Bruce Evans	809e3a8464	Partially fixed write clustering for cases where cluster_wbuild() is called from vfs_bio_awrite() without going through cluster_write() or ufs_bmaparray(), in particular for all writes to block disk devices. Only ufs_bmaparray() sets vp->v_maxio in a correct way, and it doesn't seem to be called early enough even for regular files.	1998-05-01 16:29:27 +00:00
Peter Wemm	b1951f4028	vm_page_is_valid() wasn't expecting a large offset argument, it's expecting a sub-page offset. We were passing the file position, and vm_page_bits() could do some interesting things when base was larger PAGE_SIZE. if (size > PAGE_SIZE - base) size = PAGE_SIZE - base; is interesting when (PAGE_SIZE - base) is negative. I could imagine that this could have interesting consequences for memory page -> device block bit validation.	1998-05-01 15:10:59 +00:00
Peter Wemm	f806d5a257	Fix one problem with NFSv3 > 2GB file support. Submitted by: bde	1998-05-01 15:04:35 +00:00
Eivind Eklund	288078be0f	Translate T_PROTFLT to SIGSEGV instead of SIGBUS when running under Linux emulation. This make Allegro Common Lisp 4.3 work under FreeBSD! Submitted by: Fred Gilham <gilham@csl.sri.com> Commented on by: bde, dg, msmith, tg Hoping he got everything right: eivind	1998-04-28 18:15:08 +00:00
David E. O'Brien	cbcfa1ba6a	Discussed with: bde	1998-04-24 11:50:30 +00:00
David E. O'Brien	8f89f24fc3	Create virgin disklabels with 8 (MAXPARTITIONS) partitions rather than three (RAW_PART + 1); This makes ``disklabel -Brw sdN auto'' do the Right Thing.	1998-04-24 11:49:57 +00:00
David Greenman	9351a2295a	Added kern.ipc.nmbclusters	1998-04-24 04:15:52 +00:00
Julian Elischer	c0bab11dfe	Make the devfs SLICE option a standard type option. (hopefully it will go away eventually anyhow)	1998-04-20 03:57:41 +00:00
Julian Elischer	3e425b968d	Add changes and code to implement a functional DEVFS. This code will be turned on with the TWO options DEVFS and SLICE. (see LINT) Two labels PRE_DEVFS_SLICE and POST_DEVFS_SLICE will deliniate these changes. /dev will be automatically mounted by init (thanks phk) on bootup. See /sys/dev/slice/slice.4 for more info. All code should act the same without these options enabled. Mike Smith, Poul Henning Kamp, Soeren, and a few dozen others This code does not support the following: bad144 handling. Persistance. (My head is still hurting from the last time we discussed this) ATAPI flopies are not handled by the SLICE code yet. When this code is running, all major numbers are arbitrary and COULD be dynamically assigned. (this is not done, for POLA only) Minor numbers for disk slices ARE arbitray and dynamically assigned.	1998-04-19 23:32:49 +00:00
Dag-Erling Smørgrav	59bad7c53b	Backed out lseek changes.	1998-04-19 22:20:32 +00:00
Dag-Erling Smørgrav	25096724e8	Return EINVAL and do not change file pointer if resulting offset is negative. PR: kern/6184	1998-04-18 19:24:44 +00:00
Peter Wemm	37b8ccd37a	In vfs_msync(), test to see if the vnode being examined is "interesting" (ie: it has a vm_object attached and is marked as OBJ_MIGHTBEDIRTY) before attempting to lock it. This should reduce the cpu hit that is incurred when doing a sync(2) and when the syncer process is doing the 30-second writeback of dirty mmap() data to disk. Skip this speedup if we are doing an unmount() to be sure to get everything - we can afford to occasionally miss a msync while the system is running, but not at unmount. I'm not sure about the VXLOCK and MNT_WAIT case, it seems a bit odd to skip doing a page_clean at unmount time just because a vnode is VXLOCKed, but that's what was being done before...	1998-04-18 06:26:16 +00:00
Dag-Erling Smørgrav	dc73342347	Seventy-odd "its" / "it's" typos in comments fixed as per kern/6108.	1998-04-17 22:37:19 +00:00
Bruce Evans	ab36c3d3e7	Really finish supporting compiling with `gcc -ansi'.	1998-04-17 04:53:44 +00:00
Peter Wemm	efdc5523c0	When the softdep conversion took place, the periodic vfs_msync() from update got lost. This is responsible for ensuring that dirty mmap() pages get periodically written to disk. Without it, long time mmap's might not have their dirty pages written out at all of the system crashes or isn't cleanly shut down. This could be nasty if you've got a long-running writing via mmap(), dirty pages used to get written to disk within 30 seconds or so.	1998-04-16 03:31:26 +00:00
Tor Egge	71033a8c50	Unlock mountlist_slock if the mount point was busy (unmount in progress) during the attempt at lazy fsync.	1998-04-15 18:37:49 +00:00
Bruce Evans	c1087c1324	Support compiling with `gcc -ansi'.	1998-04-15 17:47:40 +00:00
Poul-Henning Kamp	115facb29d	Fix a minor mbuf leak created by the previous change. Reviewed by: phk Submitted by: pb@fasterix.freenix.org (Pierre Beyssac)	1998-04-14 06:24:43 +00:00
Poul-Henning Kamp	aba558930b	setsockopt() transports user option data in an mbuf. if the user data is greater than MLEN, setsockopt is unable to pass it onto the protocol handler. Allocate a cluster in such case. PR: 2575 Reviewed by: phk Submitted by: Julian Assange proff@iq.org	1998-04-11 20:31:46 +00:00
Poul-Henning Kamp	a2481bbe8e	When pmap_pinit0() allocates a page for proc0's page directory, kernal page table may need to be extended. But while growing the kernel page table (pmap_growkernel()), newly allocated kernel page table pages are entered into every process' page directory. For proc0, the page directory is not allocated yet, and results in a page fault. Eventually, the machine panics with "lockmgr: not holding exclusive lock". PR: 5458 Reviewed by: phk Submitted by: Luoqi Chen <luoqi@luoqi.watermarkgroup.com>	1998-04-11 17:24:06 +00:00
Alexander Langer	7c2e3d329a	Grammar police.	1998-04-10 00:09:04 +00:00
Wolfram Schneider	5ddc8ded1d	New mount option nosymfollow. If enabled, the kernel lookup() function will not follow symbolic links on the mounted file system and return EACCES (Permission denied).	1998-04-08 18:31:59 +00:00
Poul-Henning Kamp	5f88ec3625	Minor adjustments to the timecounting and proc0. Mostly Submitted by: bde	1998-04-08 09:01:53 +00:00
Peter Wemm	100ceca222	Today is not my lucky day. Fix missing brace and I got a request to use EMLINK instead.	1998-04-06 19:32:37 +00:00
Peter Wemm	193afe0189	Use a different errno (ELOOP (as sef mentioned) since the text that goes with the error sounds ok for the condition) if O_NOFOLLOW gets a link.	1998-04-06 18:43:28 +00:00
Peter Wemm	0fdc628b41	Rather than let users get fd's to symlink files, make O_NOFOLLOW cause an error if it gets a link (like it does if it gets a socket). The implications of letting users try and do file operations on symlinks themselves were too worrying.	1998-04-06 18:25:21 +00:00
Peter Wemm	7e3426aa1f	Implement a new open(2) flag: O_NOFOLLOW. This will instruct open to not follow symlinks, but to open a handle on the link itself(!). As strange as this might sound, it has several useful applications safe race-free ways of opening files in hostile areas (eg: /tmp, a mode 1777 /var/mail, etc). It also would allow things like fchown() to work on the link rather than having to implement a new syscall specifically for that task. Reviewed by: phk	1998-04-06 17:38:43 +00:00
Peter Wemm	aacdc613e5	curproc is initialized in locore at the same time for both SMP and UP now.	1998-04-06 15:51:22 +00:00
Peter Wemm	cf34ef61ee	Use real types for the SMP pages being allocated rather than arrays of ints. Remove some no longer needed casts. Initialize the per-cpu global data area using the structs rather than knowing too much about layout, alignment, etc.	1998-04-06 15:48:30 +00:00
Poul-Henning Kamp	2eeb0e2ea0	Make read_random() take a (void ) argument instead of (char )	1998-04-06 09:30:42 +00:00
Poul-Henning Kamp	4cf41af3d4	Make a kernel version of the timer* functions called timerval* to be more consistent. OK'ed by: bde	1998-04-06 08:26:08 +00:00
Poul-Henning Kamp	5704ba6a06	More fixes for the iterative case of nanosleep1 from bruce. I hate the 2-arg time{spec\|val}{add\|sub} functions!	1998-04-05 12:10:41 +00:00
Poul-Henning Kamp	bfe6c9fabf	Make the dummy timecounter run at 1 MHz rather than 100kHz (noticed by bde) fix the itimer(REAL) handling.	1998-04-05 11:49:36 +00:00
Peter Wemm	d59fbbf6c8	If there is no error code, don't copyout the remaining time. (As documented in the man page and the standards). (and besides, nanosleep1 isn't setting it in this case at present anyway, so we'd be copying junk).	1998-04-05 11:17:19 +00:00
Poul-Henning Kamp	338418263d	Fix nanosleep1 based on Bruces suggestion.	1998-04-05 10:28:01 +00:00
Andrey A. Chernov	80a39463c9	Remove unused atv.tv_usec = 0; from select/poll code	1998-04-05 10:03:52 +00:00
Peter Wemm	2257b488b9	tsleep() returns EWOULDBLOCK if the timeout expired. Don't return this to usermode, otherwise sleep(3) fails, cron doesn't work, etc etc etc.	1998-04-05 07:31:44 +00:00
Peter Wemm	b90dcc0c5d	Fix previous commit. Don't people read compiler messages or something??	1998-04-05 02:59:10 +00:00
Poul-Henning Kamp	91ad39c6b3	Handle double fraction overflow in nano & microtime functions (spotted by Bruce) Use tvtohz() a place where it fits.	1998-04-04 18:46:13 +00:00
Poul-Henning Kamp	00af9731c9	Time changes mark 2: * Figure out UTC relative to boottime. Four new functions provide time relative to boottime. * move "runtime" into struct proc. This helps fix the calcru() problem in SMP. * kill mono_time. * add timespec{add\|sub\|cmp} macros to time.h. (XXX: These may change!) * nanosleep, select & poll takes long sleeps one day at a time Reviewed by: bde Tested by: ache and others	1998-04-04 13:26:20 +00:00
John Dyson	aec0bcdf5b	Perhaps fix a problem that some drivers have that they don't properly initialize the b_kvasize element. This might fix some of the split I/O requests that some people have.	1998-04-04 05:55:05 +00:00
Poul-Henning Kamp	4ff16568be	Try to fix poll & select after I broke them.	1998-04-02 07:22:17 +00:00
Tor Egge	5758c2de94	Add two workarounds for broken MP tables: - Attempt to handle PCI devices where the interrupt is an ISA/EISA interrupt according to the mp table. - Attempt to handle multiple IO APIC pins connected to the same PCI or ISA/EISA interrupt source. Print a warning if this happens, since performance is suboptimal. This workaround is only used for PCI devices. With these two workarounds, the -SMP kernel is capable of running on my Asus P/I-P65UP5 motherboard when version 1.4 of the MP table is disabled.	1998-04-01 21:07:37 +00:00
Poul-Henning Kamp	460608e768	Fix an off by 1<<32 error.	1998-03-31 10:47:01 +00:00

... 4 5 6 7 8 ...

2258 Commits