freebsd-skq

Author	SHA1	Message	Date
phk	a8f67509a6	Fix contigmalloc() and contigmalloc1() arguments.	1997-10-10 18:18:47 +00:00
dyson	9b2d1fdb85	Improve management of pages moving from the inactive to active queue. Additionally, add some much needed comments.	1997-10-06 02:48:16 +00:00
dyson	df56983676	Relax the vnode locking for read only operations.	1997-10-06 02:38:30 +00:00
peter	9cb126eb6a	Fix some style(9) and formatting problems. tabsize 4 formatting doesn't look too great with 'more' etc. Approved by: dyson (with a minor grumble :-)	1997-09-21 11:41:12 +00:00
dyson	e64b1984f9	Change the M_NAMEI allocations to use the zone allocator. This change plus the previous changes to use the zone allocator decrease the useage of malloc by half. The Zone allocator will be upgradeable to be able to use per CPU-pools, and has more intelligent usage of SPLs. Additionally, it has reasonable stats gathering capabilities, while making most calls inline.	1997-09-21 04:24:27 +00:00
peter	796eb5ce0a	Update select -> poll in drivers.	1997-09-14 03:19:42 +00:00
peter	0a52445cc3	Print correct function name in panics	1997-09-13 15:04:52 +00:00
jlemon	0eba88d0be	Do not consider VM_PROT_OVERRIDE_WRITE to be part of the protection entry when handling a fault. This is set by procfs whenever it wants to write to a page, as a means of overriding `r-x COW' entries, but causes failures in the `rwx' case. Submitted by: bde	1997-09-12 15:58:47 +00:00
bde	bcade9a903	Removed yet more vestiges of config-time swap configuration and/or cleaned up nearby cruft.	1997-09-07 16:21:11 +00:00
bde	a08aff2d02	Removed unused #includes.	1997-09-01 03:17:34 +00:00
bde	e11885cf92	Some staticized variables were still declared to be extern.	1997-09-01 02:55:50 +00:00
bde	98fcb3f476	Print a device number in hex instead of decimal.	1997-09-01 02:28:32 +00:00
phk	0b3a12b83e	Change the 0xdeadb hack to a flag called VDOOMED. Introduce VFREE which indicates that vnode is on freelist. Rename vholdrele() to vdrop(). Create vfree() and vbusy() to add/delete vnode from freelist. Add vfree()/vbusy() to keep (v_holdcnt != 0 \|\| v_usecount != 0) vnodes off the freelist. Generalize vhold()/v_holdcnt to mean "do not recycle". Fix reassignbuf()s lack of use of vhold(). Use vhold() instead of checking v_cache_src list. Remove vtouch(), the vnodes are always vget'ed soon enough after for it to have any measuable effect. Add sysctl debug.freevnodes to keep track of things. Move cache_purge() up in getnewvnodes to avoid race. Decrement v_usecount after VOP_INACTIVE(), put a vhold() on it during VOP_INACTIVE() Unmacroize vhold()/vdrop() Print out VDOOMED and VFREE flags (XXX: should use %b) Reviewed by: dyson	1997-08-31 07:32:39 +00:00
peter	29e2c84e7a	Allow non-page aligned file offset mmap's, providing that the system is allowed to choose the address, or that the MAP_FIXED address has the same remainder when modulo PAGE_SIZE as the file offset. Apparently this is posix1003.1b specified behavior. SVR4 and the other *BSD's allow it too. It costs us nothing to support and means we don't get EINVAL on some mmap code that works perfectly elsewhere. Obtained from: NetBSD	1997-08-30 18:50:06 +00:00
bde	c978fb3652	Fixed type mismatches for functions with args of type vm_prot_t and/or vm_inherit_t. These types are smaller than ints, so the prototypes should have used the promoted type (int) to match the old-style function definitions. They use just vm_prot_t and/or vm_inherit_t. This depends on gcc features to work. I fixed the definitions since this is easiest. The correct fix may be to change the small types to u_int, to optimize for time instead of space.	1997-08-25 22:15:31 +00:00
dyson	042ae4067b	This is a trial improvement for the vnode reference count while on the vnode free list problem. Also, the vnode age flag is no longer used by the vnode pager. (It is actually incorrect to use then.) Constructive feedback welcome -- just be kind.	1997-08-22 03:56:37 +00:00
bde	6be005551f	#include <machine/limits.h> explicitly in the few places that it is required.	1997-08-21 20:33:42 +00:00
fsmp	24a2d0d38a	Added includes of smp.h for SMP. This eliminates a bazillion warnings about implicit s_lock & friends.	1997-08-18 03:29:21 +00:00
dyson	cc823b6e73	Fix kern_lock so that it will work. Additionally, clean-up some of the VM systems usage of the kernel lock (lockmgr) code. This is a first pass implementation, and is expected to evolve as needed. The API for the lock manager code has not changed, but the underlying implementation has changed significantly. This change should not materially affect our current SMP or UP code without non-standard parameters being used.	1997-08-18 02:06:35 +00:00
dyson	a8d01f6338	The "cutsie" register parameter passing that I had mistakenly used breaks profiling. Since it doesn't really improve perf much, I have backed it out.	1997-08-10 00:12:13 +00:00
dyson	85f902e519	More vm_zone cleanup. The sysctl now accounts for items better, and counts the number of allocations.	1997-08-07 03:52:55 +00:00
dyson	5f9cb6429d	Add exposure of some vm_zone allocation stats by sysctl. Also, change the initialization parameters of some zones in VM map. This contains only optimizations and not bugfixes.	1997-08-06 04:58:05 +00:00
dyson	e150d815cc	Fixed the commit botch that was causing crashes soon after system startup. Due to the error, the initialization of the zone for pv_entries was missing. The system should be usable again.	1997-08-05 23:03:24 +00:00
dyson	2649bd0b26	Another attempt at cleaning up the new memory allocator.	1997-08-05 22:24:31 +00:00
dyson	55205b3be5	Fix some bugs, document vm_zone better. Add copyright to vm_zone.h. Use the new zone code in pmap.c so that we can get rid of the ugly ad-hoc allocations in pmap.c.	1997-08-05 22:07:27 +00:00
dyson	96f688be11	Modify pmap to use our new memory allocator. Also, change the vm_map_entry allocations to be interrupt safe.	1997-08-05 01:32:52 +00:00
dyson	54005d6ed9	A very simple zone allocator.	1997-08-05 00:07:31 +00:00
dyson	8fa8ae3d0d	Get rid of the ad-hoc memory allocator for vm_map_entries, in lieu of a simple, clean zone type allocator. This new allocator will also be used for machine dependent pmap PV entries.	1997-08-05 00:02:08 +00:00
bde	9195bd1ec7	Removed unused #includes.	1997-08-02 14:33:27 +00:00
dyson	5e05983d33	Add the ability for the pageout daemon to measure stats on memory usage before the system is out of memory. The daemon does a minimal amount of work that increases as the system becomes more likely to run out of memory and page in/out. The default tuning is fairly low in background CPU usage, and sysctl variables have been added to enable flexable operation. This is an experimental feature that will likely be changed and improved over time.	1997-07-27 04:49:19 +00:00
dyson	e011371c82	Fix a very subtile problem that causes unnessary numbers of objects backing a single logical object. Submitted by: Alan Cox <alc@cs.rice.edu>	1997-07-27 04:44:12 +00:00
dyson	b39089e3e9	Add support for 4MB pages. This includes the .text, .data, .data parts of the kernel, and also most of the dynamic parts of the kernel. Additionally, 4MB pages will be allocated for display buffers as appropriate (only.) The 4MB support for SMP isn't complete, but doesn't interfere with operation either.	1997-07-17 04:34:03 +00:00
tegge	4b413d416e	Don't try upgrading an existing exclusive lock in vm_map_user_pageable. This should close PR kern/3180. Also remove a bogus unconditional call to vm_map_unlock_read in vm_map_lookup.	1997-06-23 21:51:03 +00:00
peter	e0245a10b2	Kill some stale leftovers from the earlier attempts at SMP per-cpu pages	1997-06-22 15:47:16 +00:00
dyson	8786565a86	Remove a window during running down a file vnode. Also, the OBJ_DEAD flag wasn't being respected during vref(), et. al. Note that this isn't the eventual fix for the locking problem. Fine grained SMP in the VM and VFS code will require (lots) more work.	1997-06-22 03:00:24 +00:00
dyson	db14cfe28c	Correct the return code for the mlock system call. Also add the stubs for mlockall and munlockall.	1997-06-15 23:35:32 +00:00
dyson	2e39fb736c	Fix a reference problem with maps. Only appears to manifest itself when sharing address spaces.	1997-06-15 23:33:52 +00:00
peter	6f94abef97	Update the #include "opt_smpxxx.h" includes - opt_smp.h isn't needed very much in the generic parts of the kernel now.	1997-05-29 02:57:22 +00:00
dfr	d7e320b30e	Fix a few bugs with NFS and mmap caused by NFS' use of b_validoff and b_validend. The changes to vfs_bio.c are a bit ugly but hopefully can be tidied up later by a slight redesign. PR: kern/2573, kern/2754, kern/3046 (possibly) Reviewed by: dyson	1997-05-19 14:36:56 +00:00
dyson	8d05a00726	Check the correct queue for waking up the pageout daemon. Specifically, the pageout daemon wasn't always being waken up appropriately when the (cache + free) queues were depleted. Submitted by: David S. Miller <davem@jenolan.rutgers.edu>	1997-05-01 14:36:01 +00:00
peter	6323aa10bf	Man the liferafts! Here comes the long awaited SMP -> -current merge! There are various options documented in i386/conf/LINT, there is more to come over the next few days. The kernel should run pretty much "as before" without the options to activate SMP mode. There are a handful of known "loose ends" that need to be fixed, but have been put off since the SMP kernel is in a moderately good condition at the moment. This commit is the result of the tinkering and testing over the last 14 months by many people. A special thanks to Steve Passe for implementing the APIC code!	1997-04-26 11:46:25 +00:00
peter	4997aa77f1	Send this to the Attic so there's no mixups over which kern_lock.c is in use in -current.	1997-04-21 13:39:56 +00:00
peter	05ac2f5194	Unused variable (upobj is now purely handled within pmap)	1997-04-14 03:40:42 +00:00
dyson	61955ab830	Fully implement vfork. Vfork is now much much faster than even our fork. (On my machine, fork is about 240usecs, vfork is 78usecs.) Implement rfork(!RFPROC !RFMEM), which allows a thread to divorce its memory from the other threads of a group. Implement rfork(!RFPROC RFCFDG), which closes all file descriptors, eliminating possible existing shares with other threads/processes. Implement rfork(!RFPROC RFFDG), which divorces the file descriptors for a thread from the rest of the group. Fix the case where a thread does an exec. It is almost nonsense for a thread to modify the other threads address space by an exec, so we now automatically divorce the address space before modifying it.	1997-04-13 01:48:35 +00:00
peter	ecf50a7463	The biggie: Get rid of the UPAGES from the top of the per-process address space. (!) Have each process use the kernel stack and pcb in the kvm space. Since the stacks are at a different address, we cannot copy the stack at fork() and allow the child to return up through the function call tree to return to user mode - create a new execution context and have the new process begin executing from cpu_switch() and go to user mode directly. In theory this should speed up fork a bit. Context switch the tss_esp0 pointer in the common tss. This is a lot simpler since than swithching the gdt[GPROC0_SEL].sd.sd_base pointer to each process's tss since the esp0 pointer is a 32 bit pointer, and the sd_base setting is split into three different bit sections at non-aligned boundaries and requires a lot of twiddling to reset. The 8K of memory at the top of the process space is now empty, and unmapped (and unmappable, it's higher than VM_MAXUSER_ADDRESS). Simplity the pmap code to manage process contexts, we no longer have to double map the UPAGES, this simplifies and should measuably speed up fork(). The following parts came from John Dyson: Set PG_G on the UPAGES that are now in kernel context, and invalidate them when swapping them out. Move the upages object (upobj) from the vmspace to the proc structure. Now that the UPAGES (pcb and kernel stack) are out of user space, make rfork(..RFMEM..) do what was intended by sharing the vmspace entirely via reference counting rather than simply inheriting the mappings.	1997-04-07 07:16:06 +00:00
peter	07c3ab609e	Commit a typo fix that's been sitting in my tree for ages, quite forgotten. The typo was detected once apon a time with the -Wunused compile option. The result was that a block of code for implementing madvise(.. MADV_SEQUENTIAL..) behavior was "dead" and unused, probably negating the effect of activating the option. Reviewed by: dyson	1997-04-06 16:16:11 +00:00
dyson	f304c6bda9	Make vm_map_protect be more complete about map simplification. This is useful when a process changes it's page range protections very much. Submitted by: Alan Cox <alc@cs.rice.edu>	1997-04-06 03:04:31 +00:00
dyson	54fd4a3d42	Correction to the prototype for vm_fault.	1997-04-06 02:30:56 +00:00
dyson	22d3427970	Fix the gdb executable modify problem. Thanks to the detective work by Alan Cox <alc@cs.rice.edu>, and his description of the problem. The bug was primarily in procfs_mem, but the mistake likely happened due to the lack of vm system support for the operation. I added better support for selective marking of page dirty flags so that vm_map_pageable(wiring) will not cause this problem again. The code in procfs_mem is now less bogus (but maybe still a little so.)	1997-04-06 02:29:45 +00:00
bde	278256e73a	Removed potentially harmful garbage <vm/lock.h> and fixed bogus use of it. It was actually harmless because the use was null due to fortuitous include orders and identical (wrong) idempotency macros.	1997-04-01 08:39:07 +00:00
dg	1543ecae88	Changed the way that the exec image header is read to be filesystem- centric rather than VM-centric to fix a problem with errors not being detectable when the header is read. Killed exech_map as a result of these changes. There appears to be no performance difference with this change.	1997-03-31 11:11:26 +00:00
bde	0d3591bdbd	Don't #include <sys/fcntl.h> in <sys/file.h> if KERNEL is defined. Fixed everything that depended on getting fcntl.h stuff from the wrong place. Most things don't depend on file.h stuff at all.	1997-03-23 03:37:54 +00:00
dyson	9c1cce114f	Fix a significant error in the accounting for pre-zeroed pages. This is a candidate for RELENG_2_2...	1997-03-23 02:44:54 +00:00
dyson	3f5747589b	When removing IN_RECURSE support during the Lite/2 merge, read/write to/from mmaped regions was broken. This commit fixes the breakage, and uses the new Lite/2 locking mechanisms.	1997-03-08 04:33:47 +00:00
bde	61157dd0d7	Removed a wrong LK_INTERLOCK flag.	1997-02-27 15:38:41 +00:00
peter	94b6d72794	Back out part 1 of the MCFH that changed $Id$ to $FreeBSD$. We are not ready for it yet.	1997-02-22 09:48:43 +00:00
bde	6a229a98d5	Removed vestiges of Mach lock types. vm_map.h: Removed #include of <sys/proc.h>. curproc is only used in some macros and users of the macros already include <sys/proc.h>.	1997-02-18 14:07:03 +00:00
wollman	cb442e2038	Provide an alternative interface to contigmalloc() which allows a specific map to be used when allocating the kernel va (e.g., mb_map). The VM gurus may want to look this over.	1997-02-13 19:37:40 +00:00
dyson	10f666af84	This is the kernel Lite/2 commit. There are some requisite userland changes, so don't expect to be able to run the kernel as-is (very well) without the appropriate Lite/2 userland changes. The system boots and can mount UFS filesystems. Untested: ext2fs, msdosfs, NFS Known problems: Incorrect Berkeley ID strings in some files. Mount_std mounts will not work until the getfsent library routine is changed. Reviewed by: various people Submitted by: Jeffery Hsu <hsu@freebsd.org>	1997-02-10 02:22:35 +00:00
dyson	0ebe30bff0	Another fix to inheriting shared segments. Do the copy on write thing if needed. Submitted by: Alan Cox <alc@cs.rice.edu>	1997-01-31 04:10:41 +00:00
dg	5479ba63a9	Added a check/panic for v_usecount being 0 (no vnode reference) in vnode_pager_alloc().	1997-01-24 22:20:23 +00:00
dyson	7a02d1469f	Fix two problems where a NULL object is dereferenced. One problem was in the VM_INHERIT_SHARE case of vmspace_fork, and also in vm_map_madvise. Submitted by: Alan Cox <alc@cs.rice.edu>	1997-01-22 01:34:48 +00:00
dyson	7a84712547	Make MADV_FREE work better. Specifically, it did not wait for the page to be unbusy, and it caused some algorithmic problems as a result. There were some other problems with it also, so this is a general cleanup of the code. Submitted by: Douglas Crosher <dtc@scrooge.ee.swin.oz.au> and myself.	1997-01-20 02:25:14 +00:00
dyson	52f682b582	Change the map entry flags from bitfields to bitmasks. Allows for some code simplification.	1997-01-16 04:16:22 +00:00
dg	78afd808d5	Fix bug related to map entry allocations where a sleep might be attempted when allocating memory for network buffers at interrupt time. This is due to inadequate checking for the new mcl_map. Fixed by merging mb_map and mcl_map into a single mb_map. Reviewed by: wollman	1997-01-15 20:46:02 +00:00
bde	dd18dffcc8	Removed redundant spl0()'s from kernel processes. They were work-arounds for a bug in fork().	1997-01-15 19:05:08 +00:00
jkh	808a36ef65	Make the long-awaited change from $Id$ to $FreeBSD$ This will make a number of things easier in the future, as well as (finally!) avoiding the Id-smashing problem which has plagued developers for so long. Boy, I'm glad we're not using sup anymore. This update would have been insane otherwise.	1997-01-14 07:20:47 +00:00
dyson	9215dc1c36	Slightly correct the code that moves pages from the active to the inactive queue. This is only a minor performance improvement, but will not affect perf on machines that don't have ref bits.	1997-01-11 07:22:24 +00:00
dyson	73dafb0b2c	Prepare better for multi-platform by eliminating another required pmap routine (pmap_is_referenced.) Upper level recoded to use pmap_ts_referenced.	1997-01-11 07:19:02 +00:00
dyson	b68b333a77	Undo the collapse breakage (swap space usage problem.)	1997-01-03 17:02:28 +00:00
dyson	3bb3295727	Guess what? We left alot of the old collapse code that is not needed anymore with the "full" collapse fix that we added about 1yr ago!!! The code has been removed by optioning it out for now, so we can put it back in ASAP if any problems are found.	1997-01-01 04:45:05 +00:00
dyson	b7fce42185	A very significant improvement in the management of process maps and objects. Previously, "fancy" memory management techniques such as that used by the M3 RTS would have the tendancy of chopping up processes allocated memory into lots of little objects. Alan has come up with some improvements to migtigate the sitution to the point where even the M3 RTS only has one object for bss and it's managed memory (when running CVSUP.) (There are still cases where the situation isn't improved when the system pages -- but this is much much better for the vast majority of cases.) The system will now be able to much more effectively merge map entries. Submitted by: Alan Cox <alc@cs.rice.edu>	1996-12-31 16:23:38 +00:00
dyson	c232302d3f	Let the VM system know that on certain arch's that VM_PROT_READ also implies VM_PROT_EXEC. We support it that way for now, since the break system call by default gives VM_PROT_ALL. Now we have a better chance of coalesing map entries when mixing mmap/break type operations. This was contributing to excessive numbers of map entries on the modula-3 runtime system. The problem is still not "solved", but the situation makes more sense. Eventually, when we work on architectures where VM_PROT_READ is orthogonal to VM_PROT_EXEC, we will have to visit this issue carefully (esp. regarding security issues.)	1996-12-30 05:31:21 +00:00
dyson	bc06bff430	EEEK!!! useracc and kernacc didn't lock their respective maps. Additionally, eliminate the map->hint distortion associated with useracc. That may/may-not be the "right" thing to do -- but time will tell. Submitted by: Partially by Alan Cox <alc@cs.rice.edu>	1996-12-30 03:56:11 +00:00
dyson	a23fe88830	Superficial cleanup of comment.	1996-12-29 02:33:12 +00:00
dyson	fb5d9384f8	Eliminate the redundancy due to the similarity between the routines vm_map_simplify and vm_map_simplify_entry. Make vm_map_simplify_entry handle wired maps so that we can get rid of vm_map_simplify. Modify the callers of vm_map_simplify to properly use vm_map_simplify_entry. Submitted by: Alan Cox <alc@cs.rice.edu>	1996-12-28 23:07:49 +00:00
dyson	527a08777f	The code unnecessarily created an object with no handle up-front, which has the negative effect of disabling some map optimizations. This patch defers the creation of the object until it needs to be at fault time. Submitted by: Alan Cox <alc@cs.rice.edu>	1996-12-28 22:40:44 +00:00
joerg	63b6a05776	Make DFLDSIZ and MAXDSIZ fully-supported options. "Don't forget to do a ``make depend''" :-)	1996-12-22 23:17:09 +00:00
dyson	765e5fd282	Implement closer-to POSIX mlock semantics. The major difference is that we do allow mlock to span unallocated regions (of course, not mlocking them.) We also allow mlocking of RO regions (which the old code couldn't.) The restriction there is that once a RO region is wired (mlocked), it cannot be debugged (or EVER written to.) Under normal usage, the new mlock code will be a significant improvement over our old stuff.	1996-12-14 17:54:17 +00:00
dyson	86b1c9f6b9	Expunge inlines...	1996-12-07 07:44:05 +00:00
dyson	16bfdb75a7	Fix a map entry leak problem found by DG. Also, de-inline a function vm_map_entry_dispose, because it won't help being inlined.	1996-12-07 06:19:37 +00:00
dyson	468189da8d	Make vm_map_insert much more intelligent in the MAP_NOFAULT case so that map entries are coalesced when appropriate. Also, conditionalize some code that is currently not used in vm_map_insert. This mod has been added to eliminate unnecessary map entries in buffer map. Additionally, there were some cases where map coalescing could be done when it shouldn't. That problem has been resolved.	1996-12-07 00:03:43 +00:00
dyson	7a58275f33	Implement a new totally dynamic (up to MAXPHYS) buffer kva allocation scheme. Additionally, add the capability for checking for unexpected kernel page faults. The maximum amount of kva space for buffers hasn't been decreased from where it is, but it will now be possible to do so. This scheme manages the kva space similar to the buffers themselves. If there isn't enough kva space because of usage or fragementation, buffers will be reclaimed until a buffer allocation is successful. This scheme should be very resistant to fragmentation problems until/if the LFS code is fixed and uses the bogus buffer locking scheme -- but a 'fixed' LFS is not likely to use such a scheme. Now there should be NO problem allocating buffers up to MAXPHYS.	1996-11-30 22:41:49 +00:00
dyson	f573ad0ab2	Make the kernel smaller with at worst a neutral effect on perf by de-inlining some VM calls. (Actually, I measured a small improvement.)	1996-11-28 23:15:07 +00:00
dyson	2383152fd5	Improve the locality of reference for variables in vm_page and vm_kern by moving them from .bss to .data. With this change, there is a measurable perf improvement in fork/exec.	1996-11-17 02:38:31 +00:00
dyson	f459fb4443	Vastly improved contigmalloc routine. It does not solve the problem of allocating contiguous buffer memory in general, but make it much more likely to work at boot-up time. The best chance for an LKM-type load of a sound driver is immediately after the mount of the root filesystem. This appears to work for a 64K allocation on an 8MB system.	1996-11-05 04:19:08 +00:00
dyson	70bcdaf44a	Change mmap to use OBJT_DEFAULT instead of OBJT_SWAP by default for anonymous objects. The system will automatically change the type to SWAP if needed (for size or pageout reasons.)	1996-10-29 22:07:11 +00:00
phk	ffa9f8fecc	The way we get a vnode for swapdev is not quite kosher. In particular it breaks in the DEVFS_ROOT case. replicate a bit too much of bdevvp() in here to circumvent the problem. The real problem is the magic that lives in bdevsw[1].	1996-10-27 22:31:00 +00:00
dyson	30a1549ad1	Remove a bogus optimization in the mmap code. It is superfluous, and at best is the same speed as the unoptimized code. At worst, it slows down trivial programs.	1996-10-24 02:56:23 +00:00
dyson	ea70b20311	Make processes waken up eligible for immediate swap-in.	1996-10-17 02:58:20 +00:00
dyson	576dd5e9f6	Clean up the rundown of the object backing a vnode. This should fix NFS problems associated with forcible dismounts.	1996-10-17 02:49:35 +00:00
bde	5beb18abc1	Removed nested include of <sys/proc.h> from <vm/vm_object.h> and fixed the one place that depended on it. wakeup() is now prototyped in <sys/systm.h> so that it is normally visible. Added nested include of <sys/queue.h> in <vm/vm_object.h>. The queue macros are a more fundamental prerequisite for <vm/vm_object.h> than the wakeup prototype and previously happened to be included by namespace pollution from <sys/proc.h> or elsewhere.	1996-10-15 18:24:34 +00:00
dyson	4af9184e0b	Move much of the machine dependent code from vm_glue.c into pmap.c. Along with the improved organization, small proc fork performance is now about 5%-10% faster.	1996-10-15 03:16:45 +00:00
phk	622bd3da23	Remove a stale comment.	1996-10-13 07:16:50 +00:00
bde	2529e37cab	Removed __pure's and __pure2's. __pure is a no-op for recent versions of gcc by definition, and __pure2 is a no-op in effect (presumably the compiler can see when an inline function has no side effects).	1996-10-12 20:09:48 +00:00
dyson	db43a78580	Make the default cache size optim to be 256K, the old default was 64K. The change has essentially neutral effect on those machines with little or no cache, and has a positive effect on "normal" machines with 256K or more cache.	1996-10-06 22:26:13 +00:00
dyson	38f3ce75b7	Fix a problem with the page coloring code that the system will not always be able to use all of the free pages. This can manifest as a panic using DIAGNOSTIC, or as a panic on an indirect memory reference.	1996-10-06 18:27:39 +00:00
bde	ff971ca6a7	Fixed undeclared variables for the !(PQ_L2_SIZE > 1) case. Removed redundant #include.	1996-09-28 17:53:18 +00:00
dyson	bf83250c76	Reviewed by: Submitted by: Obtained from:	1996-09-28 03:33:40 +00:00
dg	f44d436f84	Fixed bug with reversed trunc/round_page() in madvise...start must be trunced, end must be rounded.	1996-09-19 10:12:41 +00:00
bde	ed762d7339	Removed iprintf(). It was copied to db_iprintf() in ddb.	1996-09-15 11:24:21 +00:00
bde	e9c09cbb61	Attached vm ddb commands `show map',` show vmochk', `show object', `show vmopag', `show page' and `show pageq'. Moved all vm ddb stuff to the ends of the vm source files. Changed printf() to db_printf(), `indent' to db_indent, and iprintf() to db_iprintf() in ddb commands. Moved db_indent and db_iprintf() from vm to ddb. vm_page.c: Don't use __pure. Staticized. db_output.c: Reduced page width from 80 to 79 to inhibit double spacing for long lines (there are still some problems if words are printed across column 79).	1996-09-14 11:54:59 +00:00
dyson	49ad6efb1b	The whole issue of not support VOP_LOCK for VBLK devices should be rethought. This fixes YET another problem with unmounting filesystems. The root cause is not fixed here, but at least the problem has gone away.	1996-09-10 05:28:23 +00:00
dyson	b69e6f092f	Fixed the use of the wrong variable in vm_map_madvise.	1996-09-08 23:49:47 +00:00
dyson	62b009f8b1	Addition of page coloring support. Various levels of coloring are afforded. The default level works with minimal overhead, but one can also enable full, efficient use of a 512K cache. (Parameters can be generated to support arbitrary cache sizes also.)	1996-09-08 20:44:49 +00:00
dyson	6ba270257a	Improve the scalability of certain pmap operations.	1996-09-08 16:57:53 +00:00
dyson	966cbc5d29	Even though this looks like it, this is not a complex code change. The interface into the "VMIO" system has changed to be more consistant and robust. Essentially, it is now no longer necessary to call vn_open to get merged VM/Buffer cache operation, and exceptional conditions such as merged operation of VBLK devices is simpler and more correct. This code corrects a potentially large set of problems including the problems with ktrace output and loaded systems, file create/deletes, etc. Most of the changes to NFS are cosmetic and name changes, eliminating a layer of subroutine calls. The direct calls to vput/vrele have been re-instituted for better cross platform compatibility. Reviewed by: davidg	1996-08-21 21:56:23 +00:00
dyson	01ce9d323a	Backed out the recent changes/enhancements to the VM code. The problem with the 'shell scripts' was found, but there was a 'strange' problem found with a 486 laptop that we could not find. This commit backs the code back to 25-jul, and will be re-entered after the snapshot in smaller (more easily tested) chunks.	1996-07-30 03:08:57 +00:00
dg	3b8df2418d	Slight performance tweak for previous commit.	1996-07-28 02:54:09 +00:00
dyson	b8f642e489	Undo part of the scalability commit. Many of the changes in vm_fault had some performance enhancements not ready for prime time. This commit backs out some of the changes.	1996-07-28 01:14:01 +00:00
dyson	566ed4fc29	Allow sequentially created mmap'ed anonymous regions to coalesce. There is little or no reason to create a swap pager for small mmap's. The vm_map_insert code will automatically create a swap pager if the object becomes too large. This fix, per a request from phk.	1996-07-27 17:21:41 +00:00
dyson	41a57d3946	Clean up some lint.	1996-07-27 04:22:12 +00:00
dyson	3a44c87901	Remove experimental header file. My test-build must have picked it up in an unexpected place. Submitted by: jkh	1996-07-27 04:06:11 +00:00
dyson	bb418c5f23	Missing (prototype) change from the previous commit.	1996-07-27 03:47:35 +00:00
dyson	293abd3564	This commit is meant to solve a couple of VM system problems or performance issues. 1) The pmap module has had too many inlines, and so the object file is simply bigger than it needs to be. Some common code is also merged into subroutines. 2) Removal of some evil PHYS_TO_VM_PAGE macro calls. Unfortunately, a few have needed to be added also. The removal caused the need for more vm_page_lookups. I added lookup hints to minimize the need for the page table lookup operations. 3) Removal of some bogus performance improvements, that mostly made the code more complex (tracking individual page table page updates unnecessarily). Those improvements actually hurt 386 processors perf (not that people who worry about perf use 386 processors anymore :-)). 4) Changed pv queue manipulations/structures to be TAILQ's. 5) The pv queue code has had some performance problems since day one. Some significant scalability issues are resolved by threading the pv entries from the pmap AND the physical address instead of just the physical address. This makes certain pmap operations run much faster. This does not affect most micro-benchmarks, but should help loaded system performance significantly. DG helped and came up with most of the solution for this one. 6) Most if not all pmap bit operations follow the pattern: pmap_test_bit(); pmap_clear_bit(); That made for twice the necessary pv list traversal. The pmap interface now supports only pmap_tc_bit type operations: pmap_[test/clear]_modified, pmap_[test/clear]_referenced. Additionally, the modified routine now takes a vm_page_t arg instead of a phys address. This eliminates a PHYS_TO_VM_PAGE operation. 7) Several rewrites of routines that contain redundant code to use common routines, so that there is a greater likelihood of keeping the cache footprint smaller.	1996-07-27 03:24:10 +00:00
bde	4c9b9ce103	Don't use NULL in non-pointer contexts.	1996-07-12 04:12:25 +00:00
dyson	824afef0c4	Back-off on the previous commit, specifically remove the look-ahead optimization on the active queue scan. I will do this correctly later.	1996-07-08 03:22:55 +00:00
dyson	f0e4f8d97a	Fix a problem with the pageout daemon RSS limiting, where it degrades performance to LRU or worse when RSS limiting takes effect. Also, make an end condition in the active queue scan more efficient in the case where pages are removed from the active queue as a side effect of a pmap operation.	1996-07-08 02:25:53 +00:00
dg	965af71fa0	In all special cases for spl or page_alloc where kmem_map is check for, mb_map (a submap of kmem_map) must also be checked. Thanks to wcarchive (err...sort of) for demonstrating this bug.	1996-07-07 03:27:41 +00:00
dyson	b60f2e04c9	Properly set the PG_MAPPED and PG_WRITEABLE flags. This fixes some potential problems with vm_map_remove/vm_map_delete.	1996-07-02 02:08:02 +00:00
dyson	477c2d5654	Make -current consistant with -stable regarding time that a process sleeps before being swapped out. The time is increased from 4 secs to 10 secs. Originally I had decreased it from 20 to 4, but that is a bit severe. 20 is too long though.	1996-06-30 21:16:18 +00:00
dg	5b37bd13e0	Make sure we have an object in the map entry before trying to trim pages from it.	1996-06-29 09:17:17 +00:00
dyson	9f0819bc54	This commit does a couple of things: Re-enables the RSS limiting, and the routine is now tail-recursive, making it much more safe (eliminates the possiblity of kernel stack overflow.) Also, the RSS limiting is a little more intelligent about finding the likely objects that are pushing the process over the limit. Added some sysctls that help with VM system tuning. New sysctl features: 1) Enable/disable lru pageout algorithm. vm.pageout_algorithm = 0, default algorithm that works well, especially using X windows and heavy memory loading. Can have adverse effects, sometimes slowing down program loading. vm.pageout_algorithm = 1, close to true LRU. Works much better than clock, etc. Does not work as well as the default algorithm in general. Certain memory "malloc" type benchmarks work a little better with this setting. Please give me feedback on the performance results associated with these. 2) Enable/disable swapping. vm.swapping_enabled = 1, default. vm.swapping_enabled = 0, useful for cases where swapping degrades performance. The config option "NO_SWAPPING" is still operative, and takes precedence over the sysctl. If "NO_SWAPPING" is specified, the sysctl still exists, but "vm.swapping_enabled" is hard-wired to "0". Each of these can be changed "on the fly."	1996-06-26 05:39:27 +00:00
dyson	43e1aa7ef6	Fix some serious problems with limits checking in the sbrk(2)/brk(2) code. Reviewed by: bde	1996-06-25 00:36:46 +00:00
dyson	aa5927cc6e	Remove RSS limiting until I rewrite the code to be non-recursive. The code can overrun the kernel stack under very stressful conditions.	1996-06-24 04:30:24 +00:00
dyson	010ba0185a	Improve algorithm for page hash queue. It was previously about as bad as it could be. This algorithm appears to improve fork performance (barely) measurably.	1996-06-21 05:39:22 +00:00
dyson	5cd11c9038	Several bugfixes/improvements: 1) Make it much less likely to miss a wakeup in vm_page_free_wakeup 2) Create a new entry point into pmap: pmap_ts_referenced, eliminates the need to scan the pv lists twice in many cases. Perhaps there is alot more to do here to work on minimizing pv list manipulation 3) Minor improvements to vm_pageout including the use of pmap_ts_ref. 4) Major changes and code improvement to pmap. This code has had several serious bugs in page table page manipulation. In order to simplify the problem, and hopefully solve it for once and all, page table pages are no longer "managed" with the pv list stuff. Page table pages are only (mapped and held/wired) or (free and unused) now. Page table pages are never inactive, active or cached. These changes have probably fixed the hold count problems, but if they haven't, then the code is simpler anyway for future bugfixing. 5) The pmap code has been sorely in need of re-organization, and I have taken a first (of probably many) steps. Please tell me if you have any ideas.	1996-06-17 03:35:40 +00:00
dyson	4e8ccabc81	Various bugfixes/cleanups from me and others: 1) Remove potential race conditions on waking up in vm_page_free_wakeup by making sure that it is at splvm(). 2) Fix another bug in vm_map_simplify_entry. 3) Be more complete about converting from default to swap pager when an object grows to be large enough that there can be a problem with data structure allocation under low memory conditions. 4) Make some madvise code more efficient. 5) Added some comments.	1996-06-16 20:37:31 +00:00
dg	17f7319856	Move a case of PG_MAPPED being set before a pmap_enter(). This will likely make no difference, but it will make it consistent with other uses of PG_MAPPED.	1996-06-14 23:26:40 +00:00
dyson	cfc0d659e7	Fix a very significant cnt.v_wire_count leak in vm_page.c, and some minor leaks in pmap.c. Bruce Evans made me aware of this problem.	1996-06-12 06:52:12 +00:00
dyson	cc48a94842	Fix some serious errors in vm_map_simplify_entries.	1996-06-12 04:03:21 +00:00
dyson	8a5ac3e758	Mostly superficial code improvements, add a diagnostic. The code improvements include significant simplification of the reservation of the swap pager control blocks for reads. Add a panic for an inconsistent swap pager control block count.	1996-06-10 04:58:48 +00:00
dyson	c4b1301343	Keep the vm_fault/vm_pageout from getting into an "infinite paging loop", by reserving "cached" pages before waking up the pageout daemon. This will reserve the faulted page, and keep the system from thrashing itself to death given this condition.	1996-06-10 00:25:40 +00:00
dyson	dca5ec165f	Adjust the threshold for blocking on movement of pages from the cache queue in vm_fault. Move the PG_BUSY in vm_fault to the correct place. Remove redundant/unnecessary code in pmap.c. Properly block on rundown of page table pages, if they are busy. I think that the VM system is in pretty good shape now, and the following individuals (among others, in no particular order) have helped with this recent bunch of bugs, thanks! If I left anyone out, I apologize! Stephen McKay, Stephen Hocking, Eric J. Chet, Dan O'Brien, James Raynard, Marc Fournier.	1996-06-08 06:48:35 +00:00
dyson	7bd5766700	Keep page-table pages from ever being sensed as dirty. This should fix some problems with the page-table page management code, since it can't deal with the notion of page-table pages being paged out or in transit. Also, clean up some stylistic issues per some suggestions from Stephen McKay.	1996-06-05 03:31:49 +00:00
dyson	e3cb859c13	Disable madvise optimizations for device pager objects (some of the operations don't work with FICTITIOUS pages.) Also, close a window between PG_MANAGED and pmap_enter that can mess up the accounting of the managed flag. This problem could likely cause a hold_count error for page table pages.	1996-06-01 20:50:57 +00:00
dyson	65214cd0c8	This commit is dual-purpose, to fix more of the pageout daemon queue corruption problems, and to apply Gary Palmer's code cleanups. David Greenman helped with these problems also. There is still a hang problem using X in small memory machines.	1996-05-31 00:38:04 +00:00
dyson	c545522f30	Correct some unfortunately chosen constants, otherwise, not enough pages are calculated for deferred allocation of swap pager data structures. This is a follow-on to the previous commit to this file.	1996-05-29 06:33:30 +00:00
dyson	1f06055059	After careful review by David Greenman and myself, David had found a case where blocking can occur, thereby giving other process's a chance to modify the queue where a page resides. This could cause numerous process and system failures.	1996-05-29 05:15:33 +00:00
dyson	f5f74ba5e1	Make sure that pageout deadlocks cannot occur. There is a problem that the datastructures needed to support the swap pager can take enough space to fully deplete system memory, and cause a deadlock. This change keeps large objects from being filled with dirty pages without the appropriate swap pager datastructures. Right now, default objects greater than 1/4 the size of available system memory are converted to swap objects, thereby eliminating the risk of deadlock.	1996-05-29 05:12:23 +00:00
dyson	7aa7146d55	Fix a couple of problems in the pageout_scan routine. First, there is a condition when blocking can occur, and the daemon did not check properly for a page remaining on the expected queue. Additionally, the inactive target was being set much too large for small memory machines. It is now being calculated based upon the amount of user memory available on every pageout daemon run. Another problem was that if memory was very low, the pageout daemon could fail repeatedly to traverse the inactive queue.	1996-05-26 07:52:09 +00:00
dyson	2640e59c23	I think this covers (fixes) the last batch of freeing active/held/busy page problem. BY MISTAKE, the vm_page_unqueue (or equiv) was removed from the vm_fault code. Really bad things appear to happen if a page is on a queue while it is being faulted.	1996-05-26 05:30:33 +00:00
dyson	a7dfed03c3	Add an assert to vm_page_cache. We should never cache a dirty page.	1996-05-24 05:20:15 +00:00
dyson	55bfe20284	Add apparently needed splvm protection to the active queue, and eliminate an unnecessary test for dirty pages if it is already known to be dirty.	1996-05-24 05:19:15 +00:00
dyson	992a2cacdc	Eliminate inefficient check for dirty pages for pages in the PQ_CACHE queue. Also, modify the MADV_FREE policy (it probably still isn't the final version.)	1996-05-24 05:17:21 +00:00
dyson	5ff3dd6648	Make the conversion from the default pager to swap pager more robust in the face of low memory conditions.	1996-05-24 05:14:44 +00:00
dyson	a9f2d60090	Eliminate a vm_page_free, busy panic, in kern_malloc.	1996-05-23 02:24:55 +00:00
dyson	509f02d4a3	Initial support for MADV_FREE, support for pages that we don't care about the contents anymore. This gives us alot of the advantage of freeing individual pages through munmap, but with almost none of the overhead.	1996-05-23 00:45:58 +00:00
dyson	48a5cbf840	After reviewing the previous commit to vm_object, the page protection is never necessary, not just for PG_FICTICIOUS.	1996-05-21 17:13:31 +00:00
dyson	89cf8c5641	Don't protect non-managed pages off during object rundown. This fixes a hang that occurs under certain circumstances when exiting X.	1996-05-21 05:26:27 +00:00
dyson	d3600176f4	Initial support for mincore and madvise. Both are almost fully supported, except madvise does not page in with MADV_WILLNEED, and MADV_DONTNEED doesn't force dirty pages out.	1996-05-19 07:36:50 +00:00
dyson	2359f358cc	One more file missing from the mega-commit. This inlines some very simple routines in vm_page.c, so that an unnecessary subroutine call is removed.	1996-05-18 04:00:18 +00:00
dyson	39bee2738c	File mistakenly left out of the previous mega-commit. This provides a global defn for 'exech_map.'	1996-05-18 03:52:13 +00:00
dyson	242e10df11	This set of commits to the VM system does the following, and contain contributions or ideas from Stephen McKay <syssgm@devetir.qld.gov.au>, Alan Cox <alc@cs.rice.edu>, David Greenman <davidg@freebsd.org> and me: More usage of the TAILQ macros. Additional minor fix to queue.h. Performance enhancements to the pageout daemon. Addition of a wait in the case that the pageout daemon has to run immediately. Slightly modify the pageout algorithm. Significant revamp of the pmap/fork code: 1) PTE's and UPAGES's are NO LONGER in the process's map. 2) PTE's and UPAGES's reside in their own objects. 3) TOTAL elimination of recursive page table pagefaults. 4) The page directory now resides in the PTE object. 5) Implemented pmap_copy, thereby speeding up fork time. 6) Changed the pv entries so that the head is a pointer and not an entire entry. 7) Significant cleanup of pmap_protect, and pmap_remove. 8) Removed significant amounts of machine dependent fork code from vm_glue. Pushed much of that code into the machine dependent pmap module. 9) Support more completely the reuse of already zeroed pages (Page table pages and page directories) as being already zeroed. Performance and code cleanups in vm_map: 1) Improved and simplified allocation of map entries. 2) Improved vm_map_copy code. 3) Corrected some minor problems in the simplify code. Implemented splvm (combo of splbio and splimp.) The VM code now seldom uses splhigh. Improved the speed of and simplified kmem_malloc. Minor mod to vm_fault to avoid using pre-zeroed pages in the case of objects with backing objects along with the already existant condition of having a vnode. (If there is a backing object, there will likely be a COW... With a COW, it isn't necessary to start with a pre-zeroed page.) Minor reorg of source to perhaps improve locality of ref.	1996-05-18 03:38:05 +00:00
wollman	a28a8481af	Allocate mbufs from a separate submap so that NMBCLUSTERS works as expected.	1996-05-10 19:28:55 +00:00
phk	5d01dc3d50	Another sweep over the pmap/vm macros, this time with more focus on the usage. I'm not satisfied with the naming, but now at least there is less bogus stuff around.	1996-05-03 21:01:54 +00:00
phk	5a6fb3a7da	removed: CLBYTES PD_SHIFT PGSHIFT NBPG PGOFSET CLSIZELOG2 CLSIZE pdei() ptei() kvtopte() ptetov() ispt() ptetoav() &c &c new: NPDEPG Major macro cleanup.	1996-05-02 14:21:14 +00:00
phk	7bfc0101cb	KGDB is dead. It may come back one day if somebody does it.	1996-05-02 09:34:51 +00:00
dyson	4f2a49e711	Move the map entry allocations from the kmem_map to the kernel_map. As a side effect, correct the associated object offset.	1996-04-29 22:04:57 +00:00
dyson	6c1ec28cad	This fixes kmem_malloc/kmem_free (and malloc/free of objects of > 8K). A page index was calculated incorrectly in vm_kern, and vm_object_page_remove removed pages that should not have been.	1996-04-24 04:16:45 +00:00
bde	a680120460	Fixed a spl hog. The vmdaemon process ran entirely at splhigh. It sometimes disabled clock interrupts for 60 msec or more on a P133. Clock interrupts were lost ... Reviewed by: dyson	1996-04-11 21:05:25 +00:00
dyson	a3cc211552	Reinstitute the map lock for processes being swapped out. This is needed because of the vm_fault used to bring the page table page for the kernel stack (UPAGES) back in. The consequence of the previous incorrect change was a system hang.	1996-04-09 04:36:58 +00:00
dyson	cdc4afc0a8	Map lock checks not needed anymore for swapping out. We don't use map operations for it anymore. Certain deadlocks should never happen anymore.	1996-04-08 03:42:01 +00:00
bde	d75a1370e4	Removed never-used #includes of <machine/cpu.h>. Many were apparently copied from bad examples.	1996-04-07 17:39:28 +00:00
dyson	ceb23bccb6	Fixed a problem that the UPAGES of a process were being run down in a suboptimal manner. I had also noticed some panics that appeared to be at least superficially caused by this problem. Also, included are some minor mods to support more general handling of page table page faulting. More details in a future commit.	1996-04-03 05:23:44 +00:00
dg	4d625bffc5	Revert to previous calculation of vm_object_cache_max: it simply works better in most real-world cases.	1996-03-29 06:28:48 +00:00
bde	edc7662d0a	Undid last revision. It duplicated part of second last revision.	1996-03-28 15:40:17 +00:00
scrappy	7dcaa810dc	devfs_add_devsw() -> devfs_add_devswf modifications Reviewed by: julian@freebsd.org	1996-03-28 14:36:48 +00:00
dyson	b37500ce5f	Add a function prototype for pmap_prefault.	1996-03-28 04:54:50 +00:00
dyson	903a66b4af	VM performance improvements, and reorder some operations in VM fault in anticipation of a fix in pmap that will allow the mlock system call to work without panicing the system.	1996-03-28 04:53:28 +00:00
dyson	f766c05521	More map_simplify fixes from Alan Cox. This very significanly improves the performance when the map has been chopped up. The map simplify operations really work now. Reviewed by: dyson Submitted by: Alan Cox <alc@cs.rice.edu>	1996-03-28 04:22:17 +00:00
bde	cd9e381ea0	Added drum device. Submitted by: partly by "Marc G. Fournier" <scrappy@ki.net>	1996-03-27 20:09:26 +00:00
dyson	1fa4d69259	Fix the problem that unmounting filesystems that are backed by a VMIO device have reference count problems. We mark the underlying object ono-persistent, and account for the reference count that the VM system maintainsfor the special device close. This should fix the removable device problem.	1996-03-19 05:13:22 +00:00
dg	47ab1071a2	Force device mappings to always be shared. It doesn't make sense for them to ever be COW and we need the mappings to be shared for backward compatibilty. Reviewed by: dyson	1996-03-16 15:00:05 +00:00
dyson	bb7008423c	This commit is as a result of a comment by Alan Cox (alc@cs.rice.edu) regarding the "real" problem with maps that we have been having over the last few weeks. He noted that the first_free pointer was left dangling in certain circumstances -- and he was right!!! This should fix the map problems that we were having, and also give us the advantage of being able to simplify maps more aggressively.	1996-03-13 01:18:14 +00:00
dyson	27f39393b2	Fix the map corruption problem that appears as a u_map allocation error.	1996-03-12 13:46:13 +00:00
dyson	df05bab7f8	Allow mmap'ed devices to work correctly across forks. The sanest solution appeared to be to allow the child to maintain the same mapping as the parent.	1996-03-12 02:27:20 +00:00
hsu	0bb5cd954d	For Lite2: proc LIST changes. Reviewed by: davidg & bde	1996-03-11 06:11:43 +00:00
dyson	f9ef0c5c40	Delay forking a process until there are more pages available. It was possible to deadlock with the low threshold that we had used.	1996-03-09 06:57:53 +00:00
dyson	d1a4f3bdd4	Modify a threshold for waking up the pageout daemon. Also, add a consistancy check for making sure that held pages aren't freed (DG).	1996-03-09 06:56:39 +00:00
dyson	38b0672ce2	Add a missing initialization of the hold_count for device pager ficticiouse pages.	1996-03-09 06:54:41 +00:00
dyson	5abad5fc2d	Fix a calculation for a paging parameter.	1996-03-09 06:53:27 +00:00
dyson	c455617892	Fix two problems: The pmap_remove in vm_map_clean incorrectly unmapped the entire map entry. The new vm_map_simplify_entry code had an error (the offset of the combined map entry was not set correctly.) Submitted by: Alan Cox <alc@cs.rice.edu>	1996-03-09 06:52:05 +00:00
dyson	ffcf7ec799	Set the page valid bits in fewer places, as opposed to being scattered in various places.	1996-03-09 06:48:26 +00:00
dyson	51a9444d94	Fix a problem in the swap pager that caused some of the pages that were paged in under low swap space conditions to both loose their backing store and their dirty bits. This would cause pages to be demand zeroed under certain conditions in low VM space conditions and consequential sig-11's or sig-10's. This situation was made worse lately when the level for swap space reclaim threshold was increased.	1996-03-06 04:31:46 +00:00
dyson	4efa53288e	Fix a problem that pages in a mapped region were not always properly invalidated. Now we traverse the object shadow chain properly.	1996-03-04 02:04:24 +00:00
dyson	1b4bcc8dfd	In order to fix some concurrency problems with the swap pager early on in the FreeBSD development, I had made a global lock around the rlist code. This was bogus, and now the lock is maintained on a per resource list basis. This now allows the rlist code to be used for almost any non-interrupt level application.	1996-03-03 21:11:08 +00:00
peter	32c0e11a4b	Remove the #ifdef notyet from the prototype of vm_map_simplify. John re-enabled the function but missed the prototype, causing a warning.	1996-03-03 18:53:10 +00:00
peter	7a5076f346	Oops.. I nearly forgot the actual core of the length/rounding/etc fixes that Bruce asked for. These still are not quite perfect, and in particular, it can get upset on extreme boundary cases (addr = 0xfff, len = 0xffffffff, which would end up mapping a single page rather than failing), but this is better code that I committed before. (note, the VM system does not (apparently) support single mmap segment sizes above 0x80000000 anyway)	1996-03-02 17:14:09 +00:00
dyson	ed1fa57da8	1) Eliminate unnecessary bzero of UPAGES. 2) Eliminate unnecessary copying of pages during/after forks. 3) Add user map simplification.	1996-03-02 02:54:24 +00:00
peter	5239b23b5d	kern_descrip.c: add fdshare()/fdcopy() kern_fork.c: add the tiny bit of code for rfork operation. kern/sysv_: shmfork() takes one less arg, it was never used. sys/shm.h: drop "isvfork" arg from shmfork() prototype sys/param.h: declare rfork args.. (this is where OpenBSD put it..) sys/filedesc.h: protos for fdshare/fdcopy. vm/vm_mmap.c: add minherit code, add rounding to mmap() type args where it makes sense. vm/: drop unused isvfork arg. Note: this rfork() implementation copies the address space mappings, it does not connect the mappings together. ie: once the two processes have split, the pages may be shared, but the address space is not. If one does a mmap() etc, it does not appear in the other. This makes it not useful for pthreads, but it is useful in it's own right for having light-weight threads in a static shared address space. Obtained from: Original by Ron Minnich, extended by OpenBSD	1996-02-23 18:49:25 +00:00
dg	f54e4705e6	Add a "NO_SWAPPING" option to disable swapping. This was originally done to help diagnose a problem on wcarchive (where the kernel stack was sometimes not present), but is useful in its own right since swapping actually reduces performance on some systems (such as wcarchive). Note: swapping in this context means making the U pages pageable and has nothing to do with generic VM paging, which is unaffected by this option. Reviewed by: <dyson>	1996-02-22 10:57:37 +00:00
dyson	e46dad739e	Fixed a really bogus problem with msync ripping pages away from objects before they were written. Also, don't allow processes without write access to remove pages from vm_objects.	1996-02-11 22:03:49 +00:00
dyson	f8428b2cd3	Changed vm_fault_quick in vm_machdep.c to be global. Needed for new pipe code.	1996-02-04 22:09:12 +00:00
dg	f7e207c60f	"out of space" -> "out of swap space".	1996-01-31 13:14:21 +00:00
dg	d31b06efba	Improved killproc() log message and made it and the other similar message tolerant of p_ucred being invalid. Starting using killproc() where appropriate.	1996-01-31 12:44:33 +00:00
dg	66bb81b8ee	Print a more descriptive message when the mb_map is filled (out of mbuf clusters), and tell the operator what to do about it (increase maxusers).	1996-01-31 12:05:52 +00:00
mpp	f3dd75a38d	Fix a bunch of spelling errors in the comment fields of a bunch of system include files.	1996-01-30 23:02:38 +00:00
dg	f13dc7b998	Added a check/panic for vm_map_find failing to find space for the page tables/u-pages when forking. This is a "can't happen" case. :-)	1996-01-29 12:10:30 +00:00
bde	0f99c673dc	Added a `boundary' arg to vm_alloc_page_contig(). Previously the only way to avoid crossing a 64K DMA boundary was to specify an alignment greater than the size even when the alignment didn't matter, and for sizes larger than a page, this reduced the chance of finding enough contiguous pages. E.g., allocations of 8K not crossing a 64K boundary previously had to be allocated on 8K boundaries; now they can be allocated on any 4K boundary except (64 * n + 60)K. Fixed bugs in vm_alloc_page_contig(): - the last page wasn't allocated for sizes smaller than a page. - failures of kmem_alloc_pageable() weren't handled. Mutated vm_page_alloc_contig() to create a more convenient interface named contigmalloc(). This is the same as the one in 1.1.5 except it has `low' and `high' args, and the `alignment' and `boundary' args are multipliers instead of masks.	1996-01-27 00:13:33 +00:00
phk	b744a1f417	Don't use %r, we havn't got it anymore. Submitted by: bde	1996-01-25 07:15:40 +00:00
dyson	8fc8a772af	Eliminated many redundant vm_map_lookup operations for vm_mmap. Speed up for vfs_bio -- addition of a routine bqrelse to greatly diminish overhead for merged cache. Efficiency improvement for vfs_cluster. It used to do alot of redundant calls to cluster_rbuild. Correct the ordering for vrele of .text and release of credentials. Use the selective tlb update for 486/586/P6. Numerous fixes to the size of objects allocated for files. Additionally, fixes in the various pagers. Fixes for proper positioning of vnode_pager_setsize in msdosfs and ext2fs. Fixes in the swap pager for exhausted resources. The pageout code will not as readily thrash. Change the page queue flags (PG_ACTIVE, PG_INACTIVE, PG_FREE, PG_CACHE) into page queue indices (PQ_ACTIVE, PQ_INACTIVE, PQ_FREE, PQ_CACHE), thereby improving efficiency of several routines. Eliminate even more unnecessary vm_page_protect operations. Significantly speed up process forks. Make vm_object_page_clean more efficient, thereby eliminating the pause that happens every 30seconds. Make sequential clustered writes B_ASYNC instead of B_DELWRI even in the case of filesystems mounted async. Fix a panic with busy pages when write clustering is done for non-VMIO buffers.	1996-01-19 04:00:31 +00:00
wollman	39d3a9a3d3	Convert DDB to new-style option.	1996-01-04 21:13:23 +00:00
wollman	073b9e3f1c	Convert SYSV IPC to new-style options. (I hope I got everything...) The LKMs will need an extra file, to come later.	1996-01-04 20:29:06 +00:00
dg	32818d3b27	Increased vm_object_cache_max by about 50% to yield better utilization of memory when lots of small files are cached. Reviewed by: dyson	1996-01-04 18:32:31 +00:00
peter	ab124e78b0	recording cvs-1.6 file death	1995-12-30 19:02:48 +00:00
julian	bd3219428d	i386/i386/conf.c is no longer needed.. remove it from files.i386 redistribute a few last routines to beter places and shoot the file I haven't act actually 'deleted' the file yet togive people time to have done a config.. I.e. they are likely to have done one in a week or so so I'll remove it then.. it's now empty. makes the question of a USL copyright rather moot.	1995-12-21 20:09:46 +00:00
dyson	9127bd82d9	Fix paging from ext2fs (and other fs w/block size < PAGE_SIZE). This should fix kern/900.	1995-12-17 23:29:56 +00:00
bde	4b4fabc98f	Cleaned up prototypes in pmap headers: removed ones for nonexistent functions; moved misplaced ones; restored most of KNFish formatting from 4.4lite version; removed bogus __BEGIN/END_DECLS.	1995-12-17 07:39:05 +00:00
bde	ca36bb34d1	Fixed 1TB filesize changes. Some pindexes had bogus names and types but worked because vm_pindex_t is indistinuishable from vm_offset_t.	1995-12-17 07:19:58 +00:00
phk	9cb413a93c	Another mega commit to staticize things.	1995-12-14 09:55:16 +00:00
phk	63ec2c0ae9	A Major staticize sweep. Generates a couple of warnings that I'll deal with later. A number of unused vars removed. A number of unused procs removed or #ifdefed.	1995-12-14 08:32:45 +00:00
julian	d905b350fe	devsw tables are now arrays of POINTERS to struct [cb]devsw seems to work hre just fine though I can't check every file that changed due to limmited h/w, however I've checked enught to be petty happy withe hte code.. WARNING... struct lkm[mumble] has changed so it might be an idea to recompile any lkm related programs	1995-12-13 15:13:57 +00:00
dyson	37543c2e24	There was a bug that the size for an msync'ed region was not rounded up. The effect of this was that msync with a size would generally sync 1 page less than it should. This problem was brought to my attention by Darrel Herbst <dherbst@gradin.cis.upenn.edu> and Ron Minnich <rminnich@sarnoff.com>.	1995-12-13 12:28:39 +00:00
dyson	c8faa63068	Some new anti-deadlock code ended up messing up the paging stats. A modified version of the code is now in place, and gausspage performance is back up to where it should be.	1995-12-11 15:43:33 +00:00
dyson	fde0933a9e	Some DIAGNOSTIC code was enabled all of the time in error. The diagnostic code is now conditional on #ifdef DIAGNOSTIC again.	1995-12-11 15:41:50 +00:00
dyson	601ed1a4c0	Changes to support 1Tb filesizes. Pages are now named by an (object,index) pair instead of (object,offset) pair.	1995-12-11 04:58:34 +00:00
bde	fe727807c4	Replaced nxdump by nodump (if the dump function gets called, then the device must be configured, so ENXIO is a bogus errno). Replaced zerosize by nopsize. zerosize was a temporary alias.	1995-12-10 19:53:42 +00:00
bde	31bbf49f89	Restored used includes of <vm/vm_extern.h>.	1995-12-10 14:52:10 +00:00
bde	c951d0b363	Moved the declaration of boolean_t from <vm/vm_param.h> to <sys/types.h> (if KERNEL is defined). This allows removing bogus dependencies on vm stuff in several places (e.g., ddb) and stops <vm_param.h> from depending on <vm_param.h> Added declaration of boolean_t to <vm/vm.h> (if KERNEL is not defined). It never belonged in <vm/vm_param.h>. Unfortunately, it is required for some vm headers that are included by applications. Deleted declarations of TRUE and FALSE from <vm/vm_param.h>. They are defined in <sys/param.h> if KERNEL is defined and we'll soon find out if any applications depend on them being defined in a vm header.	1995-12-10 02:34:29 +00:00
phk	a69d1dfbcb	Julian forgot to make the *devsw structures static.	1995-12-08 23:23:00 +00:00
julian	1900eea896	Pass 3 of the great devsw changes most devsw referenced functions are now static, as they are in the same file as their devsw structure. I've also added DEVFS support for nearly every device in the system, however many of the devices have 'incorrect' names under DEVFS because I couldn't quickly work out the correct naming conventions. (but devfs won't be coming on line for a month or so anyhow so that doesn't matter) If you "OWN" a device which would normally have an entry in /dev then search for the devfs_add_devsw() entries and munge to make them right.. check out similar devices to see what I might have done in them in you can't see what's going on.. for a laugh compare conf.c conf.h defore and after... :) I have not doen DEVFS entries for any DISKSLICE devices yet as that will be a much more complicated job.. (pass 5 :) pass 4 will be to make the devsw tables of type (cdevsw * ) rather than (cdevsw) seems to work here.. complaints to the usual places.. :)	1995-12-08 11:19:42 +00:00
dg	c30f46c534	Untangled the vm.h include file spaghetti.	1995-12-07 12:48:31 +00:00
bde	7a0e2a4994	Moved the declaration of vm_object_t from <vm/vm.h> to <sys/types.h> (if KERNEL is defined). This allows removing the #includes of vm stuff in vnode_if.h, which will speed up the compilation of LINT by about 5%.	1995-12-05 20:54:42 +00:00
phk	6b7609f909	A major sweep over the sysctl stuff. Move a lot of variables home to their own code (In good time before xmas :-) Introduce the string descrition of format. Add a couple more functions to poke into these marvels, while I try to decide what the correct interface should look like. Next is adding vars on the fly, and sysctl looking at them too. Removed a tine bit of defunct and #ifdefed notused code in swapgeneric.	1995-12-04 16:48:58 +00:00
bde	103688f7d6	Fixed the type mismatch in check for the bogus mmap function `nullop'. The test should never succeed and should go away. Temporarily print a warning if it does succeed.	1995-12-03 18:59:55 +00:00
bde	754df9d25d	Completed function declarations and/or added prototypes. Staticized some functions. __purified some functions. Some functions were bogusly declared as returning `const'. This hasn't done anything since gcc-2.5. For later versions of gcc, the equivalent is __attribute__((const)) at the end of function declarations.	1995-12-03 12:18:39 +00:00
bde	338d20055d	Finished (?) cleaning up sysinit stuff.	1995-12-02 17:11:20 +00:00
julian	198d88e0ae	If you're going to mechanically replicate something in 50 files it's best to not have a (compiles cleanly) typo in it! (sigh)	1995-11-29 14:41:20 +00:00
julian	f2f63c6ece	OK, that's it.. That's EVERY SINGLE driver that has an entry in conf.c.. my next trick will be to define cdevsw[] and bdevsw[] as empty arrays and remove all those DAMNED defines as well.. Each of these drivers has a SYSINIT linker set entry that comes in very early.. and asks teh driver to add it's own entry to the two devsw[] tables. some slight reworking of the commits from yesterday (added the SYSINIT stuff and some usually wrong but token DEVFS entries to all these devices. BTW does anyone know where the 'ata' entries in conf.c actually reside? seems we don't actually have a 'ataopen() etc... If you want to add a new device in conf.c please make sure I know so I can keep it up to date too.. as before, this is all dependent on #if defined(JREMOD) (and #ifdef DEVFS in parts)	1995-11-29 10:49:16 +00:00
bde	24ce87cc75	Completed function declarations and/or added prototypes.	1995-11-21 12:55:26 +00:00
phk	07252df1ff	Remove unused vars & funcs, make things static, protoize a little bit.	1995-11-20 12:20:02 +00:00
bde	fd57258459	Fixed recent staticizations. Some protypes for static functions were left in headers and not staticized.	1995-11-16 09:51:22 +00:00
phk	010c461fbe	staticize.	1995-11-14 20:53:20 +00:00
phk	0cd6c033b7	Move all the VM sysctl stuff home where it belongs.	1995-11-14 09:29:34 +00:00
dg	95ceae8e7d	Fixed up a comment and removed some #if 0'd code.	1995-11-13 10:53:38 +00:00
dg	867529addc	Moved vm_map_lock call to inside the splhigh protection in vm_map_find(). This closes a probably rare but nonetheless real window that would result in a process hanging or the system panicing. Reviewed by: dyson, davidg Submitted by: kato@eclogite.eps.nagoya-u.ac.jp (KATO Takenori)	1995-11-12 08:58:58 +00:00
bde	aa9a60640e	Included <sys/sysproto.h> to get central declarations for syscall args structs and prototypes for syscalls. Ifdefed duplicated decentralized declarations of args structs. It's convenient to have this visible but they are hard to maintain. Some are already different from the central declarations. 4.4lite2 puts them in comments in the function headers but I wanted to avoid the large changes for that.	1995-11-12 06:43:28 +00:00
bde	b58717a247	Fixed type of obreak(). The args struct member name conflicted with the (better) machine generated one in <sys/sysproto.h>.	1995-11-11 06:40:35 +00:00
dg	aa13ab02a2	Initialize lock struct entries explicitly rather than calling bzero().	1995-11-06 08:44:15 +00:00
bde	08b652fb65	Replaced bogus macros for dummy devswitch entries by functions. These functions went away: enosys (hasn't been used for some time) enxio enodev enoioctl (was used only once, actually for a vop) if_tun.c: Continued cleaning up... conf.h: Probably fixed the type of d_reset_t. It is hard to tell the correct type because there are no non-dummy device reset functions. Removed last vestige of ambiguous sleep message strings.	1995-11-06 00:36:19 +00:00
dyson	1b8e5404ee	Greatly simplify the msync code. Eliminate complications in vm_pageout for msyncing. Remove a bug that manifests itself primarily on NFS (the dirty range on the buffers is not set on msync.)	1995-11-05 20:46:03 +00:00
dg	c85bf90168	Move page fixups (pmap_clear_modify, etc) that happen after paging input completes out of vm_fault and into the pagers. This get rid of some redundancy and improves the architecture. Reviewed by: John Dyson <dyson>	1995-11-02 06:42:47 +00:00
bde	9c65ce66e1	Don't pass an extra trailing arg to some functions. Added the prototypes that found this bug.	1995-10-30 17:56:30 +00:00
dyson	863d0ce972	Get rid of machine-dependent NBPG and replace with PAGE_SIZE.	1995-10-23 05:35:48 +00:00
dyson	321c91c6e1	Remove of now unused PG_COPYONWRITE.	1995-10-23 04:29:39 +00:00
dyson	eca6c1d907	First phase of removing the PG_COPYONWRITE flag, and an architectural cleanup of mapping files.	1995-10-23 03:49:43 +00:00
dyson	b1a28fda3c	Finalize GETPAGES layering scheme. Move the device GETPAGES interface into specfs code. No need at this point to modify the PUTPAGES stuff except in the layered-type (NULL/UNION) filesystems.	1995-10-23 02:23:29 +00:00
dyson	4519f95061	Implement mincore system call.	1995-10-21 17:42:28 +00:00
dg	61860c0e91	Fix initialization of "bsize" in vnode_pager_haspage(). It must happen after the check for the mount point still existing or else the system will panic if someone forcibly unmounted the filesystem.	1995-10-19 21:35:03 +00:00
dyson	a8f1d3e38b	Remove an unnecessary tsleep in the swapin code. This tsleep can defer swapping in processes and is just not the right thing to do.	1995-10-16 05:45:49 +00:00
dg	bf6129211c	Fix argument passing to the "freeer" routine. Added some prototypes. (bde) Moved extern declaration of swap_pager_full into swap_pager.h and out of the various files that reference it. (davidg) Submitted by: bde & davidg	1995-10-07 19:02:56 +00:00
phk	3cbc1f1bb0	Avoid a 64bit divide.	1995-10-06 09:42:11 +00:00
bde	f12a489cee	Fix pollution of application namespace by declarations of kernel functions. The application header <sys/user.h> includes <vm/vm.h> which includes <vm/lock.h>... vm.h: Don't include <machine/cpufunc.h>. It is already included by <sys/systm.h> in the kernel and isn't designed to be included by applications (the 2.1 version causes a syntax error in C++ and the current version has initializers that are invalid in strict C++). lock.h: Only declare kernel functions if KERNEL is defined.	1995-10-05 01:11:48 +00:00
dyson	29810fb9d7	Perform more checking for proper loading of the UPAGES when a process is swapped in. Also, remove unnecessary map locking/unlocking during selection of processes to be swapped out. This code might afford proper panics as opposed to spontaneous reboots on certain systems. This should allow us to debug these problems better.	1995-09-24 19:51:50 +00:00
dyson	8c2e45ac18	Significantly simplify the fault clustering code. After some analysis by David Greenman, it has been determined that the more sophisticated code only made a very minor difference in fault performance. Therefore, this code eliminates some of the complication of the fault code, decreasing the amount of CPU used to scan shadow chains.	1995-09-24 19:47:58 +00:00
dg	56e87c0fef	Check that the swap block is valid before including it in a cluster. Submitted by: John Dyson	1995-09-24 04:40:19 +00:00
dg	1388e34c46	Check the return value from vm_map_pageable() when mapping the process's UPAGES and associated page table page. Panic on error. This is less than optimial and will be fixed in the future, but is better than the old behavior of panicing with a "kernel page directory invalid" in pmap_enter.	1995-09-17 01:46:03 +00:00
dyson	d0216ffdd0	Fixed a typo in vm_fault_additional_pages.	1995-09-14 13:42:52 +00:00
dyson	7f65a388a8	Fix really bogus casting of a block number to a long. Also change the comparison from a "< 0" to "== -1" like it should be.	1995-09-12 14:42:43 +00:00
dyson	4f7c5687d4	Make sure that the prezero flag is cleared when needed.	1995-09-11 00:47:17 +00:00
dyson	ed4da14d34	Fix an error that can cause attempted reading beyond the end of file.	1995-09-11 00:46:19 +00:00
dyson	3432895a19	Code cleanup and minor performance improvement in the faultin cluster code.	1995-09-11 00:45:15 +00:00
dg	573c688a68	Fixed init functions argument type - caddr_t -> void *. Fixed a couple of compiler warnings.	1995-09-09 18:10:37 +00:00
dyson	bd69f4b400	Fixed a sign reversal problem -- might have cause some Sig-11s that people have been seeing.	1995-09-06 07:08:45 +00:00
dyson	05b73ee386	Minor performance improvements, additional prototype for additional exported symbol.	1995-09-06 05:37:43 +00:00
dyson	3f4ad92abf	Allow the fault code to use additional clustering info from both bmap and the swap pager. Improved fault clustering performance.	1995-09-04 04:44:26 +00:00
dyson	251a16a6af	Added VOP_GETPAGES/VOP_PUTPAGES and also the "backwards" block count for VOP_BMAP. Updated affected filesystems...	1995-09-04 00:21:16 +00:00
dyson	19cfe6b04c	Machine independent changes to support pre-zeroed free pages. This significantly improves demand-zero performance.	1995-09-03 20:40:43 +00:00
dyson	325ce9e610	Added prototype for new routine "vm_page_set_validclean" and initial declarations for the prezeroed pages mechanism.	1995-09-03 20:11:26 +00:00
dyson	062d747837	New subroutine "vm_page_set_validclean" for a vfs_bio improvement.	1995-09-03 19:57:25 +00:00
julian	ebb726ec45	Reviewed by: julian with quick glances by bruce and others Submitted by: terry (terry lambert) This is a composite of 3 patch sets submitted by terry. they are: New low-level init code that supports loadbal modules better some cleanups in the namei code to help terry in 16-bit character support some changes to the mount-root code to make it a little more modular.. NOTE: mounting root off cdrom or NFS MIGHT be broken as I haven't been able to test those cases.. certainly mounting root of disk still works just fine.. mfs should work but is untested. (tomorrows task) The low level init stuff includes a total rewrite of init_main.c to make it possible for new modules to have an init phase by simply adding an entry to a TEXT_SET (or is it DATA_SET) list. thus a new module can be added to the kernel without editing any other files other than the 'files' file.	1995-08-28 09:19:25 +00:00
bde	5a504de50d	Change vm_object_print() to have the correct number and type of args for a ddb command.	1995-08-26 23:19:49 +00:00
bde	dba3676dfb	Change vm_map_print() to have the correct number and type of args for a ddb command.	1995-08-26 23:18:38 +00:00
bde	b31df09238	Make everything except the unsupported network sources compile cleanly with -Wnested-externs.	1995-08-16 16:14:28 +00:00
bde	38a74a7bc2	Eliminate sloppy common-style declarations. There should be none left for the LINT configuation.	1995-07-29 11:44:31 +00:00
dg	af78438176	#if 0'd one of the DIAGNOSTIC checks in vm_page_alloc(). It was too expensive for "normal" use.	1995-07-20 05:28:07 +00:00
dg	1c7ffd3f4c	1) Merged swpager structure into vm_object. 2) Changed swap_pager internal interfaces to cope w/#1. 3) Eliminated object->copy as we no longer have copy objects. 4) Minor stylistic changes.	1995-07-16 13:28:37 +00:00
dg	d8a054a0aa	Added a copyright to this file.	1995-07-13 10:29:34 +00:00
dg	ba602d0f8d	Oops, forgot to add the "default" pager files... NOTE: libkvm, w, ps, 'top', and any other utility which depends on struct proc or any VM system structure will have to be rebuilt!!! Much needed overhaul of the VM system. Included in this first round of changes: 1) Improved pager interfaces: init, alloc, dealloc, getpages, putpages, haspage, and sync operations are supported. The haspage interface now provides information about clusterability. All pager routines now take struct vm_object's instead of "pagers". 2) Improved data structures. In the previous paradigm, there is constant confusion caused by pagers being both a data structure ("allocate a pager") and a collection of routines. The idea of a pager structure has escentially been eliminated. Objects now have types, and this type is used to index the appropriate pager. In most cases, items in the pager structure were duplicated in the object data structure and thus were unnecessary. In the few cases that remained, a un_pager structure union was created in the object to contain these items. 3) Because of the cleanup of #1 & #2, a lot of unnecessary layering can now be removed. For instance, vm_object_enter(), vm_object_lookup(), vm_object_remove(), and the associated object hash list were some of the things that were removed. 4) simple_lock's removed. Discussion with several people reveals that the SMP locking primitives used in the VM system aren't likely the mechanism that we'll be adopting. Even if it were, the locking that was in the code was very inadequate and would have to be mostly re-done anyway. The locking in a uni-processor kernel was a no-op but went a long way toward making the code difficult to read and debug. 5) Places that attempted to kludge-up the fact that we don't have kernel thread support have been fixed to reflect the reality that we are really dealing with processes, not threads. The VM system didn't have complete thread support, so the comments and mis-named routines were just wrong. We now use tsleep and wakeup directly in the lock routines, for instance. 6) Where appropriate, the pagers have been improved, especially in the pager_alloc routines. Most of the pager_allocs have been rewritten and are now faster and easier to maintain. 7) The pagedaemon pageout clustering algorithm has been rewritten and now tries harder to output an even number of pages before and after the requested page. This is sort of the reverse of the ideal pagein algorithm and should provide better overall performance. 8) Unnecessary (incorrect) casts to caddr_t in calls to tsleep & wakeup have been removed. Some other unnecessary casts have also been removed. 9) Some almost useless debugging code removed. 10) Terminology of shadow objects vs. backing objects straightened out. The fact that the vm_object data structure escentially had this backwards really confused things. The use of "shadow" and "backing object" throughout the code is now internally consistent and correct in the Mach terminology. 11) Several minor bug fixes, including one in the vm daemon that caused 0 RSS objects to not get purged as intended. 12) A "default pager" has now been created which cleans up the transition of objects to the "swap" type. The previous checks throughout the code for swp->pg_data != NULL were really ugly. This change also provides the rudiments for future backing of "anonymous" memory by something other than the swap pager (via the vnode pager, for example), and it allows the decision about which of these pagers to use to be made dynamically (although will need some additional decision code to do this, of course). 13) (dyson) MAP_COPY has been deprecated and the corresponding "copy object" code has been removed. MAP_COPY was undocumented and non- standard. It was furthermore broken in several ways which caused its behavior to degrade to MAP_PRIVATE. Binaries that use MAP_COPY will continue to work correctly, but via the slightly different semantics of MAP_PRIVATE. 14) (dyson) Sharing maps have been removed. It's marginal usefulness in a threads design can be worked around in other ways. Both #12 and #13 were done to simplify the code and improve readability and maintain- ability. (As were most all of these changes) TODO: 1) Rewrite most of the vnode pager to use VOP_GETPAGES/PUTPAGES. Doing this will reduce the vnode pager to a mere fraction of its current size. 2) Rewrite vm_fault and the swap/vnode pagers to use the clustering information provided by the new haspage pager interface. This will substantially reduce the overhead by eliminating a large number of VOP_BMAP() calls. The VOP_BMAP() filesystem interface should be improved to provide both a "behind" and "ahead" indication of contiguousness. 3) Implement the extended features of pager_haspage in swap_pager_haspage(). It currently just says 0 pages ahead/behind. 4) Re-implement the swap device (swstrategy) in a more elegant way, perhaps via a much more general mechanism that could also be used for disk striping of regular filesystems. 5) Do something to improve the architecture of vm_object_collapse(). The fact that it makes calls into the swap pager and knows too much about how the swap pager operates really bothers me. It also doesn't allow for collapsing of non-swap pager objects ("unnamed" objects backed by other pagers).	1995-07-13 10:15:03 +00:00
dg	c8b0a7332c	NOTE: libkvm, w, ps, 'top', and any other utility which depends on struct proc or any VM system structure will have to be rebuilt!!! Much needed overhaul of the VM system. Included in this first round of changes: 1) Improved pager interfaces: init, alloc, dealloc, getpages, putpages, haspage, and sync operations are supported. The haspage interface now provides information about clusterability. All pager routines now take struct vm_object's instead of "pagers". 2) Improved data structures. In the previous paradigm, there is constant confusion caused by pagers being both a data structure ("allocate a pager") and a collection of routines. The idea of a pager structure has escentially been eliminated. Objects now have types, and this type is used to index the appropriate pager. In most cases, items in the pager structure were duplicated in the object data structure and thus were unnecessary. In the few cases that remained, a un_pager structure union was created in the object to contain these items. 3) Because of the cleanup of #1 & #2, a lot of unnecessary layering can now be removed. For instance, vm_object_enter(), vm_object_lookup(), vm_object_remove(), and the associated object hash list were some of the things that were removed. 4) simple_lock's removed. Discussion with several people reveals that the SMP locking primitives used in the VM system aren't likely the mechanism that we'll be adopting. Even if it were, the locking that was in the code was very inadequate and would have to be mostly re-done anyway. The locking in a uni-processor kernel was a no-op but went a long way toward making the code difficult to read and debug. 5) Places that attempted to kludge-up the fact that we don't have kernel thread support have been fixed to reflect the reality that we are really dealing with processes, not threads. The VM system didn't have complete thread support, so the comments and mis-named routines were just wrong. We now use tsleep and wakeup directly in the lock routines, for instance. 6) Where appropriate, the pagers have been improved, especially in the pager_alloc routines. Most of the pager_allocs have been rewritten and are now faster and easier to maintain. 7) The pagedaemon pageout clustering algorithm has been rewritten and now tries harder to output an even number of pages before and after the requested page. This is sort of the reverse of the ideal pagein algorithm and should provide better overall performance. 8) Unnecessary (incorrect) casts to caddr_t in calls to tsleep & wakeup have been removed. Some other unnecessary casts have also been removed. 9) Some almost useless debugging code removed. 10) Terminology of shadow objects vs. backing objects straightened out. The fact that the vm_object data structure escentially had this backwards really confused things. The use of "shadow" and "backing object" throughout the code is now internally consistent and correct in the Mach terminology. 11) Several minor bug fixes, including one in the vm daemon that caused 0 RSS objects to not get purged as intended. 12) A "default pager" has now been created which cleans up the transition of objects to the "swap" type. The previous checks throughout the code for swp->pg_data != NULL were really ugly. This change also provides the rudiments for future backing of "anonymous" memory by something other than the swap pager (via the vnode pager, for example), and it allows the decision about which of these pagers to use to be made dynamically (although will need some additional decision code to do this, of course). 13) (dyson) MAP_COPY has been deprecated and the corresponding "copy object" code has been removed. MAP_COPY was undocumented and non- standard. It was furthermore broken in several ways which caused its behavior to degrade to MAP_PRIVATE. Binaries that use MAP_COPY will continue to work correctly, but via the slightly different semantics of MAP_PRIVATE. 14) (dyson) Sharing maps have been removed. It's marginal usefulness in a threads design can be worked around in other ways. Both #12 and #13 were done to simplify the code and improve readability and maintain- ability. (As were most all of these changes) TODO: 1) Rewrite most of the vnode pager to use VOP_GETPAGES/PUTPAGES. Doing this will reduce the vnode pager to a mere fraction of its current size. 2) Rewrite vm_fault and the swap/vnode pagers to use the clustering information provided by the new haspage pager interface. This will substantially reduce the overhead by eliminating a large number of VOP_BMAP() calls. The VOP_BMAP() filesystem interface should be improved to provide both a "behind" and "ahead" indication of contiguousness. 3) Implement the extended features of pager_haspage in swap_pager_haspage(). It currently just says 0 pages ahead/behind. 4) Re-implement the swap device (swstrategy) in a more elegant way, perhaps via a much more general mechanism that could also be used for disk striping of regular filesystems. 5) Do something to improve the architecture of vm_object_collapse(). The fact that it makes calls into the swap pager and knows too much about how the swap pager operates really bothers me. It also doesn't allow for collapsing of non-swap pager objects ("unnamed" objects backed by other pagers).	1995-07-13 08:48:48 +00:00
dg	d088fb31fa	swapout_threads() -> swapout_procs().	1995-07-10 08:53:22 +00:00
dg	e2d951bea4	Increased global RSS limit to total RAM.	1995-07-10 08:48:58 +00:00
dg	24d64ff84a	Moved call to VOP_GETATTR() out of vnode_pager_alloc() and into the places that call vnode_pager_alloc() so that a failure return can be dealt with. This fixes a panic seen on NFS clients when a file being opened is deleted on the server before the open completes.	1995-07-09 06:58:03 +00:00
dg	21036bccfd	Fixed an object allocation race condition that was causing a "object deallocated too many times" panic when using NFS. Reviewed by: John Dyson	1995-07-06 11:48:48 +00:00
dg	3c7c1dd62f	1) Converted v_vmdata to v_object. 2) Removed unnecessary vm_object_lookup()/pager_cache(object, TRUE) pairs after vnode_pager_alloc() calls - the object is already guaranteed to be persistent. 3) Removed some gratuitous casts.	1995-06-28 12:01:13 +00:00
rgrimes	1b1ee55538	Merge RELENG_2_0_5 into HEAD	1995-06-11 19:33:05 +00:00
rgrimes	c86f0c7a71	Remove trailing whitespace.	1995-05-30 08:16:23 +00:00
dg	0ced7fdbc6	Removed check for sw_dev == NODEV; this is a normal condition for swap over NFS and was gratuitously panicing when it happens. Reviewed by: John Dyson Submitted by: Pierre Beyssac via Poul-Henning Kamp	1995-05-25 03:38:11 +00:00
dg	2045200a00	Changes to fix the following bugs: 1) Files weren't properly synced on filesystems other than UFS. In some cases, this lead to lost data. Most likely would be noticed on NFS. The fix is to make the VM page sync/object_clean general rather than in each filesystem. 2) Mixing regular and mmaped file I/O on NFS was very broken. It caused chunks of files to end up as zeroes rather than the intended contents. The fix was to fix several race conditions and to kludge up the "b_dirtyoff" and "b_dirtyend" that NFS relies upon - paying attention to page modifications that occurred via the mmapping. Reviewed by: David Greenman Submitted by: John Dyson	1995-05-21 21:39:31 +00:00
dg	240701b33f	NFS diskless operation was broken because swapdev_vp wasn't initialized. These changes solve the problem in a general way by moving the initialization out of the individual fs_mountroot's and into swaponvp(). Submitted by: Poul-Henning Kamp	1995-05-19 03:27:08 +00:00
dg	0856c15479	Fixed a bug that managed to slip in during Poul's dynamic swap partition changes. The check for nswap was bogus, but the code was so convoluted that it was difficult to tell. It's better now. :-) Reviewed by: David Greenman (extensively), and John Dyson Submitted by: Poul-Henning Kamp, w/tweaks by me.	1995-05-18 05:09:54 +00:00
dg	56d21b4218	Accessing pages beyond the end of a mapped file results in internal inconsistencies in the VM system that eventually lead to a panic. These changes fix the behavior to conform to the behavior in SunOS, which is to deny faults to pages beyond the EOF (returning SIGBUS). Internally, this is implemented by requiring faults to be within the object size boundaries. These changes exposed another bug, namely that passing in an offset to mmap when trying to map an unnamed anonymous region also results in internal inconsistencies. In this case, the offset is forced to zero. Reviewed by: John Dyson and others	1995-05-18 02:59:26 +00:00
dg	b649d7b9c7	Changed swap partition handling/allocation so that it doesn't require specific partitions be mentioned in the kernel config file ("swap on foo" is now obsolete). From Poul-Henning: The visible effect is this: As default, unless options "NSWAPDEV=23" is in your config, you will have four swap-devices. You can swapon(2) any block device you feel like, it doesn't have to be in the kernel config. There is a performance/resource win available by getting the NSWAPDEV right (but only if you have just one swap-device ??), but using that as default would be too restrictive. The invisible effect is that: Swap-handling disappears from the $arch part of the kernel. It gets a lot simpler (-145 lines) and cleaner. Reviewed by: John Dyson, David Greenman Submitted by: Poul-Henning Kamp, with minor changes by me.	1995-05-14 03:00:10 +00:00
phk	5cfbbcea82	I'm about to jump on the swap-initialization, and having talked with davidg about it, I hereby kill two undocumented misfeatures: The code to skip a miniroot in the swapdev is not particular useful, and if we need it we need it to be done properly, ie size the fs and skip all of it not some hardcoded size, and subtract what we skip from the length in the first place. The SEQSWAP dies too. It's not the way to do it, it doesn't work, and nobody have expressed any great desire for it to work. The way to implement it correctly would be a second argument to swapon(2) to give a priority/policy information. Low priority swapdevs can be made so by adding them at a far offset (0x80000000 kind of thing), with almost no modification to the strategy routine (in particular a offset per swapdev). But until the need is obvious, it will not be done.	1995-05-12 03:54:59 +00:00
dg	a2a89cc5d7	Changed "handle" from type caddr_t to void ; "handle" is several different types of pointers, and "char " is a bad choice for the type.	1995-05-10 18:56:09 +00:00
dyson	02299bb821	Another error in the correction for trimming swap allocation for small objects. (This code needs to be revisited.)	1995-05-07 06:36:59 +00:00
dyson	f90d96a223	Fixed a calculation that would once-in-a-while cause the swap_pager to emit spurious page outside of object type messages. It is not a fatal condition anyway, so the message will be omitted for release. Also, the code that "clips" the allocation size, associated with the above problem, was fixed.	1995-05-07 03:48:54 +00:00
dg	1678b3aebe	Changed object hash list to be a list rather than a tailq. This saves space for the hash list buckets and is a little faster. The features of tailq aren't needed. Increased the size of the object hash table to improve performance. In the future, this will be changed so that the table is sized dynamically.	1995-05-02 05:57:11 +00:00
dg	b25e489488	Fixed a "bswbuf" hang caused by the wakeup in relpbuf() waking up the wrong thing.	1995-04-25 06:22:47 +00:00

... 4 5 6 7 8 ...

735 Commits