freebsd-skq

Author	SHA1	Message	Date
attilio	5725f63f57	MFC	2012-03-16 15:46:44 +00:00
attilio	f9319cf885	Fix the nodes allocator in architectures without direct-mapping: - Fix bugs in the free path where the pages were not unwired and relevant locking wasn't acquired. - Introduce the rnode_map, submap of kernel_map, where to allocate from. The reason is that, in architectures without direct-mapping, kmem_alloc*() will try to insert the newly created mapping while holding the vm_object lock introducing a LOR or lock recursion. rnode_map is however a leafly-used submap, thus there cannot be any deadlock. Notes: Size the submap in order to be, by default, around 64 MB and decrase the size of the nodes as the allocation will be much smaller (and when the compacting code in the vm_radix will be implemented this will aim for much less space to be used). However note that the size of the submap can be changed at boot time via the hw.rnode_map_scale scaling factor. - Use uma_zone_set_max() covering the size of the submap. Tested by: flo	2012-03-16 15:41:07 +00:00
jhb	1e515523f3	Pedantic nit: use vm_pindex_t instead of long for a count of pages.	2012-03-14 20:57:48 +00:00
flo	83ea608b00	IFC at r232948 Approved by: attilio	2012-03-14 00:41:37 +00:00
jhb	19feaba08b	Add KTR_VFS traces to track modifications to a vnode's writecount.	2012-03-08 20:27:20 +00:00
attilio	86fae10111	MFC	2012-03-07 11:18:38 +00:00
attilio	9e63566650	Fix a compile time bug by adding a check just after the struct definition	2012-03-06 23:37:53 +00:00
alc	9181f45b9f	Eliminate stale incorrect ARGSUSED comments. Submitted by: bde	2012-03-02 17:33:51 +00:00
attilio	df89a6a2db	- Exclude vm_radix_shrink() from the interface but retain the code still as it can be useful. - Make most of the interface private as it is unnecessary public right now. This will help in making nodes changing with arch and still avoid namespace pollution.	2012-03-01 00:54:08 +00:00
attilio	3c5fbc2c09	MFC	2012-03-01 00:27:51 +00:00
alc	54c1d2e89a	Simplify kmem_alloc() by eliminating code that existed on account of external pagers in Mach. FreeBSD doesn't implement external pagers. Moreover, it don't pageout the kernel object. So, the reasons for having code don't hold. Reviewed by: kib MFC after: 6 weeks	2012-02-29 05:41:29 +00:00
alc	867c58a8cb	Simplify vm_mmap()'s control flow. Add a comment describing what vm_mmap_to_errno() does. Reviewed by: kib MFC after: 3 weeks X-MFC after: r232071	2012-02-25 21:06:39 +00:00
attilio	d4c43cbb8b	MFC	2012-02-25 18:24:45 +00:00
alc	7d737c65b5	Simplify vmspace_fork()'s control flow by copying immutable data before the vm map locks are acquired. Also, eliminate redundant initialization of the new vm map's timestamp. Reviewed by: kib MFC after: 3 weeks	2012-02-25 17:49:59 +00:00
kib	8c39852ba5	Place the if() at the right location, to activate the v_writecount accounting for shared writeable mappings for all filesystems, not only for the bypass layers. Submitted by: alc Pointy hat to: kib MFC after: 20 days	2012-02-24 10:41:58 +00:00
kib	f315a59476	Account the writeable shared mappings backed by file in the vnode v_writecount. Keep the amount of the virtual address space used by the mappings in the new vm_object un_pager.vnp.writemappings counter. The vnode v_writecount is incremented when writemappings gets non-zero value, and decremented when writemappings is returned to zero. Writeable shared vnode-backed mappings are accounted for in vm_mmap(), and vm_map_insert() is instructed to set MAP_ENTRY_VN_WRITECNT flag on the created map entry. During deferred map entry deallocation, vm_map_process_deferred() checks for MAP_ENTRY_VN_WRITECOUNT and decrements writemappings for the vm object. Now, the writeable mount cannot be demoted to read-only while writeable shared mappings of the vnodes from the mount point exist. Also, execve(2) fails for such files with ETXTBUSY, as it should be. Noted by: tegge Reviewed by: tegge (long time ago, early version), alc Tested by: pho MFC after: 3 weeks	2012-02-23 21:07:16 +00:00
kib	ac9e4627bb	Remove wrong comment. Discussed with: alc MFC after: 3 days	2012-02-22 20:01:38 +00:00
alc	506ef17e7f	When vm_mmap() is used to map a vm object into a kernel vm_map, it makes no sense to check the size of the kernel vm_map against the user-level resource limits for the calling process. Reviewed by: kib	2012-02-16 06:45:51 +00:00
attilio	d58aaf5046	MFC	2012-02-14 19:58:00 +00:00
kib	dacbfe950a	Close a race due to dropping of the map lock between creating map entry for a shared mapping and marking the entry for inheritance. Other thread might execute vmspace_fork() in between (e.g. by fork(2)), resulting in the mapping becoming private. Noted and reviewed by: alc MFC after: 1 week	2012-02-11 17:29:07 +00:00
attilio	8fe59ec45c	MFC	2012-02-10 18:31:35 +00:00
ed	28b4a002d6	Remove direct access to si_name. Code should just use the devtoname() function to obtain the name of a character device. Also add const keywords to pieces of code that need it to build properly. MFC after: 2 weeks	2012-02-10 12:35:57 +00:00
flo	1e497814c3	fix KTR consistency I'm committing this on behalf of Attilio as he cannot access svn right now.	2012-02-05 18:55:20 +00:00
attilio	6587a6afdd	Remove the panic from vm_radix_insert() and propagate the error to the callers of vm_page_insert(). The default action for every caller is to unwind-back the operation besides vm_page_rename() where this has proven to be impossible to do. For that case, it just spins until the page is not available to be allocated. However, due to vm_page_rename() to be mostly rare (and having never hit this panic in the past) it is tought to be a very seldom thing and not a possible performance factor. The patch has been tested with an atomic counter returning NULL from the zone allocator every 1/100000 allocations. Per-printf, I've verified that a typical buildkernel could trigger this 30 times. The patch survived to 2 hours of repeated buildkernel/world. Several technical notes: - The vm_page_insert() is moved, in several callers, closer to failure points. This could be committed separately before vmcontention hits the tree just to verify -CURRENT is happy with it. - vm_page_rename() does not need to have the page lock in the callers as it hide that as an implementation detail. Do the locking internally. - now vm_page_insert() returns an int, with 0 meaning everything was ok, thus KPI is broken by this patch.	2012-02-05 17:37:26 +00:00
attilio	3454102b5b	MFC	2012-02-04 17:18:16 +00:00
mav	3cc9e27b24	Fix NULL dereference panic on attempt to turn off (on system shutdown) disconnected swap device. This is quick and imperfect solution, as swap device will still be opened and GEOM will not be able to destroy it. Proper solution would be to automatically turn off and close disconnected swap device, but with existing code it will cause panic if there is at least one page on device, even if it is unimportant page of the user-level process. It needs some work. Reviewed by: kib@ MFC after: 1 week	2012-02-01 20:12:44 +00:00
attilio	1b454e6b83	Fix a bug in vm_radix_leaf() where the shifting start address can wrap-up at some point. This bug is triggered very easilly by indirect blocks in UFS which grow negative resulting in very high counts. In collabouration with: flo	2012-01-29 16:44:21 +00:00
attilio	8bc5caadc8	Fix format string for the pindex members as they should be treated as uintmax_t for compatibility among 32/64 bits.	2012-01-29 16:29:06 +00:00
attilio	810afc9780	Make an assertion stronger and improve the printout for easier bug catching when it is not possible to dump	2012-01-29 16:11:25 +00:00
kmacy	84d434965a	exclude kmem_alloc'ed ARC data buffers from kernel minidumps on amd64 excluding other allocations including UMA now entails the addition of a single flag to kmem_alloc or uma zone create Reviewed by: alc, avg MFC after: 2 weeks	2012-01-27 20:18:31 +00:00
nwhitehorn	bf2ee27f25	Revert r212360 now that PowerPC can handle large sparse arguments to pmap_remove() (changed in r228412). MFC after: 2 weeks	2012-01-17 00:31:09 +00:00
kib	247e21eaf0	Change the type of the paging_in_progress refcounter from u_short to u_int. With the auto-sized buffer cache on the modern machines, UFS metadata can generate more the 65535 pages belonging to the buffers undergoing i/o, overflowing the counter. Reported and tested by: jimharris Reviewed by: alc MFC after: 1 week	2012-01-10 18:05:44 +00:00
attilio	abdebe75b2	MFC	2012-01-05 23:12:19 +00:00
kib	aec33a2b90	Do not restart the scan in vm_object_page_clean() on the object generation change if requested mode is async. The object generation is only changed when the object is marked as OBJ_MIGHTBEDIRTY. For async mode it is enough to write each dirty page, not to make a guarantee that all pages are cleared after the vm_object_page_clean() returned. Diagnosed by: truckman Tested by: flo Reviewed by: alc, truckman MFC after: 2 weeks	2012-01-04 16:04:20 +00:00
attilio	7ba6fbeca7	Fix a spot missed during the last merge.	2012-01-01 21:46:16 +00:00
attilio	a4ffaeb982	MFC	2012-01-01 20:18:40 +00:00
alc	7f817ed8c5	Optimize vm_object_split()'s handling of reservations.	2011-12-28 20:27:18 +00:00
kib	c2a7f10253	Optimize the common case of msyncing the whole file mapping with MS_SYNC flag. The system must guarantee that all writes are finished before syscalls returned. Schedule the writes in async mode, which is much faster and allows the clustering to occur. Wait for writes using VOP_FSYNC(), since we are syncing the whole file mapping. Potentially, the restriction to only apply the optimization can be relaxed by not requiring that the mapping cover whole file, as it is done by other OSes. Reported and tested by: az Reviewed by: alc MFC after: 2 weeks	2011-12-23 09:09:42 +00:00
kib	cfa70889cb	Move kstack_cache_entry into the private header, and make the stack cache list header accessible outside vm_glue.c. MFC after: 1 week	2011-12-16 10:56:16 +00:00
eadler	d29af4d1bc	- The previous commit (r228449) accidentally moved the vm.stats.vm.* sysctls to vm.stats.sys. Move them back. Noticed by: pho Reviewed by: bde (earlier version) Approved by: bz MFC after: 1 week Pointy hat to: me	2011-12-14 13:25:00 +00:00
eadler	3072a90209	Document a large number of currently undocumented sysctls. While here fix some style(9) issues and reduce redundancy. PR: kern/155491 PR: kern/155490 PR: kern/155489 Submitted by: Galimov Albert <wtfcrap@mail.ru> Approved by: bde Reviewed by: jhb MFC after: 1 week	2011-12-13 00:38:50 +00:00
kib	2893f40afd	Fix printf. Submitted by: az MFC after: 1 week	2011-12-12 10:04:04 +00:00
attilio	1f27e97ae5	Use atomics for rn_count on leaf node because RED operations happen without the VM_OBJECT_LOCK held, thus can be concurrent with BLACK ones. However, also use a write memory barrier in order to not reorder the operation of decrementing rn_count in respect fetching the pointer. Discussed with: jeff	2011-12-06 22:57:48 +00:00
attilio	2436e63a9c	- Make rn_count 32-bits as it will naturally pad for 32-bit arches - Avoid to use atomic to manipulate it at level0 because it seems unneeded and introduces a bug on big-endian architectures where only the top half (2 bits) of the double-words are written (as sparc64, for example, doesn't support atomics at 16-bits) heading to a wrong handling of rn_count. Reported by: flo, andreast Found by: marius No answer by: jeff	2011-12-06 19:04:45 +00:00
alc	a8855af4c0	Introduce vm_reserv_alloc_contig() and teach vm_page_alloc_contig() how to use superpage reservations. So, for the first time, kernel virtual memory that is allocated by contigmalloc(), kmem_alloc_attr(), and kmem_alloc_contig() can be promoted to superpages. In fact, even a series of small contigmalloc() allocations may collectively result in a promoted superpage. Eliminate some duplication of code in vm_reserv_alloc_page(). Change the type of vm_reserv_reclaim_contig()'s first parameter in order that it be consistent with other vm_*_contig() functions. Tested by: marius (sparc64)	2011-12-05 18:29:25 +00:00
andreast	8c385e6008	Fix compilation issue on 32-bit targets. Reviewed by: attilio	2011-12-05 16:06:12 +00:00
attilio	8fbad61ab4	Revert a change that sneaked in during the last MFC.	2011-12-02 23:21:59 +00:00
attilio	b2701fb716	MFC	2011-12-02 21:45:46 +00:00
kib	d326d5565d	Rename vm_page_set_valid() to vm_page_set_valid_range(). The vm_page_set_valid() is the most reasonable name for the m->valid accessor. Reviewed by: attilio, alc	2011-11-30 17:39:00 +00:00
kib	a441abaf37	Hide the internals of vm_page_lock(9) from the loadable modules. Since the address of vm_page lock mutex depends on the kernel options, it is easy for module to get out of sync with the kernel. No vm_page_lockptr() accessor is provided for modules. It can be added later if needed, unless proper KPI is developed to serve the needs. Reviewed by: attilio, alc MFC after: 3 weeks	2011-11-29 13:07:32 +00:00

1 2 3 4 5 ...

2936 Commits