freebsd-dev

History

Alan Cox 5268042bbd Revamp the default page clustering strategy that is used by the page fault handler. For roughly twenty years, the page fault handler has used the same basic strategy: Fetch a fixed number of non-resident pages both ahead and behind the virtual page that was faulted on. Over the years, alternative strategies have been implemented for optimizing the handling of random and sequential access patterns, but the only change to the default strategy has been to increase the number of pages read ahead to 7 and behind to 8. The problem with the default page clustering strategy becomes apparent when you look at how it behaves on the code section of an executable or shared library. (To simplify the following explanation, I'm going to ignore the read that is performed to obtain the header and assume that no pages are resident at the start of execution.) Suppose that we have a code section consisting of 32 pages. Further, suppose that we access pages 4, 28, and 16 in that order. Under the default page clustering strategy, we page fault three times and perform three I/O operations, because the first and second page faults only read a truncated cluster of 12 pages. In contrast, if we access pages 8, 24, and 16 in that order, we only fault twice and perform two I/O operations, because the first and second page faults read a full cluster of 16 pages. In general, truncated clusters are more common than full clusters. To address this problem, this revision changes the default page clustering strategy to align the start of the cluster to a page offset within the vm object that is a multiple of the cluster size. This results in many fewer truncated clusters. Returning to our example, if we now access pages 4, 28, and 16 in that order, the cluster that is read to satisfy the page fault on page 28 will now include page 16. So, the access to page 16 will no longer page fault and perform an I/O operation. Since the revised default page clustering strategy is typically reading more pages at a time, we are likely to read a few more pages that are never accessed. However, for the various programs that we looked at, including clang, emacs, firefox, and openjdk, the reduction in the number of page faults and I/O operations far outweighed the increase in the number of pages that are never accessed. Moreover, the extra resident pages allowed for many more superpage mappings. For example, if we look at the execution of clang during a buildworld, the number of (hard) page faults on the code section drops by 26%, the number of superpage mappings increases by about 29,000, but the number of never accessed pages only increases from 30.38% to 33.66%. Finally, this leads to a small but measureable reduction in execution time. In collaboration with: Emily Pettigrew <ejp1@rice.edu> Differential Revision: https://reviews.freebsd.org/D1500 Reviewed by: jhb, kib MFC after: 6 weeks		2015-01-16 18:17:09 +00:00
..
_vm_radix.h
default_pager.c
device_pager.c
memguard.c
memguard.h
phys_pager.c
pmap.h
redzone.c
redzone.h
sg_pager.c
swap_pager.c	\n at end of panicstr is redundant.	2014-11-23 18:32:21 +00:00
swap_pager.h
uma_core.c	Eliminate a stale debug message. The per-CPU cache locks were replaced	2014-12-31 17:44:57 +00:00
uma_dbg.c
uma_dbg.h
uma_int.h
uma.h
vm_extern.h
vm_fault.c	Revamp the default page clustering strategy that is used by the page fault	2015-01-16 18:17:09 +00:00
vm_glue.c
vm_init.c
vm_kern.c
vm_kern.h
vm_map.c
vm_map.h
vm_meter.c
vm_mmap.c	Always ignore the deprecated MAP_RENAME and MAP_NORESERVE flags to mmap().	2014-12-05 15:24:42 +00:00
vm_object.c	When the last reference on the vnode' vm object is dropped, read the	2014-12-05 15:02:30 +00:00
vm_object.h
vm_page.c	Add flag VM_ALLOC_NOWAIT for vm_page_grab() that prevents sleeping and	2014-12-22 09:02:21 +00:00
vm_page.h	Add flag VM_ALLOC_NOWAIT for vm_page_grab() that prevents sleeping and	2014-12-22 09:02:21 +00:00
vm_pageout.c
vm_pageout.h
vm_pager.c
vm_pager.h	Merge from projects/sendfile:	2014-11-23 12:01:52 +00:00
vm_param.h
vm_phys.c	The physical memory allocator supports the use of distinct free lists for	2014-12-31 00:54:38 +00:00
vm_phys.h	The physical memory allocator supports the use of distinct free lists for	2014-12-31 00:54:38 +00:00
vm_radix.c
vm_radix.h
vm_reserv.c	By the time that vm_reserv_init() runs, vm_phys_segs[] is initialized. Use	2014-11-22 17:46:30 +00:00
vm_reserv.h
vm_unix.c
vm_zeroidle.c
vm.h
vnode_pager.c	We already have "int i" in this scope.	2014-11-24 07:57:20 +00:00
vnode_pager.h	Merge from projects/sendfile:	2014-11-23 12:01:52 +00:00