freebsd-dev

Author	SHA1	Message	Date
Alan Cox	b8e7fc24fe	Add configuration knobs for the superpage reservation system. Initially, the reservation will only be enabled on amd64.	2007-12-27 16:45:39 +00:00
Joseph Koshy	0da7aa7a7d	Add stubs to unbreak LINT.	2007-12-07 13:45:47 +00:00
Robert Watson	3c90d1ea74	Break out stack(9) from ddb(4): - Introduce per-architecture stack_machdep.c to hold stack_save(9). - Introduce per-architecture machine/stack.h to capture any common definitions required between db_trace.c and stack_machdep.c. - Add new kernel option "options STACK"; we will build in stack(9) if it is defined, or also if "options DDB" is defined to provide compatibility with existing users of stack(9). Add new stack_save_td(9) function, which allows the capture of a stacktrace of another thread rather than the current thread, which the existing stack_save(9) was limited to. It requires that the thread be neither swapped out nor running, which is the responsibility of the consumer to enforce. Update stack(9) man page. Build tested: amd64, arm, i386, ia64, powerpc, sparc64, sun4v Runtime tested: amd64 (rwatson), arm (cognet), i386 (rwatson)	2007-12-02 20:40:35 +00:00
Olivier Houchard	b21a1da537	Close a race. The RAS implementation would set the end address, then the start address. These were used by the kernel to restart a RAS sequence if it was interrupted. When the thread switching code ran, it would check these values and adjust the PC and clear them if it did. However, there's a small flaw in this scheme. Thread T1, sets the end address and gets preempted. Thread T2 runs and also does a RAS operation. This resets end to zero. Thread T1 now runs again and sets start and then begins the RAS sequence, but is preempted before the RAS sequence executes its last instruction. The kernel code that would ordinarily restart the RAS sequence doesn't because the PC isn't between start and 0, so the PC isn't set to the start of the sequence. So when T1 is resumed again, it is at the wrong location for RAS to produce the correct results. This causes the wrong results for the atomic sequence. The window for the first race is 3 instructions. The window for the second race is 5-10 instructions depending on the atomic operation. This makes this failure fairly rare and hard to reproduce. Mutexs are implemented in libthr using atomic operations. When the above race would occur, a lock could get stuck locked, causing many downstream problems, as you might expect. Also, make sure to reset the start and end address when doing a syscall, or a malicious process could set them before doing a syscall. Reviewed by: imp, ups (thanks guys) Pointy hat to: cognet MFC After: 3 days	2007-12-02 12:49:28 +00:00
Olivier Houchard	9acb0e651b	In atomic_fetchadd_32(), do not blindly increase the value of %3. It should just contain the value we want to add, as if we're interrupted between the add and the str, we will restart from the beginning. Just use a register we can scratch instead. MFC After: 1 week	2007-11-27 22:12:05 +00:00
Kevin Lo	92e7748daf	__CPU_XSCALE_PXA2XX -> CPU_XSCALE_PXA2X0	2007-11-01 10:01:15 +00:00
Warner Losh	63b2597849	Merge support from p4 (from NetBSD) for arm9e and arm10, arm11 cores. Not yet connected to the build, but reduces diffs to p4 repo. Obtained from: NetBSD	2007-10-18 05:33:06 +00:00
Warner Losh	dfb7d4cdef	Merge definitions for ARM9E, ARM10 and ARM11 processors from p4 (which got them from NetBSD).	2007-10-18 05:06:58 +00:00
Olivier Houchard	258f866cbf	Define _ARM_ARCH_5E too, so that we know if pld/strd/ldrd are available. MFC After: 3 days	2007-10-13 12:04:10 +00:00
Alan Cox	7bfda801a8	Change the management of cached pages (PQ_CACHE) in two fundamental ways: (1) Cached pages are no longer kept in the object's resident page splay tree and memq. Instead, they are kept in a separate per-object splay tree of cached pages. However, access to this new per-object splay tree is synchronized by the _free_ page queues lock, not to be confused with the heavily contended page queues lock. Consequently, a cached page can be reclaimed by vm_page_alloc(9) without acquiring the object's lock or the page queues lock. This solves a problem independently reported by tegge@ and Isilon. Specifically, they observed the page daemon consuming a great deal of CPU time because of pages bouncing back and forth between the cache queue (PQ_CACHE) and the inactive queue (PQ_INACTIVE). The source of this problem turned out to be a deadlock avoidance strategy employed when selecting a cached page to reclaim in vm_page_select_cache(). However, the root cause was really that reclaiming a cached page required the acquisition of an object lock while the page queues lock was already held. Thus, this change addresses the problem at its root, by eliminating the need to acquire the object's lock. Moreover, keeping cached pages in the object's primary splay tree and memq was, in effect, optimizing for the uncommon case. Cached pages are reclaimed far, far more often than they are reactivated. Instead, this change makes reclamation cheaper, especially in terms of synchronization overhead, and reactivation more expensive, because reactivated pages will have to be reentered into the object's primary splay tree and memq. (2) Cached pages are now stored alongside free pages in the physical memory allocator's buddy queues, increasing the likelihood that large allocations of contiguous physical memory (i.e., superpages) will succeed. Finally, as a result of this change long-standing restrictions on when and where a cached page can be reclaimed and returned by vm_page_alloc(9) are eliminated. Specifically, calls to vm_page_alloc(9) specifying VM_ALLOC_INTERRUPT can now reclaim and return a formerly cached page. Consequently, a call to malloc(9) specifying M_NOWAIT is less likely to fail. Discussed with: many over the course of the summer, including jeff@, Justin Husted @ Isilon, peter@, tegge@ Tested by: an earlier version by kris@ Approved by: re (kensmith)	2007-09-25 06:25:06 +00:00
Olivier Houchard	75f66155bf	Twist the RAS logic a bit to avoid branching. MFC After: 1 week Approved by: re (blanket)	2007-09-22 14:23:52 +00:00
Olivier Houchard	4168e66b1f	In __bswap16_var(), make sure the 16 upper bits are cleared; while optimizing, gcc4 doesn't always do so. Reported by: Nathan Whitehorn Approved by: re (blanket)	2007-09-09 11:58:38 +00:00
Olivier Houchard	5f78cb4a35	XScale core 3 definitions. Approved by: re (blanket)	2007-07-27 14:54:27 +00:00
Olivier Houchard	e905513c06	Fix the cache mode description. Approved by: re (blanket)	2007-07-27 14:45:33 +00:00
Olivier Houchard	b4db6fd942	Properly handle supersections. Make sure we cache entries in the L2 cache. Approved by: re (blanket)	2007-07-27 14:45:04 +00:00
Olivier Houchard	425b5be335	Add a new set of functions to handle L2 cache. Make them no-op for every CPU except Xscale core 3. Approved by: re (blanket)	2007-07-27 14:39:41 +00:00
Olivier Houchard	d076bcf203	The iop34x has 128 interrupts.	2007-06-16 15:03:33 +00:00
Olivier Houchard	10d8c18005	Introduce pmap_kenter_supersection(), which maps 16MB super-sections into the kernel pmap. Document a bit more the behavior of the xscale core 3.	2007-06-11 21:29:26 +00:00
Marcel Moolenaar	01bd17cc99	Add kdb_cpu_sync_icache(), intended to synchronize instruction caches with data caches after writing to memory. This typically is required to make breakpoints work on ia64 and powerpc. For those architectures the function is implemented.	2007-06-09 21:55:17 +00:00
Jeff Roberson	4736604759	- PCPU_ADD is no longer spelled with LAZY_ in the middle. Submitted by: attilio	2007-06-06 23:23:47 +00:00
Attilio Rao	6759608248	Rework the PCPU_* (MD) interface: - Rename PCPU_LAZY_INC into PCPU_INC - Add the PCPU_ADD interface which just does an add on the pcpu member given a specific value. Note that for most architectures PCPU_INC and PCPU_ADD are not safe. This is a point that needs some discussions/work in the next days. Reviewed by: alc, bde Approved by: jeff (mentor)	2007-06-04 21:38:48 +00:00
Alan Cox	9211deca08	Add the machine-specific definitions for configuring the new physical memory allocator. Approved by: re	2007-06-04 08:02:22 +00:00
Alan Cox	66ab556097	Eliminate some unused definitions that came from NetBSD.	2007-05-28 21:04:22 +00:00
Olivier Houchard	705fda849d	Use __mcount() instead of _mcount() to reduce diffs with NetBSD.	2007-05-19 16:20:37 +00:00
Olivier Houchard	fe85f6cee8	Switch the kernel's pmap domain from 15 to 0. This should be a no-op, and this is needed for xscale core 3 supersections support, as they are always part of the domain 0	2007-05-19 12:47:34 +00:00
Alan Cox	04a18977c8	Define every architecture as either VM_PHYSSEG_DENSE or VM_PHYSSEG_SPARSE depending on whether the physical address space is densely or sparsely populated with memory. The effect of this definition is to determine which of two implementations of vm_page_array and PHYS_TO_VM_PAGE() is used. The legacy implementation is obtained by defining VM_PHYSSEG_DENSE, and a new implementation that trades off time for space is obtained by defining VM_PHYSSEG_SPARSE. For now, all architectures except for ia64 and sparc64 define VM_PHYSSEG_DENSE. Defining VM_PHYSSEG_SPARSE on ia64 allows the entirety of my Itanium 2's memory to be used. Previously, only the first 1 GB could be used. Defining VM_PHYSSEG_SPARSE on sparc64 allows USIIIi-based systems to boot without crashing. This change is a combination of Nathan Whitehorn's patch and my own work in perforce. Discussed with: kmacy, marius, Nathan Whitehorn PR: 112194	2007-05-05 19:50:28 +00:00
Kevin Lo	4eaa43e6f4	Remove __P	2007-03-21 03:28:16 +00:00
Alan Cox	c640357f04	Push down the implementation of PCPU_LAZY_INC() into the machine-dependent header file. Reimplement PCPU_LAZY_INC() on amd64 and i386 making it atomic with respect to interrupts. Reviewed by: bde, jhb	2007-03-11 05:54:29 +00:00
Paolo Pisati	ef544f6312	o break newbus api: add a new argument of type driver_filter_t to bus_setup_intr() o add an int return code to all fast handlers o retire INTR_FAST/IH_FAST For more info: http://docs.freebsd.org/cgi/getmsg.cgi?fetch=465712+0+current/freebsd-current Reviewed by: many Approved by: re@	2007-02-23 12:19:07 +00:00
Olivier Houchard	47010239a8	- Add bounce pages for arm, largely based on the i386 implementation. - Add a default parent dma tag, similar to what has been done for sparc64. - Before invalidating the dcache in POSTREAD, save the bits which are in the same cachelines than our buffers, but not part of it, and restore them after the invalidation.	2007-01-17 00:53:05 +00:00
Bernd Walter	69b40f4db3	MFp4: Add missing atomic functions Based on a patch by: des	2007-01-05 02:50:27 +00:00
Olivier Houchard	2feb83cec2	Introduce CPU_XSCALE_CORE3, as XScale Core 3 is significally different than regular Xscale (it has no mini data cache, has armv6-style 16MB supersections, and can address 36bits). Define it for i81342.	2006-11-30 23:30:40 +00:00
Sam Leffler	588a2322a9	correct bus space unmap prototype Reviewed by: cognet, imp MFC after: 1 month	2006-11-19 23:46:50 +00:00
Ruslan Ermilov	26af9ac7d0	Fix a comment.	2006-11-13 06:26:57 +00:00
Alan Cox	cc0d48ffb6	Eliminate unused global variables.	2006-11-11 20:57:52 +00:00
Olivier Houchard	676b1fbdbf	Identify the xscale 81342.	2006-11-07 22:36:57 +00:00
Olivier Houchard	2c7b82c9dd	Add atomic_cmpset_acq_32.	2006-11-07 11:53:44 +00:00
John Birrell	6825d60738	PR: Submitted by: Reviewed by: Approved by: Obtained from: MFC after: Security: Move the relocation definitions to the common elf header so that DTrace can use them on one architecture targeted to a different one. Add the additional ELF types defines in Sun's "Linker and Libraries" manual.	2006-10-04 21:37:10 +00:00
Poul-Henning Kamp	f645b0b51c	First part of a little cleanup in the calendar/timezone/RTC handling. Move relevant variables to <sys/clock.h> and fix #includes as necessary. Use libkern's much more time- & spamce-efficient BCD routines.	2006-10-02 12:59:59 +00:00
Alexander Kabaev	d9cb97ff9d	Use __builtin_va_start instead of __builtin_stdarg_start. GCC4 obsoletes the former and __builtin_va_start was present in all GCC version 3.1 and later.	2006-09-21 01:37:02 +00:00
Olivier Houchard	4731df1ee7	Remove dead code, already defined in sys/cdef.h Spotted out by: bde	2006-08-30 11:45:07 +00:00
Alan Cox	b554f899bd	Eliminate unused definitions. (They came from NetBSD.) Discussed with: cognet, grehan, marcel	2006-08-25 23:51:11 +00:00
Olivier Houchard	11d1528ce0	Finally bring it support for the i80219 XScale processor. Submitted by: Max M. Boyarov <m.boyarov bsd by>	2006-08-24 23:51:28 +00:00
Olivier Houchard	ba282be9f3	Use ELFDATA2MSB if we're building big endian. Noticed by: Oleksandr Tymoshenko <gonzo freebsd org>	2006-08-24 23:00:03 +00:00
Olivier Houchard	49953e11d7	Rewrite ARM_USE_SMALL_ALLOC so that instead of the current behavior, it maps whole the physical memory, cached, using 1MB section mappings. This reduces the address space available for user processes a bit, but given the amount of memory a typical arm machine has, it is not (yet) a big issue. It then provides a uma_small_alloc() that works as it does for architectures which have a direct mapping.	2006-08-08 20:59:38 +00:00
Olivier Houchard	9c8cab3814	Define BYTE_MSF if we're compiling a big endian kernel, so that DDB can correctly disassemble instructions on big endian.	2006-07-27 11:41:37 +00:00
Olivier Houchard	be050429a3	Add remote GDB bits for arm.	2006-07-14 00:50:51 +00:00
Alan Cox	ed48a217f6	Add partial pmap locking. Eliminate the unused allpmaps list. Tested by: cognet@	2006-06-06 04:32:20 +00:00
Olivier Houchard	b2adc703fd	Don't #error if no CPU is defined but we're not compiling the kernel.	2006-06-02 09:39:06 +00:00
Olivier Houchard	27b45ae819	Don't enable the FIQ in enable_interrupts() if F32_bit is not specified. This has been committed by mistake. Reported by: ssouhlal	2006-06-01 16:17:44 +00:00

1 2 3 4 5

219 Commits