freebsd-dev

Author	SHA1	Message	Date
Nathan Whitehorn	653a5825b8	Also align the 32-bit PowerPC stacks.	2013-09-05 23:28:50 +00:00
Nathan Whitehorn	a5715964b1	Align stacks of kernel threads correctly at 16-byte boundaries rather than making sure they are all misaligned at +8 bytes. This fixes clang builds of powerpc64 kernels (aside from a required increase in KSTACK_PAGES which will come later). This commit from FreeBSD/powerpc64 with a clang-built kernel. MFC after: 2 weeks	2013-09-05 23:00:24 +00:00
Warner Losh	ce7c952a8e	Newer versions of gcc define __INT64_C and __UINT64_C, so avoid redefining them if gcc provides them.	2013-09-03 22:04:55 +00:00
Justin Hibbits	b2fb58a11d	Refactor PowerPC hwpmc(4) driver into generic and specific. More refactoring will likely be done as more drivers are added, since AIM-compatible processors have similar PMC configuration logic.	2013-09-03 00:34:18 +00:00
Justin Hibbits	450f197050	Remove duplicate definition of SPR MMCR0. MFC after: 3 days	2013-08-03 18:05:12 +00:00
Andriy Gapon	a29cc9a34b	Revert r253748,253749 This WIP should not have been committed yet. Pointyhat to: avg	2013-07-28 18:44:17 +00:00
Andriy Gapon	366d8bfb7b	put contents of cpu.h under _KERNEL no userland-serviceable parts inside MFC after: 20 days	2013-07-28 18:32:27 +00:00
Konstantin Belousov	70a7dd5d5b	Fix issues with zeroing and fetching the counters, on x86 and ppc64. Issues were noted by Bruce Evans and are present on all architectures. On i386, a counter fetch should use atomic read of 64bit value, otherwise carry from the increment on other CPU could be lost for the given fetch, making error of 2^32. If 64bit read (cmpxchg8b) is not available on the machine, it cannot be SMP and it is enough to disable preemption around read to avoid the split read. On x86 the counter increment is not atomic on purpose, which makes it possible for the store of the incremented result to override just zeroed per-cpu slot. The effect would be a counter going off by arbitrary value after zeroing. Perform the counter zeroing on the same processor which does the increments, making the operations mutually exclusive. On i386, same as for the fetching, if the cmpxchg8b is not available, machine is not SMP and we disable preemption for zeroing. PowerPC64 is treated the same as amd64. For other architectures, the changes made to allow the compilation to succeed, without fixing the issues with zeroing or fetching. It should be possible to handle them by using the 64bit loads and stores atomic WRT preemption (assuming the architectures also converted from using critical sections to proper asm). If architecture does not provide the facility, using global (spin) mutex would be non-optimal but working solution. Noted by: bde Sponsored by: The FreeBSD Foundation	2013-07-01 02:48:27 +00:00
Justin Hibbits	05bd8e50ac	Pad the PCPU MD struct, to satisfy an assert added with the projects/counters branch import. PR: ports/179173,ports/179164	2013-06-04 00:40:26 +00:00
Marcel Moolenaar	972aa49664	Fix the PowerPC Book-E register definitions used by the remote GDB protocol. Obtained from: Juniper Networks, Inc.	2013-05-21 18:00:47 +00:00
Attilio Rao	941646f5ec	Rename VM_NDOMAIN into MAXMEMDOM and move it into machine/param.h in order to match the MAXCPU concept. The change should also be useful for consolidation and consistency. Sponsored by: EMC / Isilon storage division Obtained from: jeff Reviewed by: alc	2013-05-07 22:46:24 +00:00
Gleb Smirnoff	85d592adf9	Since UMA_ZONE_PCPU zones put a constraint on sizeof(struct pcpu), declared as CTASSERT in MI pcpu.h, stop including all possible mutually exclusive PCPU_MD_FIELDS fields into LINT kernels, due to brekaing aforementioned CTASSERT.	2013-04-10 16:09:45 +00:00
Konstantin Belousov	4f9d403c2e	Fix build for AIM 64bit.	2013-04-09 12:01:54 +00:00
Gleb Smirnoff	4e76af6a41	Merge from projects/counters: counter(9). Introduce counter(9) API, that implements fast and raceless counters, provided (but not limited to) for gathering of statistical data. See http://lists.freebsd.org/pipermail/freebsd-arch/2013-April/014204.html for more details. In collaboration with: kib Reviewed by: luigi Tested by: ae, ray Sponsored by: Nginx, Inc.	2013-04-08 19:40:53 +00:00
Gleb Smirnoff	17dece86fe	Merge from projects/counters: Pad struct pcpu so that its size is denominator of PAGE_SIZE. This is done to reduce memory waste in UMA_PCPU_ZONE zones. Sponsored by: Nginx, Inc.	2013-04-08 19:19:10 +00:00
Alan Cox	07ad25d488	Eliminate an unused #define.	2013-02-22 16:59:52 +00:00
Rui Paulo	eaba9848dd	Introduce PLATFORMMETHOD_END and use it.	2013-02-13 02:21:45 +00:00
Rui Paulo	c19603623d	Allow this file to be used in LOCORE sections of the kernel.	2012-11-12 06:15:54 +00:00
Nathan Whitehorn	2383d92ae8	Move the prototype for savectx from cpu.h to pcb.h, as it is on other platforms, as well as putting it in an #ifdef KERNEL block. MFC after: 2 weeks	2012-09-23 17:33:16 +00:00
Adrian Chadd	2467c62fc6	On Nintendo Wii CPUs, the mdp value will be garbage. Set it to NULL so as to not confuse things. Submitted by: Margarida Gouveia	2012-08-21 06:34:21 +00:00
Andrew Turner	74dc547e24	Make the wchar_t type machine dependent. This is required for ARM EABI. Section 7.1.1 of the Procedure Call for the ARM Architecture (AAPCS) defines wchar_t as either an unsigned int or an unsigned short with the former preferred. Because of this requirement we need to move the definition of __wchar_t to a machine dependent header. It also cleans up the macros defining the limits of wchar_t by defining __WCHAR_MIN and __WCHAR_MAX in the same machine dependent header then using them to define WCHAR_MIN and WCHAR_MAX respectively. Discussed with: bde	2012-06-24 04:15:58 +00:00
Konstantin Belousov	aea810386d	Implement mechanism to export some kernel timekeeping data to usermode, using shared page. The structures and functions have vdso prefix, to indicate the intended location of the code in some future. The versioned per-algorithm data is exported in the format of struct vdso_timehands, which mostly repeats the content of in-kernel struct timehands. Usermode reading of the structure can be lockless. Compatibility export for 32bit processes on 64bit host is also provided. Kernel also provides usermode with indication about currently used timecounter, so that libc can fall back to syscall if configured timecounter is unknown to usermode code. The shared data updates are initiated both from the tc_windup(), where a fast task is queued to do the update, and from sysctl handlers which change timecounter. A manual override switch kern.timecounter.fast_gettime allows to turn off the mechanism. Only x86 architectures export the real algorithm data, and there, only for tsc timecounter. HPET counters page could be exported as well, but I prefer to not further glue the kernel and libc ABI there until proper vdso-based solution is developed. Minimal stubs neccessary for non-x86 architectures to still compile are provided. Discussed with: bde Reviewed by: jhb Tested by: flo MFC after: 1 month	2012-06-22 07:06:40 +00:00
Konstantin Belousov	232aa31fb9	Reserve AT_TIMEKEEP auxv entry for providing usermode the pointer to timekeeping information. MFC after: 1 week	2012-06-22 06:38:31 +00:00
Alan Cox	6031c68de4	The page flag PGA_WRITEABLE is set and cleared exclusively by the pmap layer, but it is read directly by the MI VM layer. This change introduces pmap_page_is_write_mapped() in order to completely encapsulate all direct access to PGA_WRITEABLE in the pmap layer. Aesthetics aside, I am making this change because amd64 will likely begin using an alternative method to track write mappings, and having pmap_page_is_write_mapped() in place allows me to make such a change without further modification to the MI VM layer. As an added bonus, tidy up some nearby comments concerning page flags. Reviewed by: kib MFC after: 6 weeks	2012-06-16 18:56:19 +00:00
Rafal Jaworowski	aa6bc7dc29	Extract vendor specific Book-E pieces into separate files and have a common skeleton (maybe we should kobj-tize this one day). Note the PPC4xx bit is not connected to the build yet. Obtained from: AppliedMicro, Semihalf.	2012-05-30 17:34:40 +00:00
Rafal Jaworowski	17f4cae4a5	Let us manage differences of Book-E PowerPC variations i.e. vendor / implementation specific vs. the common architecture definition. Bring PPC4XX defines (PSL, SPR, TLB). Note the new definitions under BOOKE_PPC4XX are not used in the code yet. This change set is not supposed to affect existing E500 support, it's just another reorg step before bringing support for E500mc, E5500 and PPC465. Obtained from: AppliedMicro, Freescale, Semihalf	2012-05-27 10:25:20 +00:00
Rafal Jaworowski	4f0962fc83	Provide SPR definitions for newer Book-E (E500mc, E5500, PPC465). Obtained from: Freescale, Semihalf.	2012-05-26 12:39:23 +00:00
Rafal Jaworowski	2ae7b3e42d	Unify SPR defines formatting, no funtional changes.	2012-05-26 12:15:13 +00:00
Rafal Jaworowski	ec0453765b	Update HID defines for E500mc and E5500 CPU cores. Obtained from: Freescale, Semihalf	2012-05-25 21:12:24 +00:00
Bjoern A. Zeeb	08c5f3303d	Add a missing " to get closer to compiling.	2012-05-24 23:46:17 +00:00
Nathan Whitehorn	270dc329b7	Atomic operation acquire barriers also need to be isync on 64-bit systems.	2012-05-24 22:14:39 +00:00
Marcel Moolenaar	7097794901	Revert isync for ILP32 to sync as per my original change that I discussed with Nathan. Leave __ATOMIC_ACQ as an isync as per Nathan.	2012-05-24 22:06:00 +00:00
Bjoern A. Zeeb	920b965865	MFp4 bz_ipv6_fast: in_cksum.h required ip.h to be included for struct ip. To be able to use some general checksum functions like in_addword() in a non-IPv4 context, limit the (also exported to user space) IPv4 specific functions to the times, when the ip.h header is present and IPVERSION is defined (to 4). We should consider more general checksum (updating) functions to also allow easier incremental checksum updates in the L3/4 stack and firewalls, as well as ponder further requirements by certain NIC drivers needing slightly different pseudo values in offloading cases. Thinking in terms of a better "library". Sponsored by: The FreeBSD Foundation Sponsored by: iXsystems Reviewed by: gnn (as part of the whole) MFC After: 3 days	2012-05-24 22:00:48 +00:00
Rafal Jaworowski	20b7961267	Fix physical address type to vm_paddr_t.	2012-05-24 21:13:24 +00:00
Marcel Moolenaar	df0bef25eb	Fix the memory barriers for CPUs that do not like lwsync and wedge or cause exceptions early enough during boot that the kernel will do ithe same. Use lwsync only when compiling for LP64 and revert to the more proven isync when compiling for ILP32. Note that in the end (i.e. between revision 222198 and this change) ILP32 changed from using sync to using isync. As per Nathan the isync is needed to make sure I/O accesses are properly serialized with locks and isync tends to be more effecient than sync. While here, undefine __ATOMIC_ACQ and __ATOMIC_REL at the end of the file so as not to leak their definitions. Discussed with: nwhitehorn	2012-05-24 20:45:44 +00:00
Nathan Whitehorn	ccc4a5c761	Replace the list of PVOs owned by each PMAP with an RB tree. This simplifies range operations like pmap_remove() and pmap_protect() as well as allowing simple operations like pmap_extract() not to involve any global state. This substantially reduces lock coverages for the global table lock and improves concurrency.	2012-05-20 14:33:28 +00:00
Nathan Whitehorn	bc96dccc69	Fix final bugs in memory barriers on PowerPC: - Use isync/lwsync unconditionally for acquire/release. Use of isync guarantees a complete memory barrier, which is important for serialization of bus space accesses with mutexes on multi-processor systems. - Go back to using sync as the I/O memory barrier, which solves the same problem as above with respect to mutex release using lwsync, while not penalizing non-I/O operations like a return to sync on the atomic release operations would. - Place an acquisition barrier around thread lock acquisition in cpu_switchin().	2012-05-04 16:00:22 +00:00
Dimitry Andric	460378bf13	Add a convenience macro for the returns_twice attribute, and apply it to the prototypes of the appropriate functions (getcontext, savectx, setjmp, sigsetjmp and vfork). MFC after: 2 weeks	2012-04-29 11:04:31 +00:00
Nathan Whitehorn	de63b4d2d5	Switch the default I/O memory barrier to eieio, as it should be. This does not appear to cause any problems due to fixes elsewhere. MFC after: 2 months	2012-04-24 13:37:43 +00:00
Nathan Whitehorn	51a6f57e4a	Fix copy-and-paste error in r230400. MFC after: 3 days	2012-04-23 20:53:50 +00:00
Nathan Whitehorn	a4cbf436e7	Provide a clearer split between read/write and acquire/release barriers. This should really, actually be correct now.	2012-04-22 22:27:35 +00:00
Nathan Whitehorn	14758466eb	Correctly specify assembler constrains for synchronization instructions. MFC after: 3 days	2012-04-22 21:55:19 +00:00
Nathan Whitehorn	a6349a998d	Clarify what we are doing in r234583 a little better: eieio and isync do not provide general barriers, but only barriers in the context of the atomic sequences here. As such, make them private and keep the global *mb() routines using a variant of sync.	2012-04-22 21:11:01 +00:00
Nathan Whitehorn	83ae3d5531	On non-64-bit systems (which generally don't have lwsync), use eieio and isync to implement read and write barriers, following Appendix B.2 of Book II of the architecture manual. This provides a 25% speed increase to fork() on the PowerPC G4.	2012-04-22 20:23:34 +00:00
Nathan Whitehorn	6f26a88999	Use lwsync to provide memory barriers on systems that support it instead of sync (lwsync is an alternate encoding of sync on systems that do not support it, providing graceful fallback). This provides more than an order of magnitude reduction in the time required to acquire or release a mutex. MFC after: 2 months	2012-04-22 19:00:51 +00:00
Nathan Whitehorn	a1f8f44820	Remove dead code. The routines in atomic.S did not work properly anyway, and were everywhere unused. If we turn out to need them, they should be reimplemented. MFC after: 2 weeks	2012-04-22 18:56:56 +00:00
Nathan Whitehorn	13d47f302f	Replace eieio; sync for creating bus-space memory barriers with sync. sync performs a strict superset of the functions of eieio, so using both is redundant. While here, expand bus barriers to all bus_space operations, since many drivers do not correctly use bus_space_barrier(). In principle, we can also replace sync just with eieio, for a significant performance increase, but it remains to be seen whether any poorly-written drivers currently depend on the side effects of sync to properly function. MFC after: 1 week	2012-04-22 18:54:51 +00:00
Nathan Whitehorn	fbd21ea620	Organize some members of ucontext_t in the same order they are in the trap frame. These are usually not used, and so this changes very little. MFC after: 5 days	2012-04-21 14:39:47 +00:00
Nathan Whitehorn	e3c2930d36	We don't need kcopy() in any of the remaining places it is used, so remove it. MFC after: 2 weeks	2012-04-11 22:23:50 +00:00
Nathan Whitehorn	629e40e45e	Give the kernel pmap lock a different name than user pmap locks. It has (slightly) different semantics and renaming it prevents a (harmless) WITNESS warning during bootup for 32-bit kernels on 64-bit CPUs. MFC after: 5 days	2012-04-06 16:00:37 +00:00

1 2 3 4 5 ...

662 Commits