freebsd-skq

Author	SHA1	Message	Date
John Baldwin	2b8a339c7e	Add various constants for the PAT MSR and the PAT PTE and PDE flags. Initialize the PAT MSR during boot to map PAT type 2 to Write-Combining (WC) instead of Uncached (UC-). MFC after: 1 month	2006-05-01 22:07:00 +00:00
John Baldwin	4ac60df584	Add a new 'pmap_invalidate_cache()' to flush the CPU caches via the wbinvd() instruction. This includes a new IPI so that all CPU caches on all CPUs are flushed for the SMP case. MFC after: 1 month	2006-05-01 21:36:47 +00:00
Peter Wemm	c0345a84aa	Introduce minidumps. Full physical memory crash dumps are still available via the debug.minidump sysctl and tunable. Traditional dumps store all physical memory. This was once a good thing when machines had a maximum of 64M of ram and 1GB of kvm. These days, machines often have many gigabytes of ram and a smaller amount of kvm. libkvm+kgdb don't have a way to access physical ram that is not mapped into kvm at the time of the crash dump, so the extra ram being dumped is mostly wasted. Minidumps invert the process. Instead of dumping physical memory in in order to guarantee that all of kvm's backing is dumped, minidumps instead dump only memory that is actively mapped into kvm. amd64 has a direct map region that things like UMA use. Obviously we cannot dump all of the direct map region because that is effectively an old style all-physical-memory dump. Instead, introduce a bitmap and two helper routines (dump_add_page(pa) and dump_drop_page(pa)) that allow certain critical direct map pages to be included in the dump. uma_machdep.c's allocator is the intended consumer. Dumps are a custom format. At the very beginning of the file is a header, then a copy of the message buffer, then the bitmap of pages present in the dump, then the final level of the kvm page table trees (2MB mappings are expanded into a 4K page mappings), then the sparse physical pages according to the bitmap. libkvm can now conveniently access the kvm page table entries. Booting my test 8GB machine, forcing it into ddb and forcing a dump leads to a 48MB minidump. While this is a best case, I expect minidumps to be in the 100MB-500MB range. Obviously, never larger than physical memory of course. minidumps are on by default. It would want be necessary to turn them off if it was necessary to debug corrupt kernel page table management as that would mess up minidumps as well. Both minidumps and regular dumps are supported on the same machine.	2006-04-21 04:24:50 +00:00
Marcel Moolenaar	b1fb1bb19a	Sync with i386: Map exceptions to signals in gdb_cpu_signal() so that kgdb(1) gets a SIGTRAP when it needs to. Pointed out by: grehan@	2006-04-04 03:00:20 +00:00
Marcel Moolenaar	470d831703	The PC is register 16, not 18. Pointed out by: grehan@	2006-04-04 02:44:51 +00:00
Marcel Moolenaar	bfcdefd8aa	Eliminate HAVE_STOPPEDPCBS. On ia64 the PCPU holds a pointer to the PCB in which the context of stopped CPUs is stored. To access this PCB from KDB, we introduce a new define, called KDB_STOPPEDPCB. The definition, when present, lives in <machine/kdb.h> and abstracts where MD code saves the context. Define KDB_STOPPEDPCB on i386, amd64, alpha and sparc64 in accordance to previous code.	2006-04-03 22:51:47 +00:00
Peter Wemm	68ac481184	Shrink the amd64 pv entry from 48 bytes to about 24 bytes. On a machine with large mmap files mapped into many processes, this saves hundreds of megabytes of ram. pv entries were individually allocated and had two tailq entries and two pointers (or addresses). Each pv entry was linked to a vm_page_t and a process's address space (pmap). It had the virtual address and a pointer to the pmap. This change replaces the individual allocation with a per-process allocation system. A page ("pv chunk") is allocated and this provides 168 pv entries for that process. We can now eliminate one of the 16 byte tailq entries because we can simply iterate through the pv chunks to find all the pv entries for a process. We can eliminate one of the 8 byte pointers because the location of the pv entry implies the containing pv chunk, which has the pointer. After overheads from the pv chunk bitmap and tailq linkage, this works out that each pv entry has an effective size of 24.38 bytes. Future work still required, and other problems: * when running low on pv entries or system ram, we may need to defrag the chunk pages and free any spares. The stats (vm.pmap.) show that this doesn't seem to be that much of a problem, but it can be done if needed. running low on pv entries is now a much bigger problem. The old get_pv_entry() routine just needed to reclaim one other pv entry. Now, since they are per-process, we can only use pv entries that are assigned to our current process, or by stealing an entire page worth from another process. Under normal circumstances, the pmap_collect() code should be able to dislodge some pv entries from the current process. But if needed, it can still reclaim entire pv chunk pages from other processes. * This should port to i386 really easily, except there it would reduce pv entries from 24 bytes to about 12 bytes. (I have integrated Alan's recent changes.)	2006-04-03 21:36:01 +00:00
Peter Wemm	8d0593f54e	Merge/sync with i386: various cosmetic tweaks	2006-03-14 00:01:56 +00:00
Peter Wemm	cfa7ffb1d7	MFi386: The SIGFPE macros were moved to signal.h (FPE_INTOVF etc)	2006-03-14 00:01:22 +00:00
Sam Leffler	5225f08dc9	guard function decls with _KERNEL so user code can include this file	2006-03-01 05:59:56 +00:00
John Baldwin	215e7c161a	Rework how we wire up interrupt sources to CPUs: - Throw out all of the logical APIC ID stuff. The Intel docs are somewhat ambiguous, but it seems that the "flat" cluster model we are currently using is only supported on Pentium and P6 family CPUs. The other "hierarchy" cluster model that is supported on all Intel CPUs with local APICs is severely underdocumented. For example, it's not clear if the OS needs to glean the topology of the APIC hierarchy from somewhere (neither ACPI nor MP Table include it) and setup the logical clusters based on the physical hierarchy or not. Not only that, but on certain Intel chipsets, even though there were 4 CPUs in a logical cluster, all the interrupts were only sent to one CPU anyway. - We now bind interrupts to individual CPUs using physical addressing via the local APIC IDs. This code has also moved out of the ioapic PIC driver and into the common interrupt source code so that it can be shared with MSI interrupt sources since MSI is addressed to APICs the same way that I/O APIC pins are. - Interrupt source classes grow a new method pic_assign_cpu() to bind an interrupt source to a specific local APIC ID. - The SMP code now tells the interrupt code which CPUs are avaiable to handle interrupts in a simpler and more intuitive manner. For one thing, it means we could now choose to not route interrupts to HT cores if we wanted to (this code is currently in place in fact, but under an #if 0 for now). - For now we simply do static round-robin of IRQs to CPUs when the first interrupt handler just as before, with the change that IRQs are now bound to individual CPUs rather than groups of up to 4 CPUs. - Because the IRQ to CPU mapping has now been moved up a layer, it would be easier to manage this mapping from higher levels. For example, we could allow drivers to specify a CPU affinity map for their interrupts, or we could allow a userland tool to bind IRQs to specific CPUs. The MFC is tentative, but I want to see if this fixes problems some folks had with UP APIC kernels on 6.0 on SMP machines (an SMP kernel would work fine, but a UP APIC kernel (such as GENERIC in RELENG_6) would lose interrupts). MFC after: 1 week	2006-02-28 22:24:55 +00:00
Warner Losh	d5e61c97a6	By popular demand, move __HAVE_ACPI and __PCI_REROUTE_INTERRUPT into param.h. Per request, I've placed these just after the _NO_NAMESPACE_POLLUTION ifndef. I've not renamed anything yet, but may since we don't need the __. Submitted by: bde, jhb, scottl, many others.	2006-01-09 06:05:57 +00:00
Warner Losh	501755f4f6	Define __HAVE_ACPI and/or __PCI_REROUTE_INTERRUPT, as appropriate for each platform. These will be used in the pci code in preference to the complicated #ifdefs we have there now.	2006-01-01 20:59:28 +00:00
John Baldwin	b439e431bf	Tweak how the MD code calls the fooclock() methods some. Instead of passing a pointer to an opaque clockframe structure and requiring the MD code to supply CLKF_FOO() macros to extract needed values out of the opaque structure, just pass the needed values directly. In practice this means passing the pair (usermode, pc) to hardclock() and profclock() and passing the boolean (usermode) to hardclock_cpu() and hardclock_process(). Other details: - Axe clockframe and CLKF_FOO() macros on all architectures. Basically, all the archs were taking a trapframe and converting it into a clockframe one way or another. Now they can just extract the PC and usermode values directly out of the trapframe and pass it to fooclock(). - Renamed hardclock_process() to hardclock_cpu() as the latter is more accurate. - On Alpha, we now run profclock() at hz (profhz == hz) rather than at the slower stathz. - On Alpha, for the TurboLaser machines that don't have an 8254 timecounter, call hardclock() directly. This removes an extra conditional check from every clock interrupt on Alpha on the BSP. There is probably room for even further pruning here by changing Alpha to use the simplified timecounter we use on x86 with the lapic timer since we don't get interrupts from the 8254 on Alpha anyway. - On x86, clkintr() shouldn't ever be called now unless using_lapic_timer is false, so add a KASSERT() to that affect and remove a condition to slightly optimize the non-lapic case. - Change prototypeof arm_handler_execute() so that it's first arg is a trapframe pointer rather than a void pointer for clarity. - Use KCOUNT macro in profclock() to lookup the kernel profiling bucket. Tested on: alpha, amd64, arm, i386, ia64, sparc64 Reviewed by: bde (mostly)	2005-12-22 22:16:09 +00:00
John Baldwin	333b8de537	MFi386: - Move PUSH_FRAME and POP_FRAME to asmacros.h and use PUSH_FRAME in atpic entry points. - Move PCPU_* asm macros out of the middle of the asm profiling macros. - Pass IRQ vector argument as an int rather than void * to reduce diffs with i386. - EOI the lapic in C for the lapic timer handler. - GC unused Xcpuast function. - Split IPI_STOP handling code of ipi_nmi_handler() out into a cpustop_handler() function and call it from Xcpustop rather than duplicating all the logic in assembly. - Fixup the list of symbols with interrupt frames in ddb traces. Xatpic_fastintr* have never existed on amd64, and the lapic timer handler and various IPI handlers were missing. - Use trapframe instead of intrframe for interrupt entry points (on amd64 the interrupt vector was already a separate argument, so the two frames were already identical) and GC intrframe. Submitted by: peter (3)	2005-12-08 18:33:30 +00:00
John Baldwin	696effb697	- Cleanup whitespace and extra ()s in vtophys() macros. - Move vtophys() macros next to vtopte() where vtopte() exists to match comments above vtopte(). - Remove references to the alternate address space in the comment above vtopte(). amd64 never had the alternate address space, and i386 lost it prior to PAE support being added. - s/entires/entries/ in comments. Reviewed by: alc	2005-12-06 21:09:01 +00:00
Ruslan Ermilov	224d140293	Drop _MACHINE_ARCH and _MACHINE defines (not to be confused with MACHINE_ARCH and MACHINE). Their purpose was to be able to test in cpp(1), but cpp(1) only understands integer type expressions. Using such unsupported expressions introduced a number of subtle bugs, which were discovered by compiling with -Wundef.	2005-12-06 13:27:21 +00:00
John Baldwin	c7362ff7fb	Change the x86 code to allocate IDT vectors on-demand when an interrupt source is first enabled similar to how intr_event's now allocate ithreads on-demand. Previously, we would map IDT vectors 1:1 to IRQs. Since we only have 191 available IDT vectors for I/O interrupts, this limited us to only supporting IRQs 0-190 corresponding to the first 190 I/O APIC intpins. On many machines, however, each PCI-X bus has its own APIC even though it only has 1 or 2 devices, thus, we were reserving between 24 and 32 IRQs just for 1 or 2 devices and thus 24 or 32 IDT vectors. With this change, a machine with 100 IRQs but only 5 in use will only use up 5 IDT vectors. Also, this change provides an API (apic_alloc_vector() and apic_free_vector()) that will allow a future MSI interrupt source driver to request IDT vectors for use by MSI interrupts on x86 machines. Tested on: amd64, i386	2005-11-02 20:11:47 +00:00
John Baldwin	e0f66ef861	Reorganize the interrupt handling code a bit to make a few things cleaner and increase flexibility to allow various different approaches to be tried in the future. - Split struct ithd up into two pieces. struct intr_event holds the list of interrupt handlers associated with interrupt sources. struct intr_thread contains the data relative to an interrupt thread. Currently we still provide a 1:1 relationship of events to threads with the exception that events only have an associated thread if there is at least one threaded interrupt handler attached to the event. This means that on x86 we no longer have 4 bazillion interrupt threads with no handlers. It also means that interrupt events with only INTR_FAST handlers no longer have an associated thread either. - Renamed struct intrhand to struct intr_handler to follow the struct intr_foo naming convention. This did require renaming the powerpc MD struct intr_handler to struct ppc_intr_handler. - INTR_FAST no longer implies INTR_EXCL on all architectures except for powerpc. This means that multiple INTR_FAST handlers can attach to the same interrupt and that INTR_FAST and non-INTR_FAST handlers can attach to the same interrupt. Sharing INTR_FAST handlers may not always be desirable, but having sio(4) and uhci(4) fight over an IRQ isn't fun either. Drivers can always still use INTR_EXCL to ask for an interrupt exclusively. The way this sharing works is that when an interrupt comes in, all the INTR_FAST handlers are executed first, and if any threaded handlers exist, the interrupt thread is scheduled afterwards. This type of layout also makes it possible to investigate using interrupt filters ala OS X where the filter determines whether or not its companion threaded handler should run. - Aside from the INTR_FAST changes above, the impact on MD interrupt code is mostly just 's/ithread/intr_event/'. - A new MI ddb command 'show intrs' walks the list of interrupt events dumping their state. It also has a '/v' verbose switch which dumps info about all of the handlers attached to each event. - We currently don't destroy an interrupt thread when the last threaded handler is removed because it would suck for things like ppbus(8)'s braindead behavior. The code is present, though, it is just under #if 0 for now. - Move the code to actually execute the threaded handlers for an interrrupt event into a separate function so that ithread_loop() becomes more readable. Previously this code was all in the middle of ithread_loop() and indented halfway across the screen. - Made struct intr_thread private to kern_intr.c and replaced td_ithd with a thread private flag TDP_ITHREAD. - In statclock, check curthread against idlethread directly rather than curthread's proc against idlethread's proc. (Not really related to intr changes) Tested on: alpha, amd64, i386, sparc64 Tested on: arm, ia64 (older version of patch by cognet and marcel)	2005-10-25 19:48:48 +00:00
John Baldwin	58553b9925	Rename the KDB_STOP_NMI kernel option to STOP_NMI and make it apply to all IPI_STOP IPIs. - Change the i386 and amd64 MD IPI code to send an NMI if STOP_NMI is enabled if an attempt is made to send an IPI_STOP IPI. If the kernel option is enabled, there is also a sysctl to change the behavior at runtime (debug.stop_cpus_with_nmi which defaults to enabled). This includes removing stop_cpus_nmi() and making ipi_nmi_selected() a private function for i386 and amd64. - Fix ipi_all(), ipi_all_but_self(), and ipi_self() on i386 and amd64 to properly handle bitmapped IPIs as well as IPI_STOP IPIs when STOP_NMI is enabled. - Fix ipi_nmi_handler() to execute the restart function on the first CPU that is restarted making use of atomic_readandclear() rather than assuming that the BSP is always included in the set of restarted CPUs. Also, the NMI handler didn't clear the function pointer meaning that subsequent stop and restarts could execute the function again. - Define a new macro HAVE_STOPPEDPCBS on i386 and amd64 to control the use of stoppedpcbs[] and always enable it for i386 and amd64 instead of being dependent on KDB_STOP_NMI. It works fine in both the NMI and non-NMI cases.	2005-10-24 21:04:19 +00:00
Jung-uk Kim	25736eb670	Correct few MSR addresses. PR: amd64/85852 Submitted by: Nate Eldredge <nge at cs dot hmc dot edu>	2005-10-15 00:44:56 +00:00
Jung-uk Kim	9c3acb0bc1	- Print number of physical/logical cores and more CPUID info. - Add newer CPUID definitions for future use. Many thanks to Mike Tancsa <mike at sentex dot net> for providing test cases for Intel Pentium D and AMD Athlon 64 X2. Approved by: anholt (mentor)	2005-10-14 22:52:01 +00:00
Peter Wemm	d176c062c9	I believe the stack underflows during early development that caused me to add spare padding at the beginning of the pcb are long gone. Remove the padding fields.	2005-09-27 21:11:35 +00:00
Peter Wemm	1acc225f91	Kill pcb_rflags. It served no purpose. Reported by: bde	2005-09-27 21:10:10 +00:00
John Baldwin	3c2bc2bf26	Add a new atomic_fetchadd() primitive that atomically adds a value to a variable and returns the previous value of the variable. Tested on: i386, alpha, sparc64, arm (cognet) Reviewed by: arch@ Submitted by: cognet (arm) MFC after: 1 week	2005-09-27 17:39:11 +00:00
Warner Losh	62061bf002	MFi386: pci attribute allocation fixes.	2005-09-18 01:42:43 +00:00
John Baldwin	80d52f16da	Stop using the '+' constraint modifier with inline assembly. The '+' constraint is actually only allowed for register operands. Instead, use separate input and output memory constraints. Education from: alc Reviewed by: alc Tested on: i386, alpha MFC after: 1 week	2005-09-15 19:31:22 +00:00
Stefan Farfeleder	a1f85d7f83	Move MINSIGSTKSZ from <machine/signal.h> to <machine/_limits.h> and rename it to __MINSIGSTKSZ. Define MINSIGSTKSZ in <sys/signal.h>. This is done in order to use MINSIGSTKSZ for the macro PTHREAD_STACK_MIN in <pthread.h> (soon <limits.h>) without having to include the whole <sys/signal.h> header. Discussed with: bde	2005-08-20 16:44:41 +00:00
John Baldwin	5d2f4de5da	Add aliases for atomic operations on 64-bit integers just like other 64-bit platforms. MFC after: 1 week	2005-08-18 14:36:47 +00:00
David E. O'Brien	b8b77732ff	Fix $FreeBSD$.	2005-07-22 04:03:25 +00:00
Peter Wemm	9e76f9ad3f	Like on i386, bypass lock prefix for atomic ops on !SMP kernels.	2005-07-21 22:35:02 +00:00
Poul-Henning Kamp	636d90fc5c	Make the facility for recognizing BIOS-signatures more general and return a printable representation. This fixes recognition of the PC Engines WRAP and improves the recognition of the Soekris boards (Bios version can now be seen in the dmesg output for instance). Also, add watchdog support for PCM-582x platforms. Submitted by: Adrian Steinmann <ast@marabu.ch> Slightly changed by: phk PR: 81360	2005-07-21 09:48:37 +00:00
John Baldwin	122eceef61	Convert the atomic_ptr() operations over to operating on uintptr_t variables rather than void * variables. This makes it easier and simpler to get asm constraints and volatile keywords correct. MFC after: 3 days Tested on: i386, alpha, sparc64 Compiled on: ia64, powerpc, amd64 Kernel toolchain busted on: arm	2005-07-15 18:17:59 +00:00
John Baldwin	48281036d7	Some cleanups and tweaks to some of the atomic.h files in preparation for further changes and fixes in the future: - Use aliases via macros rather than duplicated inlines wherever possible. - Move all the aliases to the bottom of these files and the inline functions to the top. - Add various comments. - On alpha, drop atomic_{load_acq,store_rel}_{8,char,16,short}(). - On i386 and amd64, don't duplicate the extern declarations for functions in the two non-inline cases (KLD_MODULE and compiler doesn't do inlines), instead, consolidate those two cases. - Some whitespace fixes. Approved by: re (scottl)	2005-07-09 12:38:53 +00:00
Andrew Thompson	2fcb030ad5	Check the alignment of the IP header before passing the packet up to the packet filter. This would cause a panic on architectures that require strict alignment such as sparc64 (tier1) and ia64/ppc (tier2). This adds two new macros that check the alignment, these are compile time dependent on __NO_STRICT_ALIGNMENT which is set for i386 and amd64 where alignment isn't need so the cost is avoided. IP_HDR_ALIGNED_P() IP6_HDR_ALIGNED_P() Move bridge_ip_checkbasic()/bridge_ip6_checkbasic() up so that the alignment is checked for ipfw and dummynet too. PR: ia64/81284 Obtained from: NetBSD Approved by: re (dwhite), mlaier (mentor)	2005-07-02 23:13:31 +00:00
Peter Wemm	235a54de9d	Switch AMD64 and i386 platforms to using ELF as their kernel crash dump format. The key reason to do this is so that we can dump sparse address space. For example, we need to be able to skip the PCI hole just below the 4GB boundary. Trying to destructively dump MMIO device registers is Really Bad(TM). The frequent result of trying to do a crash dump on a machine with 4GB or more ram was ugly (lockup or reboot). This code has been taken directly from the IA64 dump_machdep.c code, with just a few (mostly minor) mods. Introduce a dump_avail[] array in the machdep.c code so that we have a source of truth for what memory is present in a machine that needs to be dumped. We can't use phys_avail[] because all sorts of things slice memory out of it that we really need to dump. eg: the vm page array and the dmesg buffer. dump_avail[] is pretty much an unmolested version of phys_avail[]. It does have Maxmem correction. Bump the i386 and amd64 dump format to version 2, but nothing actually uses this. amd64 was actually using the i386 dump version number. libkvm support to follow. Approved by: re	2005-06-29 22:28:46 +00:00
John Baldwin	014693eb89	Increase MAXCPU to 16 in SMP kernels so that APIC IDs from 0 to 15 are allowed for CPUs. Tested by: amd64 at cybernetwork dot org Approved by: re (scottl) MFC after: 1 week	2005-06-29 15:13:25 +00:00
Joseph Koshy	f263522a45	MFP4: - Implement sampling modes and logging support in hwpmc(4). - Separate MI and MD parts of hwpmc(4) and allow sharing of PMC implementations across different architectures. Add support for P4 (EMT64) style PMCs to the amd64 code. - New pmcstat(8) options: -E (exit time counts) -W (counts every context switch), -R (print log file). - pmc(3) API changes, improve our ability to keep ABI compatibility in the future. Add more 'alias' names for commonly used events. - bug fixes & documentation.	2005-06-09 19:45:09 +00:00
Stephan Uphoff	6097174e4d	Add IPI support for preempting a thread on another CPU. MFC after: 3 weeks	2005-06-09 18:23:54 +00:00
Yoshihiro Takahashi	d4fcf3cba5	Remove bus_{mem,p}io.h and related code for a micro-optimization on i386 and amd64. The optimization is a trivial on recent machines. Reviewed by: -arch (imp, marcel, dfr)	2005-05-29 04:42:30 +00:00
Yoshihiro Takahashi	f7965374d4	Change the spkr_set_pitch() function to a macro to fix low level profiling.	2005-05-28 13:40:27 +00:00
Peter Wemm	1eb6f02e7a	MFi386: remove comment	2005-05-22 16:31:32 +00:00
Yoshihiro Takahashi	24072ca35b	- Move timerreg.h to <arch>/include and split i8253 specific defines into i8253reg.h, and add some defines to control a speaker. - Move PPI related defines from i386/isa/spkr.c into ppireg.h and use them. - Move IO_{PPI,TIMER} defines into ppireg.h and timerreg.h respectively. - Use isa/isareg.h rather than <arch>/isa/isa.h. Tested on: i386, pc98	2005-05-14 09:10:02 +00:00
Jacques Vidrine	f6108b6158	Add a knob for disabling/enabling HTT, "machdep.hyperthreading_allowed". Default off due to information disclosure on multi-user systems. Submitted by: cperciva Reviewed by: jhb	2005-05-13 00:10:56 +00:00
Doug White	fdc9713bf7	Implement an alternate method to stop CPUs when entering DDB. Normally we use a regular IPI vector, but this vector is blocked when interrupts are disabled. With "options KDB_STOP_NMI" and debug.kdb.stop_cpus_with_nmi set, KDB will send an NMI to each CPU instead. The code also has a context-stuffing feature which helps ddb extract the state of processes running on the stopped CPUs. KDB_STOP_NMI is only useful with SMP and complains if SMP is not defined. This feature only applies to i386 and amd64 at the moment, but could be used on other architectures with the appropriate MD bits. Submitted by: ups	2005-04-30 20:01:00 +00:00
Marcel Moolenaar	76b6d954f0	o Reverse the inclusion chain from MD->MI to MI->MD by removing the inclusion of <sys/pmc.h> and depending on being included from that header file. o Include any MD specific header files that otherwise need to be included from MI files. Ok'd: jkoshy@	2005-04-20 20:22:33 +00:00
Joseph Koshy	ebccf1e3a6	Bring a working snapshot of hwpmc(4), its associated libraries, userland utilities and documentation into -CURRENT. Bump FreeBSD_version. Reviewed by: alc, jhb (kernel changes)	2005-04-19 04:01:25 +00:00
Warner Losh	06db52b609	Break out the definition of bus_space_{tag,handle}_t and a few other types into _bus.h to help with name space polution from including all of bus.h. In a few days, I'll commit changes to the MI code to take advantage of thse sepration (after I've made sure that these changes don't break anything in the main tree, I've tested in my trees, but you never know...). Suggested by: bde (in 2002 or 2003 I think) Reviewed in principle by: jhb	2005-04-18 21:45:34 +00:00
Peter Wemm	ba5f6b61da	MFi386: use the lapic timer for UP systems that are using the apic so that IRQ0 and mixed mode isn't a problem anymore. This removes mixed mode support because nothing is left that uses it.	2005-04-15 18:44:53 +00:00
Peter Wemm	0501844603	MFi386: use c99 types	2005-04-15 18:41:32 +00:00
Peter Wemm	7234adbe8e	Show that I can actually count.	2005-04-15 18:39:31 +00:00
Peter Wemm	2fc8e0f037	MFi386: track bus.h changes (unsplit bus_${machine}.h)	2005-04-15 18:38:59 +00:00
Peter Wemm	fe8b8bf778	Implement 32-bit compatable fsbase/gsbase methods so that we can run (newer) unmodified static i386 binaries again.	2005-04-14 16:57:58 +00:00
John Baldwin	181897f05f	The memory operands to fldcw and ldmxcsr are inputs, not outputs.	2005-04-12 23:12:00 +00:00
Alan Cox	16f571bd18	Align the entry point to assembly language functions to a 16-byte boundary. (The Opteron's instruction fetcher reads instructions from the L1 cache in 16-byte, aligned packets.)	2005-04-10 20:49:21 +00:00
Colin Percival	d0b183c937	Fully initialize the required TSS fields so that the io permission bitmap is set correctly. Patch from: peter Security: FreeBSD-SA-05:03.amd64	2005-04-06 01:05:51 +00:00
John Baldwin	c6a37e8413	Divorce critical sections from spinlocks. Critical sections as denoted by critical_enter() and critical_exit() are now solely a mechanism for deferring kernel preemptions. They no longer have any affect on interrupts. This means that standalone critical sections are now very cheap as they are simply unlocked integer increments and decrements for the common case. Spin mutexes now use a separate KPI implemented in MD code: spinlock_enter() and spinlock_exit(). This KPI is responsible for providing whatever MD guarantees are needed to ensure that a thread holding a spin lock won't be preempted by any other code that will try to lock the same lock. For now all archs continue to block interrupts in a "spinlock section" as they did formerly in all critical sections. Note that I've also taken this opportunity to push a few things into MD code rather than MI. For example, critical_fork_exit() no longer exists. Instead, MD code ensures that new threads have the correct state when they are created. Also, we no longer try to fixup the idlethreads for APs in MI code. Instead, each arch sets the initial curthread and adjusts the state of the idle thread it borrows in order to perform the initial context switch. This change is largely a big NOP, but the cleaner separation it provides will allow for more efficient alternative locking schemes in other parts of the kernel (bare critical sections rather than per-CPU spin mutexes for per-CPU data for example). Reviewed by: grehan, cognet, arch@, others Tested on: i386, alpha, sparc64, powerpc, arm, possibly more	2005-04-04 21:53:56 +00:00
Alexander Leidinger	3df129097b	The file machine/ieeefp.h needs sys/cdefs.h on amd64 and i386 after my compiler features tests. This is ok, since machine/ieeefp.h is an internal interface. But floatingpoint.h is a public interface and some ports use it, so include sys/cdefs.h in the amd64 and i386 version of floatingpoint.h. Note: some architectures don't provide recursive inclusion protection in floatingpoint.h, namely alpha and ia64. Except for this part and now the include of sys/cdefs.h, all those files are equal (from a compiler POV), so they could be moved to only one version in src/include/. Approved by: joerg	2005-04-02 17:31:42 +00:00
David Schultz	7b74e4a759	Remove fpsetsticky(). This was added for SysV compatibility, but due to mistakes from day 1, it has always had semantics inconsistent with SVR4 and its successors. In particular, given argument M: - On Solaris and FreeBSD/{alpha,sparc64}, it clobbers the old flags and sets the new flag word to M. (NetBSD, too?) - On FreeBSD/{amd64,i386}, it clears the flags that are specified in M and leaves the remaining flags unchanged (modulo a small bug on amd64.) - On FreeBSD/ia64, it is not implemented. There is no way to fix fpsetsticky() to DTRT for both old FreeBSD apps and apps ported from other operating systems, so the best approach seems to be to kill the function and fix any apps that break. I couldn't find any ports that use it, and any such ports would already be broken on FreeBSD/ia64 and Linux anyway. By the way, the routine has always been undocumented in FreeBSD, except for an MLINK to a manpage that doesn't describe it. This manpage has stated since 5.3-RELEASE that the functions it describes are deprecated, so that must mean that functions that it is supposed to describe but doesn't are even more deprecated. ;-) Note that fpresetsticky() has been retained on FreeBSD/i386. As far as I can tell, no other operating systems or ports of FreeBSD implement it, so there's nothing for it to be inconsistent with. PR: 75862 Suggested by: bde	2005-03-15 15:53:39 +00:00
Scott Long	5974e5c71c	Refactor the bus_dma header files so that the interface is described in sys/bus_dma.h instead of being copied in every single arch. This slightly reorders a flag that was specific to AXP and thus changes the ABI there. The interface still relies on bus_space definitions found in <machine/bus.h> so it cannot be included on its own yet, but that will be fixed at a later date. Add an MD <machine/bus_dma.h> for ever arch for consistency and to allow for future MD augmentation of the API. sparc64 makes heavy use of this right now due to its different bus_dma implemenation.	2005-03-14 16:46:28 +00:00
Peter Wemm	cf4e1c4613	Remove diffs to i386 version that came in via the compiler support ifdefs. This changes things like whitespace, inconsistent use of #ifndef vs #if !defined(), different macro argument orders, mismatched comments, etc.	2005-03-11 22:16:09 +00:00
Peter Wemm	639ac97a88	Match i386 rev 1.38 with __cplusplus support	2005-03-11 21:46:01 +00:00
Joerg Wunsch	a5f50ef9e4	netchild's mega-patch to isolate compiler dependencies into a central place. This moves the dependency on GCC's and other compiler's features into the central sys/cdefs.h file, while the individual source files can then refer to #ifdef __COMPILER_FEATURE_FOO where they by now used to refer to #if __GNUC__ > 3.1415 && __BARC__ <= 42. By now, GCC and ICC (the Intel compiler) have been actively tested on IA32 platforms by netchild. Extension to other compilers is supposed to be possible, of course. Submitted by: netchild Reviewed by: various developers on arch@, some time ago	2005-03-02 21:33:29 +00:00
Peter Wemm	e73976812c	MFi386: Update alc's copyright notice	2005-02-28 23:38:15 +00:00
Peter Wemm	c29f1e2b3b	MFi386: Bring over John's local apic timer code	2005-02-28 23:37:35 +00:00
Ruslan Ermilov	3971d2cf5e	Use a common multi-inclusion protection, and add such a protection to alpha/include/exec.h.	2005-02-19 21:16:48 +00:00
Peter Wemm	b6e89c6d47	JumboMFi386: use bitmapped IPI handler. Update elcr and default mptable config handler. Tidy up various local apic initialization.	2005-01-21 06:01:20 +00:00
Peter Wemm	ba2426ff44	MFi386: whitespace, copyright header, etc updates	2005-01-21 05:56:41 +00:00
Scott Long	e015dfcfd1	Introduce bus_dmamap_load_mbuf_sg(). Instead of taking a callback arg, this cuts to the chase and fills in a provided s/g list. This is meant to optimize out the cost of the callback since the callback doesn't serve much purpose for mbufs since mbuf loads will never be deferred. This is just for amd64 and i386 at the moment, other arches will be coming shortly.	2005-01-07 07:57:18 +00:00
Warner Losh	46280ae719	Begin all license/copyright comments with /*-	2005-01-05 20:17:21 +00:00
Warner Losh	17d5b792e5	PC98 will never be defined for amd64	2005-01-05 20:11:13 +00:00
Marcel Moolenaar	bcc5241c43	Change gdb_cpu_setreg() to not take the value to which to set the specified register, but a pointer to the in-memory representation of that value. The reason for this is twofold: 1. Not all registers can be represented by a register_t. In particular FP registers fall in that category. Passing the new register value by reference instead of by value makes this point moot. 2. When we receive a G or P packet, both are for writing a register, the packet will have the register value in target-byte order and in the memory representation (modulo the fact that bytes are sent as 2 printable hexadecimal numbers of course). We only need to decode the packet to have a pointer to the register value. This change fixes the bug of extracting the register value of the P packet as a hexadecimal number instead of as a bit array. The quick (and dirty) fix to bswap the register value in gdb_cpu_setreg() as it has been added on i386 and amd64 can therefore be removed and has in fact been that. Tested on: alpha, amd64, i386, ia64, sparc64	2004-12-01 06:40:35 +00:00
David Schultz	ab44ebf537	Remove UAREA_PAGES. Reviewed by: arch@	2004-11-20 02:29:50 +00:00
Peter Wemm	3904b13fab	Raise MAXDSIZ from 8G to 32G. The old limit was just an arbitary choice that was greater than 4G. I originally used the same values as i386 in order to save opening a new PML4 page slot, but in the day of gigabytes of memory, worrying about a 4K page seems futile. Moving from 8 to 32G moves the page to a different index, it doesn't increase the number of pages used.	2004-10-27 17:21:15 +00:00
Nate Lawson	31ad3b8802	Move the code for halting the CPU (acpi_cpu_c1) into machdep files. This removes the last MD portion of acpi_cpu.c. MFC after: 2 weeks	2004-10-11 05:39:15 +00:00
Alan Cox	aced26ce6e	Make pte_load_store() an atomic operation in all cases, not just i386 PAE. Restructure pmap_enter() to prevent the loss of a page modified (PG_M) bit in a race between processors. (This restructuring assumes the newly atomic pte_load_store() for correct operation.) Reviewed by: tegge@ PR: i386/61852	2004-10-08 08:23:43 +00:00
Alan Cox	0a752e9843	Prevent the unexpected deallocation of a page table page while performing pmap_copy(). This entails additional locking in pmap_copy() and the addition of a "flags" parameter to the page table page allocator for specifying whether it may sleep when memory is unavailable. (Already, pmap_copy() checks the availability of memory, aborting if it is scarce. In theory, another CPU could, however, allocate memory between pmap_copy()'s check and the call to the page table page allocator, causing the current thread to release its locks and sleep. This change makes this scenario impossible.) Reviewed by: tegge@	2004-09-29 19:20:40 +00:00
Peter Wemm	c3277f936c	Severely strip down the repocopied i386/bios.c and bios.h files. It turns out that bios_sigsearch() etc is useful for finding tables in roms.	2004-09-24 00:42:36 +00:00
Peter Wemm	7789933b6a	MFi386: adapt rev 1.19 (debugger fixes)	2004-09-22 01:27:06 +00:00
Scott Long	9e0c3bdf64	Double the number of kernel page tables for amd64 and for i386/PAE. The old value was only enough for 8GB of RAM, the new value can do 16GB. This still isn't optimal since it doesn't scale. Fixing this for amd64 looks to be fairly easy, but for i386 will be quite difficult. Reviewed by: peter	2004-09-11 01:31:26 +00:00
Scott Long	9923b511ed	Turn PREEMPTION into a kernel option. Make sure that it's defined if FULL_PREEMPTION is defined. Add a runtime warning to ULE if PREEMPTION is enabled (code inspired by the PREEMPTION warning in kern_switch.c). This is a possible MT5 candidate.	2004-09-02 18:59:15 +00:00
Marcel Moolenaar	0f2fe153bc	Move the kernel-specific logic to adjust frompc from MI to MD. For these two reasons: 1. On ia64 a function pointer does not hold the address of the first instruction of a functions implementation. It holds the address of a function descriptor. Hence the user(), btrap(), eintr() and bintr() prototypes are wrong for getting the actual code address. 2. The logic forces interrupt, trap and exception entry points to be layed-out contiguously. This can not be achieved on ia64 and is generally just bad programming. The MCOUNT_FROMPC_USER macro is used to set the frompc argument to some kernel address which represents any frompc that falls outside the kernel text range. The macro can expand to ~0U to bail out in that case. The MCOUNT_FROMPC_INTR macro is used to set the frompc argument to some kernel address to represent a call to a trap or interrupt handler. This to avoid that the trap or interrupt handler appear to be called from everywhere in the call graph. The macro can expand to ~0U to prevent adjusting frompc. Note that the argument is selfpc, not frompc. This commit defines the macros on all architectures equivalently to the original code in sys/libkern/mcount.c. People can take it from here... Compile-tested on: alpha, amd64, i386, ia64 and sparc64 Boot-tested on: i386	2004-08-27 19:42:35 +00:00
Peter Wemm	1c0dea0f6e	Sync with i386 - Optimize intr_execute_handlers a bit etc.	2004-08-16 23:12:30 +00:00
Robert Watson	a632deec30	Add an "options MP_WATCHDOG" to i386. This option allows one of the logical CPUs on a system to be used as a dedicated watchdog to cause a drop to the debugger and/or generate an NMI to the boot processor if the kernel ceases to respond. A sysctl enables the watchdog running out of the processor's idle thread; a callout is launched to reset a timer in the watchdog. If the callout fails to reset the timer for ten seconds, the watchdog will fire. The sysctl allows you to select which CPU will run the watchdog. A sample "debug.leak_schedlock" is included, which causes a sysctl to spin holding sched_lock in order to trigger the watchdog. On my Xeons, the watchdog is able to detect this failure mode and break into the debugger, which cannot otherwise be done without an NMI button. This option does not currently work with sched_ule due to ule's push notion of scheduling, similar to machdep.hlt_logical_cpus failing to work with that scheduler. On face value, this might seem somewhat inefficient, but there are a lot of dual-processor Xeons with HTT around, so using one as a watchdog for testing is not as inefficient as one might fear.	2004-08-15 18:02:09 +00:00
Maxime Henrion	9f1b87f106	Instead of calling ia32_pause() conditionally on __i386__ or __amd64__ being defined, define and use a new MD macro, cpu_spinwait(). It only expands to something on i386 and amd64, so the compiled code should be identical. Name of the macro found by: jhb Reviewed by: jhb	2004-08-03 18:44:27 +00:00
Doug Rabson	595a88c6af	Add style(9) foolishness.	2004-08-03 08:21:48 +00:00
Doug Rabson	4d84a58d1d	Add definitions for TLS relocations.	2004-08-02 19:12:17 +00:00
Scott Long	9352fe30a0	Turn off PREEMPTION by default while it gets debugged. It's been causing 4 weeks of problems including deadlocks and instant panics. Note that the real bugs are likely in the scheduler.	2004-08-01 14:31:45 +00:00
Mark Murray	8ab2f5ecc5	Break out the MI part of the /dev/[k]mem and /dev/io drivers into their own directory and module, leaving the MD parts in the MD area (the MD parts _are_ part of the modules). /dev/mem and /dev/io are now loadable modules, thus taking us one step further towards a kernel created entirely out of modules. Of course, there is nothing preventing the kernel from having these statically compiled.	2004-08-01 11:40:54 +00:00
Paul Saab	bc35f5dc9e	MFia64: Fix -O builds with gcc 3.4 by defining ffs as __builtin_ffs instead of creating an inline function that just calls __builtin_ffs.	2004-07-30 16:44:29 +00:00
Alexander Kabaev	8b5ae4db0d	Use newly added __used attribute to keep static function symbol from being eliminated.	2004-07-29 18:02:28 +00:00
Robert Watson	1a8cfbc450	Pass a thread argument into cpu_critical_{enter,exit}() rather than dereference curthread. It is called only from critical_{enter,exit}(), which already dereferences curthread. This doesn't seem to affect SMP performance in my benchmarks, but improves MySQL transaction throughput by about 1% on UP on my Xeon. Head nodding: jhb, bmilekic	2004-07-27 16:41:01 +00:00
Alan Cox	fa543780cc	Remove the allpmaps list. It's unused. Reviewed by: peter@	2004-07-20 02:40:56 +00:00
David Schultz	479f8d2214	Make FLT_ROUNDS correctly reflect the dynamic rounding mode.	2004-07-19 08:17:25 +00:00
Peter Wemm	6897c4aef7	Like on i386, eliminate pv_ptem (which was suggested by alc). This reduces the size of the pv_entry structure a small but significant amount. This is implemented a little differently because it isn't so cheap to get the physical address of the page tabke page on amd64.. instead of it being directly accessible from the top level page directory, it is now two additional tree levels down. However.. In almost all cases, we recently had the physical address if the page table page a short while before we needed it, but it slipped through our fingers. This patch saves it for when we do need it. Also, for the one case where we do not have the ptp paddr, we are always running in curproc context and so we can do a vtopte-like trick. I've implemented vtopde() for this purpose. There is still a CYA entry in pmap_unuse_pt() that needs to be removed. I think it can be removed now but I forgot to test with it gone.	2004-07-14 07:13:35 +00:00
Marcel Moolenaar	37224cd3fc	Mega update for the KDB framework: turn DDB into a KDB backend. Most of the changes are a direct result of adding thread awareness. Typically, DDB_REGS is gone. All registers are taken from the trapframe and backtraces use the PCB based contexts. DDB_REGS was defined to be a trapframe on all platforms anyway. Thread awareness introduces the following new commands: thread X switch to thread X (where X is the TID), show threads list all threads. The backtrace code has been made more flexible so that one can create backtraces for any thread by giving the thread ID as an argument to trace. With this change, ia64 has support for breakpoints.	2004-07-10 23:47:20 +00:00
Marcel Moolenaar	f9c8fc6017	Remove obsolete prototype of kdb_trap().	2004-07-10 22:39:56 +00:00
Marcel Moolenaar	5a39cbaf69	Implement makectx(). The makectx() function is used by KDB to create a PCB from a trapframe for purposes of unwinding the stack. The PCB is used as the thread context and all but the thread that entered the debugger has a valid PCB. This function can also be used to create a context for the threads running on the CPUs that have been stopped when the debugger got entered. This however is not done at the time of this commit.	2004-07-10 19:56:00 +00:00
Marcel Moolenaar	cbc174356c	Introduce the KDB debugger frontend. The frontend provides a framework in which multiple (presumably different) debugger backends can be configured and which provides basic services to those backends. Besides providing services to backends, it also serves as the single point of contact for any and all code that wants to make use of the debugger functions, such as entering the debugger or handling of the alternate break sequence. For this purpose, the frontend has been made non-optional. All debugger requests are forwarded or handed over to the current backend, if applicable. Selection of the current backend is done by the debug.kdb.current sysctl. A list of configured backends can be obtained with the debug.kdb.available sysctl. One can enter the debugger by writing to the debug.kdb.enter sysctl.	2004-07-10 18:40:12 +00:00
Marcel Moolenaar	72d44f31a6	Introduce the GDB debugger backend for the new KDB framework. The backend improves over the old GDB support in the following ways: o Unified implementation with minimal MD code. o A simple interface for devices to register themselves as debug ports, ala consoles. o Compression by using run-length encoding. o Implements GDB threading support.	2004-07-10 17:47:22 +00:00
Peter Wemm	d8ad50b704	MFi386: various io apic cleanups	2004-07-08 01:42:49 +00:00
Peter Wemm	2e37b53aba	MFi386: whitespace nit fix (spare blank line)	2004-07-08 01:32:25 +00:00
John Baldwin	0c0b25ae91	Implement preemption of kernel threads natively in the scheduler rather than as one-off hacks in various other parts of the kernel: - Add a function maybe_preempt() that is called from sched_add() to determine if a thread about to be added to a run queue should be preempted to directly. If it is not safe to preempt or if the new thread does not have a high enough priority, then the function returns false and sched_add() adds the thread to the run queue. If the thread should be preempted to but the current thread is in a nested critical section, then the flag TDF_OWEPREEMPT is set and the thread is added to the run queue. Otherwise, mi_switch() is called immediately and the thread is never added to the run queue since it is switch to directly. When exiting an outermost critical section, if TDF_OWEPREEMPT is set, then clear it and call mi_switch() to perform the deferred preemption. - Remove explicit preemption from ithread_schedule() as calling setrunqueue() now does all the correct work. This also removes the do_switch argument from ithread_schedule(). - Do not use the manual preemption code in mtx_unlock if the architecture supports native preemption. - Don't call mi_switch() in a loop during shutdown to give ithreads a chance to run if the architecture supports native preemption since the ithreads will just preempt DELAY(). - Don't call mi_switch() from the page zeroing idle thread for architectures that support native preemption as it is unnecessary. - Native preemption is enabled on the same archs that supported ithread preemption, namely alpha, i386, and amd64. This change should largely be a NOP for the default case as committed except that we will do fewer context switches in a few cases and will avoid the run queues completely when preempting. Approved by: scottl (with his re@ hat)	2004-07-02 20:21:44 +00:00
Bruce Evans	4c5f10a672	Backed out previous commit. Blind substitution of dev_t by `struct cdev *' was just wrong here because the dev_t's are user dev_t's.	2004-06-20 03:52:50 +00:00
Bruce Evans	7a637a637e	Include <sys/_lock.h>'s prerequisite <sys/queue.h> before including the former, not after. Don't hide this bug by including <sys/queue.h> in <sys/_lock.h>.	2004-06-19 14:58:35 +00:00
Poul-Henning Kamp	89c9c53da0	Do the dreaded s/dev_t/struct cdev */ Bump __FreeBSD_version accordingly.	2004-06-16 09:47:26 +00:00
Alan Cox	50f91a9445	Introduce pmap locking to many of the pmap functions. There is more to come later.	2004-06-14 01:17:50 +00:00
Alan Cox	8559e0a291	- Remove an unused declaration. - Move a definition inside the scope of a #ifdef _KERNEL.	2004-06-13 03:44:11 +00:00
Peter Wemm	a520047095	Argh. Add the mini-stack-frame back in for mcount's benefit for syscall stubs.	2004-06-10 22:02:26 +00:00
Peter Wemm	6d05d7c75a	Make profiling work for varargs functions.. %al is an additional argument which indicates the number of xmm registers used in the varargs. This stops the explosion that happened when profiling printf() etc.	2004-06-10 22:00:58 +00:00
John Baldwin	bad4ce7d91	- Use the correct devclass name ("acpi" vs "ACPI") to detect if acpi0 is present and thus that the PnPBIOS probe should be skipped instead of having ACPI zero out the PnPBIOStable pointer. - Make the PnPBIOStable pointer static to i386/i386/bios.c now that that is the only place it is used.	2004-06-10 20:43:04 +00:00
John Baldwin	092a5c4530	Remove atdevbase and replace it's remaining uses with direct references to KERNBASE instead.	2004-06-10 20:31:00 +00:00
Peter Wemm	430e272c7e	Initial PG_NX support (no-execute page bit) - export the rest of the cpu features (and amd's features). - turn on EFER_NXE, depending on the NX amd feature bit - reorg the identcpu stuff a bit in order to stop treating the amd features as second class features (since it is now a primary feature bit set) and make it easier to export.	2004-06-08 01:02:52 +00:00
Peter Wemm	7d95d34bb7	Mask pte's with PG_FRAME before passing it to PHYS_TO_VM_PAGE().. PG_NX lives in the top 12 'available' bits. atop() in the PHYS_TO_VM_PAGE() macro only masks off the lower bits (by accident) and the upper bits in the 64 bit ptes turn into "interesting" index values.	2004-06-08 00:29:42 +00:00
Peter Wemm	b8168edefc	Be a little more consistent in the naming of the PML4 defines.	2004-06-07 23:47:59 +00:00
Poul-Henning Kamp	79005bbdbe	Add new bios_string() which will hunt for a string inside a given range of the BIOS. This can be used for finding arbitrary magic in the BIOS in order to recognize particular platforms.	2004-06-03 22:36:24 +00:00
Peter Wemm	9248fc7bc0	MFi386: move cpu_nameclass struct next to its only consumer	2004-06-03 20:18:15 +00:00
Bruce Evans	5a8f125ad9	MFi386 (1.37: GUPROF calibration macros; only routine adjustments needed).	2004-05-20 16:22:57 +00:00
Bruce Evans	8693960479	Fixed the type of fptrdiff_t. It needs to be 64 bits in theory, and in practice too since kernel addresses are almost 2^64 higher than most user addresses.	2004-05-19 16:19:11 +00:00
Bruce Evans	19b5915afa	Fixed some style bugs (mainly misalignment of backslashes).	2004-05-19 16:04:26 +00:00
Bruce Evans	b2321e7cdb	Moved most of the "MI" definitions and declarations from <machine/profile.h> to <sys/gmon.h>. Cleaned them up a little by not attempting to ifdef for incomplete and out of date support for GUPROF in userland, as in the sparc64 version.	2004-05-19 15:41:26 +00:00
Peter Wemm	2079cde964	The 'call mcount' hooks that gcc inserts when profiling are in a place that cannot handle the scratch registers being trashed. So we have to preserve them ourselves.	2004-05-18 22:52:32 +00:00
Stefan Farfeleder	b1aa0ba527	<stdint.h> should define WINT_M{AX,IN} independent from whether WCHAR_MIN is defined. Otherwise first including <wchar.h> and then <stdint.h> leads to no WINT_M{AX,IN} at all. PR: 64956 Approved by: das (mentor)	2004-05-18 16:04:57 +00:00
Peter Wemm	df4fd27737	Checkpoint some of what I was starting to tinker with for having some different context support for 32 vs 64 bit processes. This simply omits the save/restore of the segment selector registers for non 32 bit processes. This avoids the rdmsr/rwmsr juggling when restoring %gs clobbers the kernel msr that holds the gsbase. However, I suspect it might be better to conditionally do this at user<->kernel transition where we wouldn't need to do the juggling in the first place. Or have per-thread extended context save/restore hooks.	2004-05-16 22:43:57 +00:00
Peter Wemm	12c1418ccf	Kill the LAZYPMAP ifdefs. While they worked, they didn't do anything to help the AMD cpus (which have a hardware tlb flush filter). I held off to see what the 64 bit Intel cpus did, but it doesn't seem to help much there either. Oh well, store it in the Attic.	2004-05-16 22:11:50 +00:00
Peter Wemm	792e29ba26	For consistency with i386, have pmap_kenter_temporary() take a vm_paddr_t argument. It is actually the same type on amd64 (vm_paddr_t = vm_offset_t) but this reduces the i386<->amd64 diffs a little.	2004-05-16 20:44:41 +00:00
Peter Wemm	463e5aa66e	MFi386: numerous interrupt and acpi updates	2004-05-16 20:30:47 +00:00
Nate Lawson	65a7c90189	Add an MI implementation of the ACPI global lock routines and retire the individual asm versions. The global lock is shared between the BIOS and OS and thus cannot use our mutexes. It is defined in section 5.2.9.1 of the ACPI specification. Reviewed by: marcel, bde, jhb	2004-05-05 20:04:14 +00:00
David Schultz	be3930682a	Hide FLT_EVAL_METHOD and DECIMAL_DIG in pre-C99 compilation environments. PR: 63935 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2004-04-25 02:36:29 +00:00
Alan Cox	ec8544f744	Simplify the sf_buf implementation. In short, make it a trivial veneer over the direct virtual-to-physical mapping.	2004-04-18 05:36:37 +00:00
Alan Cox	2c38d78e41	- is_physical_memory()'s parameter, which is a physical address, should be a vm_paddr_t not a vm_offset_t.	2004-04-11 04:26:58 +00:00
Alan Cox	b14d6acced	- pmap_kenter_temporary() is unused by machine-independent code. Therefore, move its declaration to the machine-dependent header file on those machines that use it. In principle, only i386 should have it. Alpha and AMD64 should use their direct virtual-to-physical mapping. - Remove pmap_kenter_temporary() from ia64. It is unused. Approved by: marcel@	2004-04-10 22:41:46 +00:00
Peter Wemm	cd0149e39b	MFi386: move rss() from db_interface.c to cpufunc.h	2004-04-07 00:41:05 +00:00
Warner Losh	9a80fddc71	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999 and email from Peter Wemm. Approved by: core, peter	2004-04-05 23:55:14 +00:00
Warner Losh	29ae923f44	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999. Approved by: core	2004-04-05 21:29:41 +00:00
Alan Cox	803e5b9ccc	Remove ptmmap and ptvmmap. They are unused on amd64.	2004-04-03 20:53:50 +00:00
Alan Cox	c64b70130e	- Add an optimized page copy function for use by pmap_copy_page(). It is roughly four times faster than bcopy() for uncached pages. - Sort the function prototypes in md_var.h.	2004-03-31 02:03:49 +00:00
Marcel Moolenaar	a36bdc0606	Introduce the cpumask_t type. The purpose of the type is to create a level of abstraction for any and all CPU mask and CPU bitmap variables so that platforms have the ability to break free from the hard limit of 32 CPUs, simply because we don't have more bits in an u_int. Note that the type is not supposed to solve massive parallelism, where the number of CPUs can be larger than the width of the widest integral type. As such, cpumask_t is not supposed to be a compound type. If such would be necessary in the future, we can deal with the issues then and there. For now, it can be assumed that the type is integral and unsigned. With this commit, all MD definitions start off as u_int. This allows us to phase-in cpumask_t at our leasure without breaking anything. Once cpumask_t is used consistently, platforms can switch to wider (or smaller) types if such would be beneficial (or not; whatever :-) Compile-tested on: i386	2004-03-20 20:41:40 +00:00
Peter Wemm	f8325b428c	Re-kill ispcvt on amd64 - rc.d/syscons was fixed ages ago.	2004-03-13 22:18:34 +00:00
Peter Wemm	73f3495386	Move the non-MD machine/dvcfg.h and machine/physio_proc.h to a common MI area before they proliferate more.	2004-03-13 19:46:27 +00:00
Peter Wemm	10884719f8	MFi386: nuke pci_cfgintr	2004-03-13 19:19:13 +00:00
Bruce Evans	a67ef0a77a	Don't implement anything in the ffs family in <machine/cpufunc.h> in the non-_KERNEL case. This "fixes" applications that include this "kernel-only" header and also include <strings.h> (or get <strings.h> via the default _BSD_VISIBLE pollution in <string.h>. In C++ there was a fatal error: the declaration specifies C linkage but the implementation gives C++ linkage. In C there was only a static/extern mismatch if the headers were included in a certain order order, and a partially redundant declaration for all include orders; gcc emits incomplete or wrong diagnostics for these, but only for compiling with -Wsystem-headers and certain other warning options, so the problem was usually not seen for C. Ports breakage reported by: kris	2004-03-11 13:38:54 +00:00
Peter Wemm	4f6c19e593	MFi386: re-sort non-gcc function prototypes, trim includes	2004-03-08 00:24:15 +00:00
Alan Cox	925d2fedf5	Remove unused declarations. (Some time ago, these variables became fields of vm/vm.h's struct kva_md_info.)	2004-03-07 07:13:15 +00:00
Lukas Ertl	1bcf24ee9d	Fix syntax errors and wrong function prototypes in several MD header files when using non-GNUC compilers. PR: kern/58515 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at> Approved by: grog (mentor), obrien	2004-03-05 09:19:59 +00:00
David E. O'Brien	e81bb67370	AMD64 versions.	2004-03-03 07:00:48 +00:00
Peter Wemm	170a05510d	Re-add user_dbreg_trap() for debug register support	2004-01-29 00:05:03 +00:00
Peter Wemm	1c89210c83	We can simplify a lot of things now that we don't have to worry about hardware bugs on external 386 cpus and now that we can depend on SSE.	2004-01-28 23:55:58 +00:00
Peter Wemm	d957532a87	Add dbreg struct definitions for /proc/*/dbregs and a place to store the registers in the pcb	2004-01-28 23:54:31 +00:00
Peter Wemm	1182b177ff	Re-add debug register functions	2004-01-28 23:53:04 +00:00
Peter Wemm	b29fd7c4db	MFi386: mp_topology().	2004-01-28 23:51:16 +00:00
Peter Wemm	5e465ab919	MFi386: add THERMTRIP msr values	2004-01-28 23:47:22 +00:00
Poul-Henning Kamp	37f0ad870c	remove elan_mmcr, I'm not sure I understand what it did here in the first place.	2004-01-17 13:13:48 +00:00
Jacques Vidrine	e4dc8baa84	Provide sysarch(2) prototypes in the MD sysarch.h headers. While I'm at it, use the ANSI C generic pointer type for the second argument, thus matching the documentation. Remove the now extraneous (and now conflicting) function declarations in various libc sources. Remove now unnecessary casts. Reviewed by: bde	2004-01-09 16:52:09 +00:00
Jacques Vidrine	3f6f39ff54	Remove `static' prototype from header file.	2004-01-06 20:36:21 +00:00
Peter Wemm	c15e347e22	GC the unused <machine/kse.h> file.	2003-12-24 00:51:30 +00:00
Alan Cox	35833d6a20	Since we have additional kernel virtual address space, allow the buffer cache to grow to 400M bytes.	2003-12-20 20:03:10 +00:00
Peter Wemm	d50c65bfce	MFi386: remove APIC_IRQ* defines that are no longer used.	2003-12-11 02:48:25 +00:00
David E. O'Brien	70005bdb38	Add just enough of i386/include/pcvt_ioctl.h to amd64/include/pcvt_ioctl.h such that 'ispcvt' can build. Unforunately 'ispcvt' is needed in order for /etc/rc.d/syscons to run. This fixes the bug where I could not get my keymap effective at boot.	2003-12-10 02:49:17 +00:00
Alan Cox	4d4a286cba	Increase VM_KMEM_SIZE_MAX from 200MB to 400MB. Discussed with: peter	2003-12-07 04:51:04 +00:00
Peter Wemm	3db6450423	Reconfigure the runq macros to use the 64 bit ffs/bsf routines instead of doing a loop and taking two 32 bit passes at the runqueue bits. All the 64 bit platforms should probably do this since there are 64 run queues. Approved by: re (scottl)	2003-12-06 23:24:18 +00:00
Peter Wemm	176ce2b1f4	Add 64 bit bsf/ffs routines. Have the ffs() inline use gcc's builtin because it uses the better cmove instructions to avoid branches.	2003-12-06 23:22:43 +00:00
Peter Wemm	62b689edc9	Kill the ASM versions of the mtx_lock_spin and friends. They were never used on amd64, and were actually totally broken. They had the wrong calling conventions. I believe the i386 versions are going away too. Approved by: re (scottl)	2003-12-06 23:17:18 +00:00
Peter Wemm	42f5377536	Revert some amd64 changes that cached curthread and converge back to the i386 version. The curthread special case in pcpu.h solves my complaint about the verbose macro expansion in this case. Note that the i386 version still has some OBE comments, I didn't re-add them back again. Approved by: re (scottl)	2003-12-06 23:13:22 +00:00
Peter Wemm	cda078658e	Cosmetic and/or trivial sync up with i386. Approved by: re (rwatson)	2003-11-21 03:02:00 +00:00
Peter Wemm	b4bae2b0c5	Argh! Followup to previous commit. I checked in the patch with an unintended local change. Change Xurthread back to curthread.	2003-11-20 23:49:28 +00:00
Peter Wemm	5dd883833c	Provide a streamlined '#define curthread __curthread()' for amd64 to avoid the compiler having to parse and optimize the PCPU_GET(curthread) so often. __curthread() is an inline optimized version of PCPU_GET(curthread) that knows that pc_curthread is at offset zero in the pcpu struct. Add a CTASSERT() to catch any possible changes to this. This accounts for just over a 1% wall clock speedup for total kernel compile/link time, and 20% compile time speedup on some specific files depending on which compile options are used. Approved by: re (jhb)	2003-11-20 22:54:44 +00:00
Peter Wemm	0d2a298904	Initial landing of SMP support for FreeBSD/amd64. - This is heavily derived from John Baldwin's apic/pci cleanup on i386. - I have completely rewritten or drastically cleaned up some other parts. (in particular, bootstrap) - This is still a WIP. It seems that there are some highly bogus bioses on nVidia nForce3-150 boards. I can't stress how broken these boards are. I have a workaround in mind, but right now the Asus SK8N is broken. The Gigabyte K8NPro (nVidia based) is also mind-numbingly hosed. - Most of my testing has been with SCHED_ULE. SCHED_4BSD works. - the apic and acpi components are 'standard'. - If you have an nVidia nForce3-150 board, you are stuck with 'device atpic' in addition, because they somehow managed to forget to connect the 8254 timer to the apic, even though its in the same silicon! ARGH! This directly violates the ACPI spec.	2003-11-17 08:58:16 +00:00
Bruce Evans	6a0d1abc5d	Fixed pedantic warnings for statement-expressions using __extension__ and by not using a statement-expression for the non-expression __PCPU_SET().	2003-11-17 04:40:58 +00:00
Bruce Evans	81bbee5996	Fixed a pedantic syntax error (a stray semicolon at the end of PCPU_MD_FIELDS).	2003-11-17 03:40:41 +00:00
Alan Cox	e45db9b837	- Modify alpha's sf_buf implementation to use the direct virtual-to- physical mapping. - Move the sf_buf API to its own header file; make struct sf_buf's definition machine dependent. In this commit, we remove an unnecessary field from struct sf_buf on the alpha, amd64, and ia64. Ultimately, we may eliminate struct sf_buf on those architecures except as an opaque pointer that references a vm page.	2003-11-16 06:11:26 +00:00
Nate Lawson	b72e9cf526	Add the pc_acpi_id PCPU member. The new acpi_cpu driver uses this to dereference the softc.	2003-11-15 18:58:29 +00:00
Peter Wemm	1f6c75db0b	Preemptively burn a bridges. The isa timer code is likely to be replaced by the HPET timer at some point, so dont even make a release with the aquire/release_timer0 functions.	2003-11-14 22:34:43 +00:00
Peter Wemm	7b66b81ee4	Convert a couple of pointers to integers for source compatability with amd64.	2003-11-14 22:23:30 +00:00
Peter Wemm	40e3826a9f	Whitespace nit (sorry, couldn't help it)	2003-11-14 22:21:30 +00:00
John Baldwin	3ab2ba59f4	Shuffle the APIC interrupt vectors around a bit: - Move the IPI and local APIC interrupt vectors up into the 0xf0 - 0xff range. The pmap lazyfix IPI was reordered down next to the TLB shootdowns to avoid conflicting with the spurious interrupt vector. - Move the base of APIC interrupts up 16 so that the first 16 APIC interrupts do not overlap the vectors used by the ATPIC. - Remove bogus interrupt vector reservations for LINT[01]. - Now that 0xc0 - 0xef are available, use them for device interrupts. This increases the number of APIC device interrupts to 191. - Increase the system-wide number of global interrupts to 191 to catch up to more APIC interrupts. Requested by: peter (2)	2003-11-14 19:10:13 +00:00
John Baldwin	69487322d8	Fix a typo.	2003-11-13 16:41:07 +00:00
John Baldwin	bd9cd7e3f7	- Move manipulation of td_intr_nesting_level out of assembly interrupt vector stubs and into the C functions they call. - Move disabling and EOIing of interrupt sources out of PIC driver entry points and into intr_execute_handlers(). Intr_execute_handlers() only disables a source for an interrupt if it is a stray interrupt or has threaded handlers. Sources with fast handlers no longer disable (mask) the source while executing the handlers. - Move the setting of clkintr_pending into intr_execute_handlers() and set the variable for any interrupt source with a vector of 0. (Should only be true for IRQ 0.) This fixes clkintr_pending in the NO_MIXED_MODE case. - Implement lapic_eoi() and use it to implement ioapic_eoi_source(). - Rename atpic_sched_ithd() to atpic_handle_intr() since it is used to handle all atpic interrupts and not just threaded ones. Inspired by: peter's changes to amd64 in p4 (1) Requested by: bde (2)	2003-11-12 18:13:57 +00:00
Peter Wemm	fcfe57d640	Update the graffiti.	2003-11-08 04:39:22 +00:00
Peter Wemm	398dbb11d8	Switch from having a fpu "device" to something that is more like the integrated part of the cpu core that it is.	2003-11-08 04:37:54 +00:00
Peter Wemm	bf2f09ee97	The great s/npx/fpu/gi	2003-11-08 03:33:38 +00:00
Peter Wemm	8b2454d833	Rename npx* to fpu*. I haven't done the flags/function names yet.	2003-11-08 02:39:46 +00:00
Scott Long	eb3b7bf69f	Document the lockfunc and lockfuncarg arguments to bus_dma_tag_create() in the busdma headers.	2003-11-07 23:29:42 +00:00
John Baldwin	8f8914ad98	Add the MP Table APIC enumerator. This code uses the BIOS MP Table to enumerate I/O APICs as well as local APICs. It also provides Host-PCI and PCI-PCI bridge drivers to use the MP Table to route PCI interrupts.	2003-11-03 22:12:37 +00:00
John Baldwin	6f92bdd0c1	New APIC support code: - The apic interrupt entry points have been rewritten so that each entry point can serve 32 different vectors. When the entry is executed, it uses one of the 32-bit ISR registers to determine which vector in its assigned range was triggered. Thus, the apic code can support 159 different interrupt vectors with only 5 entry points. - We now always to disable the local APIC to work around an errata in certain PPros and then re-enable it again if we decide to use the APICs to route interrupts. - We no longer map IO APICs or local APICs using special page table entries. Instead, we just use pmap_mapdev(). We also no longer export the virtual address of the local APIC as a global symbol to the rest of the system, but only in local_apic.c. To aid this, the APIC ID of each CPU is exported as a per-CPU variable. - Interrupt sources are provided for each intpin on each IO APIC. Currently, each source is given a unique interrupt vector meaning that PCI interrupts are not shared on most machines with an I/O APIC. That mapping for interrupt sources to interrupt vectors is up to the APIC enumerator driver however. - We no longer probe to see if we need to use mixed mode to route IRQ 0, instead we always use mixed mode to route IRQ 0 for now. This can be disabled via the 'NO_MIXED_MODE' kernel option. - The npx(4) driver now always probes to see if a built-in FPU is present since this test can now be performed with the new APIC code. However, an SMP kernel will panic if there is more than one CPU and a built-in FPU is not found. - PCI interrupts are now properly routed when using APICs to route interrupts, so remove the hack to psuedo-route interrupts when the intpin register was read. - The apic.h header was moved to apicreg.h and a new apicvar.h header that declares the APIs used by the new APIC code was added.	2003-11-03 21:53:38 +00:00
John Baldwin	ecee5704ed	New device interrupt code. This defines an interrupt source abstraction that provides methods via a PIC driver to do things like mask a source, unmask a source, enable it when the first interrupt handler is added, etc. The interrupt code provides a table of interrupt sources indexed by IRQ numbers, or vectors. These vectors are what new-bus uses for its IRQ resources and for bus_setup_intr()/bus_teardown_intr(). The interrupt code then maps that vector a given interrupt source object. When an interrupt comes in, the low-level interrupt code looks up the interrupt source for the source that triggered the interrupt and hands it off to this code to execute the appropriate handlers. By having an interrupt source abstraction, this allows us to have different types of interrupt source providers within the shared IRQ address space. For example, IRQ 0 may map to pin 0 of the master 8259A PIC, IRQs 1 through 60 may map to pins on various I/O APICs, and IRQs 120 through 128 may map to MSI interrupts for various PCI devices.	2003-11-03 21:25:52 +00:00
John Baldwin	63239aa581	Always export r_gdt and r_idt and give them extern declarations in machine/segments.h.	2003-10-30 21:42:17 +00:00
John Baldwin	07930cce05	A few whitespace and comment tweaks.	2003-10-24 21:02:26 +00:00
Peter Wemm	cedb3695c1	Add __va_copy and make it always visible, in spite of the __ISO_C_VISIBLE setting. Make va_copy be an alias if __ISO_C_VISIBLE >= 1999. Why? more than a few ports have an autoconf that looks for __va_copy because it is available on glibc. It is critical that we use it if at all possible on amd64. It generally isn't a problem for i386 and its ilk because autoconf driven code tends to fall back to an assignment.	2003-10-24 02:50:39 +00:00
Peter Wemm	63f2bb5ff1	Use a more robust API altogether for the amd64_get_fsbase() etc functions.	2003-10-23 06:06:14 +00:00
Peter Wemm	c0432d033e	Renumber the sysarch vectors for amd64 specific syscalls so that I can implement i386 compat numbers where it makes sense. This would save a syscall translation layer. Yes, this breaks the abi slightly again, but fortunately its just a recompile rather than tweaking the source. I will be fixing the libc stubs while I'm here.	2003-10-23 05:31:23 +00:00
Peter Wemm	19acc770c2	Pull the tier-2 card one last time and break the get/setcontext and sigreturn() ABI and the signal context on the stack. Make the trapframe (and its shadows in the ucontext and sigframe etc) 8 bytes larger in order to preserve 16 byte stack alignment for the following C code calls. I could have done some padding after the trapframe was saved, but some of the C code still expects an argument of 'struct trapframe'. Anyway, this gives me a spare field that can be used to store things like 'partial trapframe' status or something else in the future. The runtime impact is fairly small, except for threaded apps and things that decode contexts and the signal stack (eg: cvsup binary). Signal delivery isn't too badly affected because the kernel generates the sigframe that sigreturn uses after the handler has been called. The size of mcontext_t and struct sigframe hasn't changed. Only the last few fields (sc_eip etc) got moved a little and I eliminated a spare field. mc_len/sc_len did change location though so the sanity checks there will still trap it.	2003-10-15 02:04:52 +00:00
Bruce M Simpson	2bc7dd5661	Move pmap_resident_count() from the MD pmap.h to the MI pmap.h. Add a definition of pmap_wired_count(). Add a definition of vmspace_wired_count(). Reviewed by: truckman Discussed with: peter	2003-10-06 01:47:12 +00:00
Peter Wemm	6ccf265bb0	Commit Bosko's patch to clean up the PSE/PG_G initialization to and avoid problems with some Pentium 4 cpus and some older PPro/Pentium2 cpus. There are several problems, some documented in Intel errata. This patch: 1) moves the kernel to the second page in the PSE case. There is an errata that says that you Must Not point a 4MB page at physical address zero on older cpus. We avoided bugs here due to sheer luck. 2) sets up PSE page tables right from the start in locore, rather than trying to switch from 4K to 4M (or 2M) pages part way through the boot sequence at the same time that we're messing with PG_G. For some reason, the pmap work over the last 18 months seems to tickle the problems, and the PAE infrastructure changes disturb the cpu bugs even more. A couple of people have reported a problem with APM bios calls during boot. I'll work with people to get this resolved. Obtained from: bmilekic	2003-10-01 23:46:08 +00:00
Peter Wemm	a93020d7a1	Use __register_t instead of register_t, otherwise <sys/types.h> is a prerequisite for <ucontext.h> on amd64. Oops.	2003-10-01 01:08:04 +00:00
Peter Wemm	ec548f97fc	MFi386: BURN_BRIDGES around timer0 functions	2003-09-30 06:38:11 +00:00
Alan Cox	9060731130	Eliminate the pte object.	2003-09-27 20:53:01 +00:00
Peter Wemm	cc3112f108	Re-raise the default datasize and stacksize now that the 32 bit exec support can clip it to sensible values.	2003-09-25 01:11:17 +00:00
Peter Wemm	c460ac3a00	Add sysentvec->sv_fixlimits() hook so that we can catch cases on 64 bit systems where the data/stack/etc limits are too big for a 32 bit process. Move the 5 or so identical instances of ELF_RTLD_ADDR() into imgact_elf.c. Supply an ia32_fixlimits function. Export the clip/default values to sysctl under the compat.ia32 heirarchy. Have mmap(0, ...) respect the current p->p_limits[RLIMIT_DATA].rlim_max value rather than the sysctl tweakable variable. This allows mmap to place mappings at sensible locations when limits have been reduced. Have the imgact_elf.c ld-elf.so.1 placement algorithm use the same method as mmap(0, ...) now does. Note that we cannot remove all references to the sysctl tweakable maxdsiz etc variables because /etc/login.conf specifies a datasize of 'unlimited'. And that causes exec etc to fail since it can no longer find space to mmap things.	2003-09-25 01:10:26 +00:00
Yoshihiro Takahashi	33e38a2cc8	Implement the bus_space_map() function to allocate resources and initialize a bus_handle, but currently it does only initializing a bus_handle.	2003-09-23 08:22:34 +00:00
Peter Wemm	725bc17312	Oops. back out last commit. The data and stack limits are used by the 32 bit binary stuff. 32 bit binaries do not like it much when the kernel tries hard to put things above the 8GB mark. I have a work-in-progress to fix this properly, but I didn't want to burn anybody with this yet.	2003-09-23 03:20:34 +00:00
Peter Wemm	cd3402fa66	Sync with i386 version. The quality initialization was missing and some other junk.	2003-09-23 00:18:45 +00:00
Peter Wemm	4295ddf26f	MFi386 pci_bus.c 1.102 legacyvar.h 1.4: rename nexus_pcib to legacy_pcib However, leave legacy_pcib_route_interrupt() since there is no pcibios to call.	2003-09-23 00:03:44 +00:00
Peter Wemm	da87d7e10d	Move basemem variable into global scope so that the MP startup code can refer to it for looking for tables.	2003-09-22 23:33:29 +00:00
Peter Wemm	24789c549a	Increase the default data size limit from 512MB to 8GB. Increase default stack limit from 64MB to 512MB.	2003-09-22 23:21:39 +00:00
Peter Wemm	d79ddbf5de	MFi386 machdep.c rev 1.201, clock.c 1.201, clock.h 1.45 by phk: Dont initialize a TSC timecounter until we know if it is broke or not. XXX I think there is a bug in the i386 code here. init_TSC_tc() comes after: if (statclock_disable) return; ie: if you turn off the statclock interrupt, you dont get the TSC either.	2003-09-22 23:02:24 +00:00
Peter Wemm	13a27f2962	MFi386 rev 1.25 by jhb: add new MSR's and some missing older ones and APICBASE MSR constants.	2003-09-22 22:51:46 +00:00
Peter Wemm	d10e66f073	MFi386 rev 1.37: constant-friendly bswap macros	2003-09-22 22:37:49 +00:00
Peter Wemm	5bc82d1ce1	MFi386: pci_cfgreg.h rev 1.10 by jhb/des/njl. Fix CONF1_ENABLE_MSK.	2003-09-22 22:21:21 +00:00
Peter Wemm	8848ad863b	MFi386 by jhb: add acpi_SetDefaultIntrModel();	2003-09-22 22:12:46 +00:00
Peter Wemm	76caec589f	MFi386 by jhb: use symbolic constants for the IDT entries.	2003-09-22 22:09:02 +00:00
John Baldwin	a547af297d	Remove an XXX comment by using the per CPU mask added after this comment was added.	2003-09-10 01:36:48 +00:00
John Baldwin	f03cb48d41	Fix a typo.	2003-09-10 01:11:58 +00:00
Peter Wemm	292bbfd103	Argh. This file was completely out of sync with mcontext/trapframe.	2003-09-08 18:31:48 +00:00
Peter Wemm	7fe089a006	Hmm. Two copies of the mcontext...	2003-09-08 18:28:41 +00:00
Peter Wemm	c896a8adbf	Oops. sizeof(long) = 8, not 4. Get the fxsave buffer inside mcontext the right size. I'm planning on possibly stealing the two 'spare' variables on either side for botched alignment correction.	2003-09-05 20:47:27 +00:00
Alexander Kabaev	1d49585050	Standardize idempotentcy ifdefs. Consistently use _MACHINE_VARARGS_H_ symbol.	2003-09-01 03:01:45 +00:00
David E. O'Brien	a7b60ab26e	Fix copyright comment & FBSDID style nits. Requested by: bde	2003-08-25 09:48:48 +00:00
Marcel Moolenaar	26502503e5	Further cleanup <machine/cpu.h> and <machine/md_var.h>: move the MI prototypes of cpu_halt(), cpu_reset() and swi_vm() from md_var.h to cpu.h. This affects db_command.c and kern_shutdown.c. ia64: move all MD prototypes from cpu.h to md_var.h. This affects madt.c, interrupt.c and mp_machdep.c. Remove is_physical_memory(). It's not used (vm_machdep.c). alpha: the MD prototypes have been left in cpu.h with a comment that they should be there. Moving them is left for later. It was expected that the impact would be significant enough to be done in a seperate commit. powerpc: MD prototypes left in cpu.h. Comment added. Suggested by: bde Tested with: make universe (pc98 incomplete)	2003-08-16 16:57:57 +00:00
John Baldwin	594dfbc391	- Fix a duplicated typo. - Add a macro for the logical shift needed to extract an APIC ID from either from the local APIC ICR Hi register or the APIC ID registers of the local and IO APICs.	2003-08-15 15:23:13 +00:00
Warner Losh	06b4bf3e55	Expand inline the relevant parts of src/COPYRIGHT for Matt Dillon's copyrighted files. Approved by: Matt Dillon	2003-08-12 23:24:05 +00:00
Paul Saab	77c39e17fa	Halted CPU's should not accumulate time. Reviewed by: jhb	2003-08-12 17:01:10 +00:00
John Baldwin	3bdbd658f1	- Since td_critnest is now initialized in MI code, it doesn't have to be set in cpu_critical_fork_exit() anymore. - As far as I can tell, cpu_thread_link() has never been used, not even when it was originally added, so remove it.	2003-08-04 20:32:45 +00:00
Peter Wemm	59cc2230c6	Fix a dumbass mistake. I had the 'set' and 'get' reversed in the fpsetround/fpgetround macro pairs.	2003-08-02 00:26:30 +00:00
Peter Wemm	3950c40739	KSTACK_PAGES is a global option.	2003-07-31 01:27:18 +00:00
Maxime Henrion	d5afecd068	- Introduce a new busdma flag BUS_DMA_ZERO to request for zero'ed memory in bus_dmamem_alloc(). This is possible now that contigmalloc() supports the M_ZERO flag. - Remove the locking of Giant around calls to contigmalloc() since contigmalloc() now grabs Giant itself.	2003-07-27 13:52:10 +00:00
John Baldwin	e47d4f0fc2	Use macros from apic.h to when writing to the ICR to send IPIs to startup APs rather than magic numbers. Tested by: scottl	2003-07-23 19:04:28 +00:00
John Baldwin	55fb372edd	Add a new macro APIC_ICRLO_RESV_MASK that contains all of the reserved fields in the low 32 bits of the local APIC ICR register. Use this macro in place of APIC_RESV2_MASK when masking off existing bits from the ICR when writing to it to send an IPI. Tested by: scottl	2003-07-23 18:59:38 +00:00
Peter Wemm	5b9f8ddbbd	Go back to 64 bit precision for fadd/fsub/fsqrt etc. This is because on AMD64, gcc (and the ABI) expects the x87 unit to be running in 80/64 mode (not 64/53) so that it can use it for 'long double' operations. It takes the expected precision differences into account when generating code.	2003-07-22 06:50:34 +00:00
Peter Wemm	76537e43f5	Extend the machine/ieeefp.h that was inherited from i386 to support the SSE mxcsr register as well. Since gcc will intermix SSE2 and x87 FP code, the fpsetround() etc mode had better be the same. There are hooks to enable these inlines to be instantiated inside libc for non-gcc or C++ callers. (g++ doesn't like the inlines that tried to extract an integer and convert it to an enum).	2003-07-22 06:44:54 +00:00
Mark Murray	c7b132c974	Protect lint(1) from a #error.	2003-07-10 18:05:02 +00:00
Peter Wemm	e95babf3a8	unifdef -DLAZY_SWITCH and start to tidy up the associated glue.	2003-07-10 01:02:59 +00:00
Peter Wemm	bf8ca114e2	Fix the VADDR() macros to use either KVADDR() or UVADDR(), depending on the implied sign extension. The single unified VADDR() macro was not able to avoid sign extending the VM_MAXUSER_ADDRESS/USRSTACK values. Be explicit about UVADDR() (positive address space) and KVADDR() (kernel negative address space) to make mistakes show up more spectacularly. Increase user VM space from 1/2TB (512GB) to 128TB.	2003-07-09 23:04:23 +00:00
Scott Long	f6b1c44d1f	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
Jeff Roberson	ab875ef896	- Construct a cpu topology map for Hyper Threading systems so that ULE may take advantage of them.	2003-06-28 22:07:42 +00:00
Hidetoshi Shimokawa	e07324646e	Move KERNBASE to -2GB. Currently, we cannot increase KVA more than 2GB.	2003-06-22 13:02:45 +00:00
Alan Cox	49a2507bd1	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
John Baldwin	e59ae32f18	- Document the thermal and performance counter LVT entries in the local APIC. - Add a lvt_thermal member to the LAPIC struct. - Add constants for the SMI and INIT LVT delivery modes.	2003-06-06 17:22:15 +00:00
Peter Wemm	7fc03ef474	Fix ALIGNED_POINTER(). sizeof((u_int32_t)) is not legal C.	2003-06-04 02:15:13 +00:00
Peter Wemm	c35518b4ed	Make this more compatable with libc_r. Make the internal types for storing registers an array of longs rather than int.	2003-06-02 21:49:35 +00:00
David E. O'Brien	9676a785e7	Use __FBSDID().	2003-06-02 06:43:15 +00:00
Peter Wemm	193b147c05	MFi386: i386/include/asm.h rev 1.11: Do not abuse ##.	2003-06-02 05:59:35 +00:00
David E. O'Brien	69bb404192	Use C99 compatable asm statements.	2003-06-02 00:29:35 +00:00
Peter Wemm	cc71eb5e10	With the help of jhb, fix the ACPI_ACQUIRE_GLOBAL_LOCK() macros and port to amd64 after repocopy. Approved by: re (amd64/*)	2003-05-31 06:43:55 +00:00
Hiten Pandya	b77c32a07e	Rename BUS_DMAMEM_NOSYNC to BUS_DMA_COHERENT. The current name is confusing, because it indicates to the client that a bus_dmamap_sync() operation is not necessary when the flag is specified, which is wrong. The main purpose of this flag is to hint the underlying architecture that DMA memory should be mapped in a coherent way, but the architecture can ignore it. But if the architecture does supports coherent mapping of memory, then it makes bus_dmamap_sync() calls cheap. This flag is the same as the one in NetBSD's Bus DMA. Reviewed by: gibbs, scottl, des (implicitly) Approved by: re@ (jhb)	2003-05-30 20:40:33 +00:00
Peter Wemm	5feb2148ba	Initial port to amd64 after repocopy from i386. Note that the disassembler has not been updated yet, and will do some very strange things. It does tracebacks (without function arguments due to regparm calling conventions) if -fno-omit-frame-pointer is used (to come later). This achieves basic functionality. Approved by: re (amd64/* blanket)	2003-05-30 01:01:07 +00:00
Scott Long	7e71df9339	Bring back bus_dmasync_op_t. It is now a typedef to an int, though the BUS_DMASYNC_ definitions remain as before. The does not change the ABI, and reverts the API to be a bit more compatible and flexible. This has survived a full 'make universe'. Approved by: re (bmah)	2003-05-27 04:59:59 +00:00
Scott Long	c87d464f28	De-orbit bus_dmamem_alloc_size(). It's a hack and was never used anyways. No need for it to pollute the 5.x API any further. Approved by: re (bmah)	2003-05-26 04:00:52 +00:00
Peter Wemm	3ebd9b48ce	Stop profiled libc from exploding, matching gcc's generated code. Approved by: re (amd64/* blanket)	2003-05-24 18:24:03 +00:00
Peter Wemm	d9cd1af4aa	Typo fix. oops. Submitted by: jmallett Approved by: re (blanket amd64/*)	2003-05-23 06:36:46 +00:00
Peter Wemm	cbd667fa2f	Update comments. Note that the kernel is at -1GB, not -2GB as erroniously implied by the previous commit. KVM is still only 1GB until pmap_growkernel() learns about the extra page table level. Approved by: re (blanket)	2003-05-23 06:35:45 +00:00
Peter Wemm	f229f5cf85	As suggested by the gdb folks, pad the 'struct fpreg' to a full 512 bytes to match the native fxsave/fxrstor object size since thats apparently what the Linux/NetBSD folks do.	2003-05-23 06:31:56 +00:00
Peter Wemm	3c9a3c9ca3	Major pmap rework to take advantage of the larger address space on amd64 systems. Of note: - Implement a direct mapped region using 2MB pages. This eliminates the need for temporary mappings when getting ptes. This supports up to 512GB of physical memory for now. This should be enough for a while. - Implement a 4-tier page table system. Most of the infrastructure is there for 128TB of userland virtual address space, but only 512GB is presently enabled due to a mystery bug somewhere. The design of this was heavily inspired by the alpha pmap.c. - The kernel is moved into the negative address space(!). - The kernel has 2GB of KVM available. - Provide a uma memory allocator to use the direct map region to take advantage of the 2MB TLBs. - Fixed some assumptions in the bus_space macros about the ability to fit virtual addresses in an 'int'. Notable missing things: - pmap_growkernel() should be able to grow to 512GB of KVM by expanding downwards below kernbase. The kernel must be at the top 2GB of the negative address space because of gcc code generation strategies. - need to fix the >512GB user vm code. Approved by: re (blanket)	2003-05-23 05:04:54 +00:00
Alexander Kabaev	980ded9a7d	sys/sys/limits.h: - Fix visibilty test for LONG_BIT and WORD_BIT. `#if defined(__FOO_VISIBLE)' is alays wrong because __FOO_VISIBLE is always defined (to 0 for invisibility). sys/<arch>/include/limits.h sys/<arch>/include/_limits.h: - Style fixes. Submitted by: bde Reviewed by: bsdmike Approved by: re (scottl)	2003-05-19 20:29:07 +00:00
Alan Cox	4a0d6dfd2c	Initialize logical_cpus_mask when the logical CPUs are enumerated in the mptable. (Previously, logical_cpus_mask was only initialized if the hyperthreading fixup was executed.) Approved by: re (jhb) Reviewed by: ps	2003-05-15 05:12:24 +00:00
Peter Wemm	c0a54ff621	Collect the nastiness for preserving the kernel MSR_GSBASE around the load_gs() calls into a single place that is less likely to go wrong. Eliminate the per-process context switching of MSR_GSBASE, because it should be constant for a single cpu. Instead, save/restore it during the loading of the new %gs selector for the new process. Approved by: re (amd64/* blanket)	2003-05-15 00:23:40 +00:00
Peter Wemm	be52ef1399	Use compile time constants for things like PTmap[] etc because they're about to move outside of the +/- 2GB range Suggested by: jake Approved by: re (amd64/* blanket)	2003-05-15 00:20:17 +00:00
Peter Wemm	d85631c4ac	Add BASIC i386 binary support for the amd64 kernel. This is largely stolen from the ia64/ia32 code (indeed there was a repocopy), but I've redone the MD parts and added and fixed a few essential syscalls. It is sufficient to run i386 binaries like /bin/ls, /usr/bin/id (dynamic) and p4. The ia64 code has not implemented signal delivery, so I had to do that. Before you say it, yes, this does need to go in a common place. But we're in a freeze at the moment and I didn't want to risk breaking ia64. I will sort this out after the freeze so that the common code is in a common place. On the AMD64 side, this required adding segment selector context switch support and some other support infrastructure. The %fs/%gs etc code is hairy because loading %gs will clobber the kernel's current MSR_GSBASE setting. The segment selectors are not used by the kernel, so they're only changed at context switch time or when changing modes. This still needs to be optimized. Approved by: re (amd64/* blanket)	2003-05-14 04:10:49 +00:00
Peter Wemm	0fe93e7480	For the page fault handler, save %cr2 in the outer trap handler so that we do not have to run so long with interrupts disabled. This involved creating tf_addr in the trapframe. Reorganize the trap stubs so that they consistently reserve the stack space and initialize any missing bits. Approved by: re (amd64 stuff)	2003-05-12 18:33:19 +00:00
Peter Wemm	0f6241620b	Sync ucontext with reality. The struct trapframe changes need to be reflected here. Approved by: re (blanket amd64/*)	2003-05-12 18:23:04 +00:00
Peter Wemm	e9b193dc33	AMD64 physical space is much larger than i386, de-i386 the bus_space and bus_dma MD code for AMD64. (And a trivial ifdef update in dev/kbd because of this). More updates are needed here to take advantage of the 64 bit instructions. Approved by: re (blanket amd64/*)	2003-05-12 02:44:37 +00:00
Peter Wemm	bf1e897425	Give a %fs and %gs to userland. Use swapgs to obtain the kernel %GS.base value on entry and exit. This isn't as easy as it sounds because when we recursively trap or interrupt, we have to avoid duplicating the swapgs instruction or we end up back with the userland %gs. I implemented this by testing TF_CS to see if we're coming from supervisor mode already, and check for returning to supervisor. To avoid a race with interrupts in the brief period after beginning executing the handler and before the swapgs, convert all trap gates to interrupt gates, and reenable interrupts immediately after the swapgs. I am not happy with this. There are other possible ways to do this that should be investigated. (eg: storing the GS.base MSR value in the trapframe) Add some sysarch functions to let the userland code get to this. Approved by: re (blanket amd64/*)	2003-05-12 02:37:29 +00:00
Peter Wemm	eeee69d45c	Make atdevbase long for the KERNBASE > 4GB case Approved by: re (amd64/* blanket)	2003-05-11 22:53:43 +00:00
Peter Wemm	0fe0f2515b	Provide a fake varargs implementation for lint's benefit. This way it can see the intent of the va_* macros, even though it cannot work. Approved by: re (blanket amd64/*)	2003-05-10 00:55:15 +00:00
Peter Wemm	e1ef71de2b	Remove _ARCH_INDIRECT ifdefs. They existed for lib/msun/* on i386, which could use different versions of the math code depending on whether there was real floating point hardware or math emulation. Since the fpu is part of the core specification on amd64, there is no need for this here. Approved by: re (blanket amd64/*)	2003-05-10 00:53:34 +00:00
Peter Wemm	2e4f687a1d	bcopyb() isn't used on amd64 kernel (it only exists for i386/pcvt) Approved by: re (blanket amd64/*)	2003-05-10 00:51:29 +00:00
Peter Wemm	395e65aa29	Include the MXCSR initial values, based on the AMD docs. This file should really be renamed to fpu.h and npx.c to fpu.c since its part of the core architecture on amd64 systems, not an isa 'numeric processor extension'.	2003-05-09 18:28:05 +00:00
Alexander Kabaev	0eda4c08a5	Style fixes. Remove DBL_DIG, DBL_MIN, DBL_MAX and their FLT_ counterparts, they were marked for deprecation ever since SUSv1 at least. Only define ULLONG_MIN/MAX and LLONG_MAX if long long type is supported. Restore a lost comment in MI _limits.h file and remove it from sys/limits.h where it does not belong.	2003-05-04 22:13:04 +00:00
Peter Wemm	7f47668191	Slight reorg and added AMD64 support. A couple of the MODINFOMD_* values that were added to sparc64 and later powerpc, really should have been in the MI area. But changing that now with insufficient preperation will just cause too much pain. Move MD_FETCH() to the MI sys/linker.h file to avoid another two copies of it.	2003-05-01 03:31:18 +00:00
Peter Wemm	afa8862328	Commit MD parts of a loosely functional AMD64 port. This is based on a heavily stripped down FreeBSD/i386 (brutally stripped down actually) to attempt to get a stable base to start from. There is a lot missing still. Worth noting: - The kernel runs at 1GB in order to cheat with the pmap code. pmap uses a variation of the PAE code in order to avoid having to worry about 4 levels of page tables yet. - It boots in 64 bit "long mode" with a tiny trampoline embedded in the i386 loader. This simplifies locore.s greatly. - There are still quite a few fragments of i386-specific code that have not been translated yet, and some that I cheated and wrote dumb C versions of (bcopy etc). - It has both int 0x80 for syscalls (but using registers for argument passing, as is native on the amd64 ABI), and the 'syscall' instruction for syscalls. int 0x80 preserves all registers, 'syscall' does not. - I have tried to minimize looking at the NetBSD code, except in a couple of places (eg: to find which register they use to replace the trashed %rcx register in the syscall instruction). As a result, there is not a lot of similarity. I did look at NetBSD a few times while debugging to get some ideas about what I might have done wrong in my first attempt.	2003-05-01 01:05:25 +00:00
Peter Wemm	1e57e9eba3	Repocopy from x86_64/... to amd64/... Rename visible x86_64 references to amd64. Kill MID_MACHINE, its a.out specific, the only platform that supports it is i386. All of the other platforms should remove it too.	2003-04-30 22:51:59 +00:00
Alexander Kabaev	104a9b7e3e	Deprecate machine/limits.h in favor of new sys/limits.h. Change all in-tree consumers to include <sys/limits.h> Discussed on: standards@ Partially submitted by: Craig Rodrigues <rodrigc@attbi.com>	2003-04-29 13:36:06 +00:00
Jake Burkholder	14ce5bd49b	Use inlines for loading and storing page table entries. Use cmpxchg8b for the PAE case to ensure idempotent 64 bit loads and stores. Sponsored by: DARPA, Network Associates Laboratories	2003-04-28 20:35:36 +00:00
Alexander Kabaev	6fd839f9c7	Add a new sys/limits.h file which in turn depends on machine/_limits.h to get actual constant values. This is in preparation for machine/limits.h retirement. Discussed on: standards@ Submitted by: Craig Rodrigues <rodrigc@attbi.com> (*) Modified by: kan	2003-04-23 21:41:59 +00:00
David Xu	d1fc2022c3	Backout my last commit. Requested by: bde	2003-04-20 01:35:21 +00:00
David Xu	2bdf11638e	Don't return garbage in high 16 bits.	2003-04-19 02:40:39 +00:00
Maxime Henrion	141bacb048	Change the operation parameter of bus_dmamap_sync() from an enum to an int and redefine the BUS_DMASYNC_* constants as flags. This allows us to specify several operations in one call to bus_dmamap_sync() as in NetBSD.	2003-04-10 23:03:33 +00:00
Jake Burkholder	ac00210525	Remove invalid cast to vm_offset_t to avoid truncating a physical address when doing pmap_kextract on a 2MB page. Spotted by: peter Sponsored by: DARPA, Network Associates Laboratories	2003-04-08 18:22:41 +00:00
Jake Burkholder	46ea68dd10	Better fix for previous previous which still allows the 4megs of kva at the top of the address space to be reclaimed. The problem is that with the APTD gone the mapable kernel address space runs right to the end of the 32 bit address space. As a max this is 0x100000000, which can't be represented in 32 bits, so we have to use ptd entry n-1 and pte offset n-1, instead of ptd entry n and pte offset 0. There's still 1 page we can't use, but we gain just under 4 megs of kva (8 megs with PAE). Sponsored by: DARPA, Network Associates Laboratories	2003-04-07 14:27:19 +00:00
Dag-Erling Smørgrav	9f45b2da8f	Define ovbcopy() as a macro which expands to the equivalent bcopy() call, to take care of the KAME IPv6 code which needs ovbcopy() because NetBSD's bcopy() doesn't handle overlap like ours. Remove all implementations of ovbcopy(). Previously, bzero was a function pointer on i386, to save a jmp to bzero_vector. Get rid of this microoptimization as it only confuses things, adds machine-dependent code to an MD header, and doesn't really save all that much. This commit does not add my pagezero() / pagecopy() code.	2003-04-04 17:29:55 +00:00
Jake Burkholder	d1d03c2b72	Bandaid fix for previous commit while I figure out why it broke. This caused crashes early in boot on i386 UP machines. Reported by: phk Pointy hat to: jake	2003-04-04 10:09:44 +00:00
Jake Burkholder	163529c2b3	- Removed APTD and associated macros, it is no longer used. BANG BANG BANG etc. Sponsored by: DARPA, Network Associates Laboratories	2003-04-03 23:44:35 +00:00
Peter Wemm	cc66ebe2a9	Commit a partial lazy thread switch mechanism for i386. it isn't as lazy as it could be and can do with some more cleanup. Currently its under options LAZY_SWITCH. What this does is avoid %cr3 reloads for short context switches that do not involve another user process. ie: we can take an interrupt, switch to a kthread and return to the user without explicitly flushing the tlb. However, this isn't as exciting as it could be, the interrupt overhead is still high and too much blocks on Giant still. There are some debug sysctls, for stats and for an on/off switch. The main problem with doing this has been "what if the process that you're running on exits while we're borrowing its address space?" - in this case we use an IPI to give it a kick when we're about to reclaim the pmap. Its not compiled in unless you add the LAZY_SWITCH option. I want to fix a few more things and get some more feedback before turning it on by default. This is NOT a replacement for Bosko's lazy interrupt stuff. This was more meant for the kthread case, while his was for interrupts. Mine helps a little for interrupts, but his helps a lot more. The stats are enabled with options SWTCH_OPTIM_STATS - this has been a pseudo-option for years, I just added a bunch of stuff to it. One non-trivial change was to select a new thread before calling cpu_switch() in the first place. This allows us to catch the silly case of doing a cpu_switch() to the current process. This happens uncomfortably often. This simplifies a bit of the asm code in cpu_switch (no longer have to call choosethread() in the middle). This has been implemented on i386 and (thanks to jake) sparc64. The others will come soon. This is actually seperate to the lazy switch stuff. Glanced at by: jake, jhb	2003-04-02 23:53:30 +00:00
Jake Burkholder	7ab9b220d9	- Add support for PAE and more than 4 gigs of ram on x86, dependent on the kernel opition 'options PAE'. This will only work with device drivers which either use busdma, or are able to handle 64 bit physical addresses. Thanks to Lanny Baron from FreeBSD Systems for the loan of a test machine with 6 gigs of ram. Sponsored by: DARPA, Network Associates Laboratories, FreeBSD Systems	2003-03-30 05:24:52 +00:00
Jake Burkholder	de54353fb8	- Remove invalid casts. Sponsored by: DARPA, Network Associates Laboratories	2003-03-30 01:44:16 +00:00
Jake Burkholder	aea57872f0	- Convert all uses of pmap_pte and get_ptbase to pmap_pte_quick. When accessing an alternate address space this causes 1 page table page at a time to be mapped in, rather than using the recursive mapping technique to map in an entire alternate address space. The recursive mapping technique changes large portions of the address space and requires global tlb flushes, which seem to cause problems when PAE is enabled. This will also allow IPIs to be avoided when mapping in new page table pages using the same technique as is used for pmap_copy_page and pmap_zero_page. Sponsored by: DARPA, Network Associates Laboratories	2003-03-30 01:16:19 +00:00
Paul Saab	87437b0b89	Nuke options HTT infavor of machdep.hlt_logical_cpus tunable/sysctl. This keeps the logical cpu's halted in the idle loop. By default the logical cpu's are halted at startup. It is also possible to halt any cpu in the idle loop now using machdep.hlt_cpus. Examples of how to use this: machdep.hlt_cpus=1 halt cpu0 machdep.hlt_cpus=2 halt cpu1 machdep.hlt_cpus=4 halt cpu2 machdep.hlt_cpus=3 halt cpu0,cpu1 Reviewed by: jhb, peter	2003-03-26 19:49:34 +00:00
Jake Burkholder	227f9a1c58	- Add vm_paddr_t, a physical address type. This is required for systems where physical addresses larger than virtual addresses, such as i386s with PAE. - Use this to represent physical addresses in the MI vm system and in the i386 pmap code. This also changes the paddr parameter to d_mmap_t. - Fix printf formats to handle physical addresses >4G in the i386 memory detection code, and due to kvtop returning vm_paddr_t instead of u_long. Note that this is a name change only; vm_paddr_t is still the same as vm_offset_t on all currently supported platforms. Sponsored by: DARPA, Network Associates Laboratories Discussed with: re, phk (cdevsw change)	2003-03-25 00:07:06 +00:00
John Baldwin	a480c73063	Expand the APIC ID mask field of the ICR register to 8 bits intead of just 4 bits. This reportedly fixes booting on the SW7500CW2. Much thanks to the submitter for tracking this down! Submitted by: Brian Buchanan <brian@ncircle.com> Reviewed by: peter MFC after: 3 days	2003-03-17 19:14:13 +00:00
Jake Burkholder	5501d40bb9	Made the prototypes for pmap_kenter and pmap_kremove MD. These functions are machine dependent because they are not required to update the tlb when mappings are added or removed, and doing so is machine dependent. In addition, an implementation may require that pages mapped with pmap_kenter have a backing vm_page_t, which is not necessarily true of all physical pages, and so may choose to pass the vm_page_t to pmap_kenter instead of the physical address in order to make this requirement clear.	2003-03-16 04:16:03 +00:00
Jake Burkholder	2089c5c95e	Use bus_space_handle_t to represent host port and virtual addresses; bus_addr_t may not be appropriate. Sponsored by: DARPA, Network Associates Laboratories	2003-03-11 19:43:38 +00:00
John Baldwin	d7a715dc64	Wrap the hyperthreading support code with the HTT kernel option. Hyperthreading support is now off unless the HTT option is added. MFC-after: 3 days	2003-03-04 20:24:53 +00:00
Alan Cox	8480cd45a5	Remove some long unused declarations. (For example, the PV flags have not been used since revision 1.8, roughly nine years ago.)	2003-02-27 20:13:20 +00:00
Maxime Henrion	f6c912dd0c	Correctly set BUS_SPACE_MAXSIZE in all the busdma backends. It was bogusly set to 64 * 1024 or 128 * 1024 because it was bogusly reused in the BUS_DMAMAP_NSEGS definition.	2003-02-26 02:16:06 +00:00
Jake Burkholder	0f1a7e05a2	- Added inlines pmap_is_current, pmap_is_alternate and pmap_set_alternate for testing and setting the current and alternate address spaces. - Changed PTDpde and APTDpde to arrays to support multiple page directory pages. ponsored by: DARPA, Network Associates Laboratories	2003-02-25 19:40:21 +00:00
Jake Burkholder	5cd612b27e	- Removed UMAXPTDI and UMAXPTEOFF. - Changed VM_MAXUSER_ADDRESS to be defined in terms of PTDPTDI. In order for assumptions about the recursive page table map to work it must be the base of the recursive map. Any pte offset that's not NPTEPG will break these assumptions. Sponsored by: DARPA, Network Associates Laboratories	2003-02-24 20:29:52 +00:00
Yoshihiro Takahashi	2302407b21	The mpbiosreason variable does not used for pc98.	2003-02-24 14:36:03 +00:00
Tor Egge	07d2563244	Allow machines with one CPU and a valid mp table to boot an SMP kernel.	2003-02-23 23:49:57 +00:00
Jake Burkholder	ef49a94104	Previous commit missed a 1 that should be NGPTD, and an NPDEPG that should be NPDEPTD. Grumble. Sponsored by: DARPA, Network Associates Laboratories	2003-02-23 22:12:08 +00:00
Jake Burkholder	910548dea7	- Added macros NPGPTD, NBPTD, and NPDEPTD, for dealing with the size of the page directory. - Use these instead of the magic constants 1 or PAGE_SIZE where appropriate. There are still numerous assumptions that the page directory is exactly 1 page. Sponsored by: DARPA, Network Associates Laboratories	2003-02-23 21:20:00 +00:00

... 4 5 6 7 8 ...

1450 Commits