freebsd-dev

Author	SHA1	Message	Date
Bruce Evans	81bbee5996	Fixed a pedantic syntax error (a stray semicolon at the end of PCPU_MD_FIELDS).	2003-11-17 03:40:41 +00:00
Alan Cox	0ec3db3072	- Remove unnecessary synchronization from sf_buf_init(). (There is only one active CPU when sf_buf_init() is performed.)	2003-11-16 23:40:06 +00:00
Alan Cox	e45db9b837	- Modify alpha's sf_buf implementation to use the direct virtual-to- physical mapping. - Move the sf_buf API to its own header file; make struct sf_buf's definition machine dependent. In this commit, we remove an unnecessary field from struct sf_buf on the alpha, amd64, and ia64. Ultimately, we may eliminate struct sf_buf on those architecures except as an opaque pointer that references a vm page.	2003-11-16 06:11:26 +00:00
Bruce Evans	416ab90e6b	Localized the cy driver's locking.	2003-11-16 00:55:54 +00:00
Nate Lawson	b72e9cf526	Add the pc_acpi_id PCPU member. The new acpi_cpu driver uses this to dereference the softc.	2003-11-15 18:58:29 +00:00
Peter Wemm	1f6c75db0b	Preemptively burn a bridges. The isa timer code is likely to be replaced by the HPET timer at some point, so dont even make a release with the aquire/release_timer0 functions.	2003-11-14 22:34:43 +00:00
Peter Wemm	5333eee414	Minor source sync with amd64. Use int as the type for the width field of %.*s rather than size_t.	2003-11-14 22:29:21 +00:00
Peter Wemm	ad641f0fd5	Minor source sync with amd64. For %.s printf formats, pass in an int rather than a size_t. cast the ioapicaddress variable via uintptr_t before going to void .	2003-11-14 22:26:29 +00:00
Peter Wemm	7b66b81ee4	Convert a couple of pointers to integers for source compatability with amd64.	2003-11-14 22:23:30 +00:00
Peter Wemm	40e3826a9f	Whitespace nit (sorry, couldn't help it)	2003-11-14 22:21:30 +00:00
John Baldwin	64bb257f0b	Always install IDT entries for ATPIC interrupt sources. The APIC no longer uses these interrupt vectors for its ISA interrupt pins, so these entries will not be overwritten. If we get a spurious interrupt from the ATPIC when using the APIC, it will be treated as a stray interrupt instead of causing a panic.	2003-11-14 21:02:49 +00:00
John Baldwin	43d63d12fa	If an interrupt source doesn't have an ithread, treat it as a stray interrupt. This can only happen if an unregistered interrupt source triggers an interrupt.	2003-11-14 21:00:32 +00:00
Peter Wemm	db049820e1	basemem is in K, not bytes. I think I tricked jhb into making the same mistake I did and then committing it to cvs.	2003-11-14 20:51:07 +00:00
Peter Wemm	2edfe38b10	"opt_auto_eoi.h" is not used here anymore. See atpic.c.	2003-11-14 20:06:24 +00:00
John Baldwin	f082493f10	Replace magic numbers with macros for i8259A register constants. Still need the ICW4 bits for PC98 though.	2003-11-14 19:13:06 +00:00
John Baldwin	3ab2ba59f4	Shuffle the APIC interrupt vectors around a bit: - Move the IPI and local APIC interrupt vectors up into the 0xf0 - 0xff range. The pmap lazyfix IPI was reordered down next to the TLB shootdowns to avoid conflicting with the spurious interrupt vector. - Move the base of APIC interrupts up 16 so that the first 16 APIC interrupts do not overlap the vectors used by the ATPIC. - Remove bogus interrupt vector reservations for LINT[01]. - Now that 0xc0 - 0xef are available, use them for device interrupts. This increases the number of APIC device interrupts to 191. - Increase the system-wide number of global interrupts to 191 to catch up to more APIC interrupts. Requested by: peter (2)	2003-11-14 19:10:13 +00:00
Peter Wemm	3ef6155b2e	Fix up the control word 3 bits. jhb discovered how much I screwed this up. :-]	2003-11-14 18:20:20 +00:00
John Baldwin	dced4b0c31	Whitespace.	2003-11-13 18:16:37 +00:00
John Baldwin	69487322d8	Fix a typo.	2003-11-13 16:41:07 +00:00
Peter Wemm	e46fe241d3	Stop pretending to support kernel profiling. The FAKE_MCOUNT() etc calls are just gradually getting more and more stale. At this point it would be better to start from scratch once prof_machdep.c is adapted.	2003-11-13 02:38:33 +00:00
John Baldwin	bd9cd7e3f7	- Move manipulation of td_intr_nesting_level out of assembly interrupt vector stubs and into the C functions they call. - Move disabling and EOIing of interrupt sources out of PIC driver entry points and into intr_execute_handlers(). Intr_execute_handlers() only disables a source for an interrupt if it is a stray interrupt or has threaded handlers. Sources with fast handlers no longer disable (mask) the source while executing the handlers. - Move the setting of clkintr_pending into intr_execute_handlers() and set the variable for any interrupt source with a vector of 0. (Should only be true for IRQ 0.) This fixes clkintr_pending in the NO_MIXED_MODE case. - Implement lapic_eoi() and use it to implement ioapic_eoi_source(). - Rename atpic_sched_ithd() to atpic_handle_intr() since it is used to handle all atpic interrupts and not just threaded ones. Inspired by: peter's changes to amd64 in p4 (1) Requested by: bde (2)	2003-11-12 18:13:57 +00:00
Peter Wemm	b79c9c6c58	Cosmetic sync with i386	2003-11-12 01:49:49 +00:00
John Baldwin	fd7d14d30b	Don't probe busses in the MP Table for the MP Table PCI bridge drivers if the bus number doesn't correspond to a PCI bus in the MP Table. Reported by: jhay	2003-11-11 21:19:43 +00:00
John Baldwin	d3c01334ce	Some motherboards like to remap the SCI (normally IRQ 9) up to a PCI interrupt such as IRQ 22 or 19. However, the ACPI BIOS still routes interrupts from some PCI devices to the same intpin calling the pin IRQ 22. Thus, ACPI expects to address a single interrupt source via two different names. To work around this, if the SCI is remapped to a non-ISA interrupt (i.e., greater than 15), then we use acpi_OverrideInterruptLevel() function to tell ACPI to use IRQ 22 or 19 rather than IRQ 9 for the SCI. Previously we would change IRQ 22 or 19's name to IRQ 9 when we encountered such an Interrupt Source Override entry in the MADT which routed the SCI properly but left PCI devices mapped to IRQ 22 or 19 w/o a routable interrupt. Tested by: sos	2003-11-11 18:20:10 +00:00
John Baldwin	da17811e64	Enable HTT CPUs by default instead of halting them by default. Users should now only have HTT CPUs if they have explicitly asked for them either by enabling HyperThreading in the BIOS or by using the MPTABLE_FORCE_HTT kernel option.	2003-11-11 17:16:15 +00:00
John Baldwin	f9dbba5c4e	Disable probing of HTT CPUs by default for the MP Table case. HTT CPUs should only be used if they are enabled in the BIOS. Now that we support enumerating CPUs using the ACPI MADT, any HTT machine using ACPI should respect the BIOS setting. For HTT machines with ACPI disabled in the kernel, the MPTABLE_FORCE_HTT kernel option can be used to try to probe HTT CPUs like have done in the past for the MP Table case. This option should only be enabled if HTT is enabled in the BIOS.	2003-11-11 17:14:26 +00:00
John Baldwin	4e478d4521	MFamd64 (via P4, not in CVS yet): - Use the static boot_address variable directly rather than passing it around to several functions. - Clean up a couple of magic numbers.	2003-11-10 21:24:34 +00:00
John Baldwin	95020215db	Bump APIC ID limits up to 32 since a machine with 16 CPUs will have APIC IDs for the I/O APICs that are greater than 16. Reported by: John Cagle <john.cagle@hp.com>	2003-11-10 19:52:58 +00:00
Marcel Moolenaar	fcaa2925a9	Change the clear_ret argument of get_mcontext() to be a flags argument. Since all callers either passed 0 or 1 for clear_ret, define bit 0 in the flags for use as clear_ret. Reserve bits 1, 2 and 3 for use by MI code for possible (but unlikely) future use. The remaining bits are for use by MD code. This change is triggered by a need on ia64 to have another knob for get_mcontext().	2003-11-09 20:31:04 +00:00
Peter Wemm	4cd2d525e3	Move a MD 32 bit binary support routine into the MD areas. exec_setregs is highly MD in an emulation environment since it operates on the host environment. Although the setregs functions are really for exec support rather than signals, they deal with the same sorts of context and include files. So I put it there rather than create yet another file.	2003-11-08 07:43:44 +00:00
Peter Wemm	fcfe57d640	Update the graffiti.	2003-11-08 04:39:22 +00:00
Peter Wemm	398dbb11d8	Switch from having a fpu "device" to something that is more like the integrated part of the cpu core that it is.	2003-11-08 04:37:54 +00:00
Peter Wemm	bf2f09ee97	The great s/npx/fpu/gi	2003-11-08 03:33:38 +00:00
Peter Wemm	a4d60a7fb3	Converge with i386/GENERIC	2003-11-08 03:17:36 +00:00
Peter Wemm	8b2454d833	Rename npx* to fpu*. I haven't done the flags/function names yet.	2003-11-08 02:39:46 +00:00
Peter Wemm	7538a488f5	There isn't much point printing 'npx0: INT 16 interface' because that is the only way it works here.	2003-11-08 00:13:43 +00:00
John Baldwin	88861af1fb	Dump the trigger and polarity of each intpin's default setting in the bootverbose output.	2003-11-07 23:44:35 +00:00
Scott Long	eb3b7bf69f	Document the lockfunc and lockfuncarg arguments to bus_dma_tag_create() in the busdma headers.	2003-11-07 23:29:42 +00:00
John Baldwin	8dec768242	Only disable the old pin when doing a remap if it's current vector is still the old vector. Reported by: sam	2003-11-06 14:47:53 +00:00
Peter Wemm	6350f49c4a	OK, this might be a bit silly, but add another popcnt() candidate.	2003-11-06 01:24:25 +00:00
John Baldwin	f84d8b318a	When remapping an ISA interrupt from one intpin to another, disable the pin that is used by the default identity mapping if it still maps to the old vector. The ACPI case might need some tweaking for the SCI interrupt case since ACPI likes to address the intpin using both the IRQ remapped to it as well as the previous existing PCI IRQ mapped to it. Reported by: kan	2003-11-05 23:15:52 +00:00
John Baldwin	240cfc80b3	Two style nits.	2003-11-05 23:07:39 +00:00
John Baldwin	be11140dfb	- Adjust some of the bitfields in the ioapic_intsrc struct to be unsigned rather than signed. This fixes some cosmetics such as verbose printf's for IRQs greater than 127. - The calculation for next_ioapic_base was also adjusted so that it will only complain once for each hole in the IRQs provided by ACPI for IO APICs. Reported by: Michal Mertl <mime@traveller.cz>	2003-11-05 16:18:06 +00:00
John Baldwin	fc0d431d4b	Add a workaround for MP Tables that list the same PCI IRQ twice with the same APIC / pin destination in both cases. Reported by: Pawel Jakub Dawidek <nick@garage.freebsd.pl>	2003-11-05 16:14:10 +00:00
John Baldwin	7542a92afa	Tweak the version string output for ioapic devices.	2003-11-04 19:22:20 +00:00
Yoshihiro Takahashi	0ca1bf3907	Fix to support pc98.	2003-11-04 13:13:04 +00:00
Yoshihiro Takahashi	95755cc99b	Split pc98 support into pc98/pc98/nmi.c.	2003-11-04 13:01:41 +00:00
Peter Wemm	93c3f67fe7	Make this compile with PAE.	2003-11-04 01:07:04 +00:00
John Baldwin	147ad8d5ad	New i386 SMP code: - The MP code no longer knows anything specific about an MP Table. Instead, the local APIC code adds CPUs via the cpu_add() function when a local APIC is enumerated by an APIC enumerator. - Don't divide the argument to mp_bootaddress() by 1024 just so that we can turn around and mulitply it by 1024 again. - We no longer panic if SMP is enabled but we are booted on a UP machine. - init_secondary(), the asm code between init_secondary() and ap_init() in mpboot.s and ap_init() have all been merged together in C into init_secondary(). - We now use the cpuid feature bits to determine if we should enable PSE, PGE, or VME on each AP. - Due to the change in the implementation of critical sections, acquire the SMP TLB mutex around a slightly larger chunk of code for TLB shootdowns. - Remove some of the debug code from the original SMP implementation that is no longer used or no longer applies to the new APIC code. - Use a temporary hack to disable the ACPI module until the SMP code has been further reorganized to allow ACPI to work as a module again. - Add a DDB command to dump the interesting contents of the IDT.	2003-11-03 22:32:04 +00:00
John Baldwin	9738024229	Don't probe PnP BIOS devices for PICs for now to avoid problems with those devices claiming resources that they don't actually use. The PIC drivers only register valid interrupt sources, so we don't need to rely on these drivers to claim invalid IRQs to prevent their use by other drivers.	2003-11-03 22:22:04 +00:00
John Baldwin	ab089945d3	Add the ACPI MADT table APIC enumerator. This code uses the ACPI Multiple APIC Descriptor Table to enumerate both I/O APICs and local APICs. ACPI does not embed PCI interrupt routing information in the MADT like the MP Table does. Instead, ACPI stores the PCI interrupt routing information in the _PRT object under each PCI bus device. The MADT table simply provides hints about which interrupt vectors map to which I/O APICs. Thus when using ACPI, the existing ACPI PCI bridge drivers are sufficient to route PCI interrupts.	2003-11-03 22:17:44 +00:00
John Baldwin	8f8914ad98	Add the MP Table APIC enumerator. This code uses the BIOS MP Table to enumerate I/O APICs as well as local APICs. It also provides Host-PCI and PCI-PCI bridge drivers to use the MP Table to route PCI interrupts.	2003-11-03 22:12:37 +00:00
John Baldwin	6f92bdd0c1	New APIC support code: - The apic interrupt entry points have been rewritten so that each entry point can serve 32 different vectors. When the entry is executed, it uses one of the 32-bit ISR registers to determine which vector in its assigned range was triggered. Thus, the apic code can support 159 different interrupt vectors with only 5 entry points. - We now always to disable the local APIC to work around an errata in certain PPros and then re-enable it again if we decide to use the APICs to route interrupts. - We no longer map IO APICs or local APICs using special page table entries. Instead, we just use pmap_mapdev(). We also no longer export the virtual address of the local APIC as a global symbol to the rest of the system, but only in local_apic.c. To aid this, the APIC ID of each CPU is exported as a per-CPU variable. - Interrupt sources are provided for each intpin on each IO APIC. Currently, each source is given a unique interrupt vector meaning that PCI interrupts are not shared on most machines with an I/O APIC. That mapping for interrupt sources to interrupt vectors is up to the APIC enumerator driver however. - We no longer probe to see if we need to use mixed mode to route IRQ 0, instead we always use mixed mode to route IRQ 0 for now. This can be disabled via the 'NO_MIXED_MODE' kernel option. - The npx(4) driver now always probes to see if a built-in FPU is present since this test can now be performed with the new APIC code. However, an SMP kernel will panic if there is more than one CPU and a built-in FPU is not found. - PCI interrupts are now properly routed when using APICs to route interrupts, so remove the hack to psuedo-route interrupts when the intpin register was read. - The apic.h header was moved to apicreg.h and a new apicvar.h header that declares the APIs used by the new APIC code was added.	2003-11-03 21:53:38 +00:00
John Baldwin	223e573bbd	Add the new atpic(4) driver for the 8259A master and slave PICs. By default we provide 16 interrupt sources for IRQs 0 through 15. However, if the I/O APIC driver has already registered sources for any of those IRQs then we will silently fail to register our own source for that IRQ. Note that i386/isa/icu.h is now specific to the 8259A and no longer contains any info relevant to APICs. Also note that fast interrupts no longer use a separate entry point. Instead, both fast and threaded interrupts share the same entry point which merely looks up the appropriate source and passes control to intr_execute_handlers().	2003-11-03 21:34:45 +00:00
John Baldwin	ecee5704ed	New device interrupt code. This defines an interrupt source abstraction that provides methods via a PIC driver to do things like mask a source, unmask a source, enable it when the first interrupt handler is added, etc. The interrupt code provides a table of interrupt sources indexed by IRQ numbers, or vectors. These vectors are what new-bus uses for its IRQ resources and for bus_setup_intr()/bus_teardown_intr(). The interrupt code then maps that vector a given interrupt source object. When an interrupt comes in, the low-level interrupt code looks up the interrupt source for the source that triggered the interrupt and hands it off to this code to execute the appropriate handlers. By having an interrupt source abstraction, this allows us to have different types of interrupt source providers within the shared IRQ address space. For example, IRQ 0 may map to pin 0 of the master 8259A PIC, IRQs 1 through 60 may map to pins on various I/O APICs, and IRQs 120 through 128 may map to MSI interrupts for various PCI devices.	2003-11-03 21:25:52 +00:00
John Baldwin	e14243fac7	Move the NMI handling code out to its own file.	2003-11-03 21:10:17 +00:00
John Baldwin	1ab9ea3059	Include "opt_pmap.h" so that the DISABLE_P* options are honored.	2003-10-30 21:42:44 +00:00
John Baldwin	63239aa581	Always export r_gdt and r_idt and give them extern declarations in machine/segments.h.	2003-10-30 21:42:17 +00:00
Peter Wemm	fbd00896e2	MFi386: thread specific fpu state optimizations	2003-10-30 19:04:58 +00:00
Peter Wemm	3f378ea44a	MFi386: rev 1.451 (jhb): call pmap_kremove() rather than duplicate it	2003-10-30 04:08:22 +00:00
Peter Wemm	10d9b64384	MFi386: trap.c rev 1.259: fetch thread mailbox address in page fault trap	2003-10-30 04:06:28 +00:00
Peter Wemm	57e1fa205b	Oops. Remove some rather noisy debug printfs that slipped in there somehow.	2003-10-28 01:06:37 +00:00
John Baldwin	07930cce05	A few whitespace and comment tweaks.	2003-10-24 21:02:26 +00:00
Peter Wemm	cedb3695c1	Add __va_copy and make it always visible, in spite of the __ISO_C_VISIBLE setting. Make va_copy be an alias if __ISO_C_VISIBLE >= 1999. Why? more than a few ports have an autoconf that looks for __va_copy because it is available on glibc. It is critical that we use it if at all possible on amd64. It generally isn't a problem for i386 and its ilk because autoconf driven code tends to fall back to an assignment.	2003-10-24 02:50:39 +00:00
Peter Wemm	63f2bb5ff1	Use a more robust API altogether for the amd64_get_fsbase() etc functions.	2003-10-23 06:06:14 +00:00
Peter Wemm	c0432d033e	Renumber the sysarch vectors for amd64 specific syscalls so that I can implement i386 compat numbers where it makes sense. This would save a syscall translation layer. Yes, this breaks the abi slightly again, but fortunately its just a recompile rather than tweaking the source. I will be fixing the libc stubs while I'm here.	2003-10-23 05:31:23 +00:00
Mike Silbersack	184dcdc7c8	Change all SYSCTLS which are readonly and have a related TUNABLE from CTLFLAG_RD to CTLFLAG_RDTUN so that sysctl(8) can provide more useful error messages.	2003-10-21 18:28:36 +00:00
Nate Lawson	4c3655b418	Add the cpu_idle_hook() function pointer so that other idlers can be hooked at runtime. Make C1 sleep (e.g., HLT) be the default. This prepares the way for further ACPI sleep states.	2003-10-18 22:25:07 +00:00
Bruce Evans	ed86674a3d	Don't forget to load %es with the kernel data segment selector in Xcpustop(). %es is used in at least the call to savectx() when savectx() calls bcopy(), so not loading it was fatal if a stop IPI interrupts user mode. This reduces bugs starting and stopping CPUs for debuggers. CPUs are stopped mainly in kdb_trap() and cpu_reset(). At reset time there is a good chance that all the CPUs are in the kernel, so the bug was probably harmless then.	2003-10-16 10:44:24 +00:00
Peter Wemm	19acc770c2	Pull the tier-2 card one last time and break the get/setcontext and sigreturn() ABI and the signal context on the stack. Make the trapframe (and its shadows in the ucontext and sigframe etc) 8 bytes larger in order to preserve 16 byte stack alignment for the following C code calls. I could have done some padding after the trapframe was saved, but some of the C code still expects an argument of 'struct trapframe'. Anyway, this gives me a spare field that can be used to store things like 'partial trapframe' status or something else in the future. The runtime impact is fairly small, except for threaded apps and things that decode contexts and the signal stack (eg: cvsup binary). Signal delivery isn't too badly affected because the kernel generates the sigframe that sigreturn uses after the handler has been called. The size of mcontext_t and struct sigframe hasn't changed. Only the last few fields (sc_eip etc) got moved a little and I eliminated a spare field. mc_len/sc_len did change location though so the sanity checks there will still trap it.	2003-10-15 02:04:52 +00:00
Alan Cox	7fb578933f	MFia64 Move uma_small_alloc() and uma_small_free() to uma_machdep.c.	2003-10-14 05:51:31 +00:00
Robert Drehmel	ea924c4cd3	Implement preliminary support for the PT_SYSCALL command to ptrace(2).	2003-10-09 10:17:16 +00:00
Bruce M Simpson	2bc7dd5661	Move pmap_resident_count() from the MD pmap.h to the MI pmap.h. Add a definition of pmap_wired_count(). Add a definition of vmspace_wired_count(). Reviewed by: truckman Discussed with: peter	2003-10-06 01:47:12 +00:00
Alan Cox	ab87e2fb83	Don't bother setting a page table page's valid field. It is unused and not setting it is consistent with other uses of VM_ALLOC_NOOBJ pages.	2003-10-05 00:12:16 +00:00
Alan Cox	566526a957	Migrate pmap_prefault() into the machine-independent virtual memory layer. A small helper function pmap_is_prefaultable() is added. This function encapsulate the few lines of pmap_prefault() that actually vary from machine to machine. Note: pmap_is_prefaultable() and pmap_mincore() have much in common. Going forward, it's worth considering their merger.	2003-10-03 22:46:53 +00:00
Alan Cox	81b460c5eb	Reimplement pagezero() using "movnti".	2003-10-02 05:08:13 +00:00
Peter Wemm	6ccf265bb0	Commit Bosko's patch to clean up the PSE/PG_G initialization to and avoid problems with some Pentium 4 cpus and some older PPro/Pentium2 cpus. There are several problems, some documented in Intel errata. This patch: 1) moves the kernel to the second page in the PSE case. There is an errata that says that you Must Not point a 4MB page at physical address zero on older cpus. We avoided bugs here due to sheer luck. 2) sets up PSE page tables right from the start in locore, rather than trying to switch from 4K to 4M (or 2M) pages part way through the boot sequence at the same time that we're messing with PG_G. For some reason, the pmap work over the last 18 months seems to tickle the problems, and the PAE infrastructure changes disturb the cpu bugs even more. A couple of people have reported a problem with APM bios calls during boot. I'll work with people to get this resolved. Obtained from: bmilekic	2003-10-01 23:46:08 +00:00
Peter Wemm	a93020d7a1	Use __register_t instead of register_t, otherwise <sys/types.h> is a prerequisite for <ucontext.h> on amd64. Oops.	2003-10-01 01:08:04 +00:00
Peter Wemm	ba5a51ea04	MFi386: Do not depend on LEAPYEAR() macro boolean values being 0 or 1. MFi386: Add quality field for timer0	2003-09-30 06:42:47 +00:00
Peter Wemm	ec548f97fc	MFi386: BURN_BRIDGES around timer0 functions	2003-09-30 06:38:11 +00:00
Jeff Roberson	3c4d5e1546	- Remove the definition for TD_SWITCHIN as it is not used. Approved by: peter	2003-09-30 04:52:24 +00:00
Alan Cox	9060731130	Eliminate the pte object.	2003-09-27 20:53:01 +00:00
Alan Cox	d2a81cdbed	MFi386 Allocate the page table directory page as "no object" pages.	2003-09-26 04:12:41 +00:00
Alan Cox	d91440cd46	MFi386 Reimplement pmap_release() such that it uses the page table rather than the pte object to locate the page table directory pages. (Temporarily, retain an assertion on the emptiness of the pte object.)	2003-09-25 05:38:18 +00:00
Peter Wemm	cc3112f108	Re-raise the default datasize and stacksize now that the 32 bit exec support can clip it to sensible values.	2003-09-25 01:11:17 +00:00
Peter Wemm	c460ac3a00	Add sysentvec->sv_fixlimits() hook so that we can catch cases on 64 bit systems where the data/stack/etc limits are too big for a 32 bit process. Move the 5 or so identical instances of ELF_RTLD_ADDR() into imgact_elf.c. Supply an ia32_fixlimits function. Export the clip/default values to sysctl under the compat.ia32 heirarchy. Have mmap(0, ...) respect the current p->p_limits[RLIMIT_DATA].rlim_max value rather than the sysctl tweakable variable. This allows mmap to place mappings at sensible locations when limits have been reduced. Have the imgact_elf.c ld-elf.so.1 placement algorithm use the same method as mmap(0, ...) now does. Note that we cannot remove all references to the sysctl tweakable maxdsiz etc variables because /etc/login.conf specifies a datasize of 'unlimited'. And that causes exec etc to fail since it can no longer find space to mmap things.	2003-09-25 01:10:26 +00:00
Yoshihiro Takahashi	33e38a2cc8	Implement the bus_space_map() function to allocate resources and initialize a bus_handle, but currently it does only initializing a bus_handle.	2003-09-23 08:22:34 +00:00
Peter Wemm	725bc17312	Oops. back out last commit. The data and stack limits are used by the 32 bit binary stuff. 32 bit binaries do not like it much when the kernel tries hard to put things above the 8GB mark. I have a work-in-progress to fix this properly, but I didn't want to burn anybody with this yet.	2003-09-23 03:20:34 +00:00
Peter Wemm	705c67adc2	Fix patch transcription typo. s/IDT_BPT/IDT_BP/	2003-09-23 00:45:55 +00:00
Peter Wemm	cd3402fa66	Sync with i386 version. The quality initialization was missing and some other junk.	2003-09-23 00:18:45 +00:00
Peter Wemm	ee3ce1c29c	GC unused child variable	2003-09-23 00:04:28 +00:00
Peter Wemm	4295ddf26f	MFi386 pci_bus.c 1.102 legacyvar.h 1.4: rename nexus_pcib to legacy_pcib However, leave legacy_pcib_route_interrupt() since there is no pcibios to call.	2003-09-23 00:03:44 +00:00
Peter Wemm	da87d7e10d	Move basemem variable into global scope so that the MP startup code can refer to it for looking for tables.	2003-09-22 23:33:29 +00:00
Peter Wemm	24789c549a	Increase the default data size limit from 512MB to 8GB. Increase default stack limit from 64MB to 512MB.	2003-09-22 23:21:39 +00:00
Peter Wemm	848947c793	MFi386 rev 1.51 by scottl: make dflt_lock() always panic.	2003-09-22 23:11:42 +00:00
Peter Wemm	951b3d46b6	MFi386 rev 1.53 by scottl: Allocate the S/G list in the tag, not on the stack. This means that s/g lists can be arbitrarily long.	2003-09-22 23:10:24 +00:00
Peter Wemm	d79ddbf5de	MFi386 machdep.c rev 1.201, clock.c 1.201, clock.h 1.45 by phk: Dont initialize a TSC timecounter until we know if it is broke or not. XXX I think there is a bug in the i386 code here. init_TSC_tc() comes after: if (statclock_disable) return; ie: if you turn off the statclock interrupt, you dont get the TSC either.	2003-09-22 23:02:24 +00:00
Peter Wemm	e63f19e150	MFi386 rev 1.105 by jhb: fix comment typo	2003-09-22 22:54:14 +00:00
Peter Wemm	74a99ec4fe	MFi386 rev 1.256 by jhb: remove redundant #include <sys/sysctl.h>	2003-09-22 22:52:46 +00:00
Peter Wemm	13a27f2962	MFi386 rev 1.25 by jhb: add new MSR's and some missing older ones and APICBASE MSR constants.	2003-09-22 22:51:46 +00:00
Peter Wemm	f0c4b48689	MFi386 rev 1.55 by sam: remove unused #define BUS_DMAMAP_NSEGS	2003-09-22 22:43:21 +00:00
Peter Wemm	d10e66f073	MFi386 rev 1.37: constant-friendly bswap macros	2003-09-22 22:37:49 +00:00
Peter Wemm	5bc82d1ce1	MFi386: pci_cfgreg.h rev 1.10 by jhb/des/njl. Fix CONF1_ENABLE_MSK.	2003-09-22 22:21:21 +00:00
Peter Wemm	20e220ac68	MFCi386: trap.c rev 1.257 by bde. Don't forget to reenable interrupts for breakpoint and trace traps from usermode. Although all the setidt entries are interrupt gates on amd64, all but the trace and bpt trap entry handlers reenable interrupts after the swapgs instruction in order to simulate the trap/interrupt gate distinction. In other words, the amd64 code behaves the same way that i386 does here.	2003-09-22 22:19:59 +00:00
Peter Wemm	8848ad863b	MFi386 by jhb: add acpi_SetDefaultIntrModel();	2003-09-22 22:12:46 +00:00
Peter Wemm	76caec589f	MFi386 by jhb: use symbolic constants for the IDT entries.	2003-09-22 22:09:02 +00:00
Peter Wemm	882554f111	MFi386: machdep.c:1.570 clock.c:1.204 by bde: Quick fix for calling DELAY for ddb input in some atkbd-based console drivers. ddb must not use any normal locks but DELAY() normally calls getit() which needs clock_lock. This also removes the need for recursion on clock_lock.	2003-09-22 21:56:48 +00:00
Joerg Wunsch	9678710b1f	Mention the puc(4) glue driver in a commented-out example so the user of "dumb" PCI-based serial/parallel boards get a hint how to enable them. I wasn't sure about the ia64, pc98, powerpc, and sparc64 archs whether they'd support puc(4) or not.	2003-09-19 20:04:55 +00:00
David E. O'Brien	67193a54f0	Statically compile in sound as we don't have modules yet.	2003-09-15 22:40:00 +00:00
Alan Cox	6d66d714c7	Simplify (and micro-optimize) pmap_unuse_pt(): Only one caller, pmap_remove_pte(), passed NULL instead of the required page table page to pmap_unuse_pt(). Compute the necessary page table page in pmap_remove_pte(). Also, remove some unreachable code from pmap_remove_pte().	2003-09-13 21:57:38 +00:00
Alan Cox	b9850eb224	Add a new parameter to pmap_extract_and_hold() that is needed to eliminate Giant from vmapbuf(). Idea from: tegge	2003-09-12 07:07:49 +00:00
David E. O'Brien	3fc40c2484	Sort 'bge' correctly.	2003-09-10 18:54:59 +00:00
John Baldwin	a547af297d	Remove an XXX comment by using the per CPU mask added after this comment was added.	2003-09-10 01:36:48 +00:00
John Baldwin	f03cb48d41	Fix a typo.	2003-09-10 01:11:58 +00:00
Peter Wemm	2c25d12414	Clean up get/set_mcontext() and get/set_fpcontext(). These are operated on data structures on the kernel stack which are guaranteed to be 16 byte aligned by gcc, the amd64 ABI and __aligned(16). Ensire the tss_rsp0 initial stack pointer is 16 byte aligned in case sizeof(pcb) becomes odd at some point. This is convenient for the interrupt handler case because the ring crossing pushes cause the required odd alignment before the call to the C code. Have fast_syscall add an additional 8 bytes to ensure that the trapframe has the correct odd alignment for the call to C code. Note that there are no checks to make sure that the trapframe size is appropriate for this. This makes get/setfpcontext work properly (finally). You get a GPF in kernel mode if any of this is botched without the alignment fixup code that is apparently needed on i386.	2003-09-09 19:32:09 +00:00
Peter Wemm	df6ece387b	Turn aac back on now that its been cleaned up for 64 bit compilation	2003-09-08 20:00:55 +00:00
Peter Wemm	292bbfd103	Argh. This file was completely out of sync with mcontext/trapframe.	2003-09-08 18:31:48 +00:00
Peter Wemm	7fe089a006	Hmm. Two copies of the mcontext...	2003-09-08 18:28:41 +00:00
Alan Cox	ba2157f218	Introduce a new pmap function, pmap_extract_and_hold(). This function atomically extracts and holds the physical page that is associated with the given pmap and virtual address. Such a function is needed to make the memory mapping optimizations used by, for example, pipes and raw disk I/O MP-safe. Reviewed by: tegge	2003-09-08 02:45:03 +00:00
Bill Paul	a94100fa9b	Take the support for the 8139C+/8169/8169S/8110S chips out of the rl(4) driver and put it in a new re(4) driver. The re(4) driver shares the if_rlreg.h file with rl(4) but is a separate module. (Ultimately I may change this. For now, it's convenient.) rl(4) has been modified so that it will never attach to an 8139C+ chip, leaving it to re(4) instead. Only re(4) has the PCI IDs to match the 8169/8169S/8110S gigE chips. if_re.c contains the same basic code that was originally bolted onto if_rl.c, with the following updates: - Added support for jumbo frames. Currently, there seems to be a limit of approximately 6200 bytes for jumbo frames on transmit. (This was determined via experimentation.) The 8169S/8110S chips apparently are limited to 7.5K frames on transmit. This may require some more work, though the framework to handle jumbo frames on RX is in place: the re_rxeof() routine will gather up frames than span multiple 2K clusters into a single mbuf list. - Fixed bug in re_txeof(): if we reap some of the TX buffers, but there are still some pending, re-arm the timer before exiting re_txeof() so that another timeout interrupt will be generated, just in case re_start() doesn't do it for us. - Handle the 'link state changed' interrupt - Fix a detach bug. If re(4) is loaded as a module, and you do tcpdump -i re0, then you do 'kldunload if_re,' the system will panic after a few seconds. This happens because ether_ifdetach() ends up calling the BPF detach code, which notices the interface is in promiscuous mode and tries to switch promisc mode off while detaching the BPF listner. This ultimately results in a call to re_ioctl() (due to SIOCSIFFLAGS), which in turn calls re_init() to handle the IFF_PROMISC flag change. Unfortunately, calling re_init() here turns the chip back on and restarts the 1-second timeout loop that drives re_tick(). By the time the timeout fires, if_re.ko has been unloaded, which results in a call to invalid code and blows up the system. To fix this, I cleared the IFF_UP flag before calling ether_ifdetach(), which stops the ioctl routine from trying to reset the chip. - Modified comments in re_rxeof() relating to the difference in RX descriptor status bit layout between the 8139C+ and the gigE chips. The layout is different because the frame length field was expanded from 12 bits to 13, and they got rid of one of the status bits to make room. - Add diagnostic code (re_diag()) to test for the case where a user has installed a broken 32-bit 8169 PCI NIC in a 64-bit slot. Some NICs have the REQ64# and ACK64# lines connected even though the board is 32-bit only (in this case, they should be pulled high). This fools the chip into doing 64-bit DMA transfers even though there is no 64-bit data path. To detect this, re_diag() puts the chip into digital loopback mode and sets the receiver to promiscuous mode, then initiates a single 64-byte packet transmission. The frame is echoed back to the host, and if the frame contents are intact, we know DMA is working correctly, otherwise we complain loudly on the console and abort the device attach. (At the moment, I don't know of any way to work around the problem other than physically modifying the board, so until/unless I can think of a software workaround, this will have do to.) - Created re(4) man page - Modified rlphy.c to allow re(4) to attach as well as rl(4). Note that this code works for the sample 8169/Marvell 88E1000 NIC that I have, but probably won't work for the 8169S/8110S chips. RealTek has sent me some sample NICs, but they haven't arrived yet. I will probably need to add an rlgphy driver to handle the on-board PHY in the 8169S/8110S (it needs special DSP initialization).	2003-09-08 02:11:25 +00:00
Peter Wemm	c896a8adbf	Oops. sizeof(long) = 8, not 4. Get the fxsave buffer inside mcontext the right size. I'm planning on possibly stealing the two 'spare' variables on either side for botched alignment correction.	2003-09-05 20:47:27 +00:00
David E. O'Brien	be8d2cbf2c	MFi386: add device ataraid, this is now seperate and not pulled in by atadisk.	2003-09-03 01:24:47 +00:00
Alexander Kabaev	1d49585050	Standardize idempotentcy ifdefs. Consistently use _MACHINE_VARARGS_H_ symbol.	2003-09-01 03:01:45 +00:00
Alan Cox	411d10a600	Migrate the sf_buf allocator that is used by sendfile(2) and zero-copy sockets into machine-dependent files. The rationale for this migration is illustrated by the modified amd64 allocator. It uses the amd64's direct map to avoid emphemeral mappings in the kernel's address space. On an SMP, the emphemeral mappings result in an IPI for TLB shootdown for each transmitted page. Yuck. Maintainers of other 64-bit platforms with direct maps should be able to use the amd64 allocator as a reference implementation.	2003-08-29 20:04:10 +00:00
John Baldwin	729d7ffbcf	- Rename PCIx_HEADERTYPE* to PCIx_HDRTYPE* so the constants aren't so long. - Add a new PCIM_HDRTYPE constant for the field in PCIR_HDRTYPE that holds the header type. - Replace several magic numbers with appropriate constants for the header type register and a couple of PCI_FUNCMAX. - Merge to amd64 the fix to the i386 bridge code to skip devices with unknown header types. Requested by: imp (1, 2)	2003-08-28 21:22:25 +00:00
Nate Lawson	5a4d072c93	Minor style cleanups.	2003-08-28 16:30:31 +00:00
David E. O'Brien	a7b60ab26e	Fix copyright comment & FBSDID style nits. Requested by: bde	2003-08-25 09:48:48 +00:00
Alan Cox	d08ffe8451	Eliminate the last (direct) uses of vm_page_lookup() on the pte object.	2003-08-24 08:07:06 +00:00
Peter Wemm	0dda1d3887	AMD64 mtrr driver.	2003-08-23 00:27:58 +00:00
Peter Wemm	46159d1fd6	Switch to using the emulator in the common compat area. Still work-in-progress.	2003-08-23 00:04:53 +00:00
Peter Wemm	c639ca93f4	Initial sweep at dividing up the generic 32bit-on-64bit kernel support from the ia32 specific stuff. Some of this still needs to move to the MI freebsd32 area, and some needs to move to the MD area. This is still work-in-progress.	2003-08-22 23:19:02 +00:00
Warner Losh	d2c5276d96	Prefer new location of pci include files (which have only been in the tree for two or more years now), except in a few places where there's code to be compatible with older versions of FreeBSD.	2003-08-22 07:39:05 +00:00
Peter Wemm	82914097e5	Regen	2003-08-21 03:48:50 +00:00
Peter Wemm	6b59055cb8	This is too funny for words. Swap syscalls 416 and 417 around. It works better that way when sigaction() and sigreturn() do the right thing.	2003-08-21 03:48:05 +00:00
Alan Cox	2b12cfb461	- Lock the pte object when performing vm_page_grab(). - Insure that the page table page is zero filled before adding it to the page table.	2003-08-20 05:09:55 +00:00
Gordon Tetlow	df3d69c217	Fixup the ELF branding information to point to the new home of rtld.	2003-08-17 08:08:38 +00:00
Alan Cox	365b27ea29	In pmap_copy(), since we have the page table page's physical address in hand, use PHYS_TO_VM_PAGE() rather than vm_page_lookup().	2003-08-17 04:48:21 +00:00
Marcel Moolenaar	710338e94f	In vm_thread_swap{in\|out}(), remove the alpha specific conditional compilation and replace it with a call to cpu_thread_swap{in\|out}(). This allows us to add similar code on ia64 without cluttering the code even more.	2003-08-16 23:15:15 +00:00
Marcel Moolenaar	26502503e5	Further cleanup <machine/cpu.h> and <machine/md_var.h>: move the MI prototypes of cpu_halt(), cpu_reset() and swi_vm() from md_var.h to cpu.h. This affects db_command.c and kern_shutdown.c. ia64: move all MD prototypes from cpu.h to md_var.h. This affects madt.c, interrupt.c and mp_machdep.c. Remove is_physical_memory(). It's not used (vm_machdep.c). alpha: the MD prototypes have been left in cpu.h with a comment that they should be there. Moving them is left for later. It was expected that the impact would be significant enough to be done in a seperate commit. powerpc: MD prototypes left in cpu.h. Comment added. Suggested by: bde Tested with: make universe (pc98 incomplete)	2003-08-16 16:57:57 +00:00
Alan Cox	6700fc865c	Eliminate pmap_page_lookup() and its uses. Instead, use PHYS_TO_VM_PAGE() to convert the pte's physical address into a vm page. Reviewed by: peter	2003-08-16 03:11:33 +00:00
John Baldwin	594dfbc391	- Fix a duplicated typo. - Add a macro for the logical shift needed to extract an APIC ID from either from the local APIC ICR Hi register or the APIC ID registers of the local and IO APICs.	2003-08-15 15:23:13 +00:00
Warner Losh	06b4bf3e55	Expand inline the relevant parts of src/COPYRIGHT for Matt Dillon's copyrighted files. Approved by: Matt Dillon	2003-08-12 23:24:05 +00:00
Paul Saab	77c39e17fa	Halted CPU's should not accumulate time. Reviewed by: jhb	2003-08-12 17:01:10 +00:00
Alan Cox	ba97fd8a78	Rename pmap_changebit() to pmap_clear_ptes() and remove the last parameter. The new name better reflects what the function does and how it is used. The last parameter was always FALSE. Note: In theory, gcc would perform constant propagation and dead code elimination to achieve the same effect as removing the last parameter, which is always FALSE. In practice, recent versions do not. So, there is little point in letting unused code pessimize execution.	2003-08-10 21:53:55 +00:00
Alan Cox	7fbff95c04	MFi386 1.422 & 1.423: lock page queues in pmap_insert_entry().	2003-08-08 01:52:03 +00:00
Scott Long	477327b5c5	In _bus_dmamap_load_buffer(), only count the number of bounce pages needed if they haven't been counted before. This test was ommitted when bus_dmamap_load() was merged into this function, and results in the pagesneeded field growing without bounds when multiple deferrals happen. Thanks to Paul Saab for beating his head against this for a few hours =-)	2003-08-04 23:40:35 +00:00
John Baldwin	3bdbd658f1	- Since td_critnest is now initialized in MI code, it doesn't have to be set in cpu_critical_fork_exit() anymore. - As far as I can tell, cpu_thread_link() has never been used, not even when it was originally added, so remove it.	2003-08-04 20:32:45 +00:00
Alan Cox	e53f32ace5	Use kmem_alloc_nofault() rather than kmem_alloc_pageable() in pmap_mapdev(). See revision 1.140 of kern/sys_pipe.c for a detailed rationale. Submitted by: tegge	2003-08-02 19:26:09 +00:00
Peter Wemm	59cc2230c6	Fix a dumbass mistake. I had the 'set' and 'get' reversed in the fpsetround/fpgetround macro pairs.	2003-08-02 00:26:30 +00:00
Bosko Milekic	b053bc8407	Make sure that when the PV ENTRY zone is created in pmap, that it's created not only with UMA_ZONE_VM but also with UMA_ZONE_NOFREE. In the i386 case in particular, the pmap code would hook a special page allocation routine that allocated from kernel_map and not kmem_map, and so when/if the pageout daemon drained the zones, it could actually push out slabs from the PV ENTRY zone but call UMA's default page_free, which resulted in pages allocated from kernel_map being freed to kmem_map; bad. kmem_free() ignores the return value of the vm_map_delete and just returns. I'm not sure what the exact repercussions could be, but it doesn't look good. In the PAE case on i386, we also set-up a zone in pmap, so be conservative for now and make that zone also ZONE_NOFREE and ZONE_VM. Do this for the pmap zones for the other archs too, although in some cases it may not be entirely necessarily. We'd rather be safe than sorry at this point. Perhaps all UMA_ZONE_VM zones should by default be also UMA_ZONE_NOFREE? May fix some of silby's crashes on the PV ENTRY zone.	2003-07-31 03:39:51 +00:00
Peter Wemm	3950c40739	KSTACK_PAGES is a global option.	2003-07-31 01:27:18 +00:00
Peter Wemm	9fb1db7bc8	Cosmetic: fix disorder of opt_kstack_pages.h include.	2003-07-31 01:26:40 +00:00
David Xu	5a92cbc206	Use PSL_KERNEL as upcall thread's initial rflags, don't use scratch user rflags.	2003-07-29 12:44:16 +00:00
Maxime Henrion	d5afecd068	- Introduce a new busdma flag BUS_DMA_ZERO to request for zero'ed memory in bus_dmamem_alloc(). This is possible now that contigmalloc() supports the M_ZERO flag. - Remove the locking of Giant around calls to contigmalloc() since contigmalloc() now grabs Giant itself.	2003-07-27 13:52:10 +00:00
David E. O'Brien	56ae44c5df	Use __FBSDID(). Brought to you by: a boring talk at Ottawa Linux Symposium	2003-07-25 21:19:19 +00:00
David E. O'Brien	12ea2cfe2e	Use __FBSDID(). Brought to you by: a boring talk at OLS	2003-07-25 21:10:19 +00:00
Alan Cox	059358675e	MFi386 revision 1.416 Add vm object locking to pmap_prefault(). Note: powerpc and sparc64 do not implement this function.	2003-07-25 18:58:39 +00:00
David Xu	74bbb26b51	Align upcall stack top to odd times of 8. GCC accounts return address in callee function for stack alignment.	2003-07-25 00:21:37 +00:00
David Xu	c3f8e34d6b	Implement cpu_set_upcall and cpu_set_upcall_kse. Reviewed by: peter	2003-07-24 08:52:44 +00:00
David Xu	81ebc68226	Set fault address to si_addr. Reviewed by: peter	2003-07-24 08:51:22 +00:00
Peter Wemm	9e9e575b6a	Make the breakpoint instruction trap gate available to users. ptrace() needs this. Submitted by: Mark Kettenis <kettenis@chello.nl>	2003-07-23 23:20:20 +00:00
Peter Wemm	8b48b40d5e	Set the %gs base to pcb_gsbase, not pcb_fsbase. Oops. Discovered by: davidxu	2003-07-23 23:17:15 +00:00
Alan Cox	3462150083	Annotate pmap_changebit() as __always_inline. This function was written as a template that when inlined is specialized for the caller through constant value propagation and dead code elimination. Thus, the specialized code that is generated for pmap_clear_reference() et al. avoids several conditional branches inside of a loop.	2003-07-23 19:49:32 +00:00
John Baldwin	e47d4f0fc2	Use macros from apic.h to when writing to the ICR to send IPIs to startup APs rather than magic numbers. Tested by: scottl	2003-07-23 19:04:28 +00:00
John Baldwin	55fb372edd	Add a new macro APIC_ICRLO_RESV_MASK that contains all of the reserved fields in the low 32 bits of the local APIC ICR register. Use this macro in place of APIC_RESV2_MASK when masking off existing bits from the ICR when writing to it to send an IPI. Tested by: scottl	2003-07-23 18:59:38 +00:00
Peter Wemm	5b9f8ddbbd	Go back to 64 bit precision for fadd/fsub/fsqrt etc. This is because on AMD64, gcc (and the ABI) expects the x87 unit to be running in 80/64 mode (not 64/53) so that it can use it for 'long double' operations. It takes the expected precision differences into account when generating code.	2003-07-22 06:50:34 +00:00
Peter Wemm	76537e43f5	Extend the machine/ieeefp.h that was inherited from i386 to support the SSE mxcsr register as well. Since gcc will intermix SSE2 and x87 FP code, the fpsetround() etc mode had better be the same. There are hooks to enable these inlines to be instantiated inside libc for non-gcc or C++ callers. (g++ doesn't like the inlines that tried to extract an integer and convert it to an enum).	2003-07-22 06:44:54 +00:00
David Xu	20a2d71332	Rename thread_siginfo to cpu_thread_siginfo. Suggested by: jhb	2003-07-15 00:11:04 +00:00
Mark Murray	c7b132c974	Protect lint(1) from a #error.	2003-07-10 18:05:02 +00:00
Peter Wemm	e95babf3a8	unifdef -DLAZY_SWITCH and start to tidy up the associated glue.	2003-07-10 01:02:59 +00:00
Peter Wemm	bf8ca114e2	Fix the VADDR() macros to use either KVADDR() or UVADDR(), depending on the implied sign extension. The single unified VADDR() macro was not able to avoid sign extending the VM_MAXUSER_ADDRESS/USRSTACK values. Be explicit about UVADDR() (positive address space) and KVADDR() (kernel negative address space) to make mistakes show up more spectacularly. Increase user VM space from 1/2TB (512GB) to 128TB.	2003-07-09 23:04:23 +00:00
Peter Wemm	6486c09935	Fix up bogus index/offset/mask calculations in the allocpte and the corresponding release code. This was preventing the use of more than 1/2TB of user VM. I also spent a week staring at this code only to eventually find that I'd mistakenly typed a P as an R.	2003-07-09 22:59:45 +00:00
Peter Wemm	4afd44c16a	Turn the 2MB page mappings that cover the kernel text+data+bss area back on now that pmap_pte() can handle it. I never actually ran into anything that broke that I know of, but this was turned off as a precaution.	2003-07-09 22:55:00 +00:00
Peter Wemm	436e1f203f	Have pmap_pte() on a 2MB mapped address return the 2MB pde itself rather than a non-existing pte. There is code elsewhere in i386/amd64 pmap that neglects to handle the large page cases because it knows that it will see PG_PS in the returned "pte".	2003-07-09 22:53:45 +00:00
Alan Cox	90a7c7b671	In pmap_object_init_pt(), the pmap_invalidate_all() should be performed on the caller-provided pmap, not the kernel_pmap. Using the kernel_pmap results in an unnecessary IPI for TLB shootdown on SMPs. Reviewed by: jake, peter	2003-07-08 19:40:35 +00:00
Alan Cox	1f78f902a8	Background: pmap_object_init_pt() premaps the pages of a object in order to avoid the overhead of later page faults. In general, it implements two cases: one for vnode-backed objects and one for device-backed objects. Only the device-backed case is really machine-dependent, belonging in the pmap. This commit moves the vnode-backed case into the (relatively) new function vm_map_pmap_enter(). On amd64 and i386, this commit only amounts to code rearrangement. On alpha and ia64, the new machine independent (MI) implementation of the vnode case is smaller and more efficient than their pmap-based implementations. (The MI implementation takes advantage of the fact that objects in -CURRENT are ordered collections of pages.) On sparc64, pmap_object_init_pt() hadn't (yet) been implemented.	2003-07-03 20:18:02 +00:00
Maxime Henrion	331e012396	Sync more things with other backends.	2003-07-01 19:16:48 +00:00
Maxime Henrion	4813f72a9b	Honor the boundary of the busdma tag when allocating bounce pages. This was fixed in revision 1.5 of alpha/alpha/busdma_machdep.c and was never fixed in other busdma backends using bounce pages.	2003-07-01 16:54:54 +00:00
Scott Long	f6b1c44d1f	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
Alan Cox	dca96f1adc	- Export pmap_enter_quick() to the MI VM. This will permit the implementation of a largely MI pmap_object_init_pt() for vnode-backed objects. pmap_enter_quick() is implemented via pmap_enter() on sparc64 and powerpc. - Correct a mismatch between pmap_object_init_pt()'s prototype and its various implementations. (I plan to keep pmap_object_init_pt() as the MD hook for device-backed objects on i386 and amd64.) - Correct an error in ia64's pmap_enter_quick() and adjust its interface to match the other versions. Discussed with: marcel	2003-06-29 21:20:04 +00:00
Jeff Roberson	ab875ef896	- Construct a cpu topology map for Hyper Threading systems so that ULE may take advantage of them.	2003-06-28 22:07:42 +00:00
David Xu	b8f480ab94	Add a machine depended function thread_siginfo, SA signal code will use the function to construct a siginfo structure and use the result to export to userland. Reviewed by: julian	2003-06-28 06:34:08 +00:00
Scott Long	7f95801188	Catch amd64 up with the pending busdma async callback locking. Though this mechanism might change in the near future, it's best to keep everything in sync right now. Reminded by: peter	2003-06-28 06:07:06 +00:00
Peter Wemm	b6a5f89b4d	Turn ips back on.	2003-06-27 23:11:22 +00:00
Peter Wemm	1e5d8b3b66	Oops, I only added a comment about why ips doesn't compile. Actually comment it out for real.	2003-06-26 04:01:59 +00:00
Peter Wemm	ba1cabf4b9	Sync with i386 - add everything that compiles. There are a few drivers that are trivially easy to fix (eg: ips) that I've not committed fixes for.	2003-06-26 03:49:54 +00:00
Peter Wemm	2d29639ebb	Add back in the ability for pmap_mapdev() to use KVM if the region being requested is outside of the range of the direct map region. eg: for pci windows. While here, increase the minimum size of the direct map region to be 4GB instead of 1GB.	2003-06-26 01:04:31 +00:00
Alan Cox	0183359659	MFi386 Add vm object locking to pmap_object_init_pt().	2003-06-23 06:10:52 +00:00
Hidetoshi Shimokawa	e07324646e	Move KERNBASE to -2GB. Currently, we cannot increase KVA more than 2GB.	2003-06-22 13:02:45 +00:00
Hidetoshi Shimokawa	bfcd2ec739	- Allow access to direct mapped region via /dev/kmem. This makes 'netstat -r' work. - Use direct map for /dev/mem.	2003-06-22 12:59:43 +00:00
Hidetoshi Shimokawa	c1c1cc9c19	- Allocate a new PD Table if kernel grows beyond 1GB boundary. Reviewed by: peter - Use direct map in pmap_mapdev().	2003-06-22 12:55:20 +00:00
Hidetoshi Shimokawa	e14720d614	Use direct map in pmap_map(). This saves much KVA for vm_pages and you don't need to increase NKPT for large physical memory anymore. Suggested by: dfr	2003-06-20 14:09:33 +00:00
Hidetoshi Shimokawa	d25ac2fa68	Fix direct map page table for 2GB+ physical memory. You may still need to increase NKPT for larger memory. I have successfully booted 8GB system with NKPT=256.	2003-06-19 12:14:37 +00:00
Alan Cox	40ebf3e43a	Fix a performance bug in all of the various implementations of uma_small_alloc(): They always zeroed the page regardless of what the caller requested.	2003-06-18 02:57:38 +00:00
David Xu	0e2a4d3aeb	Rename P_THREADED to P_SA. P_SA means a process is using scheduler activations.	2003-06-15 00:31:24 +00:00
Alan Cox	49a2507bd1	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
Alan Cox	89f4fca265	Move the _new_altkstack() and _dispose_altkstack() functions out of the various pmap implementations into the machine-independent vm. They were all identical.	2003-06-14 06:20:25 +00:00
Peter Wemm	77e2a274d0	GC unused cpu_wait() function	2003-06-11 05:20:33 +00:00
John Baldwin	6b9691f103	- Use IDTVEC() to declare IPI handlers since they are also IDT vectors. - Make handlers for IPI's used by SMP kernels #ifdef SMP.	2003-06-06 17:45:25 +00:00
John Baldwin	e59ae32f18	- Document the thermal and performance counter LVT entries in the local APIC. - Add a lvt_thermal member to the LAPIC struct. - Add constants for the SMI and INIT LVT delivery modes.	2003-06-06 17:22:15 +00:00
Marcel Moolenaar	c2e4eb969f	Change the second (and last) argument of cpu_set_upcall(). Previously we were passing in a void* representing the PCB of the parent thread. Now we pass a pointer to the parent thread itself. The prime reason for this change is to allow cpu_set_upcall() to copy (parts of) the trapframe instead of having it done in MI code in each caller of cpu_set_upcall(). Copying the trapframe cannot always be done with a simply bcopy() or may not always be optimal that way. On ia64 specifically the trapframe contains information that is specific to an entry into the kernel and can only be used by the corresponding exit from the kernel. A trapframe copied verbatim from another frame is in most cases useless without some additional normalization. Note that this change removes the assignment to td->td_frame in some implementations of cpu_set_upcall(). The assignment is redundant. A previous call to cpu_thread_setup() already did the exact same assignment. An added benefit of removing the redundant assignment is that we can now change td_pcb without nasty side-effects. This change officially marks the ability on ia64 for 1:1 threading. Not tested on: amd64, powerpc Compile & boot tested on: alpha, sparc64 Functionally tested on: i386, ia64	2003-06-04 22:46:27 +00:00
Peter Wemm	7fc03ef474	Fix ALIGNED_POINTER(). sizeof((u_int32_t)) is not legal C.	2003-06-04 02:15:13 +00:00
Peter Wemm	babc58fd74	Fix restarted syscalls. When we rewind %rip, we also need to restore all the argument registers etc since we have almost certainly have trashed them by now. Take particular car of %r10 since it held the original value of %rcx (which we saved in tf_rcx on entry and doreti doesn't know this).	2003-06-02 21:56:08 +00:00
Peter Wemm	c35518b4ed	Make this more compatable with libc_r. Make the internal types for storing registers an array of longs rather than int.	2003-06-02 21:49:35 +00:00
David E. O'Brien	006124d811	Use __FBSDID().	2003-06-02 16:32:55 +00:00
David E. O'Brien	9676a785e7	Use __FBSDID().	2003-06-02 06:43:15 +00:00
Peter Wemm	193b147c05	MFi386: i386/include/asm.h rev 1.11: Do not abuse ##.	2003-06-02 05:59:35 +00:00
David E. O'Brien	69bb404192	Use C99 compatable asm statements.	2003-06-02 00:29:35 +00:00
David E. O'Brien	713c939103	Sync with i386/GENERIC ordering.	2003-06-01 20:26:38 +00:00
Peter Wemm	395aac85f8	MFi386: rev 1.56: remove break after return	2003-05-31 22:02:11 +00:00
Peter Wemm	0c5b3efcb0	MFi386: rev 1.23: use gdb_strlen()/gdb_strcpy() directly.	2003-05-31 22:00:57 +00:00
Peter Wemm	fbbfc4c335	MFi386: rev 1.50: remove unused variable	2003-05-31 21:58:55 +00:00
Poul-Henning Kamp	618b80ddcf	Avoid unbalancing the { } count in the source file with #ifdef by putting the opening { after the #ifdef ... #endif sequence. Found by: FlexeLint	2003-05-31 20:25:53 +00:00
Peter Wemm	4af5a3de60	Add acpi to the build. Remove the hack from machdep.c that lies to the loader to shut it up.	2003-05-31 07:00:08 +00:00
Peter Wemm	b043c80645	Have hammer_time() return the proc0 stack location, and have locore switch to it before calling mi_startup(). The bootstack is WAY too small for running acpica during probe/attach. While here, pass modulep/physfree to the startup routine, rather than writing to the global variables in locore.S. Approved by: re (amd64/*)	2003-05-31 06:54:29 +00:00
Peter Wemm	5681a6f60d	Regenerate.	2003-05-31 06:51:04 +00:00
Peter Wemm	1f5b79bc16	Make this compile with WITNESS enabled. It wants the syscall names.	2003-05-31 06:49:53 +00:00
Peter Wemm	ff7bf2f72e	Port acpica to amd64. Approved by: re (amd64/* blanket)	2003-05-31 06:47:05 +00:00
Peter Wemm	cc71eb5e10	With the help of jhb, fix the ACPI_ACQUIRE_GLOBAL_LOCK() macros and port to amd64 after repocopy. Approved by: re (amd64/*)	2003-05-31 06:43:55 +00:00
Hiten Pandya	b77c32a07e	Rename BUS_DMAMEM_NOSYNC to BUS_DMA_COHERENT. The current name is confusing, because it indicates to the client that a bus_dmamap_sync() operation is not necessary when the flag is specified, which is wrong. The main purpose of this flag is to hint the underlying architecture that DMA memory should be mapped in a coherent way, but the architecture can ignore it. But if the architecture does supports coherent mapping of memory, then it makes bus_dmamap_sync() calls cheap. This flag is the same as the one in NetBSD's Bus DMA. Reviewed by: gibbs, scottl, des (implicitly) Approved by: re@ (jhb)	2003-05-30 20:40:33 +00:00
Peter Wemm	5c980babcd	Nasty 'make it compile' port to amd64. Note that it needs some other wire protocol for the extra registers. I should probably just remove it from here for now since its quite useless. Approved by: re (amd64/* blanket)	2003-05-30 01:02:52 +00:00
Peter Wemm	5feb2148ba	Initial port to amd64 after repocopy from i386. Note that the disassembler has not been updated yet, and will do some very strange things. It does tracebacks (without function arguments due to regparm calling conventions) if -fno-omit-frame-pointer is used (to come later). This achieves basic functionality. Approved by: re (amd64/* blanket)	2003-05-30 01:01:07 +00:00
Peter Wemm	0afbc83dfd	Add setjmp/longjmp for ddb	2003-05-30 00:58:48 +00:00
Peter Wemm	5e1b7df5cf	Update AMD Features vector to include NX (page table entry no-execute bit) and LM (long mode) etc.	2003-05-27 21:59:56 +00:00
Scott Long	7e71df9339	Bring back bus_dmasync_op_t. It is now a typedef to an int, though the BUS_DMASYNC_ definitions remain as before. The does not change the ABI, and reverts the API to be a bit more compatible and flexible. This has survived a full 'make universe'. Approved by: re (bmah)	2003-05-27 04:59:59 +00:00
Scott Long	c87d464f28	De-orbit bus_dmamem_alloc_size(). It's a hack and was never used anyways. No need for it to pollute the 5.x API any further. Approved by: re (bmah)	2003-05-26 04:00:52 +00:00
Peter Wemm	3ebd9b48ce	Stop profiled libc from exploding, matching gcc's generated code. Approved by: re (amd64/* blanket)	2003-05-24 18:24:03 +00:00
Peter Wemm	d9cd1af4aa	Typo fix. oops. Submitted by: jmallett Approved by: re (blanket amd64/*)	2003-05-23 06:36:46 +00:00
Peter Wemm	cbd667fa2f	Update comments. Note that the kernel is at -1GB, not -2GB as erroniously implied by the previous commit. KVM is still only 1GB until pmap_growkernel() learns about the extra page table level. Approved by: re (blanket)	2003-05-23 06:35:45 +00:00
Peter Wemm	f229f5cf85	As suggested by the gdb folks, pad the 'struct fpreg' to a full 512 bytes to match the native fxsave/fxrstor object size since thats apparently what the Linux/NetBSD folks do.	2003-05-23 06:31:56 +00:00
Peter Wemm	9f0c4ab393	Deal with the user VM space expanding. 32 bit applications do not like having their stack at the 512GB mark. Give 4GB of user VM space for 32 bit apps. Note that this is significantly more than on i386 which gives only about 2.9GB of user VM to a process (1GB for kernel, plus page table pages which eat user VM space). Approved by: re (blanket)	2003-05-23 05:07:33 +00:00
Peter Wemm	3c9a3c9ca3	Major pmap rework to take advantage of the larger address space on amd64 systems. Of note: - Implement a direct mapped region using 2MB pages. This eliminates the need for temporary mappings when getting ptes. This supports up to 512GB of physical memory for now. This should be enough for a while. - Implement a 4-tier page table system. Most of the infrastructure is there for 128TB of userland virtual address space, but only 512GB is presently enabled due to a mystery bug somewhere. The design of this was heavily inspired by the alpha pmap.c. - The kernel is moved into the negative address space(!). - The kernel has 2GB of KVM available. - Provide a uma memory allocator to use the direct map region to take advantage of the 2MB TLBs. - Fixed some assumptions in the bus_space macros about the ability to fit virtual addresses in an 'int'. Notable missing things: - pmap_growkernel() should be able to grow to 512GB of KVM by expanding downwards below kernbase. The kernel must be at the top 2GB of the negative address space because of gcc code generation strategies. - need to fix the >512GB user vm code. Approved by: re (blanket)	2003-05-23 05:04:54 +00:00
Peter Wemm	997f3bfc2a	Merge from i386/trap.c rev 1.252. Use td_critnest instead of the spinlocks count for explicitly enabling interrupts. Approved by: re (blanket)	2003-05-22 20:09:50 +00:00
Alexander Kabaev	980ded9a7d	sys/sys/limits.h: - Fix visibilty test for LONG_BIT and WORD_BIT. `#if defined(__FOO_VISIBLE)' is alays wrong because __FOO_VISIBLE is always defined (to 0 for invisibility). sys/<arch>/include/limits.h sys/<arch>/include/_limits.h: - Style fixes. Submitted by: bde Reviewed by: bsdmike Approved by: re (scottl)	2003-05-19 20:29:07 +00:00
Peter Wemm	5c0fe26236	Actually get all the bits for sd_hibase.. it was 16 bits short. oops. Approved by: re (amd64/* blanket)	2003-05-17 02:05:10 +00:00
Alan Cox	4a0d6dfd2c	Initialize logical_cpus_mask when the logical CPUs are enumerated in the mptable. (Previously, logical_cpus_mask was only initialized if the hyperthreading fixup was executed.) Approved by: re (jhb) Reviewed by: ps	2003-05-15 05:12:24 +00:00
Peter Wemm	c0a54ff621	Collect the nastiness for preserving the kernel MSR_GSBASE around the load_gs() calls into a single place that is less likely to go wrong. Eliminate the per-process context switching of MSR_GSBASE, because it should be constant for a single cpu. Instead, save/restore it during the loading of the new %gs selector for the new process. Approved by: re (amd64/* blanket)	2003-05-15 00:23:40 +00:00
Peter Wemm	be52ef1399	Use compile time constants for things like PTmap[] etc because they're about to move outside of the +/- 2GB range Suggested by: jake Approved by: re (amd64/* blanket)	2003-05-15 00:20:17 +00:00
Peter Wemm	e14528b349	Regen Approved by: re (amd64 blanket)	2003-05-14 04:11:25 +00:00
Peter Wemm	d85631c4ac	Add BASIC i386 binary support for the amd64 kernel. This is largely stolen from the ia64/ia32 code (indeed there was a repocopy), but I've redone the MD parts and added and fixed a few essential syscalls. It is sufficient to run i386 binaries like /bin/ls, /usr/bin/id (dynamic) and p4. The ia64 code has not implemented signal delivery, so I had to do that. Before you say it, yes, this does need to go in a common place. But we're in a freeze at the moment and I didn't want to risk breaking ia64. I will sort this out after the freeze so that the common code is in a common place. On the AMD64 side, this required adding segment selector context switch support and some other support infrastructure. The %fs/%gs etc code is hairy because loading %gs will clobber the kernel's current MSR_GSBASE setting. The segment selectors are not used by the kernel, so they're only changed at context switch time or when changing modes. This still needs to be optimized. Approved by: re (amd64/* blanket)	2003-05-14 04:10:49 +00:00
Peter Wemm	5d5ca6d75e	Fix some misunderstandings about 64 bit extension. Fix fuword/suword - they're supposed to be 'long' - ie: point them at fuword64/suword64 instead of the incorrect 32 bit versions.	2003-05-14 03:38:13 +00:00
John Baldwin	90af4afacb	- Merge struct procsig with struct sigacts. - Move struct sigacts out of the u-area and malloc() it using the M_SUBPROC malloc bucket. - Add a small sigacts_*() API for managing sigacts structures: sigacts_alloc(), sigacts_free(), sigacts_copy(), sigacts_share(), and sigacts_shared(). - Remove the p_sigignore, p_sigacts, and p_sigcatch macros. - Add a mutex to struct sigacts that protects all the members of the struct. - Add sigacts locking. - Remove Giant from nosys(), kill(), killpg(), and kern_sigaction() now that sigacts is locked. - Several in-kernel functions such as psignal(), tdsignal(), trapsignal(), and thread_stopped() are now MP safe. Reviewed by: arch@ Approved by: re (rwatson)	2003-05-13 20:36:02 +00:00
Peter Wemm	8a6d52c3f8	Really stop the loader from trying to load the acpi module by lying and pretending that it is already here. Approved by: re (amd64/* stuff)	2003-05-12 18:37:56 +00:00
Peter Wemm	0fe93e7480	For the page fault handler, save %cr2 in the outer trap handler so that we do not have to run so long with interrupts disabled. This involved creating tf_addr in the trapframe. Reorganize the trap stubs so that they consistently reserve the stack space and initialize any missing bits. Approved by: re (amd64 stuff)	2003-05-12 18:33:19 +00:00
Peter Wemm	0f6241620b	Sync ucontext with reality. The struct trapframe changes need to be reflected here. Approved by: re (blanket amd64/*)	2003-05-12 18:23:04 +00:00
Peter Wemm	e9b193dc33	AMD64 physical space is much larger than i386, de-i386 the bus_space and bus_dma MD code for AMD64. (And a trivial ifdef update in dev/kbd because of this). More updates are needed here to take advantage of the 64 bit instructions. Approved by: re (blanket amd64/*)	2003-05-12 02:44:37 +00:00
Peter Wemm	bf1e897425	Give a %fs and %gs to userland. Use swapgs to obtain the kernel %GS.base value on entry and exit. This isn't as easy as it sounds because when we recursively trap or interrupt, we have to avoid duplicating the swapgs instruction or we end up back with the userland %gs. I implemented this by testing TF_CS to see if we're coming from supervisor mode already, and check for returning to supervisor. To avoid a race with interrupts in the brief period after beginning executing the handler and before the swapgs, convert all trap gates to interrupt gates, and reenable interrupts immediately after the swapgs. I am not happy with this. There are other possible ways to do this that should be investigated. (eg: storing the GS.base MSR value in the trapframe) Add some sysarch functions to let the userland code get to this. Approved by: re (blanket amd64/*)	2003-05-12 02:37:29 +00:00
Peter Wemm	85983c59cd	Call it an AMD64 Processor, not a Hammer. Also, it seems that the cpuid model numbers are wider than I first thought. Approved by: re (blanket amd64/*)	2003-05-11 23:01:04 +00:00
Peter Wemm	f75b005a99	I missed another printf format error while extracting the patch. Approved by: re (blanket amd64/*)	2003-05-11 22:55:40 +00:00
Peter Wemm	eeee69d45c	Make atdevbase long for the KERNBASE > 4GB case Approved by: re (amd64/* blanket)	2003-05-11 22:53:43 +00:00

... 3 4 5 6 7 ...

4050 Commits