freebsd-skq

Author	SHA1	Message	Date
Peter Wemm	93c3f67fe7	Make this compile with PAE.	2003-11-04 01:07:04 +00:00
John Baldwin	147ad8d5ad	New i386 SMP code: - The MP code no longer knows anything specific about an MP Table. Instead, the local APIC code adds CPUs via the cpu_add() function when a local APIC is enumerated by an APIC enumerator. - Don't divide the argument to mp_bootaddress() by 1024 just so that we can turn around and mulitply it by 1024 again. - We no longer panic if SMP is enabled but we are booted on a UP machine. - init_secondary(), the asm code between init_secondary() and ap_init() in mpboot.s and ap_init() have all been merged together in C into init_secondary(). - We now use the cpuid feature bits to determine if we should enable PSE, PGE, or VME on each AP. - Due to the change in the implementation of critical sections, acquire the SMP TLB mutex around a slightly larger chunk of code for TLB shootdowns. - Remove some of the debug code from the original SMP implementation that is no longer used or no longer applies to the new APIC code. - Use a temporary hack to disable the ACPI module until the SMP code has been further reorganized to allow ACPI to work as a module again. - Add a DDB command to dump the interesting contents of the IDT.	2003-11-03 22:32:04 +00:00
John Baldwin	9738024229	Don't probe PnP BIOS devices for PICs for now to avoid problems with those devices claiming resources that they don't actually use. The PIC drivers only register valid interrupt sources, so we don't need to rely on these drivers to claim invalid IRQs to prevent their use by other drivers.	2003-11-03 22:22:04 +00:00
John Baldwin	ab089945d3	Add the ACPI MADT table APIC enumerator. This code uses the ACPI Multiple APIC Descriptor Table to enumerate both I/O APICs and local APICs. ACPI does not embed PCI interrupt routing information in the MADT like the MP Table does. Instead, ACPI stores the PCI interrupt routing information in the _PRT object under each PCI bus device. The MADT table simply provides hints about which interrupt vectors map to which I/O APICs. Thus when using ACPI, the existing ACPI PCI bridge drivers are sufficient to route PCI interrupts.	2003-11-03 22:17:44 +00:00
John Baldwin	8f8914ad98	Add the MP Table APIC enumerator. This code uses the BIOS MP Table to enumerate I/O APICs as well as local APICs. It also provides Host-PCI and PCI-PCI bridge drivers to use the MP Table to route PCI interrupts.	2003-11-03 22:12:37 +00:00
John Baldwin	6f92bdd0c1	New APIC support code: - The apic interrupt entry points have been rewritten so that each entry point can serve 32 different vectors. When the entry is executed, it uses one of the 32-bit ISR registers to determine which vector in its assigned range was triggered. Thus, the apic code can support 159 different interrupt vectors with only 5 entry points. - We now always to disable the local APIC to work around an errata in certain PPros and then re-enable it again if we decide to use the APICs to route interrupts. - We no longer map IO APICs or local APICs using special page table entries. Instead, we just use pmap_mapdev(). We also no longer export the virtual address of the local APIC as a global symbol to the rest of the system, but only in local_apic.c. To aid this, the APIC ID of each CPU is exported as a per-CPU variable. - Interrupt sources are provided for each intpin on each IO APIC. Currently, each source is given a unique interrupt vector meaning that PCI interrupts are not shared on most machines with an I/O APIC. That mapping for interrupt sources to interrupt vectors is up to the APIC enumerator driver however. - We no longer probe to see if we need to use mixed mode to route IRQ 0, instead we always use mixed mode to route IRQ 0 for now. This can be disabled via the 'NO_MIXED_MODE' kernel option. - The npx(4) driver now always probes to see if a built-in FPU is present since this test can now be performed with the new APIC code. However, an SMP kernel will panic if there is more than one CPU and a built-in FPU is not found. - PCI interrupts are now properly routed when using APICs to route interrupts, so remove the hack to psuedo-route interrupts when the intpin register was read. - The apic.h header was moved to apicreg.h and a new apicvar.h header that declares the APIs used by the new APIC code was added.	2003-11-03 21:53:38 +00:00
John Baldwin	223e573bbd	Add the new atpic(4) driver for the 8259A master and slave PICs. By default we provide 16 interrupt sources for IRQs 0 through 15. However, if the I/O APIC driver has already registered sources for any of those IRQs then we will silently fail to register our own source for that IRQ. Note that i386/isa/icu.h is now specific to the 8259A and no longer contains any info relevant to APICs. Also note that fast interrupts no longer use a separate entry point. Instead, both fast and threaded interrupts share the same entry point which merely looks up the appropriate source and passes control to intr_execute_handlers().	2003-11-03 21:34:45 +00:00
John Baldwin	ecee5704ed	New device interrupt code. This defines an interrupt source abstraction that provides methods via a PIC driver to do things like mask a source, unmask a source, enable it when the first interrupt handler is added, etc. The interrupt code provides a table of interrupt sources indexed by IRQ numbers, or vectors. These vectors are what new-bus uses for its IRQ resources and for bus_setup_intr()/bus_teardown_intr(). The interrupt code then maps that vector a given interrupt source object. When an interrupt comes in, the low-level interrupt code looks up the interrupt source for the source that triggered the interrupt and hands it off to this code to execute the appropriate handlers. By having an interrupt source abstraction, this allows us to have different types of interrupt source providers within the shared IRQ address space. For example, IRQ 0 may map to pin 0 of the master 8259A PIC, IRQs 1 through 60 may map to pins on various I/O APICs, and IRQs 120 through 128 may map to MSI interrupts for various PCI devices.	2003-11-03 21:25:52 +00:00
John Baldwin	e14243fac7	Move the NMI handling code out to its own file.	2003-11-03 21:10:17 +00:00
John Baldwin	1ab9ea3059	Include "opt_pmap.h" so that the DISABLE_P* options are honored.	2003-10-30 21:42:44 +00:00
John Baldwin	63239aa581	Always export r_gdt and r_idt and give them extern declarations in machine/segments.h.	2003-10-30 21:42:17 +00:00
Peter Wemm	fbd00896e2	MFi386: thread specific fpu state optimizations	2003-10-30 19:04:58 +00:00
Peter Wemm	3f378ea44a	MFi386: rev 1.451 (jhb): call pmap_kremove() rather than duplicate it	2003-10-30 04:08:22 +00:00
Peter Wemm	10d9b64384	MFi386: trap.c rev 1.259: fetch thread mailbox address in page fault trap	2003-10-30 04:06:28 +00:00
Peter Wemm	57e1fa205b	Oops. Remove some rather noisy debug printfs that slipped in there somehow.	2003-10-28 01:06:37 +00:00
John Baldwin	07930cce05	A few whitespace and comment tweaks.	2003-10-24 21:02:26 +00:00
Peter Wemm	cedb3695c1	Add __va_copy and make it always visible, in spite of the __ISO_C_VISIBLE setting. Make va_copy be an alias if __ISO_C_VISIBLE >= 1999. Why? more than a few ports have an autoconf that looks for __va_copy because it is available on glibc. It is critical that we use it if at all possible on amd64. It generally isn't a problem for i386 and its ilk because autoconf driven code tends to fall back to an assignment.	2003-10-24 02:50:39 +00:00
Peter Wemm	63f2bb5ff1	Use a more robust API altogether for the amd64_get_fsbase() etc functions.	2003-10-23 06:06:14 +00:00
Peter Wemm	c0432d033e	Renumber the sysarch vectors for amd64 specific syscalls so that I can implement i386 compat numbers where it makes sense. This would save a syscall translation layer. Yes, this breaks the abi slightly again, but fortunately its just a recompile rather than tweaking the source. I will be fixing the libc stubs while I'm here.	2003-10-23 05:31:23 +00:00
Mike Silbersack	184dcdc7c8	Change all SYSCTLS which are readonly and have a related TUNABLE from CTLFLAG_RD to CTLFLAG_RDTUN so that sysctl(8) can provide more useful error messages.	2003-10-21 18:28:36 +00:00
Nate Lawson	4c3655b418	Add the cpu_idle_hook() function pointer so that other idlers can be hooked at runtime. Make C1 sleep (e.g., HLT) be the default. This prepares the way for further ACPI sleep states.	2003-10-18 22:25:07 +00:00
Bruce Evans	ed86674a3d	Don't forget to load %es with the kernel data segment selector in Xcpustop(). %es is used in at least the call to savectx() when savectx() calls bcopy(), so not loading it was fatal if a stop IPI interrupts user mode. This reduces bugs starting and stopping CPUs for debuggers. CPUs are stopped mainly in kdb_trap() and cpu_reset(). At reset time there is a good chance that all the CPUs are in the kernel, so the bug was probably harmless then.	2003-10-16 10:44:24 +00:00
Peter Wemm	19acc770c2	Pull the tier-2 card one last time and break the get/setcontext and sigreturn() ABI and the signal context on the stack. Make the trapframe (and its shadows in the ucontext and sigframe etc) 8 bytes larger in order to preserve 16 byte stack alignment for the following C code calls. I could have done some padding after the trapframe was saved, but some of the C code still expects an argument of 'struct trapframe'. Anyway, this gives me a spare field that can be used to store things like 'partial trapframe' status or something else in the future. The runtime impact is fairly small, except for threaded apps and things that decode contexts and the signal stack (eg: cvsup binary). Signal delivery isn't too badly affected because the kernel generates the sigframe that sigreturn uses after the handler has been called. The size of mcontext_t and struct sigframe hasn't changed. Only the last few fields (sc_eip etc) got moved a little and I eliminated a spare field. mc_len/sc_len did change location though so the sanity checks there will still trap it.	2003-10-15 02:04:52 +00:00
Alan Cox	7fb578933f	MFia64 Move uma_small_alloc() and uma_small_free() to uma_machdep.c.	2003-10-14 05:51:31 +00:00
Robert Drehmel	ea924c4cd3	Implement preliminary support for the PT_SYSCALL command to ptrace(2).	2003-10-09 10:17:16 +00:00
Bruce M Simpson	2bc7dd5661	Move pmap_resident_count() from the MD pmap.h to the MI pmap.h. Add a definition of pmap_wired_count(). Add a definition of vmspace_wired_count(). Reviewed by: truckman Discussed with: peter	2003-10-06 01:47:12 +00:00
Alan Cox	ab87e2fb83	Don't bother setting a page table page's valid field. It is unused and not setting it is consistent with other uses of VM_ALLOC_NOOBJ pages.	2003-10-05 00:12:16 +00:00
Alan Cox	566526a957	Migrate pmap_prefault() into the machine-independent virtual memory layer. A small helper function pmap_is_prefaultable() is added. This function encapsulate the few lines of pmap_prefault() that actually vary from machine to machine. Note: pmap_is_prefaultable() and pmap_mincore() have much in common. Going forward, it's worth considering their merger.	2003-10-03 22:46:53 +00:00
Alan Cox	81b460c5eb	Reimplement pagezero() using "movnti".	2003-10-02 05:08:13 +00:00
Peter Wemm	6ccf265bb0	Commit Bosko's patch to clean up the PSE/PG_G initialization to and avoid problems with some Pentium 4 cpus and some older PPro/Pentium2 cpus. There are several problems, some documented in Intel errata. This patch: 1) moves the kernel to the second page in the PSE case. There is an errata that says that you Must Not point a 4MB page at physical address zero on older cpus. We avoided bugs here due to sheer luck. 2) sets up PSE page tables right from the start in locore, rather than trying to switch from 4K to 4M (or 2M) pages part way through the boot sequence at the same time that we're messing with PG_G. For some reason, the pmap work over the last 18 months seems to tickle the problems, and the PAE infrastructure changes disturb the cpu bugs even more. A couple of people have reported a problem with APM bios calls during boot. I'll work with people to get this resolved. Obtained from: bmilekic	2003-10-01 23:46:08 +00:00
Peter Wemm	a93020d7a1	Use __register_t instead of register_t, otherwise <sys/types.h> is a prerequisite for <ucontext.h> on amd64. Oops.	2003-10-01 01:08:04 +00:00
Peter Wemm	ba5a51ea04	MFi386: Do not depend on LEAPYEAR() macro boolean values being 0 or 1. MFi386: Add quality field for timer0	2003-09-30 06:42:47 +00:00
Peter Wemm	ec548f97fc	MFi386: BURN_BRIDGES around timer0 functions	2003-09-30 06:38:11 +00:00
Jeff Roberson	3c4d5e1546	- Remove the definition for TD_SWITCHIN as it is not used. Approved by: peter	2003-09-30 04:52:24 +00:00
Alan Cox	9060731130	Eliminate the pte object.	2003-09-27 20:53:01 +00:00
Alan Cox	d2a81cdbed	MFi386 Allocate the page table directory page as "no object" pages.	2003-09-26 04:12:41 +00:00
Alan Cox	d91440cd46	MFi386 Reimplement pmap_release() such that it uses the page table rather than the pte object to locate the page table directory pages. (Temporarily, retain an assertion on the emptiness of the pte object.)	2003-09-25 05:38:18 +00:00
Peter Wemm	cc3112f108	Re-raise the default datasize and stacksize now that the 32 bit exec support can clip it to sensible values.	2003-09-25 01:11:17 +00:00
Peter Wemm	c460ac3a00	Add sysentvec->sv_fixlimits() hook so that we can catch cases on 64 bit systems where the data/stack/etc limits are too big for a 32 bit process. Move the 5 or so identical instances of ELF_RTLD_ADDR() into imgact_elf.c. Supply an ia32_fixlimits function. Export the clip/default values to sysctl under the compat.ia32 heirarchy. Have mmap(0, ...) respect the current p->p_limits[RLIMIT_DATA].rlim_max value rather than the sysctl tweakable variable. This allows mmap to place mappings at sensible locations when limits have been reduced. Have the imgact_elf.c ld-elf.so.1 placement algorithm use the same method as mmap(0, ...) now does. Note that we cannot remove all references to the sysctl tweakable maxdsiz etc variables because /etc/login.conf specifies a datasize of 'unlimited'. And that causes exec etc to fail since it can no longer find space to mmap things.	2003-09-25 01:10:26 +00:00
Yoshihiro Takahashi	33e38a2cc8	Implement the bus_space_map() function to allocate resources and initialize a bus_handle, but currently it does only initializing a bus_handle.	2003-09-23 08:22:34 +00:00
Peter Wemm	725bc17312	Oops. back out last commit. The data and stack limits are used by the 32 bit binary stuff. 32 bit binaries do not like it much when the kernel tries hard to put things above the 8GB mark. I have a work-in-progress to fix this properly, but I didn't want to burn anybody with this yet.	2003-09-23 03:20:34 +00:00
Peter Wemm	705c67adc2	Fix patch transcription typo. s/IDT_BPT/IDT_BP/	2003-09-23 00:45:55 +00:00
Peter Wemm	cd3402fa66	Sync with i386 version. The quality initialization was missing and some other junk.	2003-09-23 00:18:45 +00:00
Peter Wemm	ee3ce1c29c	GC unused child variable	2003-09-23 00:04:28 +00:00
Peter Wemm	4295ddf26f	MFi386 pci_bus.c 1.102 legacyvar.h 1.4: rename nexus_pcib to legacy_pcib However, leave legacy_pcib_route_interrupt() since there is no pcibios to call.	2003-09-23 00:03:44 +00:00
Peter Wemm	da87d7e10d	Move basemem variable into global scope so that the MP startup code can refer to it for looking for tables.	2003-09-22 23:33:29 +00:00
Peter Wemm	24789c549a	Increase the default data size limit from 512MB to 8GB. Increase default stack limit from 64MB to 512MB.	2003-09-22 23:21:39 +00:00
Peter Wemm	848947c793	MFi386 rev 1.51 by scottl: make dflt_lock() always panic.	2003-09-22 23:11:42 +00:00
Peter Wemm	951b3d46b6	MFi386 rev 1.53 by scottl: Allocate the S/G list in the tag, not on the stack. This means that s/g lists can be arbitrarily long.	2003-09-22 23:10:24 +00:00
Peter Wemm	d79ddbf5de	MFi386 machdep.c rev 1.201, clock.c 1.201, clock.h 1.45 by phk: Dont initialize a TSC timecounter until we know if it is broke or not. XXX I think there is a bug in the i386 code here. init_TSC_tc() comes after: if (statclock_disable) return; ie: if you turn off the statclock interrupt, you dont get the TSC either.	2003-09-22 23:02:24 +00:00
Peter Wemm	e63f19e150	MFi386 rev 1.105 by jhb: fix comment typo	2003-09-22 22:54:14 +00:00
Peter Wemm	74a99ec4fe	MFi386 rev 1.256 by jhb: remove redundant #include <sys/sysctl.h>	2003-09-22 22:52:46 +00:00
Peter Wemm	13a27f2962	MFi386 rev 1.25 by jhb: add new MSR's and some missing older ones and APICBASE MSR constants.	2003-09-22 22:51:46 +00:00
Peter Wemm	f0c4b48689	MFi386 rev 1.55 by sam: remove unused #define BUS_DMAMAP_NSEGS	2003-09-22 22:43:21 +00:00
Peter Wemm	d10e66f073	MFi386 rev 1.37: constant-friendly bswap macros	2003-09-22 22:37:49 +00:00
Peter Wemm	5bc82d1ce1	MFi386: pci_cfgreg.h rev 1.10 by jhb/des/njl. Fix CONF1_ENABLE_MSK.	2003-09-22 22:21:21 +00:00
Peter Wemm	20e220ac68	MFCi386: trap.c rev 1.257 by bde. Don't forget to reenable interrupts for breakpoint and trace traps from usermode. Although all the setidt entries are interrupt gates on amd64, all but the trace and bpt trap entry handlers reenable interrupts after the swapgs instruction in order to simulate the trap/interrupt gate distinction. In other words, the amd64 code behaves the same way that i386 does here.	2003-09-22 22:19:59 +00:00
Peter Wemm	8848ad863b	MFi386 by jhb: add acpi_SetDefaultIntrModel();	2003-09-22 22:12:46 +00:00
Peter Wemm	76caec589f	MFi386 by jhb: use symbolic constants for the IDT entries.	2003-09-22 22:09:02 +00:00
Peter Wemm	882554f111	MFi386: machdep.c:1.570 clock.c:1.204 by bde: Quick fix for calling DELAY for ddb input in some atkbd-based console drivers. ddb must not use any normal locks but DELAY() normally calls getit() which needs clock_lock. This also removes the need for recursion on clock_lock.	2003-09-22 21:56:48 +00:00
Joerg Wunsch	9678710b1f	Mention the puc(4) glue driver in a commented-out example so the user of "dumb" PCI-based serial/parallel boards get a hint how to enable them. I wasn't sure about the ia64, pc98, powerpc, and sparc64 archs whether they'd support puc(4) or not.	2003-09-19 20:04:55 +00:00
David E. O'Brien	67193a54f0	Statically compile in sound as we don't have modules yet.	2003-09-15 22:40:00 +00:00
Alan Cox	6d66d714c7	Simplify (and micro-optimize) pmap_unuse_pt(): Only one caller, pmap_remove_pte(), passed NULL instead of the required page table page to pmap_unuse_pt(). Compute the necessary page table page in pmap_remove_pte(). Also, remove some unreachable code from pmap_remove_pte().	2003-09-13 21:57:38 +00:00
Alan Cox	b9850eb224	Add a new parameter to pmap_extract_and_hold() that is needed to eliminate Giant from vmapbuf(). Idea from: tegge	2003-09-12 07:07:49 +00:00
David E. O'Brien	3fc40c2484	Sort 'bge' correctly.	2003-09-10 18:54:59 +00:00
John Baldwin	a547af297d	Remove an XXX comment by using the per CPU mask added after this comment was added.	2003-09-10 01:36:48 +00:00
John Baldwin	f03cb48d41	Fix a typo.	2003-09-10 01:11:58 +00:00
Peter Wemm	2c25d12414	Clean up get/set_mcontext() and get/set_fpcontext(). These are operated on data structures on the kernel stack which are guaranteed to be 16 byte aligned by gcc, the amd64 ABI and __aligned(16). Ensire the tss_rsp0 initial stack pointer is 16 byte aligned in case sizeof(pcb) becomes odd at some point. This is convenient for the interrupt handler case because the ring crossing pushes cause the required odd alignment before the call to the C code. Have fast_syscall add an additional 8 bytes to ensure that the trapframe has the correct odd alignment for the call to C code. Note that there are no checks to make sure that the trapframe size is appropriate for this. This makes get/setfpcontext work properly (finally). You get a GPF in kernel mode if any of this is botched without the alignment fixup code that is apparently needed on i386.	2003-09-09 19:32:09 +00:00
Peter Wemm	df6ece387b	Turn aac back on now that its been cleaned up for 64 bit compilation	2003-09-08 20:00:55 +00:00
Peter Wemm	292bbfd103	Argh. This file was completely out of sync with mcontext/trapframe.	2003-09-08 18:31:48 +00:00
Peter Wemm	7fe089a006	Hmm. Two copies of the mcontext...	2003-09-08 18:28:41 +00:00
Alan Cox	ba2157f218	Introduce a new pmap function, pmap_extract_and_hold(). This function atomically extracts and holds the physical page that is associated with the given pmap and virtual address. Such a function is needed to make the memory mapping optimizations used by, for example, pipes and raw disk I/O MP-safe. Reviewed by: tegge	2003-09-08 02:45:03 +00:00
Bill Paul	a94100fa9b	Take the support for the 8139C+/8169/8169S/8110S chips out of the rl(4) driver and put it in a new re(4) driver. The re(4) driver shares the if_rlreg.h file with rl(4) but is a separate module. (Ultimately I may change this. For now, it's convenient.) rl(4) has been modified so that it will never attach to an 8139C+ chip, leaving it to re(4) instead. Only re(4) has the PCI IDs to match the 8169/8169S/8110S gigE chips. if_re.c contains the same basic code that was originally bolted onto if_rl.c, with the following updates: - Added support for jumbo frames. Currently, there seems to be a limit of approximately 6200 bytes for jumbo frames on transmit. (This was determined via experimentation.) The 8169S/8110S chips apparently are limited to 7.5K frames on transmit. This may require some more work, though the framework to handle jumbo frames on RX is in place: the re_rxeof() routine will gather up frames than span multiple 2K clusters into a single mbuf list. - Fixed bug in re_txeof(): if we reap some of the TX buffers, but there are still some pending, re-arm the timer before exiting re_txeof() so that another timeout interrupt will be generated, just in case re_start() doesn't do it for us. - Handle the 'link state changed' interrupt - Fix a detach bug. If re(4) is loaded as a module, and you do tcpdump -i re0, then you do 'kldunload if_re,' the system will panic after a few seconds. This happens because ether_ifdetach() ends up calling the BPF detach code, which notices the interface is in promiscuous mode and tries to switch promisc mode off while detaching the BPF listner. This ultimately results in a call to re_ioctl() (due to SIOCSIFFLAGS), which in turn calls re_init() to handle the IFF_PROMISC flag change. Unfortunately, calling re_init() here turns the chip back on and restarts the 1-second timeout loop that drives re_tick(). By the time the timeout fires, if_re.ko has been unloaded, which results in a call to invalid code and blows up the system. To fix this, I cleared the IFF_UP flag before calling ether_ifdetach(), which stops the ioctl routine from trying to reset the chip. - Modified comments in re_rxeof() relating to the difference in RX descriptor status bit layout between the 8139C+ and the gigE chips. The layout is different because the frame length field was expanded from 12 bits to 13, and they got rid of one of the status bits to make room. - Add diagnostic code (re_diag()) to test for the case where a user has installed a broken 32-bit 8169 PCI NIC in a 64-bit slot. Some NICs have the REQ64# and ACK64# lines connected even though the board is 32-bit only (in this case, they should be pulled high). This fools the chip into doing 64-bit DMA transfers even though there is no 64-bit data path. To detect this, re_diag() puts the chip into digital loopback mode and sets the receiver to promiscuous mode, then initiates a single 64-byte packet transmission. The frame is echoed back to the host, and if the frame contents are intact, we know DMA is working correctly, otherwise we complain loudly on the console and abort the device attach. (At the moment, I don't know of any way to work around the problem other than physically modifying the board, so until/unless I can think of a software workaround, this will have do to.) - Created re(4) man page - Modified rlphy.c to allow re(4) to attach as well as rl(4). Note that this code works for the sample 8169/Marvell 88E1000 NIC that I have, but probably won't work for the 8169S/8110S chips. RealTek has sent me some sample NICs, but they haven't arrived yet. I will probably need to add an rlgphy driver to handle the on-board PHY in the 8169S/8110S (it needs special DSP initialization).	2003-09-08 02:11:25 +00:00
Peter Wemm	c896a8adbf	Oops. sizeof(long) = 8, not 4. Get the fxsave buffer inside mcontext the right size. I'm planning on possibly stealing the two 'spare' variables on either side for botched alignment correction.	2003-09-05 20:47:27 +00:00
David E. O'Brien	be8d2cbf2c	MFi386: add device ataraid, this is now seperate and not pulled in by atadisk.	2003-09-03 01:24:47 +00:00
Alexander Kabaev	1d49585050	Standardize idempotentcy ifdefs. Consistently use _MACHINE_VARARGS_H_ symbol.	2003-09-01 03:01:45 +00:00
Alan Cox	411d10a600	Migrate the sf_buf allocator that is used by sendfile(2) and zero-copy sockets into machine-dependent files. The rationale for this migration is illustrated by the modified amd64 allocator. It uses the amd64's direct map to avoid emphemeral mappings in the kernel's address space. On an SMP, the emphemeral mappings result in an IPI for TLB shootdown for each transmitted page. Yuck. Maintainers of other 64-bit platforms with direct maps should be able to use the amd64 allocator as a reference implementation.	2003-08-29 20:04:10 +00:00
John Baldwin	729d7ffbcf	- Rename PCIx_HEADERTYPE* to PCIx_HDRTYPE* so the constants aren't so long. - Add a new PCIM_HDRTYPE constant for the field in PCIR_HDRTYPE that holds the header type. - Replace several magic numbers with appropriate constants for the header type register and a couple of PCI_FUNCMAX. - Merge to amd64 the fix to the i386 bridge code to skip devices with unknown header types. Requested by: imp (1, 2)	2003-08-28 21:22:25 +00:00
Nate Lawson	5a4d072c93	Minor style cleanups.	2003-08-28 16:30:31 +00:00
David E. O'Brien	a7b60ab26e	Fix copyright comment & FBSDID style nits. Requested by: bde	2003-08-25 09:48:48 +00:00
Alan Cox	d08ffe8451	Eliminate the last (direct) uses of vm_page_lookup() on the pte object.	2003-08-24 08:07:06 +00:00
Peter Wemm	0dda1d3887	AMD64 mtrr driver.	2003-08-23 00:27:58 +00:00
Peter Wemm	46159d1fd6	Switch to using the emulator in the common compat area. Still work-in-progress.	2003-08-23 00:04:53 +00:00
Peter Wemm	c639ca93f4	Initial sweep at dividing up the generic 32bit-on-64bit kernel support from the ia32 specific stuff. Some of this still needs to move to the MI freebsd32 area, and some needs to move to the MD area. This is still work-in-progress.	2003-08-22 23:19:02 +00:00
Warner Losh	d2c5276d96	Prefer new location of pci include files (which have only been in the tree for two or more years now), except in a few places where there's code to be compatible with older versions of FreeBSD.	2003-08-22 07:39:05 +00:00
Peter Wemm	82914097e5	Regen	2003-08-21 03:48:50 +00:00
Peter Wemm	6b59055cb8	This is too funny for words. Swap syscalls 416 and 417 around. It works better that way when sigaction() and sigreturn() do the right thing.	2003-08-21 03:48:05 +00:00
Alan Cox	2b12cfb461	- Lock the pte object when performing vm_page_grab(). - Insure that the page table page is zero filled before adding it to the page table.	2003-08-20 05:09:55 +00:00
Gordon Tetlow	df3d69c217	Fixup the ELF branding information to point to the new home of rtld.	2003-08-17 08:08:38 +00:00
Alan Cox	365b27ea29	In pmap_copy(), since we have the page table page's physical address in hand, use PHYS_TO_VM_PAGE() rather than vm_page_lookup().	2003-08-17 04:48:21 +00:00
Marcel Moolenaar	710338e94f	In vm_thread_swap{in\|out}(), remove the alpha specific conditional compilation and replace it with a call to cpu_thread_swap{in\|out}(). This allows us to add similar code on ia64 without cluttering the code even more.	2003-08-16 23:15:15 +00:00
Marcel Moolenaar	26502503e5	Further cleanup <machine/cpu.h> and <machine/md_var.h>: move the MI prototypes of cpu_halt(), cpu_reset() and swi_vm() from md_var.h to cpu.h. This affects db_command.c and kern_shutdown.c. ia64: move all MD prototypes from cpu.h to md_var.h. This affects madt.c, interrupt.c and mp_machdep.c. Remove is_physical_memory(). It's not used (vm_machdep.c). alpha: the MD prototypes have been left in cpu.h with a comment that they should be there. Moving them is left for later. It was expected that the impact would be significant enough to be done in a seperate commit. powerpc: MD prototypes left in cpu.h. Comment added. Suggested by: bde Tested with: make universe (pc98 incomplete)	2003-08-16 16:57:57 +00:00
Alan Cox	6700fc865c	Eliminate pmap_page_lookup() and its uses. Instead, use PHYS_TO_VM_PAGE() to convert the pte's physical address into a vm page. Reviewed by: peter	2003-08-16 03:11:33 +00:00
John Baldwin	594dfbc391	- Fix a duplicated typo. - Add a macro for the logical shift needed to extract an APIC ID from either from the local APIC ICR Hi register or the APIC ID registers of the local and IO APICs.	2003-08-15 15:23:13 +00:00
Warner Losh	06b4bf3e55	Expand inline the relevant parts of src/COPYRIGHT for Matt Dillon's copyrighted files. Approved by: Matt Dillon	2003-08-12 23:24:05 +00:00
Paul Saab	77c39e17fa	Halted CPU's should not accumulate time. Reviewed by: jhb	2003-08-12 17:01:10 +00:00
Alan Cox	ba97fd8a78	Rename pmap_changebit() to pmap_clear_ptes() and remove the last parameter. The new name better reflects what the function does and how it is used. The last parameter was always FALSE. Note: In theory, gcc would perform constant propagation and dead code elimination to achieve the same effect as removing the last parameter, which is always FALSE. In practice, recent versions do not. So, there is little point in letting unused code pessimize execution.	2003-08-10 21:53:55 +00:00
Alan Cox	7fbff95c04	MFi386 1.422 & 1.423: lock page queues in pmap_insert_entry().	2003-08-08 01:52:03 +00:00
Scott Long	477327b5c5	In _bus_dmamap_load_buffer(), only count the number of bounce pages needed if they haven't been counted before. This test was ommitted when bus_dmamap_load() was merged into this function, and results in the pagesneeded field growing without bounds when multiple deferrals happen. Thanks to Paul Saab for beating his head against this for a few hours =-)	2003-08-04 23:40:35 +00:00
John Baldwin	3bdbd658f1	- Since td_critnest is now initialized in MI code, it doesn't have to be set in cpu_critical_fork_exit() anymore. - As far as I can tell, cpu_thread_link() has never been used, not even when it was originally added, so remove it.	2003-08-04 20:32:45 +00:00
Alan Cox	e53f32ace5	Use kmem_alloc_nofault() rather than kmem_alloc_pageable() in pmap_mapdev(). See revision 1.140 of kern/sys_pipe.c for a detailed rationale. Submitted by: tegge	2003-08-02 19:26:09 +00:00
Peter Wemm	59cc2230c6	Fix a dumbass mistake. I had the 'set' and 'get' reversed in the fpsetround/fpgetround macro pairs.	2003-08-02 00:26:30 +00:00
Bosko Milekic	b053bc8407	Make sure that when the PV ENTRY zone is created in pmap, that it's created not only with UMA_ZONE_VM but also with UMA_ZONE_NOFREE. In the i386 case in particular, the pmap code would hook a special page allocation routine that allocated from kernel_map and not kmem_map, and so when/if the pageout daemon drained the zones, it could actually push out slabs from the PV ENTRY zone but call UMA's default page_free, which resulted in pages allocated from kernel_map being freed to kmem_map; bad. kmem_free() ignores the return value of the vm_map_delete and just returns. I'm not sure what the exact repercussions could be, but it doesn't look good. In the PAE case on i386, we also set-up a zone in pmap, so be conservative for now and make that zone also ZONE_NOFREE and ZONE_VM. Do this for the pmap zones for the other archs too, although in some cases it may not be entirely necessarily. We'd rather be safe than sorry at this point. Perhaps all UMA_ZONE_VM zones should by default be also UMA_ZONE_NOFREE? May fix some of silby's crashes on the PV ENTRY zone.	2003-07-31 03:39:51 +00:00
Peter Wemm	3950c40739	KSTACK_PAGES is a global option.	2003-07-31 01:27:18 +00:00
Peter Wemm	9fb1db7bc8	Cosmetic: fix disorder of opt_kstack_pages.h include.	2003-07-31 01:26:40 +00:00
David Xu	5a92cbc206	Use PSL_KERNEL as upcall thread's initial rflags, don't use scratch user rflags.	2003-07-29 12:44:16 +00:00
Maxime Henrion	d5afecd068	- Introduce a new busdma flag BUS_DMA_ZERO to request for zero'ed memory in bus_dmamem_alloc(). This is possible now that contigmalloc() supports the M_ZERO flag. - Remove the locking of Giant around calls to contigmalloc() since contigmalloc() now grabs Giant itself.	2003-07-27 13:52:10 +00:00
David E. O'Brien	56ae44c5df	Use __FBSDID(). Brought to you by: a boring talk at Ottawa Linux Symposium	2003-07-25 21:19:19 +00:00
David E. O'Brien	12ea2cfe2e	Use __FBSDID(). Brought to you by: a boring talk at OLS	2003-07-25 21:10:19 +00:00
Alan Cox	059358675e	MFi386 revision 1.416 Add vm object locking to pmap_prefault(). Note: powerpc and sparc64 do not implement this function.	2003-07-25 18:58:39 +00:00
David Xu	74bbb26b51	Align upcall stack top to odd times of 8. GCC accounts return address in callee function for stack alignment.	2003-07-25 00:21:37 +00:00
David Xu	c3f8e34d6b	Implement cpu_set_upcall and cpu_set_upcall_kse. Reviewed by: peter	2003-07-24 08:52:44 +00:00
David Xu	81ebc68226	Set fault address to si_addr. Reviewed by: peter	2003-07-24 08:51:22 +00:00
Peter Wemm	9e9e575b6a	Make the breakpoint instruction trap gate available to users. ptrace() needs this. Submitted by: Mark Kettenis <kettenis@chello.nl>	2003-07-23 23:20:20 +00:00
Peter Wemm	8b48b40d5e	Set the %gs base to pcb_gsbase, not pcb_fsbase. Oops. Discovered by: davidxu	2003-07-23 23:17:15 +00:00
Alan Cox	3462150083	Annotate pmap_changebit() as __always_inline. This function was written as a template that when inlined is specialized for the caller through constant value propagation and dead code elimination. Thus, the specialized code that is generated for pmap_clear_reference() et al. avoids several conditional branches inside of a loop.	2003-07-23 19:49:32 +00:00
John Baldwin	e47d4f0fc2	Use macros from apic.h to when writing to the ICR to send IPIs to startup APs rather than magic numbers. Tested by: scottl	2003-07-23 19:04:28 +00:00
John Baldwin	55fb372edd	Add a new macro APIC_ICRLO_RESV_MASK that contains all of the reserved fields in the low 32 bits of the local APIC ICR register. Use this macro in place of APIC_RESV2_MASK when masking off existing bits from the ICR when writing to it to send an IPI. Tested by: scottl	2003-07-23 18:59:38 +00:00
Peter Wemm	5b9f8ddbbd	Go back to 64 bit precision for fadd/fsub/fsqrt etc. This is because on AMD64, gcc (and the ABI) expects the x87 unit to be running in 80/64 mode (not 64/53) so that it can use it for 'long double' operations. It takes the expected precision differences into account when generating code.	2003-07-22 06:50:34 +00:00
Peter Wemm	76537e43f5	Extend the machine/ieeefp.h that was inherited from i386 to support the SSE mxcsr register as well. Since gcc will intermix SSE2 and x87 FP code, the fpsetround() etc mode had better be the same. There are hooks to enable these inlines to be instantiated inside libc for non-gcc or C++ callers. (g++ doesn't like the inlines that tried to extract an integer and convert it to an enum).	2003-07-22 06:44:54 +00:00
David Xu	20a2d71332	Rename thread_siginfo to cpu_thread_siginfo. Suggested by: jhb	2003-07-15 00:11:04 +00:00
Mark Murray	c7b132c974	Protect lint(1) from a #error.	2003-07-10 18:05:02 +00:00
Peter Wemm	e95babf3a8	unifdef -DLAZY_SWITCH and start to tidy up the associated glue.	2003-07-10 01:02:59 +00:00
Peter Wemm	bf8ca114e2	Fix the VADDR() macros to use either KVADDR() or UVADDR(), depending on the implied sign extension. The single unified VADDR() macro was not able to avoid sign extending the VM_MAXUSER_ADDRESS/USRSTACK values. Be explicit about UVADDR() (positive address space) and KVADDR() (kernel negative address space) to make mistakes show up more spectacularly. Increase user VM space from 1/2TB (512GB) to 128TB.	2003-07-09 23:04:23 +00:00
Peter Wemm	6486c09935	Fix up bogus index/offset/mask calculations in the allocpte and the corresponding release code. This was preventing the use of more than 1/2TB of user VM. I also spent a week staring at this code only to eventually find that I'd mistakenly typed a P as an R.	2003-07-09 22:59:45 +00:00
Peter Wemm	4afd44c16a	Turn the 2MB page mappings that cover the kernel text+data+bss area back on now that pmap_pte() can handle it. I never actually ran into anything that broke that I know of, but this was turned off as a precaution.	2003-07-09 22:55:00 +00:00
Peter Wemm	436e1f203f	Have pmap_pte() on a 2MB mapped address return the 2MB pde itself rather than a non-existing pte. There is code elsewhere in i386/amd64 pmap that neglects to handle the large page cases because it knows that it will see PG_PS in the returned "pte".	2003-07-09 22:53:45 +00:00
Alan Cox	90a7c7b671	In pmap_object_init_pt(), the pmap_invalidate_all() should be performed on the caller-provided pmap, not the kernel_pmap. Using the kernel_pmap results in an unnecessary IPI for TLB shootdown on SMPs. Reviewed by: jake, peter	2003-07-08 19:40:35 +00:00
Alan Cox	1f78f902a8	Background: pmap_object_init_pt() premaps the pages of a object in order to avoid the overhead of later page faults. In general, it implements two cases: one for vnode-backed objects and one for device-backed objects. Only the device-backed case is really machine-dependent, belonging in the pmap. This commit moves the vnode-backed case into the (relatively) new function vm_map_pmap_enter(). On amd64 and i386, this commit only amounts to code rearrangement. On alpha and ia64, the new machine independent (MI) implementation of the vnode case is smaller and more efficient than their pmap-based implementations. (The MI implementation takes advantage of the fact that objects in -CURRENT are ordered collections of pages.) On sparc64, pmap_object_init_pt() hadn't (yet) been implemented.	2003-07-03 20:18:02 +00:00
Maxime Henrion	331e012396	Sync more things with other backends.	2003-07-01 19:16:48 +00:00
Maxime Henrion	4813f72a9b	Honor the boundary of the busdma tag when allocating bounce pages. This was fixed in revision 1.5 of alpha/alpha/busdma_machdep.c and was never fixed in other busdma backends using bounce pages.	2003-07-01 16:54:54 +00:00
Scott Long	f6b1c44d1f	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
Alan Cox	dca96f1adc	- Export pmap_enter_quick() to the MI VM. This will permit the implementation of a largely MI pmap_object_init_pt() for vnode-backed objects. pmap_enter_quick() is implemented via pmap_enter() on sparc64 and powerpc. - Correct a mismatch between pmap_object_init_pt()'s prototype and its various implementations. (I plan to keep pmap_object_init_pt() as the MD hook for device-backed objects on i386 and amd64.) - Correct an error in ia64's pmap_enter_quick() and adjust its interface to match the other versions. Discussed with: marcel	2003-06-29 21:20:04 +00:00
Jeff Roberson	ab875ef896	- Construct a cpu topology map for Hyper Threading systems so that ULE may take advantage of them.	2003-06-28 22:07:42 +00:00
David Xu	b8f480ab94	Add a machine depended function thread_siginfo, SA signal code will use the function to construct a siginfo structure and use the result to export to userland. Reviewed by: julian	2003-06-28 06:34:08 +00:00
Scott Long	7f95801188	Catch amd64 up with the pending busdma async callback locking. Though this mechanism might change in the near future, it's best to keep everything in sync right now. Reminded by: peter	2003-06-28 06:07:06 +00:00
Peter Wemm	b6a5f89b4d	Turn ips back on.	2003-06-27 23:11:22 +00:00
Peter Wemm	1e5d8b3b66	Oops, I only added a comment about why ips doesn't compile. Actually comment it out for real.	2003-06-26 04:01:59 +00:00
Peter Wemm	ba1cabf4b9	Sync with i386 - add everything that compiles. There are a few drivers that are trivially easy to fix (eg: ips) that I've not committed fixes for.	2003-06-26 03:49:54 +00:00
Peter Wemm	2d29639ebb	Add back in the ability for pmap_mapdev() to use KVM if the region being requested is outside of the range of the direct map region. eg: for pci windows. While here, increase the minimum size of the direct map region to be 4GB instead of 1GB.	2003-06-26 01:04:31 +00:00
Alan Cox	0183359659	MFi386 Add vm object locking to pmap_object_init_pt().	2003-06-23 06:10:52 +00:00
Hidetoshi Shimokawa	e07324646e	Move KERNBASE to -2GB. Currently, we cannot increase KVA more than 2GB.	2003-06-22 13:02:45 +00:00
Hidetoshi Shimokawa	bfcd2ec739	- Allow access to direct mapped region via /dev/kmem. This makes 'netstat -r' work. - Use direct map for /dev/mem.	2003-06-22 12:59:43 +00:00
Hidetoshi Shimokawa	c1c1cc9c19	- Allocate a new PD Table if kernel grows beyond 1GB boundary. Reviewed by: peter - Use direct map in pmap_mapdev().	2003-06-22 12:55:20 +00:00
Hidetoshi Shimokawa	e14720d614	Use direct map in pmap_map(). This saves much KVA for vm_pages and you don't need to increase NKPT for large physical memory anymore. Suggested by: dfr	2003-06-20 14:09:33 +00:00
Hidetoshi Shimokawa	d25ac2fa68	Fix direct map page table for 2GB+ physical memory. You may still need to increase NKPT for larger memory. I have successfully booted 8GB system with NKPT=256.	2003-06-19 12:14:37 +00:00
Alan Cox	40ebf3e43a	Fix a performance bug in all of the various implementations of uma_small_alloc(): They always zeroed the page regardless of what the caller requested.	2003-06-18 02:57:38 +00:00
David Xu	0e2a4d3aeb	Rename P_THREADED to P_SA. P_SA means a process is using scheduler activations.	2003-06-15 00:31:24 +00:00
Alan Cox	49a2507bd1	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
Alan Cox	89f4fca265	Move the _new_altkstack() and _dispose_altkstack() functions out of the various pmap implementations into the machine-independent vm. They were all identical.	2003-06-14 06:20:25 +00:00

1 2 3 4 5 ...

3903 Commits