freebsd-dev

Author	SHA1	Message	Date
Peter Wemm	19acc770c2	Pull the tier-2 card one last time and break the get/setcontext and sigreturn() ABI and the signal context on the stack. Make the trapframe (and its shadows in the ucontext and sigframe etc) 8 bytes larger in order to preserve 16 byte stack alignment for the following C code calls. I could have done some padding after the trapframe was saved, but some of the C code still expects an argument of 'struct trapframe'. Anyway, this gives me a spare field that can be used to store things like 'partial trapframe' status or something else in the future. The runtime impact is fairly small, except for threaded apps and things that decode contexts and the signal stack (eg: cvsup binary). Signal delivery isn't too badly affected because the kernel generates the sigframe that sigreturn uses after the handler has been called. The size of mcontext_t and struct sigframe hasn't changed. Only the last few fields (sc_eip etc) got moved a little and I eliminated a spare field. mc_len/sc_len did change location though so the sanity checks there will still trap it.	2003-10-15 02:04:52 +00:00
Alan Cox	7fb578933f	MFia64 Move uma_small_alloc() and uma_small_free() to uma_machdep.c.	2003-10-14 05:51:31 +00:00
Robert Drehmel	ea924c4cd3	Implement preliminary support for the PT_SYSCALL command to ptrace(2).	2003-10-09 10:17:16 +00:00
Bruce M Simpson	2bc7dd5661	Move pmap_resident_count() from the MD pmap.h to the MI pmap.h. Add a definition of pmap_wired_count(). Add a definition of vmspace_wired_count(). Reviewed by: truckman Discussed with: peter	2003-10-06 01:47:12 +00:00
Alan Cox	ab87e2fb83	Don't bother setting a page table page's valid field. It is unused and not setting it is consistent with other uses of VM_ALLOC_NOOBJ pages.	2003-10-05 00:12:16 +00:00
Alan Cox	566526a957	Migrate pmap_prefault() into the machine-independent virtual memory layer. A small helper function pmap_is_prefaultable() is added. This function encapsulate the few lines of pmap_prefault() that actually vary from machine to machine. Note: pmap_is_prefaultable() and pmap_mincore() have much in common. Going forward, it's worth considering their merger.	2003-10-03 22:46:53 +00:00
Alan Cox	81b460c5eb	Reimplement pagezero() using "movnti".	2003-10-02 05:08:13 +00:00
Peter Wemm	6ccf265bb0	Commit Bosko's patch to clean up the PSE/PG_G initialization to and avoid problems with some Pentium 4 cpus and some older PPro/Pentium2 cpus. There are several problems, some documented in Intel errata. This patch: 1) moves the kernel to the second page in the PSE case. There is an errata that says that you Must Not point a 4MB page at physical address zero on older cpus. We avoided bugs here due to sheer luck. 2) sets up PSE page tables right from the start in locore, rather than trying to switch from 4K to 4M (or 2M) pages part way through the boot sequence at the same time that we're messing with PG_G. For some reason, the pmap work over the last 18 months seems to tickle the problems, and the PAE infrastructure changes disturb the cpu bugs even more. A couple of people have reported a problem with APM bios calls during boot. I'll work with people to get this resolved. Obtained from: bmilekic	2003-10-01 23:46:08 +00:00
Peter Wemm	a93020d7a1	Use __register_t instead of register_t, otherwise <sys/types.h> is a prerequisite for <ucontext.h> on amd64. Oops.	2003-10-01 01:08:04 +00:00
Peter Wemm	ba5a51ea04	MFi386: Do not depend on LEAPYEAR() macro boolean values being 0 or 1. MFi386: Add quality field for timer0	2003-09-30 06:42:47 +00:00
Peter Wemm	ec548f97fc	MFi386: BURN_BRIDGES around timer0 functions	2003-09-30 06:38:11 +00:00
Jeff Roberson	3c4d5e1546	- Remove the definition for TD_SWITCHIN as it is not used. Approved by: peter	2003-09-30 04:52:24 +00:00
Alan Cox	9060731130	Eliminate the pte object.	2003-09-27 20:53:01 +00:00
Alan Cox	d2a81cdbed	MFi386 Allocate the page table directory page as "no object" pages.	2003-09-26 04:12:41 +00:00
Alan Cox	d91440cd46	MFi386 Reimplement pmap_release() such that it uses the page table rather than the pte object to locate the page table directory pages. (Temporarily, retain an assertion on the emptiness of the pte object.)	2003-09-25 05:38:18 +00:00
Peter Wemm	cc3112f108	Re-raise the default datasize and stacksize now that the 32 bit exec support can clip it to sensible values.	2003-09-25 01:11:17 +00:00
Peter Wemm	c460ac3a00	Add sysentvec->sv_fixlimits() hook so that we can catch cases on 64 bit systems where the data/stack/etc limits are too big for a 32 bit process. Move the 5 or so identical instances of ELF_RTLD_ADDR() into imgact_elf.c. Supply an ia32_fixlimits function. Export the clip/default values to sysctl under the compat.ia32 heirarchy. Have mmap(0, ...) respect the current p->p_limits[RLIMIT_DATA].rlim_max value rather than the sysctl tweakable variable. This allows mmap to place mappings at sensible locations when limits have been reduced. Have the imgact_elf.c ld-elf.so.1 placement algorithm use the same method as mmap(0, ...) now does. Note that we cannot remove all references to the sysctl tweakable maxdsiz etc variables because /etc/login.conf specifies a datasize of 'unlimited'. And that causes exec etc to fail since it can no longer find space to mmap things.	2003-09-25 01:10:26 +00:00
Yoshihiro Takahashi	33e38a2cc8	Implement the bus_space_map() function to allocate resources and initialize a bus_handle, but currently it does only initializing a bus_handle.	2003-09-23 08:22:34 +00:00
Peter Wemm	725bc17312	Oops. back out last commit. The data and stack limits are used by the 32 bit binary stuff. 32 bit binaries do not like it much when the kernel tries hard to put things above the 8GB mark. I have a work-in-progress to fix this properly, but I didn't want to burn anybody with this yet.	2003-09-23 03:20:34 +00:00
Peter Wemm	705c67adc2	Fix patch transcription typo. s/IDT_BPT/IDT_BP/	2003-09-23 00:45:55 +00:00
Peter Wemm	cd3402fa66	Sync with i386 version. The quality initialization was missing and some other junk.	2003-09-23 00:18:45 +00:00
Peter Wemm	ee3ce1c29c	GC unused child variable	2003-09-23 00:04:28 +00:00
Peter Wemm	4295ddf26f	MFi386 pci_bus.c 1.102 legacyvar.h 1.4: rename nexus_pcib to legacy_pcib However, leave legacy_pcib_route_interrupt() since there is no pcibios to call.	2003-09-23 00:03:44 +00:00
Peter Wemm	da87d7e10d	Move basemem variable into global scope so that the MP startup code can refer to it for looking for tables.	2003-09-22 23:33:29 +00:00
Peter Wemm	24789c549a	Increase the default data size limit from 512MB to 8GB. Increase default stack limit from 64MB to 512MB.	2003-09-22 23:21:39 +00:00
Peter Wemm	848947c793	MFi386 rev 1.51 by scottl: make dflt_lock() always panic.	2003-09-22 23:11:42 +00:00
Peter Wemm	951b3d46b6	MFi386 rev 1.53 by scottl: Allocate the S/G list in the tag, not on the stack. This means that s/g lists can be arbitrarily long.	2003-09-22 23:10:24 +00:00
Peter Wemm	d79ddbf5de	MFi386 machdep.c rev 1.201, clock.c 1.201, clock.h 1.45 by phk: Dont initialize a TSC timecounter until we know if it is broke or not. XXX I think there is a bug in the i386 code here. init_TSC_tc() comes after: if (statclock_disable) return; ie: if you turn off the statclock interrupt, you dont get the TSC either.	2003-09-22 23:02:24 +00:00
Peter Wemm	e63f19e150	MFi386 rev 1.105 by jhb: fix comment typo	2003-09-22 22:54:14 +00:00
Peter Wemm	74a99ec4fe	MFi386 rev 1.256 by jhb: remove redundant #include <sys/sysctl.h>	2003-09-22 22:52:46 +00:00
Peter Wemm	13a27f2962	MFi386 rev 1.25 by jhb: add new MSR's and some missing older ones and APICBASE MSR constants.	2003-09-22 22:51:46 +00:00
Peter Wemm	f0c4b48689	MFi386 rev 1.55 by sam: remove unused #define BUS_DMAMAP_NSEGS	2003-09-22 22:43:21 +00:00
Peter Wemm	d10e66f073	MFi386 rev 1.37: constant-friendly bswap macros	2003-09-22 22:37:49 +00:00
Peter Wemm	5bc82d1ce1	MFi386: pci_cfgreg.h rev 1.10 by jhb/des/njl. Fix CONF1_ENABLE_MSK.	2003-09-22 22:21:21 +00:00
Peter Wemm	20e220ac68	MFCi386: trap.c rev 1.257 by bde. Don't forget to reenable interrupts for breakpoint and trace traps from usermode. Although all the setidt entries are interrupt gates on amd64, all but the trace and bpt trap entry handlers reenable interrupts after the swapgs instruction in order to simulate the trap/interrupt gate distinction. In other words, the amd64 code behaves the same way that i386 does here.	2003-09-22 22:19:59 +00:00
Peter Wemm	8848ad863b	MFi386 by jhb: add acpi_SetDefaultIntrModel();	2003-09-22 22:12:46 +00:00
Peter Wemm	76caec589f	MFi386 by jhb: use symbolic constants for the IDT entries.	2003-09-22 22:09:02 +00:00
Peter Wemm	882554f111	MFi386: machdep.c:1.570 clock.c:1.204 by bde: Quick fix for calling DELAY for ddb input in some atkbd-based console drivers. ddb must not use any normal locks but DELAY() normally calls getit() which needs clock_lock. This also removes the need for recursion on clock_lock.	2003-09-22 21:56:48 +00:00
Joerg Wunsch	9678710b1f	Mention the puc(4) glue driver in a commented-out example so the user of "dumb" PCI-based serial/parallel boards get a hint how to enable them. I wasn't sure about the ia64, pc98, powerpc, and sparc64 archs whether they'd support puc(4) or not.	2003-09-19 20:04:55 +00:00
David E. O'Brien	67193a54f0	Statically compile in sound as we don't have modules yet.	2003-09-15 22:40:00 +00:00
Alan Cox	6d66d714c7	Simplify (and micro-optimize) pmap_unuse_pt(): Only one caller, pmap_remove_pte(), passed NULL instead of the required page table page to pmap_unuse_pt(). Compute the necessary page table page in pmap_remove_pte(). Also, remove some unreachable code from pmap_remove_pte().	2003-09-13 21:57:38 +00:00
Alan Cox	b9850eb224	Add a new parameter to pmap_extract_and_hold() that is needed to eliminate Giant from vmapbuf(). Idea from: tegge	2003-09-12 07:07:49 +00:00
David E. O'Brien	3fc40c2484	Sort 'bge' correctly.	2003-09-10 18:54:59 +00:00
John Baldwin	a547af297d	Remove an XXX comment by using the per CPU mask added after this comment was added.	2003-09-10 01:36:48 +00:00
John Baldwin	f03cb48d41	Fix a typo.	2003-09-10 01:11:58 +00:00
Peter Wemm	2c25d12414	Clean up get/set_mcontext() and get/set_fpcontext(). These are operated on data structures on the kernel stack which are guaranteed to be 16 byte aligned by gcc, the amd64 ABI and __aligned(16). Ensire the tss_rsp0 initial stack pointer is 16 byte aligned in case sizeof(pcb) becomes odd at some point. This is convenient for the interrupt handler case because the ring crossing pushes cause the required odd alignment before the call to the C code. Have fast_syscall add an additional 8 bytes to ensure that the trapframe has the correct odd alignment for the call to C code. Note that there are no checks to make sure that the trapframe size is appropriate for this. This makes get/setfpcontext work properly (finally). You get a GPF in kernel mode if any of this is botched without the alignment fixup code that is apparently needed on i386.	2003-09-09 19:32:09 +00:00
Peter Wemm	df6ece387b	Turn aac back on now that its been cleaned up for 64 bit compilation	2003-09-08 20:00:55 +00:00
Peter Wemm	292bbfd103	Argh. This file was completely out of sync with mcontext/trapframe.	2003-09-08 18:31:48 +00:00
Peter Wemm	7fe089a006	Hmm. Two copies of the mcontext...	2003-09-08 18:28:41 +00:00
Alan Cox	ba2157f218	Introduce a new pmap function, pmap_extract_and_hold(). This function atomically extracts and holds the physical page that is associated with the given pmap and virtual address. Such a function is needed to make the memory mapping optimizations used by, for example, pipes and raw disk I/O MP-safe. Reviewed by: tegge	2003-09-08 02:45:03 +00:00
Bill Paul	a94100fa9b	Take the support for the 8139C+/8169/8169S/8110S chips out of the rl(4) driver and put it in a new re(4) driver. The re(4) driver shares the if_rlreg.h file with rl(4) but is a separate module. (Ultimately I may change this. For now, it's convenient.) rl(4) has been modified so that it will never attach to an 8139C+ chip, leaving it to re(4) instead. Only re(4) has the PCI IDs to match the 8169/8169S/8110S gigE chips. if_re.c contains the same basic code that was originally bolted onto if_rl.c, with the following updates: - Added support for jumbo frames. Currently, there seems to be a limit of approximately 6200 bytes for jumbo frames on transmit. (This was determined via experimentation.) The 8169S/8110S chips apparently are limited to 7.5K frames on transmit. This may require some more work, though the framework to handle jumbo frames on RX is in place: the re_rxeof() routine will gather up frames than span multiple 2K clusters into a single mbuf list. - Fixed bug in re_txeof(): if we reap some of the TX buffers, but there are still some pending, re-arm the timer before exiting re_txeof() so that another timeout interrupt will be generated, just in case re_start() doesn't do it for us. - Handle the 'link state changed' interrupt - Fix a detach bug. If re(4) is loaded as a module, and you do tcpdump -i re0, then you do 'kldunload if_re,' the system will panic after a few seconds. This happens because ether_ifdetach() ends up calling the BPF detach code, which notices the interface is in promiscuous mode and tries to switch promisc mode off while detaching the BPF listner. This ultimately results in a call to re_ioctl() (due to SIOCSIFFLAGS), which in turn calls re_init() to handle the IFF_PROMISC flag change. Unfortunately, calling re_init() here turns the chip back on and restarts the 1-second timeout loop that drives re_tick(). By the time the timeout fires, if_re.ko has been unloaded, which results in a call to invalid code and blows up the system. To fix this, I cleared the IFF_UP flag before calling ether_ifdetach(), which stops the ioctl routine from trying to reset the chip. - Modified comments in re_rxeof() relating to the difference in RX descriptor status bit layout between the 8139C+ and the gigE chips. The layout is different because the frame length field was expanded from 12 bits to 13, and they got rid of one of the status bits to make room. - Add diagnostic code (re_diag()) to test for the case where a user has installed a broken 32-bit 8169 PCI NIC in a 64-bit slot. Some NICs have the REQ64# and ACK64# lines connected even though the board is 32-bit only (in this case, they should be pulled high). This fools the chip into doing 64-bit DMA transfers even though there is no 64-bit data path. To detect this, re_diag() puts the chip into digital loopback mode and sets the receiver to promiscuous mode, then initiates a single 64-byte packet transmission. The frame is echoed back to the host, and if the frame contents are intact, we know DMA is working correctly, otherwise we complain loudly on the console and abort the device attach. (At the moment, I don't know of any way to work around the problem other than physically modifying the board, so until/unless I can think of a software workaround, this will have do to.) - Created re(4) man page - Modified rlphy.c to allow re(4) to attach as well as rl(4). Note that this code works for the sample 8169/Marvell 88E1000 NIC that I have, but probably won't work for the 8169S/8110S chips. RealTek has sent me some sample NICs, but they haven't arrived yet. I will probably need to add an rlgphy driver to handle the on-board PHY in the 8169S/8110S (it needs special DSP initialization).	2003-09-08 02:11:25 +00:00
Peter Wemm	c896a8adbf	Oops. sizeof(long) = 8, not 4. Get the fxsave buffer inside mcontext the right size. I'm planning on possibly stealing the two 'spare' variables on either side for botched alignment correction.	2003-09-05 20:47:27 +00:00
David E. O'Brien	be8d2cbf2c	MFi386: add device ataraid, this is now seperate and not pulled in by atadisk.	2003-09-03 01:24:47 +00:00
Alexander Kabaev	1d49585050	Standardize idempotentcy ifdefs. Consistently use _MACHINE_VARARGS_H_ symbol.	2003-09-01 03:01:45 +00:00
Alan Cox	411d10a600	Migrate the sf_buf allocator that is used by sendfile(2) and zero-copy sockets into machine-dependent files. The rationale for this migration is illustrated by the modified amd64 allocator. It uses the amd64's direct map to avoid emphemeral mappings in the kernel's address space. On an SMP, the emphemeral mappings result in an IPI for TLB shootdown for each transmitted page. Yuck. Maintainers of other 64-bit platforms with direct maps should be able to use the amd64 allocator as a reference implementation.	2003-08-29 20:04:10 +00:00
John Baldwin	729d7ffbcf	- Rename PCIx_HEADERTYPE* to PCIx_HDRTYPE* so the constants aren't so long. - Add a new PCIM_HDRTYPE constant for the field in PCIR_HDRTYPE that holds the header type. - Replace several magic numbers with appropriate constants for the header type register and a couple of PCI_FUNCMAX. - Merge to amd64 the fix to the i386 bridge code to skip devices with unknown header types. Requested by: imp (1, 2)	2003-08-28 21:22:25 +00:00
Nate Lawson	5a4d072c93	Minor style cleanups.	2003-08-28 16:30:31 +00:00
David E. O'Brien	a7b60ab26e	Fix copyright comment & FBSDID style nits. Requested by: bde	2003-08-25 09:48:48 +00:00
Alan Cox	d08ffe8451	Eliminate the last (direct) uses of vm_page_lookup() on the pte object.	2003-08-24 08:07:06 +00:00
Peter Wemm	0dda1d3887	AMD64 mtrr driver.	2003-08-23 00:27:58 +00:00
Peter Wemm	46159d1fd6	Switch to using the emulator in the common compat area. Still work-in-progress.	2003-08-23 00:04:53 +00:00
Peter Wemm	c639ca93f4	Initial sweep at dividing up the generic 32bit-on-64bit kernel support from the ia32 specific stuff. Some of this still needs to move to the MI freebsd32 area, and some needs to move to the MD area. This is still work-in-progress.	2003-08-22 23:19:02 +00:00
Warner Losh	d2c5276d96	Prefer new location of pci include files (which have only been in the tree for two or more years now), except in a few places where there's code to be compatible with older versions of FreeBSD.	2003-08-22 07:39:05 +00:00
Peter Wemm	82914097e5	Regen	2003-08-21 03:48:50 +00:00
Peter Wemm	6b59055cb8	This is too funny for words. Swap syscalls 416 and 417 around. It works better that way when sigaction() and sigreturn() do the right thing.	2003-08-21 03:48:05 +00:00
Alan Cox	2b12cfb461	- Lock the pte object when performing vm_page_grab(). - Insure that the page table page is zero filled before adding it to the page table.	2003-08-20 05:09:55 +00:00
Gordon Tetlow	df3d69c217	Fixup the ELF branding information to point to the new home of rtld.	2003-08-17 08:08:38 +00:00
Alan Cox	365b27ea29	In pmap_copy(), since we have the page table page's physical address in hand, use PHYS_TO_VM_PAGE() rather than vm_page_lookup().	2003-08-17 04:48:21 +00:00
Marcel Moolenaar	710338e94f	In vm_thread_swap{in\|out}(), remove the alpha specific conditional compilation and replace it with a call to cpu_thread_swap{in\|out}(). This allows us to add similar code on ia64 without cluttering the code even more.	2003-08-16 23:15:15 +00:00
Marcel Moolenaar	26502503e5	Further cleanup <machine/cpu.h> and <machine/md_var.h>: move the MI prototypes of cpu_halt(), cpu_reset() and swi_vm() from md_var.h to cpu.h. This affects db_command.c and kern_shutdown.c. ia64: move all MD prototypes from cpu.h to md_var.h. This affects madt.c, interrupt.c and mp_machdep.c. Remove is_physical_memory(). It's not used (vm_machdep.c). alpha: the MD prototypes have been left in cpu.h with a comment that they should be there. Moving them is left for later. It was expected that the impact would be significant enough to be done in a seperate commit. powerpc: MD prototypes left in cpu.h. Comment added. Suggested by: bde Tested with: make universe (pc98 incomplete)	2003-08-16 16:57:57 +00:00
Alan Cox	6700fc865c	Eliminate pmap_page_lookup() and its uses. Instead, use PHYS_TO_VM_PAGE() to convert the pte's physical address into a vm page. Reviewed by: peter	2003-08-16 03:11:33 +00:00
John Baldwin	594dfbc391	- Fix a duplicated typo. - Add a macro for the logical shift needed to extract an APIC ID from either from the local APIC ICR Hi register or the APIC ID registers of the local and IO APICs.	2003-08-15 15:23:13 +00:00
Warner Losh	06b4bf3e55	Expand inline the relevant parts of src/COPYRIGHT for Matt Dillon's copyrighted files. Approved by: Matt Dillon	2003-08-12 23:24:05 +00:00
Paul Saab	77c39e17fa	Halted CPU's should not accumulate time. Reviewed by: jhb	2003-08-12 17:01:10 +00:00
Alan Cox	ba97fd8a78	Rename pmap_changebit() to pmap_clear_ptes() and remove the last parameter. The new name better reflects what the function does and how it is used. The last parameter was always FALSE. Note: In theory, gcc would perform constant propagation and dead code elimination to achieve the same effect as removing the last parameter, which is always FALSE. In practice, recent versions do not. So, there is little point in letting unused code pessimize execution.	2003-08-10 21:53:55 +00:00
Alan Cox	7fbff95c04	MFi386 1.422 & 1.423: lock page queues in pmap_insert_entry().	2003-08-08 01:52:03 +00:00
Scott Long	477327b5c5	In _bus_dmamap_load_buffer(), only count the number of bounce pages needed if they haven't been counted before. This test was ommitted when bus_dmamap_load() was merged into this function, and results in the pagesneeded field growing without bounds when multiple deferrals happen. Thanks to Paul Saab for beating his head against this for a few hours =-)	2003-08-04 23:40:35 +00:00
John Baldwin	3bdbd658f1	- Since td_critnest is now initialized in MI code, it doesn't have to be set in cpu_critical_fork_exit() anymore. - As far as I can tell, cpu_thread_link() has never been used, not even when it was originally added, so remove it.	2003-08-04 20:32:45 +00:00
Alan Cox	e53f32ace5	Use kmem_alloc_nofault() rather than kmem_alloc_pageable() in pmap_mapdev(). See revision 1.140 of kern/sys_pipe.c for a detailed rationale. Submitted by: tegge	2003-08-02 19:26:09 +00:00
Peter Wemm	59cc2230c6	Fix a dumbass mistake. I had the 'set' and 'get' reversed in the fpsetround/fpgetround macro pairs.	2003-08-02 00:26:30 +00:00
Bosko Milekic	b053bc8407	Make sure that when the PV ENTRY zone is created in pmap, that it's created not only with UMA_ZONE_VM but also with UMA_ZONE_NOFREE. In the i386 case in particular, the pmap code would hook a special page allocation routine that allocated from kernel_map and not kmem_map, and so when/if the pageout daemon drained the zones, it could actually push out slabs from the PV ENTRY zone but call UMA's default page_free, which resulted in pages allocated from kernel_map being freed to kmem_map; bad. kmem_free() ignores the return value of the vm_map_delete and just returns. I'm not sure what the exact repercussions could be, but it doesn't look good. In the PAE case on i386, we also set-up a zone in pmap, so be conservative for now and make that zone also ZONE_NOFREE and ZONE_VM. Do this for the pmap zones for the other archs too, although in some cases it may not be entirely necessarily. We'd rather be safe than sorry at this point. Perhaps all UMA_ZONE_VM zones should by default be also UMA_ZONE_NOFREE? May fix some of silby's crashes on the PV ENTRY zone.	2003-07-31 03:39:51 +00:00
Peter Wemm	3950c40739	KSTACK_PAGES is a global option.	2003-07-31 01:27:18 +00:00
Peter Wemm	9fb1db7bc8	Cosmetic: fix disorder of opt_kstack_pages.h include.	2003-07-31 01:26:40 +00:00
David Xu	5a92cbc206	Use PSL_KERNEL as upcall thread's initial rflags, don't use scratch user rflags.	2003-07-29 12:44:16 +00:00
Maxime Henrion	d5afecd068	- Introduce a new busdma flag BUS_DMA_ZERO to request for zero'ed memory in bus_dmamem_alloc(). This is possible now that contigmalloc() supports the M_ZERO flag. - Remove the locking of Giant around calls to contigmalloc() since contigmalloc() now grabs Giant itself.	2003-07-27 13:52:10 +00:00
David E. O'Brien	56ae44c5df	Use __FBSDID(). Brought to you by: a boring talk at Ottawa Linux Symposium	2003-07-25 21:19:19 +00:00
David E. O'Brien	12ea2cfe2e	Use __FBSDID(). Brought to you by: a boring talk at OLS	2003-07-25 21:10:19 +00:00
Alan Cox	059358675e	MFi386 revision 1.416 Add vm object locking to pmap_prefault(). Note: powerpc and sparc64 do not implement this function.	2003-07-25 18:58:39 +00:00
David Xu	74bbb26b51	Align upcall stack top to odd times of 8. GCC accounts return address in callee function for stack alignment.	2003-07-25 00:21:37 +00:00
David Xu	c3f8e34d6b	Implement cpu_set_upcall and cpu_set_upcall_kse. Reviewed by: peter	2003-07-24 08:52:44 +00:00
David Xu	81ebc68226	Set fault address to si_addr. Reviewed by: peter	2003-07-24 08:51:22 +00:00
Peter Wemm	9e9e575b6a	Make the breakpoint instruction trap gate available to users. ptrace() needs this. Submitted by: Mark Kettenis <kettenis@chello.nl>	2003-07-23 23:20:20 +00:00
Peter Wemm	8b48b40d5e	Set the %gs base to pcb_gsbase, not pcb_fsbase. Oops. Discovered by: davidxu	2003-07-23 23:17:15 +00:00
Alan Cox	3462150083	Annotate pmap_changebit() as __always_inline. This function was written as a template that when inlined is specialized for the caller through constant value propagation and dead code elimination. Thus, the specialized code that is generated for pmap_clear_reference() et al. avoids several conditional branches inside of a loop.	2003-07-23 19:49:32 +00:00
John Baldwin	e47d4f0fc2	Use macros from apic.h to when writing to the ICR to send IPIs to startup APs rather than magic numbers. Tested by: scottl	2003-07-23 19:04:28 +00:00
John Baldwin	55fb372edd	Add a new macro APIC_ICRLO_RESV_MASK that contains all of the reserved fields in the low 32 bits of the local APIC ICR register. Use this macro in place of APIC_RESV2_MASK when masking off existing bits from the ICR when writing to it to send an IPI. Tested by: scottl	2003-07-23 18:59:38 +00:00
Peter Wemm	5b9f8ddbbd	Go back to 64 bit precision for fadd/fsub/fsqrt etc. This is because on AMD64, gcc (and the ABI) expects the x87 unit to be running in 80/64 mode (not 64/53) so that it can use it for 'long double' operations. It takes the expected precision differences into account when generating code.	2003-07-22 06:50:34 +00:00
Peter Wemm	76537e43f5	Extend the machine/ieeefp.h that was inherited from i386 to support the SSE mxcsr register as well. Since gcc will intermix SSE2 and x87 FP code, the fpsetround() etc mode had better be the same. There are hooks to enable these inlines to be instantiated inside libc for non-gcc or C++ callers. (g++ doesn't like the inlines that tried to extract an integer and convert it to an enum).	2003-07-22 06:44:54 +00:00
David Xu	20a2d71332	Rename thread_siginfo to cpu_thread_siginfo. Suggested by: jhb	2003-07-15 00:11:04 +00:00
Mark Murray	c7b132c974	Protect lint(1) from a #error.	2003-07-10 18:05:02 +00:00
Peter Wemm	e95babf3a8	unifdef -DLAZY_SWITCH and start to tidy up the associated glue.	2003-07-10 01:02:59 +00:00
Peter Wemm	bf8ca114e2	Fix the VADDR() macros to use either KVADDR() or UVADDR(), depending on the implied sign extension. The single unified VADDR() macro was not able to avoid sign extending the VM_MAXUSER_ADDRESS/USRSTACK values. Be explicit about UVADDR() (positive address space) and KVADDR() (kernel negative address space) to make mistakes show up more spectacularly. Increase user VM space from 1/2TB (512GB) to 128TB.	2003-07-09 23:04:23 +00:00
Peter Wemm	6486c09935	Fix up bogus index/offset/mask calculations in the allocpte and the corresponding release code. This was preventing the use of more than 1/2TB of user VM. I also spent a week staring at this code only to eventually find that I'd mistakenly typed a P as an R.	2003-07-09 22:59:45 +00:00
Peter Wemm	4afd44c16a	Turn the 2MB page mappings that cover the kernel text+data+bss area back on now that pmap_pte() can handle it. I never actually ran into anything that broke that I know of, but this was turned off as a precaution.	2003-07-09 22:55:00 +00:00
Peter Wemm	436e1f203f	Have pmap_pte() on a 2MB mapped address return the 2MB pde itself rather than a non-existing pte. There is code elsewhere in i386/amd64 pmap that neglects to handle the large page cases because it knows that it will see PG_PS in the returned "pte".	2003-07-09 22:53:45 +00:00
Alan Cox	90a7c7b671	In pmap_object_init_pt(), the pmap_invalidate_all() should be performed on the caller-provided pmap, not the kernel_pmap. Using the kernel_pmap results in an unnecessary IPI for TLB shootdown on SMPs. Reviewed by: jake, peter	2003-07-08 19:40:35 +00:00
Alan Cox	1f78f902a8	Background: pmap_object_init_pt() premaps the pages of a object in order to avoid the overhead of later page faults. In general, it implements two cases: one for vnode-backed objects and one for device-backed objects. Only the device-backed case is really machine-dependent, belonging in the pmap. This commit moves the vnode-backed case into the (relatively) new function vm_map_pmap_enter(). On amd64 and i386, this commit only amounts to code rearrangement. On alpha and ia64, the new machine independent (MI) implementation of the vnode case is smaller and more efficient than their pmap-based implementations. (The MI implementation takes advantage of the fact that objects in -CURRENT are ordered collections of pages.) On sparc64, pmap_object_init_pt() hadn't (yet) been implemented.	2003-07-03 20:18:02 +00:00
Maxime Henrion	331e012396	Sync more things with other backends.	2003-07-01 19:16:48 +00:00
Maxime Henrion	4813f72a9b	Honor the boundary of the busdma tag when allocating bounce pages. This was fixed in revision 1.5 of alpha/alpha/busdma_machdep.c and was never fixed in other busdma backends using bounce pages.	2003-07-01 16:54:54 +00:00
Scott Long	f6b1c44d1f	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
Alan Cox	dca96f1adc	- Export pmap_enter_quick() to the MI VM. This will permit the implementation of a largely MI pmap_object_init_pt() for vnode-backed objects. pmap_enter_quick() is implemented via pmap_enter() on sparc64 and powerpc. - Correct a mismatch between pmap_object_init_pt()'s prototype and its various implementations. (I plan to keep pmap_object_init_pt() as the MD hook for device-backed objects on i386 and amd64.) - Correct an error in ia64's pmap_enter_quick() and adjust its interface to match the other versions. Discussed with: marcel	2003-06-29 21:20:04 +00:00
Jeff Roberson	ab875ef896	- Construct a cpu topology map for Hyper Threading systems so that ULE may take advantage of them.	2003-06-28 22:07:42 +00:00
David Xu	b8f480ab94	Add a machine depended function thread_siginfo, SA signal code will use the function to construct a siginfo structure and use the result to export to userland. Reviewed by: julian	2003-06-28 06:34:08 +00:00
Scott Long	7f95801188	Catch amd64 up with the pending busdma async callback locking. Though this mechanism might change in the near future, it's best to keep everything in sync right now. Reminded by: peter	2003-06-28 06:07:06 +00:00
Peter Wemm	b6a5f89b4d	Turn ips back on.	2003-06-27 23:11:22 +00:00
Peter Wemm	1e5d8b3b66	Oops, I only added a comment about why ips doesn't compile. Actually comment it out for real.	2003-06-26 04:01:59 +00:00
Peter Wemm	ba1cabf4b9	Sync with i386 - add everything that compiles. There are a few drivers that are trivially easy to fix (eg: ips) that I've not committed fixes for.	2003-06-26 03:49:54 +00:00
Peter Wemm	2d29639ebb	Add back in the ability for pmap_mapdev() to use KVM if the region being requested is outside of the range of the direct map region. eg: for pci windows. While here, increase the minimum size of the direct map region to be 4GB instead of 1GB.	2003-06-26 01:04:31 +00:00
Alan Cox	0183359659	MFi386 Add vm object locking to pmap_object_init_pt().	2003-06-23 06:10:52 +00:00
Hidetoshi Shimokawa	e07324646e	Move KERNBASE to -2GB. Currently, we cannot increase KVA more than 2GB.	2003-06-22 13:02:45 +00:00
Hidetoshi Shimokawa	bfcd2ec739	- Allow access to direct mapped region via /dev/kmem. This makes 'netstat -r' work. - Use direct map for /dev/mem.	2003-06-22 12:59:43 +00:00
Hidetoshi Shimokawa	c1c1cc9c19	- Allocate a new PD Table if kernel grows beyond 1GB boundary. Reviewed by: peter - Use direct map in pmap_mapdev().	2003-06-22 12:55:20 +00:00
Hidetoshi Shimokawa	e14720d614	Use direct map in pmap_map(). This saves much KVA for vm_pages and you don't need to increase NKPT for large physical memory anymore. Suggested by: dfr	2003-06-20 14:09:33 +00:00
Hidetoshi Shimokawa	d25ac2fa68	Fix direct map page table for 2GB+ physical memory. You may still need to increase NKPT for larger memory. I have successfully booted 8GB system with NKPT=256.	2003-06-19 12:14:37 +00:00
Alan Cox	40ebf3e43a	Fix a performance bug in all of the various implementations of uma_small_alloc(): They always zeroed the page regardless of what the caller requested.	2003-06-18 02:57:38 +00:00
David Xu	0e2a4d3aeb	Rename P_THREADED to P_SA. P_SA means a process is using scheduler activations.	2003-06-15 00:31:24 +00:00
Alan Cox	49a2507bd1	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
Alan Cox	89f4fca265	Move the _new_altkstack() and _dispose_altkstack() functions out of the various pmap implementations into the machine-independent vm. They were all identical.	2003-06-14 06:20:25 +00:00
Peter Wemm	77e2a274d0	GC unused cpu_wait() function	2003-06-11 05:20:33 +00:00
John Baldwin	6b9691f103	- Use IDTVEC() to declare IPI handlers since they are also IDT vectors. - Make handlers for IPI's used by SMP kernels #ifdef SMP.	2003-06-06 17:45:25 +00:00
John Baldwin	e59ae32f18	- Document the thermal and performance counter LVT entries in the local APIC. - Add a lvt_thermal member to the LAPIC struct. - Add constants for the SMI and INIT LVT delivery modes.	2003-06-06 17:22:15 +00:00
Marcel Moolenaar	c2e4eb969f	Change the second (and last) argument of cpu_set_upcall(). Previously we were passing in a void* representing the PCB of the parent thread. Now we pass a pointer to the parent thread itself. The prime reason for this change is to allow cpu_set_upcall() to copy (parts of) the trapframe instead of having it done in MI code in each caller of cpu_set_upcall(). Copying the trapframe cannot always be done with a simply bcopy() or may not always be optimal that way. On ia64 specifically the trapframe contains information that is specific to an entry into the kernel and can only be used by the corresponding exit from the kernel. A trapframe copied verbatim from another frame is in most cases useless without some additional normalization. Note that this change removes the assignment to td->td_frame in some implementations of cpu_set_upcall(). The assignment is redundant. A previous call to cpu_thread_setup() already did the exact same assignment. An added benefit of removing the redundant assignment is that we can now change td_pcb without nasty side-effects. This change officially marks the ability on ia64 for 1:1 threading. Not tested on: amd64, powerpc Compile & boot tested on: alpha, sparc64 Functionally tested on: i386, ia64	2003-06-04 22:46:27 +00:00
Peter Wemm	7fc03ef474	Fix ALIGNED_POINTER(). sizeof((u_int32_t)) is not legal C.	2003-06-04 02:15:13 +00:00
Peter Wemm	babc58fd74	Fix restarted syscalls. When we rewind %rip, we also need to restore all the argument registers etc since we have almost certainly have trashed them by now. Take particular car of %r10 since it held the original value of %rcx (which we saved in tf_rcx on entry and doreti doesn't know this).	2003-06-02 21:56:08 +00:00
Peter Wemm	c35518b4ed	Make this more compatable with libc_r. Make the internal types for storing registers an array of longs rather than int.	2003-06-02 21:49:35 +00:00
David E. O'Brien	006124d811	Use __FBSDID().	2003-06-02 16:32:55 +00:00
David E. O'Brien	9676a785e7	Use __FBSDID().	2003-06-02 06:43:15 +00:00
Peter Wemm	193b147c05	MFi386: i386/include/asm.h rev 1.11: Do not abuse ##.	2003-06-02 05:59:35 +00:00
David E. O'Brien	69bb404192	Use C99 compatable asm statements.	2003-06-02 00:29:35 +00:00
David E. O'Brien	713c939103	Sync with i386/GENERIC ordering.	2003-06-01 20:26:38 +00:00
Peter Wemm	395aac85f8	MFi386: rev 1.56: remove break after return	2003-05-31 22:02:11 +00:00
Peter Wemm	0c5b3efcb0	MFi386: rev 1.23: use gdb_strlen()/gdb_strcpy() directly.	2003-05-31 22:00:57 +00:00
Peter Wemm	fbbfc4c335	MFi386: rev 1.50: remove unused variable	2003-05-31 21:58:55 +00:00
Poul-Henning Kamp	618b80ddcf	Avoid unbalancing the { } count in the source file with #ifdef by putting the opening { after the #ifdef ... #endif sequence. Found by: FlexeLint	2003-05-31 20:25:53 +00:00
Peter Wemm	4af5a3de60	Add acpi to the build. Remove the hack from machdep.c that lies to the loader to shut it up.	2003-05-31 07:00:08 +00:00
Peter Wemm	b043c80645	Have hammer_time() return the proc0 stack location, and have locore switch to it before calling mi_startup(). The bootstack is WAY too small for running acpica during probe/attach. While here, pass modulep/physfree to the startup routine, rather than writing to the global variables in locore.S. Approved by: re (amd64/*)	2003-05-31 06:54:29 +00:00
Peter Wemm	5681a6f60d	Regenerate.	2003-05-31 06:51:04 +00:00
Peter Wemm	1f5b79bc16	Make this compile with WITNESS enabled. It wants the syscall names.	2003-05-31 06:49:53 +00:00
Peter Wemm	ff7bf2f72e	Port acpica to amd64. Approved by: re (amd64/* blanket)	2003-05-31 06:47:05 +00:00
Peter Wemm	cc71eb5e10	With the help of jhb, fix the ACPI_ACQUIRE_GLOBAL_LOCK() macros and port to amd64 after repocopy. Approved by: re (amd64/*)	2003-05-31 06:43:55 +00:00
Hiten Pandya	b77c32a07e	Rename BUS_DMAMEM_NOSYNC to BUS_DMA_COHERENT. The current name is confusing, because it indicates to the client that a bus_dmamap_sync() operation is not necessary when the flag is specified, which is wrong. The main purpose of this flag is to hint the underlying architecture that DMA memory should be mapped in a coherent way, but the architecture can ignore it. But if the architecture does supports coherent mapping of memory, then it makes bus_dmamap_sync() calls cheap. This flag is the same as the one in NetBSD's Bus DMA. Reviewed by: gibbs, scottl, des (implicitly) Approved by: re@ (jhb)	2003-05-30 20:40:33 +00:00
Peter Wemm	5c980babcd	Nasty 'make it compile' port to amd64. Note that it needs some other wire protocol for the extra registers. I should probably just remove it from here for now since its quite useless. Approved by: re (amd64/* blanket)	2003-05-30 01:02:52 +00:00
Peter Wemm	5feb2148ba	Initial port to amd64 after repocopy from i386. Note that the disassembler has not been updated yet, and will do some very strange things. It does tracebacks (without function arguments due to regparm calling conventions) if -fno-omit-frame-pointer is used (to come later). This achieves basic functionality. Approved by: re (amd64/* blanket)	2003-05-30 01:01:07 +00:00
Peter Wemm	0afbc83dfd	Add setjmp/longjmp for ddb	2003-05-30 00:58:48 +00:00
Peter Wemm	5e1b7df5cf	Update AMD Features vector to include NX (page table entry no-execute bit) and LM (long mode) etc.	2003-05-27 21:59:56 +00:00
Scott Long	7e71df9339	Bring back bus_dmasync_op_t. It is now a typedef to an int, though the BUS_DMASYNC_ definitions remain as before. The does not change the ABI, and reverts the API to be a bit more compatible and flexible. This has survived a full 'make universe'. Approved by: re (bmah)	2003-05-27 04:59:59 +00:00
Scott Long	c87d464f28	De-orbit bus_dmamem_alloc_size(). It's a hack and was never used anyways. No need for it to pollute the 5.x API any further. Approved by: re (bmah)	2003-05-26 04:00:52 +00:00
Peter Wemm	3ebd9b48ce	Stop profiled libc from exploding, matching gcc's generated code. Approved by: re (amd64/* blanket)	2003-05-24 18:24:03 +00:00
Peter Wemm	d9cd1af4aa	Typo fix. oops. Submitted by: jmallett Approved by: re (blanket amd64/*)	2003-05-23 06:36:46 +00:00
Peter Wemm	cbd667fa2f	Update comments. Note that the kernel is at -1GB, not -2GB as erroniously implied by the previous commit. KVM is still only 1GB until pmap_growkernel() learns about the extra page table level. Approved by: re (blanket)	2003-05-23 06:35:45 +00:00
Peter Wemm	f229f5cf85	As suggested by the gdb folks, pad the 'struct fpreg' to a full 512 bytes to match the native fxsave/fxrstor object size since thats apparently what the Linux/NetBSD folks do.	2003-05-23 06:31:56 +00:00
Peter Wemm	9f0c4ab393	Deal with the user VM space expanding. 32 bit applications do not like having their stack at the 512GB mark. Give 4GB of user VM space for 32 bit apps. Note that this is significantly more than on i386 which gives only about 2.9GB of user VM to a process (1GB for kernel, plus page table pages which eat user VM space). Approved by: re (blanket)	2003-05-23 05:07:33 +00:00
Peter Wemm	3c9a3c9ca3	Major pmap rework to take advantage of the larger address space on amd64 systems. Of note: - Implement a direct mapped region using 2MB pages. This eliminates the need for temporary mappings when getting ptes. This supports up to 512GB of physical memory for now. This should be enough for a while. - Implement a 4-tier page table system. Most of the infrastructure is there for 128TB of userland virtual address space, but only 512GB is presently enabled due to a mystery bug somewhere. The design of this was heavily inspired by the alpha pmap.c. - The kernel is moved into the negative address space(!). - The kernel has 2GB of KVM available. - Provide a uma memory allocator to use the direct map region to take advantage of the 2MB TLBs. - Fixed some assumptions in the bus_space macros about the ability to fit virtual addresses in an 'int'. Notable missing things: - pmap_growkernel() should be able to grow to 512GB of KVM by expanding downwards below kernbase. The kernel must be at the top 2GB of the negative address space because of gcc code generation strategies. - need to fix the >512GB user vm code. Approved by: re (blanket)	2003-05-23 05:04:54 +00:00
Peter Wemm	997f3bfc2a	Merge from i386/trap.c rev 1.252. Use td_critnest instead of the spinlocks count for explicitly enabling interrupts. Approved by: re (blanket)	2003-05-22 20:09:50 +00:00
Alexander Kabaev	980ded9a7d	sys/sys/limits.h: - Fix visibilty test for LONG_BIT and WORD_BIT. `#if defined(__FOO_VISIBLE)' is alays wrong because __FOO_VISIBLE is always defined (to 0 for invisibility). sys/<arch>/include/limits.h sys/<arch>/include/_limits.h: - Style fixes. Submitted by: bde Reviewed by: bsdmike Approved by: re (scottl)	2003-05-19 20:29:07 +00:00
Peter Wemm	5c0fe26236	Actually get all the bits for sd_hibase.. it was 16 bits short. oops. Approved by: re (amd64/* blanket)	2003-05-17 02:05:10 +00:00
Alan Cox	4a0d6dfd2c	Initialize logical_cpus_mask when the logical CPUs are enumerated in the mptable. (Previously, logical_cpus_mask was only initialized if the hyperthreading fixup was executed.) Approved by: re (jhb) Reviewed by: ps	2003-05-15 05:12:24 +00:00
Peter Wemm	c0a54ff621	Collect the nastiness for preserving the kernel MSR_GSBASE around the load_gs() calls into a single place that is less likely to go wrong. Eliminate the per-process context switching of MSR_GSBASE, because it should be constant for a single cpu. Instead, save/restore it during the loading of the new %gs selector for the new process. Approved by: re (amd64/* blanket)	2003-05-15 00:23:40 +00:00
Peter Wemm	be52ef1399	Use compile time constants for things like PTmap[] etc because they're about to move outside of the +/- 2GB range Suggested by: jake Approved by: re (amd64/* blanket)	2003-05-15 00:20:17 +00:00
Peter Wemm	e14528b349	Regen Approved by: re (amd64 blanket)	2003-05-14 04:11:25 +00:00
Peter Wemm	d85631c4ac	Add BASIC i386 binary support for the amd64 kernel. This is largely stolen from the ia64/ia32 code (indeed there was a repocopy), but I've redone the MD parts and added and fixed a few essential syscalls. It is sufficient to run i386 binaries like /bin/ls, /usr/bin/id (dynamic) and p4. The ia64 code has not implemented signal delivery, so I had to do that. Before you say it, yes, this does need to go in a common place. But we're in a freeze at the moment and I didn't want to risk breaking ia64. I will sort this out after the freeze so that the common code is in a common place. On the AMD64 side, this required adding segment selector context switch support and some other support infrastructure. The %fs/%gs etc code is hairy because loading %gs will clobber the kernel's current MSR_GSBASE setting. The segment selectors are not used by the kernel, so they're only changed at context switch time or when changing modes. This still needs to be optimized. Approved by: re (amd64/* blanket)	2003-05-14 04:10:49 +00:00
Peter Wemm	5d5ca6d75e	Fix some misunderstandings about 64 bit extension. Fix fuword/suword - they're supposed to be 'long' - ie: point them at fuword64/suword64 instead of the incorrect 32 bit versions.	2003-05-14 03:38:13 +00:00
John Baldwin	90af4afacb	- Merge struct procsig with struct sigacts. - Move struct sigacts out of the u-area and malloc() it using the M_SUBPROC malloc bucket. - Add a small sigacts_*() API for managing sigacts structures: sigacts_alloc(), sigacts_free(), sigacts_copy(), sigacts_share(), and sigacts_shared(). - Remove the p_sigignore, p_sigacts, and p_sigcatch macros. - Add a mutex to struct sigacts that protects all the members of the struct. - Add sigacts locking. - Remove Giant from nosys(), kill(), killpg(), and kern_sigaction() now that sigacts is locked. - Several in-kernel functions such as psignal(), tdsignal(), trapsignal(), and thread_stopped() are now MP safe. Reviewed by: arch@ Approved by: re (rwatson)	2003-05-13 20:36:02 +00:00
Peter Wemm	8a6d52c3f8	Really stop the loader from trying to load the acpi module by lying and pretending that it is already here. Approved by: re (amd64/* stuff)	2003-05-12 18:37:56 +00:00
Peter Wemm	0fe93e7480	For the page fault handler, save %cr2 in the outer trap handler so that we do not have to run so long with interrupts disabled. This involved creating tf_addr in the trapframe. Reorganize the trap stubs so that they consistently reserve the stack space and initialize any missing bits. Approved by: re (amd64 stuff)	2003-05-12 18:33:19 +00:00
Peter Wemm	0f6241620b	Sync ucontext with reality. The struct trapframe changes need to be reflected here. Approved by: re (blanket amd64/*)	2003-05-12 18:23:04 +00:00
Peter Wemm	e9b193dc33	AMD64 physical space is much larger than i386, de-i386 the bus_space and bus_dma MD code for AMD64. (And a trivial ifdef update in dev/kbd because of this). More updates are needed here to take advantage of the 64 bit instructions. Approved by: re (blanket amd64/*)	2003-05-12 02:44:37 +00:00
Peter Wemm	bf1e897425	Give a %fs and %gs to userland. Use swapgs to obtain the kernel %GS.base value on entry and exit. This isn't as easy as it sounds because when we recursively trap or interrupt, we have to avoid duplicating the swapgs instruction or we end up back with the userland %gs. I implemented this by testing TF_CS to see if we're coming from supervisor mode already, and check for returning to supervisor. To avoid a race with interrupts in the brief period after beginning executing the handler and before the swapgs, convert all trap gates to interrupt gates, and reenable interrupts immediately after the swapgs. I am not happy with this. There are other possible ways to do this that should be investigated. (eg: storing the GS.base MSR value in the trapframe) Add some sysarch functions to let the userland code get to this. Approved by: re (blanket amd64/*)	2003-05-12 02:37:29 +00:00
Peter Wemm	85983c59cd	Call it an AMD64 Processor, not a Hammer. Also, it seems that the cpuid model numbers are wider than I first thought. Approved by: re (blanket amd64/*)	2003-05-11 23:01:04 +00:00
Peter Wemm	f75b005a99	I missed another printf format error while extracting the patch. Approved by: re (blanket amd64/*)	2003-05-11 22:55:40 +00:00
Peter Wemm	eeee69d45c	Make atdevbase long for the KERNBASE > 4GB case Approved by: re (amd64/* blanket)	2003-05-11 22:53:43 +00:00
Peter Wemm	5a337b2589	Fix printf format errors that were undetected due to using the standard FSF compiler during early development.	2003-05-11 22:40:25 +00:00
Peter Wemm	5048926df9	Export PML4SHIFT and PDPSHIFT Approved by: re (blanket amd64/*)	2003-05-11 22:39:40 +00:00
Peter Wemm	4ce3e250ce	Since compiling natively, the compile environment has been less forgiving about silly typos. Use the correct comment sequences.	2003-05-11 22:38:54 +00:00
Peter Wemm	0fe0f2515b	Provide a fake varargs implementation for lint's benefit. This way it can see the intent of the va_* macros, even though it cannot work. Approved by: re (blanket amd64/*)	2003-05-10 00:55:15 +00:00
Peter Wemm	e1ef71de2b	Remove _ARCH_INDIRECT ifdefs. They existed for lib/msun/* on i386, which could use different versions of the math code depending on whether there was real floating point hardware or math emulation. Since the fpu is part of the core specification on amd64, there is no need for this here. Approved by: re (blanket amd64/*)	2003-05-10 00:53:34 +00:00
Peter Wemm	2e4f687a1d	bcopyb() isn't used on amd64 kernel (it only exists for i386/pcvt) Approved by: re (blanket amd64/*)	2003-05-10 00:51:29 +00:00
Peter Wemm	5826a47e9b	Finish translating i386/support.s into amd64 asm - replace bcopy etc with asm versions. This yields about a 5% kernel compile time speedup.	2003-05-10 00:49:56 +00:00
Peter Wemm	395e65aa29	Include the MXCSR initial values, based on the AMD docs. This file should really be renamed to fpu.h and npx.c to fpu.c since its part of the core architecture on amd64 systems, not an isa 'numeric processor extension'.	2003-05-09 18:28:05 +00:00
Peter Wemm	14426b9c3b	Turn syscons on now that it works, so that anybody trying to run this can see something. Probing for keyboard still works for auto serial console mode.	2003-05-09 18:26:06 +00:00
Peter Wemm	b3f7680e49	Oops. Turn T_PAGEFLT back into an interrupt gate. It is critical that interrupts be disabled and remain disabled until %cr2 is read. Otherwise we can preempt and another process can fault, and by the time we read %cr2, we see a different processes fault address. This Greatly Confuses vm_fault() (to say the least). The i386 port has got this marked as a bug workaround for a Cyrix CPU, which is what lead me astray. Its actually necessary for preemption, regardless of whether Cyrix cpus had a bug or not.	2003-05-08 08:25:51 +00:00
Peter Wemm	2dbe628162	Leave space for the 128 byte red-zone on the stack.	2003-05-08 00:13:24 +00:00
Peter Wemm	f3b234157e	#include <machine/metadata.h> was missing; add it	2003-05-08 00:12:37 +00:00
Peter Wemm	9c43b77ff5	Fix a preemption race. I was reenabling interrupts in the fast system call handler before it was safe. It was possible for to lose context and for something else to clobber the PCPU scratch variable. This moves the interrupt enable way too late, but its better safe than sorry for the moment.	2003-05-08 00:05:00 +00:00
John Baldwin	ace85d0a3c	Style nits. Approved by: re (bmah)	2003-05-07 17:21:38 +00:00
Alexander Kabaev	0eda4c08a5	Style fixes. Remove DBL_DIG, DBL_MIN, DBL_MAX and their FLT_ counterparts, they were marked for deprecation ever since SUSv1 at least. Only define ULLONG_MIN/MAX and LLONG_MAX if long long type is supported. Restore a lost comment in MI _limits.h file and remove it from sys/limits.h where it does not belong.	2003-05-04 22:13:04 +00:00
Peter Wemm	5b27e93419	Repocopy .s to .S	2003-05-03 00:21:43 +00:00
Peter Wemm	abf50ec921	I changed the numbering of the MODINFOMD_SMAP during the commit, so recognize the old number for my development boxes so I can use old loader/pxeboot for a while if I need to.	2003-05-01 04:18:02 +00:00
Peter Wemm	7f47668191	Slight reorg and added AMD64 support. A couple of the MODINFOMD_* values that were added to sparc64 and later powerpc, really should have been in the MI area. But changing that now with insufficient preperation will just cause too much pain. Move MD_FETCH() to the MI sys/linker.h file to avoid another two copies of it.	2003-05-01 03:31:18 +00:00
Peter Wemm	afa8862328	Commit MD parts of a loosely functional AMD64 port. This is based on a heavily stripped down FreeBSD/i386 (brutally stripped down actually) to attempt to get a stable base to start from. There is a lot missing still. Worth noting: - The kernel runs at 1GB in order to cheat with the pmap code. pmap uses a variation of the PAE code in order to avoid having to worry about 4 levels of page tables yet. - It boots in 64 bit "long mode" with a tiny trampoline embedded in the i386 loader. This simplifies locore.s greatly. - There are still quite a few fragments of i386-specific code that have not been translated yet, and some that I cheated and wrote dumb C versions of (bcopy etc). - It has both int 0x80 for syscalls (but using registers for argument passing, as is native on the amd64 ABI), and the 'syscall' instruction for syscalls. int 0x80 preserves all registers, 'syscall' does not. - I have tried to minimize looking at the NetBSD code, except in a couple of places (eg: to find which register they use to replace the trashed %rcx register in the syscall instruction). As a result, there is not a lot of similarity. I did look at NetBSD a few times while debugging to get some ideas about what I might have done wrong in my first attempt.	2003-05-01 01:05:25 +00:00
Peter Wemm	1e57e9eba3	Repocopy from x86_64/... to amd64/... Rename visible x86_64 references to amd64. Kill MID_MACHINE, its a.out specific, the only platform that supports it is i386. All of the other platforms should remove it too.	2003-04-30 22:51:59 +00:00
John Baldwin	d90e753aa8	Range check the syscall number before looking it up in the syscallnames[] array. Submitted by: pho	2003-04-30 17:59:27 +00:00
Mark Murray	51da11a27a	Fix some easy, global, lint warnings. In most cases, this means making some local variables static. In a couple of cases, this means removing an unused variable.	2003-04-30 12:57:40 +00:00
Mark Murray	f17615daca	Warns fixing. Protect against inappropriate linting, and mark GCC-specific assemble code as such (in #ifdefs). Fix an easy static variable warning while I'm here.	2003-04-30 12:23:58 +00:00
Alexander Kabaev	104a9b7e3e	Deprecate machine/limits.h in favor of new sys/limits.h. Change all in-tree consumers to include <sys/limits.h> Discussed on: standards@ Partially submitted by: Craig Rodrigues <rodrigc@attbi.com>	2003-04-29 13:36:06 +00:00
Jake Burkholder	14ce5bd49b	Use inlines for loading and storing page table entries. Use cmpxchg8b for the PAE case to ensure idempotent 64 bit loads and stores. Sponsored by: DARPA, Network Associates Laboratories	2003-04-28 20:35:36 +00:00
John Baldwin	7ff022c485	- Push down Giant into the sysarch() calls that still need Giant. - Standardize on EINVAL rather than EOPNOTSUPP if the sysarch op value is invalid.	2003-04-25 20:04:02 +00:00
John Baldwin	d8ca78d02f	Regen.	2003-04-25 15:59:44 +00:00
John Baldwin	9fb3809a3a	Oops, the thr_* and jail_attach() syscall entries should be NOPROTO rather than STD.	2003-04-25 15:59:18 +00:00
Jake Burkholder	ffad008fcd	Remove harmless invalid cast. Sponsored by: DARPA, Network Associates Laboratories	2003-04-25 15:07:58 +00:00
Daniel Eischen	1328e1c4be	Add an argument to get_mcontext() which specified whether the syscall return values should be cleared. The system calls getcontext() and swapcontext() want to return 0 on success but these contexts can be switched to at a later time so the return values need to be cleared in the saved register sets. Other callers of get_mcontext() would normally want the context without clearing the return values. Remove the i386-specific context saving from the KSE code. get_mcontext() is not i386-specific any more. Fix a bad pointer in the alpha get_mcontext() code. The context was being bcopy()'d from &td->tf_frame, but tf_frame is itself a pointer, so the thread was being copied instead. Spotted by jake. Glanced at by: jake Reviewed by: bde (months ago)	2003-04-25 01:50:30 +00:00
John Baldwin	9bc65d35f2	Regen.	2003-04-24 20:50:57 +00:00
John Baldwin	d46b3412dc	Fix the thr_create() entry by adding a trailing \. Also, sync up the MP safe flag for thr_* with the main table.	2003-04-24 20:49:46 +00:00
David Xu	e63c419732	Don't print anything for fault at cpu_switch_load_gs, just like other code to recover fault in doreti because of invalid segment registers, silently push error to userland.	2003-04-24 01:48:59 +00:00
Alexander Kabaev	6fd839f9c7	Add a new sys/limits.h file which in turn depends on machine/_limits.h to get actual constant values. This is in preparation for machine/limits.h retirement. Discussed on: standards@ Submitted by: Craig Rodrigues <rodrigc@attbi.com> (*) Modified by: kan	2003-04-23 21:41:59 +00:00
John Baldwin	fe8cdcae87	- Replace inline implementations of sigprocmask() with calls to kern_sigprocmask() in the various binary compatibility emulators. - Replace calls to sigsuspend(), sigaltstack(), sigaction(), and sigprocmask() that used the stackgap with calls to the corresponding kern_sig*() functions instead without using the stackgap.	2003-04-22 18:23:49 +00:00
David Xu	97637bcfb2	Move down intr level testing code a bit, cpu_switch_load_gs fault can be at interrupt nested time.	2003-04-22 08:12:03 +00:00
David Xu	5515888875	Fix some problems for cpu_switch_load_gs. when fault address is at cpu_switch_load_gs, cpu is in context switch, so don't enable interrupt. because it is in context switch, it is expected sched_lock was held, so don't PROC_LOCK(p) and psignal, it is LOR, probably we can set a P_XSIGBUS like flag in p_sflags, and set TDF_ASTPENDING in td_flags, in ast(), post a SIGBUS to process if P_XSIGBUS was set.	2003-04-22 07:45:47 +00:00
David Xu	5b70587b8a	Remove single threading detecting code, these code really should be replaced by thread_user_enter(), but current we don't want to enable this in trap.	2003-04-22 03:17:41 +00:00
Hidetoshi Shimokawa	092cd06fcd	Add FireWire drivers to GENERIC.	2003-04-21 16:44:05 +00:00
David Xu	6625036082	Reset pcb_gs and %gs before possibly invalidating it.	2003-04-21 15:05:05 +00:00
Bill Paul	87b4a25958	Add device driver support for the ASIX Electronics AX88172 USB 2.0 ethernet controller. The driver has been tested with the LinkSys USB200M adapter. I know for a fact that there are other devices out there with this chip but don't have all the USB vendor/device IDs. Note: I'm not sure if this will force the driver to end up in the install kernel image or not. Special magic needs to be done to exclude it to keep the boot floppies from bloating again, someone please advise.	2003-04-20 19:05:33 +00:00
David Xu	d1fc2022c3	Backout my last commit. Requested by: bde	2003-04-20 01:35:21 +00:00
David Xu	2bdf11638e	Don't return garbage in high 16 bits.	2003-04-19 02:40:39 +00:00
John Baldwin	889a6b5845	Use the proc lock to protect p_singlethread and a P_WEXIT test. This fixes a couple of potential KSE panics on non-i386 arch's that weren't holding the proc lock when calling thread_exit().	2003-04-18 20:20:00 +00:00
John Baldwin	ee6c1d2ed2	Hold the proc lock for curproc around sigonstack().	2003-04-18 20:09:04 +00:00
John Baldwin	8365f5bf7c	Remove a couple of unused symbols.	2003-04-17 22:17:28 +00:00
Maxime Henrion	0cb112309a	style(9)	2003-04-15 03:11:03 +00:00
Hidetoshi Shimokawa	c8990f0d8e	Restore delayed load support for the resource shortage case. It was missed in the previous change. Now, _bus_dmamap_load_buffer() accepts BUS_DMA_WAITOK/BUS_DMA_NOWAIT flags. Original idea from: jake	2003-04-14 13:21:40 +00:00
Hidetoshi Shimokawa	f5270431be	* Use _bus_dmamap_load_buffer() and respect maxsegsz in bus_dmamap_load(). Ignoring maxsegsz may lead to fatal data corruption for some devices. ex. SBP-2/FireWire We should apply this change to other platforms except for sparc64. MFC after: 1 week	2003-04-14 04:19:42 +00:00
David Xu	2257a44ffd	Copy %gs from current CPU not from a stale PCB backup.	2003-04-11 14:47:34 +00:00
David Xu	d8c586e73a	set_user_ldt_rv() should check same proc not thread, this commit fixes an user LDT smp rendezvous bug.	2003-04-11 14:45:07 +00:00
Dag-Erling Smørgrav	0da46d776b	Convert the SMP_TSC kernel option into a loader tunable. Also enable the TSC timecounter on single-CPU systems even when they are running an SMP kernel.	2003-04-10 23:07:24 +00:00
Maxime Henrion	141bacb048	Change the operation parameter of bus_dmamap_sync() from an enum to an int and redefine the BUS_DMASYNC_* constants as flags. This allows us to specify several operations in one call to bus_dmamap_sync() as in NetBSD.	2003-04-10 23:03:33 +00:00
Julian Elischer	060563ec50	Move the _oncpu entry from the KSE to the thread. The entry in the KSE still exists but it's purpose will change a bit when we add the ability to lock a KSE to a cpu.	2003-04-10 17:35:44 +00:00
Wes Peters	c2ff1e1682	Add a sysctl that records and reports the CPU clock rate calculated at boot. Funny how often this trivial piece of information crops up in embedded boxen. Sponsored by: St. Bernard Software	2003-04-10 07:05:24 +00:00
Mike Barcroft	fd7a8150fb	o In struct prison, add an allprison linked list of prisons (protected by allprison_mtx), a unique prison/jail identifier field, two path fields (pr_path for reporting and pr_root vnode instance) to store the chroot() point of each jail. o Add jail_attach(2) to allow a process to bind to an existing jail. o Add change_root() to perform the chroot operation on a specified vnode. o Generalize change_dir() to accept a vnode, and move namei() calls to callers of change_dir(). o Add a new sysctl (security.jail.list) which is a group of struct xprison instances that represent a snapshot of active jails. Reviewed by: rwatson, tjr	2003-04-09 02:55:18 +00:00
Jake Burkholder	ac00210525	Remove invalid cast to vm_offset_t to avoid truncating a physical address when doing pmap_kextract on a 2MB page. Spotted by: peter Sponsored by: DARPA, Network Associates Laboratories	2003-04-08 18:22:41 +00:00
Jake Burkholder	fce2287796	Add support for bounce buffers to _bus_dmamap_load_buffer, which is the backend for bus_dmamap_load_mbuf and bus_dmamap_load_uio. - Increaes MAX_BPAGES to 512. Less than this causes fxp to quickly runs out of bounce pages. - Add an argument to reserve_bounce_pages indicating wether this operation should fail or be queued for later processing if we run out of memory. The EINPROGRESS return value is not handled properly by consumers of bus_dmamap_load_mbuf. - If bounce buffers are required allocate minimum 1 bounce page at map creation time. If maxsize was small previously this could get truncated to 0 and the drivers would quickly run out of bounce pages. - Fix a bug handling the return value of alloc_bounce_pages at map creation time. It returns the number of pages allocated, not 0 on success. - Use bus_addr_t for physical addresses to avoid truncation. - Assert that the map is non-null and not the no bounce map in add_bounce_pages. Sponsored by: DARPA, Network Associates Laboratories	2003-04-07 16:08:32 +00:00
Jake Burkholder	46ea68dd10	Better fix for previous previous which still allows the 4megs of kva at the top of the address space to be reclaimed. The problem is that with the APTD gone the mapable kernel address space runs right to the end of the 32 bit address space. As a max this is 0x100000000, which can't be represented in 32 bits, so we have to use ptd entry n-1 and pte offset n-1, instead of ptd entry n and pte offset 0. There's still 1 page we can't use, but we gain just under 4 megs of kva (8 megs with PAE). Sponsored by: DARPA, Network Associates Laboratories	2003-04-07 14:27:19 +00:00
Peter Wemm	c81e825f6c	Unbreak the !LAZY_SWITCH case. I #ifdef'ed too much when I added the ifdefs prior to commit and killed the same-address-space test. Submitted by: bde	2003-04-05 22:18:14 +00:00
Tor Egge	fd6d48b8e8	Add SMP_TSC option, which can be used on SMP systems where the TSCs are synchronized to reduce context switch cost.	2003-04-04 23:54:46 +00:00
Dag-Erling Smørgrav	9f45b2da8f	Define ovbcopy() as a macro which expands to the equivalent bcopy() call, to take care of the KAME IPv6 code which needs ovbcopy() because NetBSD's bcopy() doesn't handle overlap like ours. Remove all implementations of ovbcopy(). Previously, bzero was a function pointer on i386, to save a jmp to bzero_vector. Get rid of this microoptimization as it only confuses things, adds machine-dependent code to an MD header, and doesn't really save all that much. This commit does not add my pagezero() / pagecopy() code.	2003-04-04 17:29:55 +00:00
Jake Burkholder	d1d03c2b72	Bandaid fix for previous commit while I figure out why it broke. This caused crashes early in boot on i386 UP machines. Reported by: phk Pointy hat to: jake	2003-04-04 10:09:44 +00:00
Jake Burkholder	163529c2b3	- Removed APTD and associated macros, it is no longer used. BANG BANG BANG etc. Sponsored by: DARPA, Network Associates Laboratories	2003-04-03 23:44:35 +00:00
Peter Wemm	cc66ebe2a9	Commit a partial lazy thread switch mechanism for i386. it isn't as lazy as it could be and can do with some more cleanup. Currently its under options LAZY_SWITCH. What this does is avoid %cr3 reloads for short context switches that do not involve another user process. ie: we can take an interrupt, switch to a kthread and return to the user without explicitly flushing the tlb. However, this isn't as exciting as it could be, the interrupt overhead is still high and too much blocks on Giant still. There are some debug sysctls, for stats and for an on/off switch. The main problem with doing this has been "what if the process that you're running on exits while we're borrowing its address space?" - in this case we use an IPI to give it a kick when we're about to reclaim the pmap. Its not compiled in unless you add the LAZY_SWITCH option. I want to fix a few more things and get some more feedback before turning it on by default. This is NOT a replacement for Bosko's lazy interrupt stuff. This was more meant for the kthread case, while his was for interrupts. Mine helps a little for interrupts, but his helps a lot more. The stats are enabled with options SWTCH_OPTIM_STATS - this has been a pseudo-option for years, I just added a bunch of stuff to it. One non-trivial change was to select a new thread before calling cpu_switch() in the first place. This allows us to catch the silly case of doing a cpu_switch() to the current process. This happens uncomfortably often. This simplifies a bit of the asm code in cpu_switch (no longer have to call choosethread() in the middle). This has been implemented on i386 and (thanks to jake) sparc64. The others will come soon. This is actually seperate to the lazy switch stuff. Glanced at by: jake, jhb	2003-04-02 23:53:30 +00:00
Jake Burkholder	cef57e7624	- Make casuptr return the old value of the location we're trying to update, and change the umtx code to expect this. Reviewed by: jeff	2003-04-02 08:02:27 +00:00
Jeff Roberson	a0704f9de9	- Add thr and umtx system calls.	2003-04-01 01:15:56 +00:00
Jeff Roberson	b8db34d280	- Define a new md function 'casuptr'. This atomically compares and sets a pointer that is in user space. It will be used as the basic primitive for a kernel supported user space lock implementation. - Implement this function in x86's support.s - Provide stubs that return -1 in all other architectures. Implementations will follow along shortly. Reviewed by: jake	2003-04-01 00:18:55 +00:00
Jeff Roberson	fb8aaa76c7	- In npxgetregs() use the td argument to save the fpu state from and not curthread. Nothing currently depends on this behavior. - Clean up an extra newline. Obtained from: bde	2003-04-01 00:16:32 +00:00

... 3 4 5 6 7 ...

3981 Commits