freebsd-skq

Author	SHA1	Message	Date
obrien	02a4f42b9a	Use __FBSDID(). Brought to you by: a boring talk at Ottawa Linux Symposium	2003-07-25 21:19:19 +00:00
obrien	cdd3abf3e4	Use __FBSDID(). Brought to you by: a boring talk at OLS	2003-07-25 21:10:19 +00:00
alc	bc3b454720	MFi386 revision 1.416 Add vm object locking to pmap_prefault(). Note: powerpc and sparc64 do not implement this function.	2003-07-25 18:58:39 +00:00
davidxu	1654706adb	Align upcall stack top to odd times of 8. GCC accounts return address in callee function for stack alignment.	2003-07-25 00:21:37 +00:00
davidxu	28420f22f7	Implement cpu_set_upcall and cpu_set_upcall_kse. Reviewed by: peter	2003-07-24 08:52:44 +00:00
davidxu	181093ade7	Set fault address to si_addr. Reviewed by: peter	2003-07-24 08:51:22 +00:00
peter	c43dfc354a	Make the breakpoint instruction trap gate available to users. ptrace() needs this. Submitted by: Mark Kettenis <kettenis@chello.nl>	2003-07-23 23:20:20 +00:00
peter	65f0b759da	Set the %gs base to pcb_gsbase, not pcb_fsbase. Oops. Discovered by: davidxu	2003-07-23 23:17:15 +00:00
alc	2e85aa2ad2	Annotate pmap_changebit() as __always_inline. This function was written as a template that when inlined is specialized for the caller through constant value propagation and dead code elimination. Thus, the specialized code that is generated for pmap_clear_reference() et al. avoids several conditional branches inside of a loop.	2003-07-23 19:49:32 +00:00
jhb	318967d8e2	Use macros from apic.h to when writing to the ICR to send IPIs to startup APs rather than magic numbers. Tested by: scottl	2003-07-23 19:04:28 +00:00
jhb	1ab48551b3	Add a new macro APIC_ICRLO_RESV_MASK that contains all of the reserved fields in the low 32 bits of the local APIC ICR register. Use this macro in place of APIC_RESV2_MASK when masking off existing bits from the ICR when writing to it to send an IPI. Tested by: scottl	2003-07-23 18:59:38 +00:00
peter	d7566f1c0d	Go back to 64 bit precision for fadd/fsub/fsqrt etc. This is because on AMD64, gcc (and the ABI) expects the x87 unit to be running in 80/64 mode (not 64/53) so that it can use it for 'long double' operations. It takes the expected precision differences into account when generating code.	2003-07-22 06:50:34 +00:00
peter	576883ab15	Extend the machine/ieeefp.h that was inherited from i386 to support the SSE mxcsr register as well. Since gcc will intermix SSE2 and x87 FP code, the fpsetround() etc mode had better be the same. There are hooks to enable these inlines to be instantiated inside libc for non-gcc or C++ callers. (g++ doesn't like the inlines that tried to extract an integer and convert it to an enum).	2003-07-22 06:44:54 +00:00
davidxu	720b97177f	Rename thread_siginfo to cpu_thread_siginfo. Suggested by: jhb	2003-07-15 00:11:04 +00:00
markm	2184143037	Protect lint(1) from a #error.	2003-07-10 18:05:02 +00:00
peter	fb79192cce	unifdef -DLAZY_SWITCH and start to tidy up the associated glue.	2003-07-10 01:02:59 +00:00
peter	9fc6da3f4c	Fix the VADDR() macros to use either KVADDR() or UVADDR(), depending on the implied sign extension. The single unified VADDR() macro was not able to avoid sign extending the VM_MAXUSER_ADDRESS/USRSTACK values. Be explicit about UVADDR() (positive address space) and KVADDR() (kernel negative address space) to make mistakes show up more spectacularly. Increase user VM space from 1/2TB (512GB) to 128TB.	2003-07-09 23:04:23 +00:00
peter	b1f1716f2a	Fix up bogus index/offset/mask calculations in the allocpte and the corresponding release code. This was preventing the use of more than 1/2TB of user VM. I also spent a week staring at this code only to eventually find that I'd mistakenly typed a P as an R.	2003-07-09 22:59:45 +00:00
peter	5ca42e0d4a	Turn the 2MB page mappings that cover the kernel text+data+bss area back on now that pmap_pte() can handle it. I never actually ran into anything that broke that I know of, but this was turned off as a precaution.	2003-07-09 22:55:00 +00:00
peter	26770348b8	Have pmap_pte() on a 2MB mapped address return the 2MB pde itself rather than a non-existing pte. There is code elsewhere in i386/amd64 pmap that neglects to handle the large page cases because it knows that it will see PG_PS in the returned "pte".	2003-07-09 22:53:45 +00:00
alc	3d2e5159d9	In pmap_object_init_pt(), the pmap_invalidate_all() should be performed on the caller-provided pmap, not the kernel_pmap. Using the kernel_pmap results in an unnecessary IPI for TLB shootdown on SMPs. Reviewed by: jake, peter	2003-07-08 19:40:35 +00:00
alc	0699f7e17f	Background: pmap_object_init_pt() premaps the pages of a object in order to avoid the overhead of later page faults. In general, it implements two cases: one for vnode-backed objects and one for device-backed objects. Only the device-backed case is really machine-dependent, belonging in the pmap. This commit moves the vnode-backed case into the (relatively) new function vm_map_pmap_enter(). On amd64 and i386, this commit only amounts to code rearrangement. On alpha and ia64, the new machine independent (MI) implementation of the vnode case is smaller and more efficient than their pmap-based implementations. (The MI implementation takes advantage of the fact that objects in -CURRENT are ordered collections of pages.) On sparc64, pmap_object_init_pt() hadn't (yet) been implemented.	2003-07-03 20:18:02 +00:00
mux	e95ec1f864	Sync more things with other backends.	2003-07-01 19:16:48 +00:00
mux	152160211a	Honor the boundary of the busdma tag when allocating bounce pages. This was fixed in revision 1.5 of alpha/alpha/busdma_machdep.c and was never fixed in other busdma backends using bounce pages.	2003-07-01 16:54:54 +00:00
scottl	4d495abb9d	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
alc	44509f207f	- Export pmap_enter_quick() to the MI VM. This will permit the implementation of a largely MI pmap_object_init_pt() for vnode-backed objects. pmap_enter_quick() is implemented via pmap_enter() on sparc64 and powerpc. - Correct a mismatch between pmap_object_init_pt()'s prototype and its various implementations. (I plan to keep pmap_object_init_pt() as the MD hook for device-backed objects on i386 and amd64.) - Correct an error in ia64's pmap_enter_quick() and adjust its interface to match the other versions. Discussed with: marcel	2003-06-29 21:20:04 +00:00
jeff	a92e8f57c5	- Construct a cpu topology map for Hyper Threading systems so that ULE may take advantage of them.	2003-06-28 22:07:42 +00:00
davidxu	bb3ae5a363	Add a machine depended function thread_siginfo, SA signal code will use the function to construct a siginfo structure and use the result to export to userland. Reviewed by: julian	2003-06-28 06:34:08 +00:00
scottl	6a4473efe5	Catch amd64 up with the pending busdma async callback locking. Though this mechanism might change in the near future, it's best to keep everything in sync right now. Reminded by: peter	2003-06-28 06:07:06 +00:00
peter	b851e36619	Turn ips back on.	2003-06-27 23:11:22 +00:00
peter	b29d321359	Oops, I only added a comment about why ips doesn't compile. Actually comment it out for real.	2003-06-26 04:01:59 +00:00
peter	5d98ddee25	Sync with i386 - add everything that compiles. There are a few drivers that are trivially easy to fix (eg: ips) that I've not committed fixes for.	2003-06-26 03:49:54 +00:00
peter	8a3793da9b	Add back in the ability for pmap_mapdev() to use KVM if the region being requested is outside of the range of the direct map region. eg: for pci windows. While here, increase the minimum size of the direct map region to be 4GB instead of 1GB.	2003-06-26 01:04:31 +00:00
alc	918432075e	MFi386 Add vm object locking to pmap_object_init_pt().	2003-06-23 06:10:52 +00:00
simokawa	ab870d6327	Move KERNBASE to -2GB. Currently, we cannot increase KVA more than 2GB.	2003-06-22 13:02:45 +00:00
simokawa	4287eff159	- Allow access to direct mapped region via /dev/kmem. This makes 'netstat -r' work. - Use direct map for /dev/mem.	2003-06-22 12:59:43 +00:00
simokawa	cc83270ff2	- Allocate a new PD Table if kernel grows beyond 1GB boundary. Reviewed by: peter - Use direct map in pmap_mapdev().	2003-06-22 12:55:20 +00:00
simokawa	97c9bf2fb9	Use direct map in pmap_map(). This saves much KVA for vm_pages and you don't need to increase NKPT for large physical memory anymore. Suggested by: dfr	2003-06-20 14:09:33 +00:00
simokawa	8974dd1d7d	Fix direct map page table for 2GB+ physical memory. You may still need to increase NKPT for larger memory. I have successfully booted 8GB system with NKPT=256.	2003-06-19 12:14:37 +00:00
alc	9dcd110789	Fix a performance bug in all of the various implementations of uma_small_alloc(): They always zeroed the page regardless of what the caller requested.	2003-06-18 02:57:38 +00:00
davidxu	abb4420bbe	Rename P_THREADED to P_SA. P_SA means a process is using scheduler activations.	2003-06-15 00:31:24 +00:00
alc	83f108b04d	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
alc	d20c30720b	Move the _new_altkstack() and _dispose_altkstack() functions out of the various pmap implementations into the machine-independent vm. They were all identical.	2003-06-14 06:20:25 +00:00
peter	fda03b7cfc	GC unused cpu_wait() function	2003-06-11 05:20:33 +00:00
jhb	95a58df398	- Use IDTVEC() to declare IPI handlers since they are also IDT vectors. - Make handlers for IPI's used by SMP kernels #ifdef SMP.	2003-06-06 17:45:25 +00:00
jhb	55d014b2d9	- Document the thermal and performance counter LVT entries in the local APIC. - Add a lvt_thermal member to the LAPIC struct. - Add constants for the SMI and INIT LVT delivery modes.	2003-06-06 17:22:15 +00:00
marcel	14594eae88	Change the second (and last) argument of cpu_set_upcall(). Previously we were passing in a void* representing the PCB of the parent thread. Now we pass a pointer to the parent thread itself. The prime reason for this change is to allow cpu_set_upcall() to copy (parts of) the trapframe instead of having it done in MI code in each caller of cpu_set_upcall(). Copying the trapframe cannot always be done with a simply bcopy() or may not always be optimal that way. On ia64 specifically the trapframe contains information that is specific to an entry into the kernel and can only be used by the corresponding exit from the kernel. A trapframe copied verbatim from another frame is in most cases useless without some additional normalization. Note that this change removes the assignment to td->td_frame in some implementations of cpu_set_upcall(). The assignment is redundant. A previous call to cpu_thread_setup() already did the exact same assignment. An added benefit of removing the redundant assignment is that we can now change td_pcb without nasty side-effects. This change officially marks the ability on ia64 for 1:1 threading. Not tested on: amd64, powerpc Compile & boot tested on: alpha, sparc64 Functionally tested on: i386, ia64	2003-06-04 22:46:27 +00:00
peter	f3ad59aebd	Fix ALIGNED_POINTER(). sizeof((u_int32_t)) is not legal C.	2003-06-04 02:15:13 +00:00
peter	54294f1739	Fix restarted syscalls. When we rewind %rip, we also need to restore all the argument registers etc since we have almost certainly have trashed them by now. Take particular car of %r10 since it held the original value of %rcx (which we saved in tf_rcx on entry and doreti doesn't know this).	2003-06-02 21:56:08 +00:00
peter	458ad34341	Make this more compatable with libc_r. Make the internal types for storing registers an array of longs rather than int.	2003-06-02 21:49:35 +00:00
obrien	999f42eba3	Use __FBSDID().	2003-06-02 16:32:55 +00:00
obrien	e52c8a45ff	Use __FBSDID().	2003-06-02 06:43:15 +00:00
peter	9d14df17cd	MFi386: i386/include/asm.h rev 1.11: Do not abuse ##.	2003-06-02 05:59:35 +00:00
obrien	f00e6d17da	Use C99 compatable asm statements.	2003-06-02 00:29:35 +00:00
obrien	1814d2a2a4	Sync with i386/GENERIC ordering.	2003-06-01 20:26:38 +00:00
peter	acd4004bf3	MFi386: rev 1.56: remove break after return	2003-05-31 22:02:11 +00:00
peter	1b3d5aa600	MFi386: rev 1.23: use gdb_strlen()/gdb_strcpy() directly.	2003-05-31 22:00:57 +00:00
peter	73a59225a5	MFi386: rev 1.50: remove unused variable	2003-05-31 21:58:55 +00:00
phk	7722d85e80	Avoid unbalancing the { } count in the source file with #ifdef by putting the opening { after the #ifdef ... #endif sequence. Found by: FlexeLint	2003-05-31 20:25:53 +00:00
peter	0cb8f83a58	Add acpi to the build. Remove the hack from machdep.c that lies to the loader to shut it up.	2003-05-31 07:00:08 +00:00
peter	d5cee515b4	Have hammer_time() return the proc0 stack location, and have locore switch to it before calling mi_startup(). The bootstack is WAY too small for running acpica during probe/attach. While here, pass modulep/physfree to the startup routine, rather than writing to the global variables in locore.S. Approved by: re (amd64/*)	2003-05-31 06:54:29 +00:00
peter	a32db9797c	Regenerate.	2003-05-31 06:51:04 +00:00
peter	3afe1a4377	Make this compile with WITNESS enabled. It wants the syscall names.	2003-05-31 06:49:53 +00:00
peter	776ff76012	Port acpica to amd64. Approved by: re (amd64/* blanket)	2003-05-31 06:47:05 +00:00
peter	711d1a9b94	With the help of jhb, fix the ACPI_ACQUIRE_GLOBAL_LOCK() macros and port to amd64 after repocopy. Approved by: re (amd64/*)	2003-05-31 06:43:55 +00:00
hmp	d48f3818ad	Rename BUS_DMAMEM_NOSYNC to BUS_DMA_COHERENT. The current name is confusing, because it indicates to the client that a bus_dmamap_sync() operation is not necessary when the flag is specified, which is wrong. The main purpose of this flag is to hint the underlying architecture that DMA memory should be mapped in a coherent way, but the architecture can ignore it. But if the architecture does supports coherent mapping of memory, then it makes bus_dmamap_sync() calls cheap. This flag is the same as the one in NetBSD's Bus DMA. Reviewed by: gibbs, scottl, des (implicitly) Approved by: re@ (jhb)	2003-05-30 20:40:33 +00:00
peter	daf35023f1	Nasty 'make it compile' port to amd64. Note that it needs some other wire protocol for the extra registers. I should probably just remove it from here for now since its quite useless. Approved by: re (amd64/* blanket)	2003-05-30 01:02:52 +00:00
peter	089f04e92b	Initial port to amd64 after repocopy from i386. Note that the disassembler has not been updated yet, and will do some very strange things. It does tracebacks (without function arguments due to regparm calling conventions) if -fno-omit-frame-pointer is used (to come later). This achieves basic functionality. Approved by: re (amd64/* blanket)	2003-05-30 01:01:07 +00:00
peter	992d725c87	Add setjmp/longjmp for ddb	2003-05-30 00:58:48 +00:00
peter	02dfc0d238	Update AMD Features vector to include NX (page table entry no-execute bit) and LM (long mode) etc.	2003-05-27 21:59:56 +00:00
scottl	f26aca7b71	Bring back bus_dmasync_op_t. It is now a typedef to an int, though the BUS_DMASYNC_ definitions remain as before. The does not change the ABI, and reverts the API to be a bit more compatible and flexible. This has survived a full 'make universe'. Approved by: re (bmah)	2003-05-27 04:59:59 +00:00
scottl	5f2aec7948	De-orbit bus_dmamem_alloc_size(). It's a hack and was never used anyways. No need for it to pollute the 5.x API any further. Approved by: re (bmah)	2003-05-26 04:00:52 +00:00
peter	a30b99096a	Stop profiled libc from exploding, matching gcc's generated code. Approved by: re (amd64/* blanket)	2003-05-24 18:24:03 +00:00
peter	8fccddb300	Typo fix. oops. Submitted by: jmallett Approved by: re (blanket amd64/*)	2003-05-23 06:36:46 +00:00
peter	0bccc0e2f6	Update comments. Note that the kernel is at -1GB, not -2GB as erroniously implied by the previous commit. KVM is still only 1GB until pmap_growkernel() learns about the extra page table level. Approved by: re (blanket)	2003-05-23 06:35:45 +00:00
peter	d7d93178f5	As suggested by the gdb folks, pad the 'struct fpreg' to a full 512 bytes to match the native fxsave/fxrstor object size since thats apparently what the Linux/NetBSD folks do.	2003-05-23 06:31:56 +00:00
peter	99d1672b3d	Deal with the user VM space expanding. 32 bit applications do not like having their stack at the 512GB mark. Give 4GB of user VM space for 32 bit apps. Note that this is significantly more than on i386 which gives only about 2.9GB of user VM to a process (1GB for kernel, plus page table pages which eat user VM space). Approved by: re (blanket)	2003-05-23 05:07:33 +00:00
peter	eea63ec45a	Major pmap rework to take advantage of the larger address space on amd64 systems. Of note: - Implement a direct mapped region using 2MB pages. This eliminates the need for temporary mappings when getting ptes. This supports up to 512GB of physical memory for now. This should be enough for a while. - Implement a 4-tier page table system. Most of the infrastructure is there for 128TB of userland virtual address space, but only 512GB is presently enabled due to a mystery bug somewhere. The design of this was heavily inspired by the alpha pmap.c. - The kernel is moved into the negative address space(!). - The kernel has 2GB of KVM available. - Provide a uma memory allocator to use the direct map region to take advantage of the 2MB TLBs. - Fixed some assumptions in the bus_space macros about the ability to fit virtual addresses in an 'int'. Notable missing things: - pmap_growkernel() should be able to grow to 512GB of KVM by expanding downwards below kernbase. The kernel must be at the top 2GB of the negative address space because of gcc code generation strategies. - need to fix the >512GB user vm code. Approved by: re (blanket)	2003-05-23 05:04:54 +00:00
peter	eb87db7a61	Merge from i386/trap.c rev 1.252. Use td_critnest instead of the spinlocks count for explicitly enabling interrupts. Approved by: re (blanket)	2003-05-22 20:09:50 +00:00
kan	f35a6040c1	sys/sys/limits.h: - Fix visibilty test for LONG_BIT and WORD_BIT. `#if defined(__FOO_VISIBLE)' is alays wrong because __FOO_VISIBLE is always defined (to 0 for invisibility). sys/<arch>/include/limits.h sys/<arch>/include/_limits.h: - Style fixes. Submitted by: bde Reviewed by: bsdmike Approved by: re (scottl)	2003-05-19 20:29:07 +00:00
peter	2d59e009c8	Actually get all the bits for sd_hibase.. it was 16 bits short. oops. Approved by: re (amd64/* blanket)	2003-05-17 02:05:10 +00:00
alc	efcc32885e	Initialize logical_cpus_mask when the logical CPUs are enumerated in the mptable. (Previously, logical_cpus_mask was only initialized if the hyperthreading fixup was executed.) Approved by: re (jhb) Reviewed by: ps	2003-05-15 05:12:24 +00:00
peter	12d7e4bee6	Collect the nastiness for preserving the kernel MSR_GSBASE around the load_gs() calls into a single place that is less likely to go wrong. Eliminate the per-process context switching of MSR_GSBASE, because it should be constant for a single cpu. Instead, save/restore it during the loading of the new %gs selector for the new process. Approved by: re (amd64/* blanket)	2003-05-15 00:23:40 +00:00
peter	7208ad8cbb	Use compile time constants for things like PTmap[] etc because they're about to move outside of the +/- 2GB range Suggested by: jake Approved by: re (amd64/* blanket)	2003-05-15 00:20:17 +00:00
peter	c177f59bbf	Regen Approved by: re (amd64 blanket)	2003-05-14 04:11:25 +00:00
peter	770abdbb9c	Add BASIC i386 binary support for the amd64 kernel. This is largely stolen from the ia64/ia32 code (indeed there was a repocopy), but I've redone the MD parts and added and fixed a few essential syscalls. It is sufficient to run i386 binaries like /bin/ls, /usr/bin/id (dynamic) and p4. The ia64 code has not implemented signal delivery, so I had to do that. Before you say it, yes, this does need to go in a common place. But we're in a freeze at the moment and I didn't want to risk breaking ia64. I will sort this out after the freeze so that the common code is in a common place. On the AMD64 side, this required adding segment selector context switch support and some other support infrastructure. The %fs/%gs etc code is hairy because loading %gs will clobber the kernel's current MSR_GSBASE setting. The segment selectors are not used by the kernel, so they're only changed at context switch time or when changing modes. This still needs to be optimized. Approved by: re (amd64/* blanket)	2003-05-14 04:10:49 +00:00
peter	94122e1008	Fix some misunderstandings about 64 bit extension. Fix fuword/suword - they're supposed to be 'long' - ie: point them at fuword64/suword64 instead of the incorrect 32 bit versions.	2003-05-14 03:38:13 +00:00
jhb	89a4eb17de	- Merge struct procsig with struct sigacts. - Move struct sigacts out of the u-area and malloc() it using the M_SUBPROC malloc bucket. - Add a small sigacts_*() API for managing sigacts structures: sigacts_alloc(), sigacts_free(), sigacts_copy(), sigacts_share(), and sigacts_shared(). - Remove the p_sigignore, p_sigacts, and p_sigcatch macros. - Add a mutex to struct sigacts that protects all the members of the struct. - Add sigacts locking. - Remove Giant from nosys(), kill(), killpg(), and kern_sigaction() now that sigacts is locked. - Several in-kernel functions such as psignal(), tdsignal(), trapsignal(), and thread_stopped() are now MP safe. Reviewed by: arch@ Approved by: re (rwatson)	2003-05-13 20:36:02 +00:00
peter	fbf228660c	Really stop the loader from trying to load the acpi module by lying and pretending that it is already here. Approved by: re (amd64/* stuff)	2003-05-12 18:37:56 +00:00
peter	9ff3e48a71	For the page fault handler, save %cr2 in the outer trap handler so that we do not have to run so long with interrupts disabled. This involved creating tf_addr in the trapframe. Reorganize the trap stubs so that they consistently reserve the stack space and initialize any missing bits. Approved by: re (amd64 stuff)	2003-05-12 18:33:19 +00:00
peter	2d77a1cdad	Sync ucontext with reality. The struct trapframe changes need to be reflected here. Approved by: re (blanket amd64/*)	2003-05-12 18:23:04 +00:00
peter	0cfa424a1b	AMD64 physical space is much larger than i386, de-i386 the bus_space and bus_dma MD code for AMD64. (And a trivial ifdef update in dev/kbd because of this). More updates are needed here to take advantage of the 64 bit instructions. Approved by: re (blanket amd64/*)	2003-05-12 02:44:37 +00:00
peter	c688fcc3ca	Give a %fs and %gs to userland. Use swapgs to obtain the kernel %GS.base value on entry and exit. This isn't as easy as it sounds because when we recursively trap or interrupt, we have to avoid duplicating the swapgs instruction or we end up back with the userland %gs. I implemented this by testing TF_CS to see if we're coming from supervisor mode already, and check for returning to supervisor. To avoid a race with interrupts in the brief period after beginning executing the handler and before the swapgs, convert all trap gates to interrupt gates, and reenable interrupts immediately after the swapgs. I am not happy with this. There are other possible ways to do this that should be investigated. (eg: storing the GS.base MSR value in the trapframe) Add some sysarch functions to let the userland code get to this. Approved by: re (blanket amd64/*)	2003-05-12 02:37:29 +00:00
peter	581fa87ddd	Call it an AMD64 Processor, not a Hammer. Also, it seems that the cpuid model numbers are wider than I first thought. Approved by: re (blanket amd64/*)	2003-05-11 23:01:04 +00:00
peter	756ca1f04c	I missed another printf format error while extracting the patch. Approved by: re (blanket amd64/*)	2003-05-11 22:55:40 +00:00
peter	7ba8edf901	Make atdevbase long for the KERNBASE > 4GB case Approved by: re (amd64/* blanket)	2003-05-11 22:53:43 +00:00
peter	f6ba7180dc	Fix printf format errors that were undetected due to using the standard FSF compiler during early development.	2003-05-11 22:40:25 +00:00
peter	89e93481ff	Export PML4SHIFT and PDPSHIFT Approved by: re (blanket amd64/*)	2003-05-11 22:39:40 +00:00
peter	d8c525a6c6	Since compiling natively, the compile environment has been less forgiving about silly typos. Use the correct comment sequences.	2003-05-11 22:38:54 +00:00
peter	952d1b68b2	Provide a fake varargs implementation for lint's benefit. This way it can see the intent of the va_* macros, even though it cannot work. Approved by: re (blanket amd64/*)	2003-05-10 00:55:15 +00:00
peter	4d4f676db4	Remove _ARCH_INDIRECT ifdefs. They existed for lib/msun/* on i386, which could use different versions of the math code depending on whether there was real floating point hardware or math emulation. Since the fpu is part of the core specification on amd64, there is no need for this here. Approved by: re (blanket amd64/*)	2003-05-10 00:53:34 +00:00
peter	26c4873253	bcopyb() isn't used on amd64 kernel (it only exists for i386/pcvt) Approved by: re (blanket amd64/*)	2003-05-10 00:51:29 +00:00
peter	63c2624c81	Finish translating i386/support.s into amd64 asm - replace bcopy etc with asm versions. This yields about a 5% kernel compile time speedup.	2003-05-10 00:49:56 +00:00
peter	31356c64a8	Include the MXCSR initial values, based on the AMD docs. This file should really be renamed to fpu.h and npx.c to fpu.c since its part of the core architecture on amd64 systems, not an isa 'numeric processor extension'.	2003-05-09 18:28:05 +00:00
peter	9bc9519226	Turn syscons on now that it works, so that anybody trying to run this can see something. Probing for keyboard still works for auto serial console mode.	2003-05-09 18:26:06 +00:00
peter	5596d2cb7e	Oops. Turn T_PAGEFLT back into an interrupt gate. It is critical that interrupts be disabled and remain disabled until %cr2 is read. Otherwise we can preempt and another process can fault, and by the time we read %cr2, we see a different processes fault address. This Greatly Confuses vm_fault() (to say the least). The i386 port has got this marked as a bug workaround for a Cyrix CPU, which is what lead me astray. Its actually necessary for preemption, regardless of whether Cyrix cpus had a bug or not.	2003-05-08 08:25:51 +00:00
peter	83575bc69c	Leave space for the 128 byte red-zone on the stack.	2003-05-08 00:13:24 +00:00
peter	aa8e46304f	#include <machine/metadata.h> was missing; add it	2003-05-08 00:12:37 +00:00
peter	068f75971f	Fix a preemption race. I was reenabling interrupts in the fast system call handler before it was safe. It was possible for to lose context and for something else to clobber the PCPU scratch variable. This moves the interrupt enable way too late, but its better safe than sorry for the moment.	2003-05-08 00:05:00 +00:00
jhb	905e807e48	Style nits. Approved by: re (bmah)	2003-05-07 17:21:38 +00:00
kan	9328ad6bf8	Style fixes. Remove DBL_DIG, DBL_MIN, DBL_MAX and their FLT_ counterparts, they were marked for deprecation ever since SUSv1 at least. Only define ULLONG_MIN/MAX and LLONG_MAX if long long type is supported. Restore a lost comment in MI _limits.h file and remove it from sys/limits.h where it does not belong.	2003-05-04 22:13:04 +00:00
peter	8fc30582f4	Repocopy .s to .S	2003-05-03 00:21:43 +00:00
peter	d082f140fb	I changed the numbering of the MODINFOMD_SMAP during the commit, so recognize the old number for my development boxes so I can use old loader/pxeboot for a while if I need to.	2003-05-01 04:18:02 +00:00
peter	6a018fff12	Slight reorg and added AMD64 support. A couple of the MODINFOMD_* values that were added to sparc64 and later powerpc, really should have been in the MI area. But changing that now with insufficient preperation will just cause too much pain. Move MD_FETCH() to the MI sys/linker.h file to avoid another two copies of it.	2003-05-01 03:31:18 +00:00
peter	45949ccde1	Commit MD parts of a loosely functional AMD64 port. This is based on a heavily stripped down FreeBSD/i386 (brutally stripped down actually) to attempt to get a stable base to start from. There is a lot missing still. Worth noting: - The kernel runs at 1GB in order to cheat with the pmap code. pmap uses a variation of the PAE code in order to avoid having to worry about 4 levels of page tables yet. - It boots in 64 bit "long mode" with a tiny trampoline embedded in the i386 loader. This simplifies locore.s greatly. - There are still quite a few fragments of i386-specific code that have not been translated yet, and some that I cheated and wrote dumb C versions of (bcopy etc). - It has both int 0x80 for syscalls (but using registers for argument passing, as is native on the amd64 ABI), and the 'syscall' instruction for syscalls. int 0x80 preserves all registers, 'syscall' does not. - I have tried to minimize looking at the NetBSD code, except in a couple of places (eg: to find which register they use to replace the trashed %rcx register in the syscall instruction). As a result, there is not a lot of similarity. I did look at NetBSD a few times while debugging to get some ideas about what I might have done wrong in my first attempt.	2003-05-01 01:05:25 +00:00
peter	007a27a9b4	Repocopy from x86_64/... to amd64/... Rename visible x86_64 references to amd64. Kill MID_MACHINE, its a.out specific, the only platform that supports it is i386. All of the other platforms should remove it too.	2003-04-30 22:51:59 +00:00
jhb	09adcd8b3e	Range check the syscall number before looking it up in the syscallnames[] array. Submitted by: pho	2003-04-30 17:59:27 +00:00
markm	6cc289554b	Fix some easy, global, lint warnings. In most cases, this means making some local variables static. In a couple of cases, this means removing an unused variable.	2003-04-30 12:57:40 +00:00
markm	5b09b29a7f	Warns fixing. Protect against inappropriate linting, and mark GCC-specific assemble code as such (in #ifdefs). Fix an easy static variable warning while I'm here.	2003-04-30 12:23:58 +00:00
kan	9468fdaf14	Deprecate machine/limits.h in favor of new sys/limits.h. Change all in-tree consumers to include <sys/limits.h> Discussed on: standards@ Partially submitted by: Craig Rodrigues <rodrigc@attbi.com>	2003-04-29 13:36:06 +00:00
jake	d33371fd63	Use inlines for loading and storing page table entries. Use cmpxchg8b for the PAE case to ensure idempotent 64 bit loads and stores. Sponsored by: DARPA, Network Associates Laboratories	2003-04-28 20:35:36 +00:00
jhb	ec7071fcb8	- Push down Giant into the sysarch() calls that still need Giant. - Standardize on EINVAL rather than EOPNOTSUPP if the sysarch op value is invalid.	2003-04-25 20:04:02 +00:00
jhb	dcf45ed625	Regen.	2003-04-25 15:59:44 +00:00
jhb	011ef0f3d8	Oops, the thr_* and jail_attach() syscall entries should be NOPROTO rather than STD.	2003-04-25 15:59:18 +00:00
jake	ef38f814c4	Remove harmless invalid cast. Sponsored by: DARPA, Network Associates Laboratories	2003-04-25 15:07:58 +00:00
deischen	3d51b3a280	Add an argument to get_mcontext() which specified whether the syscall return values should be cleared. The system calls getcontext() and swapcontext() want to return 0 on success but these contexts can be switched to at a later time so the return values need to be cleared in the saved register sets. Other callers of get_mcontext() would normally want the context without clearing the return values. Remove the i386-specific context saving from the KSE code. get_mcontext() is not i386-specific any more. Fix a bad pointer in the alpha get_mcontext() code. The context was being bcopy()'d from &td->tf_frame, but tf_frame is itself a pointer, so the thread was being copied instead. Spotted by jake. Glanced at by: jake Reviewed by: bde (months ago)	2003-04-25 01:50:30 +00:00
jhb	968ad4dbc6	Regen.	2003-04-24 20:50:57 +00:00
jhb	4d35246c8d	Fix the thr_create() entry by adding a trailing \. Also, sync up the MP safe flag for thr_* with the main table.	2003-04-24 20:49:46 +00:00
davidxu	7dbf40d20d	Don't print anything for fault at cpu_switch_load_gs, just like other code to recover fault in doreti because of invalid segment registers, silently push error to userland.	2003-04-24 01:48:59 +00:00
kan	b86b779077	Add a new sys/limits.h file which in turn depends on machine/_limits.h to get actual constant values. This is in preparation for machine/limits.h retirement. Discussed on: standards@ Submitted by: Craig Rodrigues <rodrigc@attbi.com> (*) Modified by: kan	2003-04-23 21:41:59 +00:00
jhb	146e8aecec	- Replace inline implementations of sigprocmask() with calls to kern_sigprocmask() in the various binary compatibility emulators. - Replace calls to sigsuspend(), sigaltstack(), sigaction(), and sigprocmask() that used the stackgap with calls to the corresponding kern_sig*() functions instead without using the stackgap.	2003-04-22 18:23:49 +00:00
davidxu	0a01dd83c5	Move down intr level testing code a bit, cpu_switch_load_gs fault can be at interrupt nested time.	2003-04-22 08:12:03 +00:00
davidxu	d0165cba28	Fix some problems for cpu_switch_load_gs. when fault address is at cpu_switch_load_gs, cpu is in context switch, so don't enable interrupt. because it is in context switch, it is expected sched_lock was held, so don't PROC_LOCK(p) and psignal, it is LOR, probably we can set a P_XSIGBUS like flag in p_sflags, and set TDF_ASTPENDING in td_flags, in ast(), post a SIGBUS to process if P_XSIGBUS was set.	2003-04-22 07:45:47 +00:00
davidxu	4f1ed41d01	Remove single threading detecting code, these code really should be replaced by thread_user_enter(), but current we don't want to enable this in trap.	2003-04-22 03:17:41 +00:00
simokawa	9f7fbe4b69	Add FireWire drivers to GENERIC.	2003-04-21 16:44:05 +00:00
davidxu	d06885c524	Reset pcb_gs and %gs before possibly invalidating it.	2003-04-21 15:05:05 +00:00
wpaul	e41f6225fa	Add device driver support for the ASIX Electronics AX88172 USB 2.0 ethernet controller. The driver has been tested with the LinkSys USB200M adapter. I know for a fact that there are other devices out there with this chip but don't have all the USB vendor/device IDs. Note: I'm not sure if this will force the driver to end up in the install kernel image or not. Special magic needs to be done to exclude it to keep the boot floppies from bloating again, someone please advise.	2003-04-20 19:05:33 +00:00
davidxu	f781b4eab2	Backout my last commit. Requested by: bde	2003-04-20 01:35:21 +00:00
davidxu	6223a95348	Don't return garbage in high 16 bits.	2003-04-19 02:40:39 +00:00
jhb	8b7a3b47d1	Use the proc lock to protect p_singlethread and a P_WEXIT test. This fixes a couple of potential KSE panics on non-i386 arch's that weren't holding the proc lock when calling thread_exit().	2003-04-18 20:20:00 +00:00
jhb	5bc80dc230	Hold the proc lock for curproc around sigonstack().	2003-04-18 20:09:04 +00:00
jhb	3d97448a8a	Remove a couple of unused symbols.	2003-04-17 22:17:28 +00:00
mux	c2d5e1feb9	style(9)	2003-04-15 03:11:03 +00:00
simokawa	bdfca1b9c1	Restore delayed load support for the resource shortage case. It was missed in the previous change. Now, _bus_dmamap_load_buffer() accepts BUS_DMA_WAITOK/BUS_DMA_NOWAIT flags. Original idea from: jake	2003-04-14 13:21:40 +00:00
simokawa	a350d6ddb9	* Use _bus_dmamap_load_buffer() and respect maxsegsz in bus_dmamap_load(). Ignoring maxsegsz may lead to fatal data corruption for some devices. ex. SBP-2/FireWire We should apply this change to other platforms except for sparc64. MFC after: 1 week	2003-04-14 04:19:42 +00:00
davidxu	ad8b73e2eb	Copy %gs from current CPU not from a stale PCB backup.	2003-04-11 14:47:34 +00:00
davidxu	ce0741a1c6	set_user_ldt_rv() should check same proc not thread, this commit fixes an user LDT smp rendezvous bug.	2003-04-11 14:45:07 +00:00
des	6366f8a796	Convert the SMP_TSC kernel option into a loader tunable. Also enable the TSC timecounter on single-CPU systems even when they are running an SMP kernel.	2003-04-10 23:07:24 +00:00
mux	ea793948f7	Change the operation parameter of bus_dmamap_sync() from an enum to an int and redefine the BUS_DMASYNC_* constants as flags. This allows us to specify several operations in one call to bus_dmamap_sync() as in NetBSD.	2003-04-10 23:03:33 +00:00
julian	6f175a0e20	Move the _oncpu entry from the KSE to the thread. The entry in the KSE still exists but it's purpose will change a bit when we add the ability to lock a KSE to a cpu.	2003-04-10 17:35:44 +00:00
wes	e35ae2d86e	Add a sysctl that records and reports the CPU clock rate calculated at boot. Funny how often this trivial piece of information crops up in embedded boxen. Sponsored by: St. Bernard Software	2003-04-10 07:05:24 +00:00
mike	75859ca578	o In struct prison, add an allprison linked list of prisons (protected by allprison_mtx), a unique prison/jail identifier field, two path fields (pr_path for reporting and pr_root vnode instance) to store the chroot() point of each jail. o Add jail_attach(2) to allow a process to bind to an existing jail. o Add change_root() to perform the chroot operation on a specified vnode. o Generalize change_dir() to accept a vnode, and move namei() calls to callers of change_dir(). o Add a new sysctl (security.jail.list) which is a group of struct xprison instances that represent a snapshot of active jails. Reviewed by: rwatson, tjr	2003-04-09 02:55:18 +00:00
jake	dcb4f8c15b	Remove invalid cast to vm_offset_t to avoid truncating a physical address when doing pmap_kextract on a 2MB page. Spotted by: peter Sponsored by: DARPA, Network Associates Laboratories	2003-04-08 18:22:41 +00:00
jake	eff9d3d98d	Add support for bounce buffers to _bus_dmamap_load_buffer, which is the backend for bus_dmamap_load_mbuf and bus_dmamap_load_uio. - Increaes MAX_BPAGES to 512. Less than this causes fxp to quickly runs out of bounce pages. - Add an argument to reserve_bounce_pages indicating wether this operation should fail or be queued for later processing if we run out of memory. The EINPROGRESS return value is not handled properly by consumers of bus_dmamap_load_mbuf. - If bounce buffers are required allocate minimum 1 bounce page at map creation time. If maxsize was small previously this could get truncated to 0 and the drivers would quickly run out of bounce pages. - Fix a bug handling the return value of alloc_bounce_pages at map creation time. It returns the number of pages allocated, not 0 on success. - Use bus_addr_t for physical addresses to avoid truncation. - Assert that the map is non-null and not the no bounce map in add_bounce_pages. Sponsored by: DARPA, Network Associates Laboratories	2003-04-07 16:08:32 +00:00
jake	204303829b	Better fix for previous previous which still allows the 4megs of kva at the top of the address space to be reclaimed. The problem is that with the APTD gone the mapable kernel address space runs right to the end of the 32 bit address space. As a max this is 0x100000000, which can't be represented in 32 bits, so we have to use ptd entry n-1 and pte offset n-1, instead of ptd entry n and pte offset 0. There's still 1 page we can't use, but we gain just under 4 megs of kva (8 megs with PAE). Sponsored by: DARPA, Network Associates Laboratories	2003-04-07 14:27:19 +00:00
peter	620ab7fc9b	Unbreak the !LAZY_SWITCH case. I #ifdef'ed too much when I added the ifdefs prior to commit and killed the same-address-space test. Submitted by: bde	2003-04-05 22:18:14 +00:00
tegge	766eadf040	Add SMP_TSC option, which can be used on SMP systems where the TSCs are synchronized to reduce context switch cost.	2003-04-04 23:54:46 +00:00
des	5468286a89	Define ovbcopy() as a macro which expands to the equivalent bcopy() call, to take care of the KAME IPv6 code which needs ovbcopy() because NetBSD's bcopy() doesn't handle overlap like ours. Remove all implementations of ovbcopy(). Previously, bzero was a function pointer on i386, to save a jmp to bzero_vector. Get rid of this microoptimization as it only confuses things, adds machine-dependent code to an MD header, and doesn't really save all that much. This commit does not add my pagezero() / pagecopy() code.	2003-04-04 17:29:55 +00:00
jake	098b68fc78	Bandaid fix for previous commit while I figure out why it broke. This caused crashes early in boot on i386 UP machines. Reported by: phk Pointy hat to: jake	2003-04-04 10:09:44 +00:00
jake	68ea55c962	- Removed APTD and associated macros, it is no longer used. BANG BANG BANG etc. Sponsored by: DARPA, Network Associates Laboratories	2003-04-03 23:44:35 +00:00
peter	46969da5f8	Commit a partial lazy thread switch mechanism for i386. it isn't as lazy as it could be and can do with some more cleanup. Currently its under options LAZY_SWITCH. What this does is avoid %cr3 reloads for short context switches that do not involve another user process. ie: we can take an interrupt, switch to a kthread and return to the user without explicitly flushing the tlb. However, this isn't as exciting as it could be, the interrupt overhead is still high and too much blocks on Giant still. There are some debug sysctls, for stats and for an on/off switch. The main problem with doing this has been "what if the process that you're running on exits while we're borrowing its address space?" - in this case we use an IPI to give it a kick when we're about to reclaim the pmap. Its not compiled in unless you add the LAZY_SWITCH option. I want to fix a few more things and get some more feedback before turning it on by default. This is NOT a replacement for Bosko's lazy interrupt stuff. This was more meant for the kthread case, while his was for interrupts. Mine helps a little for interrupts, but his helps a lot more. The stats are enabled with options SWTCH_OPTIM_STATS - this has been a pseudo-option for years, I just added a bunch of stuff to it. One non-trivial change was to select a new thread before calling cpu_switch() in the first place. This allows us to catch the silly case of doing a cpu_switch() to the current process. This happens uncomfortably often. This simplifies a bit of the asm code in cpu_switch (no longer have to call choosethread() in the middle). This has been implemented on i386 and (thanks to jake) sparc64. The others will come soon. This is actually seperate to the lazy switch stuff. Glanced at by: jake, jhb	2003-04-02 23:53:30 +00:00
jake	02364d4f5d	- Make casuptr return the old value of the location we're trying to update, and change the umtx code to expect this. Reviewed by: jeff	2003-04-02 08:02:27 +00:00
jeff	5f8f1497c8	- Add thr and umtx system calls.	2003-04-01 01:15:56 +00:00
jeff	420a77ecd5	- Define a new md function 'casuptr'. This atomically compares and sets a pointer that is in user space. It will be used as the basic primitive for a kernel supported user space lock implementation. - Implement this function in x86's support.s - Provide stubs that return -1 in all other architectures. Implementations will follow along shortly. Reviewed by: jake	2003-04-01 00:18:55 +00:00
jeff	56865cc549	- In npxgetregs() use the td argument to save the fpu state from and not curthread. Nothing currently depends on this behavior. - Clean up an extra newline. Obtained from: bde	2003-04-01 00:16:32 +00:00
jeff	23844ff023	- Add a placeholder for sigwait	2003-03-31 23:36:40 +00:00
jeff	46e6ba39f1	- Move p->p_sigmask to td->td_sigmask. Signal masks will be per thread with a follow on commit to kern_sig.c - signotify() now operates on a thread since unmasked pending signals are stored in the thread. - PS_NEEDSIGCHK moves to TDF_NEEDSIGCHK.	2003-03-31 22:49:17 +00:00
jeff	554fb31557	- Fix two calls to trapsignal() that were still passing in 'struct proc'. These were missed in my last commit.	2003-03-31 22:41:32 +00:00
jeff	4a3718fb25	- Change trapsignal() to accept a thread and not a proc. - Change all consumers to pass in a thread. Right now this does not cause any functional changes but it will be important later when signals can be delivered to specific threads.	2003-03-31 22:02:38 +00:00
jeff	3a712624d5	- In npxsetregs don't set the floating point if td == fpcurthread not if curthread == fpcurthread. This is important when we're saving the fp state for a thread other than curthread as in from set_mcontext.	2003-03-31 00:32:43 +00:00
jake	abd082c833	- Add support for PAE and more than 4 gigs of ram on x86, dependent on the kernel opition 'options PAE'. This will only work with device drivers which either use busdma, or are able to handle 64 bit physical addresses. Thanks to Lanny Baron from FreeBSD Systems for the loan of a test machine with 6 gigs of ram. Sponsored by: DARPA, Network Associates Laboratories, FreeBSD Systems	2003-03-30 05:24:52 +00:00
jake	a5a502558e	- Remove invalid casts. Sponsored by: DARPA, Network Associates Laboratories	2003-03-30 01:44:16 +00:00
jake	b88859c2da	- Convert all uses of pmap_pte and get_ptbase to pmap_pte_quick. When accessing an alternate address space this causes 1 page table page at a time to be mapped in, rather than using the recursive mapping technique to map in an entire alternate address space. The recursive mapping technique changes large portions of the address space and requires global tlb flushes, which seem to cause problems when PAE is enabled. This will also allow IPIs to be avoided when mapping in new page table pages using the same technique as is used for pmap_copy_page and pmap_zero_page. Sponsored by: DARPA, Network Associates Laboratories	2003-03-30 01:16:19 +00:00
mdodd	ededebc1a4	- Move driver to newbus. - Provide identify methods for EtherExpress and 3c507 cards; this means these cards no longer need wired configs. - Provide a detach method.	2003-03-29 13:36:41 +00:00
ps	1fc4964a6e	Nuke HTT from here too. Spotted by: jhb	2003-03-26 19:55:03 +00:00
ps	26fe456f21	Nuke options HTT infavor of machdep.hlt_logical_cpus tunable/sysctl. This keeps the logical cpu's halted in the idle loop. By default the logical cpu's are halted at startup. It is also possible to halt any cpu in the idle loop now using machdep.hlt_cpus. Examples of how to use this: machdep.hlt_cpus=1 halt cpu0 machdep.hlt_cpus=2 halt cpu1 machdep.hlt_cpus=4 halt cpu2 machdep.hlt_cpus=3 halt cpu0,cpu1 Reviewed by: jhb, peter	2003-03-26 19:49:34 +00:00
peter	0511e210cb	Halt the cpus in the idle loop for SMP as well for several reasons: 1) Its critical for HTT. There's less foot-shooting opportunity. 2) I've seen significant improvements in interactive response to commands over ssh sessions. I assume this is less lock contention. 3) As incentive to finish the idle cpu IPI wakeup stuff. 4) The machine on my desk was blowing hot air in my general direction because somebody forgot to turn the hlt on, and it saves 50 watts per cpu.. The machdep.cpu_idle_hlt sysctl is still available, but now the default is the same as on UP kernels.	2003-03-26 19:40:29 +00:00
jhb	77e623bc9a	Add an options entry for HTT in SMP and GENERIC similar to the SMP and APIC_IO options. Requested by: John Cagle <john.cagle@hp.com>	2003-03-25 23:31:14 +00:00
jake	783ae539c3	- Add vm_paddr_t, a physical address type. This is required for systems where physical addresses larger than virtual addresses, such as i386s with PAE. - Use this to represent physical addresses in the MI vm system and in the i386 pmap code. This also changes the paddr parameter to d_mmap_t. - Fix printf formats to handle physical addresses >4G in the i386 memory detection code, and due to kvtop returning vm_paddr_t instead of u_long. Note that this is a name change only; vm_paddr_t is still the same as vm_offset_t on all currently supported platforms. Sponsored by: DARPA, Network Associates Laboratories Discussed with: re, phk (cdevsw change)	2003-03-25 00:07:06 +00:00
mdodd	803a8a66ce	Use repo-copied files in sys/i386/bios.	2003-03-24 19:14:46 +00:00
bde	79afd37295	Disable interrupts while in kdb_trap() to handle cases where the caller doesn't do it. This fixes all known causes of "Context switches not allowed in the debugger" in mi_switch(). The main cause was trap_fatal() calling kdb_trap() with interrupts enabled. Switching to ithreads for interrupt handling then made fatal traps more fatal and harder to debug. The problem was limited in -current because most interrupt handlers are blocked by Giant, but it occurred almost deterministically for me because my clock interrupt handlers are non-fast and not blocked by Giant.	2003-03-24 10:17:14 +00:00
ru	3e93151335	Remove bitrot associated with `maxusers'. Submitted by: bde	2003-03-22 14:18:23 +00:00
dwmalone	ca680978de	Extend CPU_ATHLON_SSE_HACK to cover a few more revisions of Athlon CPUs. Submitted by: Jon Kuster <kwsn@earthlink.net> MFC after: 2 weeks	2003-03-20 20:50:22 +00:00
mux	0977536812	Use atomic operations to increment and decrement the refcount in busdma tags. There are currently no tags shared accross different drivers so this isn't needed at the moment, but it will be required when we'll have a proper newbus method to get the parent busdma tag.	2003-03-20 19:45:26 +00:00
phk	e059b79437	Including <sys/stdint.h> is (almost?) universally only to be able to use %j in printfs, so put a newsted include in <sys/systm.h> where the printf prototype lives and save everybody else the trouble.	2003-03-18 08:45:25 +00:00
jhb	c76c140d9b	Expand the APIC ID mask field of the ICR register to 8 bits intead of just 4 bits. This reportedly fixes booting on the SW7500CW2. Much thanks to the submitter for tracking this down! Submitted by: Brian Buchanan <brian@ncircle.com> Reviewed by: peter MFC after: 3 days	2003-03-17 19:14:13 +00:00
mux	279c48578e	- Lock down the bounce pages structures. We use the same locking scheme as with the alpha backend because both implementations of bounce pages are identical. - Remove useless splhigh()/splx() calls.	2003-03-17 18:34:34 +00:00
jake	c1838df603	Made the prototypes for pmap_kenter and pmap_kremove MD. These functions are machine dependent because they are not required to update the tlb when mappings are added or removed, and doing so is machine dependent. In addition, an implementation may require that pages mapped with pmap_kenter have a backing vm_page_t, which is not necessarily true of all physical pages, and so may choose to pass the vm_page_t to pmap_kenter instead of the physical address in order to make this requirement clear.	2003-03-16 04:16:03 +00:00
mux	15b2d31e35	Grab Giant around calls to contigmalloc() and contigfree() so that drivers converted to be MP safe don't have to deal with it.	2003-03-13 17:18:48 +00:00
jake	0d29b7995c	- Added support for multiple page directory pages to pmap_pinit and pmap_release. - Merged pmap_release and pmap_release_free_page. When pmap_release is called only the page directory page(s) can be left in the pmap pte object, since all page table pages will have been freed by pmap_remove_pages and pmap_remove. In addition, there can only be one reference to the pmap and the page directory is wired, so the page(s) can never be busy. So all there is to do is clear the magic mappings from the page directory and free the page(s). Sponsored by: DARPA, Network Associates Laboratories	2003-03-12 07:38:37 +00:00
jake	8e6ed1abea	Use bus_space_handle_t to represent host port and virtual addresses; bus_addr_t may not be appropriate. Sponsored by: DARPA, Network Associates Laboratories	2003-03-11 19:43:38 +00:00
davidxu	27d6312bf7	Initialize eflags in fake frame to default value rather than random one. The random value sometimes causes macro CLKF_USERMODE to return true because PSL_VM bit is set and really shoudn't be, this causes statclock() to execute in wrong path, and further breaks KSE code and kernel crashes when executing threaded program.	2003-03-08 03:58:50 +00:00
rwatson	7974609efe	Instrument sysarch() MD privileged I/O access interfaces with a MAC check, mac_check_sysarch_ioperm(), permitting MAC security policy modules to control access to these interfaces. Currently, they protect access to IOPL on i386, and setting HAE on Alpha. Additional checks might be required on other platforms to prevent bypass of kernel security protections by unauthorized processes. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2003-03-06 04:47:47 +00:00
jhb	e4bcd25517	Replace calls to WITNESS_SLEEP() and witness_list() with equivalent calls to WITNESS_WARN().	2003-03-04 21:03:05 +00:00
jhb	1ccbcbaf68	Wrap the hyperthreading support code with the HTT kernel option. Hyperthreading support is now off unless the HTT option is added. MFC-after: 3 days	2003-03-04 20:24:53 +00:00
phk	0ae911eb0e	Gigacommit to improve device-driver source compatibility between branches: Initialize struct cdevsw using C99 sparse initializtion and remove all initializations to default values. This patch is automatically generated and has been tested by compiling LINT with all the fields in struct cdevsw in reverse order on alpha, sparc64 and i386. Approved by: re(scottl)	2003-03-03 12:15:54 +00:00
jhb	cf1a19f656	Expand some #ifdef's to fix I386_CPU compile. Reported by: Andy Farkas <andyf@speednet.com.au>	2003-02-27 20:38:48 +00:00
alc	89bdff5b12	Remove some long unused declarations. (For example, the PV flags have not been used since revision 1.8, roughly nine years ago.)	2003-02-27 20:13:20 +00:00
julian	3fc9836d46	Change the process flags P_KSES to be P_THREADED. This is just a cosmetic change but I've been meaning to do it for about a year.	2003-02-27 02:05:19 +00:00
ru	c51d104769	Implemented "nooption" and "nomakeoption" config(8) tokens. Fixed memory leak in the "nodevice" option implementation. Use these instead of sed(1) in MD NOTES. Use a single makefile (sys/conf/makeLINT.mk) to generate LINT for all architectures. (Previous versions missed the LINT dependency on Makefile, and i386 version also missed the dependency on ${NOTES}.) Fixed bugs in the previous NOTES conversion using the "nodevice" token and sed(1): - i386 LINT lost "device pst". - pc98 LINT lost SC_, MAXCONS and KBD_DISABLE_KEYMAP_LOAD options, and got needless DPT_ options. - Added nooptions PPC_DEBUG, PPC_PROBE_CHIPSET, KBD_INSTALL_CDEV to sparc64 LINT so that it has a chance to config(8). This basically returns us to where we were before.	2003-02-26 23:36:59 +00:00
davidxu	0f0fab8ca6	Better to not know anything about KSE.	2003-02-26 05:47:25 +00:00
mux	186c547c81	Correctly set BUS_SPACE_MAXSIZE in all the busdma backends. It was bogusly set to 64 * 1024 or 128 * 1024 because it was bogusly reused in the BUS_DMAMAP_NSEGS definition.	2003-02-26 02:16:06 +00:00
obrien	d42e7b5cee	Move most everything back to a MI NOTES, and use "nodevice" in MD NOTES Where needed. Use 'sed' for now in place of "nooptions". Add a sparc64 MD NOTES. Reviewed by: arch@	2003-02-25 20:59:23 +00:00
jake	8c5e3715d7	- Added inlines pmap_is_current, pmap_is_alternate and pmap_set_alternate for testing and setting the current and alternate address spaces. - Changed PTDpde and APTDpde to arrays to support multiple page directory pages. ponsored by: DARPA, Network Associates Laboratories	2003-02-25 19:40:21 +00:00
davidxu	996c44cea2	Remove an unsafe KASSERT.	2003-02-25 11:23:17 +00:00
mux	541937cf73	Cleanup of the d_mmap_t interface. - Get rid of the useless atop() / pmap_phys_address() detour. The device mmap handlers must now give back the physical address without atop()'ing it. - Don't borrow the physical address of the mapping in the returned int. Now we properly pass a vm_offset_t * and expect it to be filled by the mmap handler when the mapping was successful. The mmap handler must now return 0 when successful, any other value is considered as an error. Previously, returning -1 was the only way to fail. This change thus accidentally fixes some devices which were bogusly returning errno constants which would have been considered as addresses by the device pager. - Garbage collect the poorly named pmap_phys_address() now that it's no longer used. - Convert all the d_mmap_t consumers to the new API. I'm still not sure wheter we need a __FreeBSD_version bump for this, since and we didn't guarantee API/ABI stability until 5.1-RELEASE. Discussed with: alc, phk, jake Reviewed by: peter Compile-tested on: LINT (i386), GENERIC (alpha and sparc64) Runtime-tested on: i386	2003-02-25 03:21:22 +00:00
jake	dd594d3b57	- Removed UMAXPTDI and UMAXPTEOFF. - Changed VM_MAXUSER_ADDRESS to be defined in terms of PTDPTDI. In order for assumptions about the recursive page table map to work it must be the base of the recursive map. Any pte offset that's not NPTEPG will break these assumptions. Sponsored by: DARPA, Network Associates Laboratories	2003-02-24 20:29:52 +00:00
nyan	0c2cfddf38	The mpbiosreason variable does not used for pc98.	2003-02-24 14:36:03 +00:00
jake	5c99d4e544	Use the direct mapping of IdlePTD setup in locore for proc0's page directory, instead of allocating another page of kva and mapping it in again. This was likely an oversight in revision 1.174 (cut and paste from pmap_pinit). Discussed with: peter, tegge Sponsored by: DARPA, Network Associates Laboratories	2003-02-24 00:39:50 +00:00
tegge	762ff6ef1c	Allow machines with one CPU and a valid mp table to boot an SMP kernel.	2003-02-23 23:49:57 +00:00
jake	e12ad10b6b	Previous commit missed a 1 that should be NGPTD, and an NPDEPG that should be NPDEPTD. Grumble. Sponsored by: DARPA, Network Associates Laboratories	2003-02-23 22:12:08 +00:00
jake	c614cef2af	- Added macros NPGPTD, NBPTD, and NPDEPTD, for dealing with the size of the page directory. - Use these instead of the magic constants 1 or PAGE_SIZE where appropriate. There are still numerous assumptions that the page directory is exactly 1 page. Sponsored by: DARPA, Network Associates Laboratories	2003-02-23 21:20:00 +00:00
jake	8fafb8765a	- Added macros PDESHIFT and PTESHIFT, use these instead of magic constants in locore. - Removed the macros PTESIZE and PDESIZE, use sizeof instead in C. Sponsored by: DARPA, Network Associates Laboratories	2003-02-23 09:45:50 +00:00
alc	34f029068a	The root of the splay tree maintained within the pm_pteobj always refers to the last accessed pte page. Thus, the pm_ptphint is redundant and can be removed.	2003-02-22 23:43:08 +00:00
jake	e57fb3b9bb	unsigned -> pt_entry_t. Sponsored by: DARPA, Network Associates Laboratories	2003-02-22 23:41:27 +00:00
peter	d8b908a683	Fix fumble in rev 1.525. pmap_kenter()'s second argument is a physical address, not a page index. Laughed at by: jake	2003-02-20 05:35:52 +00:00
imp	cf874b345d	Back out M_* changes, per decision of the TRB. Approved by: trb	2003-02-19 05:47:46 +00:00
peter	c8ccde8063	Initiate de-orbit burn for USE_PCI_BIOS_FOR_READ_WRITE. This has been #if'ed out for a while. Complete the deed and tidy up some other bits. We need to be able to call this stuff from outer edges of interrupt handlers for devices that have the ISR bits in pci config space. Making the bios code mpsafe was just too hairy. We had also stubbed it out some time ago due to there simply being too much brokenness in too many systems. This adds a leaf lock so that it is safe to use pci_read_config() and pci_write_config() from interrupt handlers. We still will use pcibios to do interrupt routing if there is no acpi.. [yes, I tested this] Briefly glanced at by: imp	2003-02-18 03:36:49 +00:00
julian	af55753a06	Move a bunch of flags from the KSE to the thread. I was in two minds as to where to put them in the first case.. I should have listenned to the other mind. Submitted by: parts by davidxu@ Reviewed by: jeff@ mini@	2003-02-17 09:55:10 +00:00
jeff	590a39e29b	- Split the struct kse into struct upcall and struct kse. struct kse will soon be visible only to schedulers. This greatly simplifies much the KSE code. Submitted by: davidxu	2003-02-17 05:14:26 +00:00
jeff	aa384c931f	- Move ke_sticks, ke_iticks, ke_uticks, ke_uu, ke_su, and ke_iu back into the proc. These counters are only examined through calcru. Submitted by: davidxu Tested on: x86, alpha, UP/SMP	2003-02-17 02:19:58 +00:00
phk	5d80f8f84b	Change "dev_t gdbdev" to "void *gdb_arg", some possible paths for GDB will not have a dev_t.	2003-02-16 19:22:21 +00:00
phk	4bfb37f22e	Remove #include <sys/dkstat.h>	2003-02-16 14:13:23 +00:00
alc	4045291582	Assert that the kernel map's system mutex is held in pmap_growkernel().	2003-02-15 19:23:37 +00:00
alc	de76e76787	- Add a mutex for synchronizing the use of CMAP/CADDR 1 and 2. - Eliminate small style differences between pmap_zero_page(), pmap_copy_page(), etc.	2003-02-14 07:34:28 +00:00
obrien	85bd6322e6	Fix the style of the SCHED_4BSD commit.	2003-02-13 22:24:44 +00:00
peter	dad13b35df	Oops. I mis-remembered about the P4 problems. It was 5.0-DP2 that was shipped with DISABLE_PG_G and DISABLE_PSE, not 5.0-REL. blush Disable the code - but still leave it there in case its still lurking.	2003-02-13 02:42:06 +00:00
peter	cd0627ca28	Turn of PG_PS and PG_G for Pentium-4 cpus at boot time. This is so that we can stop turning off PG_G and PG_PS globally for releases.	2003-02-13 01:52:44 +00:00
alc	20b766196e	Remove kptobj. Instead, use VM_ALLOC_NOOBJ.	2003-02-12 04:35:37 +00:00
phk	2a777ce2e1	Switch to using the TSC code in i386/i386/tsc.c.	2003-02-11 11:43:25 +00:00
mike	b4e3f2f94a	Implement fpclassify(): o Add a MD header private to libc called _fpmath.h; this header contains bitfield layouts of MD floating-point types. o Add a MI header private to libc called fpmath.h; this header contains bitfield layouts of MI floating-point types. o Add private libc variables to lib/libc/$arch/gen/infinity.c for storing NaN values. o Add __double_t and __float_t to <machine/_types.h>, and provide double_t and float_t typedefs in <math.h>. o Add some C99 manifest constants (FP_ILOGB0, FP_ILOGBNAN, HUGE_VALF, HUGE_VALL, INFINITY, NAN, and return values for fpclassify()) to <math.h> and others (FLT_EVAL_METHOD, DECIMAL_DIG) to <float.h> via <machine/float.h>. o Add C99 macro fpclassify() which calls __fpclassify{d,f,l}() based on the size of its argument. __fpclassifyl() is never called on alpha because (sizeof(long double) == sizeof(double)), which is good since __fpclassifyl() can't deal with such a small `long double'. This was developed by David Schultz and myself with input from bde and fenner. PR: 23103 Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU> (significant portions) Reviewed by: bde, fenner (earlier versions)	2003-02-08 20:37:55 +00:00
alc	387116adf4	MF alpha - Synchronize access to the allpmaps list with a mutex.	2003-02-08 05:41:41 +00:00
peter	9f68b995ed	Commit some cosmetic changes I had laying around and almost included with another commit. Unwrap a line. Unexpand a pmap_kenter().	2003-02-07 01:52:06 +00:00
phk	cb1b00f178	This file has no longer any content from the original Berkeley file so replace the UCB copyright with a FreeBSD 2 clause thing. Remove some no longer relevant comments.	2003-02-05 11:11:39 +00:00
phk	545eeb1024	i386/i386/tsc.c was repo-copied from i386/isa/clock.c. Remove all the stuff that does not relate to the TSC. Change the calibration to use DELAY(1000000) rather than trying to check it against the CMOS RTC, this drastically increases precision: Using 25 samples on a Athlon 700MHz UP machine I find: stddev min max average CMOS 22200 Hz -74980 Hz 34301 Hz 704928721 Hz DELAY 1805 Hz -1984 Hz 2678 Hz 704937583 Hz (The difference between the two averages is not statistically significant.) expressed in PPM of the frequency: stddev min max CMOS 31.49 PPM -106.37 PPM 48.66 PPM DELAY 2.56 PPM 2.81 PPM 3.80 PPM This code will not be used until a followup commit to sys/isa/clock.c and sys/pc98/pc98/clock.c which will only happen after some field testing.	2003-02-05 09:20:40 +00:00
phk	9a0e9d814a	Make get_cyclecount() use binuptime() when no tsc is available: it is cheaper.	2003-02-05 08:55:10 +00:00
harti	570156f24c	Fix a problem in bus_dmamap_load_{mbuf,uio} when the first mbuf or the first uio segment is empty. In this case no dma segment is create by bus_dmamap_load_buffer, but the calling routine clears the first flag. Under certain combinations of addresses of the first and second mbuf/uio buffer this leads to corrupted DMA segment descriptors. This was already fixed by tmm in sparc64/sparc64/iommu.c. PR: kern/47733 Reviewed by: sam Approved by: jake (mentor)	2003-02-04 16:30:27 +00:00
phk	3692879cc8	Split the global timezone structure into two integer fields to prevent the compiler from optimizing assignments into byte-copy operations which might make access to the individual fields non-atomic. Use the individual fields throughout, and don't bother locking them with Giant: it is no longer needed. Inspired by: tjr	2003-02-03 19:49:35 +00:00
jake	6b3763a173	Split statclock into statclock and profclock, and made the method for driving statclock based on profhz when profiling is enabled MD, since most platforms don't use this anyway. This removes the need for statclock_process, whose only purpose was to subdivide profhz, and gets the profiling clock running outside of sched_lock on platforms that implement suswintr. Also changed the interface for starting and stopping the profiling clock to do just that, instead of changing the rate of statclock, since they can now be separate. Reviewed by: jhb, tmm Tested on: i386, sparc64	2003-02-03 17:53:15 +00:00
alc	c41203831b	- Make allpmaps static. - Use atomic subtract to update the global wired pages count. (See also vm/vm_page.c revision 1.233.) - Assert that the page queue lock is held in pmap_remove_entry().	2003-02-03 00:05:11 +00:00
alfred	b5c0015ac9	Consolidate MIN/MAX macros into one place (param.h). Submitted by: Hiten Pandya <hiten@unixdaemons.com>	2003-02-02 13:17:30 +00:00
joe	457d099669	Put replace spaces with tabs in keeping with the rest of the file.	2003-02-01 18:45:18 +00:00
julian	e8efa7328e	Reversion of commit by Davidxu plus fixes since applied. I'm not convinced there is anything major wrong with the patch but them's the rules.. I am using my "David's mentor" hat to revert this as he's offline for a while.	2003-02-01 12:17:09 +00:00
phk	36fe9fb493	Make tsc_freq a 64bit quantity. Inspired by: http://www.theinquirer.net/?article=7481	2003-01-29 11:36:39 +00:00
scottl	27cbbd88a2	Implement bus_dmamem_alloc_size() and bus_dmamem_free_size() as counterparts to bus_dmamem_alloc() and bus_dmamem_free(). This allows the caller to specify the size of the allocation instead of it defaulting to the max_size field of the busdma tag. This is intended to aid in converting drivers to busdma. Lots of hardware cannot understand scatter/gather lists, which forces the driver to copy the i/o buffers to a single contiguous region before sending it to the hardware. Without these new methods, this would require a new busdma tag for each operation, or a complex internal allocator/cache for each driver. Allocations greater than PAGE_SIZE are rounded up to the next PAGE_SIZE by contigmalloc(), so this is not suitable for multiple static allocations that would be better served by a single fixed-length subdivided allocation. Reviewed by: jake (sparc64)	2003-01-29 07:25:27 +00:00
jake	ae6dbfd8a7	Remove BDE_DEBUGGER. Discussed with: bde	2003-01-28 19:05:44 +00:00
alc	6661ebbc1b	Merge pmap_testbit() and pmap_is_modified(). The latter is the only caller of the former.	2003-01-28 03:01:35 +00:00
julian	3f4ea4a408	Fix KSE related patch. Make it compile for the SMP case.. statclock_process() has changed prototypes.	2003-01-26 21:32:08 +00:00
davidxu	4b9b549ca2	Move UPCALL related data structure out of kse, introduce a new data structure called kse_upcall to manage UPCALL. All KSE binding and loaning code are gone. A thread owns an upcall can collect all completed syscall contexts in its ksegrp, turn itself into UPCALL mode, and takes those contexts back to userland. Any thread without upcall structure has to export their contexts and exit at user boundary. Any thread running in user mode owns an upcall structure, when it enters kernel, if the kse mailbox's current thread pointer is not NULL, then when the thread is blocked in kernel, a new UPCALL thread is created and the upcall structure is transfered to the new UPCALL thread. if the kse mailbox's current thread pointer is NULL, then when a thread is blocked in kernel, no UPCALL thread will be created. Each upcall always has an owner thread. Userland can remove an upcall by calling kse_exit, when all upcalls in ksegrp are removed, the group is atomatically shutdown. An upcall owner thread also exits when process is in exiting state. when an owner thread exits, the upcall it owns is also removed. KSE is a pure scheduler entity. it represents a virtual cpu. when a thread is running, it always has a KSE associated with it. scheduler is free to assign a KSE to thread according thread priority, if thread priority is changed, KSE can be moved from one thread to another. When a ksegrp is created, there is always N KSEs created in the group. the N is the number of physical cpu in the current system. This makes it is possible that even an userland UTS is single CPU safe, threads in kernel still can execute on different cpu in parallel. Userland calls kse_create to add more upcall structures into ksegrp to increase concurrent in userland itself, kernel is not restricted by number of upcalls userland provides. The code hasn't been tested under SMP by author due to lack of hardware. Reviewed by: julian	2003-01-26 11:41:35 +00:00
jeff	e7887da8bd	- Remove a redundant scheduler option. Pointy hat to: jeff Spotted by: dillon	2003-01-26 06:37:43 +00:00

... 3 4 5 6 7 ...

3896 Commits