freebsd-dev

Author	SHA1	Message	Date
Thomas Moestl	816663a0b3	Fix crashes that would happen when more than one 4MB page was used to hold the kernel text, data and loader metadata by not using a fixed slot to store the TSB page(s) into. Enter fake 8k page entries into the kernel TSB that cover the 4M kernel page(s), sot that pmap_kenter() will work without having to treat these pages as a special case. Problem reported by: mjacob, obrien Problem spotted and 4M page handling proposed by: jake	2002-04-02 17:50:13 +00:00
Thomas Moestl	5320165b09	Remove the superfluous second argument from the IOTSBSLOT() macro.	2002-04-02 17:41:06 +00:00
Thomas Moestl	910d801895	Lower UPA_MEMSTART to 0x1c000000000. This is required for some larger Enterprise machines.	2002-04-02 17:38:52 +00:00
Matthew Dillon	182da8209d	Stage-2 commit of the critical*() code. This re-inlines cpu_critical_enter() and cpu_critical_exit() and moves associated critical prototypes into their own header file, <arch>/<arch>/critical.h, which is only included by the three MI source files that need it. Backout and re-apply improperly comitted syntactical cleanups made to files that were still under active development. Backout improperly comitted program structure changes that moved localized declarations to the top of two procedures. Partially re-apply one of the program structure changes to move 'mask' into an intermediate block rather then in three separate sub-blocks to make the code more readable. Re-integrate bug fixes that Jake made to the sparc64 code. Note: In general, developers should not gratuitously move declarations out of sub-blocks. They are where they are for reasons of structure, grouping, readability, compiler-localizability, and to avoid developer-introduced bugs similar to several found in recent years in the VFS and VM code. Reviewed by: jake	2002-04-01 23:51:23 +00:00
Jake Burkholder	d86b2422b9	Move the CTASSERT macro from MD code to systm.h alongside KASSERT so other code can use it. This takes a single constant argument and fails to compile if it is 0 (false). The main application of this is to make assertions about structure sizes at compile time, in order to validate assumptions made in other code. Examples: CTASSERT(sizeof(struct foo) == FOO_SIZEOF); CTASSERT(sizeof(struct foo) == (1 << FOO_SHIFT)); Requested by: jhb, phk	2002-04-01 21:55:00 +00:00
Matthew Dillon	d74ac6819b	Compromise for critical()/cpu_critical() recommit. Cleanup the interrupt disablement assumptions in kern_fork.c by adding another API call, cpu_critical_fork_exit(). Cleanup the td_savecrit field by moving it from MI to MD. Temporarily move cpu_critical() from <arch>/include/cpufunc.h to <arch>/<arch>/critical.c (stage-2 will clean this up). Implement interrupt deferral for i386 that allows interrupts to remain enabled inside critical sections. This also fixes an IPI interlock bug, and requires uses of icu_lock to be enclosed in a true interrupt disablement. This is the stage-1 commit. Stage-2 will occur after stage-1 has stabilized, and will move cpu_critical() into its own header file(s) + other things. This commit may break non-i386 architectures in trivial ways. This should be temporary. Reviewed by: core Approved by: core	2002-03-27 05:39:23 +00:00
Thomas Moestl	6267df8595	Add missing declarations.	2002-03-25 04:53:18 +00:00
David E. O'Brien	c543d983fa	Guard against redefining __gnuc_va_list.	2002-03-24 11:25:46 +00:00
Thomas Moestl	e8e2c56650	Revamp the busdma implementation a bit: - change the IOMMU support code so that it supports overcommittting the available DVMA memory, while still allocating as lazily as possible. This is achieved by limiting the preallocation, and deferring the allocation to map load time when it fails. In the latter case, the DVMA memory reserved for unloaded maps can be stolen to free up enough memory for loading a map. - allow NULL settings in the method tables, and search the parent tags until an appropriate implementation is found. This allows to remove some kluges in the old implementation.	2002-03-24 02:50:53 +00:00
Thomas Moestl	464ae30ee6	Make the OpenFirmware interrupt mapping code more generic, to reduce the bus-dependent code and to be able to support more systems. The core of the new code is mostly obtained from NetBSD. Kluge the interrupt routing methods of the psycho and apb drivers so that an intline of 0 can be handled for now; real routing is still not possible (all intline registers are preinitialized instead); this will require a sparc64-specific adaption of the driver for generic PCI-PCI bridges with a custom routing method to work right.	2002-03-24 02:11:06 +00:00
Thomas Moestl	4b5504494d	Add code to print the fault virtual address for uncorrectable DMA errors caused by IOMMU misses to aid debugging. This will only work on UltraSPARC-IIi and IIe.	2002-03-23 20:42:23 +00:00
Thomas Moestl	d1e39d7347	De-__P(), de-K&R, remove superfluous comments and prototypes, some style fixes. No functional changes.	2002-03-23 20:27:32 +00:00
Jake Burkholder	f1bb97a884	Fix a deadlock condition with tlb shootdown ipi delivery. Since ipis are not blocked by raising the pil, a reciever may be interrupted while holding a spinlock. If the sender does not defer interrupts throughout the entire operation it may be interrupted and try to acquire a spinlock held by a reciever, leading to a deadlock due to the synchronization used by the ipi handlers themselves. Submitted by: tmm	2002-03-23 04:20:00 +00:00
David E. O'Brien	439a4003ab	ASM versions of __FBSDID.	2002-03-23 02:01:27 +00:00
Warner Losh	03742795eb	intr_disable returns register_t	2002-03-21 06:21:32 +00:00
Jeff Roberson	74cbb73ba8	Remove references to vm_zone.h and switch over to the new uma API. Reviewed by: jake	2002-03-21 02:30:27 +00:00
Alfred Perlstein	91f5bcb812	Remove __P. profile.h and bus.h were excluded because there is currently WIP. Reviewed by: tmm	2002-03-21 00:06:55 +00:00
Jeff Roberson	8355f576a9	This is the first part of the new kernel memory allocator. This replaces malloc(9) and vm_zone with a slab like allocator. Reviewed by: arch@	2002-03-19 09:11:49 +00:00
Dag-Erling Smørgrav	a2e0658045	Move the definition of PT_[GS]ET{,DB,FP}REGS from the MD ptrace.h to the MI ptrace.h, since all platforms define them. Keep the MD ptrace.h around for FIX_SSTEP (which is currently only needed on Alpha).	2002-03-16 00:25:53 +00:00
Jake Burkholder	9ad1d32f3d	Fix ifdef LOCORE protection.	2002-03-13 06:04:36 +00:00
Jake Burkholder	5ce88ec47e	Add support for starting and stopping cpus with ipis. Stop the other cpus when shutting down or entering the debugger. Submitted by: tmm	2002-03-13 04:59:01 +00:00
Jake Burkholder	4968137eb9	Add support for driving the clocks on secondary cpus. Submitted by: tmm	2002-03-13 04:38:33 +00:00
Jake Burkholder	1bc589c745	Make IPI_WAIT use a bit mask of the cpus that a pmap is active on and only wait for those cpus, instead of all of them by using a count. Oops. Make the pointer to the mask that the primary cpu spins on volatile, so gcc doesn't optimize out an important load. Oops again. Activate tlb shootdown ipi synchronization now that it works. We have all involved cpus wait until all the others are done. This may not be necessary, it is mostly for sanity. Make the trigger level interrupt ipi handler work. Submitted by: tmm	2002-03-13 03:43:00 +00:00
Jake Burkholder	453e54056e	Add an ATOMIC_CLEAR_INT macro. Submitted by: tmm	2002-03-13 03:28:47 +00:00
Thomas Moestl	9390de81a1	Fix the type of some constants, and make some macros safer by casting the argument.	2002-03-11 03:04:28 +00:00
Thomas Moestl	25aab485b6	Add convenience macros to extract the cc0 and cc1 from format 2 and 3 instructions.	2002-03-11 03:03:35 +00:00
Jake Burkholder	a46877abfc	Increase VM_KMEM_SIZE to 16 megs from 12. Define VM_KMEM_SIZE_SCALE so that the number of physical pages per KVA page allocated scales properly with memory size. This fixes problems with kmem_map being too small. Noticed by: mike, wollman Submitted by: tmm	2002-03-09 23:35:50 +00:00
Mike Barcroft	d846855da8	o Don't require long long support in bswap64() functions. o In i386's <machine/endian.h>, macros have some advantages over inlines, so change some inlines to macros. o In i386's <machine/endian.h>, ungarbage collect word_swap_int() (previously __uint16_swap_uint32), it has some uses on i386's with PDP endianness. Submitted by: bde o Move a comment up in <machine/endian.h> that was accidentially moved down a few revisions ago. o Reenable userland's use of optimized inline-asm versions of byteorder(3) functions. o Fix ordering of prototypes vs. redefinition of byteorder(3) functions, so that the non-GCC (libc asm) case has proper prototypes. o Add proper prototypes for byteorder(3) functions in <sys/param.h>. o Prevent redundant duplicate prototypes by making use of the _BYTEORDER_PROTOTYPED define. o Move the bswap16(), bswap32(), bswap64() C functions into MD space for platforms in which asm versions don't exist. This significantly reduces the complexity of some things at the cost of duplicate code. Reviewed by: bde	2002-03-09 21:02:16 +00:00
Jake Burkholder	4f91e3efb2	Implement delivery of tlb shootdown ipis. This is currently more fine grained than the other implementations; we have complete control over the tlb, so we only demap specific pages. We take advantage of the ranged tlb flush api to send one ipi for a range of pages, and due to the pm_active optimization we rarely send ipis for demaps from user pmaps. Remove now unused routines to load the tlb; this is only done once outside of the tlb fault handlers. Minor cleanups to the smp startup code. This boots multi user with both cpus active on a dual ultra 60 and on a dual ultra 2.	2002-03-07 06:01:40 +00:00
Jake Burkholder	39028e8396	Modify the tlb demap API to take a pmap instead of a tlb context number. Due to allocating tlb contexts on the fly, we only ever need to demap the primary context, non-primary contexts have already been implicitly flushed by context switching. All we really need to tell is if its a kernel demap or not, and its easier just to compare against the kernel_pmap which is a constant.	2002-03-07 05:25:15 +00:00
Jake Burkholder	eb5b9c0be3	Add support for starting secondary cpus in kernel, as opposed to relying on the loader to do it. Improve smp startup code to be less racy and to defer certain things until the right time. This almost boots single user on my dual ultra 60, it is still very fragile: SMP: AP CPU #1 Launched! Enter full pathname of shell or RETURN for /bin/sh: # ls Debugger("trapsig") Stopped at Debugger+0x1c: ta %xcc, 1 db> heh No such command db>	2002-03-04 07:12:36 +00:00
Jake Burkholder	907164660a	Dig the information about which tlb slots were used to map the kernel out of the metadata passed by the loader.	2002-03-04 07:07:10 +00:00
Jake Burkholder	5573db3f0b	Allocate tlb contexts on the fly in cpu_switch, instead of statically 1 to 1 with pmaps. When the context numbers wrap around we flush all user mappings from the tlb. This makes use of the array indexed by cpuid to allow a pmap to have a different context number on a different cpu. If the context numbers are then divided evenly among cpus such that none are shared, we can avoid sending tlb shootdown ipis in an smp system for non-shared pmaps. This also removes a limit of 8192 processes (pmaps) that could be active at any given time due to running out of tlb contexts. Inspired by: the brown book Crucial bugfix from: tmm	2002-03-04 05:20:29 +00:00
Thomas Moestl	90ce56c287	Add the following functions/macros to support byte order conversions and device drivers for bus system with other endinesses than the CPU (using interfaces compatible to NetBSD): - bwap16() and bswap32(). These have optimized implementations on some architectures; for those that don't, there exist generic implementations. - macros to convert from a certain byte order to host byte order and vice versa, using a naming scheme like le16toh(), htole16(). These are implemented using the bswap functions. - stream bus space access functions, which do not perform a byte order conversion (while the normal access functions would if the bus endianess differs from the CPU endianess). htons(), htonl(), ntohs() and ntohl() are implemented using the new functions above for kernel usage. None of the above interfaces is currently exported to user land. Make use of the new functions in a few places where local implementations of the same functionality existed. Reviewed by: mike, bde Tested on alpha by: mike	2002-02-27 17:16:18 +00:00
Jake Burkholder	df38f87be1	Minimal testing has shown that a 4 page tsb is a nice sweet spot for current work loads. It tapers off after that as gcc's working set generally just fits. compiling bin/csh: TSB_PAGES = 2 213.33 real 77.59 user 110.01 sys TSB_PAGES = 4 116.43 real 75.78 user 19.16 sys TSB_PAGES = 8 119.27 real 76.38 user 18.12 sys Testing by: tmm	2002-02-27 06:18:02 +00:00
Jake Burkholder	95a44511f3	Parameterize the number of pages to allocate for the per-cpu area on PCPU_PAGES.	2002-02-27 06:08:13 +00:00
Jake Burkholder	62ad058292	Make cpu_identify take the value of the ver register and cpuid as arguments so we can print nice things about non-current cpus.	2002-02-27 06:05:50 +00:00
Jake Burkholder	d689965cb4	Wrap long lines.	2002-02-27 00:28:35 +00:00
Jake Burkholder	5d60dc233a	Add a macro for shift of an integer (1 << shift == sizeof). Move the pointer define to live alongside it. For kicks assert at compile time that they are correct. Use these instead of magic numbers.	2002-02-27 00:21:04 +00:00
David E. O'Brien	db3442f54d	Define basic macros required by GDB.	2002-02-26 21:49:46 +00:00
Jake Burkholder	de3fee8992	Convert pmap.pm_context to an array of contexts indexed by cpuid. This doesn't make sense for SMP right now, but it is a means to an end.	2002-02-26 06:57:30 +00:00
Jake Burkholder	6e5f3d0f0f	Allow the user tsb to span multiple pages. Make the default 2 pages for now until we do some testing to see what's best. This gives a massive reduction in system time for processes with a relatively large working set. The size of the tsb directly affects the rss size that a user process can keep mapped. When it starts to get full replacements occur and the process takes a lot of soft vm faults. Increasing the default from 1 page to 2 gives the following before and after numbers for compiling vfs_bio.c: before: 14.27 real 6.56 user 5.69 sys after: 8.57 real 6.11 user 1.62 sys This should make self hosted builds more tolerable.	2002-02-26 02:37:43 +00:00
Jake Burkholder	07d99740b6	Remove code to lock the user tsb into the tlb. We can handle faults on it now, as we do for normal wired kernel memory.	2002-02-25 22:58:41 +00:00
Jake Burkholder	4c4a1a19e8	Implement a nested window state. This avoids attempting to spill a user window to the user stack while in a nested kernel trap. We do this for entry to the kernel from user mode, but if we get an interrupt in kernel mode while there are still user windows in the cpu, and we attempt to spill to the user stack, we may take too many nested traps and overflow the trap stack, causing a red state exception. This is needed by upcoming changes to allow the user tsb to not be locked in the tlb. Reviewed by: tmm	2002-02-25 18:37:17 +00:00
Jake Burkholder	3c997c536c	Modify the tte format to not include the tlb context number and to store the virtual page number in a much more convenient way; all in one piece. This greatly simplifies the comparison for a matching tte, and allows the fault handlers to be much simpler due to not having to load wierd masks. Rewrite the tlb fault handlers to account for the new format. These are also written to allow faults on the user tsb inside of the fault handlers; the kernel fault handler must be aware of this and not clobber the other's registers. The faults do not yet occur due to other support that is needed (and still under my desk). Bug fixes from: tmm	2002-02-25 04:56:50 +00:00
Jake Burkholder	4d6fc96f19	Add inlines for demapping a range of pages from the itlb and dtlb. This will be used to reduce the number of tlb shootdown ipis in an smp system by sending one ipi for a whole range of pages, instead of one per page. Munge the context demap operations slightly to support demapping a non-primary context.	2002-02-23 21:10:06 +00:00
Jake Burkholder	e87e98c1d6	Use intr_disable/intr_restore instead of TLB_ATOMIC_START/END. Submitted by: tmm	2002-02-23 20:59:35 +00:00
Jake Burkholder	919f71f0fc	Adapt the tsb_foreach interface to take a source and a destination pmap so that it can be used for pmap_copy. Other consumers ignore the second pmap. Add statistics gathering for tsb_foreach. Implement pmap_copy.	2002-02-23 20:25:20 +00:00
Jake Burkholder	502a371561	Add macros to extract the UPA module id from the UPA config register. This is the hardware cpuid.	2002-02-23 19:54:34 +00:00
Jake Burkholder	40e8552ea0	Include intr_machdep.h only for !LOCORE.	2002-02-23 18:41:34 +00:00

1 2 3 4 5

222 Commits