freebsd-skq

Author	SHA1	Message	Date
peter	b9ca1b31c7	Move vm.pmap.pv_entry_count out from the PV_STATS ifdefs. It is always available and is a real counter, not a statistic.	2006-04-26 21:34:07 +00:00
jkim	18e73c2320	Check if reported HTT cores are physical cores. This commit does not affect AMD CPUs at all because HTT bit is disabled earlier. Intel multicore CPUs and ULE scheduler may be affected.	2006-04-25 00:06:37 +00:00
jkim	eefd58df92	Add another Intel CPU feature flag, xTPR (Send Task Priority Messages).	2006-04-24 22:56:57 +00:00
jkim	6b218fc19f	Check if deterministic cache parameters leaf is valid before use.	2006-04-24 22:23:52 +00:00
cperciva	900c118819	Adjust dangerous-shared-cache-detection logic from "all shared data caches are dangerous" to "a shared L1 data cache is dangerous". This is a compromise between paranoia and performance: Unlike the L1 cache, nobody has publicly demonstrated a cryptographic side channel which exploits the L2 cache -- this is harder due to the larger size, lower bandwidth, and greater associativity -- and prohibiting shared L2 caches turns Intel Core Duo processors into Intel Core Solo processors. As before, the 'machdep.hyperthreading_allowed' sysctl will allow even the L1 data cache to be shared. Discussed with: jhb, scottl Security: See FreeBSD-SA-05:09.htt for background material.	2006-04-24 21:17:01 +00:00
delphij	da32f1fb9a	Move AHC_REG_PRETTY_PRINT and AHD_REG_PRETTY_PRINT below their corresponding devices.	2006-04-24 08:44:34 +00:00
peter	0e7c77416b	Oops. Minidumps were developed on 6.x, in without the small pv entry code. Add some strategic dump_add_page()/dump_drop_page() lines to include pv chunks in the minidumps - these operate in the direct map region like UMA.	2006-04-21 04:50:18 +00:00
peter	dbae6322e8	Introduce minidumps. Full physical memory crash dumps are still available via the debug.minidump sysctl and tunable. Traditional dumps store all physical memory. This was once a good thing when machines had a maximum of 64M of ram and 1GB of kvm. These days, machines often have many gigabytes of ram and a smaller amount of kvm. libkvm+kgdb don't have a way to access physical ram that is not mapped into kvm at the time of the crash dump, so the extra ram being dumped is mostly wasted. Minidumps invert the process. Instead of dumping physical memory in in order to guarantee that all of kvm's backing is dumped, minidumps instead dump only memory that is actively mapped into kvm. amd64 has a direct map region that things like UMA use. Obviously we cannot dump all of the direct map region because that is effectively an old style all-physical-memory dump. Instead, introduce a bitmap and two helper routines (dump_add_page(pa) and dump_drop_page(pa)) that allow certain critical direct map pages to be included in the dump. uma_machdep.c's allocator is the intended consumer. Dumps are a custom format. At the very beginning of the file is a header, then a copy of the message buffer, then the bitmap of pages present in the dump, then the final level of the kvm page table trees (2MB mappings are expanded into a 4K page mappings), then the sparse physical pages according to the bitmap. libkvm can now conveniently access the kvm page table entries. Booting my test 8GB machine, forcing it into ddb and forcing a dump leads to a 48MB minidump. While this is a best case, I expect minidumps to be in the 100MB-500MB range. Obviously, never larger than physical memory of course. minidumps are on by default. It would want be necessary to turn them off if it was necessary to debug corrupt kernel page table management as that would mess up minidumps as well. Both minidumps and regular dumps are supported on the same machine.	2006-04-21 04:24:50 +00:00
imp	0e9911a7c4	Set the rid for a resoruce allocated with rman_reserve_resource.	2006-04-20 04:16:34 +00:00
cperciva	51d1ca0f6e	Correct a local information leakage bug affecting AMD FPUs. Security: FreeBSD-SA-06:14.fpu	2006-04-19 07:00:19 +00:00
peter	6030c4e1a5	If we're doing a try-alloc of a pv entry and give up early, do not forget to reduce the pv_entry_count counter. This was found by Tor Egge. In the same email, Tor also pointed out the pv_stats problem in the previous commit, but I'd forgotten about it until I went looking for this email about this allocation problem.	2006-04-18 20:17:32 +00:00
peter	be61087902	pv_entry_count is more than a statistic. It is used for resource limiting. Do not compile out its counter updates if pv entry stats are turned off.	2006-04-18 20:11:00 +00:00
alc	aac2697d98	Include opt_pmap.h for PMAP_SHPGPERPROC. PR: 94509	2006-04-13 03:31:48 +00:00
alc	a7e3d6f83b	Retire pmap_track_modified(). We no longer need it because we do not create managed mappings within the clean submap. To prevent regressions, add assertions blocking the creation of managed mappings within the clean submap. Reviewed by: tegge	2006-04-12 04:22:52 +00:00
ps	cc2c59e66f	Hook bce up to the build	2006-04-10 20:04:22 +00:00
jhb	1dfdfa5677	Cache the value of the lower half of each I/O APIC redirection table entry so that we only have to do an ioapic_write() instead of an ioapic_read() followed by an ioapic_write() every time we mask and unmask level triggered interrupts. This cuts the execution time for these operations roughly in half. Profiled by: Paolo Pisati <p.pisati@oltrelinux.com> MFC after: 1 week	2006-04-05 20:43:19 +00:00
peter	bfd11ed701	Convert pv_entry_frees and pv_entry_allocs stats counters from int to long, they wrap way too quickly.	2006-04-04 20:17:35 +00:00
marcel	78f0584b0b	Sync with i386: Map exceptions to signals in gdb_cpu_signal() so that kgdb(1) gets a SIGTRAP when it needs to. Pointed out by: grehan@	2006-04-04 03:00:20 +00:00
marcel	dc8b7dcaa1	The PC is register 16, not 18. Pointed out by: grehan@	2006-04-04 02:44:51 +00:00
marcel	8278e2d5fb	Eliminate HAVE_STOPPEDPCBS. On ia64 the PCPU holds a pointer to the PCB in which the context of stopped CPUs is stored. To access this PCB from KDB, we introduce a new define, called KDB_STOPPEDPCB. The definition, when present, lives in <machine/kdb.h> and abstracts where MD code saves the context. Define KDB_STOPPEDPCB on i386, amd64, alpha and sparc64 in accordance to previous code.	2006-04-03 22:51:47 +00:00
peter	3a90816456	Shrink the amd64 pv entry from 48 bytes to about 24 bytes. On a machine with large mmap files mapped into many processes, this saves hundreds of megabytes of ram. pv entries were individually allocated and had two tailq entries and two pointers (or addresses). Each pv entry was linked to a vm_page_t and a process's address space (pmap). It had the virtual address and a pointer to the pmap. This change replaces the individual allocation with a per-process allocation system. A page ("pv chunk") is allocated and this provides 168 pv entries for that process. We can now eliminate one of the 16 byte tailq entries because we can simply iterate through the pv chunks to find all the pv entries for a process. We can eliminate one of the 8 byte pointers because the location of the pv entry implies the containing pv chunk, which has the pointer. After overheads from the pv chunk bitmap and tailq linkage, this works out that each pv entry has an effective size of 24.38 bytes. Future work still required, and other problems: * when running low on pv entries or system ram, we may need to defrag the chunk pages and free any spares. The stats (vm.pmap.) show that this doesn't seem to be that much of a problem, but it can be done if needed. running low on pv entries is now a much bigger problem. The old get_pv_entry() routine just needed to reclaim one other pv entry. Now, since they are per-process, we can only use pv entries that are assigned to our current process, or by stealing an entire page worth from another process. Under normal circumstances, the pmap_collect() code should be able to dislodge some pv entries from the current process. But if needed, it can still reclaim entire pv chunk pages from other processes. * This should port to i386 really easily, except there it would reduce pv entries from 24 bytes to about 12 bytes. (I have integrated Alan's recent changes.)	2006-04-03 21:36:01 +00:00
peter	0f363b7d24	Remove the unused sva and eva arguments from pmap_remove_pages().	2006-04-03 21:16:10 +00:00
alc	af01e3f809	Introduce pmap_try_insert_pv_entry(), a function that conditionally creates a pv entry if the number of entries is below the high water mark for pv entries. Use pmap_try_insert_pv_entry() in pmap_copy() instead of pmap_insert_entry(). This avoids possible recursion on a pmap lock in get_pv_entry(). Eliminate the explicit low-memory checks in pmap_copy(). The check that the number of pv entries was below the high water mark was largely ineffective because it was located in the outer loop rather than the inner loop where pv entries were allocated. Instead of checking, we attempt the allocation and handle the failure. Reviewed by: tegge Reported by: kris MFC after: 5 days	2006-04-02 05:45:05 +00:00
emax	f245d640bb	Add kbdmux(4) to GENERIC on amd64 Requested by: scottl Tested by: scottl	2006-03-31 23:04:48 +00:00
scottl	725c458dc3	Hook the MFI driver up to the build.	2006-03-29 09:57:22 +00:00
jhb	3718b3713e	If the XSDT address in the RSDP for an ACPI 2.0 machine is NULL, then fall back to using the RSDT instead. ACPI-CA already follows this same strategy as a workaround for yet another instance of brain-damaged BIOS writers. PR: i386/93963 Submitted by: Masayuki FUKUI <fukui.FreeBSD@fanet.net>	2006-03-27 15:59:48 +00:00
alc	108c9331c3	Eliminate unnecessary invalidations of the entire TLB by pmap_remove(). Specifically, on mappings with PG_G set pmap_remove() not only performs the necessary per-page invlpg invalidations but also performs an unnecessary invalidation of the entire set of non-PG_G entries. Reviewed by: tegge	2006-03-21 18:07:42 +00:00
davidxu	9f834e1bd5	Remove stale KSE code. Reviewed by: alc	2006-03-21 06:46:27 +00:00
jhb	e11865e4b1	Drop some unneeded casts since we program the kernel in C rather than C++.	2006-03-20 19:39:08 +00:00
netchild	0c0762be80	regen: fix of linuxolator with testing in a cross-build	2006-03-20 18:54:29 +00:00
netchild	868fb79586	Fix the linuxolator on amd64 (cross-build).	2006-03-20 18:53:26 +00:00
ru	324d968ad4	Regen.	2006-03-19 11:12:41 +00:00
ru	106242f7bb	Unbreak COMPAT_LINUX32 option support on amd64. Broken by: netchild	2006-03-19 11:10:33 +00:00
netchild	39276e2b1e	regen	2006-03-18 20:49:01 +00:00
ups	1a84cbb2c6	Enable global pages TLB extension on Application Processors. MFC after: 3 days	2006-03-18 19:32:46 +00:00
netchild	8fd6664412	regen after COMPAT_43 removal	2006-03-18 18:24:38 +00:00
netchild	c1829f604c	Get rid of the need of COMPAT_43 in the linuxolator. Submitted by: Divacky Roman <xdivac02@stud.fit.vutbr.cz> Obtained from: DragonFly (some parts)	2006-03-18 18:20:17 +00:00
jhb	aaa33da2ed	Don't allow userland to set hardware watch points on kernel memory at all. Previously, we tried to allow this only for root. However, we were calling suser() on the target process rather than the current process. This means that if you can ptrace() a process running as root you can set a hardware watch point in the kernel. In practice I think you probably have to be root in order to pass the p_candebug() checks in ptrace() to attach to a process running as root anyway. Rather than fix the suser(), I just axed the entire idea, as I can't think of any good reason _at all_ for userland to set hardware watch points for KVM. MFC after: 3 days Also thinks hardware watch points on KVM from userland are bad: bde, rwatson	2006-03-14 16:13:55 +00:00
peter	ef2d575a8e	Merge/sync with i386: various cosmetic tweaks	2006-03-14 00:01:56 +00:00
peter	3a37e4a2b0	MFi386: The SIGFPE macros were moved to signal.h (FPE_INTOVF etc)	2006-03-14 00:01:22 +00:00
peter	079fb5c5f1	MFi386: rename pcib_devclass to hostb_devclass (cosmetic here)	2006-03-13 23:58:40 +00:00
peter	02d811ad5e	MFi386: add a TRAP_INTERRUPT case	2006-03-13 23:56:44 +00:00
peter	635b89f062	Cosmetic sync with i386	2006-03-13 23:55:31 +00:00
ps	fb5edd1dcd	Fix the format/display descriptor of vm.kmem_size and vm.kmem_free to be 'long' instead of 'int' so that sysctl(8) correctly displays the 8 returned bytes as a single 'long' instead of two 'int' values. Submitted by: peter	2006-03-13 08:13:37 +00:00
jhb	329536bd48	Flip the switch and don't route interrupts to hyperthreads in a HT system. In at least one benchmark this showed around a 20% performance increase. If other workloads do benefit from having hyperthreads service interrupts, we can always make this a loader tunable. MFC after: 3 days Tested by: ps	2006-03-09 16:38:52 +00:00
ups	5ad34fd1d6	Fix exec_map resource leaks. Tested by: kris@	2006-03-08 20:21:54 +00:00
yar	da15630ac2	MFi386 revision 1.1220: options TDFX_LINUX --> device tdfx_linux	2006-03-06 15:29:28 +00:00
sam	2c5d022749	guard function decls with _KERNEL so user code can include this file	2006-03-01 05:59:56 +00:00
jhb	3478c467ee	Rework how we wire up interrupt sources to CPUs: - Throw out all of the logical APIC ID stuff. The Intel docs are somewhat ambiguous, but it seems that the "flat" cluster model we are currently using is only supported on Pentium and P6 family CPUs. The other "hierarchy" cluster model that is supported on all Intel CPUs with local APICs is severely underdocumented. For example, it's not clear if the OS needs to glean the topology of the APIC hierarchy from somewhere (neither ACPI nor MP Table include it) and setup the logical clusters based on the physical hierarchy or not. Not only that, but on certain Intel chipsets, even though there were 4 CPUs in a logical cluster, all the interrupts were only sent to one CPU anyway. - We now bind interrupts to individual CPUs using physical addressing via the local APIC IDs. This code has also moved out of the ioapic PIC driver and into the common interrupt source code so that it can be shared with MSI interrupt sources since MSI is addressed to APICs the same way that I/O APIC pins are. - Interrupt source classes grow a new method pic_assign_cpu() to bind an interrupt source to a specific local APIC ID. - The SMP code now tells the interrupt code which CPUs are avaiable to handle interrupts in a simpler and more intuitive manner. For one thing, it means we could now choose to not route interrupts to HT cores if we wanted to (this code is currently in place in fact, but under an #if 0 for now). - For now we simply do static round-robin of IRQs to CPUs when the first interrupt handler just as before, with the change that IRQs are now bound to individual CPUs rather than groups of up to 4 CPUs. - Because the IRQ to CPU mapping has now been moved up a layer, it would be easier to manage this mapping from higher levels. For example, we could allow drivers to specify a CPU affinity map for their interrupts, or we could allow a userland tool to bind IRQs to specific CPUs. The MFC is tentative, but I want to see if this fixes problems some folks had with UP APIC kernels on 6.0 on SMP machines (an SMP kernel would work fine, but a UP APIC kernel (such as GENERIC in RELENG_6) would lose interrupts). MFC after: 1 week	2006-02-28 22:24:55 +00:00
dwmalone	fab7fda621	It seems bit 5 of cpu_feature2 is the VMX (Virtual Machine Extensions) bit. While I'm here, delete a comment that was cut and past from the cpu_features code that doesn't belong here.	2006-02-15 14:48:59 +00:00

1 2 3 4 5 ...

4584 Commits