freebsd-dev

Author	SHA1	Message	Date
Yaroslav Tykhiy	042bbfae5a	The i386 "call" instruction works as follows: it pushes the return address on the stack and only then "dereferences" %pc. Therefore, in the case of a call to an invalid address, we arrive to the trap handler with the invalid value in tf_eip. This used to prevent db_backtrace() from assigning the most recent and interesting frame on the stack to the right spot in the right function, from which the invalid call was attempted. Try to detect and work around that by recovering the return address from the stack. The work-around requires the fault address be passed to db_backtrace(). Smuggle it as tf_err. MFC after: 1 month Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-18 12:07:00 +00:00
Matt Jacob	375e362989	Unbreak tinderbox- fix device_printf arg to accomodate different sizes of vm_paddr_t in different contexts (e.g., PAE vs. non PAE).	2006-06-16 14:04:21 +00:00
Yaroslav Tykhiy	a436dbf123	Return -1 from db_numargs() if number of args couldn't be guessed. Use this later to indicate in backtrace output that args shown are uncertain. Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-16 11:49:37 +00:00
Yaroslav Tykhiy	70b906ae82	Guess the number of arguments to a function somewhat better. Now GCC likes to stick a "mov %eax, %FOO" instruction before "addl $BAR, %esp" if the function just called returns an int, which is a very common case in the kernel. Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-16 11:14:54 +00:00
Alexander Leidinger	28a3ae7f88	Remove COMPAT_43 from GENERIC (and other kernel configs). For amd64 there's an explicit comment that it's needed for the linuxolator. This is not the case anymore. For all other architectures there was only a "KEEP THIS". I'm (and other people too) running a COMPAT_43-less kernel since it's not necessary anymore for the linuxolator. Roman is running such a kernel for a for longer time. No problems so far. And I doubt other (newer than ia32 or alpha) architectures really depend on it. This may result in a small performance increase for some workloads. If the removal of COMPAT_43 results in a not working program, please recompile it and all dependencies and try again before reporting a problem. The only place where COMPAT_43 is needed (as in: does not compile without it) is in the (outdated/not usable since too old) svr4 code. Note: this does not remove the COMPAT_43TTY option. Nagging by: rdivacky	2006-06-15 19:58:53 +00:00
Stephan Uphoff	2053c12705	Remove mpte optimization from pmap_enter_quick(). There is a race with the current locking scheme and removing it should have no measurable performance impact. This fixes page faults leading to panics in pmap_enter_quick_locked() on amd64/i386. Reviewed by: alc,jhb,peter,ps	2006-06-15 01:01:06 +00:00
Alexander Leidinger	4946fe7c4d	regen after MFP4 (soc2006/rdivacky_linuxolator) of syscalls.master P4-Changes: similar to 98673 and 98675 but regenerated locally Sponsored by: Google SoC 2006 Submitted by: rdivacky	2006-06-13 18:48:30 +00:00
Alexander Leidinger	c8b579c182	MFP4 (soc2006/rdivacky_linuxolator) Update of syscall.master: o Adding of several new dummy syscalls (268-310) o Synchronization of amd64 syscall.master with i386 one o Auditing added to amd64 syscall.master o Change auditing type for lstat syscall (bugfix). [1] P4-Changes: 98672, 98674 Noticed by: rwatson [1] Sponsored by: Google SoC 2006 Submitted by: rdivacky	2006-06-13 18:43:55 +00:00
David Xu	b41f1452d9	Add scheduler CORE, the work I have done half a year ago, recent, I picked it up again. The scheduler is forked from ULE, but the algorithm to detect an interactive process is almost completely different with ULE, it comes from Linux paper "Understanding the Linux 2.6.8.1 CPU Scheduler", although I still use same word "score" as a priority boost in ULE scheduler. Briefly, the scheduler has following characteristic: 1. Timesharing process's nice value is seriously respected, timeslice and interaction detecting algorithm are based on nice value. 2. per-cpu scheduling queue and load balancing. 3. O(1) scheduling. 4. Some cpu affinity code in wakeup path. 5. Support POSIX SCHED_FIFO and SCHED_RR. Unlike scheduler 4BSD and ULE which using fuzzy RQ_PPQ, the scheduler uses 256 priority queues. Unlike ULE which using pull and push, the scheduelr uses pull method, the main reason is to let relative idle cpu do the work, but current the whole scheduler is protected by the big sched_lock, so the benefit is not visible, it really can be worse than nothing because all other cpu are locked out when we are doing balancing work, which the 4BSD scheduelr does not have this problem. The scheduler does not support hyperthreading very well, in fact, the scheduler does not make the difference between physical CPU and logical CPU, this should be improved in feature. The scheduler has priority inversion problem on MP machine, it is not good for realtime scheduling, it can cause realtime process starving. As a result, it seems the MySQL super-smack runs better on my Pentium-D machine when using libthr, despite on UP or SMP kernel.	2006-06-13 13:12:56 +00:00
Marius Strobl	acb8c14985	Make the ISAPNP code optional and only enable it on i386 and pc98 (used for CBUS-PNP cards there) by default, as there are no amd64 and sparc64 machines with ISA slots and which therefore could make use of this code known to exist. For sparc64 this additionally allows to get rid of the compat shims for in{b,w,l}()/out{b,w,l}() etc and the associated hacks. OK'ed by: imp, peter	2006-06-12 21:07:13 +00:00
John Baldwin	e3d7caf487	Enable a few more things in x86 NOTES to get broader LINT coverage: - Turn on iwi(4), ipw(4), and ndis(4) on amd64 and i386. - Turn on ral(4) and ural(4) on i386, pc98, and amd64.	2006-06-12 20:38:17 +00:00
Alan Cox	b74a62d602	Don't invalidate the TLB in pmap_qenter() unless the old mapping was valid. Most often, it isn't. Reviewed by: tegge@	2006-06-12 20:05:27 +00:00
Warner Losh	78878cef94	Add the ability to subset the devices that UART pulls in. This allows the arm to compile without all the extras that don't appear, at least not in the flavors of ARM I deal with. This helps us save about 100k. If I've botched the available devices on a platform, please let me know and I'll correct ASAP.	2006-06-12 04:21:50 +00:00
Nate Lawson	dd311cb41a	* Ask for a page-aligned page instead of an arbitrary address. This should not be necessary but might be helpful and at least reduce fragmentation. * Add an assert to detect if the wakecode ever grows too big. We include 1 KB for stack, which should be more than enough also. * Remove unnecessary initialization of static variables. * Add comments and a bootverbose print giving the page phys address.	2006-06-10 08:20:17 +00:00
Nate Lawson	716d09af5e	Minor tweaks to the resume code. Previous commit reverted alignment back to 4. There is no need to be more strict at assembly time since we copy the code anyway to a private page. * Clear the direction flag and eflags. Probably not necessary but it won't hurt to be safe. * Add prefixes to all instructions to prevent any assembler mistakes. * Remove zeroing of eax - edi. We use those registers immediately after to transfer values to protected mode so this was pointless. * Update comments to reflect info found during code review.	2006-06-10 08:20:03 +00:00
Nate Lawson	b46f4324ff	Move the reset beep tunable/sysctl to debug.acpi.resume_beep. This makes more sense than under hw.acpi. Also, document this in the man page.	2006-06-10 08:06:16 +00:00
Nate Lawson	64297e67ab	Minor tweaks to the resume code that might help people debug. * Add hw.acpi.resume_beep tunable and sysctl, default to 0. Beeps the PC speaker soon after waking to diagnose whether the wakeup code is even getting run before other drivers possibly hang the system. To stop the beep, cause another beep (i.e. keyboard bell). Submitted by takawata@, I changed the frequency to be lower. * Use 4096 instead of 4 byte alignment. Might be useful although doesn't seem to be necessary. * Remove a useless assignment to acpi_reset_video. It was overwritten by the default sysctl value anyway.	2006-06-08 17:54:10 +00:00
Alan Cox	ce142d9ec0	Introduce the function pmap_enter_object(). It maps a sequence of resident pages from the same object. Use it in vm_map_pmap_enter() to reduce the locking overhead of premapping objects. Reviewed by: tegge@	2006-06-05 20:35:27 +00:00
Ed Maste	f4eaa4b967	Fix cut-n-pasteo: use the i386 version #define for i386 dumps, not the amd64 one.	2006-06-05 18:21:29 +00:00
Alan Cox	62b5e735a6	MFamd64 Eliminate unnecessary, recursive acquisitions and releases of the page queues lock by free_pv_entry() and pmap_remove_pages(). Reduce the scope of the page queues lock in pmap_remove_pages().	2006-06-05 06:08:21 +00:00
Mike Silbersack	f25d341cfb	After much discussion with mjacob and scottl, change bus_dmamem_alloc so that it just warns the user with a printf when it misaligns a piece of memory that was requested through a busdma tag. Some drivers (such as mpt, and probably others) were asking for alignments that could not be satisfied, but as far as driver operation was concerned, that did not matter. In the theory that other drivers will fall into this same category, we agreed that panicing or making the allocation fail will cause more hardship than is necessary. The printf should be sufficient motivation to get the driver glitch fixed.	2006-06-01 04:49:29 +00:00
Matt Jacob	aa57a87a56	Turn the panic on not being able to meet alignment constraints in bus_dmamem_alloc into the more reasonable EINVAL return. Also, reclaim memory allocated but then not used if we had an error return.	2006-05-31 00:37:56 +00:00
David Xu	f1c313bff2	Clear invalid bits only if CPU supports SSE, otherwise, some fields in struct save87 will be cleared unexpectly.	2006-05-31 00:17:29 +00:00
David Xu	afedf1a7f1	Use the method described in IA-32 Intel Architecture Software Developer's Manual chapter 11.6.6 to get valid mxcsr bits, use the mxcsr mask to clear invalid bits passed by user code. Reviewed by: bde	2006-05-30 23:44:21 +00:00
David Xu	5d84379dd6	Backout changes trying to inherit floating-point environment, although POSIX (susv3) requires this, but it is unclear what should be inherited, duplicating whole 387 stack for new thread seems to be unnecessary and dangerous. Revert to previous code, force a new thread to be started with clean FP state.	2006-05-29 02:58:37 +00:00
Mike Silbersack	0d65566db8	Add a quick hack to ensure that bus_dmamem_alloc properly aligns small allocations with large alignment requirements. Add a panic to detect cases where we've still failed to properly align.	2006-05-28 18:30:36 +00:00
David Xu	4f56cbcbd5	Clear high 16 bits of mxcsr register, according to Intel document, if the high 16 bits is non-zero, fxrstor instruction will generate GP fault, resulting kernel crash, this bug can be triggered by setcontext and ptrace(PT_SETXMMREGS).	2006-05-28 06:51:57 +00:00
David Xu	1db0da9e2b	PCB_NPXINITDONE is cleared by npx_fork_thread.	2006-05-28 04:47:56 +00:00
David Xu	40310f021d	If parent thread never used FPU, the only work is to clear flag PCB_NPXINITDONE for new thread and let trap code initialize it.	2006-05-28 04:40:45 +00:00
David Xu	38fd748725	When creating a new thread, inherit floating-point environment from current thread, this is required by POSIX pthread_create document.	2006-05-28 02:03:13 +00:00
Warner Losh	d708737568	APM was calling the suspend process from a timeout. This meant that other timeouts could not happen while suspending, including timeouts for things like msleep. This caused the system to hang on suspend when the cbb was enabled, since its suspend path powered down the socket which used a timeout to wait for it to be done. APM now creates a thread when it is enabled, and deletes the thread when it is disabled. This thread takes the place of the timeout by doing its polling every ~.9s. When the thread is disabled, it will wakeup early, otherwise it times out and polls the varius things the old timeout polled (APM events, suspend delays, etc). This makes my Sony VAIO 505TS suspend/resume correctly when APM is enabled (ACPI is black listed on my 505TS). This will likely fix other problems with the suspend path where drivers would sleep with msleep and/or do other timeouts. Maybe there's some special case code that would use DELAY while suspending and msleep otherwise that can be revisited and removed. This was also tested by glebius@, who pointed out that in the patch I sent him, I'd forgotten apm_saver.c MFC After: 3 weeks	2006-05-25 23:06:38 +00:00
Maxim Sobolev	aa1807d5d6	Move clock_lock prototype into <machine/clock.h>, where it is more appropriate. Discussed with: jhb	2006-05-19 18:53:50 +00:00
Marius Strobl	136eda1dc3	- Add C-bus and ISA front-ends for le(4) so it can actually replace lnc(4) on PC98 and i386. The ISA front-end supports the same non-PNP network cards as lnc(4) did and additionally a couple of PNP ones. Like lnc(4), the C-bus front-end of le(4) only supports C-NET(98)S and is untested due to lack of such hardware, but given that's it's based on the respective lnc(4) and not too different from the ISA front-end it should be highly likely to work. - Remove the descriptions of le(4), which where converted from lnc(4), from sys/i386/conf/NOTES and sys/pc98/conf/NOTES as there's a common one in sys/conf/NOTES.	2006-05-17 21:25:23 +00:00
Marius Strobl	dcaf1a3834	- As only the PCI front-end of le(4) is common to all platforms move its entry to the PCI NICs section so it's in the same spot in all GENERIC config files. - Add a note to the description of pcn(4) informing that is has precedence over le(4).	2006-05-17 20:44:01 +00:00
Poul-Henning Kamp	f6ce2a64f7	Send the pcvt(4) driver off to retirement.	2006-05-17 09:33:15 +00:00
Poul-Henning Kamp	c40da00ca3	Since DELAY() was moved, most <machine/clock.h> #includes have been unnecessary.	2006-05-16 14:37:58 +00:00
Ruslan Ermilov	155d9f6a98	Kill more references to lnc(4). Submitted by: grep(1)	2006-05-16 12:15:39 +00:00
Marius Strobl	055abe9af2	Remove some remnants of lnc(4).	2006-05-14 18:49:25 +00:00
George V. Neville-Neil	3f0c418ad9	Prefer the le device driver for Lance (AMD7990 et al) hardware over the older, and less capable lnc driver. Reviewed by: imp	2006-05-14 01:40:41 +00:00
Peter Wemm	374757c7cb	Test commit after repoman upgrade. Remove one of my many email addresses from a copyright message.	2006-05-12 22:41:58 +00:00
Peter Wemm	b02a3351e8	Test commit after repoman upgrade. Remove one of my many email addresses from a coyright message.	2006-05-12 22:38:53 +00:00
Nate Lawson	c404dfeae1	Add support for the VIA C7-M processor family. Remove an unnecessary check of the table's bus clock. CPUs that support this feature export only the high/low settings via the MSR, packed into 32 bits. Hardware from: Centaur Technologies MFC after: 1 week	2006-05-11 17:35:44 +00:00
Poul-Henning Kamp	5405ab4889	Clean out sysctl machdep.* related defines. The cmos clock related stuff should really be in MI code.	2006-05-11 17:29:25 +00:00
Alexander Leidinger	ba5bd0001c	regen (linux rt_sigpending)	2006-05-10 18:19:51 +00:00
Alexander Leidinger	17138b619c	Implement rt_sigpending in the linuxolator. PR: 92671 Submitted by: Markus Niemist"o <markus.niemisto@gmx.net>	2006-05-10 18:17:29 +00:00
Sam Leffler	145ebf44d2	make tinderbox happy: GENERIC got ath and wlan added so we need to now mark these "nodevice" or we'll get undefined references	2006-05-10 05:19:21 +00:00
Doug Ambrisko	32397ce071	Add in linsysfs. A linux 2.6 like sys filesystem to pacify the Linux LSI MegaRAID SAS utility. Sponsored by: IronPort Systems Man page help from: brueffer	2006-05-09 22:27:01 +00:00
Maxim Konovalov	053a042047	o Add acpi_ibm to the build. PR: kern/96940 Submitted by: Rong-En Fan	2006-05-07 20:13:18 +00:00
Doug Ambrisko	060e488247	Enhance the Linux emulation layer to make MegaRAID SAS managements tool happy. Add back in a scheme to emulate old type major/minor numbers via hooks into stat, linprocfs to return major/minors that Linux app's expect. Currently only /dev/null is always registered. Drivers can register via the Linux type shim similar to the ioctl shim but by using linux_device_register_handler/linux_device_unregister_handler functions. The structure is: struct linux_device_handler { char bsd_driver_name; char linux_driver_name; char bsd_device_name; char linux_device_name; int linux_major; int linux_minor; int linux_char_device; }; Linprocfs uses this to display the major number of the driver. The soon to be available linsysfs will use it to fill in the driver name. Linux_stat uses it to translate the major/minor into Linux type values. Note major numbers are dynamically assigned via passing in a -1 for the major number so we don't need to keep track of them. This is somewhat needed due to us switching to our devfs. MegaCli will not run until I add in the linsysfs and mfi Linux compat changes. Sponsored by: IronPort Systems	2006-05-05 16:10:45 +00:00
Sam Leffler	8e84cc6b22	add ath and wlan crypto support Requested by: many MFC after: 1 month	2006-05-03 18:13:11 +00:00
Scott Long	8d59dfff98	Allow bus_dmamap_load() to pass ENOMEM back to the caller. This puts it into conformance with the mbuf and uio load routines. ENOMEM can only happen with BUS_DMA_NOWAIT is passed in, thus the deferals are disabled. I don't like doing this, but fixing this fixes assumptions in other important drivers, which is a net benefit for now.	2006-05-03 04:14:17 +00:00
John Baldwin	2b8a339c7e	Add various constants for the PAT MSR and the PAT PTE and PDE flags. Initialize the PAT MSR during boot to map PAT type 2 to Write-Combining (WC) instead of Uncached (UC-). MFC after: 1 month	2006-05-01 22:07:00 +00:00
John Baldwin	4ac60df584	Add a new 'pmap_invalidate_cache()' to flush the CPU caches via the wbinvd() instruction. This includes a new IPI so that all CPU caches on all CPUs are flushed for the SMP case. MFC after: 1 month	2006-05-01 21:36:47 +00:00
Peter Wemm	ada5d7d5b0	Using an idea from Stephan Uphoff, use the empty pte's that correspond to the unused kva in the pv memory block to thread a freelist through. This allows us to free pages that used to be used for pv entry chunks since we can now track holes in the kva memory block. Idea from: ups	2006-05-01 21:22:38 +00:00
Peter Wemm	4c8eff70f8	Fix missing changes required for the amd64->i386 conversion. Add the missing VM_ALLOC_WIRED flags to vm_page_alloc() calls I added. Submitted by: alc	2006-05-01 19:57:00 +00:00
Marcel Moolenaar	64220a7e28	Rewrite of puc(4). Significant changes are: o Properly use rman(9) to manage resources. This eliminates the need to puc-specific hacks to rman. It also allows devinfo(8) to be used to find out the specific assignment of resources to serial/parallel ports. o Compress the PCI device "database" by optimizing for the common case and to use a procedural interface to handle the exceptions. The procedural interface also generalizes the need to setup the hardware (program chipsets, program clock frequencies). o Eliminate the need for PUC_FASTINTR. Serdev devices are fast by default and non-serdev devices are handled by the bus. o Use the serdev I/F to collect interrupt status and to handle interrupts across ports in priority order. o Sync the PCI device configuration to include devices found in NetBSD and not yet merged to FreeBSD. o Add support for Quatech 2, 4 and 8 port UARTs. o Add support for a couple dozen Timedia serial cards as found in Linux.	2006-04-28 21:21:53 +00:00
Peter Wemm	7eeda22793	Interim fix for pmap problems I introduced with my last commit. Remove the code to dyanmically change the pv_entry limits. Go back to a single fixed kva reservation for pv entries, like was done before when using the uma zone. Go back to never freeing pages back to the free pool after they are no longer used, just like before. This stops the lock order reversal due to aquiring the kernel map lock while pmap was locked. This fixes the recursive panic if invariants are enabled. The problem was that allocating/freeing kva causes vm_map_entry nodes to be allocated/freed. That can recurse back into pmap as new pages are hooked up to kvm and hence all the problem. Allocating/freeing kva indirectly allocate/frees memory. So, by going back to a single fixed size kva block and an index, we avoid the recursion panics and the LOR. The problem is that now with a linear block of kva, we have no mechanism to track holes once pages are freed. UMA has the same problem when using custom object for a zone and a fixed reservation of kva. Simple solutions like having a bitmap would work, but would be very inefficient when there are hundreds of thousands of bits in the map. A first-free pointer is similarly flawed because pages can be freed at random and the first-free pointer would be rewinding huge amounts. If we could allocate memory for tree strucures or an external freelist, that would work. Except we cannot allocate/free memory here because we cannot allocate/free address space to use it in. Anyway, my change here reverts back to the UMA behavior of not freeing pages for now, thereby avoiding holes in the map. ups@ had a truely evil idea that I'll investigate. It should allow freeing unused pages again by giving us a no-cost way to track the holes in the kva block. But in the meantime, this should get people booting with witness and/or invariants again. Footnote: amd64 doesn't have this problem because of the direct map access method. I'd done all my witness/invariants testing there. I'd never considered that the harmless-looking kmem_alloc/kmem_free calls would cause such a problem and it didn't show up on the boot test.	2006-04-28 19:05:08 +00:00
Maxim Sobolev	ab39543373	Unbreak pc98. Sorry...	2006-04-28 03:38:23 +00:00
Alan Cox	7dece6c7d9	In general, bits in the page directory entry (PDE) and the page table entry (PTE) have the same meaning. The exception to this rule is the eighth bit (0x080). It is the PS bit in a PDE and the PAT bit in a PTE. This change avoids the possibility that pmap_enter() confuses a PAT bit with a PS bit, avoiding a panic(). Eliminate a diagnostic printf() from the i386 pmap_enter() that serves no current purpose, i.e., I've seen no bug reports in the last two years that are helped by this printf(). Reviewed by: jhb	2006-04-27 21:26:25 +00:00
Scott Long	bd02c63f13	Add the rr232x driver to the default kernels.	2006-04-27 20:58:24 +00:00
Maxim Sobolev	897f1917a4	In the case when reset via keyboard controller doesn't work for some reason (i.e. no keyboard controller present), try two other common methods for resetting i386 machine - pci reset and port 0x92 fast reset. Only if neither works warn user and resort to "unmap entire address space and hope for good" hack. This makes my MacBook Pro rebooting just fine and should also help other legacy-free hardware out there. Also, disable interrupts unconditionally in cpu_reset_real(), since we don't want any interference. MFC after: 1 week	2006-04-27 05:18:26 +00:00
Xin LI	027ed650da	Fix build on i386	2006-04-27 05:02:21 +00:00
Peter Wemm	041a991fa7	MFamd64: shrink pv entries from 24 bytes to about 12 bytes. (336 pv entries per page = effectively 12.19 bytes per pv entry after overheads). Instead of using a shared UMA zone for 24 byte pv entries (two 8-byte tailq nodes, a 4 byte pointer, and a 4 byte address), we allocate a page at a time per process. This provides 336 pv entries per process (actually, per pmap address space) and eliminates one of the 8-byte tailq entries since we now can track per-process pv entries implicitly. The pointer to the pmap can be eliminated by doing address arithmetic to find the metadata on the page headers to find a single pointer shared by all 336 entries. There is an 11-int bitmap for the freelist of those 336 entries. This is mostly a mechanical conversion from amd64, except: * i386 has to allocate kvm and map the pages, amd64 has them outside of kvm * native word size is smaller, so bitmaps etc become 32 bit instead of 64 * no dump_add_page() etc stuff because they are in kvm always. * various pmap internals tweaks because pmap uses direct map on amd64 but on i386 it has to use sched_pin and temporary mappings. Also, sysctl vm.pmap.pv_entry_max and vm.pmap.shpgperproc are now dynamic sysctls. Like on amd64, i386 can now tune the pv entry limits without a recompile or reboot. This is important because of the following scenario. If you have a 1GB file (262144 pages) mmap()ed into 50 processes, that requires 13 million pv entries. At 24 bytes per pv entry, that is 314MB of ram and kvm, while at 12 bytes it is 157MB. A 157MB saving is significant. Test-run by: scottl (Thanks!)	2006-04-26 21:49:20 +00:00
Jung-uk Kim	daea0aad84	Check if reported HTT cores are physical cores. This commit does not affect AMD CPUs at all because HTT bit is disabled earlier. Intel multicore CPUs and ULE scheduler may be affected.	2006-04-25 00:06:37 +00:00
Jung-uk Kim	091c9b4961	Add another Intel CPU feature flag, xTPR (Send Task Priority Messages).	2006-04-24 22:56:57 +00:00
Jung-uk Kim	cf24d86bcc	Check if deterministic cache parameters leaf is valid before use.	2006-04-24 22:23:52 +00:00
Colin Percival	8b4553119e	Adjust dangerous-shared-cache-detection logic from "all shared data caches are dangerous" to "a shared L1 data cache is dangerous". This is a compromise between paranoia and performance: Unlike the L1 cache, nobody has publicly demonstrated a cryptographic side channel which exploits the L2 cache -- this is harder due to the larger size, lower bandwidth, and greater associativity -- and prohibiting shared L2 caches turns Intel Core Duo processors into Intel Core Solo processors. As before, the 'machdep.hyperthreading_allowed' sysctl will allow even the L1 data cache to be shared. Discussed with: jhb, scottl Security: See FreeBSD-SA-05:09.htt for background material.	2006-04-24 21:17:01 +00:00
Xin LI	3b28c0c6f9	Move AHC_REG_PRETTY_PRINT and AHD_REG_PRETTY_PRINT below their corresponding devices.	2006-04-24 08:44:34 +00:00
Peter Wemm	4503a06eef	Merge minidumps from amd64 where they were originally developed. Major differences: * since there is no direct map region, there is no custom uma memory allocator to modify to include its pages in the dumps. * Various data entries are reduced from 64 bit to 32 bit to match the native size. dump_add_page() and dump_drop_page() are still present in case one wants to arrange for arbitary pages to be dumped. This is of marginal use though because libkvm+kgdb cannot address physical memory that isn't mapped into kvm.	2006-04-21 04:28:43 +00:00
Warner Losh	99b0e15695	Set the rid of the resource we're about to return to the user.	2006-04-20 04:10:27 +00:00
Colin Percival	2652af563e	Correct a local information leakage bug affecting AMD FPUs. Security: FreeBSD-SA-06:14.fpu	2006-04-19 07:00:19 +00:00
Mitsuru IWASAKI	858a52f464	Import ACPI Dock Station support. Note that this is still very young. Additional detach implementaions (or maybe improvement) for other deivce drivers is required. Reviewed by: njl, imp MFC after: 1 week	2006-04-15 12:31:34 +00:00
Alan Cox	826c207263	Retire pmap_track_modified(). We no longer need it because we do not create managed mappings within the clean submap. To prevent regressions, add assertions blocking the creation of managed mappings within the clean submap. Reviewed by: tegge	2006-04-12 04:22:52 +00:00
Paul Saab	d8636a9ab7	Hook bce up to the build	2006-04-10 20:04:22 +00:00
John Baldwin	0f2be07217	- Don't set CR0_NE and CR0_MP in npx_probe() as they are already set earlier in cpu_setregs(). - If we know this CPU has a FPU via cpuid, then just assume the INT16 interface and make the npx device quiet to not clutter the dmesg. This is true for all Pentium and later CPUs and even some of the later 486dx CPUs. Reviewed by: bde Tested by: ps MFC after: 1 week	2006-04-06 17:17:45 +00:00
John Baldwin	907d4d7f45	Cache the value of the lower half of each I/O APIC redirection table entry so that we only have to do an ioapic_write() instead of an ioapic_read() followed by an ioapic_write() every time we mask and unmask level triggered interrupts. This cuts the execution time for these operations roughly in half. Profiled by: Paolo Pisati <p.pisati@oltrelinux.com> MFC after: 1 week	2006-04-05 20:43:19 +00:00
Joseph Koshy	64e3ca8f48	Freshen a comment. Reviewed by: jhb	2006-04-04 02:26:45 +00:00
Marcel Moolenaar	bfcdefd8aa	Eliminate HAVE_STOPPEDPCBS. On ia64 the PCPU holds a pointer to the PCB in which the context of stopped CPUs is stored. To access this PCB from KDB, we introduce a new define, called KDB_STOPPEDPCB. The definition, when present, lives in <machine/kdb.h> and abstracts where MD code saves the context. Define KDB_STOPPEDPCB on i386, amd64, alpha and sparc64 in accordance to previous code.	2006-04-03 22:51:47 +00:00
Peter Wemm	b9eee07e36	Remove the unused sva and eva arguments from pmap_remove_pages().	2006-04-03 21:16:10 +00:00
Alan Cox	9c6a71e4ca	Introduce pmap_try_insert_pv_entry(), a function that conditionally creates a pv entry if the number of entries is below the high water mark for pv entries. Use pmap_try_insert_pv_entry() in pmap_copy() instead of pmap_insert_entry(). This avoids possible recursion on a pmap lock in get_pv_entry(). Eliminate the explicit low-memory checks in pmap_copy(). The check that the number of pv entries was below the high water mark was largely ineffective because it was located in the outer loop rather than the inner loop where pv entries were allocated. Instead of checking, we attempt the allocation and handle the failure. Reviewed by: tegge Reported by: kris MFC after: 5 days	2006-04-02 05:45:05 +00:00
Maksim Yevmenkin	9216fccdd9	Add kbdmux(4) to GENERIC Requested by: scottl	2006-03-31 19:03:37 +00:00
Scott Long	7f631a410c	Hook the MFI driver up to the build.	2006-03-29 09:57:22 +00:00
Dag-Erling Smørgrav	6f0f8cca25	Use wrapper macros for atomic pointer operations in order to perform the correct casts. This should probably be merged to other architectures.	2006-03-28 14:34:48 +00:00
John Baldwin	8283c726e7	If the XSDT address in the RSDP for an ACPI 2.0 machine is NULL, then fall back to using the RSDT instead. ACPI-CA already follows this same strategy as a workaround for yet another instance of brain-damaged BIOS writers. PR: i386/93963 Submitted by: Masayuki FUKUI <fukui.FreeBSD@fanet.net>	2006-03-27 15:59:48 +00:00
Alan Cox	fa8053e9a9	Eliminate unnecessary invalidations of the entire TLB by pmap_remove(). Specifically, on mappings with PG_G set pmap_remove() not only performs the necessary per-page invlpg invalidations but also performs an unnecessary invalidation of the entire set of non-PG_G entries. Reviewed by: tegge	2006-03-21 18:07:42 +00:00
David Xu	39d3e6198d	Remove stale KSE code. Reviewed by: alc	2006-03-21 06:46:27 +00:00
John Baldwin	aef8cd01ed	Drop some unneeded casts since we program the kernel in C rather than C++.	2006-03-20 19:39:08 +00:00
Alexander Leidinger	c85625bfe7	regen	2006-03-18 20:49:01 +00:00
Alexander Leidinger	d4a3f5ddb6	Fixup some problems in my previous commit (COMPAT_43). Pointyhat to: netchild	2006-03-18 20:47:36 +00:00
Alexander Leidinger	1f7642e058	regen after COMPAT_43 removal	2006-03-18 18:24:38 +00:00
Alexander Leidinger	5c8919adf4	Get rid of the need of COMPAT_43 in the linuxolator. Submitted by: Divacky Roman <xdivac02@stud.fit.vutbr.cz> Obtained from: DragonFly (some parts)	2006-03-18 18:20:17 +00:00
John Baldwin	39092e79ed	Don't allow userland to set hardware watch points on kernel memory at all. Previously, we tried to allow this only for root. However, we were calling suser() on the target process rather than the current process. This means that if you can ptrace() a process running as root you can set a hardware watch point in the kernel. In practice I think you probably have to be root in order to pass the p_candebug() checks in ptrace() to attach to a process running as root anyway. Rather than fix the suser(), I just axed the entire idea, as I can't think of any good reason _at all_ for userland to set hardware watch points for KVM. MFC after: 3 days Also thinks hardware watch points on KVM from userland are bad: bde, rwatson	2006-03-14 16:13:55 +00:00
David Xu	90a693f891	It is not necessary to read %gs twice.	2006-03-10 05:55:26 +00:00
David Xu	fc643048fe	Fix stack offset to allow gcc's stack aligment code to work correctly. MFC after: 3 days	2006-03-10 02:54:45 +00:00
John Baldwin	8e8f0765ab	Flip the switch and don't route interrupts to hyperthreads in a HT system. In at least one benchmark this showed around a 20% performance increase. If other workloads do benefit from having hyperthreads service interrupts, we can always make this a loader tunable. MFC after: 3 days Tested by: ps	2006-03-09 16:38:52 +00:00
Poul-Henning Kamp	6acae67129	Improve the advantech watchdog.	2006-03-06 07:43:28 +00:00
Yaroslav Tykhiy	375ce6798f	Take the functionality contained in the former "options TDFX_LINUX" into a separate module. Accordingly, convert the option into a device named similarly. Note for MFC: Perhaps the option should stay in RELENG_6 for POLA reasons. Suggested by: scottl Reviewed by: cokane MFC after: 5 days	2006-03-03 21:37:38 +00:00
Alexander Leidinger	fb0a379774	- use a more common style to print memory sizes - add some more cache sizes (2nd and 3rd level) [1] Submitted by: HATANOU Tomomi <hatanou@infolab.ne.jp> [1] PR: 91328 [1]	2006-03-03 18:54:05 +00:00
Rink Springer	5fa7c51ff6	Committed the xbox syscons(8)-able console driver. Reviewed by: arch@ (no comments) Approved by: imp (mentor)	2006-03-03 14:52:57 +00:00
Scott Long	a7f12baaca	iir works on PAE now.	2006-03-03 04:30:18 +00:00
John Baldwin	215e7c161a	Rework how we wire up interrupt sources to CPUs: - Throw out all of the logical APIC ID stuff. The Intel docs are somewhat ambiguous, but it seems that the "flat" cluster model we are currently using is only supported on Pentium and P6 family CPUs. The other "hierarchy" cluster model that is supported on all Intel CPUs with local APICs is severely underdocumented. For example, it's not clear if the OS needs to glean the topology of the APIC hierarchy from somewhere (neither ACPI nor MP Table include it) and setup the logical clusters based on the physical hierarchy or not. Not only that, but on certain Intel chipsets, even though there were 4 CPUs in a logical cluster, all the interrupts were only sent to one CPU anyway. - We now bind interrupts to individual CPUs using physical addressing via the local APIC IDs. This code has also moved out of the ioapic PIC driver and into the common interrupt source code so that it can be shared with MSI interrupt sources since MSI is addressed to APICs the same way that I/O APIC pins are. - Interrupt source classes grow a new method pic_assign_cpu() to bind an interrupt source to a specific local APIC ID. - The SMP code now tells the interrupt code which CPUs are avaiable to handle interrupts in a simpler and more intuitive manner. For one thing, it means we could now choose to not route interrupts to HT cores if we wanted to (this code is currently in place in fact, but under an #if 0 for now). - For now we simply do static round-robin of IRQs to CPUs when the first interrupt handler just as before, with the change that IRQs are now bound to individual CPUs rather than groups of up to 4 CPUs. - Because the IRQ to CPU mapping has now been moved up a layer, it would be easier to manage this mapping from higher levels. For example, we could allow drivers to specify a CPU affinity map for their interrupts, or we could allow a userland tool to bind IRQs to specific CPUs. The MFC is tentative, but I want to see if this fixes problems some folks had with UP APIC kernels on 6.0 on SMP machines (an SMP kernel would work fine, but a UP APIC kernel (such as GENERIC in RELENG_6) would lose interrupts). MFC after: 1 week	2006-02-28 22:24:55 +00:00
Colin Percival	69084095dc	Add frequency-voltage tables for Intel 778, 758, 773, 753, and 733J processors. Obtained from: Intel Datasheet 302189-008	2006-02-25 04:55:38 +00:00
Sam Leffler	3f676959ae	guard function decls with _KERNEL so user code can include this file MFC after: 1 week	2006-02-22 21:38:33 +00:00
John Baldwin	06ad42b2f7	Close some races between procfs/ptrace and exit(2): - Reorder the events in exit(2) slightly so that we trigger the S_EXIT stop event earlier. After we have signalled that, we set P_WEXIT and then wait for any processes with a hold on the vmspace via PHOLD to release it. PHOLD now KASSERT()'s that P_WEXIT is clear when it is invoked, and PRELE now does a wakeup if P_WEXIT is set and p_lock drops to zero. - Change proc_rwmem() to require that the processing read from has its vmspace held via PHOLD by the caller and get rid of all the junk to screw around with the vmspace reference count as we no longer need it. - In ptrace() and pseudofs(), treat a process with P_WEXIT set as if it doesn't exist. - Only do one PHOLD in kern_ptrace() now, and do it earlier so it covers FIX_SSTEP() (since on alpha at least this can end up calling proc_rwmem() to clear an earlier single-step simualted via a breakpoint). We only do one to avoid races. Also, by making the EINVAL error for unknown requests be part of the default: case in the switch, the various switch cases can now just break out to return which removes a _lot_ of duplicated PRELE and proc unlocks, etc. Also, it fixes at least one bug where a LWP ptrace command could return EINVAL with the proc lock still held. - Changed the locking for ptrace_single_step(), ptrace_set_pc(), and ptrace_clear_single_step() to always be called with the proc lock held (it was a mixed bag previously). Alpha and arm have to drop the lock while the mess around with breakpoints, but other archs avoid extra lock release/acquires in ptrace(). I did have to fix a couple of other consumers in kern_kse and a few other places to hold the proc lock and PHOLD. Tested by: ps (1 mostly, but some bits of 2-4 as well) MFC after: 1 week	2006-02-22 18:57:50 +00:00
Tor Egge	6bd7e81d83	Rounding addr upwards to next 4M or 2M boundary in pmap_growkernel() could cause addr to become 0, resulting in an early return without populating the last PDE. Reviewed by: alc	2006-02-16 22:10:57 +00:00
David Malone	0cbae93607	It seems bit 5 of cpu_feature2 is the VMX (Virtual Machine Extensions) bit. While I'm here, delete a comment that was cut and past from the cpu_features code that doesn't belong here.	2006-02-15 14:48:59 +00:00
Poul-Henning Kamp	e8444a7e6f	CPU time accounting speedup (step 2) Keep accounting time (in per-cpu) cputicks and the statistics counts in the thread and summarize into struct proc when at context switch. Don't reach across CPUs in calcru(). Add code to calibrate the top speed of cpu_tickrate() for variable cpu_tick hardware (like TSC on power managed machines). Don't enforce monotonicity (at least for now) in calcru. While the calibrated cpu_tickrate ramps up it may not be true. Use 27MHz counter on i386/Geode. Use TSC on amd64 & i386 if present. Use tick counter on sparc64	2006-02-11 09:33:07 +00:00
Rink Springer	424d9b482d	Cleaned the memory initialization up, moved some defines from the framebuffer to an include file. Reviewed by: imp Approved by: imp (mentor)	2006-02-10 18:48:22 +00:00
Yaroslav Tykhiy	84d8f1b027	Avoid calling CPUID function 0x02 if the CPU reports no support for it. The former code used to hang older Intel CPUs by trying to get non-existent TLB info 2^32 times. Reduce code duplication around the calls to CPUID 0x02 by using do-while loops. PR: i386/92977 Tested by: cy	2006-02-09 09:10:54 +00:00
Poul-Henning Kamp	eb2da9a51f	Simplify system time accounting for profiling. Rename struct thread's td_sticks to td_pticks, we will need the other name for more appropriately named use shortly. Reduce it from uint64_t to u_int. Clear td_pticks whenever we enter the kernel instead of recording its value as reference for userret(). Use the absolute value of td->pticks in userret() and eliminate third argument.	2006-02-08 08:09:17 +00:00
Poul-Henning Kamp	5b1a8eb397	Modify the way we account for CPU time spent (step 1) Keep track of time spent by the cpu in various contexts in units of "cputicks" and scale to real-world microsec^H^H^H^H^H^H^H^Hclock_t only when somebody wants to inspect the numbers. For now "cputicks" are still derived from the current timecounter and therefore things should by definition remain sensible also on SMP machines. (The main reason for this first milestone commit is to verify that hypothesis.) On slower machines, the avoided multiplications to normalize timestams at every context switch, comes out as a 5-7% better score on the unixbench/context1 microbenchmark. On more modern hardware no change in performance is seen.	2006-02-07 21:22:02 +00:00
Robert Watson	ce41b52994	Regenerate.	2006-02-06 22:15:00 +00:00
Robert Watson	68b21daa64	Assign audit event identifiers to ibcs2 system calls. Obtained from: TrustedBSD Project	2006-02-06 22:14:50 +00:00
John Baldwin	8917b8d28c	- Always call exec_free_args() in kern_execve() instead of doing it in all the callers if the exec either succeeds or fails early. - Move the code to call exit1() if the exec fails after the vmspace is gone to the bottom of kern_execve() to cut down on some code duplication.	2006-02-06 22:06:54 +00:00
John Baldwin	809f984b21	Add a kern_eaccess() function and use it to implement xenix_eaccess() rather than kern_access(). Suggested by: rwatson	2006-02-06 22:00:53 +00:00
Robert Watson	3f4b50a482	Regenerate.	2006-02-06 01:40:48 +00:00
Robert Watson	35d982a761	Assign audit event identifiers to Linux i386 system calls. Obtained from: TrustedBSD Project	2006-02-06 01:40:30 +00:00
Robert Watson	570d840b52	Regenerate.	2006-02-05 23:28:46 +00:00
Robert Watson	b11c14d6e9	Assign audit event identfiers to Xenix system calls. Note: AUE_EACCESS is assigned to xenix_eaccess() instead of AUE_ACCESS, as that is the intended meaning of the system call. xenix_eaccess() should be reimplemented using our native eaccess() implementation so that it works as intended. Obtained from: TrustedBSD Project	2006-02-05 23:28:01 +00:00
Robert Watson	ad3a6a7d36	Correct help line: list targets, not names of files generated by targets when no argument is provided to make. MFC after: 1 week	2006-02-05 23:25:19 +00:00
Robert Watson	4334cc4f81	Regenerate (accidentally also committed in commit that updated syscalls.isc).	2006-02-05 23:16:20 +00:00
Robert Watson	60be92dd8d	Assign audit event identifiers to ibcs2 ISC system calls. Obtained from: TrustedBSD Project	2006-02-05 23:15:22 +00:00
Ken Smith	4d286e9426	Move asr driver from global NOTES to i386-specific NOTES. Requestor reports it is neither endian-clean or 64-bit clean. :-) Requested by: scottl	2006-02-05 05:06:04 +00:00
Wayne Salamon	c62ccf20b2	Hook up the audit system to system call entry and exit. System calls will now be audited. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-04 14:11:33 +00:00
Rink Springer	aaadec7cac	Patch to allow XBox-users to use the onboard nve(4) nForce ethernet driver. The patch crudely forces the NIC out of operating mode before the nve(4) driver can initialize it; this is required to properly initialize the NIC. It is XBox-specific, as this condition can only occur on XBoxes (Most loaders will simply leave the NIC running, forcing us to use a crude workaround like this to get it in a workable condition). Due to the XBox-only aspect, this has been solved in XBox-specific initialization code and not within nve(4). Reviewed by: imp Approved by: imp (mentor) No objection: bz@, obrien@, q@ontheweb.com.au	2006-02-04 10:01:33 +00:00
David Xu	97794f4eb3	Clear carry flag in get_mcontext so that setcontext does not return a bogus error. PR: misc/92110	2006-02-03 02:33:01 +00:00
David Xu	165ba65d7b	Under verbose mode, correctly report L2 cache information for CPU which supports CPUID function 8000_0006h. Tested on: Pentum-M 750	2006-02-02 12:44:09 +00:00
David Xu	ec4275eefa	Fix bug in L2 cache size detection code for CPU which supports CPUID function 8000_0006h. Tested on: Pentum-M 750	2006-02-02 11:54:40 +00:00
David Xu	7aa9d01bf4	Correctly report L2 cache size according to its code comment. Tested on my Dual PIII machine.	2006-02-02 06:35:50 +00:00
Roman Kurakin	f2ca64ca71	Attach ce(4) to the build. MFC after: 3 days	2006-01-31 23:11:35 +00:00
Roman Kurakin	8edb110aa3	Prepare for sconfig(8) update. Change also my e-mail.	2006-01-30 13:34:57 +00:00
John Baldwin	6966c33482	Call WITNESS_CHECK() in the page fault handler and immediately assume it is a fatal fault if we are holding any non-sleepable locks. This should cut down on the number of bogus LORs we currently get when the kernel panics due to a NULL (or bogus) pointer dereference that goes wandering off into the VM system which tries to acquire locks and then kicks off the spurious LORs. This should probably be ported to all the archs at some point. Tested on: i386	2006-01-27 22:22:10 +00:00
Stephan Uphoff	3753c2c3f1	Fix race conditions. Tested by: kris@ MFC after: 3 days	2006-01-23 15:46:09 +00:00
Marius Strobl	a5f92d5a54	Remove the commented out entry of the old ISA-only le(4) driver which was retired 22 months ago. MFC after: 1 day	2006-01-21 12:38:35 +00:00
David Xu	74bcd9d280	Eliminate a stale instruction introduced in revision 1.136.	2006-01-18 06:42:42 +00:00
Scott Long	0af57729a6	Free the newtag if we exit with a failure from alloc_bounce_zone(). Found by: Coverity Prevent(tm)	2006-01-14 17:22:47 +00:00
Poul-Henning Kamp	d3e64681d6	Move the old BSD4.3 tty compatibility from (!BURN_BRIDGES && COMPAT_43) to COMPAT_43TTY. Add COMPAT_43TTY to NOTES and */conf/GENERIC Compile tty_compat.c only under the new option. Spit out #warning "Old BSD tty API used, please upgrade." if ioctl_compat.h gets #included from userland.	2006-01-10 09:19:10 +00:00
Warner Losh	d5e61c97a6	By popular demand, move __HAVE_ACPI and __PCI_REROUTE_INTERRUPT into param.h. Per request, I've placed these just after the _NO_NAMESPACE_POLLUTION ifndef. I've not renamed anything yet, but may since we don't need the __. Submitted by: bde, jhb, scottl, many others.	2006-01-09 06:05:57 +00:00
John Baldwin	04dda605c5	- Make pcib_devclass private to sys/dev/pci/pci_pci.c and change all the various pcib drivers to use their own private devclass_t variables for their modules. - Use the DEFINE_CLASS_0() macro to declare drivers for the various pcib drivers while I'm here.	2006-01-06 19:22:19 +00:00
John Baldwin	360c3c2d1a	Fix various places that were testing td_critnest to see if interrupts should remain disabled during a trap or not to check td_md.md_spinlock_count instead.	2006-01-06 18:02:12 +00:00
Alexander Leidinger	e21a562e99	We don't support I386_CPU in 6.0 and later. This file can be cleaned up some to assume that '#if defined(I486_CPU) \|\| defined(I586_CPU) \|\| defined(I686_CPU)' is true. Suggested by: jhb Reviewed by: jhb	2006-01-04 20:11:04 +00:00
Alexander Leidinger	d5231b4a7f	- Make sure the cpu_exthigh variable is initialized (page coloring case). [1] - Remove a conditional in the AMD cache detection, it's always false. [2] - Don't try to detect a cache if only compiled for i386. Analyzed by: Antoine Brodin <antoine.brodin@laposte.net> [1] Submitted by: Antoine Brodin <antoine.brodin@laposte.net> [2]	2006-01-04 12:57:02 +00:00
Poul-Henning Kamp	8c92c2096d	Use ttyalloc() instead of ttymalloc()	2006-01-04 09:46:20 +00:00
John Baldwin	ab2c368640	Fix a couple of issues with the ibcs2 module event handler. First, return success instead of EOPNOTSUPP when being loaded. Secondly, if there are no ibcs2 processes running when a MOD_UNLOAD request is made, break out to return success instead of falling through into the default case which returns EOPNOTSUPP. With these fixes, I can now kldload and subsequently kldunload the ibcs2 module. PR: kern/82026 (and several duplicates) Reported by: lots of folks MFC after: 1 week	2006-01-03 20:39:38 +00:00
Jung-uk Kim	dccb7faff6	- Explicitly validate an empty filter to match bpf_filter() comment[1]. - Do not use BPF JIT compiler for an empty filter. [1] Pointed out by: darrenr	2006-01-03 20:26:03 +00:00
Warner Losh	501755f4f6	Define __HAVE_ACPI and/or __PCI_REROUTE_INTERRUPT, as appropriate for each platform. These will be used in the pci code in preference to the complicated #ifdefs we have there now.	2006-01-01 20:59:28 +00:00
Alexander Leidinger	ef39c05baa	MI changes: - provide an interface (macros) to the page coloring part of the VM system, this allows to try different coloring algorithms without the need to touch every file [1] - make the page queue tuning values readable: sysctl vm.stats.pagequeue - autotuning of the page coloring values based upon the cache size instead of options in the kernel config (disabling of the page coloring as a kernel option is still possible) MD changes: - detection of the cache size: only IA32 and AMD64 (untested) contains cache size detection code, every other arch just comes with a dummy function (this results in the use of default values like it was the case without the autotuning of the page coloring) - print some more info on Intel CPU's (like we do on AMD and Transmeta CPU's) Note to AMD owners (IA32 and AMD64): please run "sysctl vm.stats.pagequeue" and report if the cache* values are zero (= bug in the cache detection code) or not. Based upon work by: Chad David <davidc@acns.ab.ca> [1] Reviewed by: alc, arch (in 2004) Discussed with: alc, Chad David, arch (in 2004)	2005-12-31 14:39:20 +00:00
David Xu	f71ba3d4a7	Remove pcb_switchout, it has not been used for a long time.	2005-12-29 13:23:48 +00:00
Maxim Sobolev	900b28f9f6	Remove kern.elf32.can_exec_dyn sysctl. Instead extend Brandinfo structure with flags bitfield and set BI_CAN_EXEC_DYN flag for all brands that usually allow executing elf dynamic binaries (aka shared libraries). When it is requested to execute ET_DYN elf image check if this flag is on after we know the elf brand allowing execution if so. PR: kern/87615 Submitted by: Marcin Koziej <creep@desk.pl>	2005-12-26 21:23:57 +00:00
David Xu	1bfa910843	Move global variable private_tss into per-cpu area. Reviewed by: jhb	2005-12-26 00:07:19 +00:00
Jeff Roberson	660002d398	- Improve the INKERNEL macro such that it can no longer give false positives. This fixes the stack(9) functionality. Submitted by: Antoine Brodin <antoine.brodin@laposte.net>	2005-12-23 21:33:55 +00:00
John Baldwin	b439e431bf	Tweak how the MD code calls the fooclock() methods some. Instead of passing a pointer to an opaque clockframe structure and requiring the MD code to supply CLKF_FOO() macros to extract needed values out of the opaque structure, just pass the needed values directly. In practice this means passing the pair (usermode, pc) to hardclock() and profclock() and passing the boolean (usermode) to hardclock_cpu() and hardclock_process(). Other details: - Axe clockframe and CLKF_FOO() macros on all architectures. Basically, all the archs were taking a trapframe and converting it into a clockframe one way or another. Now they can just extract the PC and usermode values directly out of the trapframe and pass it to fooclock(). - Renamed hardclock_process() to hardclock_cpu() as the latter is more accurate. - On Alpha, we now run profclock() at hz (profhz == hz) rather than at the slower stathz. - On Alpha, for the TurboLaser machines that don't have an 8254 timecounter, call hardclock() directly. This removes an extra conditional check from every clock interrupt on Alpha on the BSP. There is probably room for even further pruning here by changing Alpha to use the simplified timecounter we use on x86 with the lapic timer since we don't get interrupts from the 8254 on Alpha anyway. - On x86, clkintr() shouldn't ever be called now unless using_lapic_timer is false, so add a KASSERT() to that affect and remove a condition to slightly optimize the non-lapic case. - Change prototypeof arm_handler_execute() so that it's first arg is a trapframe pointer rather than a void pointer for clarity. - Use KCOUNT macro in profclock() to lookup the kernel profiling bucket. Tested on: alpha, amd64, arm, i386, ia64, sparc64 Reviewed by: bde (mostly)	2005-12-22 22:16:09 +00:00
Warner Losh	b0493a9984	Move device 'cs' into i386/pc98 specific NOTES. It is broken on ppc because it uses i386 specific calls. Maybe it could be added to amd64, but I'm not so sure it would work there so I've not added it there.	2005-12-20 23:00:11 +00:00
John Baldwin	5b2119223e	Move the hostb driver out of the i386 and amd64 PCI code (where it was duplicated anyways) and into a single MI driver. Extend the driver a bit to implement the bus and PCI kobj interfaces such that other drivers can attach to it and transparently act as if their parent device is the PCI bus (for the most part).	2005-12-20 21:09:45 +00:00
John Baldwin	410d857972	Remove linux_mib_destroy() (which I actually added in between 5.0 and 5.1) which existed to cleanup the linux_osname mutex. Now that MTX_SYSINIT() has grown a SYSUNINIT to destroy mutexes on unload, the extra destroy here was redundant and resulted in panics in debug kernels. MFC after: 1 week Reported by: Goran Gajic ggajic at afrodita dot rcub dot bg dot ac dot yu	2005-12-15 16:30:41 +00:00
John Baldwin	05ee80c796	Fix stale comment.	2005-12-14 21:47:02 +00:00
Peter Wemm	737429bc96	MFamd64 rev 1.223: Use the TSC to implement DELAY() if not marked broken and it has been calibrated.	2005-12-13 19:08:55 +00:00
John Baldwin	e83f6bcb75	Revert previous commit. The BIOS braindamage is even worse than I originally thought. The BIOS that cleared CPUID_APIC actually managed to disable the local APIC entirely and even Windows 64 doesn't boot on it. Reported by: bz	2005-12-13 18:29:10 +00:00
John Baldwin	15b7edbeaa	Don't check the CPUID_APIC bit in the cpu_features flags field to determine if the boot CPU has a local APIC because some BIOS vendors are not competent enough to set this bit. Instead, just assume that we always have a local APIC on amd64. For i386 the check is a bit more subtle. FreeBSD requires either an MP Table or an ACPI MADT table to enumerate APICs. The only systems that have one of those tables that don't have local APICs are some presumably rare (and old) SMP 486 systems using external APICs. Thus, instead of checking the CPUID_APIC flag, check the CPU class and abort if we are running on a 486. MFC after: 1 week Reported by: bz	2005-12-13 15:09:40 +00:00
Craig Rodrigues	16f99fe169	Add support for 7320 and 915 PCIe chipsets. Submitted by: Gavin Atkinson <gavin.atkinson at ury dot york dot ac dot uk> PR: kern/79139 Reviewed by: scottl	2005-12-08 18:55:15 +00:00
John Baldwin	47d65ba4d0	Whitespace: reduce diffs with amd64.	2005-12-08 18:33:48 +00:00
John Baldwin	696effb697	- Cleanup whitespace and extra ()s in vtophys() macros. - Move vtophys() macros next to vtopte() where vtopte() exists to match comments above vtopte(). - Remove references to the alternate address space in the comment above vtopte(). amd64 never had the alternate address space, and i386 lost it prior to PAE support being added. - s/entires/entries/ in comments. Reviewed by: alc	2005-12-06 21:09:01 +00:00
Jung-uk Kim	50c9fad9ce	Fix ZERO_EDX() macro from the previous commit. It was emitting `xor %ecx, %ecx', not `xor %edx, %edx'.	2005-12-06 20:11:07 +00:00
Ruslan Ermilov	224d140293	Drop _MACHINE_ARCH and _MACHINE defines (not to be confused with MACHINE_ARCH and MACHINE). Their purpose was to be able to test in cpp(1), but cpp(1) only understands integer type expressions. Using such unsupported expressions introduced a number of subtle bugs, which were discovered by compiling with -Wundef.	2005-12-06 13:27:21 +00:00
Jung-uk Kim	6a96c4832f	s/M_WAITOK/M_NOWAIT/ while mutex is held. Pointed out by: csjp	2005-12-06 07:22:01 +00:00
Jung-uk Kim	23a8fc28c2	- Micro-optimize `mov $0, %edx' ->` xor %edx, %edx'. - Correct amd64 macro style (no functional change).	2005-12-06 06:45:39 +00:00
Jung-uk Kim	ae275efcae	Add experimental BPF Just-In-Time compiler for amd64 and i386. Use the following kernel configuration option to enable: options BPF_JITTER If you want to use bpf_filter() instead (e. g., debugging), do: sysctl net.bpf.jitter.enable=0 to turn it off. Currently BIOCSETWF and bpf_mtap2() are unsupported, and bpf_mtap() is partially supported because 1) no need, 2) avoid expensive m_copydata(9). Obtained from: WinPcap 3.1 (for i386)	2005-12-06 02:58:12 +00:00
John Baldwin	2dce95a085	Change the i386 code to pass the interrupt vector as a separate argument rather than embedding it in the intrframe as if_vec. This reduces diffs with amd64 somewhat. - Remove cf_vec from clockframe (it wasn't used anyway) and stop pushing dummy vector arguments for ipi_bitmap_handler() and lapic_handle_timer() since clockframe == trapframe now. - Fix ddb to handle stack traces across interrupt entry points that just have a trapframe on their stack and not a trapframe + vector. - Change intr_execute_handlers() to take a trapframe rather than an intrframe pointer. - Change lapic_handle_intr() and atpic_handle_intr() to take a vector and trapframe rather than an intrframe. - GC struct intrframe now that nothing uses it anymore. - GC CLOCK_TO_TRAPFRAME() and INTR_TO_TRAPFRAME(). Reviewed by: bde Requested by: peter	2005-12-05 22:39:09 +00:00
John Baldwin	f0b9813920	- Move the code to deal with handling an IPI_STOP IPI out of ipi_nmi_handler() and into a new cpustop_handler() function. Change the Xcpustop IPI_STOP handler to call this function instead of duplicating all the same logic in assembly. - EOI the local APIC for the lapic timer interrupt in C rather than assembly. - Bump the lazypmap IPI counter if COUNT_IPIS is defined in C rather than assembly.	2005-12-05 22:25:41 +00:00
John Baldwin	ac7326e338	Don't panic if IRQ 13 doesn't exist. On some machines (see previous commit to atpic.c) there may not be an IRQ 13. Instead, just keep going. If the INT16 interface doesn't work then we will eventually panic anyway. FWIW: We could probably just axe the support for IRQ 13 altogether at this point. The only thing we'd lose support for are 486sx systems with external 487 FPUs. MFC after: 1 week	2005-12-05 22:11:44 +00:00
John Baldwin	5ae84c09e7	Really slam the door on mixed mode now that we don't depend on it for a working IRQ0 with APIC anymore. Previously, it was possible to have some other ATPIC IRQS "leak" through in a few edge cases. For example, on my x86 test machine, ACPI re-routes the SCI (IRQ 9) to intpin 13 on the first I/O APIC. This leaves a hole for IRQ 13 (since the APIC doesn't provide a source for IRQ 13 in that case) with the result that the ATPIC IRQ13 source was registered instead. This changes the 8259A drivers to only register their interrupt sources if none of the 16 ISA IRQs have an interrupt source already installed. MFC after: 1 week	2005-12-05 22:09:30 +00:00
John Baldwin	48c8cbcb82	- Move PUSH_FRAME and POP_FRAME into machine/asmacros.h. - Add a new SET_KERNEL_SREGS macro that sets up %ds and %es to point to kernel data and %fs to point to per-CPU data and use the new macro in several kernel entry points including trap and interrupt handlers. - Convert the IPI_STOP handler Xcpustop to push a standard trap frame rather than an application frame. - Make the TRAP() macro private to exception.s since it is only used there. - Move the PCPU_*() macros in asmacros.h out of the middle of the profiling macros. Reviewed by: bde Requested by: bde (4, 5)	2005-12-05 21:44:47 +00:00
Ruslan Ermilov	8c7a62275a	Prepare for MACHINE and hw.machine switching to "pc98" on FreeBSD/pc98. Reviewed by: nyan	2005-12-05 14:22:12 +00:00
Ruslan Ermilov	342ed5d948	Fix -Wundef warnings found when compiling i386 LINT, GENERIC and custom kernels.	2005-12-05 11:58:35 +00:00
Eric Anholt	69b9fffc84	Merge DRM CVS as of 2005-12-02, adding i915 DRM support thanks to Alexey Popov, and a new r300 PCI ID.	2005-12-03 01:23:50 +00:00
Eric Anholt	9fb0767374	Update DRM to CVS snapshot as of 2005-11-28. Notable changes: - S3 Savage driver ported. - Added support for ATI_fragment_shader registers for r200. - Improved r300 support, needed for latest r300 DRI driver. - (possibly) r300 PCIE support, needs X.Org server from CVS. - Added support for PCI Matrox cards. - Software fallbacks fixed for Rage 128, which used to render badly or hang. - Some issues reported by WITNESS are fixed. - i915 module Makefile added, as the driver may now be working, but is untested. - Added scripts for copying and preprocessing DRM CVS for inclusion in the kernel. Thanks to Daniel Stone for getting me started on that.	2005-11-28 23:13:57 +00:00
John Baldwin	d6ef938e56	If we get a stray interrupt, return after logging it. In the extremely rare case of a stray interrupt to an unregistered source (such as a stray interrupt from the 8259As when using APIC), this could result in a page fault when it tried to walk the list of interrupt handlers to execute INTR_FAST handlers. This bug was introduced with the intr_event changes, so it's not present in 5.x or 6.x. Submitted by: Mark Tinguely tinguely at casselton dot net	2005-11-28 20:18:43 +00:00
Ruslan Ermilov	6646524f34	- Allow duplicate "machine" directives with the same arguments. - Move existing "machine" directives to DEFAULTS.	2005-11-27 23:17:00 +00:00
Lukas Ertl	ae5a74ec72	Fix typo.	2005-11-24 15:28:32 +00:00
Ruslan Ermilov	1a581012df	Add missing "struct" in i386/i386/machdep.c,v 1.497 by deischen@.	2005-11-24 08:16:18 +00:00
John Baldwin	1dab802e37	Garbage collect machine/smptests.h now that it is empty and no longer used.	2005-11-22 22:55:48 +00:00
John Baldwin	c21ba8d166	Make COUNT_IPIS and COUNT_XINVLTLB_HITS real kernel options and take them out of machine/smptests.h.	2005-11-22 22:54:42 +00:00
John Baldwin	e36e973da9	Garbage collect unused {VERBOSE_,}CPUSTOP_ON_DDBBREAK macros.	2005-11-22 22:37:13 +00:00
John Baldwin	0a17b197d3	Garbage collect the code to store diagnostics codes in a CMOS register during SMP startup. We haven't had any issues with starting up the APs on i386 in quite a while now which is all this code is really useful for. If someone ever does really need it they can always dig it up out of the attic.	2005-11-22 22:34:14 +00:00
John Baldwin	7417e80b4e	Don't enable PUC_FASTINTR by default in the source. Instead, enable it via the DEFAULTS kernel configs. This allows folks to turn it that option off in the kernel configs if desired without having to hack the source. This is especially useful since PUC_FASTINTR hangs the kernel boot on my ultra60 which has two uart(4) devices hung off of a puc(4) device. I did not enable PUC_FASTINTR by default on powerpc since powerpc does not currently allow sharing of INTR_FAST with non-INTR_FAST like the other archs.	2005-11-21 20:22:35 +00:00
Alan Cox	97a0c226d6	Eliminate pmap_init2(). It's no longer used.	2005-11-20 06:09:49 +00:00
John Baldwin	7d0a7ec90c	- Always print the trap number so that we have something to start with for mystery traps. If we don't have a message for a given trap, just use UNKNOWN for the message. - Add trap messages for T_XMMFLT and T_RESERVED. MFC after: 1 week	2005-11-18 19:26:46 +00:00
David E. O'Brien	5ab591d4d9	Fix spelling mistake. Submitted by: kris	2005-11-17 02:32:39 +00:00
John Baldwin	db477d6cc8	Revert a part of the previous commits to these files that made the NMI IPI_STOP handling code use atomic_readandclear() to execute the restart function on the first CPU to resume and restore the behavior of always executing the restart function on the BSP since this is in fact what the non-NMI IPI_STOP handler does. I did add back in a statement to clear the restart function pointer after it is executed to match the behavior of the non-NMI IPI_STOP handler.	2005-11-16 20:58:40 +00:00
John Baldwin	fdb9ce3716	Revert previous commit to these files. There isn't a race necessitating an xchg instruction as we only try to execute the startup function if the CPU ID is 0 (i.e. the BSP). I missed this earlier.	2005-11-16 20:55:57 +00:00
John Baldwin	b60119eb02	Fix a typo in the check for an invalid APIC. If we are told about an I/O APIC that doesn't exist, then a read of the version register is going to return -1 which is 0xffffffff not 0xffffff. Tested on: i386 Tested by: Nikos Ntarmos ntarmos at ceid dot upatras dot gr MFC after: 1 week	2005-11-16 20:29:29 +00:00
Andre Oppermann	da12fc2370	Provide a link to the documentation of the I/O APIC at Intel.	2005-11-15 20:18:13 +00:00
Warner Losh	421552a580	Provide a dummy NO_XBOX option that lives in opt_xbox.h for pc98. This allows us to eliminate a three ifdef PC98 instances.	2005-11-14 00:43:44 +00:00
Warner Losh	2eb06f7ccb	Add xbox associated options/devices to LINT. Submitted by: Rink P.W. Springer	2005-11-14 00:20:27 +00:00
Alan Cox	65336314cf	In get_pv_entry() use PMAP_LOCK() instead of PMAP_TRYLOCK() when deadlock cannot possibly occur.	2005-11-13 02:17:05 +00:00
Ruslan Ermilov	6d8200ff0c	Add /dev/speaker support to amd64. The following repo-copies were made (by Mark Murray): sys/i386/isa/spkr.c -> sys/dev/speaker/spkr.c sys/i386/include/speaker.h -> sys/dev/speaker/speaker.h share/man/man4/man4.i386/spkr.4 -> share/man/man4/spkr.4	2005-11-11 09:57:32 +00:00
Yoshihiro Takahashi	1ba0023e33	Fix pc98 build.	2005-11-09 12:22:26 +00:00
Alan Cox	7a35a21e7b	Reimplement the reclamation of PV entries. Specifically, perform reclamation synchronously from get_pv_entry() instead of asynchronously as part of the page daemon. Additionally, limit the reclamation to inactive pages unless allocation from the PV entry zone or reclamation from the inactive queue fails. Previously, reclamation destroyed mappings to both inactive and active pages. get_pv_entry() still, however, wakes up the page daemon when reclamation occurs. The reason being that the page daemon may move some pages from the active queue to the inactive queue, making some new pages available to future reclamations. Print the "reclaiming PV entries" message at most once per minute, but don't stop printing it after the fifth time. This way, we do not give the impression that the problem has gone away. Reviewed by: tegge	2005-11-09 08:19:21 +00:00
Warner Losh	43ce714039	Remove obsolete options	2005-11-09 04:30:55 +00:00
Warner Losh	51ef421d92	Add support for XBOX to the FreeBSD port. The xbox architecture is nearly identical to wintel/ia32, with a couple of tweaks. Since it is so similar to ia32, it is optionally added to a i386 kernel. This port is preliminary, but seems to work well. Further improvements will improve the interaction with syscons(4), port Linux nforce driver and future versions of the xbox. This supports the 64MB and 128MB boxes. You'll need the most recent CVS version of Cromwell (the Linux BIOS for the XBOX) to boot. Rink will be maintaining this port, and is interested in feedback. He's setup a website http://xbox-bsd.nl to report the latest developments. Any silly mistakes are my fault. Submitted by: Rink P.W. Springer rink at stack dot nl and Ed Schouten ed at fxq dot nl	2005-11-09 03:55:40 +00:00
Scott Long	f6f8bc7ad5	The hptmv inherently believes that a 'long' can hold a physical address. This hasn't been true on i386 for at least a decade, probably longer, but I'm too lazy to look up the exact year that PAE support was introduced. Thus, this driver doesn't work on PAE. X-MFC After: now	2005-11-08 22:12:26 +00:00
Marcel Moolenaar	38195fdcaf	Add uart(4). When both sio(4) and uart(4) can handle a serial port, sio(4) will claim it. This change therefore only affects how ports are handled when they are not claimed by sio(4), and in principle will improve hardware support. MFC after: 2 months	2005-11-05 19:48:53 +00:00
Poul-Henning Kamp	d55d015667	Unbreak !SMP kernels	2005-11-05 10:42:40 +00:00
Peter Wemm	68a443c292	MFamd64: indent with tabs instead of spaces.	2005-11-04 22:53:44 +00:00
Alan Cox	e9cb1037da	Begin and end the initialization of pvzone in pmap_init(). Previously, pvzone's initialization was split between pmap_init() and pmap_init2(). This split initialization was the underlying cause of some UMA panics during initialization. Specifically, if the UMA boot pages was exhausted before the pvzone was fully initialized, then UMA, through no fault of its own, would use an inappropriate back-end allocator leading to a panic. (Previously, as a workaround, we have increased the UMA boot pages.) Fortunately, there is no longer any reason that pvzone's initialization cannot be completed in pmap_init(). Eliminate a check for whether pv_entry_high_water has been initialized or not from get_pv_entry(). Since pvzone's initialization is completed in pmap_init(), this check is no longer needed. Use cnt.v_page_count, the actual count of available physical pages, instead of vm_page_array_size to compute the maximum number of pv entries. Introduce the vm.pmap.pv_entries tunable on alpha and ia64. Eliminate some unnecessary white space. Discussed with: tegge (item #1) Tested by: marcel (ia64)	2005-11-04 18:03:24 +00:00
John Baldwin	c7362ff7fb	Change the x86 code to allocate IDT vectors on-demand when an interrupt source is first enabled similar to how intr_event's now allocate ithreads on-demand. Previously, we would map IDT vectors 1:1 to IRQs. Since we only have 191 available IDT vectors for I/O interrupts, this limited us to only supporting IRQs 0-190 corresponding to the first 190 I/O APIC intpins. On many machines, however, each PCI-X bus has its own APIC even though it only has 1 or 2 devices, thus, we were reserving between 24 and 32 IRQs just for 1 or 2 devices and thus 24 or 32 IDT vectors. With this change, a machine with 100 IRQs but only 5 in use will only use up 5 IDT vectors. Also, this change provides an API (apic_alloc_vector() and apic_free_vector()) that will allow a future MSI interrupt source driver to request IDT vectors for use by MSI interrupts on x86 machines. Tested on: amd64, i386	2005-11-02 20:11:47 +00:00
John Baldwin	d394d454b0	Throw the switch and turn on STOP_NMI on in GENERIC for amd64 and i386. Requested by: kris Ok'd by: scottl	2005-11-01 22:59:03 +00:00
Jung-uk Kim	e8d472a7af	Catch up with ACPI-CA 20051021 import	2005-11-01 22:44:08 +00:00
Alan Cox	f7118bdf3b	Instead of a panic()ing in pmap_insert_entry() if get_pv_entry() fails, reclaim a pv entry by destroying a mapping to an inactive page. Change the format strings in many of the assertions that were recently converted from PMAP_DIAGNOSTIC printf()s so that they are compatible with PAE. Avoid unnecessary differences between the amd64 and i386 format strings.	2005-10-31 21:25:33 +00:00
John Baldwin	296c4b1ad5	Hook nve(4) up in i386 and amd64 NOTES. MFC after: 1 week	2005-10-31 20:45:37 +00:00
Robert Watson	5bb84bc84b	Normalize a significant number of kernel malloc type names: - Prefer '_' to ' ', as it results in more easily parsed results in memory monitoring tools such as vmstat. - Remove punctuation that is incompatible with using memory type names as file names, such as '/' characters. - Disambiguate some collisions by adding subsystem prefixes to some memory types. - Generally prefer lower case to upper case. - If the same type is defined in multiple architecture directories, attempt to use the same name in additional cases. Not all instances were caught in this change, so more work is required to finish this conversion. Similar changes are required for UMA zone names.	2005-10-31 15:41:29 +00:00
Alan Cox	6fb8d0e3a7	Replace diagnostic printf()s by assertions. Use consistent style for similar assertions.	2005-10-30 20:47:42 +00:00
Joerg Wunsch	9b229abc8f	Finally complete some work on generalizing the PCF8584-based I2C drivers I started quite some time before. Retire the old i386-only pcf driver, and activate the new general driver that has been sitting in the tree already for quite some time. Build the i2c modules for sparc64 architectures as well (where I've been developing all this on).	2005-10-28 15:58:19 +00:00
John Baldwin	85d72e4a2e	Create a default kernel config for i386 and move 'device isa' and 'device npx' (both of which aren't really optional right now) and 'device io' and 'device mem' (to preserve POLA for 4.x users upgrading to 6.0) from GENERIC into DEFAULTS. Requested by: scottl Reviewed by: scottl	2005-10-27 17:34:35 +00:00
John Baldwin	e0f66ef861	Reorganize the interrupt handling code a bit to make a few things cleaner and increase flexibility to allow various different approaches to be tried in the future. - Split struct ithd up into two pieces. struct intr_event holds the list of interrupt handlers associated with interrupt sources. struct intr_thread contains the data relative to an interrupt thread. Currently we still provide a 1:1 relationship of events to threads with the exception that events only have an associated thread if there is at least one threaded interrupt handler attached to the event. This means that on x86 we no longer have 4 bazillion interrupt threads with no handlers. It also means that interrupt events with only INTR_FAST handlers no longer have an associated thread either. - Renamed struct intrhand to struct intr_handler to follow the struct intr_foo naming convention. This did require renaming the powerpc MD struct intr_handler to struct ppc_intr_handler. - INTR_FAST no longer implies INTR_EXCL on all architectures except for powerpc. This means that multiple INTR_FAST handlers can attach to the same interrupt and that INTR_FAST and non-INTR_FAST handlers can attach to the same interrupt. Sharing INTR_FAST handlers may not always be desirable, but having sio(4) and uhci(4) fight over an IRQ isn't fun either. Drivers can always still use INTR_EXCL to ask for an interrupt exclusively. The way this sharing works is that when an interrupt comes in, all the INTR_FAST handlers are executed first, and if any threaded handlers exist, the interrupt thread is scheduled afterwards. This type of layout also makes it possible to investigate using interrupt filters ala OS X where the filter determines whether or not its companion threaded handler should run. - Aside from the INTR_FAST changes above, the impact on MD interrupt code is mostly just 's/ithread/intr_event/'. - A new MI ddb command 'show intrs' walks the list of interrupt events dumping their state. It also has a '/v' verbose switch which dumps info about all of the handlers attached to each event. - We currently don't destroy an interrupt thread when the last threaded handler is removed because it would suck for things like ppbus(8)'s braindead behavior. The code is present, though, it is just under #if 0 for now. - Move the code to actually execute the threaded handlers for an interrrupt event into a separate function so that ithread_loop() becomes more readable. Previously this code was all in the middle of ithread_loop() and indented halfway across the screen. - Made struct intr_thread private to kern_intr.c and replaced td_ithd with a thread private flag TDP_ITHREAD. - In statclock, check curthread against idlethread directly rather than curthread's proc against idlethread's proc. (Not really related to intr changes) Tested on: alpha, amd64, i386, sparc64 Tested on: arm, ia64 (older version of patch by cognet and marcel)	2005-10-25 19:48:48 +00:00
Bill Paul	8a3a26385c	Undo the change to pci_cfgdisable() on i386 for now. It seems to fix the amd64 case, but makes the i386 case fail even more often.	2005-10-25 05:32:44 +00:00
Bill Paul	ba3af76df7	Modify the pci_cfgdisable() routine to bring it more in line with other OSes (Solaris, Linux, VxWorks). It's not necessary to write a 0 to the config address register when using config mechanism 1 to turn off config access. In fact, it can be downright troublesome, since it seems to confuse the PCI-PCI bridge in the AMD8111 chipset and cause it to sporadically botch reads from some devices. This is the cause of the missing USP ports problem I was experiencing with my Sun Opteron system. Also correct the case for mechanism 2: it's only necessary to write a 0 to the ENABLE port.	2005-10-25 04:53:29 +00:00
John Baldwin	58553b9925	Rename the KDB_STOP_NMI kernel option to STOP_NMI and make it apply to all IPI_STOP IPIs. - Change the i386 and amd64 MD IPI code to send an NMI if STOP_NMI is enabled if an attempt is made to send an IPI_STOP IPI. If the kernel option is enabled, there is also a sysctl to change the behavior at runtime (debug.stop_cpus_with_nmi which defaults to enabled). This includes removing stop_cpus_nmi() and making ipi_nmi_selected() a private function for i386 and amd64. - Fix ipi_all(), ipi_all_but_self(), and ipi_self() on i386 and amd64 to properly handle bitmapped IPIs as well as IPI_STOP IPIs when STOP_NMI is enabled. - Fix ipi_nmi_handler() to execute the restart function on the first CPU that is restarted making use of atomic_readandclear() rather than assuming that the BSP is always included in the set of restarted CPUs. Also, the NMI handler didn't clear the function pointer meaning that subsequent stop and restarts could execute the function again. - Define a new macro HAVE_STOPPEDPCBS on i386 and amd64 to control the use of stoppedpcbs[] and always enable it for i386 and amd64 instead of being dependent on KDB_STOP_NMI. It works fine in both the NMI and non-NMI cases.	2005-10-24 21:04:19 +00:00
John Baldwin	301268b8ca	When restarting the BSP during cpu_reset() use a membar to ensure that the updated cpustop_restartfunc is seen when the BSP resumes execution. This matches the membar already present in restart_cpus().	2005-10-24 20:53:52 +00:00
John Baldwin	95d84e5461	Use xchg in Xcpustop to close a race and make cpustop_restartfunc truly one-shot in the SMP case (before using the simple mov / cmp / mov sequence could allow multiple CPUs to execute the restart function on resume).	2005-10-24 20:52:26 +00:00
John Baldwin	6b1e0d75b0	- Various small whitespace and style nits. - Use PCPU_GET(cpumask) in preference to 1 << PCPU_GET(cpuid) in a few places.	2005-10-24 20:31:04 +00:00
Nate Lawson	b77b750ee9	Like acpi_throttle, set frequency to 100% in attach. Some BIOSen may set this value lower, making the system quite slow after booting.	2005-10-23 19:38:06 +00:00
Ade Lovett	8d228514fb	Specifically panic() in the case where pmap_insert_entry() fails to get a new pv under high system load where the available pv entries have been exhausted before the pagedaemon has a chance to wake up to reclaim some. Prior to this, the NULL pointer dereference ended up causing secondary panics with rather less than useful resulting tracebacks. Reviewed by: alc, jhb MFC after: 1 week	2005-10-21 19:42:43 +00:00
Robert Watson	273ae68f43	Use svr4_si_{addr,code,errno,signo,trap} in preference to si_{...}. Fix a debugging printf to printf after a variable is first assigned, not before. These are purely build fixes, and need inspection to make sure they were what the original author of the previous changes intended.	2005-10-19 14:59:54 +00:00
Jung-uk Kim	7c799f4520	Redo physical/logical CPU count. Suggested by: jhb	2005-10-17 23:23:20 +00:00
Jung-uk Kim	42fb42a399	Split displaying number of physical and logical cores.	2005-10-17 15:51:28 +00:00
David E. O'Brien	f5dce7aa6e	For AMD processors, nullify CPUID.HTT. FreeBSD has no need for the information it conveys, and it is only confusing people. This fixes incorrect output in the previous commit.	2005-10-16 08:58:27 +00:00
David Xu	98850c388a	Fix compiling.	2005-10-15 22:57:39 +00:00
Jung-uk Kim	9c3acb0bc1	- Print number of physical/logical cores and more CPUID info. - Add newer CPUID definitions for future use. Many thanks to Mike Tancsa <mike at sentex dot net> for providing test cases for Intel Pentium D and AMD Athlon 64 X2. Approved by: anholt (mentor)	2005-10-14 22:52:01 +00:00
John Baldwin	728ef95410	The signal code is now an int rather than a long, so update debug printfs.	2005-10-14 20:22:57 +00:00
John Baldwin	021eda1d85	Remove the sx(4) driver at the request of the author. The author originally wrote it for 4.x and hasn't really had the time to fully update it to 5.x and later. Also, the author doesn't use the hardware anymore as well. If someone does need this driver they can always resurrect it from the Attic. Requested by: Frank Mayhar frank at exit dot com	2005-10-14 18:24:58 +00:00
Ruslan Ermilov	6f6b430e2f	Sort ath_rate_* entries. Mark ath_rate_sample as the desired algorithm. Discussed with: sam	2005-10-14 17:22:28 +00:00
David Xu	9104847f21	1. Change prototype of trapsignal and sendsig to use ksiginfo_t *, most changes in MD code are trivial, before this change, trapsignal and sendsig use discrete parameters, now they uses member fields of ksiginfo_t structure. For sendsig, this change allows us to pass POSIX realtime signal value to user code. 2. Remove cpu_thread_siginfo, it is no longer needed because we now always generate ksiginfo_t data and feed it to libpthread. 3. Add p_sigqueue to proc structure to hold shared signals which were blocked by all threads in the proc. 4. Add td_sigqueue to thread structure to hold all signals delivered to thread. 5. i386 and amd64 now return POSIX standard si_code, other arches will be fixed. 6. In this sigqueue implementation, pending signal set is kept as before, an extra siginfo list holds additional siginfo_t data for signals. kernel code uses psignal() still behavior as before, it won't be failed even under memory pressure, only exception is when deleting a signal, we should call sigqueue_delete to remove signal from sigqueue but not SIGDELSET. Current there is no kernel code will deliver a signal with additional data, so kernel should be as stable as before, a ksiginfo can carry more information, for example, allow signal to be delivered but throw away siginfo data if memory is not enough. SIGKILL and SIGSTOP have fast path in sigqueue_add, because they can not be caught or masked. The sigqueue() syscall allows user code to queue a signal to target process, if resource is unavailable, EAGAIN will be returned as specification said. Just before thread exits, signal queue memory will be freed by sigqueue_flush. Current, all signals are allowed to be queued, not only realtime signals. Earlier patch reviewed by: jhb, deischen Tested on: i386, amd64	2005-10-14 12:43:47 +00:00
David Xu	ac2587e125	Add POSIX siginfo_t's si_code, this is for upcoming POSIX realtime signal support in kernel. Earlier patch reviewed by: jhb, deischen	2005-10-14 03:01:14 +00:00
Alan Cox	c71c8706fe	Restore the UP optimization to reduce the number of TLB invalidations. The previous revision only restored the MP optimization. Describe the optimization strategy for TLB invalidations in a comment. Reviewed by: ups@ MFC after: 3 days	2005-10-13 23:42:12 +00:00
Stephan Uphoff	f6f67ea993	Restore optimizations to reduce TLB shootdowns. Alan Cox pointed out that they are really useful for sendfile(). MFC after: 3 days	2005-10-13 03:55:25 +00:00
Stephan Uphoff	4c0e268adf	Ensure that a thread stays on same CPU when calculating per CPU TLB shootdown requirements. Otherwise a CPU may not get the needed TLB invalidation. The PTE valid and access flags can not be used here to avoid TLB shootdowns unless sf->cpumask == all_cpus. ( Otherwise some CPUs may still hold an even older entry in the TLB) Since sf_buf_alloc mappings are normally always used this is also not really useful and presetting accessed and modified allows the CPU to speculatively load the entry into the TLB. Both bugs can cause random data corruption. MFC after: 3 days	2005-10-12 01:41:48 +00:00
Gleb Smirnoff	6e65f82cd1	Polling is now configured with help of ifconfig(8), not sysctl. Prodded by: maxim	2005-10-07 08:55:58 +00:00
Gleb Smirnoff	4092996774	Big polling(4) cleanup. o Axe poll in trap. o Axe IFF_POLLING flag from if_flags. o Rework revision 1.21 (Giant removal), in such a way that poll_mtx is not dropped during call to polling handler. This fixes problem with idle polling. o Make registration and deregistration from polling in a functional way, insted of next tick/interrupt. o Obsolete kern.polling.enable. Polling is turned on/off with ifconfig. Detailed kern_poll.c changes: - Remove polling handler flags, introduced in 1.21. The are not needed now. - Forget and do not check if_flags, if_capenable and if_drv_flags. - Call all registered polling handlers unconditionally. - Do not drop poll_mtx, when entering polling handlers. - In ether_poll() NET_LOCK_GIANT prior to locking poll_mtx. - In netisr_poll() axe the block, where polling code asks drivers to unregister. - In netisr_poll() and ether_poll() do polling always, if any handlers are present. - In ether_poll_[de]register() remove a lot of error hiding code. Assert that arguments are correct, instead. - In ether_poll_[de]register() use standard return values in case of error or success. - Introduce poll_switch() that is a sysctl handler for kern.polling.enable. poll_switch() goes through interface list and enabled/disables polling. A message that kern.polling.enable is deprecated is printed. Detailed driver changes: - On attach driver announces IFCAP_POLLING in if_capabilities, but not in if_capenable. - On detach driver calls ether_poll_deregister() if polling is enabled. - In polling handler driver obtains its lock and checks IFF_DRV_RUNNING flag. If there is no, then unlocks and returns. - In ioctl handler driver checks for IFCAP_POLLING flag requested to be set or cleared. Driver first calls ether_poll_[de]register(), then obtains driver lock and [dis/en]ables interrupts. - In interrupt handler driver checks IFCAP_POLLING flag in if_capenable. If present, then returns.This is important to protect from spurious interrupts. Reviewed by: ru, sam, jhb	2005-10-01 18:56:19 +00:00
John Baldwin	29442a30e2	Add interrupt counters for IPIs. By default they are disabled, but they can be enabled by enabling COUNT_IPIS in smptests.h. When enabled, each CPU provides an interrupt counter for nearly all of the IPIs it receives (IPI_STOP currently doesn't have a counter) that can be examined using vmstat -i, etc. MFC after: 3 days Requested by: rwatson	2005-09-28 18:04:11 +00:00
John Baldwin	ea688ef40b	Rename the lapic timer interrupt counters from lapicX: timer to cpuX: timer since it's not always obvious that lapic == cpu. MFC after: 3 days	2005-09-28 18:01:41 +00:00
Marius Strobl	b7c96c0d0b	Add a font width argument to vi_load_font_t, vi_save_font_t and vi_putm_t and do some preparations for handling 12x22 fonts (currently lots of code implies and/or hardcodes a font width of 8 pixels). This will be required on sparc64 which uses a default font size of 12x22 in order to add font loading and saving support as well as to use a syscons(4)-supplied mouse pointer image. This API breakage is committed now so it can be MFC'ed in time for 6.0 and later on upcoming framebuffer drivers destined for use on sparc64 and which are expected to rely on using font loading internally and on a syscons(4)-supplied mouse pointer image can be easily MFC'ed to RELENG_6 rather than requiring a backport. Tested on: i386, sparc64, make universe MFC after: 1 week	2005-09-28 14:54:07 +00:00
Robert Watson	5f419982c2	Back out alpha/alpha/trap.c:1.124, osf1_ioctl.c:1.14, osf1_misc.c:1.57, osf1_signal.c:1.41, amd64/amd64/trap.c:1.291, linux_socket.c:1.60, svr4_fcntl.c:1.36, svr4_ioctl.c:1.23, svr4_ipc.c:1.18, svr4_misc.c:1.81, svr4_signal.c:1.34, svr4_stat.c:1.21, svr4_stream.c:1.55, svr4_termios.c:1.13, svr4_ttold.c:1.15, svr4_util.h:1.10, ext2_alloc.c:1.43, i386/i386/trap.c:1.279, vm86.c:1.58, unaligned.c:1.12, imgact_elf.c:1.164, ffs_alloc.c:1.133: Now that Giant is acquired in uprintf() and tprintf(), the caller no longer leads to acquire Giant unless it also holds another mutex that would generate a lock order reversal when calling into these functions. Specifically not backed out is the acquisition of Giant in nfs_socket.c and rpcclnt.c, where local mutexes are held and would otherwise violate the lock order with Giant. This aligns this code more with the eventual locking of ttys. Suggested by: bde	2005-09-28 07:03:03 +00:00
John Baldwin	3c2bc2bf26	Add a new atomic_fetchadd() primitive that atomically adds a value to a variable and returns the previous value of the variable. Tested on: i386, alpha, sparc64, arm (cognet) Reviewed by: arch@ Submitted by: cognet (arm) MFC after: 1 week	2005-09-27 17:39:11 +00:00
Poul-Henning Kamp	5d3ea8713c	__RMAN_RESOURCE_VISIBLE not necessary.	2005-09-25 20:21:48 +00:00
Robert Watson	84d2b7df26	Add GIANT_REQUIRED and WITNESS sleep warnings to uprintf() and tprintf(), as they both interact with the tty code (!MPSAFE) and may sleep if the tty buffer is full (per comment). Modify all consumers of uprintf() and tprintf() to hold Giant around calls into these functions. In most cases, this means adding an acquisition of Giant immediately around the function. In some cases (nfs_timer()), it means acquiring Giant higher up in the callout. With these changes, UFS no longer panics on SMP when either blocks are exhausted or inodes are exhausted under load due to races in the tty code when running without Giant. NB: Some reduction in calls to uprintf() in the svr4 code is probably desirable. NB: In the case of nfs_timer(), calling uprintf() while holding a mutex, or even in a callout at all, is a bad idea, and will generate warnings and potential upset. This needs to be fixed, but was a problem before this change. NB: uprintf()/tprintf() sleeping is generally a bad ideas, as is having non-MPSAFE tty code. MFC after: 1 week	2005-09-19 16:51:43 +00:00
Christian S.J. Peron	33cdc78d01	Introduce a kernel config for the Mandatory Access Control framework. This kernel config briefly describes some of the major MAC policies available on FreeBSD. The hope is that this will raise the awareness about MAC and get more people interested. Discussed with: scottl	2005-09-18 03:15:36 +00:00
Warner Losh	e429f92618	Expose legacy_pcib_alloc_resource, and use it in the mptable pci bus implementation, like other routines in the legacy bus. This should fix problems with resource allocation on MP systems without ACPI enabled.	2005-09-17 23:57:53 +00:00
Warner Losh	dca2069084	Commit a workaround to a problem with resource allocation. This helps with some Dell servers that booted w/o a problem[*] on 5.4, but failed with 6.0-BETA. On the PCI bus, when we do lazy resource allocation, we narrow the range requested as we pass through bridges to reflect how the bridges are programmed and what addresses they pass. However, when we're doing an allocation on a bus that's directly connected to a host bridge, no such translation can take place. We already had a fallback range for memory requests, but none for ioports. As such, provide a fallback for I/O ports so we don't allocate location 0, which will have undesired side effects when the resources are actually used. This fixes a problem with booting a Dell server with usb in the kernel. However, it is an unsatisfying solution. I don't like the hard coded value, and I think we should start narrowing the resources returned to not be in the so-called isa alias area (where the ranage & 0x0300 must be 0 iirc). Doing such filtering will have to wait for another day. This may be a good 6 candidate, maybe after its had a chance to be refined. Tested by: glebius@	2005-09-16 07:02:29 +00:00
John Baldwin	80d52f16da	Stop using the '+' constraint modifier with inline assembly. The '+' constraint is actually only allowed for register operands. Instead, use separate input and output memory constraints. Education from: alc Reviewed by: alc Tested on: i386, alpha MFC after: 1 week	2005-09-15 19:31:22 +00:00

... 3 4 5 6 7 ...

10868 Commits