freebsd-dev

Author	SHA1	Message	Date
Robert Watson	16aebf571f	Add some basic KTR tracing to busdma on i386. This is likely not the final set of traces -- someone with more busdma background will probably want to review and expand this, as well as port to other platforms. This tracing is sufficient to identify key busdma events on i386, and in particular to draw attention to bounce buffering events that may have a substantial performance impact.	2004-10-23 10:34:27 +00:00
Nate Lawson	cc62efa527	Remove a "needs Giant" flag from the /dev/apm compat device. MFC after: 2 weeks	2004-10-22 17:17:12 +00:00
Poul-Henning Kamp	95bc568977	Add new function ttyinitmode() which sets our systemwide default modes on a tty structure. Both the ".init" and the current settings are initialized allowing the function to be used both at attach and open time. The function takes an argument to decide if echoing should be enabled. Echoing should not be enabled for regular physical serial ports unless they are consoles, in which case they should be configured by ttyconsolemode() instead. Use the new function throughout.	2004-10-18 21:51:27 +00:00
Alan Cox	20351faf18	When sf_buf_alloc() replaces a virtual-to-physical mapping, it needn't invalidate the TLB(s) if the old mapping wasn't used by the CPU. With network interfaces that implement checksum off-loading, the old mapping is almost never used by the CPU, only by the device driver for setting up the DMA operation. Reviewed by: tegge@	2004-10-16 22:32:50 +00:00
Nate Lawson	ccd582b0fd	Let nexus print our flags for us. Also, clean up an obfuscated if stmt.	2004-10-14 22:37:51 +00:00
Nate Lawson	8f528832e5	Print flags in the nexus for child devices.	2004-10-14 22:36:47 +00:00
Nate Lawson	e5979322ba	Remove local hacks to set flags now that the device probe does this for us. Tested on every device except sio_pci and the pc98 fd.c. Perhaps something similar should be done for the "disabled" hints also. MFC after: 2 weeks	2004-10-14 22:21:59 +00:00
Poul-Henning Kamp	09af1b6cdd	Add zero flags argument to sysctl calls.	2004-10-12 07:59:02 +00:00
Poul-Henning Kamp	e1e785a3d4	Add missing zero flag arguments to sysctl calls. Add missing pointy hat to peter@	2004-10-12 07:58:13 +00:00
Warner Losh	fd492ee0e6	Make the lower range of the memory area 0x80000000 again. Also introduce hw.{pci,acpi}.host_mem_start tunable to change this. MFC: ASAP	2004-10-11 21:10:23 +00:00
Nate Lawson	7f35f90eae	Match surrounding style, not style(msmith).	2004-10-11 05:42:12 +00:00
Nate Lawson	31ad3b8802	Move the code for halting the CPU (acpi_cpu_c1) into machdep files. This removes the last MD portion of acpi_cpu.c. MFC after: 2 weeks	2004-10-11 05:39:15 +00:00
Warner Losh	905454c86c	Fix conflicts I didn't fix before I committed my busspace changes. Noticed by: ru@ (and likely tinderbox, I haven't checked)	2004-10-11 00:58:24 +00:00
Warner Losh	ac00fac23a	Convert to newbus. (chances are we could now move this to dev/pbio since I believe it is now MI, but that hasn't been done yet). Reviewed by: dds	2004-10-10 03:26:20 +00:00
David E. O'Brien	3b33d41dc2	style(9)	2004-10-09 08:31:21 +00:00
Alan Cox	aced26ce6e	Make pte_load_store() an atomic operation in all cases, not just i386 PAE. Restructure pmap_enter() to prevent the loss of a page modified (PG_M) bit in a race between processors. (This restructuring assumes the newly atomic pte_load_store() for correct operation.) Reviewed by: tegge@ PR: i386/61852	2004-10-08 08:23:43 +00:00
Warner Losh	bacb482d94	Port pbio to HEAD. OK'd by: dds	2004-10-07 16:21:03 +00:00
Warner Losh	e625cbacaf	Add missing 'static'	2004-10-06 15:18:12 +00:00
Warner Losh	0b3a486f21	For legacy PCI bridges, limit memory allocation to the top 32MB of RAM. Many older, legacy bridges only allow allocation from this range. This only appies to devices who don't have their memory assigned by the BIOS (since we allocate the ranges so assigned exactly), so should have minimal impact. Hoewver, for CardBus bridges (cbb), they rarely get the resources allocated by the BIOS, and this patch helps them greatly. Typically the 'bad Vcc' messages are caused by this problem.	2004-10-06 07:22:58 +00:00
Alan Cox	caa665aae3	Undo revision 1.251. This change was a performance pessimizing work-around that is no longer required. (In fact, it is not clear that it was ever required in HEAD or RELENG_4, only RELENG_3 required a work-around.) Now, as before revision 1.251, if the preexisting PTE is invalid, pmap_enter() does not call pmap_invalidate_page() to update the TLB(s). Note: Even with this change, the handling of a copy-on-write fault is inefficient, in such cases pmap_enter() calls pmap_invalidate_page() twice. Discussed with: bde@ PR: kern/16568	2004-10-03 20:14:07 +00:00
Alan Cox	8ceb3dcb60	The physical address stored in the vm_page is page aligned. There is no need to mask off the page offset bits. (This operation made some sense prior to i386/i386/pmap.c revision 1.254 when we passed a physical address rather than a vm_page pointer to pmap_enter().)	2004-10-03 00:16:43 +00:00
Alan Cox	07b3303943	Eliminate unnecessary uses of PHYS_TO_VM_PAGE() from pmap_enter(). These uses predate the change in the pmap_enter() interface that replaced the page's physical address by the address of its vm_page structure. The PHYS_TO_VM_PAGE() was being used to compute the address of the same vm_page structure that was being passed in.	2004-10-02 07:34:58 +00:00
Yoshihiro Takahashi	92f8f73a93	Fix BIOS default geometry on pc98. PR: kern/72225 Submitted by: Hirokazu WATANABE <wnabe@par.odn.ne.jp>	2004-10-01 15:57:23 +00:00
David Schultz	46ec41ecb4	Fix the following race: 1. Process p1 is currently being swapped in. 2. Process p2 calls linux_ptrace(PTRACE_GETFPXREGS, p1_pid, ...) 3. After acquiring a reference to FIRST_THREAD_IN_PROC(p1), p2 blocks in faultin() while p1 finishes being swapped in. This means p2 won't get back the lock on p1 until after p1's threads are runnable. 4. After p1 is swapped in, the first thread in p1 exits. 5. p2 now uses its dangling reference to p1's first thread.	2004-10-01 05:01:00 +00:00
Alan Cox	0a752e9843	Prevent the unexpected deallocation of a page table page while performing pmap_copy(). This entails additional locking in pmap_copy() and the addition of a "flags" parameter to the page table page allocator for specifying whether it may sleep when memory is unavailable. (Already, pmap_copy() checks the availability of memory, aborting if it is scarce. In theory, another CPU could, however, allocate memory between pmap_copy()'s check and the call to the page table page allocator, causing the current thread to release its locks and sleep. This change makes this scenario impossible.) Reviewed by: tegge@	2004-09-29 19:20:40 +00:00
John Baldwin	9eba48462e	Improve the panic message for a busted MP table with conflicting entries for the same PCI interrupt. Tested by: Pavel Gubin pg at ie dot tusur dot ru MFC after: 3 days	2004-09-24 18:42:54 +00:00
Roman Kurakin	9b27ceb6dc	Invalidate cache after changing pte entry. Discussed with: jhp and njl MFC after: 5 days	2004-09-23 16:06:27 +00:00
Matt Jacob	1db03259c9	PAE seems to work for isp- at least under mimimal testing.	2004-09-23 05:26:19 +00:00
Alan Cox	a971139680	Correct a long-standing error in _pmap_unwire_pte_hold() affecting multiprocessors. Specifically, the error is conditioning the call to pmap_invalidate_page() on whether the pmap is active on the current CPU. This call must be unconditional. Regardless of whether the pmap is active on the CPU performing _pmap_unwire_pte_hold(), it could be active on another CPU. For example, a call to pmap_remove_all() by the page daemon could result in a call to _pmap_unwire_pte_hold() with the pmap inactive on the current CPU and active on another CPU. In such circumstances, failing to call pmap_invalidate_page() results in a stale TLB entry on the other CPU that still maps the now deallocated page table page. What happens next is typically a mysterious panic in pmap_enter() by the other CPU, either "pmap_enter: attempted pmap_enter on 4MB page" or "pmap_enter: pte vanished, va: 0x%lx". Both occur because the former page table page has been recycled and allocated to a new purpose. Consequently, it no longer contains zeroes. See also Peter's i386/i386/pmap.c revision 1.448 and the related e-mail thread last year. Many thanks to the engineers at Sandvine for providing clear and concise information until all of the pieces of the puzzle fell into place and for testing an earlier patch. MT5 Candidate	2004-09-22 05:01:48 +00:00
John Baldwin	76764432e4	- Add support for "paging" in stack trace output. That is, when you do a stack trace from ddb, the output will pause with a '--More--' prompt every 18 lines. If you hit Enter, it will print another line and prompt again. If you hit space it will output another page and then prompt. If you hit 'q' or 'x' it will abort the rest of the stack trace. - Fix the sparc64 userland stack trace to honor the total count of lines to print. This is useful if your trace happens to walk back onto 0xdeadc0de and gets stuck in an endless loop. MFC after: 1 month Tested on: i386, alpha, sparc64	2004-09-20 19:05:32 +00:00
Alan Cox	de6c3db01f	Simplify the reference counting of page table pages. Specifically, use the page table page's wired count rather than its hold count to contain the reference count. My rationale for this change is based on several factors: 1. The machine-independent and pmap layers used the same hold count field in subtly different ways. The machine-independent layer uses the hold count to implement a form of ephemeral wiring that is used by pipes, physio, etc. In other words, subsystems where we wish to temporarily block a page from being swapped out while it is mapped into the kernel's address space. Such pages are never removed from the page queues. Instead, the page daemon recognizes a non-zero hold count to mean "hands off this page." In contrast, page table pages are never in the page queues; they are wired from birth to death. The hold count was being used as a kind of reference count, specifically, the number of valid page table entries within the page. Not surprisingly, these two different uses imply different synchronization rules: in the machine- independent layer access to the hold count requires the page queues lock; whereas in the pmap layer the pmap lock is required. Thus, continued use by the pmap layer of vm_page_unhold(), which asserts that the page queues lock is held, made no sense. 2. _pmap_unwire_pte_hold() was too forgiving in its handling of the wired count. An unexpected wired count on a page table page was ignored and the underlying page leaked. 3. In a word, microoptimization. Using the wired count exclusively, rather than a combination of the wired and hold counts, makes the code slightly smaller and faster. Reviewed by: tegge@	2004-09-19 21:20:01 +00:00
Alan Cox	8478ea241b	Remove an outdated assertion from _pmap_allocpte(). (When vm_page_alloc() succeeds, the page's queue field is unconditionally set to PQ_NONE by vm_pageq_remove_nowakeup().)	2004-09-19 02:39:31 +00:00
Matt Jacob	b3940a8730	Put in a commented out ispfw device under isp and note that this is usually a module.	2004-09-19 00:52:22 +00:00
Alan Cox	7580b56bdc	Release the page queues lock earlier in pmap_protect() and pmap_remove() in order to reduce contention.	2004-09-18 22:56:58 +00:00
Julian Elischer	def46d58a6	Fix breakpoint handling for i386. not sure yet about 5.x... MFC if needed. Also fixes small problems with examining some registers and some specific gdb transfer problems. As the patch says: This is not a pretty patch and only meant as a temporary fix until a better solution is committed. PR: i386/71715 Submitted by: Stephan Uphoff <ups@tree.com> MFC after: 1 week	2004-09-15 23:26:49 +00:00
Poul-Henning Kamp	7ce1979be6	Add new a function isa_dma_init() which returns an errno when it fails and which takes a M_WAITOK/M_NOWAIT flag argument. Add compatibility isa_dmainit() macro which whines loudly if isa_dma_init() fails. Problem uncovered by: tegge	2004-09-15 12:09:50 +00:00
Poul-Henning Kamp	5757a0b985	Remove now unused #include files.	2004-09-15 12:02:35 +00:00
Alan Cox	031102cc7b	Use an atomic op to update the pte in pmap_protect(). This is to prevent the loss of a page modified (PG_M) bit in a race between processors. Quoting Tor: One scenario where the old code could cause a lost PG_M bit is a multithreaded linux program (or FreeBSD program using the linuxthreads port) where one thread was starting a subprocess. The thread doing fork() would call vmspace_fork(), which would then call vm_map_copy_entry() which would call pmap_protect() on an area possibly accessed by other threads. Additionally, make the clearing of PG_M by pmap_protect() unconditional if write permission is removed. Previously, PG_M could persist on a read-only unmanaged page. That seems inconsistent and confusing. In collaboration with: tegge@ MT5 candidate PR: 61852	2004-09-12 20:20:40 +00:00
Scott Long	1e7fad6b6a	Revert the previous round of changes to td_pinned. The scheduler isn't fully initialed when the pmap layer tries to call sched_pini() early in the boot and results in an quick panic. Use ke_pinned instead as was originally done with Tor's patch. Approved by: julian	2004-09-11 10:07:22 +00:00
Scott Long	9e0c3bdf64	Double the number of kernel page tables for amd64 and for i386/PAE. The old value was only enough for 8GB of RAM, the new value can do 16GB. This still isn't optimal since it doesn't scale. Fixing this for amd64 looks to be fairly easy, but for i386 will be quite difficult. Reviewed by: peter	2004-09-11 01:31:26 +00:00
Julian Elischer	5c854accc1	Make up my mind if cpu pinning is stored in the thread structure or the scheduler specific extension to it. Put it in the extension as the implimentation details of how the pinning is done needn't be visible outside the scheduler. Submitted by: tegge (of course!) (with changes) MFC after: 3 days	2004-09-10 22:28:33 +00:00
Bill Paul	a07bd003bf	Add device driver support for the VIA Networking Technologies VT6122 gigabit ethernet chip and integrated 10/100/1000 copper PHY. The vge driver has been added to GENERIC for i386, pc98 and amd64, but not to sparc or ia64 since I don't have the ability to test it there. The vge(4) driver supports VLANs, checksum offload and jumbo frames. Also added the lge(4) and nge(4) drivers to GENERIC for i386 and pc98 since I was in the neighborhood. There's no reason to leave them out anymore.	2004-09-10 20:57:46 +00:00
John Baldwin	64621fc5af	Teach the stack trace code how to step across a double fault when stepping across frames. Basically, if the current frame is for the 'dblfault_handler' function, then get the next %eip and %ebp values to use from the original TSS of the thread that has the saved state when the double fault triggered. MFC after: 4 days	2004-09-09 20:39:31 +00:00
Alan Cox	e232eb8288	Use atomic ops in pmap_clear_ptes() to prevent SMP races that could result in the loss of an accessed or modified bit from the pte. In collaboration with: tegge@ MT5 candidate	2004-09-08 18:58:29 +00:00
Scott Long	50736a153b	Fix a problem with tag->boundary inheritence that has existed since day one and was propagated to nearly every platform. The boundary of the child needs to consider the boundary of the parent and pick the minimum of the two, not the maximum. However, if either is 0 then pick the appropriate one. This bug was exposed by a recent change to ATA, which should now be fixed by this change. The alignment and maxsegsz tag attributes likely also need a similar review in the near future. This is a MT5 candidate. Reviewed by: marcel Submitted by: sos (in part)	2004-09-08 04:54:19 +00:00
Scott Long	4ef90982ca	Fix a cut-n-paste glitch with SCHED_4BSD.	2004-09-07 22:44:55 +00:00
Scott Long	444ba94513	Switch the default scheduler to 4BSD to match what will go into RELENG_5 soon. It can be switched back once 5.3 is tested and released. Also turn on PREEMPTION as many of the stability problems with it have been fixed. MT5: 3 days.	2004-09-07 22:37:43 +00:00
Doug Rabson	bd263739c1	Regen.	2004-09-06 09:33:30 +00:00
Doug Rabson	1bc85c0dea	Add a few stub syscalls to get TransGaming's winex a bit closer to working.	2004-09-06 09:32:59 +00:00
Julian Elischer	ed062c8d66	Refactor a bunch of scheduler code to give basically the same behaviour but with slightly cleaned up interfaces. The KSE structure has become the same as the "per thread scheduler private data" structure. In order to not make the diffs too great one is #defined as the other at this time. The KSE (or td_sched) structure is now allocated per thread and has no allocation code of its own. Concurrency for a KSEGRP is now kept track of via a simple pair of counters rather than using KSE structures as tokens. Since the KSE structure is different in each scheduler, kern_switch.c is now included at the end of each scheduler. Nothing outside the scheduler knows the contents of the KSE (aka td_sched) structure. The fields in the ksegrp structure that are to do with the scheduler's queueing mechanisms are now moved to the kg_sched structure. (per ksegrp scheduler private data structure). In other words how the scheduler queues and keeps track of threads is no-one's business except the scheduler's. This should allow people to write experimental schedulers with completely different internal structuring. A scheduler call sched_set_concurrency(kg, N) has been added that notifies teh scheduler that no more than N threads from that ksegrp should be allowed to be on concurrently scheduled. This is also used to enforce 'fainess' at this time so that a ksegrp with 10000 threads can not swamp a the run queue and force out a process with 1 thread, since the current code will not set the concurrency above NCPU, and both schedulers will not allow more than that many onto the system run queue at a time. Each scheduler should eventualy develop their own methods to do this now that they are effectively separated. Rejig libthr's kernel interface to follow the same code paths as linkse for scope system threads. This has slightly hurt libthr's performance but I will work to recover as much of it as I can. Thread exit code has been cleaned up greatly. exit and exec code now transitions a process back to 'standard non-threaded mode' before taking the next step. Reviewed by: scottl, peter MFC after: 1 week	2004-09-05 02:09:54 +00:00
Scott Long	9923b511ed	Turn PREEMPTION into a kernel option. Make sure that it's defined if FULL_PREEMPTION is defined. Add a runtime warning to ULE if PREEMPTION is enabled (code inspired by the PREEMPTION warning in kern_switch.c). This is a possible MT5 candidate.	2004-09-02 18:59:15 +00:00
Julian Elischer	df3a834f7e	Give up trying to make preemption dependent on SCHED_4BSD the list of breakages was getting too long	2004-09-01 20:41:18 +00:00
Alan Cox	3c3e8d1100	Correction to the previous revision: I forgot to apply the ones complement to a constant. This didn't show in testing because the broken expression produced the same result in my tests as the correct expression.	2004-09-01 19:04:09 +00:00
Julian Elischer	6222ded017	Don't ask for this for modules. no modules need to know about preemption at the moment	2004-09-01 18:29:57 +00:00
Alan Cox	e33353b52b	Modify pmap_pte() to support its use on non-current, non-kernel pmaps without holding Giant.	2004-09-01 18:04:22 +00:00
Scott Long	f164d4148e	Protect the PREEMPTION logic with #ifdef _KERNEL to fix the build.	2004-09-01 10:12:08 +00:00
Julian Elischer	02ea3bcab9	Only turn preemption for 4bsd. it's still poison for ULE.	2004-09-01 09:01:32 +00:00
Julian Elischer	6804a3ab6d	Give the 4bsd scheduler the ability to wake up idle processors when there is new work to be done. MFC after: 5 days	2004-09-01 06:42:02 +00:00
Julian Elischer	2630e4c90c	Give setrunqueue() and sched_add() more of a clue as to where they are coming from and what is expected from them. MFC after: 2 days	2004-09-01 02:11:28 +00:00
Matthew N. Dodd	c15033ef44	Clarify SDT feature word bits. Obtained from: NetBSD	2004-08-31 21:51:51 +00:00
Matthew N. Dodd	320be82c0f	Fix checksum calculation. Submitted by: Jean Delvare <khali@linux-fr.org>	2004-08-31 21:45:30 +00:00
Julian Elischer	5995adc206	Remove an unneeded argument.. The removed argument could trivially be derived from the remaining one. That in turn should be the same as curthread, but it is possible that curthread could be expensive to derive on some syste,s so leave it as an argument. Having both proc and thread as an argumen tjust gives an opportunity for them to get out sync. MFC after: 3 days	2004-08-31 07:34:54 +00:00
Julian Elischer	99e9dcb817	Remove sched_free_thread() which was only used in diagnostics. It has outlived its usefulness and has started causing panics for people who turn on DIAGNOSTIC, in what is otherwise good code. MFC after: 2 days	2004-08-31 06:12:13 +00:00
Peter Wemm	f37a929ca1	Kill count device support from config. I've changed the last few remaining consumers to have the count passed as an option. This is i4b, pc98/wdc, and coda. Bump configvers.h from 500013 to 600000. Remove heuristics that tried to parse "device ed5" as 5 units of the ed device. This broke things like the snd_emu10k1 device, which required quotes to make it parse right. The no-longer-needed quotes have been removed from NOTES, GENERIC etc. eg, I've removed the quotes from: device snd_maestro device "snd_maestro3" device snd_mss I believe everything will still compile and work after this.	2004-08-30 23:03:58 +00:00
Alan Cox	bfa15df9ba	Remove unnecessary check for curthread == NULL.	2004-08-30 03:52:05 +00:00
Dag-Erling Smørgrav	aa8f5987e0	Add a section for hardware watchdog timers, initially populated by ichwd. MFC after: 3 days	2004-08-29 11:11:31 +00:00
David E. O'Brien	dd68efd05b	s/smp_rv_mtx/smp_ipi_mtx/g Requested by: jhb	2004-08-28 00:49:55 +00:00
Marcel Moolenaar	0f2fe153bc	Move the kernel-specific logic to adjust frompc from MI to MD. For these two reasons: 1. On ia64 a function pointer does not hold the address of the first instruction of a functions implementation. It holds the address of a function descriptor. Hence the user(), btrap(), eintr() and bintr() prototypes are wrong for getting the actual code address. 2. The logic forces interrupt, trap and exception entry points to be layed-out contiguously. This can not be achieved on ia64 and is generally just bad programming. The MCOUNT_FROMPC_USER macro is used to set the frompc argument to some kernel address which represents any frompc that falls outside the kernel text range. The macro can expand to ~0U to bail out in that case. The MCOUNT_FROMPC_INTR macro is used to set the frompc argument to some kernel address to represent a call to a trap or interrupt handler. This to avoid that the trap or interrupt handler appear to be called from everywhere in the call graph. The macro can expand to ~0U to prevent adjusting frompc. Note that the argument is selfpc, not frompc. This commit defines the macros on all architectures equivalently to the original code in sys/libkern/mcount.c. People can take it from here... Compile-tested on: alpha, amd64, i386, ia64 and sparc64 Boot-tested on: i386	2004-08-27 19:42:35 +00:00
Alan Cox	8991a235cb	The machine-independent parts of the virtual memory system always pass a valid pmap to the pmap functions that require one. Remove the checks for NULL. (These checks have their origins in the Mach pmap.c that was integrated into BSD. None of the new code written specifically for FreeBSD included them.)	2004-08-27 19:06:17 +00:00
Andre Oppermann	c21fd23260	Always compile PFIL_HOOKS into the kernel and remove the associated kernel compile option. All FreeBSD packet filters now use the PFIL_HOOKS API and thus it becomes a standard part of the network stack. If no hooks are connected the entire packet filter hooks section and related activities are jumped over. This removes any performance impact if no hooks are active. Both OpenBSD and DragonFlyBSD have integrated PFIL_HOOKS permanently as well.	2004-08-27 15:16:24 +00:00
David E. O'Brien	2e262ac39b	Fix a bug in in_cksum_hdr w/o -O. The C code assumes that the carry bit is always kept from the previous operation. However, the pointer indexing requires another add operation. Thus, the carry bit from the first operation is tromped over by the "addl" operation that ends up following it, so the "adcl" that follows that has no effect because the carry bit is cleared before it. The result is checksum failure on received packets. The larger issue is that there isn't any other way of preventing the compiler inserting arbitrary instructions between different __asm statements (and that the commit message in revision 1.13 of in_cksum.h is wrong on this point). From http://developer.apple.com/documentation/DeveloperTools/gcc-3.3/gcc/Extended-Asm.html ---8<---8<---8<--- You can't expect a sequence of volatile asm instructions to remain perfectly consecutive. If you want consecutive output, use a single asm. Also, GCC will perform some optimizations across a volatile asm instruction; GCC does not "forget everything" when it encounters a volatile asm instruction the way some other compilers do. ---8<---8<---8<--- Also, this change also makes the ASM code much easier to read. PR: 69257 Submitted by: Mike Bristow <mike@urgle.com>, Qing Li <qing.li@bluecoat.com>	2004-08-25 18:28:15 +00:00
John Baldwin	ef36ad6921	Correct the arguments to kern_sigaltstack() as they were reversed. PR: kern/68079 Submitted by: Georg-W. Koltermann gwk at rahn-koltermann dot de	2004-08-24 20:52:52 +00:00
John Baldwin	8a7aa72dec	Regenerate after fcntl() wrappers were marked MP safe.	2004-08-24 20:24:34 +00:00
John Baldwin	2ca25ab53e	Fix the ABI wrappers to use kern_fcntl() rather than calling fcntl() directly. This removes a few more users of the stackgap and also marks the syscalls using these wrappers MP safe where appropriate. Tested on: i386 with linux acroread5 Compiled on: i386, alpha LINT	2004-08-24 20:21:21 +00:00
Nate Lawson	9f65aa0340	Be sure to always unlock the sx lock when exiting the sysctl function. MFC after: 3 days	2004-08-24 17:53:25 +00:00
Peter Wemm	f1009e1e1f	Commit Doug White and Alan Cox's fix for the cross-ipi smp deadlock. We were obtaining different spin mutexes (which disable interrupts after aquisition) and spin waiting for delivery. For example, KSE processes do LDT operations which use smp_rendezvous, while other parts of the system are doing things like tlb shootdowns with a different mutex. This patch uses the common smp_rendezvous mutex for all MD home-grown IPIs that spinwait for delivery. Having the single mutex means that the spinloop to aquire it will enable interrupts periodically, thus avoiding the cross-ipi deadlock. Obtained from: dwhite, alc Reviewed by: jhb	2004-08-23 21:39:29 +00:00
Nate Lawson	59d039ecc6	Add a BUS_GET_RESOURCE_LIST method for nexus. MFC after: 3 days	2004-08-23 16:26:16 +00:00
Maxim Sobolev	c1466d61d9	My recent measurement shows that CPU_DISABLE_CMPXCHG is no longer necessary with VmWare 4.x. At least with VmWare version 4.5.2, i386 version of atomic_cmpset_int() is about 30 times slower than non-i386 version. It makes this delta a good 5.3 MFC candidate, since otherwise it will mislead users who run FreeBSD under modern VmWare otherwise.	2004-08-23 15:55:03 +00:00
Maxim Sobolev	ae2f5301c8	o Fix whitespace bug introduced in the previous commit. Submitted by: ru o Simplify p4tcc_power_profile(). Submitted by: maxim	2004-08-23 10:09:29 +00:00
Maxim Sobolev	acac9ce485	o Extend boot output: print out mimimum/maximum performance value and number of performance steps available; o similarly to Enhanced SpeedStep driver, export list of all available steps via hw.p4tcc.cpuperf_levels sysctl.	2004-08-23 09:47:56 +00:00
Alan Cox	0b6a0b955a	Properly free the temporary sf_buf in uiomove_fromphys() if a copyin or copyout fails. Obtained from: DragonFlyBSD	2004-08-21 18:50:34 +00:00
David E. O'Brien	f49f2ca64e	Unconditionally support the AMD64 GART HW.	2004-08-19 20:58:24 +00:00
Nate Lawson	d3bdd24ea9	Disable interrupts after using pmap_enter() to add the identity mapping. Since pmap_enter() calls pmap_invalidate_page(), which needs interrupts enabled in the SMP case, we defer the disable to right before saving the register context. This has been incorrect for about a year but caused no real problems because the identity page never actually replaces a previously mapped page and suspend/resume on SMP systems has been uncommon. Tested by: sos MFC after: 3 days	2004-08-19 18:48:17 +00:00
Justin T. Gibbs	6ec309a62f	Modify the "legacy bus" to pass all resource allocations through to its parent rather than track resources locally. The original code was incomplete in that it would only honor requests for resources that already exist in its resource list. This prevented many ISA identify routines from allocating temporary resources. Passing the requests up to legacy's parent losing no functionality and allows these requests to succeed. Reviewed by: imp, jhb Approved by: RE	2004-08-16 21:55:29 +00:00
David E. O'Brien	3c749e3fb1	AMD64 on-CPU GART support. This also applies to AMD64 HW running 'i386' OS. Submitted by: Jung-uk Kim <jkim@niksun.com> Integration by: obrien	2004-08-16 12:25:48 +00:00
David E. O'Brien	9c737de401	Increase the scaling of VM_KMEM_SIZE_MAX. Submitted by: alc	2004-08-16 08:35:22 +00:00
Tim J. Robbins	0e73a96209	Add a new type, l_uintptr_t, which is an unsigned integer type with the same width as a pointer under Linux. Add two new macros, PTRIN and PTROUT, which convert between l_uintptr_t and native pointers.	2004-08-16 07:05:44 +00:00
Robert Watson	273ad9acbc	Preemptive anti-footshooting: cause a #error if MP_WATCHDOG is compiled with SCHED_ULE.	2004-08-15 20:32:40 +00:00
Robert Watson	932431328b	Spell MP_WATCHDIG right: I fixed the build without MP_WATCHDOG after testing MP_WATCHDOG, and used an incorrect ifdef.	2004-08-15 19:57:14 +00:00
Robert Watson	a632deec30	Add an "options MP_WATCHDOG" to i386. This option allows one of the logical CPUs on a system to be used as a dedicated watchdog to cause a drop to the debugger and/or generate an NMI to the boot processor if the kernel ceases to respond. A sysctl enables the watchdog running out of the processor's idle thread; a callout is launched to reset a timer in the watchdog. If the callout fails to reset the timer for ten seconds, the watchdog will fire. The sysctl allows you to select which CPU will run the watchdog. A sample "debug.leak_schedlock" is included, which causes a sysctl to spin holding sched_lock in order to trigger the watchdog. On my Xeons, the watchdog is able to detect this failure mode and break into the debugger, which cannot otherwise be done without an NMI button. This option does not currently work with sched_ule due to ule's push notion of scheduling, similar to machdep.hlt_logical_cpus failing to work with that scheduler. On face value, this might seem somewhat inefficient, but there are a lot of dual-processor Xeons with HTT around, so using one as a watchdog for testing is not as inefficient as one might fear.	2004-08-15 18:02:09 +00:00
Nate Lawson	8781a7852f	MPSAFE locking * Serialize access to the sysctl routines and the notify handler * Assert that the sx lock is held in any functions they call. * Note that recursively calling to re-enable the hotkeys is sub-optimal.	2004-08-13 06:22:35 +00:00
Nate Lawson	e8a162f4f3	MPSAFE locking * Serialize access to the sysctl routines and the notify handler * Assert that the sx lock is held in any functions they call.	2004-08-13 06:22:31 +00:00
Nate Lawson	1051a7c2da	MPSAFE locking * Serialize access to the sysctl routines and the notify handler.	2004-08-13 06:22:29 +00:00
Marcel Moolenaar	4da47b2fec	Add __elfN(dump_thread). This function is called from __elfN(coredump) to allow dumping per-thread machine specific notes. On ia64 we use this function to flush the dirty registers onto the backingstore before we write out the PRSTATUS notes. Tested on: alpha, amd64, i386, ia64 & sparc64 Not tested on: arm, powerpc	2004-08-11 02:35:06 +00:00
Robert Watson	b505f47bef	Add ADAPTIVE_GIANT to GENERIC on i386, with the intent of making it a standard configuration similar to [NO_]ADAPTIVE_MUTEXES. This feature causes Giant to be included in the set of mutexes adaptively spun on. It appears to have a positive effect on performance on SMP across several workloads, including measurements of a 16% improvement on buildworld, and 30%+ improvement for MySQL using the supersmack benchmark with Giant over the network stack; a 6% improvement without Giant on the network stack (as a result of less giant contention).	2004-08-11 01:34:18 +00:00
Warner Losh	e0eb3c19a7	Remove commented out pcic driver. It is too broken to work (even if you fix the obvious bugs, nastier ones reside below the surfac), and having it commented out here just encourages people to try it. # I'm not removing it from the base system, yet.	2004-08-09 17:36:19 +00:00
Alan Cox	a9cb79ba4e	With the advent of pmap locking it makes sense for pmap_copy() to be less forgiving about inconsistencies in the source pmap. Also, remove a new- line character terminating a nearby panic string.	2004-08-08 00:31:58 +00:00
Robert Watson	f6b61a6442	Generate KTR trace records for syscall enter and exit in i386 system calls. Note that the information included is a bit different from the existing KTR traces generated on powerpc, as I'm primarily interested in kernel context (thread, syscall #, proc, etc), not the user arguments to the system call. Some convergence would be useful here.	2004-08-06 21:56:26 +00:00
Nate Lawson	0cafadacae	Remove the attempt to cache the previous page mapped at our identity location (for the wake code). It should not be needed since we don't map other pages at the same location and if there was an old mapping, it would be restored by a fault. The old code had serious problems, namely that it was restoring the new page it had just removed (not opage) and it could only guess at the right protection (since there's no pmap_extract_protect function). Thanks to Alan Cox for explaining much of this to me. Also, remove a commented-out initializecpu() call since it is not needed. Restoring the cpu context is better than attempting to init from scratch. Reviewed by: alc (earlier version)	2004-08-05 06:29:12 +00:00
Robert Watson	5fb2253e10	Move definition of mem_range_softc from mp_machdep.c to machdep.c so that it is defined for non-SMP builds, not just SMP ones.	2004-08-05 00:32:08 +00:00
John Baldwin	0e5a07e533	Remove a potential deadlock on i386 SMP by changing the lazypmap ipi and spin-wait code to use the same spin mutex (smp_tlb_mtx) as the TLB ipi and spin-wait code snippets so that you can't get into the situation of one CPU doing a TLB shootdown to another CPU that is doing a lazy pmap shootdown each of which are waiting on each other. With this change, only one of the CPUs would do an IPI and spin-wait at a time.	2004-08-04 20:31:19 +00:00
Mark Murray	a20ad05beb	Fix module builds for i386 and amd64.	2004-08-04 18:30:31 +00:00
Alan Cox	1b3b9cfe1d	Post-locking clean up/simplification, particularly, the elimination of vm_page_sleep_if_busy() and the page table page's busy flag as a synchronization mechanism on page table pages. Also, relocate the inline pmap_unwire_pte_hold() so that it can be used to shorten _pmap_unwire_pte_hold() on alpha and amd64. This places pmap_unwire_pte_hold() next to a comment that more accurately describes it than _pmap_unwire_pte_hold().	2004-08-04 18:04:44 +00:00
Philip Paeps	09003ac33f	Unbreak LINT by making sure that method is always defined. Submitted by: roam Pointy hat to: philip	2004-08-04 14:29:22 +00:00
Philip Paeps	f5296c9302	Further cleanup: merge the three led toggling functions into a single general function to handle all leds. Approved by: njl	2004-08-03 22:37:09 +00:00
Nate Lawson	8390cfe8a6	Use the acpi_{Get,Set}Integer functions instead of rolling custom ones. Clean up return path of each function to have a single exit point. This reduces diffs against the MPSAFE tree.	2004-08-03 21:17:36 +00:00
Mark Murray	d23a262fc5	Making a loadable null.ko for /dev/(null\|zero) proved rather unpopular, so remove this (mis)feature. Encouragement provided by: jhb (and others)	2004-08-03 19:24:54 +00:00
Maxime Henrion	9f1b87f106	Instead of calling ia32_pause() conditionally on __i386__ or __amd64__ being defined, define and use a new MD macro, cpu_spinwait(). It only expands to something on i386 and amd64, so the compiled code should be identical. Name of the macro found by: jhb Reviewed by: jhb	2004-08-03 18:44:27 +00:00
Mark Murray	47eb78a768	Sort includes; minor whitespace.	2004-08-02 20:32:56 +00:00
Doug Rabson	4d84a58d1d	Add definitions for TLS relocations.	2004-08-02 19:12:17 +00:00
Scott Long	5ba0615c03	Optimize intr_execute_handlers() by combining the pic_disable_source() and pic_eoi_source() into one call. This halves the number of spinlock operations and indirect function calls in the normal case of handling a normal (ithread) interrupt. Optimize the atpic and ioapic drivers to use inlines where appropriate in supporting the intr_execute_handlers() change. This knocks 900ns, or roughly 1350 cycles, off of the time spent servicing an interrupt in the common case on my 1.5GHz P4 uniprocessor system. SMP systems likely won't see as much of a gain due to the ioapic being more efficient than the atpic. I'll investigate porting this to amd64 soon. Reviewed by: jhb	2004-08-02 15:31:10 +00:00
Mark Murray	b6527a3667	Add the I/O device for those architectures that have it.	2004-08-01 19:37:34 +00:00
Mark Murray	e54788c73d	Remove local hack that was not supposed to be committed. Spotted by: Antoine Brodin - antoine dot brodin at laposte dot net	2004-08-01 18:12:25 +00:00
Scott Long	9352fe30a0	Turn off PREEMPTION by default while it gets debugged. It's been causing 4 weeks of problems including deadlocks and instant panics. Note that the real bugs are likely in the scheduler.	2004-08-01 14:31:45 +00:00
Mark Murray	8ab2f5ecc5	Break out the MI part of the /dev/[k]mem and /dev/io drivers into their own directory and module, leaving the MD parts in the MD area (the MD parts _are_ part of the modules). /dev/mem and /dev/io are now loadable modules, thus taking us one step further towards a kernel created entirely out of modules. Of course, there is nothing preventing the kernel from having these statically compiled.	2004-08-01 11:40:54 +00:00
Alan Cox	c6bf9f0455	Add pmap locking to pmap_object_init_pt().	2004-07-31 06:42:05 +00:00
Alan Cox	a087914310	Advance the state of pmap locking on alpha, amd64, and i386. - Enable recursion on the page queues lock. This allows calls to vm_page_alloc(VM_ALLOC_NORMAL) and UMA's obj_alloc() with the page queues lock held. Such calls are made to allocate page table pages and pv entries. - The previous change enables a partial reversion of vm/vm_page.c revision 1.216, i.e., the call to vm_page_alloc() by vm_page_cowfault() now specifies VM_ALLOC_NORMAL rather than VM_ALLOC_INTERRUPT. - Add partial locking to pmap_copy(). (As a side-effect, pmap_copy() should now be faster on i386 SMP because it no longer generates IPIs for TLB shootdown on the other processors.) - Complete the locking of pmap_enter() and pmap_enter_quick(). (As of now, all changes to a user-level pmap on alpha, amd64, and i386 are performed with appropriate locking.)	2004-07-29 18:56:31 +00:00
Poul-Henning Kamp	0658bb8ef8	Move a relic to its correct location(s): Put nfs diskless initialization calls with the code they call. (Yet another example of mindless copy&paste).	2004-07-28 21:54:57 +00:00
Alexander Kabaev	24a06d1874	Avoid casts as lvalues. While here, avoid storing 32bit quantities in 16bit locations.	2004-07-28 06:32:28 +00:00
Robert Watson	1a8cfbc450	Pass a thread argument into cpu_critical_{enter,exit}() rather than dereference curthread. It is called only from critical_{enter,exit}(), which already dereferences curthread. This doesn't seem to affect SMP performance in my benchmarks, but improves MySQL transaction throughput by about 1% on UP on my Xeon. Head nodding: jhb, bmilekic	2004-07-27 16:41:01 +00:00
Tim J. Robbins	7ee771aa57	Use file2c instead of a combination of hexdump, sed and shell script to generate the wakecode[] array from acpi_wakecode.bin. The old method was not safe in multibyte locales.	2004-07-27 01:33:27 +00:00
Nate Lawson	6edc660c09	Get the acpi softc via the devclass, not by caching the device. Replace apm_softc with a single integer since the whole softc is not used.	2004-07-24 22:41:30 +00:00
Nate Lawson	be22348065	Whitespace cleanup and move static variables together.	2004-07-24 20:40:02 +00:00
Nate Lawson	b4cb140233	Remove unneeded parens and fix whitespace.	2004-07-24 20:39:25 +00:00
Scott Long	17ee0667eb	Arg! Revert local changes that were accidentlly included in the previous version.	2004-07-22 15:55:03 +00:00
Scott Long	7c06f85c31	Don't count needed bounce pages if loading a buffer that was created with bus_dmamem_alloc() Submitted by: harti	2004-07-22 15:46:51 +00:00
Olivier Houchard	e1021dde8b	Using NULL as a malloc type when calling contigmalloc() is wrong, so introduce a new malloc type, and use it.	2004-07-21 15:52:34 +00:00
Yoshihiro Takahashi	ee6020c993	Add the ACPI Panasonic extras driver. Submitted by: OGAWA Takaya <t-ogawa@triaez.kaisei.org> and nyan	2004-07-21 14:47:54 +00:00
Marcel Moolenaar	fd32d93b97	Unify db_stack_trace_cmd(). All it did was look up the thread given the thread ID and call db_trace_thread(). Since arm has all the logic in db_stack_trace_cmd(), rename the new DB_COMMAND function to db_stack_trace to avoid conflicts on arm. While here, have db_stack_trace parse its own arguments so that we can use a more natural radix for IDs. If the ID is not a thread ID, or more precisely when no thread exists with the ID, try if there's a process with that ID and return the first thread in it. This makes it easier to print stack traces from the ps output. requested by: rwatson@ tested on: amd64, i386, ia64	2004-07-21 05:07:09 +00:00
David Xu	2396628bb4	Make end of frames for KSE thread, for system scope thread, without this change, debugger will dump a weird stack backtrace.	2004-07-20 01:38:59 +00:00
John Baldwin	788195c186	As a temporary hack, turn off deferred preemptions that are the result of a fast interrupt handler doing an swi_sched(). This fixed the lockups I saw on my laptop when using xmms in KDE and on rwatson's MySQL benchmarks on SMP. This will eventually be removed and/or modified when I figure out what the root cause is and fix that.	2004-07-19 16:37:47 +00:00
David Schultz	479f8d2214	Make FLT_ROUNDS correctly reflect the dynamic rounding mode.	2004-07-19 08:17:25 +00:00
Mike Silbersack	4ca037c6c8	Add a #error requiring KDB if DDB is specified. (This can probably be relocated to a better place, if one exists.)	2004-07-19 02:46:34 +00:00
Alan Cox	aec86de47b	Utilize pmap_pte_quick() rather than pmap_pte() in pmap_protect(). The reason being that pmap_pte_quick() requires the page queues lock, which is already held, rather than Giant.	2004-07-18 21:19:10 +00:00
Maxim Konovalov	aa355a2679	In -CURRENT pseudo devices are not statically assigned at compile time, remove a stale comment. PR: kern/62285	2004-07-18 09:03:12 +00:00
Alan Cox	b73cfbb3e4	Remedy my omission of one change in the prevision revision: pmap_remove() must pin the current thread in order to call pmap_pte_quick().	2004-07-17 23:44:59 +00:00
Alan Cox	c9829537f4	- Utilize pmap_pte_quick() rather than pmap_pte() in pmap_remove() and pmap_remove_page(). The reason being that pmap_pte_quick() requires the page queues lock, which is already held, rather than Giant. - Assert that the page queues lock is held in pmap_remove_page() and pmap_remove_pte().	2004-07-17 22:20:53 +00:00
Poul-Henning Kamp	672c05d49c	Preparation commit for the tty cleanups that will follow in the near future: rename ttyopen() -> tty_open() and ttyclose() -> tty_close(). We need the ttyopen() and ttyclose() for the new generic cdevsw functions for tty devices in order to have consistent naming.	2004-07-15 20:47:41 +00:00
Alan Cox	3d2e54c317	Push down the acquisition and release of the page queues lock into pmap_protect() and pmap_remove(). In general, they require the lock in order to modify a page's pv list or flags. In some cases, however, pmap_protect() can avoid acquiring the lock.	2004-07-15 18:00:43 +00:00
John Baldwin	fe96955252	Fix a typo in a comment.	2004-07-15 16:37:48 +00:00
Poul-Henning Kamp	3e019deaed	Do a pass over all modules in the kernel and make them return EOPNOTSUPP for unknown events. A number of modules return EINVAL in this instance, and I have left those alone for now and instead taught MOD_QUIESCE to accept this as "didn't do anything".	2004-07-15 08:26:07 +00:00
John Baldwin	0c9cb34441	Correct bounds check in lapic_create(). Submitted by: "Ted Unangst" tedu at coverity.com	2004-07-14 18:12:15 +00:00
Poul-Henning Kamp	ec712659ef	Desupport M-Systems DiskOnChip driver "fla"	2004-07-13 17:43:03 +00:00
Warner Losh	4b5239229c	oldcard's card device no longer requires a count	2004-07-13 16:11:34 +00:00
David Xu	53dbf30349	Add ptrace_clear_single_step(), alpha already has it for years, the function will be used by ptrace to clear a thread's single step state.	2004-07-13 07:22:56 +00:00
Alan Cox	ce8da3091f	Push down the acquisition and release of the page queues lock into pmap_remove_pages(). (The implementation of pmap_remove_pages() is optional. If pmap_remove_pages() is unimplemented, the acquisition and release of the page queues lock is unnecessary.) Remove spl calls from the alpha, arm, and ia64 pmap_remove_pages().	2004-07-13 02:49:22 +00:00
Marcel Moolenaar	45cfc0a914	Partially revert previous commit. Calling getit() unconditionally fixed a problem that could also be fixed differently without reverting previous attempts to fix DELAY while the debugger is active (rev 1.204). The bug was that the i8254 implements a countdown timer, while for (k)db_active a countup timer was implemented. This resulted in premature termination and consequently the breakage of DELAY. The fix (relative to rev 1.211) is to implement a countdown timer for the kdb_active case. As such the ability to step clock initialization is preserved and DELAY does what is expected of it. Blushed: bde :-) Submitted by: bde	2004-07-11 17:50:59 +00:00
Marcel Moolenaar	8bcb1e9e84	Add options KDB and GDB. KDB takes on the function of what DDB used to be. Both DDB and GDB specify which KDB backends to include.	2004-07-11 03:20:09 +00:00
Marcel Moolenaar	1ca618fcaa	Remove the now unused GDB stubs. See src/sys/gdb/* for the new KDB backend.	2004-07-11 01:47:26 +00:00
Marcel Moolenaar	37224cd3fc	Mega update for the KDB framework: turn DDB into a KDB backend. Most of the changes are a direct result of adding thread awareness. Typically, DDB_REGS is gone. All registers are taken from the trapframe and backtraces use the PCB based contexts. DDB_REGS was defined to be a trapframe on all platforms anyway. Thread awareness introduces the following new commands: thread X switch to thread X (where X is the TID), show threads list all threads. The backtrace code has been made more flexible so that one can create backtraces for any thread by giving the thread ID as an argument to trace. With this change, ia64 has support for breakpoints.	2004-07-10 23:47:20 +00:00

1 2 3 4 5 ...

10187 Commits