freebsd-skq

Author	SHA1	Message	Date
Scott Long	444ba94513	Switch the default scheduler to 4BSD to match what will go into RELENG_5 soon. It can be switched back once 5.3 is tested and released. Also turn on PREEMPTION as many of the stability problems with it have been fixed. MT5: 3 days.	2004-09-07 22:37:43 +00:00
Julian Elischer	ed062c8d66	Refactor a bunch of scheduler code to give basically the same behaviour but with slightly cleaned up interfaces. The KSE structure has become the same as the "per thread scheduler private data" structure. In order to not make the diffs too great one is #defined as the other at this time. The KSE (or td_sched) structure is now allocated per thread and has no allocation code of its own. Concurrency for a KSEGRP is now kept track of via a simple pair of counters rather than using KSE structures as tokens. Since the KSE structure is different in each scheduler, kern_switch.c is now included at the end of each scheduler. Nothing outside the scheduler knows the contents of the KSE (aka td_sched) structure. The fields in the ksegrp structure that are to do with the scheduler's queueing mechanisms are now moved to the kg_sched structure. (per ksegrp scheduler private data structure). In other words how the scheduler queues and keeps track of threads is no-one's business except the scheduler's. This should allow people to write experimental schedulers with completely different internal structuring. A scheduler call sched_set_concurrency(kg, N) has been added that notifies teh scheduler that no more than N threads from that ksegrp should be allowed to be on concurrently scheduled. This is also used to enforce 'fainess' at this time so that a ksegrp with 10000 threads can not swamp a the run queue and force out a process with 1 thread, since the current code will not set the concurrency above NCPU, and both schedulers will not allow more than that many onto the system run queue at a time. Each scheduler should eventualy develop their own methods to do this now that they are effectively separated. Rejig libthr's kernel interface to follow the same code paths as linkse for scope system threads. This has slightly hurt libthr's performance but I will work to recover as much of it as I can. Thread exit code has been cleaned up greatly. exit and exec code now transitions a process back to 'standard non-threaded mode' before taking the next step. Reviewed by: scottl, peter MFC after: 1 week	2004-09-05 02:09:54 +00:00
Julian Elischer	5995adc206	Remove an unneeded argument.. The removed argument could trivially be derived from the remaining one. That in turn should be the same as curthread, but it is possible that curthread could be expensive to derive on some syste,s so leave it as an argument. Having both proc and thread as an argumen tjust gives an opportunity for them to get out sync. MFC after: 3 days	2004-08-31 07:34:54 +00:00
Julian Elischer	99e9dcb817	Remove sched_free_thread() which was only used in diagnostics. It has outlived its usefulness and has started causing panics for people who turn on DIAGNOSTIC, in what is otherwise good code. MFC after: 2 days	2004-08-31 06:12:13 +00:00
Marcel Moolenaar	0f2fe153bc	Move the kernel-specific logic to adjust frompc from MI to MD. For these two reasons: 1. On ia64 a function pointer does not hold the address of the first instruction of a functions implementation. It holds the address of a function descriptor. Hence the user(), btrap(), eintr() and bintr() prototypes are wrong for getting the actual code address. 2. The logic forces interrupt, trap and exception entry points to be layed-out contiguously. This can not be achieved on ia64 and is generally just bad programming. The MCOUNT_FROMPC_USER macro is used to set the frompc argument to some kernel address which represents any frompc that falls outside the kernel text range. The macro can expand to ~0U to bail out in that case. The MCOUNT_FROMPC_INTR macro is used to set the frompc argument to some kernel address to represent a call to a trap or interrupt handler. This to avoid that the trap or interrupt handler appear to be called from everywhere in the call graph. The macro can expand to ~0U to prevent adjusting frompc. Note that the argument is selfpc, not frompc. This commit defines the macros on all architectures equivalently to the original code in sys/libkern/mcount.c. People can take it from here... Compile-tested on: alpha, amd64, i386, ia64 and sparc64 Boot-tested on: i386	2004-08-27 19:42:35 +00:00
Andre Oppermann	c21fd23260	Always compile PFIL_HOOKS into the kernel and remove the associated kernel compile option. All FreeBSD packet filters now use the PFIL_HOOKS API and thus it becomes a standard part of the network stack. If no hooks are connected the entire packet filter hooks section and related activities are jumped over. This removes any performance impact if no hooks are active. Both OpenBSD and DragonFlyBSD have integrated PFIL_HOOKS permanently as well.	2004-08-27 15:16:24 +00:00
Alan Cox	0b6a0b955a	Properly free the temporary sf_buf in uiomove_fromphys() if a copyin or copyout fails. Obtained from: DragonFlyBSD	2004-08-21 18:50:34 +00:00
Marius Strobl	34c8a85fc8	- Sync whitespace and comments with i386 GENERIC. - Add some commented out NICs from i386 GENERIC. Most of them look like they would work but I'm not sure if they are endian-clean and can't test. There was a report that sk(4) works on sparc64 but it doesn't look like it would because it doesn't use busdma. - Improve some of the descriptions of sparc64 specific devices. There's no functional change, i.e. no added or deleted uncommented devices or options, in this commit.	2004-08-16 23:23:21 +00:00
Marius Strobl	df774a6b16	Try to catch up with reality: - Chase the split of pcm(4). This unbreaks LINT compiles. - sc(4) basically works and a lot of its options should be supported. - Add the creator and ofw_console drivers. - vinum(4) should work, at least its module was turned on for sparc64 a while ago. - Don't build sio(4). Its EBus front-end was removed a while ago and the ISA one hardly works. Use uart(4) instead, it's not perfect yet but works much better.	2004-08-16 23:21:06 +00:00
Marius Strobl	39513fa664	Instead of "OpenFirmware", "openfirmware", etc. use the official spelling "Open Firmware" from IEEE 1275 and OpenFirmware.org (no pun intended). Ok'ed by: tmm	2004-08-16 15:45:27 +00:00
Marius Strobl	775f3c3c28	Correct some uses of the wrong members of the min()/max()-familiy, e.g. min() on unsigned long. None of these are believed to have been fatal though. Reviewed by: tmm	2004-08-15 21:37:52 +00:00
Marius Strobl	b7b6c9e6fa	- Make OF_getetheraddr() honour the "local-mac-address?" system config variable. If set to "true" OF_getetheraddr() will now return the unique MAC address stored in the "local-mac-address" property of the device's OFW node if present and the host address/system default MAC address if the node doesn't doesn't have such a property. If set to "false" the host address will be returned for all devices like before this change. This brings the behaviour of device drivers for NICs with OFW support/ FCode, i.e. dc(4) for on-board DM9102A on Sun machines, gem(4) and hme(4), regarding "local-mac-address?" in line with NetBSD and Solaris. The man pages of the respective drivers will be updated separately to reflect this change. - Remove OF_getetheraddr2() which was used as a stopgap in dc(4). Its functionality is now part of OF_getetheraddr().	2004-08-14 21:43:37 +00:00
Alan Cox	edf181d07e	Add pmap locking to pmap_remove_all().	2004-08-13 18:54:21 +00:00
Marius Strobl	26280d88d7	- Introduce an ofw_bus kobj-interface for retrieving the OFW node and a subset ("compatible", "device_type", "model" and "name") of the standard properties in drivers for devices on Open Firmware supported busses. The standard properties "reg", "interrupts" und "address" are not covered by this interface because they are only of interest in the respective bridge code. There's a remaining standard property "status" which is unclear how to support properly but which also isn't used in FreeBSD at present. This ofw_bus kobj-interface allows to replace the various (ebus_get_node(), ofw_pci_get_node(), etc.) and partially inconsistent (central_get_type() vs. sbus_get_device_type(), etc.) existing IVAR ones with a common one. This in turn allows to simplify and remove code-duplication in drivers for devices that can hang off of more than one OFW supported bus. - Convert the sparc64 Central, EBus, FHC, PCI and SBus bus drivers and the drivers for their children to use the ofw_bus kobj-interface. The IVAR- interfaces of the Central, EBus and FHC are entirely replaced by this. The PCI bus driver used its own kobj-interface and now also uses the ofw_bus one. The IVARs special to the SBus, e.g. for retrieving the burst size, remain. Beware: this causes an ABI-breakage for modules of drivers which used the IVAR-interfaces, i.e. esp(4), hme(4), isp(4) and uart(4), which need to be recompiled. The style-inconsistencies introduced in some of the bus drivers will be fixed by tmm@ in a generic clean-up of the respective drivers later (he requested to add the changes in the "new" style). - Convert the powerpc MacIO bus driver and the drivers for its children to use the ofw_bus kobj-interface. This invloves removing the IVARs related to the "reg" property which were unused and a leftover from the NetBSD origini of the code. There's no ABI-breakage caused by this because none of these driver are currently built as modules. There are other powerpc bus drivers which can be converted to the ofw_bus kobj-interface, e.g. the PCI bus driver, which should be done together with converting powerpc to use the OFW PCI code from sparc64. - Make the SBus and FHC front-end of zs(4) and the sparc64 eeprom(4) take advantage of the ofw_bus kobj-interface and simplify them a bit. Reviewed by: grehan, tmm Approved by: re (scottl) Discussed with: tmm Tested with: Sun AX1105, AXe, Ultra 2, Ultra 60; PPC cross-build on i386	2004-08-12 17:41:33 +00:00
Marcel Moolenaar	4da47b2fec	Add __elfN(dump_thread). This function is called from __elfN(coredump) to allow dumping per-thread machine specific notes. On ia64 we use this function to flush the dirty registers onto the backingstore before we write out the PRSTATUS notes. Tested on: alpha, amd64, i386, ia64 & sparc64 Not tested on: arm, powerpc	2004-08-11 02:35:06 +00:00
Alan Cox	2c0680e659	Add pmap locking to many of the functions. Implement the protection check required by the pmap_extract_and_hold() specification. Remove the acquisition and release of Giant from pmap_extract_and_hold() and pmap_protect(). Many thanks to Ken Smith for resolving a sparc64-specific initialization problem in my original patch. Tested by: kensmith@	2004-08-10 20:53:26 +00:00
Alan Cox	684a62b7bf	- Push down the acquisition and release of Giant into pmap_enter_quick() on those architectures without pmap locking. - Eliminate the acquisition and release of Giant in vm_map_pmap_enter().	2004-08-04 22:03:16 +00:00
Mark Murray	d23a262fc5	Making a loadable null.ko for /dev/(null\|zero) proved rather unpopular, so remove this (mis)feature. Encouragement provided by: jhb (and others)	2004-08-03 19:24:54 +00:00
Maxime Henrion	9f1b87f106	Instead of calling ia32_pause() conditionally on __i386__ or __amd64__ being defined, define and use a new MD macro, cpu_spinwait(). It only expands to something on i386 and amd64, so the compiled code should be identical. Name of the macro found by: jhb Reviewed by: jhb	2004-08-03 18:44:27 +00:00
Mark Murray	8ab2f5ecc5	Break out the MI part of the /dev/[k]mem and /dev/io drivers into their own directory and module, leaving the MD parts in the MD area (the MD parts _are_ part of the modules). /dev/mem and /dev/io are now loadable modules, thus taking us one step further towards a kernel created entirely out of modules. Of course, there is nothing preventing the kernel from having these statically compiled.	2004-08-01 11:40:54 +00:00
Alan Cox	9bb0e06861	- Push down the acquisition and release of Giant into pmap_protect() on those architectures without pmap locking. - Eliminate the acquisition and release of Giant from vm_map_protect(). (Translation: mprotect(2) runs to completion without touching Giant on alpha, amd64, i386 and ia64.)	2004-07-30 20:38:30 +00:00
Robert Watson	1a8cfbc450	Pass a thread argument into cpu_critical_{enter,exit}() rather than dereference curthread. It is called only from critical_{enter,exit}(), which already dereferences curthread. This doesn't seem to affect SMP performance in my benchmarks, but improves MySQL transaction throughput by about 1% on UP on my Xeon. Head nodding: jhb, bmilekic	2004-07-27 16:41:01 +00:00
Alan Cox	bfdf81ac47	Use kmem_alloc_nofault() rather than kmem_alloc_pageable() for allocating KVA for explicitly managed mappings, i.e., mappings created with pmap_qenter().	2004-07-23 06:49:49 +00:00
Marcel Moolenaar	fd32d93b97	Unify db_stack_trace_cmd(). All it did was look up the thread given the thread ID and call db_trace_thread(). Since arm has all the logic in db_stack_trace_cmd(), rename the new DB_COMMAND function to db_stack_trace to avoid conflicts on arm. While here, have db_stack_trace parse its own arguments so that we can use a more natural radix for IDs. If the ID is not a thread ID, or more precisely when no thread exists with the ID, try if there's a process with that ID and return the first thread in it. This makes it easier to print stack traces from the ps output. requested by: rwatson@ tested on: amd64, i386, ia64	2004-07-21 05:07:09 +00:00
Maxim Konovalov	aa355a2679	In -CURRENT pseudo devices are not statically assigned at compile time, remove a stale comment. PR: kern/62285	2004-07-18 09:03:12 +00:00
Alan Cox	3d2e54c317	Push down the acquisition and release of the page queues lock into pmap_protect() and pmap_remove(). In general, they require the lock in order to modify a page's pv list or flags. In some cases, however, pmap_protect() can avoid acquiring the lock.	2004-07-15 18:00:43 +00:00
David Xu	53dbf30349	Add ptrace_clear_single_step(), alpha already has it for years, the function will be used by ptrace to clear a thread's single step state.	2004-07-13 07:22:56 +00:00
Marcel Moolenaar	e8d5eed9c1	The SC_DISABLE_DDBKEY options has been renamed to SC_DISABLE_KDBKEY.	2004-07-11 03:21:24 +00:00
Marcel Moolenaar	8bcb1e9e84	Add options KDB and GDB. KDB takes on the function of what DDB used to be. Both DDB and GDB specify which KDB backends to include.	2004-07-11 03:20:09 +00:00
Marcel Moolenaar	37224cd3fc	Mega update for the KDB framework: turn DDB into a KDB backend. Most of the changes are a direct result of adding thread awareness. Typically, DDB_REGS is gone. All registers are taken from the trapframe and backtraces use the PCB based contexts. DDB_REGS was defined to be a trapframe on all platforms anyway. Thread awareness introduces the following new commands: thread X switch to thread X (where X is the TID), show threads list all threads. The backtrace code has been made more flexible so that one can create backtraces for any thread by giving the thread ID as an argument to trace. With this change, ia64 has support for breakpoints.	2004-07-10 23:47:20 +00:00
Marcel Moolenaar	a5a3d76272	Update for the KDB framework: o Make debugging code conditional upon KDB instead of DDB. o Call kdb_enter() instead of Debugger(). o Remove implementation of Debugger(). o Check kdb_active instead of db_active. o Call kdb_trap() according to the new world order.	2004-07-10 23:10:07 +00:00
Marcel Moolenaar	eda20064f9	Update for the KDB framework: o Call kdb_enter() instead of Debugger().	2004-07-10 23:06:41 +00:00
Marcel Moolenaar	2aaf890c2f	Remove obsolete prototype of kdb_trap().	2004-07-10 23:05:38 +00:00
Marcel Moolenaar	5a39cbaf69	Implement makectx(). The makectx() function is used by KDB to create a PCB from a trapframe for purposes of unwinding the stack. The PCB is used as the thread context and all but the thread that entered the debugger has a valid PCB. This function can also be used to create a context for the threads running on the CPUs that have been stopped when the debugger got entered. This however is not done at the time of this commit.	2004-07-10 19:56:00 +00:00
Marcel Moolenaar	cbc174356c	Introduce the KDB debugger frontend. The frontend provides a framework in which multiple (presumably different) debugger backends can be configured and which provides basic services to those backends. Besides providing services to backends, it also serves as the single point of contact for any and all code that wants to make use of the debugger functions, such as entering the debugger or handling of the alternate break sequence. For this purpose, the frontend has been made non-optional. All debugger requests are forwarded or handed over to the current backend, if applicable. Selection of the current backend is done by the debug.kdb.current sysctl. A list of configured backends can be obtained with the debug.kdb.available sysctl. One can enter the debugger by writing to the debug.kdb.enter sysctl.	2004-07-10 18:40:12 +00:00
Marcel Moolenaar	72d44f31a6	Introduce the GDB debugger backend for the new KDB framework. The backend improves over the old GDB support in the following ways: o Unified implementation with minimal MD code. o A simple interface for devices to register themselves as debug ports, ala consoles. o Compression by using run-length encoding. o Implements GDB threading support.	2004-07-10 17:47:22 +00:00
Marius Strobl	64e6e863ac	- Add missing <sys/module.h>. [1] - Remove unused includes. - Sort includes. Reported by: Pyun YongHyeon <yongari@kt-is.co.kr> [1]	2004-07-09 23:12:22 +00:00
Warner Losh	48375ebec9	These don't need RMAN_RESOURCE_VISIBLE now that rman is visible	2004-07-03 20:56:16 +00:00
Warner Losh	c82338ca27	Really remove __RMAN_RESORUCE_VISIBLE	2004-07-03 20:49:00 +00:00
Warner Losh	8acf75a06d	Use the rman_* functions in preference to reaching into struct resource. Remove __RMAN_RESOURCE_VISIBLE after compilation confirms it is now not needed.	2004-07-03 20:48:01 +00:00
John Baldwin	0c0b25ae91	Implement preemption of kernel threads natively in the scheduler rather than as one-off hacks in various other parts of the kernel: - Add a function maybe_preempt() that is called from sched_add() to determine if a thread about to be added to a run queue should be preempted to directly. If it is not safe to preempt or if the new thread does not have a high enough priority, then the function returns false and sched_add() adds the thread to the run queue. If the thread should be preempted to but the current thread is in a nested critical section, then the flag TDF_OWEPREEMPT is set and the thread is added to the run queue. Otherwise, mi_switch() is called immediately and the thread is never added to the run queue since it is switch to directly. When exiting an outermost critical section, if TDF_OWEPREEMPT is set, then clear it and call mi_switch() to perform the deferred preemption. - Remove explicit preemption from ithread_schedule() as calling setrunqueue() now does all the correct work. This also removes the do_switch argument from ithread_schedule(). - Do not use the manual preemption code in mtx_unlock if the architecture supports native preemption. - Don't call mi_switch() in a loop during shutdown to give ithreads a chance to run if the architecture supports native preemption since the ithreads will just preempt DELAY(). - Don't call mi_switch() from the page zeroing idle thread for architectures that support native preemption as it is unnecessary. - Native preemption is enabled on the same archs that supported ithread preemption, namely alpha, i386, and amd64. This change should largely be a NOP for the default case as committed except that we will do fewer context switches in a few cases and will avoid the run queues completely when preempting. Approved by: scottl (with his re@ hat)	2004-07-02 20:21:44 +00:00
Marius Strobl	4eae91a8f7	These need __RMAN_RESOURCE_VISIBLE, too.	2004-06-30 23:21:07 +00:00
Scott Long	770fffe05b	Retire BUS_DMAMAP_NSEGS for sparc64	2004-06-28 04:04:43 +00:00
Scott Long	8e0bfc6b32	Switch sparc64 busdma to use a dynamically allocated segment list rather than a a stack-limited list. This removes the artifical limit on s/g list size. cvs: ----------------------------------------------------------------------	2004-06-28 03:49:13 +00:00
David E. O'Brien	a82b25f9b2	Better OFW console support on Sun Ultra2 machines. Ultra2 users may want to set OFWCONS_POLL_HZ to a value of '20'. I have left default value at '4' as higher values can consume a more than is acceptable amount of CPU, and we don't have a consensus yet what is an optimal value. Submitted by: Pyun YongHyeon <yongari@kt-is.co.kr>	2004-06-24 02:57:11 +00:00
Bruce Evans	4c5f10a672	Backed out previous commit. Blind substitution of dev_t by `struct cdev *' was just wrong here because the dev_t's are user dev_t's.	2004-06-20 03:52:50 +00:00
Poul-Henning Kamp	89c9c53da0	Do the dreaded s/dev_t/struct cdev */ Bump __FreeBSD_version accordingly.	2004-06-16 09:47:26 +00:00
Scott Long	c08701fd03	Add esp to the sparc64 GENERIC	2004-06-10 05:24:34 +00:00
Scott Long	c31d0cf77b	Port the NetBSD esp(4) driver. This only includes the sbus front-end, so its primary use is for the FEPS/FAS366 SCSI found in Sun Ultra 1e and 2 machines. Once the pci front-end is ported, this driver can replace the amd(4) driver. The code as-is is fairly stable. I've disabled tagged-queueing until I can figure out a corruption bug related to it. I'm importing it now so that people with these machines can (finally) stop netbooting and report bugs before 5.3.	2004-06-10 05:11:39 +00:00
Poul-Henning Kamp	9a6dc4b647	Remove filename+line number from panic messages.	2004-06-06 21:26:49 +00:00
Poul-Henning Kamp	6b2f1cf005	Add missing <sys/module.h> #includes	2004-06-04 11:52:25 +00:00
Tim J. Robbins	cc05397ffc	Remove checks for curthread == NULL - it can't happen.	2004-06-03 10:22:47 +00:00
Poul-Henning Kamp	fd360128ff	Add missing <sys/module.h> instances which were shadowed by the nested include in <sys/kernel.h>	2004-06-03 05:58:30 +00:00
Tim J. Robbins	fa2a4d0595	Move TDF_DEADLKTREAT into td_pflags (and rename it accordingly) to avoid having to acquire sched_lock when manipulating it in lockmgr(), uiomove(), and uiomove_fromphys(). Reviewed by: jhb	2004-06-03 01:47:37 +00:00
Bosko Milekic	099a0e588c	Bring in mbuma to replace mballoc. mbuma is an Mbuf & Cluster allocator built on top of a number of extensions to the UMA framework, all included herein. Extensions to UMA worth noting: - Better layering between slab <-> zone caches; introduce Keg structure which splits off slab cache away from the zone structure and allows multiple zones to be stacked on top of a single Keg (single type of slab cache); perhaps we should look into defining a subset API on top of the Keg for special use by malloc(9), for example. - UMA_ZONE_REFCNT zones can now be added, and reference counters automagically allocated for them within the end of the associated slab structures. uma_find_refcnt() does a kextract to fetch the slab struct reference from the underlying page, and lookup the corresponding refcnt. mbuma things worth noting: - integrates mbuf & cluster allocations with extended UMA and provides caches for commonly-allocated items; defines several zones (two primary, one secondary) and two kegs. - change up certain code paths that always used to do: m_get() + m_clget() to instead just use m_getcl() and try to take advantage of the newly defined secondary Packet zone. - netstat(1) and systat(1) quickly hacked up to do basic stat reporting but additional stats work needs to be done once some other details within UMA have been taken care of and it becomes clearer to how stats will work within the modified framework. From the user perspective, one implication is that the NMBCLUSTERS compile-time option is no longer used. The maximum number of clusters is still capped off according to maxusers, but it can be made unlimited by setting the kern.ipc.nmbclusters boot-time tunable to zero. Work should be done to write an appropriate sysctl handler allowing dynamic tuning of kern.ipc.nmbclusters at runtime. Additional things worth noting/known issues (READ): - One report of 'ips' (ServeRAID) driver acting really slow in conjunction with mbuma. Need more data. Latest report is that ips is equally sucking with and without mbuma. - Giant leak in NFS code sometimes occurs, can't reproduce but currently analyzing; brueffer is able to reproduce but THIS IS NOT an mbuma-specific problem and currently occurs even WITHOUT mbuma. - Issues in network locking: there is at least one code path in the rip code where one or more locks are acquired and we end up in m_prepend() with M_WAITOK, which causes WITNESS to whine from within UMA. Current temporary solution: force all UMA allocations to be M_NOWAIT from within UMA for now to avoid deadlocks unless WITNESS is defined and we can determine with certainty that we're not holding any locks when we're M_WAITOK. - I've seen at least one weird socketbuffer empty-but- mbuf-still-attached panic. I don't believe this to be related to mbuma but please keep your eyes open, turn on debugging, and capture crash dumps. This change removes more code than it adds. A paper is available detailing the change and considering various performance issues, it was presented at BSDCan2004: http://www.unixdaemons.com/~bmilekic/netbuf_bmilekic.pdf Please read the paper for Future Work and implementation details, as well as credits. Testing and Debugging: rwatson, brueffer, Ketrien I. Saihr-Kesenchedra, ... Reviewed by: Lots of people (for different parts)	2004-05-31 21:46:06 +00:00
Thomas Moestl	65e29c4822	Retire cpu_sched_exit(); it is not used any more.	2004-05-26 12:09:39 +00:00
Thomas Moestl	3e519a2cf4	Move the per-CPU vmspace pointer fixup that is required before a struct vmspace is freed from cpu_sched_exit() to pmap_release(). This has the advantage of being able to rely on MI code to decide when a free should occur, instead of having to inspect the reference count ourselves. At the same time, turn the per-CPU vmspace pointer into a pmap pointer, so that pmap_release() can deal with pmaps exclusively. Reviewed (and embrassing bug spotted) by: jake	2004-05-26 12:06:52 +00:00
Marius Strobl	c9407be9ec	Use unsigned types for the arguments of the atomic(9) operations, like described in the man page and done on all other architectures. OK'ed by: tmm	2004-05-22 00:52:16 +00:00
Marius Strobl	980284e38f	Switch from BSD-style u_intXX_t to ISO C99 uintXX_t.	2004-05-22 00:47:26 +00:00
Thomas Moestl	048ec99ef2	In cpu_sched_exit(), we must check vm_refcnt against 0, not 1, since exit1() decrements the reference count before calling this function.	2004-05-20 18:41:07 +00:00
Bruce Evans	b2321e7cdb	Moved most of the "MI" definitions and declarations from <machine/profile.h> to <sys/gmon.h>. Cleaned them up a little by not attempting to ifdef for incomplete and out of date support for GUPROF in userland, as in the sparc64 version.	2004-05-19 15:41:26 +00:00
Stefan Farfeleder	b1aa0ba527	<stdint.h> should define WINT_M{AX,IN} independent from whether WCHAR_MIN is defined. Otherwise first including <wchar.h> and then <stdint.h> leads to no WINT_M{AX,IN} at all. PR: 64956 Approved by: das (mentor)	2004-05-18 16:04:57 +00:00
Peter Wemm	31f1cfb7e9	Oops, I left a duplicate 'relocbase' declaration. Submitted by: Koop Mast <kwm@rainbow-runner.nl>	2004-05-17 22:26:17 +00:00
Peter Wemm	e8855d4f97	Make a small revision to the api between the elf linker core and the elf_reloc() backends for two reasons. First, to support the possibility of there being two elf linkers in the kernel (eg: amd64), and second, to pass the relocbase explicitly (for relocating .o format kld files).	2004-05-16 20:00:28 +00:00
Maxime Henrion	d170b1ebab	Prefer explicit ints to implicit ints in the prototype as well as in the function definition.	2004-05-10 11:17:20 +00:00
Maxime Henrion	406e2f168e	- Fix a typo in a printf(). [1] - Fix some other style bugs while I'm here. Submitted by: Koop Mast <kwm@rainbow-runner.nl> [1] Fixes PR: sparc64/66448 [1]	2004-05-10 11:07:21 +00:00
Alan Cox	6b95d60a7f	Correct the implementation of pmap_page_is_mapped(): It should return TRUE only if the page has one or more managed mappings.	2004-05-09 19:09:14 +00:00
Alan Cox	ff8ea5e396	Since revision 1.280 of vm/vm_page.c, vm_page_grab() always returns a zeroed page when passed VM_ALLOC_ZERO. Thus, we can eliminate the check against PG_ZERO from pmap_pinit().	2004-05-08 23:26:11 +00:00
Marius Strobl	66a79ac95d	- Remove the old sparc64 OFW PCI code (as opposed to the former "options OFW_NEWPCI"). This is a bit overdue, the new sparc64 OFW PCI code which is meant to replace the old one is in place for 10 months and enabled by default in GENERIC for 8 months. FreeBSD 5.2 and 5.2.1 also shipped with the new code enabled by default. - Some minor clean-up, e.g. remove functions that encapsulated the #ifdefs for OFW_NEWPCI, remove unused resp. no longer required includes, etc. Approved by: tmm, no objections on freebsd-sparc64	2004-05-08 13:53:47 +00:00
Marius Strobl	9f7cc56b4e	Fix bug introduced in revision 1.9; in nexus_probe_nomatch() get device name and type for printing info about the device that didn't probe from child, not parent. This fixes a panic on systems where not yet supported devices hang off of the nexus, e.g. on E450. Reported by: joerg	2004-05-04 22:31:16 +00:00
Marcel Moolenaar	d1d9feac7c	Add option GEOM_GPT. This brings the ability to have a large number of partitions on a single disk.	2004-05-02 20:40:19 +00:00
David E. O'Brien	4e744b5e7f	Spell Ethernet correctly.	2004-05-02 18:57:29 +00:00
Thomas Moestl	aa1eb20ba2	Some cleanups to the nexus code: - Remove second license, the first was not that different and should be fine. - Add nexus_attach(), and do not perform its task in nexus_probe() any more. - Remove nexus_write_ivar(), since it was quite pointless. - Remove superfluous devinfo members. - Clean up some comments, minor style issues and prototypes.	2004-04-30 19:50:51 +00:00
Thomas Moestl	34d32da2fb	Remove inclusion of opt_ebus.h.	2004-04-30 19:19:55 +00:00
Marius Strobl	e9e3f95d4a	Remove the separate kernel option for Sun APB PCI-PCI bridges and make support for them conditional on the pci(4) kernel option instead. OK'ed by: tmm	2004-04-30 15:00:41 +00:00
Marius Strobl	a836fc005b	Enable _KPOSIX_PRIORITY_SCHEDULING by default, it works fine on sparc64. OK'ed by: tmm	2004-04-30 14:16:39 +00:00
Marius Strobl	13ad2f11d8	Update the reference to the FreeBSD sparc64 mailing list, its name has changed a while back.	2004-04-30 14:04:21 +00:00
Thomas Moestl	e6597c747f	Fix the EBus driver to work with the new PCI code. Unlike other PCI bridges, the EBus bridge has resource ranges it claims exclusively to map its children into in its BARs. Hence, we need to allocate these completely and manage them for the children, instead of just passing allocations through to the PCI layer as we did before. While being there, split ebus_probe(), which did also contain code normally belonging into the attach method, into ebus_probe() and ebus_attach(), and perform some minor cleanups.	2004-04-28 13:06:46 +00:00
David Schultz	be3930682a	Hide FLT_EVAL_METHOD and DECIMAL_DIG in pre-C99 compilation environments. PR: 63935 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2004-04-25 02:36:29 +00:00
Thomas Moestl	2699b91e48	Some code cleanup: - Fix some comments; remove numerous superfluous or outdated ones. - Correctly pass on the requesting device when handing requests up to the parent bus. - Use the complete device name, including unit number, to build the IOMMU instance name. - Inline a function that was only used once, and was trivial.	2004-04-25 00:30:28 +00:00
Thomas Moestl	1822d00dc3	Prefix a printf with the device name.	2004-04-25 00:08:15 +00:00
Alan Cox	1f51408ade	Remove avail_end. It is not used.	2004-04-11 06:02:24 +00:00
Warner Losh	2fcbca0d85	Remove advertising clause from University of California Regent's license, per letter dated July 22, 1999 and email from Peter Wemm, Alan Cox and Robert Watson. Approved by: core, peter, alc, rwatson	2004-04-07 05:00:01 +00:00
Alan Cox	c8607538c8	Remove avail_start on those platforms that no longer use it. (Only amd64 does anything with it beyond simple initialization.)	2004-04-05 04:08:00 +00:00
Alan Cox	bdb93eb248	Remove unused arguments from pmap_init().	2004-04-05 00:37:50 +00:00
Thomas Moestl	6e7272f69d	- Use an ihandle_t to store the stdout instance handle instead of a phandle_t. Since both are typedefed to unsigned int, this is more or less cosmetic. - Fix the code that determines whether a creator instance was used for firmware output (and should not be blanked on initialization). Since r1.2 of dev/fb/creator.c, this consisted comparing a handle of an instance of a package with a handle of the package itself. Use the test from r1.1, which utilizes OF_instance_to_package(). Submitted by: Marius Strobl <marius@alchemy.franken.de>	2004-04-04 12:52:22 +00:00
Alan Cox	121230a40d	In some cases, sf_buf_alloc() should sleep with pri PCATCH; in others, it should not. Add a new parameter so that the caller can specify which is the case. Reported by: dillon	2004-04-03 09:16:27 +00:00
Ken Smith	aa89d858c9	MFi386: correctly calculate the top-of-stack when a kthread is created with a larger kernel stack. Remove inclusion of opt_kstack_pages.h now that it's unused. Reviewed by: marcel Approved by: rwatson (mentor)	2004-03-29 01:24:13 +00:00
Thomas Moestl	38c174739c	The compatibility specification property is named "compatible", not "compat". Spotted by: Marius Strobl <marius@alchemy.franken.de>	2004-03-27 22:39:47 +00:00
Thomas Moestl	62b7be1be9	Correct the termination condition of the DVMA pruning loop in iommu_dvma_vallocseg(), which I botched in r1.32. This bug could cause an endless loop when a map was loaded and DVMA was scarce, or that map had a stringent alignment or boundary. Report and additional testing: Marius Strobl <marius@alchemy.franken.de>	2004-03-23 23:20:34 +00:00
Thomas Moestl	252cc8f6b2	Intitialize the frame pointer and return pc of a new process created in cpu_fork(). This prevents the stack tracer from running past the end of the stack (only the pc is checked in that case), which became fatal when db_print_backtrace() was introduced and called outside of ddb. Additional testing: kris	2004-03-23 23:12:02 +00:00
Alan Cox	1c5043680e	Add an implementation of uiomove_fromphys() to sparc64. This implementation could be characterized as a hybrid of the amd64 and i386 implementations. Specifically, the direct virtual-to-physical mapping is used if possible and sf_buf_alloc() is used if the direct map cannot.	2004-03-22 08:08:25 +00:00
Marcel Moolenaar	a36bdc0606	Introduce the cpumask_t type. The purpose of the type is to create a level of abstraction for any and all CPU mask and CPU bitmap variables so that platforms have the ability to break free from the hard limit of 32 CPUs, simply because we don't have more bits in an u_int. Note that the type is not supposed to solve massive parallelism, where the number of CPUs can be larger than the width of the widest integral type. As such, cpumask_t is not supposed to be a compound type. If such would be necessary in the future, we can deal with the issues then and there. For now, it can be assumed that the type is integral and unsigned. With this commit, all MD definitions start off as u_int. This allows us to phase-in cpumask_t at our leasure without breaking anything. Once cpumask_t is used consistently, platforms can switch to wider (or smaller) types if such would be beneficial (or not; whatever :-) Compile-tested on: i386	2004-03-20 20:41:40 +00:00
John-Mark Gurney	4de27366d1	sync comment with i386's isa.c.. This removes a comment that is YEARS old...	2004-03-17 21:45:55 +00:00
Nate Lawson	5f96beb9e0	Convert callers to the new bus_alloc_resource_any(9) API. Submitted by: Mark Santcroos <marks@ripe.net> Reviewed by: imp, dfr, bde	2004-03-17 17:50:55 +00:00
Alan Cox	90ecfebd82	Refactor the existing machine-dependent sf_buf_free() into a machine- dependent function by the same name and a machine-independent function, sf_buf_mext(). Aside from the virtue of making more of the code machine- independent, this change also makes the interface more logical. Before, sf_buf_free() did more than simply undo an sf_buf_alloc(); it also unwired and if necessary freed the page. That is now the purpose of sf_buf_mext(). Thus, sf_buf_alloc() and sf_buf_free() can now be used as a general-purpose emphemeral map cache.	2004-03-16 19:04:28 +00:00
Scott Long	11d905ecd8	Now that contigfree() does not require Giant, don't grab it in busdma.	2004-03-13 15:42:59 +00:00
Garance A Drosehn	ec9b318eaf	Change time_t from a 32-bit value to a 64-bit value, on FreeBSD/sparc64 only. This is a MAJOR incompatible change for the sparc64 platform, but will not effect FreeBSD on other architectures. Reviewed by: imp for UPDATING, freebsd-sparc for the change itself.	2004-03-10 17:39:05 +00:00
Alan Cox	fcffa790e9	Retire pmap_pinit2(). Alpha was the last platform that used it. However, ever since alpha/alpha/pmap.c revision 1.81 introduced the list allpmaps, there has been no reason for having this function on Alpha. Briefly, when pmap_growkernel() relied upon the list of all processes to find and update the various pmaps to reflect a growth in the kernel's valid address space, pmap_init2() served to avoid a race between pmap initialization and pmap_growkernel(). Specifically, pmap_pinit2() was responsible for initializing the kernel portions of the pmap and pmap_pinit2() was called after the process structure contained a pointer to the new pmap for use by pmap_growkernel(). Thus, an update to the kernel's address space might be applied to the new pmap unnecessarily, but an update would never be lost.	2004-03-07 21:06:48 +00:00
Lukas Ertl	1bcf24ee9d	Fix syntax errors and wrong function prototypes in several MD header files when using non-GNUC compilers. PR: kern/58515 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at> Approved by: grog (mentor), obrien	2004-03-05 09:19:59 +00:00
Poul-Henning Kamp	dc08ffec87	Device megapatch 4/6: Introduce d_version field in struct cdevsw, this must always be initialized to D_VERSION. Flip sense of D_NOGIANT flag to D_NEEDGIANT, this involves removing four D_NOGIANT flags and adding 145 D_NEEDGIANT flags.	2004-02-21 21:10:55 +00:00
Ken Smith	8ecb006f39	- Fix for sparc64 to use new __panic() function Adapted from patch by: David Cornejo <dcornejo@firetide.com> Reviewed by: freebsd-sparc64 (harti) Approved by: rwatson (mentor)	2004-01-27 15:02:18 +00:00
Jeff Roberson	048ac395be	- Recruit some new ULE users by making it the default scheduler in GENERIC. ULE will be in a probationary period to determine whether it will be left as the default in 5.3 which would likely mean the rest of the 5.x series.	2004-01-24 21:38:52 +00:00
Jacques Vidrine	5864cda7c6	Add PFIL_HOOKS to the GENERIC kernel configuration, primarily so that one can load the IPFilter module (which requires PFIL_HOOKS). Requested by: Many, for over a year	2004-01-24 14:59:51 +00:00
Jacques Vidrine	e4dc8baa84	Provide sysarch(2) prototypes in the MD sysarch.h headers. While I'm at it, use the ANSI C generic pointer type for the second argument, thus matching the documentation. Remove the now extraneous (and now conflicting) function declarations in various libc sources. Remove now unnecessary casts. Reviewed by: bde	2004-01-09 16:52:09 +00:00
Maxime Henrion	ec6a729924	Some integrated Davicom cards in sparc64 boxes have an all zeros MAC address in the EEPROM, and we need to get it from OpenFirmware. This isn't very pretty but time is lacking to do this in a better way this near 5.2-RELEASE. This is a RELENG_5_2 candidate. Original version by: Marius Strobl <marius@alchemy.franken.de> Tested by: Pete Bentley <pete@sorted.org> Reviewed by: jake	2004-01-08 19:08:27 +00:00
David Xu	a30ec4b99c	Make sigaltstack as per-threaded, because per-process sigaltstack state is useless for threaded programs, multiple threads can not share same stack. The alternative signal stack is private for thread, no lock is needed, the orignal P_ALTSTACK is now moved into td_pflags and renamed to TDP_ALTSTACK. For single thread or Linux clone() based threaded program, there is no semantic changed, because those programs only have one kernel thread in every process. Reviewed by: deischen, dfr	2004-01-03 02:02:26 +00:00
Mike Silbersack	ddeb5b242e	Track three new sendfile-related statistics: - The number of times sendfile had to do disk I/O - The number of times sfbuf allocation failed - The number of times sfbuf allocation had to wait	2003-12-28 08:57:09 +00:00
Mike Silbersack	5caf2b00f0	Move the declaration of sfbufspeak and sfbufsused to mbuf.h, and use imax instead of max, as sfbufspeak and sfbufsused are signed. Submitted by: bde	2003-12-28 01:43:22 +00:00
Mike Silbersack	5eda9873e9	Track current and peak sfbuf usage, export the values via sysctl.	2003-12-27 07:52:47 +00:00
David E. O'Brien	8ae025dbda	Don't confuse NULL with 0.	2003-12-26 14:30:19 +00:00
David E. O'Brien	f01ac8a3dd	Don't confuse NULL with 0.	2003-12-26 14:22:26 +00:00
David E. O'Brien	bac2da2926	Don't confuse NULL with 0.	2003-12-26 14:04:19 +00:00
Peter Wemm	c15e347e22	GC the unused <machine/kse.h> file.	2003-12-24 00:51:30 +00:00
Peter Wemm	9b68618df0	Add an additional field to the elf brandinfo structure to support quicker exec-time replacement of the elf interpreter on an emulation environment where an entire /compat/* tree isn't really warranted.	2003-12-23 02:42:39 +00:00
John Baldwin	b6c71225a9	Fix all users of mp_maxid to use the same semantics, namely: 1) mp_maxid is a valid FreeBSD CPU ID in the range 0 .. MAXCPU - 1. 2) For all active CPUs in the system, PCPU_GET(cpuid) <= mp_maxid. Approved by: re (scottl) Tested on: i386, amd64, alpha	2003-12-03 14:57:26 +00:00
John Baldwin	798a45964d	- Split cpu_mp_probe() into two parts. cpu_mp_setmaxid() is still called very early (SI_SUB_TUNABLES - 1) and is responsible for setting mp_maxid. cpu_mp_probe() is now called at SI_SUB_CPU and determines if SMP is actually present and sets mp_ncpus and all_cpus. Splitting these up allows an architecture to probe CPUs later than SI_SUB_TUNABLES by just setting mp_maxid to MAXCPU in cpu_mp_setmaxid(). This could allow the CPU probing code to live in a module, for example, since modules sysinit's in modules cannot be invoked prior to SI_SUB_KLD. This is needed to re-enable the ACPI module on i386. - For the alpha SMP probing code, use LOCATE_PCS() instead of duplicating its contents in a few places. Also, add a smp_cpu_enabled() function to avoid duplicating some code. There is room for further code reduction later since much of this code is also present in cpu_mp_start(). - All archs besides i386 still set mp_maxid to the same values they set it to before this change. i386 now sets mp_maxid to MAXCPU. Tested on: alpha, amd64, i386, ia64, sparc64 Approved by: re (scottl)	2003-11-21 22:23:26 +00:00
Alan Cox	0ec3db3072	- Remove unnecessary synchronization from sf_buf_init(). (There is only one active CPU when sf_buf_init() is performed.)	2003-11-16 23:40:06 +00:00
Alan Cox	e45db9b837	- Modify alpha's sf_buf implementation to use the direct virtual-to- physical mapping. - Move the sf_buf API to its own header file; make struct sf_buf's definition machine dependent. In this commit, we remove an unnecessary field from struct sf_buf on the alpha, amd64, and ia64. Ultimately, we may eliminate struct sf_buf on those architecures except as an opaque pointer that references a vm page.	2003-11-16 06:11:26 +00:00
Jake Burkholder	ea6e3dd2b3	Fix more spammage with local changes. Didn't look at the diffs before committing. Pointy hat to: jake	2003-11-14 20:36:12 +00:00
Jake Burkholder	6f9e4d866d	Rev 1.61 accidentily added makeoptions NO_MODULES=yes. Funny that I add this to all my kernel configs. Submitted by: Hidetoshi Shimokawa <simokawa@sat.t.u-tokyo.ac.jp>	2003-11-14 05:31:50 +00:00
Hidetoshi Shimokawa	3055726d16	Respect RB_KDB flag.	2003-11-13 07:41:55 +00:00
Jake Burkholder	cabf3c5279	Remove reference to sio (superseded by uart).	2003-11-11 07:49:13 +00:00
Jake Burkholder	858866e77c	Add entries for creator, splash, uart and puc, commented out until they're made the default. These are needed to run X.	2003-11-11 07:48:02 +00:00
Jake Burkholder	9c91425b10	Remove references to atkbd, atkbdc, psm and vga.	2003-11-11 07:40:45 +00:00
Jake Burkholder	7b9346ced8	Assume that unit 0 is the graphics console initialized by syscons, instead of testing if the device's firmware node is stdout. This allows syscons to be used when the firmware's input and output is the serial console.	2003-11-11 06:47:00 +00:00
Jake Burkholder	0019917542	Fix a bug in the data access error recorvery. Before re-enabling the data cache after a data access error we must discard all cache lines. When disabled existing cache lines are not invalidated by stores to memory, so we risk reading stale data that was cached before the data access error if we don't flush them. This is especially fatal when the memory involved is the active part of the kernel or user stack. For good measure we also flush the instruction cache. This fixes random crashes when the X server probes the PCI bus through /dev/pci.	2003-11-11 06:41:54 +00:00
Jake Burkholder	fe6b8ecd5e	Rearrange slightly so that DELAY(9) works during cninit.	2003-11-11 06:08:10 +00:00
Marcel Moolenaar	fcaa2925a9	Change the clear_ret argument of get_mcontext() to be a flags argument. Since all callers either passed 0 or 1 for clear_ret, define bit 0 in the flags for use as clear_ret. Reserve bits 1, 2 and 3 for use by MI code for possible (but unlikely) future use. The remaining bits are for use by MD code. This change is triggered by a need on ia64 to have another knob for get_mcontext().	2003-11-09 20:31:04 +00:00
Peter Wemm	1b03813f65	Add a stub cpu_idle() function for sparc64, alpha, powerpc. This is a MI declared function so it should be everywhere.	2003-10-19 02:36:07 +00:00
Robert Drehmel	d65f2d0cff	Add an 'include' directive to pull in <sys/ptrace.h>.	2003-10-10 10:31:48 +00:00
Robert Drehmel	ea924c4cd3	Implement preliminary support for the PT_SYSCALL command to ptrace(2).	2003-10-09 10:17:16 +00:00
Bruce M Simpson	2bc7dd5661	Move pmap_resident_count() from the MD pmap.h to the MI pmap.h. Add a definition of pmap_wired_count(). Add a definition of vmspace_wired_count(). Reviewed by: truckman Discussed with: peter	2003-10-06 01:47:12 +00:00
Alan Cox	566526a957	Migrate pmap_prefault() into the machine-independent virtual memory layer. A small helper function pmap_is_prefaultable() is added. This function encapsulate the few lines of pmap_prefault() that actually vary from machine to machine. Note: pmap_is_prefaultable() and pmap_mincore() have much in common. Going forward, it's worth considering their merger.	2003-10-03 22:46:53 +00:00
Alan Cox	1b2d9f0653	Make PAGE_SIZE and related quantities signed on sparc64. (They are signed quantities on every other architecture.) This change is required in order to move pmap_prefault() out of the pmap and into the machine-independent layer.	2003-10-03 19:49:08 +00:00
Maxime Henrion	4e513b7a01	Allow the compiler to micro-optimize byte swapping functions by evaluating them at compile time rather than at run time. As for x86 and amd64, this requires GCC and it's enabled only if __OPTIMIZE__ is defined (ie, if at least -O is used). Reviewed by: jake	2003-09-30 22:35:27 +00:00
Alan Cox	79fa677d99	Add vm object locking to pmap_release().	2003-09-28 00:11:15 +00:00
Peter Wemm	c460ac3a00	Add sysentvec->sv_fixlimits() hook so that we can catch cases on 64 bit systems where the data/stack/etc limits are too big for a 32 bit process. Move the 5 or so identical instances of ELF_RTLD_ADDR() into imgact_elf.c. Supply an ia32_fixlimits function. Export the clip/default values to sysctl under the compat.ia32 heirarchy. Have mmap(0, ...) respect the current p->p_limits[RLIMIT_DATA].rlim_max value rather than the sysctl tweakable variable. This allows mmap to place mappings at sensible locations when limits have been reduced. Have the imgact_elf.c ld-elf.so.1 placement algorithm use the same method as mmap(0, ...) now does. Note that we cannot remove all references to the sysctl tweakable maxdsiz etc variables because /etc/login.conf specifies a datasize of 'unlimited'. And that causes exec etc to fail since it can no longer find space to mmap things.	2003-09-25 01:10:26 +00:00
Yoshihiro Takahashi	33e38a2cc8	Implement the bus_space_map() function to allocate resources and initialize a bus_handle, but currently it does only initializing a bus_handle.	2003-09-23 08:22:34 +00:00
Jake Burkholder	cfbe010793	Remove an invalid KASSERT. Apparently pmap_remove_all gets called on unmanaged pages.	2003-09-20 17:00:59 +00:00
Thomas Moestl	894b85b150	Handle ISA devices in OF_decode_addr(), with the same code that is used in the EBus case.	2003-09-12 20:04:29 +00:00
Alan Cox	b9850eb224	Add a new parameter to pmap_extract_and_hold() that is needed to eliminate Giant from vmapbuf(). Idea from: tegge	2003-09-12 07:07:49 +00:00
Alan Cox	ba2157f218	Introduce a new pmap function, pmap_extract_and_hold(). This function atomically extracts and holds the physical page that is associated with the given pmap and virtual address. Such a function is needed to make the memory mapping optimizations used by, for example, pipes and raw disk I/O MP-safe. Reviewed by: tegge	2003-09-08 02:45:03 +00:00
Bill Paul	a94100fa9b	Take the support for the 8139C+/8169/8169S/8110S chips out of the rl(4) driver and put it in a new re(4) driver. The re(4) driver shares the if_rlreg.h file with rl(4) but is a separate module. (Ultimately I may change this. For now, it's convenient.) rl(4) has been modified so that it will never attach to an 8139C+ chip, leaving it to re(4) instead. Only re(4) has the PCI IDs to match the 8169/8169S/8110S gigE chips. if_re.c contains the same basic code that was originally bolted onto if_rl.c, with the following updates: - Added support for jumbo frames. Currently, there seems to be a limit of approximately 6200 bytes for jumbo frames on transmit. (This was determined via experimentation.) The 8169S/8110S chips apparently are limited to 7.5K frames on transmit. This may require some more work, though the framework to handle jumbo frames on RX is in place: the re_rxeof() routine will gather up frames than span multiple 2K clusters into a single mbuf list. - Fixed bug in re_txeof(): if we reap some of the TX buffers, but there are still some pending, re-arm the timer before exiting re_txeof() so that another timeout interrupt will be generated, just in case re_start() doesn't do it for us. - Handle the 'link state changed' interrupt - Fix a detach bug. If re(4) is loaded as a module, and you do tcpdump -i re0, then you do 'kldunload if_re,' the system will panic after a few seconds. This happens because ether_ifdetach() ends up calling the BPF detach code, which notices the interface is in promiscuous mode and tries to switch promisc mode off while detaching the BPF listner. This ultimately results in a call to re_ioctl() (due to SIOCSIFFLAGS), which in turn calls re_init() to handle the IFF_PROMISC flag change. Unfortunately, calling re_init() here turns the chip back on and restarts the 1-second timeout loop that drives re_tick(). By the time the timeout fires, if_re.ko has been unloaded, which results in a call to invalid code and blows up the system. To fix this, I cleared the IFF_UP flag before calling ether_ifdetach(), which stops the ioctl routine from trying to reset the chip. - Modified comments in re_rxeof() relating to the difference in RX descriptor status bit layout between the 8139C+ and the gigE chips. The layout is different because the frame length field was expanded from 12 bits to 13, and they got rid of one of the status bits to make room. - Add diagnostic code (re_diag()) to test for the case where a user has installed a broken 32-bit 8169 PCI NIC in a 64-bit slot. Some NICs have the REQ64# and ACK64# lines connected even though the board is 32-bit only (in this case, they should be pulled high). This fools the chip into doing 64-bit DMA transfers even though there is no 64-bit data path. To detect this, re_diag() puts the chip into digital loopback mode and sets the receiver to promiscuous mode, then initiates a single 64-byte packet transmission. The frame is echoed back to the host, and if the frame contents are intact, we know DMA is working correctly, otherwise we complain loudly on the console and abort the device attach. (At the moment, I don't know of any way to work around the problem other than physically modifying the board, so until/unless I can think of a software workaround, this will have do to.) - Created re(4) man page - Modified rlphy.c to allow re(4) to attach as well as rl(4). Note that this code works for the sample 8169/Marvell 88E1000 NIC that I have, but probably won't work for the 8169S/8110S chips. RealTek has sent me some sample NICs, but they haven't arrived yet. I will probably need to add an rlgphy driver to handle the on-board PHY in the 8169S/8110S (it needs special DSP initialization).	2003-09-08 02:11:25 +00:00
Thomas Moestl	2cda2e47da	- Clear the CE AFSR bits which indicate the error condition when handling a correctable DMA error. Failing to do so can cause the error interrupt to be triggered over and over again. - Clean up the comments for UEAFSR_* constants, fix a typo (UEAFSR_BLK is (1 << 23), not (1 << 22)), and add two more. Also, add similar constants for the CE AFSR bits.	2003-09-04 15:25:10 +00:00
Marcel Moolenaar	2ec1dc3639	Add function OF_decode_addr(). This function obtains the physical address of the device identified by its phandle_t by traversing OFW's device tree. The space and address returned by this function can subsequently be passed to sparc64_fake_bustag() to construct a valid tag and handle for use by the newbus I/O functions. Use of this function is expected to be limited to pre-newbus access to devices, such as consoles and keyboards. Partially obtained from: tmm Reviewed by: jake, jmg, tmm SBus testing made possible by: jake Tested with: LINT	2003-09-02 20:32:12 +00:00
Marcel Moolenaar	11a91bffe5	Preparatory commit to allow prototypes in ofw_machdep.h to contain both newbus types and OFW types. This involves either including <machine/bus.h> or <dev/ofw/openfirm.h>. Reviewed by: jake, jmg, tmm	2003-09-02 20:24:42 +00:00
Alexander Kabaev	1d49585050	Standardize idempotentcy ifdefs. Consistently use _MACHINE_VARARGS_H_ symbol.	2003-09-01 03:01:45 +00:00
Jake Burkholder	93dee5536e	Implement cpu_set_upcall_kse. May need tweaking.	2003-08-31 22:58:56 +00:00
Alan Cox	411d10a600	Migrate the sf_buf allocator that is used by sendfile(2) and zero-copy sockets into machine-dependent files. The rationale for this migration is illustrated by the modified amd64 allocator. It uses the amd64's direct map to avoid emphemeral mappings in the kernel's address space. On an SMP, the emphemeral mappings result in an IPI for TLB shootdown for each transmitted page. Yuck. Maintainers of other 64-bit platforms with direct maps should be able to use the amd64 allocator as a reference implementation.	2003-08-29 20:04:10 +00:00
Marcel Moolenaar	3f51411974	Allow bus barrier operations on fake tags. The purpose of a fake bus tag is to allow bus space accesses prior to having newbus fully initialized, such as would be the case for console drivers. Since barriers are a fundamental part of bus space accesses, not allowing them on fake tags would defeat the purpose of these tags. We use the barrier function normally associated with nexus. This is the barrier used when subordinates haven't defined a barrier themselves.	2003-08-24 07:47:52 +00:00
John-Mark Gurney	3f6268f92a	reenable the caches when a PCI peek faults. Takes my kernel compile from 3770 real down to 1250 real. Submitted by: jake	2003-08-24 06:23:36 +00:00
Jake Burkholder	bf920dafa4	Add a driver for creator upa frame buffers found in many sparc64 machines. These are fixed resolution and operate only in pixel mode so they present a challenge to syscons (square peg, round hole, etc, etc). The driver provides a video driver interface for syscons and a separate character device for X to mmap. Wherever possible the creator's accelarated graphics functions are used so text mode is very fast. Based roughly on the openbsd driver.	2003-08-24 01:15:40 +00:00
Jake Burkholder	6cf3586b7e	"md" files for syscons.	2003-08-24 00:47:40 +00:00
Marcel Moolenaar	ac764ac32e	s#<mk48txx/mk48txxreg.h>#<dev/mk48txx/mk48txxreg.h>#	2003-08-23 05:56:58 +00:00
Warner Losh	3d11ce04c6	s=include <ofw/=include <dev/ofw/= to reflect removal of -I$S/dev	2003-08-23 00:11:16 +00:00
Warner Losh	d2c5276d96	Prefer new location of pci include files (which have only been in the tree for two or more years now), except in a few places where there's code to be compatible with older versions of FreeBSD.	2003-08-22 07:39:05 +00:00
Alan Cox	2094a9b6ce	Lock the pmap's tsb object when performing vm_page_grab() on it.	2003-08-20 06:11:39 +00:00
David E. O'Brien	05ca5d4af2	Enable OFW_NEWPCI until jmg's 2003/06/21 18:26:08 PDT bus commit is fixed that caused a 3-4 times slow down in performance. (the primary Sparc64 developers are all using OFW_NEWPCI already, so it is the best code path for users)	2003-08-19 21:57:29 +00:00
Gordon Tetlow	df3d69c217	Fixup the ELF branding information to point to the new home of rtld.	2003-08-17 08:08:38 +00:00
Marcel Moolenaar	710338e94f	In vm_thread_swap{in\|out}(), remove the alpha specific conditional compilation and replace it with a call to cpu_thread_swap{in\|out}(). This allows us to add similar code on ia64 without cluttering the code even more.	2003-08-16 23:15:15 +00:00
Marcel Moolenaar	26502503e5	Further cleanup <machine/cpu.h> and <machine/md_var.h>: move the MI prototypes of cpu_halt(), cpu_reset() and swi_vm() from md_var.h to cpu.h. This affects db_command.c and kern_shutdown.c. ia64: move all MD prototypes from cpu.h to md_var.h. This affects madt.c, interrupt.c and mp_machdep.c. Remove is_physical_memory(). It's not used (vm_machdep.c). alpha: the MD prototypes have been left in cpu.h with a comment that they should be there. Moving them is left for later. It was expected that the impact would be significant enough to be done in a seperate commit. powerpc: MD prototypes left in cpu.h. Comment added. Suggested by: bde Tested with: make universe (pc98 incomplete)	2003-08-16 16:57:57 +00:00
Warner Losh	06b4bf3e55	Expand inline the relevant parts of src/COPYRIGHT for Matt Dillon's copyrighted files. Approved by: Matt Dillon	2003-08-12 23:24:05 +00:00
Jake Burkholder	3014050b19	Fix sparc64 LINT build. <blush>	2003-08-11 07:05:55 +00:00
Jake Burkholder	2974095528	Use get_mcontext in sendsig and set_mcontext in sigreturn instead of frobbing things directly.	2003-08-09 23:14:33 +00:00
John Baldwin	3bdbd658f1	- Since td_critnest is now initialized in MI code, it doesn't have to be set in cpu_critical_fork_exit() anymore. - As far as I can tell, cpu_thread_link() has never been used, not even when it was originally added, so remove it.	2003-08-04 20:32:45 +00:00
Peter Wemm	ad7a226f9d	Deal with 'options KSTACK_PAGES' being a global option.	2003-07-31 01:31:32 +00:00
Thomas Moestl	416c84a212	Return 1 from pmap_protect_tte() instead of 0. When used with tsb_foreach(), 0 signals to terminate the tsb traversal, so when tsb_foreach() was used in pmap_protect() (which only happens when the area to be protected is larger than PMAP_TSB_THRESH = 16MB), only the first tsb entry in the specified range would be protected. Reported by: Andrew Belashov <bel@orel.ru>	2003-07-30 16:27:51 +00:00
Thomas Moestl	1cff8bd286	Respect BUS_DMA_ZERO in iommu_dvmamem_alloc().	2003-07-27 15:19:45 +00:00
Maxime Henrion	d5afecd068	- Introduce a new busdma flag BUS_DMA_ZERO to request for zero'ed memory in bus_dmamem_alloc(). This is possible now that contigmalloc() supports the M_ZERO flag. - Remove the locking of Giant around calls to contigmalloc() since contigmalloc() now grabs Giant itself.	2003-07-27 13:52:10 +00:00
Jake Burkholder	30c2333b1d	Avoid exposing declarations for kernel variables to userland. PR: 54528	2003-07-17 23:42:08 +00:00
John-Mark Gurney	f760fd6128	change CLASS depending upon __ELF_WORD_SIZE. This is necessary if someone wants to try to run 32bit binaries on sparc64.	2003-07-16 01:14:40 +00:00
John-Mark Gurney	e93581dc7a	add support for interrupt counting on sparc64. This copies part of the code from i386. The code has a slight bogon that interrupts are counted twice. Once on the ithread dispatch and once on the dispatch for the vector vmstat -i and systat -vm now contains interrupt counts. Reviewed by: jake	2003-07-16 00:08:43 +00:00
David Xu	20a2d71332	Rename thread_siginfo to cpu_thread_siginfo. Suggested by: jhb	2003-07-15 00:11:04 +00:00
Thomas Moestl	928a49644f	Lock down the IOMMU bus_dma implementation to make it safe to use without Giant held. A quick outline of the locking strategy: Since all IOMMUs are synchronized, there is a single lock, iommu_mtx, which protects the hardware registers (where needed) and the global and per-IOMMU software states. As soon as the IOMMUs are divorced, each struct iommu_state will have its own mutex (and the remaining global state will be moved into the struct). The dvma rman has its own internal mutex; the TSB slots may only be accessed by the owner of the corresponding resource, so neither needs extra protection. Since there is a second access path to maps via LRU queues, the consumer- provided locking is not sufficient; therefore, each map which is on a queue is additionally protected by iommu_mtx (in part, there is one member which only the map owner may access). Each map on a queue may be accessed and removed from or repositioned in a queue in any context as long as the lock is held; only the owner may insert a map. To reduce lock contention, some bus_dma functions remove the map from the queue temporarily (on behalf of the map owner) for some operations and reinsert it when they are done. Shorter operations and operations which are not done on behalf of the lock owner are completely covered by the lock. To facilitate the locking, reorganize the streaming buffer handling; while being there, fix an old oversight which would cause the streaming buffer to always be flushed, regardless of whether streaming was enabled in the TSB entry. The streaming buffer is still disabled for now, since there are a number of drivers which lack critical bus_dmamp_sync() calls. Additional testing by: jake	2003-07-10 23:27:35 +00:00
Maxime Henrion	fab712dcc6	Uncomment the dc(4) driver, it should work just fine now.	2003-07-09 15:04:27 +00:00
Alan Cox	6e3bf93111	MFi386 Updates to cnt.v_wire_count, the global count of wired pages, should be performed using atomic ops.	2003-07-06 20:32:42 +00:00
Alan Cox	1f78f902a8	Background: pmap_object_init_pt() premaps the pages of a object in order to avoid the overhead of later page faults. In general, it implements two cases: one for vnode-backed objects and one for device-backed objects. Only the device-backed case is really machine-dependent, belonging in the pmap. This commit moves the vnode-backed case into the (relatively) new function vm_map_pmap_enter(). On amd64 and i386, this commit only amounts to code rearrangement. On alpha and ia64, the new machine independent (MI) implementation of the vnode case is smaller and more efficient than their pmap-based implementations. (The MI implementation takes advantage of the fact that objects in -CURRENT are ordered collections of pages.) On sparc64, pmap_object_init_pt() hadn't (yet) been implemented.	2003-07-03 20:18:02 +00:00
Scott Long	f6b1c44d1f	Mega busdma API commit. Add two new arguments to bus_dma_tag_create(): lockfunc and lockfuncarg. Lockfunc allows a driver to provide a function for managing its locking semantics while using busdma. At the moment, this is used for the asynchronous busdma_swi and callback mechanism. Two lockfunc implementations are provided: busdma_lock_mutex() performs standard mutex operations on the mutex that is specified from lockfuncarg. dftl_lock() is a panic implementation and is defaulted to when NULL, NULL are passed to bus_dma_tag_create(). The only time that NULL, NULL should ever be used is when the driver ensures that bus_dmamap_load() will not be deferred. Drivers that do not provide their own locking can pass busdma_lock_mutex,&Giant args in order to preserve the former behaviour. sparc64 and powerpc do not provide real busdma_swi functions, so this is largely a noop on those platforms. The busdma_swi on is64 is not properly locked yet, so warnings will be emitted on this platform when busdma callback deferrals happen. If anyone gets panics or warnings from dflt_lock() being called, please let me know right away. Reviewed by: tmm, gibbs	2003-07-01 15:52:06 +00:00
Thomas Moestl	cb33c884cd	Add a commented-out entry for OFW_NEWPCI to GENERIC and NOTES, along with a comment describing it's advantages and the implication of changing it. While being there, fix a typo in NOTES. The option is not enabled in NOTES for now since large portions of code are conditional on it being disabled, too.	2003-07-01 15:13:07 +00:00
Thomas Moestl	1d80cb1b37	Add the new sparc64 OFW PCI framework, conditional on options OFW_NEWPCI for now. It introduces a OFW PCI bus driver and a generic OFW PCI-PCI bridge driver. By utilizing these, the PCI handling is much more elegant now. The advantages of the new approach are: - Device enumeration should hopefully be more like on Solaris now, so unit numbers should match what's printed on the box more closely. - Real interrupt routing is implemented now, so cardbus bridges etc. have at least a chance to work. - The quirk tables are gone and have been replaced by (hopefully sufficient) heuristics. - Much cleaner code. There was also a report that previously bogus interrupt assignments are fixed now, which can be attributed to the new heuristics. A pitfall, and the reason why this is not the default yet, is that it changes device enumeration, as mentioned above, which can make it necessary to change the system configuration if more than one unit of a device type is present (on a system with two hme cars, for example, it is possible that hme0 becomes hme1 and vice versa after enabling the option). Systems with multiple disk controllers may need to be booted into single user (and require manual specification of the root file system on boot) to adjust the fstab. Nevertheless, I would like to encourage users to use this option, so that it can be made the default soon. In detail, the changes are: - Introduce an OFW PCI bus driver; it inherits most methods from the generic PCI bus driver, but uses the firmware for enumeration, performs additional initialization for devices and firmware-specific interrupt routing. It also implements an OFW-specific method to allow child devices to get their firmware nodes. - Introduce an OFW PCI-PCI bridge driver; again, it inherits most of the generic PCI-PCI bridge driver; it has it's own method for interrupt routing, as well as some sparc64-specific methods (one to get the node again, and one to adjust the bridge bus range, since we need to reenumerate all PCI buses). - Convert the apb driver to the new way of handling things. - Provide a common framework for OFW bridge drivers, used be the two drivers above. - Provide a small common framework for interrupt routing (for all bridge types). - Convert the psycho driver to the new framework; this gets rid of a bunch of old kludges in pci_read_config(), and the whole preinitialization (ofw_pci_init()). - Convert the ISA MD part and the EBus driver to the new way interrupts and nodes are handled. - Introduce types for firmware interrupt properties. - Rename the old sparcbus_if to ofw_pci_if by repo copy (it is only required for PCI), and move it to a more correct location (new support methodsx were also added, and an old one was deprecated). - Fix a bunch of minor bugs, perform some cleanups. In some cases, I introduced some minor code duplication to keep the new code clean, in hopes that the old code will be unifdef'ed soon. Reviewed in part by: imp Tested by: jake, Marius Strobl <marius@alchemy.franken.de>, Sergey Mokryshev <mokr@mokr.net>, Chris Jackman <cjackNOSPAM@klatsch.org> Info on u30 firmware provided by: kris	2003-07-01 14:52:47 +00:00
Alan Cox	dca96f1adc	- Export pmap_enter_quick() to the MI VM. This will permit the implementation of a largely MI pmap_object_init_pt() for vnode-backed objects. pmap_enter_quick() is implemented via pmap_enter() on sparc64 and powerpc. - Correct a mismatch between pmap_object_init_pt()'s prototype and its various implementations. (I plan to keep pmap_object_init_pt() as the MD hook for device-backed objects on i386 and amd64.) - Correct an error in ia64's pmap_enter_quick() and adjust its interface to match the other versions. Discussed with: marcel	2003-06-29 21:20:04 +00:00
Thomas Moestl	d462b4f058	Small fixes for the IOMMU code: 1.) Handle maximum segment sizes which are smaller than the IOMMU page size by splitting up pages across multiple segments if needed; this case was previously unimplemented, and would cause panics. 2.) KASSERT that the physical address is in range; remove a KASSERT that has become pointless. 3.) Add a comment describing what remains to be fixed in the IOMMU code; I plan to address these issues soon. Desired by: dwhite (1)	2003-06-28 21:52:16 +00:00
David Xu	b8f480ab94	Add a machine depended function thread_siginfo, SA signal code will use the function to construct a siginfo structure and use the result to export to userland. Reviewed by: julian	2003-06-28 06:34:08 +00:00
John-Mark Gurney	090ef7b377	remove unnecessary comment. We do what the comments says we need to.	2003-06-24 21:37:49 +00:00
John-Mark Gurney	dffca5a624	add support for peeking at pci busses on UltraSparc systems. This prevents data access errors when trying to read/write to non-existant PCI devices. fix the psycho bridge to use peek for probing devices. This no longer fakes it if the OFW node doesn't exist (and the reg == 0). Reviewed by: jake, tmm	2003-06-22 01:26:08 +00:00
Jake Burkholder	d4c737a952	Avoid using v8 opcodes; use ba instead of b for unconditional branches.	2003-06-19 19:11:21 +00:00
Jake Burkholder	f96c24256c	- Rename the IPI_WAIT macro to IPI_DONE. - Don't require all receivers of ipis to wait for all other receivers, only that the sender wait for all receivers. This should reduce the amount of time spent with interrupts disabled, which may be a cause of ipi timeouts. Discussed with: tmm	2003-06-19 05:27:04 +00:00
Jake Burkholder	26f66ceae3	Ignore fake ttes in pmap_copy, its too hard to deal with them not having a real vm_page right now. This fixes a panic when processes with resident device mappings fork, such as the X server.	2003-06-18 17:03:04 +00:00
Thomas Moestl	6d3b2a3cad	Further cleanup of the sparc64 busdma implementation: - Move prototypes for sparc64-specific helper functions from bus.h to bus_private.h - Move the method pointers from struct bus_dma_tag into a separate structure; this saves some memory, and allows to use a single method table for each busdma backend, so that the bus drivers need no longer be changed if the methods tables need to be modified. - Remove the hierarchical tag method lookup. It was never really useful, since the layering is fixed, and the current implementations do not need to call into parent implementations anyway. Each tag inherits its method table pointer and cookie from the parent (or the root tag) now, and the method wrapper macros directly use the method table of the tag. - Add a method table to the non-IOMMU backend, remove unnecessary prototypes, remove the extra parent tag argument. - Rename sparc64_dmamem_alloc_map() and sparc64_dmamem_free_map() to sparc64_dma_alloc_map() and sparc64_dma_free_map(), move them to a better place and use them for all map allocations and deallocations. - Add a method table to the iommu backend, and staticize functions, remove the extra parent tag argument. - Change the psycho and sbus drivers to just set cookie and method table in the root tag. - Miscellaneous small fixes.	2003-06-18 16:41:36 +00:00
Alan Cox	40ebf3e43a	Fix a performance bug in all of the various implementations of uma_small_alloc(): They always zeroed the page regardless of what the caller requested.	2003-06-18 02:57:38 +00:00
Jake Burkholder	95343ec2e8	Handle recursion on the vm_page_queue_mtx manually in pmap_qenter and pmap_qremove, in order to avoid making the mutex recursable. Discussed with: alc	2003-06-17 23:22:35 +00:00
John-Mark Gurney	81cb12571a	free type too if we can't add the child.	2003-06-16 19:18:06 +00:00
John-Mark Gurney	ad0c7dea8c	fix misspelling of ORIR_NOTFOUND	2003-06-16 19:06:36 +00:00
Jake Burkholder	77b12dfe8f	The page queue lock is already held in pmap_remove, change acquire/release to assertion of ownership. Serves me right for not booting a witness kernel.	2003-06-15 21:06:49 +00:00
Jake Burkholder	86479a0840	- Mirror vm_page_queue_mtx assertions added to the i386 pmap. - Add vm page queue locking in certain places that are only needed on sparc64. This should make pmap_qenter and pmap_qremove MP-safe. Discussed with: alc	2003-06-15 19:54:50 +00:00
David Xu	0e2a4d3aeb	Rename P_THREADED to P_SA. P_SA means a process is using scheduler activations.	2003-06-15 00:31:24 +00:00
Alan Cox	49a2507bd1	Migrate the thread stack management functions from the machine-dependent to the machine-independent parts of the VM. At the same time, this introduces vm object locking for the non-i386 platforms. Two details: 1. KSTACK_GUARD has been removed in favor of KSTACK_GUARD_PAGES. The different machine-dependent implementations used various combinations of KSTACK_GUARD and KSTACK_GUARD_PAGES. To disable guard page, set KSTACK_GUARD_PAGES to 0. 2. Remove the (unnecessary) clearing of PG_ZERO in vm_thread_new. In 5.x, (but not 4.x,) PG_ZERO can only be set if VM_ALLOC_ZERO is passed to vm_page_alloc() or vm_page_grab().	2003-06-14 23:23:55 +00:00
Alan Cox	89f4fca265	Move the _new_altkstack() and _dispose_altkstack() functions out of the various pmap implementations into the machine-independent vm. They were all identical.	2003-06-14 06:20:25 +00:00
John-Mark Gurney	4966764cc1	Hardwire APB's PCI buses down. If we don't do this, pciconf -l returns selectors that are incorrect to use with pciconf -[rw] Fixes-PR: sparc64/50789 Ok's by: tmm	2003-06-13 17:44:03 +00:00

... 2 3 4 5 6 ...

1224 Commits