freebsd-skq

Author	SHA1	Message	Date
jkim	c44ef841d0	Fix style(9) and comments. Submitted by: Scot Hetzel (swhetzel at gmail dot com)	2007-04-18 20:12:05 +00:00
jkim	ded5de7de2	style(9) says sizeof's are not be followed by a space. Fix them.	2007-04-18 18:11:32 +00:00
jkim	691eeb86d4	Implement settimeofday() for Linuxulator/amd64. Submitted by: Scot Hetzel (swhetzel at gmail dot com)	2007-04-18 18:08:12 +00:00
jhb	ed9ca08793	Honor the BUS_DMA_NOCACHE flag to bus_dmamem_alloc() on amd64 and i386 by mapping the pages as UC (uncacheable) using pmap_change_attr(). MFC after: 1 week Requested by: ariff Reviewed by: scottl	2007-04-17 21:05:34 +00:00
alc	02ed5ecf42	Eliminate the misuse of PG_FRAME to truncate a virtual address to a virtual page boundary. Reviewed by: ru@	2007-04-13 16:07:29 +00:00
pjd	f4e110ebf2	Remove trailing '.' for consistency!	2007-04-10 21:40:13 +00:00
pjd	b159725895	Add UFS_GJOURNAL options to the GENERIC kernel. Approved by: re (kensmith)	2007-04-10 16:49:41 +00:00
jkim	b204c9cc13	MFP4: Turn emul_lock into a mutex. Submitted by: rdivacky	2007-04-02 18:38:13 +00:00
jkim	19c0211d4d	Correct BB-profiling and adjust comments. Pointed out by: bde Reviewed by: bde	2007-03-31 01:47:37 +00:00
jkim	cfce99a4a7	Fix off-by-4 error in address validation for i386, reduce PCB reloading, and fix more style(9) nits. Pointed out by: bde Discussed with: kib Reviewd by: bde	2007-03-30 23:19:08 +00:00
jkim	d64527e9e1	Fix more style(9) nits[1] and remove unnecessary use of '#if !defined(_KERNEL)'. Pointed out by: bde[1]	2007-03-30 19:33:53 +00:00
jkim	4f1754c08d	Use the same wisdom of sys/i386/i386/support.s 1.97 to remove obfuscation. Pointed out by: bde	2007-03-30 18:27:57 +00:00
jkim	0a5bf58f0f	MFP4: Fix style(9) nits and grammar in comments.	2007-03-30 17:27:13 +00:00
jkim	3704568725	MFP4: 114193, 114194 Dont "return" in linux_clone() after we forked the new process in a case of problems. Move the copyout of p2->p_pid outside the emul_lock coverage. Submitted by: Roman Divacky	2007-03-30 17:16:51 +00:00
jkim	19e6b257f6	MFP4: Linux futex support for amd64. Initial patch was submitted by kib and additional work was done by Divacky Roman. Tested by: emulation	2007-03-30 01:07:28 +00:00
jkim	cfcc6fe5fa	Regen for set_thread_area.	2007-03-30 00:08:21 +00:00
jkim	66aaf37941	MFP4: Linux set_thread_area syscall (aka TLS) support for amd64. Initial version was submitted by Divacky Roman and mostly rewritten by me. Tested by: emulation	2007-03-30 00:06:21 +00:00
julian	93fc8e768e	Implement the openat() linux syscall Submitted by: Roman Divacky (rdivacky@) MFC after: 2 weeks	2007-03-29 02:11:46 +00:00
kris	0e5e1ddffd	Remove unnecessary giant acquisition around panic in #ifdef DIAGNOSTIC code. # There is some question about whether this code is even relevant any # longer (it dates back to prehistoric times, i.e. present in r1.1), # especially on amd64. Reviewed by: jhb	2007-03-26 21:45:44 +00:00
njl	4933ca0aa0	Add an interface for drivers to be notified of changes to CPU frequency. cpufreq_pre_change is called before the change, giving each driver a chance to revoke the change. cpufreq_post_change provides the results of the change (success or failure). cpufreq_levels_changed gives the unit number of the cpufreq device whose number of available levels has changed. Hook in all the drivers I could find that needed it. * TSC: update TSC frequency value. When the available levels change, take the highest possible level and notify the timecounter set_cputicker() of that freq. This gets rid of the "calcru: runtime went backwards" messages. * identcpu: updates the sysctl hw.clockrate value * Profiling: if profiling is active when the clock changes, let the user know the results may be inaccurate. Reviewed by: bde, phk MFC after: 1 month	2007-03-26 18:03:29 +00:00
jkim	c06098a406	Catch up with ACPI-CA 20070320 import.	2007-03-22 18:16:43 +00:00
jhb	8b3222b80b	Change the amd64, i386, and ia64 nexus drivers to setup bus space tags and handles when activating a resource via bus_activate_resource() rather than doing some of the work in bus_alloc_resource() and some of it in bus_activate_resource(). One note is that when using isa_alloc_resourcev() on PC-98, drivers now need to just use bus_release_resource() without explicitly calling bus_deactivate_resource() first. nyan@ has already fixed all of the PC-98 drivers.	2007-03-21 15:36:38 +00:00
jhb	fe7d05b231	Add a new apic0 psuedo-device to claim memory resources for the memory address ranges used by local and I/O APICs in the system. Some systems also reserve these ranges as system resources via either PnPBIOS or ACPI, so this device currently attaches after acpi0 and legacy0 so that the system resources are given precedence.	2007-03-20 21:53:31 +00:00
jhb	9c3aae9403	Add a new ram0 pseudo-device that claims memory resouces for physical addresses corresponding to system RAM. On amd64 ram0 uses the SMAP and claims all the type 1 SMAP regions. On i386 ram0 uses the dump_avail[] array. Note that on i386 we have to ignore regions above 4G in PAE kernels since bus resources use longs.	2007-03-20 21:08:39 +00:00
jkim	d7f955fd67	- Add macros for newly added CPUID bits in the corresponding header files. - Use correct capticalization in xTPR as Intel uses in their documents. - Use proper description instead of vendor code name in comment.	2007-03-20 20:22:45 +00:00
jhb	d6f063aecf	Tweak the probe/attach order of devices on the x86 nexus devices. Various BIOS-related psuedo-devices are added at an order of 5. acpi0 is added at an order of 10, and legacy0 is added at an order of 11.	2007-03-20 20:21:44 +00:00
jhb	87cf8b87ee	MFi386 1.173: Display two new Intel feature bits.	2007-03-20 18:48:04 +00:00
jkim	94c4c2a79b	Add another CPUID for AMD CPUs and fix style(9) while I am here.	2007-03-12 20:27:21 +00:00
alc	b03ddb707b	Push down the implementation of PCPU_LAZY_INC() into the machine-dependent header file. Reimplement PCPU_LAZY_INC() on amd64 and i386 making it atomic with respect to interrupts. Reviewed by: bde, jhb	2007-03-11 05:54:29 +00:00
alc	793fdbb540	Completely eliminate "avail_start". It serves no useful purpose.	2007-03-10 20:26:43 +00:00
jhb	7894c3ad20	Defer calling lapic_init() until we've completed the 'MPTable: <...>' printf. Otherwise, printfs inside of lapic_init() (such as during a verbose boot) can uglify the output.	2007-03-09 15:49:57 +00:00
mohans	a332cb00d5	Over NFS, an open() call could result in multiple over-the-wire GETATTRs being generated - one from lookup()/namei() and the other from nfs_open() (for cto consistency). This change eliminates the GETATTR in nfs_open() if an otw GETATTR was done from the namei() path. Instead of extending the vop interface, we timestamp each attr load, and use this to detect whether a GETATTR was done from namei() for this syscall. Introduces a thread-local variable that counts the syscalls made by the thread and uses <pid, tid, thread syscalls> as the attrload timestamp. Thanks to jhb@ and peter@ for a discussion on thread state that could be used as the timestamp with minimal overhead.	2007-03-09 04:02:38 +00:00
scottl	32acf7e446	Don't increment total_bounced when doing no-op dmamap_sync ops.	2007-03-06 18:28:43 +00:00
jhb	432a1d8db5	Change the x86 interrupt code to use FreeBSD CPU IDs (i.e. PCPU_GET(cpuid)) rather than local APIC IDs to keep track of CPUs which can handle interrupts.	2007-03-06 17:16:47 +00:00
alc	e5d284af0e	Acquiring smp_ipi_mtx on every call to pmap_invalidate_() is wasteful. For example, during a buildworld more than half of the calls do not generate an IPI because the only TLB entry invalidated is on the calling processor. This revision pushes down the acquisition and release of smp_ipi_mtx into smp_tlb_shootdown() and smp_targeted_tlb_shootdown() and instead uses sched_pin() and sched_unpin() in pmap_invalidate_() so that thread migration doesn't lead to a missed TLB invalidation. Reviewed by: jhb MFC after: 3 weeks	2007-03-05 21:40:10 +00:00
jhb	7837841549	Use vm_paddr_t rather than uintptr_t when passing the physical address of APICs to lapic_init() and ioapic_create().	2007-03-05 20:35:17 +00:00
jhb	f650676d7d	Add a simple device driver to "eat" any I/O APICs that show up as PCI devices. MFC after: 1 week	2007-03-05 16:22:49 +00:00
jkim	554fb0a678	MFP4: 115220, 115222 - Fix style(9) and reduce diff between amd64 and i386. - Prefix Linuxulator macros with LINUX_ to prevent future collision.	2007-03-02 00:08:47 +00:00
jkim	2620bd06da	MFP4: 115094 Linux does not check file descriptor when MAP_ANONYMOUS is set. This should fix recent LTP test regressions. Reported by: Scot Hetzel (swhetzel at gmail dot com) netchild	2007-02-27 02:08:01 +00:00
netchild	888f5e57b2	Partial MFp4 of 114977: Whitespace commit: Fix grammar, spelling and punctuation. Submitted by: "Scot Hetzel" <swhetzel@gmail.com>	2007-02-24 16:49:25 +00:00
jhb	2e25469b76	Use ih_filter instead of ih_handler in a couple of places. This fixes most INTR_FAST handlers on i386. Reviewed by: piso	2007-02-23 20:03:24 +00:00
piso	6a2ffa86e5	o break newbus api: add a new argument of type driver_filter_t to bus_setup_intr() o add an int return code to all fast handlers o retire INTR_FAST/IH_FAST For more info: http://docs.freebsd.org/cgi/getmsg.cgi?fetch=465712+0+current/freebsd-current Reviewed by: many Approved by: re@	2007-02-23 12:19:07 +00:00
kib	a573a4d361	MFi386 rev. 1.544 of i386/i386/pmap.c: Rounding addr upwards to next 2M boundary in pmap_growkernel() could cause addr to become 0, resulting in an early return without populating the last PDE. Reported and tested by: kris Suggested by: alc MFC after: 1 week	2007-02-19 10:55:16 +00:00
alc	cc7fb68847	Eliminate some acquisitions and releases of the page queues lock that are no longer necessary.	2007-02-18 06:33:02 +00:00
jhb	13cb63c7a4	Add bootverbose printfs to indicate which IDT vectors are assigned to MSI interrupts.	2007-02-15 22:22:57 +00:00
jkim	baeb01e245	Fix accidental removal of an empty line from the previous commit.	2007-02-15 01:20:43 +00:00
jkim	22a98ec2d8	Regen.	2007-02-15 01:15:31 +00:00
jkim	ca68f71cb9	MFP4: 113033 Port iopl(2) from i386. This fixes LTP iopl01 and iopl02 on amd64.	2007-02-15 01:13:36 +00:00
jkim	df99d574b5	MFP4: 113025, 113146, 113177, 113203, 113500, 113546, 113570 - PROT_READ, PROT_WRITE, or PROT_EXEC implies PROT_READ and PROT_EXEC. Linux/ia64's i386 emulation layer does this and it complies with Linux header files. This fixes mmap05 LTP test case on amd64. - Do not adjust stack size when failure has occurred. - Synchronize i386 mmap/mprotect with amd64.	2007-02-15 00:54:40 +00:00
brooks	beaea8e48e	Include GEOM_LABEL in GENERIC. It's very useful and not well publicized enough. Approved by: pjd	2007-02-09 19:03:18 +00:00
jhb	62b1a5668d	Don't send interrupts to CPUs disabled via lapic hints. Reported by: Ludger Bolmerg <lbolmerg ! web.de> MFC after: 3 days Pointy hat to: jhb	2007-02-08 16:49:59 +00:00
marcel	0245423ad8	Evolve the ctlreq interface added to geom_gpt into a generic partitioning class that supports multiple schemes. Current schemes supported are APM (Apple Partition Map) and GPT. Change all GEOM_APPLE anf GEOM_GPT options into GEOM_PART_APM and GEOM_PART_GPT (resp). The ctlreq interface supports verbs to create and destroy partitioning schemes on a disk; to add, delete and modify partitions; and to commit or undo changes made.	2007-02-07 18:55:31 +00:00
bde	2847eb0a1f	Fixed some style bugs. Routine except: - don't use __GNUCLIKE___OFFSETOF, since __offsetof() is a standard FreeBSD implementaion detail which has nothing to do with GNUC.	2007-02-06 18:04:02 +00:00
bde	a1801d3dbe	Simplified PCPU_GET() and PCPU_SET(). We must copy through a temporary variable to avoid invalid constraints in dead code. Use an array of u_char's (inside a struct) instead of a char/short/int/long variable so that the variable and its accesses can be spelled in the same way in all cases and code doesn't need to be cloned just to hold the spelling differences. Fixed strict-aliasing errors in PCPU_SET() and in the amd64 PCPU_GET(). Cast to (void ) as in rev.1.37 of the i386 version where the errors were fixed for the i386 PCPU_GET() only. It would be more correct to copy to and from the temp. variable using memcpy(), but then an ifdef tangle would be required to ensure using the builtin memcpy(). We depend on fairly aggressive optimization to put the temp. variable only in a register despite it being copied using (type )(void )&anothertype and could depend on this when using memcpy() too. This seems to work right even for -O0, but the -O0 case has not been completely tested. This change gives identical object code for all object files in LINT on amd64 (except for one file with a __TIME__ stamp). For LINT on i386 it gives unimportant differences in instruction order and padding in a few object files. This was only tested for -O. This change (actually a previous version of it) gives the following reductions in the number of object files in LINT that fail to compile with -O2 but without the -fno-strict-aliasing kludge: - amd64: 29 (down from 211) - i386: 36 (down from 47) gcc-3.4.6 actually allows the invalid constraints that result from not using the temp. variable, at least with -O[1-2], but gcc-3.3.3 crashes on them and I don't want to depend on compiler bugs.	2007-02-06 16:21:09 +00:00
jhb	adbe57597a	Change GDB_BUFSZ to be large enough to hold a register dump where each register takes 16 characters (64-bit register in hex). In practice this is a slight bit of overkill as 7 of the 56 registers are only 32-bit, but having the buffer too small results in remote kgdb trashing kernel memory when it connects. PR: amd64/108673 Submitted by: Ravi Murty, Nikhil Rao @ Intel MFC after: 3 days	2007-02-05 21:48:32 +00:00
kib	8f812418c1	Introduce some more SO_ option equivalents from Linux to FreeBSD. The msg variable in linux_recvmsg() was not initialized. Copy it from userspace. Submitted by: rdivacky	2007-02-01 13:36:19 +00:00
kib	b9ce1aaa2a	Fix LOR that occurs because proctree_lock was acquired while holding emuldata lock by moving the code upwards outside the emul_lock coverage. Submitted by: rdivacky	2007-02-01 13:27:52 +00:00
kib	84f6f6c749	MFi386: Use LINUX_SIG_VALID macro. Submitted by: rdivacky	2007-02-01 13:24:40 +00:00
jkoshy	2664c129a9	Use a known good stack at the time of servicing an NMI --- reuse the space allocated for the double fault handler since this space is otherwise unused till the time a double fault occurs. This change should have been committed alongside r1.127 of "exception.S", but I somehow missed doing so. Problem reported by: jeff Pointy hat to: jkoshy	2007-01-27 18:13:24 +00:00
jeff	474b917526	- Remove setrunqueue and replace it with direct calls to sched_add(). setrunqueue() was mostly empty. The few asserts and thread state setting were moved to the individual schedulers. sched_add() was chosen to displace it for naming consistency reasons. - Remove adjustrunqueue, it was 4 lines of code that was ifdef'd to be different on all three schedulers where it was only called in one place each. - Remove the long ifdef'd out remrunqueue code. - Remove the now redundant ts_state. Inspect the thread state directly. - Don't set TSF_* flags from kern_switch.c, we were only doing this to support a feature in one scheduler. - Change sched_choose() to return a thread rather than a td_sched. Also, rely on the schedulers to return the idlethread. This simplifies the logic in choosethread(). Aside from the run queue links kern_switch.c mostly does not care about the contents of td_sched. Discussed with: julian - Move the idle thread loop into the per scheduler area. ULE wants to do something different from the other schedulers. Suggested by: jhb Tested on: x86/amd64 sched_{4BSD, ULE, CORE}.	2007-01-23 08:46:51 +00:00
jeff	f53a7830f7	- Allow the schedulers to IPI_PREEMPT idlethread. This puts the decision for this behavior on the initiator side.	2007-01-23 08:38:39 +00:00
bde	b12ed0640c	Cleaned up declaration and initialization of clock_lock. It is only used by clock code, so don't export it to the world for machdep.c to initialize. There is a minor problem initializing it before it is used, since although clock initialization is split up so that parts of it can be done early, the first part was never done early enough to actually work. Split it up a bit more and do the first part as late as possible to document the necessary order. The functions that implement the split are still bogusly exported. Cleaned up initialization of the i8254 clock hardware using the new split. Actually initialize it early enough, and don't work around it not being initialized in DELAY() when DELAY() is called early for initialization of some console drivers. This unfortunately moves a little more code before the early debugger breakpoint so that it is harder to debug. The ordering of console and related initialization is delicate because we want to do as little as possible before the breakpoint, but must initialize a console.	2007-01-23 08:01:20 +00:00
jhb	3624354c54	Expand the MSI/MSI-X API to address some deficiencies in the MSI-X support. - First off, device drivers really do need to know if they are allocating MSI or MSI-X messages. MSI requires allocating powerof2() messages for example where MSI-X does not. To address this, split out the MSI-X support from pci_msi_count() and pci_alloc_msi() into new driver-visible functions pci_msix_count() and pci_alloc_msix(). As a result, pci_msi_count() now just returns a count of the max supported MSI messages for the device, and pci_alloc_msi() only tries to allocate MSI messages. To get a count of the max supported MSI-X messages, use pci_msix_count(). To allocate MSI-X messages, use pci_alloc_msix(). pci_release_msi() still handles both MSI and MSI-X messages, however. As a result of this change, drivers using the existing API will only use MSI messages and will no longer try to use MSI-X messages. - Because MSI-X allows for each message to have its own data and address values (and thus does not require all of the messages to have their MD vectors allocated as a group), some devices allow for "sparse" use of MSI-X message slots. For example, if a device supports 8 messages but the OS is only able to allocate 2 messages, the device may make the best use of 2 IRQs if it enables the messages at slots 1 and 4 rather than default of using the first N slots (or indicies) at 1 and 2. To support this, add a new pci_remap_msix() function that a driver may call after a successful pci_alloc_msix() (but before allocating any of the SYS_RES_IRQ resources) to allow the allocated IRQ resources to be assigned to different message indices. For example, from the earlier example, after pci_alloc_msix() returned a value of 2, the driver would call pci_remap_msix() passing in array of integers { 1, 4 } as the new message indices to use. The rid's for the SYS_RES_IRQ resources will always match the message indices. Thus, after the call to pci_remap_msix() the driver would be able to access the first message in slot 1 at SYS_RES_IRQ rid 1, and the second message at slot 4 at SYS_RES_IRQ rid 4. Note that the message slots/indices are 1-based rather than 0-based so that they will always correspond to the rid values (SYS_RES_IRQ rid 0 is reserved for the legacy INTx interrupt). To support this API, a new PCIB_REMAP_MSIX() method was added to the pcib interface to change the message index for a single IRQ. Tested by: scottl	2007-01-22 21:48:44 +00:00
netchild	42392e7a0b	MFp4 (113077, 113083, 113103, 113124, 113097): Dont expose em->shared to the outside world before its properly initialized. Might not affect anything but its at least a better coding style. Dont expose em via p->p_emuldata until its properly initialized. This also enables us to get rid of some locking and simplify the code because we are workin on a local copy. In linux_fork and linux_vfork create the process in stopped state to be sure that the new process runs with fully initialized emuldata structure [1]. Also fix the vfork (both in linux_clone and linux_vfork) race that could result in never woken up process [2]. Reported by: Scot Hetzel [1] Suggested by: jhb [2] Reviewed by: jhb (at least some important parts) Submitted by: rdivacky Tested by: Scot Hetzel (on amd64) Change 2 comments (in the new code) to comply to style(9). Suggested by: jhb	2007-01-20 14:58:59 +00:00
rodrigc	abfdc2d6f3	Revert previous change. Requested by: kan	2007-01-18 05:46:32 +00:00
rodrigc	d7b980406d	Forward declare __pcpu as a pointer type instead of an array type to eliminate GCC 4.1 error: "array type has incomplete element type".	2007-01-18 02:00:04 +00:00
netchild	4ffc7bc7ea	MFp4 (112893): Make linux_vfork() actually work. This enables make to work again with 2.6. It also fixes the LTP vfork tests. Submitted by: rdivacky	2007-01-14 16:20:37 +00:00
imp	9109b1ceb8	Remove 3rd clause, renumber, ok per email	2007-01-12 07:26:21 +00:00
jhb	0f2597e7c2	Remove magic from rman_activate_resource() that uses the direct map at KERNBASE for the first 1 MB of RAM instead of calling pmap_mapdev(). pmap_mapdev() knows how to handle the first 1 MB (and has known for a while now) and properly maps the memory as UC to boot. MFC after: 2 weeks	2007-01-11 19:40:19 +00:00
jeff	4385f99f93	- Use the correct test in the ipi bitmask handler for IPI_PREEMPT so that we actually issue preemptions. - Remove the #ifdef IPI_PREEMPTION so it is always compiled in. Leave the option which optionally enables support in sched_4bsd. sched_ule.c will soon use this functionality as a run time rather than compile time option. - Compare against the idlethread rather than the priority. There are some idle prio tasks that we can preempt. Discussed with: ups Tested on: i386, amd64	2007-01-11 00:17:02 +00:00
jkim	24024d4f50	Add SSSE3 extensions and correct CNXT-ID spelling for Intel processors.	2007-01-09 19:23:22 +00:00
netchild	977ef4a8bc	MFp4 (112498): Rename the locking flags to EMUL_DOLOCK and EMUL_DONTLOCK to prevent confusion. Submitted by: rdivacky	2007-01-07 19:00:38 +00:00
netchild	d49e0fd374	MFi386 rev 1.56: Bring the linux mmap code more into line with how linux (2.4.x) behaves. Tested by: Scot Hetzel <swhetzel@gmail.com> on amd64 without PROT_EXEC Additionally to the i386 version always use PROT_EXEC in the mapping like the previous version of the amd64 code did. We need to examinate this further to decide what the right thing to do is. For now this fixes several problems in the LTP test runs and should behave regarding PROT_EXEC like before.	2007-01-06 15:58:34 +00:00
netchild	f87d2b65bb	regen after addition of linux_utimes and linux_rt_sigtimedwait	2006-12-31 13:20:31 +00:00
netchild	33166d619b	MFp4 (111746, 108671, 108945, 112352): - add linux utimes syscall [1] - add linux rt_sigtimedwait syscall [2] Submitted by: "Scot Hetzel" <swhetzel@gmail.com> [1] Submitted by: Bruce Becker <hostmaster@whois.gts.net> [2] PR: 93199 [2]	2006-12-31 13:16:00 +00:00
bde	b616a20d08	Fixed some style bugs (mainly assorted errors in comments, and inconsistent spelling of `result').	2006-12-29 15:29:49 +00:00
bde	0a6fe7fb48	Fixed some style bugs (whitespace only).	2006-12-29 14:28:23 +00:00
bde	6a28e42ead	Try harder to garbage-collect the "LOCORE" (really asm) version of MPLOCKED. The cleaning in rev.1.25 was supposed to have been undone by rev.1.26, but 1.26 could never have actually affected asm files since atomic.h is full of C declarations so including it in asm files would just give syntax errors. The asm MPLOCKED is even less needed than when misplaced definitions of it were first removed, and is now unused in any asm file in the src tree except in anachronismns in sys/i386/i386/support.s.	2006-12-29 13:36:26 +00:00
rwatson	a15ca76770	Regenerate.	2006-12-29 01:17:09 +00:00
rwatson	a0002f4c0b	Assign or clean up audit identifiers for a number of additional Linux system calls on the amd64 architecture. Some minor white space tweaks for consistency with other syscalls.master files. Obtained from: TrustedBSD Project	2006-12-29 01:17:02 +00:00
bde	54df184a65	Removed gratuitous cosmetic differences with the i386 version. This mainly involves removing all __CC_SUPPORTS___INLINE__ ifdefs. These ifdefs are even less needed for amd64 than for i386, but the i386 atomic.h never had them. The ifdefs here were just an optimization of obsolescent compatibility cruft (__inline) for a null set of compilers. I think null sets of compilers should only be supported in cases where this is more than an optimization, doesn't require extensive ifdefs, and only involves not-so-obsolescent compatibility cruft (plain inline here).	2006-12-28 08:15:14 +00:00
bde	9d0b590514	Avoid an instruction in atomic_cmpset_{int_long)() in most cases. These functions are used a lot for mutexes, so this reduces the text size of an average kernel by about 0.75%. This wasn't intended to be a significant optimization, but it somehow increased the maximum number of packets per second that can be transmitted by my bge hardware from 320000 to 460000 (this benchmark is CPU-bound and remarkably sensitive to changes in the text section). Details: we would prefer to leave the result of the cmpxchg in %al, but cannot tell gcc that it is there, so we have to convert it to an integer register. We converted to %al, then to %[re]ax, but the latter step is usually wasted since gcc usually only wants the condition code and can recover it from %al just as easily as from %[re]ax. Let gcc promote %al in the few cases where this is needed. Nearby style fixes; - let gcc manage the load of `res', and don't abuse `res' for a copy of `exp' - don't echo `res's name in comments - consistently spell the condition code as 'e' after comparison for equality - don't hard-code %al anywhere except in constraints - for the version that doesn't use cmpxchg, there is no requirement to use %al anywhere, so don't hard-code it in the constraints either. Style non-fix: - for the versions that use cmpxchg, keep using "a" (was %[re]ax, now %al) for the main output operand, although this is not required. The input and output operands that use the "a" constraint are now decoupled, and this makes things clearer except for the reason that the output register is hard-coded. It is now just a hack to tell gcc that the input "a" has been clobbered without increasing the number of operands.	2006-12-27 20:26:00 +00:00
davidxu	ed23d7715b	Fix a panic when rebooting a SMP machine, when option STOP_NMI is used, nmi handler is used to stop other processors, nmi hander calls trap(), however, trap() now accepts a pointer rather than a reference, this was changed by kmacy@.	2006-12-23 03:30:50 +00:00
jkim	3b05cb0c58	MFP4: 109655 - Move linux_nanosleep() from src/sys/amd64/linux32/linux32_machdep.c to src/sys/compat/linux/linux_time.c. - Validate timespec ranges before use as Linux kernel does. - Fix l_timespec structure. - Clean up style(9) nits.	2006-12-20 20:17:35 +00:00
davidxu	5a984630fa	Add a lwpid field into per-cpu structure, the lwpid represents current running thread's id on each cpu. This allow us to add in-kernel adaptive spin for user level mutex. While spinning in user space is possible, without correct thread running state exported from kernel, it hardly can be implemented efficiently without wasting cpu cycles, however exporting thread running state unlikely will be implemented soon as it has to design and stablize interfaces. This implementation is transparent to user space, it can be disabled dynamically. With this change, mutex ping-pong program's performance is improved massively on SMP machine. performance of mysql super-smack select benchmark is increased about 7% on Intel dual dual-core2 Xeon machine, it indicates on systems which have bunch of cpus and system-call overhead is low (athlon64, opteron, and core-2 are known to be fast), the adaptive spin does help performance. Added sysctls: kern.threads.umtx_dflt_spins if the sysctl value is non-zero, a zero umutex.m_spincount will cause the sysctl value to be used a spin cycle count. kern.threads.umtx_max_spins the sysctl sets upper limit of spin cycle count. Tested on: Athlon64 X2 3800+, Dual Xeon 5130	2006-12-20 04:40:39 +00:00
kmacy	80aca2a305	Evidently neither GENERIC nor kan's config had isa in it :-0. As Doug Barton says, "embrace the LINT".	2006-12-17 21:51:44 +00:00
kmacy	2061110785	Newer versions of gcc don't support treating structures passed by value as if they were really passed by reference. Specifically, the dead stores elimination pass in the GCC 4.1 optimiser breaks the non-compliant behavior on which FreeBSD relied. This change brings FreeBSD up to date by switching trap frames to being explicitly passed by reference. Reviewed by: kan Tested by: kan	2006-12-17 06:48:40 +00:00
yongari	12c67e24f1	Add msk(4) to the list of drivers supported by GENERIC kernel.	2006-12-13 03:41:47 +00:00
jhb	dfb2d953f1	Give Host-PCI bridge drivers their own pcib_alloc_msi() and pcib_alloc_msix() methods instead of using the method from the generic PCI-PCI bridge driver as the PCI-PCI methods will be gaining some PCI-PCI specific logic soon.	2006-12-12 19:27:01 +00:00
jhb	f74d404e39	Sort function prototypes.	2006-12-12 19:24:45 +00:00
jhb	65d8bd30a0	Add a function to return the MD interrupt source cookie associated with an interrupt event. Use this in the x86 code to fixup the intrcnt names when an interrupt handler is removed.	2006-12-12 19:20:19 +00:00
sobomax	8563c840b5	Allow machdep.cpu_idle_hlt to be set from the loader. This should allow to workaround the problem with SMP kernels on Turion64 X2 processors described in kern/104678 and may be useful in other situations too. MFC after: 3 days	2006-12-06 18:27:17 +00:00
julian	396ed947f6	Threading cleanup.. part 2 of several. Make part of John Birrell's KSE patch permanent.. Specifically, remove: Any reference of the ksegrp structure. This feature was never fully utilised and made things overly complicated. All code in the scheduler that tried to make threaded programs fair to unthreaded programs. Libpthread processes will already do this to some extent and libthr processes already disable it. Also: Since this makes such a big change to the scheduler(s), take the opportunity to rename some structures and elements that had to be moved anyhow. This makes the code a lot more readable. The ULE scheduler compiles again but I have no idea if it works. The 4bsd scheduler still reqires a little cleaning and some functions that now do ALMOST nothing will go away, but I thought I'd do that as a separate commit. Tested by David Xu, and Dan Eischen using libthr and libpthread.	2006-12-06 06:34:57 +00:00
ru	0d6f98c10f	Use a different bitmask for superpages' base address so that it doesn't conflict with the PG_PDE_PAT bit. (We still don't mask off all the reserved bits but that's okay for now.) Reviewed by: alc	2006-12-05 11:31:33 +00:00
netchild	8035e46d9a	MFP4 (110939): MFi386: return EOPNOTSUPP for unknown module events. Submitted by: rdivacky	2006-12-03 21:06:07 +00:00
netchild	b2ff0e08c0	Sync with i386 (remove the LINUX stuff) now that the module is usable.	2006-12-03 21:02:09 +00:00
bde	852d9800f0	Optimized RTC accesses by avoiding null writes to the index register and by only delaying when an RTC register is written to. The delay after writing to the data register is now not just a workaround. This reduces the number of ISA accesses in the usual case from 4 to 1. The usual case is 2 rtcin()'s for each RTC interrupt. The index register is almost always RTC_INTR for this. The 3 extra ISA accesses were 1 for writing the index and 2 for delays. Some delays are needed in theory, but in practice they now just slow down slow accesses some more since almost eveyone including us does them wrong so modern systems enforce sufficient delays in hardware. I used to have the delays ifdefed out, but with the index register optimization the delays are rarely executed so the old magic ones can be kept or even implemented non- magically without significant cost. Optimizing RTC interrupt handling is more interesting than it used to be because RTC interrupts are currently needed to fix the more efficient apic timer interrupts on some systems. apic_timer_hz is normally 2000 so the RTC interrupt rate needs to be 2048 to keep the apic timer firing on such systems. Without these changes, each RTC interrupt normally took 10 ISA accesses (2 PIC accesses and 2 sets of 4 RTC accesses). Each ISA access takes 1-1.5uS so 10 of then at 2048 Hz takes 2-3% of a CPU. Now 4 of them take 0.8-1.2% of a CPU.	2006-12-03 03:49:28 +00:00
jb	da35e3e55f	Turn console printf buffering into a kernel option and only on by default for sun4v where it is absolutely required. This change moves the buffer from struct pcpu to the stack to avoid using the critical section which created a LOR in a couple of cases due to interaction with the tty code and kqueue. The LOR can't be fixed with the critical section and the pcpu buffer can't be used without the critical section. Putting the buffer on the stack was my initial solution, but it was pointed out that the stress on the stack might cause problems depending on the call path. We don't have a way of creating tests for those possible cases, so it's best to leave this as an option for the time being. In time we may get enough data to enable this option more generally.	2006-11-30 04:17:05 +00:00
ru	dfdf4ba035	Differentiate between data and instruction fetch in the fatal page fault trap handler. Reviewed by: alc	2006-11-28 20:04:00 +00:00
ru	6b57c74ca8	Use a define instead of a "magic" value.	2006-11-23 21:37:04 +00:00

1 2 3 4 5 ...

4918 Commits