freebsd-skq

Author	SHA1	Message	Date
alc	4fcd318e7c	Eliminate an unnecessary initialization from trap_pfault() that also happens to contain a style error.	2006-08-14 19:53:53 +00:00
jhb	98def9ff62	Don't try to preserve PAT bits in pmap_enter(). We currently on pages that aren't mapped via pmap_enter() (KVA). We will eventually support PAT bits on user pages, but those will require some sort of MI caching mode stored in the vm_page. Reviewed by: alc	2006-08-14 15:39:41 +00:00
jhb	ce9f8963fd	First pass at allowing memory to be mapped using cache modes other than WB (write-back) on x86 via control bits in PTEs and PDEs (including making use of the PAT MSR). Changes include: - A new pmap_mapdev_attr() function for amd64 and i386 which takes an additional parameter (relative to pmap_mapdev()) specifying the cache mode for this mapping. Note that on amd64 only WB mappings are done with the direct map, all other modes result in a private mapping. - pmap_mapdev() on i386 and amd64 now defaults to using UC (uncached) mappings rather than WB. Previously we relied on the BIOS setting up MTRR's to enforce memio regions being treated as UC. This might make hw.cbb_start_memory unnecessary in some cases now for example. - A new pmap_mapbios()/pmap_unmapbios() API has been added to allow places that used pmap_mapdev() to map non-device memory (such as ACPI tables) to do so using WB as before. - A new pmap_change_attr() function for amd64 and i386 that changes the caching mode for a range of KVA. Reviewed by: alc	2006-08-11 19:22:57 +00:00
netchild	1f1a93f2ab	Add some more errno mappings (bsd -> linux) and a comment about the status.. Submitted by: "Intron" <mag@intron.ac>	2006-08-10 22:05:25 +00:00
imp	b7167a2ca5	Eliminate one set of XBOX #ifdefs. The Xbox code just needs to set a different TIMER_FREQ value than default. Accomplish this via the config file rather than via an #ifdef.	2006-08-09 23:47:38 +00:00
imp	af62585f2d	Minor style(9) nit.	2006-08-09 23:37:30 +00:00
njl	6b5ea55333	If a beep was enabled, turn it off 3 seconds after resume. MFC after: 3 days	2006-08-08 01:30:54 +00:00
alc	99dcbcf3fd	Eliminate the acquisition and release of the page queues lock around a call to vm_page_sleep_if_busy().	2006-08-06 06:29:16 +00:00
mr	77027a81d4	Dont overwrite cpu_model in the case of Via's C3-CPU. Noticed by: Mike Tancsa MFC after: 2 days	2006-08-04 13:49:16 +00:00
yar	209e4786e7	Commit the results of the typo hunt by Darren Pilgrim. This change affects documentation and comments only, no real code involved. PR: misc/101245 Submitted by: Darren Pilgrim <darren pilgrim bitfreak org> Tested by: md5(1) MFC after: 1 week	2006-08-04 07:56:35 +00:00
alc	a152234cf9	Complete the transition from pmap_page_protect() to pmap_remove_write(). Originally, I had adopted sparc64's name, pmap_clear_write(), for the function that is now pmap_remove_write(). However, this function is more like pmap_remove_all() than like pmap_clear_modify() or pmap_clear_reference(), hence, the name change. The higher-level rationale behind this change is described in src/sys/amd64/amd64/pmap.c revision 1.567. The short version is that I'm trying to clean up and fix our support for execute access. Reviewed by: marcel@ (ia64)	2006-08-01 19:06:06 +00:00
obrien	040ba91ea8	Correct spelling of 3DNow!.	2006-08-01 01:23:39 +00:00
marcel	7067faff16	Remove sio(4) and related options from MI files to amd64, i386 and pc98 MD files. Remove nodevice and nooption lines specific to sio(4) from ia64, powerpc and sparc64 NOTES. There were no such lines for arm yet. sio(4) is usable on less than half the platforms, not counting a future mips platform. Its presence in MI files is therefore increasingly becoming a burden.	2006-07-29 18:38:54 +00:00
jhb	3a707d012d	Retire SYF_ARGMASK and remove both SYF_MPSAFE and SYF_ARGMASK. sy_narg is now back to just being an argument count.	2006-07-28 20:22:58 +00:00
jhb	dee1b3da95	Regen for MPSAFE flag removal.	2006-07-28 19:08:37 +00:00
jhb	c62c38439f	Now that all system calls are MPSAFE, retire the SYF_MPSAFE flag used to mark system calls as being MPSAFE: - Stop conditionally acquiring Giant around system call invocations. - Remove all of the 'M' prefixes from the master system call files. - Remove support for the 'M' prefix from the script that generates the syscall-related files from the master system call files. - Don't explicitly set SYF_MPSAFE when registering nfssvc.	2006-07-28 19:05:28 +00:00
jhb	6a211b6d81	Various fixes to comments in the syscall master files including removing cruft from the audit import and adding mention of COMPAT4 to freebsd32.	2006-07-28 18:55:18 +00:00
jhb	12302c47d0	Unify the checking for lock misbehavior in the various syscall() implementations and adjust some of the checks while I'm here: - Add a new check to make sure we don't return from a syscall in a critical section. - Add a new explicit check before userret() to make sure we don't return with any locks held. The advantage here is that we can include the syscall number and name in syscall() whereas that info is not available in userret(). - Drop the mtx_assert()'s of sched_lock and Giant. They are replaced by the more general checks just added. MFC after: 2 weeks	2006-07-27 22:32:30 +00:00
jhb	dc69447236	Argh, fix compile with XBOX enabled. Somehow I missed a LINT compile. :(	2006-07-27 22:19:02 +00:00
jhb	c95747d9a1	Don't allow MAXMEM or hw.physmem to extend the top of memory if our memory map was obtained from the SMAP. SMAP is trustworthy, and the memory extending feature is a band-aid for older systems where FreeBSD's methods of detecting memory were not always trustworthy. This fixes the issue where using hw.physmem could result in the ACPI tables getting trashed breaking ACPI. MFC after: 3 days Tested on: i386	2006-07-27 19:47:22 +00:00
yongari	9b54b752db	Add stge(4) to the list of drivers supported by GENERIC kernel.	2006-07-25 01:06:32 +00:00
jhb	e96f2e292b	Regen.	2006-07-21 20:41:33 +00:00
jhb	675c87997e	- Pass the MPSAFE flag to namei() in linux_uselib() and handle conditional Giant VFS locking in that function. - Remove bogus code to handle the case where namei() returns success but a NULL vnode pointer. - Note that this code duplicates exec_check_permissions() and annotate where it differs. - Hold the vnode lock longer to protect the write to set VV_TEXT in v_vflag. - Mark linux_uselib() MPSAFE. Reviewed by: rwatson	2006-07-21 20:22:13 +00:00
alc	004ef88e09	Add pmap_clear_write() to the interface between the virtual memory system's machine-dependent and machine-independent layers. Once pmap_clear_write() is implemented on all of our supported architectures, I intend to replace all calls to pmap_page_protect() by calls to pmap_clear_write(). Why? Both the use and implementation of pmap_page_protect() in our virtual memory system has subtle errors, specifically, the management of execute permission is broken on some architectures. The "prot" argument to pmap_page_protect() should behave differently from the "prot" argument to other pmap functions. Instead of meaning, "give the specified access rights to all of the physical page's mappings," it means "don't take away the specified access rights from all of the physical page's mappings, but do take away the ones that aren't specified." However, owing to our i386 legacy, i.e., no support for no-execute rights, all but one invocation of pmap_page_protect() specifies VM_PROT_READ only, when the intent is, in fact, to remove only write permission. Consequently, a faithful implementation of pmap_page_protect(), e.g., ia64, would remove execute permission as well as write permission. On the other hand, some architectures that support execute permission have basically ignored whether or not VM_PROT_EXECUTE is passed to pmap_page_protect(), e.g., amd64 and sparc64. This change represents the first step in replacing pmap_page_protect() by the less subtle pmap_clear_write() that is already implemented on amd64, i386, and sparc64. Discussed with: grehan@ and marcel@	2006-07-20 17:48:41 +00:00
alc	f0337456d9	MFamd64 pmap_clear_ptes() is already convoluted. This will worsen with the implementation of superpages. Eliminate it and add pmap_clear_write(). There are no functional changes. Checked by: md5	2006-07-18 03:17:12 +00:00
alc	45cb178426	Now that free_pv_entry() accesses the pmap, call free_pv_entry() in pmap_remove_all() before rather than after the pmap is unlocked. At present, the page queues lock provides sufficient sychronization. In the future, the page queues lock may not always be held when free_pv_entry() is called.	2006-07-17 03:10:17 +00:00
alc	ae11c9115b	MFamd64 Make three simplifications to pmap_ts_referenced(): Eliminate an initialized but otherwise unused variable. Eliminate an unnecessary test. Exit the loop in a shorter way.	2006-07-16 21:05:58 +00:00
alc	5afff0eadf	Eliminate the remaining uses of "register". Convert the remaining K&R-style function declarations to ANSI-style. Eliminate excessive white space from pmap_ts_referenced().	2006-07-16 19:43:49 +00:00
alc	8f169c00cb	Make pc_freemask an array of uint32_t, rather than uint64_t. (I believe that the use of the latter is simply an oversight in porting the new pv entry code from amd64.)	2006-07-15 07:24:30 +00:00
jhb	df5064de23	Regen.	2006-07-14 15:42:47 +00:00
jhb	9b1ba3b554	Somewhat surprisingly, ibcs2_ioctl() is MPSAFE as it is without needing any further fixes.	2006-07-14 15:42:21 +00:00
jhb	917f450cf6	Regen.	2006-07-14 15:31:01 +00:00
jhb	ebe022b0c4	Mark ibcs2_mount() (just returns EINVAL) and ibcs2_umount() (just calls unmount(2)) MPSAFE.	2006-07-14 15:30:50 +00:00
jhb	6ae97a774e	Regen.	2006-07-14 15:11:46 +00:00
jhb	e860523612	ibcs2_sigprocmask() is already marked MPSAFE in syscalls.xenix, so mark it MPSAFE in syscalls.isc.	2006-07-14 15:11:20 +00:00
jkim	03e0206d84	Sync specialreg.h changes between amd64 and i386 with few fixes.	2006-07-13 16:09:40 +00:00
jhb	a72b0bcd7f	Simplify the pager support in DDB. Allowing different db commands to install custom pager functions didn't actually happen in practice (they all just used the simple pager and passed in a local quit pointer). So, just hardcode the simple pager as the only pager and make it set a global db_pager_quit flag that db commands can check when the user hits 'q' (or a suitable variant) at the pager prompt. Also, now that it's easy to do so, enable paging by default for all ddb commands. Any command that wishes to honor the quit flag can do so by checking db_pager_quit. Note that the pager can also be effectively disabled by setting $lines to 0. Other fixes: - 'show idt' on i386 and pc98 now actually checks the quit flag and terminates early. - 'show intr' now actually checks the quit flag and terminates early.	2006-07-12 21:22:44 +00:00
mr	0130801813	Initialise (if necessary) the VIA C3/C7 features. Store the capabilities for further use by random(4), padlock(4), ... Obtained from: mostly OpenBSD MFC after: 1 week	2006-07-12 19:46:08 +00:00
mr	cb9048aebc	fix typo in identcpu.c and add one define to specialreg.h. MFC after: 1 week	2006-07-12 16:52:56 +00:00
mr	83b3720abd	First step to identify and initialize the newer VIA C7 CPU as found in a VIA EPIA EN-15000 board. Obtained from: large parts from OpenBSD	2006-07-12 14:52:32 +00:00
jkim	3117fa3da4	Add two new CPUID bits for AMD CPUs, i. e., SVM and extended APIC register.	2006-07-12 06:04:12 +00:00
jhb	286a0ec5a8	Regen.	2006-07-11 20:55:23 +00:00
jhb	9569e81b84	- Add conditional VFS Giant locking to getdents_common() (linux ABIs), ibcs2_getdents(), ibcs2_read(), ogetdirentries(), svr4_sys_getdents(), and svr4_sys_getdents64() similar to that in getdirentries(). - Mark ibcs2_getdents(), ibcs2_read(), linux_getdents(), linux_getdents64(), linux_readdir(), ogetdirentries(), svr4_sys_getdents(), and svr4_sys_getdents64() MPSAFE.	2006-07-11 20:52:08 +00:00
jhb	0e6d9ac511	Retire the stackgap macros from ibcs2 as they are no longer used. Push the includes of <sys/exec.h> and <sys/sysent.h> down into the only files that now need them.	2006-07-10 17:59:26 +00:00
jhb	3924866a78	Regen.	2006-07-10 15:55:38 +00:00
jhb	387e004f72	Mark ibcs2_msgsys(), ibcs2_semsys(), and ibcs2_shmsys() MPSAFE.	2006-07-10 15:55:17 +00:00
twinterg	3e2c62d319	Extend i4b to support CAPI manager based ISDN controllers (CAPI manager is part of c4b, CAPI for BSD). This is a preparation to add CAPI for BSD to the source tree. Approved by: hm (mentor) MFC after: 2 weeks	2006-07-09 21:16:06 +00:00
mjacob	6850136348	Make the firmware assist driver resident in preparation for isp using it. Reviewed by: sam, max	2006-07-09 16:41:22 +00:00
mjacob	4574ebad6a	If PAE is built w/o modules, make sure that isp(4) has its firmware resident as well.	2006-07-09 16:38:58 +00:00
jhb	fff357912c	Regen.	2006-07-08 20:14:34 +00:00
jhb	094306d69d	- Split ioctl() up into ioctl() and kern_ioctl(). The kern_ioctl() assumes that the 'data' pointer is already setup to point to a valid KVM buffer or contains the copied-in data from userland as appropriate (ioctl(2) still does this). kern_ioctl() takes care of looking up a file pointer, implementing FIONCLEX and FIOCLEX, and calling fi_ioctl(). - Use kern_ioctl() to implement xenix_rdchk() instead of using the stackgap and mark xenix_rdchk() MPSAFE.	2006-07-08 20:12:14 +00:00
jhb	28bb163264	Use kern_connect() in spx_open() to avoid the need for the stackgap. I also used kern_close() for simplicity though close(2) wasn't requiring the use of the stackgap.	2006-07-08 20:05:04 +00:00
jhb	df27227bab	- Split the IBCS2 ipc foosys() system calls up into subfunctions matching the organization in svr4_ipc.c. - Use kern_msgctl(), kern_semctl(), and kern_shmctl() instead of the stackgap.	2006-07-08 19:54:12 +00:00
jhb	9f226f3f9d	Use ibsc2_key_t rather than key_t.	2006-07-08 19:52:49 +00:00
jhb	a63b63284f	Regen.	2006-07-06 21:43:14 +00:00
jhb	4d231459c7	- Protect the list of linux ioctl handlers with an sx lock. - Hold Giant while calling linux ioctl handlers for now as they aren't all known to be MPSAFE yet. - Mark linux_ioctl() MPSAFE.	2006-07-06 21:42:36 +00:00
jhb	e216ca9f3b	Regen.	2006-07-06 21:33:14 +00:00
jhb	54c687571c	Add kern_setgroups() and kern_getgroups() and use them to implement ibcs2_[gs]etgroups() rather than using the stackgap. This also makes ibcs2_[gs]etgroups() MPSAFE. Also, it cleans up one bit of weirdness in the old setgroups() where it allocated an entire credential just so it had a place to copy the group list into. Now setgroups just allocates a NGROUPS_MAX array on the stack that it copies into and then passes to kern_setgroups().	2006-07-06 21:32:20 +00:00
jhb	6fe08fdbd3	Use the regular poll(2) function to implement poll(2) for the IBCS2 compat ABI as FreeBSD's poll(2) is ABI compatible. The ibcs2_poll() function attempted to implement poll(2) using a wrapper around select(2). Besides being somewhat ugly, it also had at least one bug in that instead of allocating complete fdset's on the stack via the stackgap it just allocated pointers to fdsets.	2006-07-06 21:29:05 +00:00
davidxu	41e65e69dc	Temporarily remove SCHED_CORE, it seems I have so many works can do now, one example is POSIX priority mutex for libthr.	2006-07-05 02:32:55 +00:00
alc	4748a85152	Correct an error in the new pmap_collect(), thus only affecting HEAD. Specifically, the pv entry was always being freed to the caller's pmap instead of the pmap to which the pv entry belongs.	2006-07-02 18:22:47 +00:00
rink	6131479c85	Updated the XBOX kernel to use the new nfe(4) driver obtained from OpenBSD. This driver seems to give a small performance increase, and should lead to better maintainability in the future. The nForce Ethernet-specific hack in sys/i386/xbox/xbox.c is still required, judging from dev/nfe/if_nfe.c. The condition it hacks will almost certainly only occur on XBOX-es anyway, so it is best left there. Approved by: imp (mentor)	2006-06-27 20:22:32 +00:00
jhb	693417c025	Regen.	2006-06-27 18:32:16 +00:00
jhb	dff69a853e	- Add a kern_semctl() helper function for __semctl(). It accepts a pointer to a copied-in copy of the 'union semun' and a uioseg to indicate which memory space the 'buf' pointer of the union points to. This is then used in linux_semctl() and svr4_sys_semctl() to eliminate use of the stackgap. - Mark linux_ipc() and svr4_sys_semsys() MPSAFE.	2006-06-27 18:28:50 +00:00
jhb	db4d1f72c7	Regen.	2006-06-27 14:47:08 +00:00
jhb	5ceeece21b	- Expand the scope of Giant some in mount(2) to protect the vfsp structure from going away. mount(2) is now MPSAFE. - Expand the scope of Giant some in unmount(2) to protect the mp structure (or rather, to handle concurrent unmount races) from going away. umount(2) is now MPSAFE, as well as linux_umount() and linux_oldumount(). - nmount(2) and linux_mount() were already MPSAFE.	2006-06-27 14:46:31 +00:00
alc	49b81721c7	Correct a very old and very obscure bug: vmspace_fork() calls pmap_copy() if the mapping is VM_INHERIT_SHARE. Suppose the mapping is also wired. vmspace_fork() clears the wiring attributes in the vm map entry but pmap_copy() copies the PG_W attribute in the PTE. I don't think this is catastrophic. It blocks pmap_remove_pages() from destroying the mapping and corrupts the pmap's wiring count. This revision fixes the problem by changing pmap_copy() to clear the PG_W attribute. Reviewed by: tegge@	2006-06-27 04:28:23 +00:00
obrien	5094b5a232	Add a pure open source nForce Ethernet driver, under BSDL. This driver was ported from OpenBSD by Shigeaki Tagashira <shigeaki@se.hiroshima-u.ac.jp> and posted at http://www.se.hiroshima-u.ac.jp/~shigeaki/software/freebsd-nfe.html It was additionally cleaned up by me. It is still a work-in-progress and thus is purposefully not in GENERIC. And it conflicts with nve(4), so only one should be loaded.	2006-06-26 23:41:07 +00:00
babkin	f0555f2de9	Backed out the change by request from rwatson. PR: kern/14584	2006-06-26 22:03:22 +00:00
jhb	368eefb9bf	Regen.	2006-06-26 18:37:36 +00:00
jhb	ddfdf64e37	linux_brk() is MPSAFE.	2006-06-26 18:36:16 +00:00
alc	1b735c58a6	Eliminate a comment that became stale after revision 1.547.	2006-06-25 22:15:02 +00:00
babkin	3d8be823b0	The common UID/GID space implementation. It has been discussed on -arch in 1999, and there are changes to the sysctl names compared to PR, according to that discussion. The description is in sys/conf/NOTES. Lines in the GENERIC files are added in commented-out form. I'll attach the test script I've used to PR. PR: kern/14584 Submitted by: babkin	2006-06-25 18:37:44 +00:00
alc	13b4d64335	Change get_pv_entry() such that the call to vm_page_alloc() specifies VM_ALLOC_NORMAL instead of VM_ALLOC_SYSTEM when try is TRUE. In other words, when get_pv_entry() is permitted to fail, it no longer tries as hard to allocate a page. Change pmap_enter_quick_locked() to fail rather than wait if it is unable to allocate a page table page. This prevents a race between pmap_enter_object() and the page daemon. Specifically, an inactive page that is a successor to the page that was given to pmap_enter_quick_locked() might become a cache page while pmap_enter_quick_locked() waits and later pmap_enter_object() maps the cache page violating the invariant that cache pages are never mapped. Similarly, change pmap_enter_quick_locked() to call pmap_try_insert_pv_entry() rather than pmap_insert_entry(). Generally speaking, pmap_enter_quick_locked() is used to create speculative mappings. So, it should not try hard to allocate memory if free memory is scarce. Add an assertion that the object containing m_start is locked in pmap_enter_object(). Remove a similar assertion from pmap_enter_quick_locked() because that function no longer accesses the containing object. Remove a stale comment. Reviewed by: ups@	2006-06-20 20:52:11 +00:00
netchild	64550de991	regen after change to syscalls.master	2006-06-20 20:41:29 +00:00
netchild	247b98ef25	Switch to using the DUMMY infrastructure instead of UNIMPL for the new syscalls. This way there will be a log message printed to the console (this time for real). Note: UNIMPL should be used for syscalls we do not implement ever, e.g. syscalls to load linux kernel modules. Submitted by: rdivacky Sponsored by: Goole SoC 2006 P4 IDs: 99600, 99602	2006-06-20 20:38:44 +00:00
yar	7e90b114e3	We no longer need to disable interrupts in MD trap machinery when we're about to call kdb_trap() because the latter MI function can disable interrupts by itself now. Pointed out by: bde X-MFC remark: depends on kern/subr_kdb.c#1.18 Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-20 12:44:21 +00:00
davidxu	76a64a293b	Style fix, use low-case.	2006-06-19 07:55:29 +00:00
davidxu	9ef6a74011	Clear bit 22 in MSR IA32_MISC_ENABLE, according to Intel document, when the bit 22 is set to 1, CPUID with EAX=0 returns a maximum value in EAX[7..0] of 3, when set to 0(default), CPUID with EAX=0 returns the number corresponding to the maximum standard function supported. On my machine, BIOS sets the bit to 1 to make it to be compatible with old OS, this causes dual-core Pentium-D (two physical cores) to be identified as hyperthreading (two logical cores) by function mp_topology().	2006-06-19 07:51:47 +00:00
yar	2ace0191b7	Fix style while I'm here.	2006-06-18 12:13:49 +00:00
yar	e95f07384c	The i386 "call" instruction works as follows: it pushes the return address on the stack and only then "dereferences" %pc. Therefore, in the case of a call to an invalid address, we arrive to the trap handler with the invalid value in tf_eip. This used to prevent db_backtrace() from assigning the most recent and interesting frame on the stack to the right spot in the right function, from which the invalid call was attempted. Try to detect and work around that by recovering the return address from the stack. The work-around requires the fault address be passed to db_backtrace(). Smuggle it as tf_err. MFC after: 1 month Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-18 12:07:00 +00:00
mjacob	5292755b7e	Unbreak tinderbox- fix device_printf arg to accomodate different sizes of vm_paddr_t in different contexts (e.g., PAE vs. non PAE).	2006-06-16 14:04:21 +00:00
yar	4d78a1cd9f	Return -1 from db_numargs() if number of args couldn't be guessed. Use this later to indicate in backtrace output that args shown are uncertain. Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-16 11:49:37 +00:00
yar	f92d46b4f1	Guess the number of arguments to a function somewhat better. Now GCC likes to stick a "mov %eax, %FOO" instruction before "addl $BAR, %esp" if the function just called returns an int, which is a very common case in the kernel. Sponsored by: RiNet (Cronyx Plus LLC)	2006-06-16 11:14:54 +00:00
netchild	11681ee0b5	Remove COMPAT_43 from GENERIC (and other kernel configs). For amd64 there's an explicit comment that it's needed for the linuxolator. This is not the case anymore. For all other architectures there was only a "KEEP THIS". I'm (and other people too) running a COMPAT_43-less kernel since it's not necessary anymore for the linuxolator. Roman is running such a kernel for a for longer time. No problems so far. And I doubt other (newer than ia32 or alpha) architectures really depend on it. This may result in a small performance increase for some workloads. If the removal of COMPAT_43 results in a not working program, please recompile it and all dependencies and try again before reporting a problem. The only place where COMPAT_43 is needed (as in: does not compile without it) is in the (outdated/not usable since too old) svr4 code. Note: this does not remove the COMPAT_43TTY option. Nagging by: rdivacky	2006-06-15 19:58:53 +00:00
ups	b3a7439a45	Remove mpte optimization from pmap_enter_quick(). There is a race with the current locking scheme and removing it should have no measurable performance impact. This fixes page faults leading to panics in pmap_enter_quick_locked() on amd64/i386. Reviewed by: alc,jhb,peter,ps	2006-06-15 01:01:06 +00:00
netchild	de5cf4e1bd	regen after MFP4 (soc2006/rdivacky_linuxolator) of syscalls.master P4-Changes: similar to 98673 and 98675 but regenerated locally Sponsored by: Google SoC 2006 Submitted by: rdivacky	2006-06-13 18:48:30 +00:00
netchild	a561ebc3f4	MFP4 (soc2006/rdivacky_linuxolator) Update of syscall.master: o Adding of several new dummy syscalls (268-310) o Synchronization of amd64 syscall.master with i386 one o Auditing added to amd64 syscall.master o Change auditing type for lstat syscall (bugfix). [1] P4-Changes: 98672, 98674 Noticed by: rwatson [1] Sponsored by: Google SoC 2006 Submitted by: rdivacky	2006-06-13 18:43:55 +00:00
davidxu	82b666ed4a	Add scheduler CORE, the work I have done half a year ago, recent, I picked it up again. The scheduler is forked from ULE, but the algorithm to detect an interactive process is almost completely different with ULE, it comes from Linux paper "Understanding the Linux 2.6.8.1 CPU Scheduler", although I still use same word "score" as a priority boost in ULE scheduler. Briefly, the scheduler has following characteristic: 1. Timesharing process's nice value is seriously respected, timeslice and interaction detecting algorithm are based on nice value. 2. per-cpu scheduling queue and load balancing. 3. O(1) scheduling. 4. Some cpu affinity code in wakeup path. 5. Support POSIX SCHED_FIFO and SCHED_RR. Unlike scheduler 4BSD and ULE which using fuzzy RQ_PPQ, the scheduler uses 256 priority queues. Unlike ULE which using pull and push, the scheduelr uses pull method, the main reason is to let relative idle cpu do the work, but current the whole scheduler is protected by the big sched_lock, so the benefit is not visible, it really can be worse than nothing because all other cpu are locked out when we are doing balancing work, which the 4BSD scheduelr does not have this problem. The scheduler does not support hyperthreading very well, in fact, the scheduler does not make the difference between physical CPU and logical CPU, this should be improved in feature. The scheduler has priority inversion problem on MP machine, it is not good for realtime scheduling, it can cause realtime process starving. As a result, it seems the MySQL super-smack runs better on my Pentium-D machine when using libthr, despite on UP or SMP kernel.	2006-06-13 13:12:56 +00:00
marius	9e60ac43b5	Make the ISAPNP code optional and only enable it on i386 and pc98 (used for CBUS-PNP cards there) by default, as there are no amd64 and sparc64 machines with ISA slots and which therefore could make use of this code known to exist. For sparc64 this additionally allows to get rid of the compat shims for in{b,w,l}()/out{b,w,l}() etc and the associated hacks. OK'ed by: imp, peter	2006-06-12 21:07:13 +00:00
jhb	3ec293f314	Enable a few more things in x86 NOTES to get broader LINT coverage: - Turn on iwi(4), ipw(4), and ndis(4) on amd64 and i386. - Turn on ral(4) and ural(4) on i386, pc98, and amd64.	2006-06-12 20:38:17 +00:00
alc	cbeb562815	Don't invalidate the TLB in pmap_qenter() unless the old mapping was valid. Most often, it isn't. Reviewed by: tegge@	2006-06-12 20:05:27 +00:00
imp	038d1db25e	Add the ability to subset the devices that UART pulls in. This allows the arm to compile without all the extras that don't appear, at least not in the flavors of ARM I deal with. This helps us save about 100k. If I've botched the available devices on a platform, please let me know and I'll correct ASAP.	2006-06-12 04:21:50 +00:00
njl	547b3085ee	* Ask for a page-aligned page instead of an arbitrary address. This should not be necessary but might be helpful and at least reduce fragmentation. * Add an assert to detect if the wakecode ever grows too big. We include 1 KB for stack, which should be more than enough also. * Remove unnecessary initialization of static variables. * Add comments and a bootverbose print giving the page phys address.	2006-06-10 08:20:17 +00:00
njl	66b0070261	Minor tweaks to the resume code. Previous commit reverted alignment back to 4. There is no need to be more strict at assembly time since we copy the code anyway to a private page. * Clear the direction flag and eflags. Probably not necessary but it won't hurt to be safe. * Add prefixes to all instructions to prevent any assembler mistakes. * Remove zeroing of eax - edi. We use those registers immediately after to transfer values to protected mode so this was pointless. * Update comments to reflect info found during code review.	2006-06-10 08:20:03 +00:00
njl	00c07c3991	Move the reset beep tunable/sysctl to debug.acpi.resume_beep. This makes more sense than under hw.acpi. Also, document this in the man page.	2006-06-10 08:06:16 +00:00
njl	b8f1ff9a05	Minor tweaks to the resume code that might help people debug. * Add hw.acpi.resume_beep tunable and sysctl, default to 0. Beeps the PC speaker soon after waking to diagnose whether the wakeup code is even getting run before other drivers possibly hang the system. To stop the beep, cause another beep (i.e. keyboard bell). Submitted by takawata@, I changed the frequency to be lower. * Use 4096 instead of 4 byte alignment. Might be useful although doesn't seem to be necessary. * Remove a useless assignment to acpi_reset_video. It was overwritten by the default sysctl value anyway.	2006-06-08 17:54:10 +00:00
alc	ff4adb11fe	Introduce the function pmap_enter_object(). It maps a sequence of resident pages from the same object. Use it in vm_map_pmap_enter() to reduce the locking overhead of premapping objects. Reviewed by: tegge@	2006-06-05 20:35:27 +00:00
emaste	b9360f5c27	Fix cut-n-pasteo: use the i386 version #define for i386 dumps, not the amd64 one.	2006-06-05 18:21:29 +00:00
alc	efb5d1da26	MFamd64 Eliminate unnecessary, recursive acquisitions and releases of the page queues lock by free_pv_entry() and pmap_remove_pages(). Reduce the scope of the page queues lock in pmap_remove_pages().	2006-06-05 06:08:21 +00:00
silby	89bd691dee	After much discussion with mjacob and scottl, change bus_dmamem_alloc so that it just warns the user with a printf when it misaligns a piece of memory that was requested through a busdma tag. Some drivers (such as mpt, and probably others) were asking for alignments that could not be satisfied, but as far as driver operation was concerned, that did not matter. In the theory that other drivers will fall into this same category, we agreed that panicing or making the allocation fail will cause more hardship than is necessary. The printf should be sufficient motivation to get the driver glitch fixed.	2006-06-01 04:49:29 +00:00
mjacob	1b7bd7c5ee	Turn the panic on not being able to meet alignment constraints in bus_dmamem_alloc into the more reasonable EINVAL return. Also, reclaim memory allocated but then not used if we had an error return.	2006-05-31 00:37:56 +00:00
davidxu	42175dc944	Clear invalid bits only if CPU supports SSE, otherwise, some fields in struct save87 will be cleared unexpectly.	2006-05-31 00:17:29 +00:00
davidxu	fa9df4abe1	Use the method described in IA-32 Intel Architecture Software Developer's Manual chapter 11.6.6 to get valid mxcsr bits, use the mxcsr mask to clear invalid bits passed by user code. Reviewed by: bde	2006-05-30 23:44:21 +00:00
davidxu	b60160771c	Backout changes trying to inherit floating-point environment, although POSIX (susv3) requires this, but it is unclear what should be inherited, duplicating whole 387 stack for new thread seems to be unnecessary and dangerous. Revert to previous code, force a new thread to be started with clean FP state.	2006-05-29 02:58:37 +00:00
silby	7f96e8451a	Add a quick hack to ensure that bus_dmamem_alloc properly aligns small allocations with large alignment requirements. Add a panic to detect cases where we've still failed to properly align.	2006-05-28 18:30:36 +00:00
davidxu	dc6d8065e6	Clear high 16 bits of mxcsr register, according to Intel document, if the high 16 bits is non-zero, fxrstor instruction will generate GP fault, resulting kernel crash, this bug can be triggered by setcontext and ptrace(PT_SETXMMREGS).	2006-05-28 06:51:57 +00:00
davidxu	9cd9aeea7a	PCB_NPXINITDONE is cleared by npx_fork_thread.	2006-05-28 04:47:56 +00:00
davidxu	03c7322bae	If parent thread never used FPU, the only work is to clear flag PCB_NPXINITDONE for new thread and let trap code initialize it.	2006-05-28 04:40:45 +00:00
davidxu	bf6b4844d3	When creating a new thread, inherit floating-point environment from current thread, this is required by POSIX pthread_create document.	2006-05-28 02:03:13 +00:00
imp	7854550aa7	APM was calling the suspend process from a timeout. This meant that other timeouts could not happen while suspending, including timeouts for things like msleep. This caused the system to hang on suspend when the cbb was enabled, since its suspend path powered down the socket which used a timeout to wait for it to be done. APM now creates a thread when it is enabled, and deletes the thread when it is disabled. This thread takes the place of the timeout by doing its polling every ~.9s. When the thread is disabled, it will wakeup early, otherwise it times out and polls the varius things the old timeout polled (APM events, suspend delays, etc). This makes my Sony VAIO 505TS suspend/resume correctly when APM is enabled (ACPI is black listed on my 505TS). This will likely fix other problems with the suspend path where drivers would sleep with msleep and/or do other timeouts. Maybe there's some special case code that would use DELAY while suspending and msleep otherwise that can be revisited and removed. This was also tested by glebius@, who pointed out that in the patch I sent him, I'd forgotten apm_saver.c MFC After: 3 weeks	2006-05-25 23:06:38 +00:00
sobomax	210b6777a4	Move clock_lock prototype into <machine/clock.h>, where it is more appropriate. Discussed with: jhb	2006-05-19 18:53:50 +00:00
marius	70daffddff	- Add C-bus and ISA front-ends for le(4) so it can actually replace lnc(4) on PC98 and i386. The ISA front-end supports the same non-PNP network cards as lnc(4) did and additionally a couple of PNP ones. Like lnc(4), the C-bus front-end of le(4) only supports C-NET(98)S and is untested due to lack of such hardware, but given that's it's based on the respective lnc(4) and not too different from the ISA front-end it should be highly likely to work. - Remove the descriptions of le(4), which where converted from lnc(4), from sys/i386/conf/NOTES and sys/pc98/conf/NOTES as there's a common one in sys/conf/NOTES.	2006-05-17 21:25:23 +00:00
marius	0d3e65af24	- As only the PCI front-end of le(4) is common to all platforms move its entry to the PCI NICs section so it's in the same spot in all GENERIC config files. - Add a note to the description of pcn(4) informing that is has precedence over le(4).	2006-05-17 20:44:01 +00:00
phk	537a82e24b	Send the pcvt(4) driver off to retirement.	2006-05-17 09:33:15 +00:00
phk	ef310efff8	Since DELAY() was moved, most <machine/clock.h> #includes have been unnecessary.	2006-05-16 14:37:58 +00:00
ru	c249b5bd38	Kill more references to lnc(4). Submitted by: grep(1)	2006-05-16 12:15:39 +00:00
marius	be5f202f36	Remove some remnants of lnc(4).	2006-05-14 18:49:25 +00:00
gnn	d1e0397ab9	Prefer the le device driver for Lance (AMD7990 et al) hardware over the older, and less capable lnc driver. Reviewed by: imp	2006-05-14 01:40:41 +00:00
peter	c0cb1adae1	Test commit after repoman upgrade. Remove one of my many email addresses from a copyright message.	2006-05-12 22:41:58 +00:00
peter	a7162e4983	Test commit after repoman upgrade. Remove one of my many email addresses from a coyright message.	2006-05-12 22:38:53 +00:00
njl	1d3b84d7cb	Add support for the VIA C7-M processor family. Remove an unnecessary check of the table's bus clock. CPUs that support this feature export only the high/low settings via the MSR, packed into 32 bits. Hardware from: Centaur Technologies MFC after: 1 week	2006-05-11 17:35:44 +00:00
phk	5d8c57a08b	Clean out sysctl machdep.* related defines. The cmos clock related stuff should really be in MI code.	2006-05-11 17:29:25 +00:00
netchild	021fd75458	regen (linux rt_sigpending)	2006-05-10 18:19:51 +00:00
netchild	24c492f42c	Implement rt_sigpending in the linuxolator. PR: 92671 Submitted by: Markus Niemist"o <markus.niemisto@gmx.net>	2006-05-10 18:17:29 +00:00
sam	0b63676c43	make tinderbox happy: GENERIC got ath and wlan added so we need to now mark these "nodevice" or we'll get undefined references	2006-05-10 05:19:21 +00:00
ambrisko	f7d4a6b03b	Add in linsysfs. A linux 2.6 like sys filesystem to pacify the Linux LSI MegaRAID SAS utility. Sponsored by: IronPort Systems Man page help from: brueffer	2006-05-09 22:27:01 +00:00
maxim	d447c4f045	o Add acpi_ibm to the build. PR: kern/96940 Submitted by: Rong-En Fan	2006-05-07 20:13:18 +00:00
ambrisko	31b22ce017	Enhance the Linux emulation layer to make MegaRAID SAS managements tool happy. Add back in a scheme to emulate old type major/minor numbers via hooks into stat, linprocfs to return major/minors that Linux app's expect. Currently only /dev/null is always registered. Drivers can register via the Linux type shim similar to the ioctl shim but by using linux_device_register_handler/linux_device_unregister_handler functions. The structure is: struct linux_device_handler { char bsd_driver_name; char linux_driver_name; char bsd_device_name; char linux_device_name; int linux_major; int linux_minor; int linux_char_device; }; Linprocfs uses this to display the major number of the driver. The soon to be available linsysfs will use it to fill in the driver name. Linux_stat uses it to translate the major/minor into Linux type values. Note major numbers are dynamically assigned via passing in a -1 for the major number so we don't need to keep track of them. This is somewhat needed due to us switching to our devfs. MegaCli will not run until I add in the linsysfs and mfi Linux compat changes. Sponsored by: IronPort Systems	2006-05-05 16:10:45 +00:00
sam	15945a996f	add ath and wlan crypto support Requested by: many MFC after: 1 month	2006-05-03 18:13:11 +00:00
scottl	7ef1f80fdd	Allow bus_dmamap_load() to pass ENOMEM back to the caller. This puts it into conformance with the mbuf and uio load routines. ENOMEM can only happen with BUS_DMA_NOWAIT is passed in, thus the deferals are disabled. I don't like doing this, but fixing this fixes assumptions in other important drivers, which is a net benefit for now.	2006-05-03 04:14:17 +00:00
jhb	00bb13261b	Add various constants for the PAT MSR and the PAT PTE and PDE flags. Initialize the PAT MSR during boot to map PAT type 2 to Write-Combining (WC) instead of Uncached (UC-). MFC after: 1 month	2006-05-01 22:07:00 +00:00
jhb	ca8d347695	Add a new 'pmap_invalidate_cache()' to flush the CPU caches via the wbinvd() instruction. This includes a new IPI so that all CPU caches on all CPUs are flushed for the SMP case. MFC after: 1 month	2006-05-01 21:36:47 +00:00
peter	4db7dec298	Using an idea from Stephan Uphoff, use the empty pte's that correspond to the unused kva in the pv memory block to thread a freelist through. This allows us to free pages that used to be used for pv entry chunks since we can now track holes in the kva memory block. Idea from: ups	2006-05-01 21:22:38 +00:00
peter	b5fd7cea55	Fix missing changes required for the amd64->i386 conversion. Add the missing VM_ALLOC_WIRED flags to vm_page_alloc() calls I added. Submitted by: alc	2006-05-01 19:57:00 +00:00
marcel	193a6144b9	Rewrite of puc(4). Significant changes are: o Properly use rman(9) to manage resources. This eliminates the need to puc-specific hacks to rman. It also allows devinfo(8) to be used to find out the specific assignment of resources to serial/parallel ports. o Compress the PCI device "database" by optimizing for the common case and to use a procedural interface to handle the exceptions. The procedural interface also generalizes the need to setup the hardware (program chipsets, program clock frequencies). o Eliminate the need for PUC_FASTINTR. Serdev devices are fast by default and non-serdev devices are handled by the bus. o Use the serdev I/F to collect interrupt status and to handle interrupts across ports in priority order. o Sync the PCI device configuration to include devices found in NetBSD and not yet merged to FreeBSD. o Add support for Quatech 2, 4 and 8 port UARTs. o Add support for a couple dozen Timedia serial cards as found in Linux.	2006-04-28 21:21:53 +00:00
peter	725e9bf143	Interim fix for pmap problems I introduced with my last commit. Remove the code to dyanmically change the pv_entry limits. Go back to a single fixed kva reservation for pv entries, like was done before when using the uma zone. Go back to never freeing pages back to the free pool after they are no longer used, just like before. This stops the lock order reversal due to aquiring the kernel map lock while pmap was locked. This fixes the recursive panic if invariants are enabled. The problem was that allocating/freeing kva causes vm_map_entry nodes to be allocated/freed. That can recurse back into pmap as new pages are hooked up to kvm and hence all the problem. Allocating/freeing kva indirectly allocate/frees memory. So, by going back to a single fixed size kva block and an index, we avoid the recursion panics and the LOR. The problem is that now with a linear block of kva, we have no mechanism to track holes once pages are freed. UMA has the same problem when using custom object for a zone and a fixed reservation of kva. Simple solutions like having a bitmap would work, but would be very inefficient when there are hundreds of thousands of bits in the map. A first-free pointer is similarly flawed because pages can be freed at random and the first-free pointer would be rewinding huge amounts. If we could allocate memory for tree strucures or an external freelist, that would work. Except we cannot allocate/free memory here because we cannot allocate/free address space to use it in. Anyway, my change here reverts back to the UMA behavior of not freeing pages for now, thereby avoiding holes in the map. ups@ had a truely evil idea that I'll investigate. It should allow freeing unused pages again by giving us a no-cost way to track the holes in the kva block. But in the meantime, this should get people booting with witness and/or invariants again. Footnote: amd64 doesn't have this problem because of the direct map access method. I'd done all my witness/invariants testing there. I'd never considered that the harmless-looking kmem_alloc/kmem_free calls would cause such a problem and it didn't show up on the boot test.	2006-04-28 19:05:08 +00:00
sobomax	1e82b0de25	Unbreak pc98. Sorry...	2006-04-28 03:38:23 +00:00
alc	da3edd51a2	In general, bits in the page directory entry (PDE) and the page table entry (PTE) have the same meaning. The exception to this rule is the eighth bit (0x080). It is the PS bit in a PDE and the PAT bit in a PTE. This change avoids the possibility that pmap_enter() confuses a PAT bit with a PS bit, avoiding a panic(). Eliminate a diagnostic printf() from the i386 pmap_enter() that serves no current purpose, i.e., I've seen no bug reports in the last two years that are helped by this printf(). Reviewed by: jhb	2006-04-27 21:26:25 +00:00
scottl	8bb256220b	Add the rr232x driver to the default kernels.	2006-04-27 20:58:24 +00:00
sobomax	b5181c79ef	In the case when reset via keyboard controller doesn't work for some reason (i.e. no keyboard controller present), try two other common methods for resetting i386 machine - pci reset and port 0x92 fast reset. Only if neither works warn user and resort to "unmap entire address space and hope for good" hack. This makes my MacBook Pro rebooting just fine and should also help other legacy-free hardware out there. Also, disable interrupts unconditionally in cpu_reset_real(), since we don't want any interference. MFC after: 1 week	2006-04-27 05:18:26 +00:00
delphij	fc73c02211	Fix build on i386	2006-04-27 05:02:21 +00:00
peter	8165896908	MFamd64: shrink pv entries from 24 bytes to about 12 bytes. (336 pv entries per page = effectively 12.19 bytes per pv entry after overheads). Instead of using a shared UMA zone for 24 byte pv entries (two 8-byte tailq nodes, a 4 byte pointer, and a 4 byte address), we allocate a page at a time per process. This provides 336 pv entries per process (actually, per pmap address space) and eliminates one of the 8-byte tailq entries since we now can track per-process pv entries implicitly. The pointer to the pmap can be eliminated by doing address arithmetic to find the metadata on the page headers to find a single pointer shared by all 336 entries. There is an 11-int bitmap for the freelist of those 336 entries. This is mostly a mechanical conversion from amd64, except: * i386 has to allocate kvm and map the pages, amd64 has them outside of kvm * native word size is smaller, so bitmaps etc become 32 bit instead of 64 * no dump_add_page() etc stuff because they are in kvm always. * various pmap internals tweaks because pmap uses direct map on amd64 but on i386 it has to use sched_pin and temporary mappings. Also, sysctl vm.pmap.pv_entry_max and vm.pmap.shpgperproc are now dynamic sysctls. Like on amd64, i386 can now tune the pv entry limits without a recompile or reboot. This is important because of the following scenario. If you have a 1GB file (262144 pages) mmap()ed into 50 processes, that requires 13 million pv entries. At 24 bytes per pv entry, that is 314MB of ram and kvm, while at 12 bytes it is 157MB. A 157MB saving is significant. Test-run by: scottl (Thanks!)	2006-04-26 21:49:20 +00:00
jkim	18e73c2320	Check if reported HTT cores are physical cores. This commit does not affect AMD CPUs at all because HTT bit is disabled earlier. Intel multicore CPUs and ULE scheduler may be affected.	2006-04-25 00:06:37 +00:00
jkim	eefd58df92	Add another Intel CPU feature flag, xTPR (Send Task Priority Messages).	2006-04-24 22:56:57 +00:00
jkim	6b218fc19f	Check if deterministic cache parameters leaf is valid before use.	2006-04-24 22:23:52 +00:00
cperciva	900c118819	Adjust dangerous-shared-cache-detection logic from "all shared data caches are dangerous" to "a shared L1 data cache is dangerous". This is a compromise between paranoia and performance: Unlike the L1 cache, nobody has publicly demonstrated a cryptographic side channel which exploits the L2 cache -- this is harder due to the larger size, lower bandwidth, and greater associativity -- and prohibiting shared L2 caches turns Intel Core Duo processors into Intel Core Solo processors. As before, the 'machdep.hyperthreading_allowed' sysctl will allow even the L1 data cache to be shared. Discussed with: jhb, scottl Security: See FreeBSD-SA-05:09.htt for background material.	2006-04-24 21:17:01 +00:00
delphij	da32f1fb9a	Move AHC_REG_PRETTY_PRINT and AHD_REG_PRETTY_PRINT below their corresponding devices.	2006-04-24 08:44:34 +00:00
peter	3fd2125c99	Merge minidumps from amd64 where they were originally developed. Major differences: * since there is no direct map region, there is no custom uma memory allocator to modify to include its pages in the dumps. * Various data entries are reduced from 64 bit to 32 bit to match the native size. dump_add_page() and dump_drop_page() are still present in case one wants to arrange for arbitary pages to be dumped. This is of marginal use though because libkvm+kgdb cannot address physical memory that isn't mapped into kvm.	2006-04-21 04:28:43 +00:00
imp	2a2593f381	Set the rid of the resource we're about to return to the user.	2006-04-20 04:10:27 +00:00
cperciva	51d1ca0f6e	Correct a local information leakage bug affecting AMD FPUs. Security: FreeBSD-SA-06:14.fpu	2006-04-19 07:00:19 +00:00
iwasaki	0613b693d0	Import ACPI Dock Station support. Note that this is still very young. Additional detach implementaions (or maybe improvement) for other deivce drivers is required. Reviewed by: njl, imp MFC after: 1 week	2006-04-15 12:31:34 +00:00
alc	a7e3d6f83b	Retire pmap_track_modified(). We no longer need it because we do not create managed mappings within the clean submap. To prevent regressions, add assertions blocking the creation of managed mappings within the clean submap. Reviewed by: tegge	2006-04-12 04:22:52 +00:00
ps	cc2c59e66f	Hook bce up to the build	2006-04-10 20:04:22 +00:00
jhb	dca0aae557	- Don't set CR0_NE and CR0_MP in npx_probe() as they are already set earlier in cpu_setregs(). - If we know this CPU has a FPU via cpuid, then just assume the INT16 interface and make the npx device quiet to not clutter the dmesg. This is true for all Pentium and later CPUs and even some of the later 486dx CPUs. Reviewed by: bde Tested by: ps MFC after: 1 week	2006-04-06 17:17:45 +00:00
jhb	1dfdfa5677	Cache the value of the lower half of each I/O APIC redirection table entry so that we only have to do an ioapic_write() instead of an ioapic_read() followed by an ioapic_write() every time we mask and unmask level triggered interrupts. This cuts the execution time for these operations roughly in half. Profiled by: Paolo Pisati <p.pisati@oltrelinux.com> MFC after: 1 week	2006-04-05 20:43:19 +00:00
jkoshy	33b6f0c5ea	Freshen a comment. Reviewed by: jhb	2006-04-04 02:26:45 +00:00
marcel	8278e2d5fb	Eliminate HAVE_STOPPEDPCBS. On ia64 the PCPU holds a pointer to the PCB in which the context of stopped CPUs is stored. To access this PCB from KDB, we introduce a new define, called KDB_STOPPEDPCB. The definition, when present, lives in <machine/kdb.h> and abstracts where MD code saves the context. Define KDB_STOPPEDPCB on i386, amd64, alpha and sparc64 in accordance to previous code.	2006-04-03 22:51:47 +00:00
peter	0f363b7d24	Remove the unused sva and eva arguments from pmap_remove_pages().	2006-04-03 21:16:10 +00:00
alc	af01e3f809	Introduce pmap_try_insert_pv_entry(), a function that conditionally creates a pv entry if the number of entries is below the high water mark for pv entries. Use pmap_try_insert_pv_entry() in pmap_copy() instead of pmap_insert_entry(). This avoids possible recursion on a pmap lock in get_pv_entry(). Eliminate the explicit low-memory checks in pmap_copy(). The check that the number of pv entries was below the high water mark was largely ineffective because it was located in the outer loop rather than the inner loop where pv entries were allocated. Instead of checking, we attempt the allocation and handle the failure. Reviewed by: tegge Reported by: kris MFC after: 5 days	2006-04-02 05:45:05 +00:00
emax	bce2a6b523	Add kbdmux(4) to GENERIC Requested by: scottl	2006-03-31 19:03:37 +00:00
scottl	725c458dc3	Hook the MFI driver up to the build.	2006-03-29 09:57:22 +00:00
des	af5e05fb0b	Use wrapper macros for atomic pointer operations in order to perform the correct casts. This should probably be merged to other architectures.	2006-03-28 14:34:48 +00:00
jhb	3718b3713e	If the XSDT address in the RSDP for an ACPI 2.0 machine is NULL, then fall back to using the RSDT instead. ACPI-CA already follows this same strategy as a workaround for yet another instance of brain-damaged BIOS writers. PR: i386/93963 Submitted by: Masayuki FUKUI <fukui.FreeBSD@fanet.net>	2006-03-27 15:59:48 +00:00
alc	108c9331c3	Eliminate unnecessary invalidations of the entire TLB by pmap_remove(). Specifically, on mappings with PG_G set pmap_remove() not only performs the necessary per-page invlpg invalidations but also performs an unnecessary invalidation of the entire set of non-PG_G entries. Reviewed by: tegge	2006-03-21 18:07:42 +00:00
davidxu	9f834e1bd5	Remove stale KSE code. Reviewed by: alc	2006-03-21 06:46:27 +00:00
jhb	e11865e4b1	Drop some unneeded casts since we program the kernel in C rather than C++.	2006-03-20 19:39:08 +00:00
netchild	39276e2b1e	regen	2006-03-18 20:49:01 +00:00
netchild	d1db96cb48	Fixup some problems in my previous commit (COMPAT_43). Pointyhat to: netchild	2006-03-18 20:47:36 +00:00
netchild	8fd6664412	regen after COMPAT_43 removal	2006-03-18 18:24:38 +00:00
netchild	c1829f604c	Get rid of the need of COMPAT_43 in the linuxolator. Submitted by: Divacky Roman <xdivac02@stud.fit.vutbr.cz> Obtained from: DragonFly (some parts)	2006-03-18 18:20:17 +00:00
jhb	aaa33da2ed	Don't allow userland to set hardware watch points on kernel memory at all. Previously, we tried to allow this only for root. However, we were calling suser() on the target process rather than the current process. This means that if you can ptrace() a process running as root you can set a hardware watch point in the kernel. In practice I think you probably have to be root in order to pass the p_candebug() checks in ptrace() to attach to a process running as root anyway. Rather than fix the suser(), I just axed the entire idea, as I can't think of any good reason _at all_ for userland to set hardware watch points for KVM. MFC after: 3 days Also thinks hardware watch points on KVM from userland are bad: bde, rwatson	2006-03-14 16:13:55 +00:00
davidxu	651e183fe9	It is not necessary to read %gs twice.	2006-03-10 05:55:26 +00:00
davidxu	b36ad9d674	Fix stack offset to allow gcc's stack aligment code to work correctly. MFC after: 3 days	2006-03-10 02:54:45 +00:00
jhb	329536bd48	Flip the switch and don't route interrupts to hyperthreads in a HT system. In at least one benchmark this showed around a 20% performance increase. If other workloads do benefit from having hyperthreads service interrupts, we can always make this a loader tunable. MFC after: 3 days Tested by: ps	2006-03-09 16:38:52 +00:00
phk	67fc39f642	Improve the advantech watchdog.	2006-03-06 07:43:28 +00:00
yar	0ac62e02bd	Take the functionality contained in the former "options TDFX_LINUX" into a separate module. Accordingly, convert the option into a device named similarly. Note for MFC: Perhaps the option should stay in RELENG_6 for POLA reasons. Suggested by: scottl Reviewed by: cokane MFC after: 5 days	2006-03-03 21:37:38 +00:00
netchild	3d39f08ccd	- use a more common style to print memory sizes - add some more cache sizes (2nd and 3rd level) [1] Submitted by: HATANOU Tomomi <hatanou@infolab.ne.jp> [1] PR: 91328 [1]	2006-03-03 18:54:05 +00:00
rink	c057d8091b	Committed the xbox syscons(8)-able console driver. Reviewed by: arch@ (no comments) Approved by: imp (mentor)	2006-03-03 14:52:57 +00:00
scottl	d849f4e1ca	iir works on PAE now.	2006-03-03 04:30:18 +00:00
jhb	3478c467ee	Rework how we wire up interrupt sources to CPUs: - Throw out all of the logical APIC ID stuff. The Intel docs are somewhat ambiguous, but it seems that the "flat" cluster model we are currently using is only supported on Pentium and P6 family CPUs. The other "hierarchy" cluster model that is supported on all Intel CPUs with local APICs is severely underdocumented. For example, it's not clear if the OS needs to glean the topology of the APIC hierarchy from somewhere (neither ACPI nor MP Table include it) and setup the logical clusters based on the physical hierarchy or not. Not only that, but on certain Intel chipsets, even though there were 4 CPUs in a logical cluster, all the interrupts were only sent to one CPU anyway. - We now bind interrupts to individual CPUs using physical addressing via the local APIC IDs. This code has also moved out of the ioapic PIC driver and into the common interrupt source code so that it can be shared with MSI interrupt sources since MSI is addressed to APICs the same way that I/O APIC pins are. - Interrupt source classes grow a new method pic_assign_cpu() to bind an interrupt source to a specific local APIC ID. - The SMP code now tells the interrupt code which CPUs are avaiable to handle interrupts in a simpler and more intuitive manner. For one thing, it means we could now choose to not route interrupts to HT cores if we wanted to (this code is currently in place in fact, but under an #if 0 for now). - For now we simply do static round-robin of IRQs to CPUs when the first interrupt handler just as before, with the change that IRQs are now bound to individual CPUs rather than groups of up to 4 CPUs. - Because the IRQ to CPU mapping has now been moved up a layer, it would be easier to manage this mapping from higher levels. For example, we could allow drivers to specify a CPU affinity map for their interrupts, or we could allow a userland tool to bind IRQs to specific CPUs. The MFC is tentative, but I want to see if this fixes problems some folks had with UP APIC kernels on 6.0 on SMP machines (an SMP kernel would work fine, but a UP APIC kernel (such as GENERIC in RELENG_6) would lose interrupts). MFC after: 1 week	2006-02-28 22:24:55 +00:00
cperciva	8a3d42569d	Add frequency-voltage tables for Intel 778, 758, 773, 753, and 733J processors. Obtained from: Intel Datasheet 302189-008	2006-02-25 04:55:38 +00:00
sam	116633743d	guard function decls with _KERNEL so user code can include this file MFC after: 1 week	2006-02-22 21:38:33 +00:00
jhb	ff9c76bccd	Close some races between procfs/ptrace and exit(2): - Reorder the events in exit(2) slightly so that we trigger the S_EXIT stop event earlier. After we have signalled that, we set P_WEXIT and then wait for any processes with a hold on the vmspace via PHOLD to release it. PHOLD now KASSERT()'s that P_WEXIT is clear when it is invoked, and PRELE now does a wakeup if P_WEXIT is set and p_lock drops to zero. - Change proc_rwmem() to require that the processing read from has its vmspace held via PHOLD by the caller and get rid of all the junk to screw around with the vmspace reference count as we no longer need it. - In ptrace() and pseudofs(), treat a process with P_WEXIT set as if it doesn't exist. - Only do one PHOLD in kern_ptrace() now, and do it earlier so it covers FIX_SSTEP() (since on alpha at least this can end up calling proc_rwmem() to clear an earlier single-step simualted via a breakpoint). We only do one to avoid races. Also, by making the EINVAL error for unknown requests be part of the default: case in the switch, the various switch cases can now just break out to return which removes a _lot_ of duplicated PRELE and proc unlocks, etc. Also, it fixes at least one bug where a LWP ptrace command could return EINVAL with the proc lock still held. - Changed the locking for ptrace_single_step(), ptrace_set_pc(), and ptrace_clear_single_step() to always be called with the proc lock held (it was a mixed bag previously). Alpha and arm have to drop the lock while the mess around with breakpoints, but other archs avoid extra lock release/acquires in ptrace(). I did have to fix a couple of other consumers in kern_kse and a few other places to hold the proc lock and PHOLD. Tested by: ps (1 mostly, but some bits of 2-4 as well) MFC after: 1 week	2006-02-22 18:57:50 +00:00
tegge	a9e07140a7	Rounding addr upwards to next 4M or 2M boundary in pmap_growkernel() could cause addr to become 0, resulting in an early return without populating the last PDE. Reviewed by: alc	2006-02-16 22:10:57 +00:00
dwmalone	fab7fda621	It seems bit 5 of cpu_feature2 is the VMX (Virtual Machine Extensions) bit. While I'm here, delete a comment that was cut and past from the cpu_features code that doesn't belong here.	2006-02-15 14:48:59 +00:00
phk	79081baaf0	CPU time accounting speedup (step 2) Keep accounting time (in per-cpu) cputicks and the statistics counts in the thread and summarize into struct proc when at context switch. Don't reach across CPUs in calcru(). Add code to calibrate the top speed of cpu_tickrate() for variable cpu_tick hardware (like TSC on power managed machines). Don't enforce monotonicity (at least for now) in calcru. While the calibrated cpu_tickrate ramps up it may not be true. Use 27MHz counter on i386/Geode. Use TSC on amd64 & i386 if present. Use tick counter on sparc64	2006-02-11 09:33:07 +00:00
rink	34f7cafe2a	Cleaned the memory initialization up, moved some defines from the framebuffer to an include file. Reviewed by: imp Approved by: imp (mentor)	2006-02-10 18:48:22 +00:00
yar	89626cec4b	Avoid calling CPUID function 0x02 if the CPU reports no support for it. The former code used to hang older Intel CPUs by trying to get non-existent TLB info 2^32 times. Reduce code duplication around the calls to CPUID 0x02 by using do-while loops. PR: i386/92977 Tested by: cy	2006-02-09 09:10:54 +00:00
phk	74f8e63a10	Simplify system time accounting for profiling. Rename struct thread's td_sticks to td_pticks, we will need the other name for more appropriately named use shortly. Reduce it from uint64_t to u_int. Clear td_pticks whenever we enter the kernel instead of recording its value as reference for userret(). Use the absolute value of td->pticks in userret() and eliminate third argument.	2006-02-08 08:09:17 +00:00
phk	bb2f62f536	Modify the way we account for CPU time spent (step 1) Keep track of time spent by the cpu in various contexts in units of "cputicks" and scale to real-world microsec^H^H^H^H^H^H^H^Hclock_t only when somebody wants to inspect the numbers. For now "cputicks" are still derived from the current timecounter and therefore things should by definition remain sensible also on SMP machines. (The main reason for this first milestone commit is to verify that hypothesis.) On slower machines, the avoided multiplications to normalize timestams at every context switch, comes out as a 5-7% better score on the unixbench/context1 microbenchmark. On more modern hardware no change in performance is seen.	2006-02-07 21:22:02 +00:00
rwatson	69e67d72fa	Regenerate.	2006-02-06 22:15:00 +00:00
rwatson	a3306012b7	Assign audit event identifiers to ibcs2 system calls. Obtained from: TrustedBSD Project	2006-02-06 22:14:50 +00:00
jhb	ae432f93f2	- Always call exec_free_args() in kern_execve() instead of doing it in all the callers if the exec either succeeds or fails early. - Move the code to call exit1() if the exec fails after the vmspace is gone to the bottom of kern_execve() to cut down on some code duplication.	2006-02-06 22:06:54 +00:00
jhb	1f0c541bd1	Add a kern_eaccess() function and use it to implement xenix_eaccess() rather than kern_access(). Suggested by: rwatson	2006-02-06 22:00:53 +00:00
rwatson	3a79f09166	Regenerate.	2006-02-06 01:40:48 +00:00
rwatson	59732048da	Assign audit event identifiers to Linux i386 system calls. Obtained from: TrustedBSD Project	2006-02-06 01:40:30 +00:00
rwatson	e04cb6becd	Regenerate.	2006-02-05 23:28:46 +00:00
rwatson	a31938806b	Assign audit event identfiers to Xenix system calls. Note: AUE_EACCESS is assigned to xenix_eaccess() instead of AUE_ACCESS, as that is the intended meaning of the system call. xenix_eaccess() should be reimplemented using our native eaccess() implementation so that it works as intended. Obtained from: TrustedBSD Project	2006-02-05 23:28:01 +00:00
rwatson	7b3f1796f8	Correct help line: list targets, not names of files generated by targets when no argument is provided to make. MFC after: 1 week	2006-02-05 23:25:19 +00:00
rwatson	5570c55a29	Regenerate (accidentally also committed in commit that updated syscalls.isc).	2006-02-05 23:16:20 +00:00
rwatson	ee99b2f3a5	Assign audit event identifiers to ibcs2 ISC system calls. Obtained from: TrustedBSD Project	2006-02-05 23:15:22 +00:00
kensmith	92ff892e7e	Move asr driver from global NOTES to i386-specific NOTES. Requestor reports it is neither endian-clean or 64-bit clean. :-) Requested by: scottl	2006-02-05 05:06:04 +00:00
wsalamon	8ef4ce651a	Hook up the audit system to system call entry and exit. System calls will now be audited. Obtained from: TrustedBSD Project Approved by: rwatson (mentor)	2006-02-04 14:11:33 +00:00
rink	fc05a84231	Patch to allow XBox-users to use the onboard nve(4) nForce ethernet driver. The patch crudely forces the NIC out of operating mode before the nve(4) driver can initialize it; this is required to properly initialize the NIC. It is XBox-specific, as this condition can only occur on XBoxes (Most loaders will simply leave the NIC running, forcing us to use a crude workaround like this to get it in a workable condition). Due to the XBox-only aspect, this has been solved in XBox-specific initialization code and not within nve(4). Reviewed by: imp Approved by: imp (mentor) No objection: bz@, obrien@, q@ontheweb.com.au	2006-02-04 10:01:33 +00:00
davidxu	5e2e272cd9	Clear carry flag in get_mcontext so that setcontext does not return a bogus error. PR: misc/92110	2006-02-03 02:33:01 +00:00
davidxu	c013564a3c	Under verbose mode, correctly report L2 cache information for CPU which supports CPUID function 8000_0006h. Tested on: Pentum-M 750	2006-02-02 12:44:09 +00:00
davidxu	6ccd8f649b	Fix bug in L2 cache size detection code for CPU which supports CPUID function 8000_0006h. Tested on: Pentum-M 750	2006-02-02 11:54:40 +00:00
davidxu	c19e41cb59	Correctly report L2 cache size according to its code comment. Tested on my Dual PIII machine.	2006-02-02 06:35:50 +00:00
rik	34e5ca5b02	Attach ce(4) to the build. MFC after: 3 days	2006-01-31 23:11:35 +00:00
rik	655acdb819	Prepare for sconfig(8) update. Change also my e-mail.	2006-01-30 13:34:57 +00:00
jhb	2abda0c117	Call WITNESS_CHECK() in the page fault handler and immediately assume it is a fatal fault if we are holding any non-sleepable locks. This should cut down on the number of bogus LORs we currently get when the kernel panics due to a NULL (or bogus) pointer dereference that goes wandering off into the VM system which tries to acquire locks and then kicks off the spurious LORs. This should probably be ported to all the archs at some point. Tested on: i386	2006-01-27 22:22:10 +00:00
ups	e67b09cd21	Fix race conditions. Tested by: kris@ MFC after: 3 days	2006-01-23 15:46:09 +00:00
marius	2d1ab68b16	Remove the commented out entry of the old ISA-only le(4) driver which was retired 22 months ago. MFC after: 1 day	2006-01-21 12:38:35 +00:00
davidxu	f99d64bc80	Eliminate a stale instruction introduced in revision 1.136.	2006-01-18 06:42:42 +00:00
scottl	ec415604ac	Free the newtag if we exit with a failure from alloc_bounce_zone(). Found by: Coverity Prevent(tm)	2006-01-14 17:22:47 +00:00
phk	57be8af642	Move the old BSD4.3 tty compatibility from (!BURN_BRIDGES && COMPAT_43) to COMPAT_43TTY. Add COMPAT_43TTY to NOTES and */conf/GENERIC Compile tty_compat.c only under the new option. Spit out #warning "Old BSD tty API used, please upgrade." if ioctl_compat.h gets #included from userland.	2006-01-10 09:19:10 +00:00
imp	c2b2965b6a	By popular demand, move __HAVE_ACPI and __PCI_REROUTE_INTERRUPT into param.h. Per request, I've placed these just after the _NO_NAMESPACE_POLLUTION ifndef. I've not renamed anything yet, but may since we don't need the __. Submitted by: bde, jhb, scottl, many others.	2006-01-09 06:05:57 +00:00
jhb	170b22254d	- Make pcib_devclass private to sys/dev/pci/pci_pci.c and change all the various pcib drivers to use their own private devclass_t variables for their modules. - Use the DEFINE_CLASS_0() macro to declare drivers for the various pcib drivers while I'm here.	2006-01-06 19:22:19 +00:00
jhb	42cfa2cc9e	Fix various places that were testing td_critnest to see if interrupts should remain disabled during a trap or not to check td_md.md_spinlock_count instead.	2006-01-06 18:02:12 +00:00
netchild	24387e86c9	We don't support I386_CPU in 6.0 and later. This file can be cleaned up some to assume that '#if defined(I486_CPU) \|\| defined(I586_CPU) \|\| defined(I686_CPU)' is true. Suggested by: jhb Reviewed by: jhb	2006-01-04 20:11:04 +00:00
netchild	e426666183	- Make sure the cpu_exthigh variable is initialized (page coloring case). [1] - Remove a conditional in the AMD cache detection, it's always false. [2] - Don't try to detect a cache if only compiled for i386. Analyzed by: Antoine Brodin <antoine.brodin@laposte.net> [1] Submitted by: Antoine Brodin <antoine.brodin@laposte.net> [2]	2006-01-04 12:57:02 +00:00
phk	44d6de75f9	Use ttyalloc() instead of ttymalloc()	2006-01-04 09:46:20 +00:00
jhb	9a6ec66269	Fix a couple of issues with the ibcs2 module event handler. First, return success instead of EOPNOTSUPP when being loaded. Secondly, if there are no ibcs2 processes running when a MOD_UNLOAD request is made, break out to return success instead of falling through into the default case which returns EOPNOTSUPP. With these fixes, I can now kldload and subsequently kldunload the ibcs2 module. PR: kern/82026 (and several duplicates) Reported by: lots of folks MFC after: 1 week	2006-01-03 20:39:38 +00:00
jkim	ae104d9814	- Explicitly validate an empty filter to match bpf_filter() comment[1]. - Do not use BPF JIT compiler for an empty filter. [1] Pointed out by: darrenr	2006-01-03 20:26:03 +00:00
imp	8d9b67a0e3	Define __HAVE_ACPI and/or __PCI_REROUTE_INTERRUPT, as appropriate for each platform. These will be used in the pci code in preference to the complicated #ifdefs we have there now.	2006-01-01 20:59:28 +00:00
netchild	507a9b3e93	MI changes: - provide an interface (macros) to the page coloring part of the VM system, this allows to try different coloring algorithms without the need to touch every file [1] - make the page queue tuning values readable: sysctl vm.stats.pagequeue - autotuning of the page coloring values based upon the cache size instead of options in the kernel config (disabling of the page coloring as a kernel option is still possible) MD changes: - detection of the cache size: only IA32 and AMD64 (untested) contains cache size detection code, every other arch just comes with a dummy function (this results in the use of default values like it was the case without the autotuning of the page coloring) - print some more info on Intel CPU's (like we do on AMD and Transmeta CPU's) Note to AMD owners (IA32 and AMD64): please run "sysctl vm.stats.pagequeue" and report if the cache* values are zero (= bug in the cache detection code) or not. Based upon work by: Chad David <davidc@acns.ab.ca> [1] Reviewed by: alc, arch (in 2004) Discussed with: alc, Chad David, arch (in 2004)	2005-12-31 14:39:20 +00:00
davidxu	86c8c90d82	Remove pcb_switchout, it has not been used for a long time.	2005-12-29 13:23:48 +00:00
sobomax	34fa5a81a5	Remove kern.elf32.can_exec_dyn sysctl. Instead extend Brandinfo structure with flags bitfield and set BI_CAN_EXEC_DYN flag for all brands that usually allow executing elf dynamic binaries (aka shared libraries). When it is requested to execute ET_DYN elf image check if this flag is on after we know the elf brand allowing execution if so. PR: kern/87615 Submitted by: Marcin Koziej <creep@desk.pl>	2005-12-26 21:23:57 +00:00
davidxu	b1b4f02f94	Move global variable private_tss into per-cpu area. Reviewed by: jhb	2005-12-26 00:07:19 +00:00
jeff	f1d333e1f5	- Improve the INKERNEL macro such that it can no longer give false positives. This fixes the stack(9) functionality. Submitted by: Antoine Brodin <antoine.brodin@laposte.net>	2005-12-23 21:33:55 +00:00
jhb	cb0d490ebe	Tweak how the MD code calls the fooclock() methods some. Instead of passing a pointer to an opaque clockframe structure and requiring the MD code to supply CLKF_FOO() macros to extract needed values out of the opaque structure, just pass the needed values directly. In practice this means passing the pair (usermode, pc) to hardclock() and profclock() and passing the boolean (usermode) to hardclock_cpu() and hardclock_process(). Other details: - Axe clockframe and CLKF_FOO() macros on all architectures. Basically, all the archs were taking a trapframe and converting it into a clockframe one way or another. Now they can just extract the PC and usermode values directly out of the trapframe and pass it to fooclock(). - Renamed hardclock_process() to hardclock_cpu() as the latter is more accurate. - On Alpha, we now run profclock() at hz (profhz == hz) rather than at the slower stathz. - On Alpha, for the TurboLaser machines that don't have an 8254 timecounter, call hardclock() directly. This removes an extra conditional check from every clock interrupt on Alpha on the BSP. There is probably room for even further pruning here by changing Alpha to use the simplified timecounter we use on x86 with the lapic timer since we don't get interrupts from the 8254 on Alpha anyway. - On x86, clkintr() shouldn't ever be called now unless using_lapic_timer is false, so add a KASSERT() to that affect and remove a condition to slightly optimize the non-lapic case. - Change prototypeof arm_handler_execute() so that it's first arg is a trapframe pointer rather than a void pointer for clarity. - Use KCOUNT macro in profclock() to lookup the kernel profiling bucket. Tested on: alpha, amd64, arm, i386, ia64, sparc64 Reviewed by: bde (mostly)	2005-12-22 22:16:09 +00:00
imp	39e3e683e5	Move device 'cs' into i386/pc98 specific NOTES. It is broken on ppc because it uses i386 specific calls. Maybe it could be added to amd64, but I'm not so sure it would work there so I've not added it there.	2005-12-20 23:00:11 +00:00
jhb	fde66b5a2e	Move the hostb driver out of the i386 and amd64 PCI code (where it was duplicated anyways) and into a single MI driver. Extend the driver a bit to implement the bus and PCI kobj interfaces such that other drivers can attach to it and transparently act as if their parent device is the PCI bus (for the most part).	2005-12-20 21:09:45 +00:00
jhb	feebef55c2	Remove linux_mib_destroy() (which I actually added in between 5.0 and 5.1) which existed to cleanup the linux_osname mutex. Now that MTX_SYSINIT() has grown a SYSUNINIT to destroy mutexes on unload, the extra destroy here was redundant and resulted in panics in debug kernels. MFC after: 1 week Reported by: Goran Gajic ggajic at afrodita dot rcub dot bg dot ac dot yu	2005-12-15 16:30:41 +00:00
jhb	2bc0431d83	Fix stale comment.	2005-12-14 21:47:02 +00:00
peter	9b3d762efb	MFamd64 rev 1.223: Use the TSC to implement DELAY() if not marked broken and it has been calibrated.	2005-12-13 19:08:55 +00:00
jhb	3bd2c66449	Revert previous commit. The BIOS braindamage is even worse than I originally thought. The BIOS that cleared CPUID_APIC actually managed to disable the local APIC entirely and even Windows 64 doesn't boot on it. Reported by: bz	2005-12-13 18:29:10 +00:00
jhb	76cd6764a4	Don't check the CPUID_APIC bit in the cpu_features flags field to determine if the boot CPU has a local APIC because some BIOS vendors are not competent enough to set this bit. Instead, just assume that we always have a local APIC on amd64. For i386 the check is a bit more subtle. FreeBSD requires either an MP Table or an ACPI MADT table to enumerate APICs. The only systems that have one of those tables that don't have local APICs are some presumably rare (and old) SMP 486 systems using external APICs. Thus, instead of checking the CPUID_APIC flag, check the CPU class and abort if we are running on a 486. MFC after: 1 week Reported by: bz	2005-12-13 15:09:40 +00:00
rodrigc	82f361258c	Add support for 7320 and 915 PCIe chipsets. Submitted by: Gavin Atkinson <gavin.atkinson at ury dot york dot ac dot uk> PR: kern/79139 Reviewed by: scottl	2005-12-08 18:55:15 +00:00
jhb	a1d5c4c24e	Whitespace: reduce diffs with amd64.	2005-12-08 18:33:48 +00:00
jhb	0b37b8af54	- Cleanup whitespace and extra ()s in vtophys() macros. - Move vtophys() macros next to vtopte() where vtopte() exists to match comments above vtopte(). - Remove references to the alternate address space in the comment above vtopte(). amd64 never had the alternate address space, and i386 lost it prior to PAE support being added. - s/entires/entries/ in comments. Reviewed by: alc	2005-12-06 21:09:01 +00:00
jkim	18c4e589cc	Fix ZERO_EDX() macro from the previous commit. It was emitting `xor %ecx, %ecx', not `xor %edx, %edx'.	2005-12-06 20:11:07 +00:00
ru	f9739084f5	Drop _MACHINE_ARCH and _MACHINE defines (not to be confused with MACHINE_ARCH and MACHINE). Their purpose was to be able to test in cpp(1), but cpp(1) only understands integer type expressions. Using such unsupported expressions introduced a number of subtle bugs, which were discovered by compiling with -Wundef.	2005-12-06 13:27:21 +00:00
jkim	9fbde6681e	s/M_WAITOK/M_NOWAIT/ while mutex is held. Pointed out by: csjp	2005-12-06 07:22:01 +00:00
jkim	3bd9b70058	- Micro-optimize `mov $0, %edx' ->` xor %edx, %edx'. - Correct amd64 macro style (no functional change).	2005-12-06 06:45:39 +00:00
jkim	055dc8e121	Add experimental BPF Just-In-Time compiler for amd64 and i386. Use the following kernel configuration option to enable: options BPF_JITTER If you want to use bpf_filter() instead (e. g., debugging), do: sysctl net.bpf.jitter.enable=0 to turn it off. Currently BIOCSETWF and bpf_mtap2() are unsupported, and bpf_mtap() is partially supported because 1) no need, 2) avoid expensive m_copydata(9). Obtained from: WinPcap 3.1 (for i386)	2005-12-06 02:58:12 +00:00
jhb	c77d4150b7	Change the i386 code to pass the interrupt vector as a separate argument rather than embedding it in the intrframe as if_vec. This reduces diffs with amd64 somewhat. - Remove cf_vec from clockframe (it wasn't used anyway) and stop pushing dummy vector arguments for ipi_bitmap_handler() and lapic_handle_timer() since clockframe == trapframe now. - Fix ddb to handle stack traces across interrupt entry points that just have a trapframe on their stack and not a trapframe + vector. - Change intr_execute_handlers() to take a trapframe rather than an intrframe pointer. - Change lapic_handle_intr() and atpic_handle_intr() to take a vector and trapframe rather than an intrframe. - GC struct intrframe now that nothing uses it anymore. - GC CLOCK_TO_TRAPFRAME() and INTR_TO_TRAPFRAME(). Reviewed by: bde Requested by: peter	2005-12-05 22:39:09 +00:00
jhb	aa9c5f3cdd	- Move the code to deal with handling an IPI_STOP IPI out of ipi_nmi_handler() and into a new cpustop_handler() function. Change the Xcpustop IPI_STOP handler to call this function instead of duplicating all the same logic in assembly. - EOI the local APIC for the lapic timer interrupt in C rather than assembly. - Bump the lazypmap IPI counter if COUNT_IPIS is defined in C rather than assembly.	2005-12-05 22:25:41 +00:00
jhb	ec69f8e34c	Don't panic if IRQ 13 doesn't exist. On some machines (see previous commit to atpic.c) there may not be an IRQ 13. Instead, just keep going. If the INT16 interface doesn't work then we will eventually panic anyway. FWIW: We could probably just axe the support for IRQ 13 altogether at this point. The only thing we'd lose support for are 486sx systems with external 487 FPUs. MFC after: 1 week	2005-12-05 22:11:44 +00:00

... 3 4 5 6 7 ...

10964 Commits