freebsd-skq

Author	SHA1	Message	Date
Marcel Moolenaar	f2c49dd248	Revamp of the syscall path, exception and context handling. The prime objectives are: o Implement a syscall path based on the epc inststruction (see sys/ia64/ia64/syscall.s). o Revisit the places were we need to save and restore registers and define those contexts in terms of the register sets (see sys/ia64/include/_regset.h). Secundairy objectives: o Remove the requirement to use contigmalloc for kernel stacks. o Better handling of the high FP registers for SMP systems. o Switch to the new cpu_switch() and cpu_throw() semantics. o Add a good unwinder to reconstruct contexts for the rare cases we need to (see sys/contrib/ia64/libuwx) Many files are affected by this change. Functionally it boils down to: o The EPC syscall doesn't preserve registers it does not need to preserve and places the arguments differently on the stack. This affects libc and truss. o The address of the kernel page directory (kptdir) had to be unstaticized for use by the nested TLB fault handler. The name has been changed to ia64_kptdir to avoid conflicts. The renaming affects libkvm. o The trapframe only contains the special registers and the scratch registers. For syscalls using the EPC syscall path no scratch registers are saved. This affects all places where the trapframe is accessed. Most notably the unaligned access handler, the signal delivery code and the debugger. o Context switching only partly saves the special registers and the preserved registers. This affects cpu_switch() and triggered the move to the new semantics, which additionally affects cpu_throw(). o The high FP registers are either in the PCB or on some CPU. context switching for them is done lazily. This affects trap(). o The mcontext has room for all registers, but not all of them have to be defined in all cases. This mostly affects signal delivery code now. The *context syscalls are as of yet still unimplemented. Many details went into the removal of the requirement to use contigmalloc for kernel stacks. The details are mostly CPU specific and limited to exception_save() and exception_restore(). The few places where we create, destroy or switch stacks were mostly simplified by not having to construct physical addresses and additionally saving the virtual addresses for later use. Besides more efficient context saving and restoring, which of course yields a noticable speedup, this also fixes the dreaded SMP bootup problem as a side-effect. The details of which are still not fully understood. This change includes all the necessary backward compatibility code to have it handle older userland binaries that use the break instruction for syscalls. Support for break-based syscalls has been pessimized in favor of a clean implementation. Due to the overall better performance of the kernel, this will still be notived as an improvement if it's noticed at all. Approved by: re@ (jhb)	2003-05-16 21:26:42 +00:00
Don Lewis	1e9bc9f889	Detect that a vnode has been reclaimed while vflush() was waiting to lock the vnode and restart the loop. Vflush() is vulnerable since it does not hold a reference to the vnode and it holds no other locks while waiting for the vnode lock. The vnode will no longer be on the list when the loop is restarted. Approved by: re (rwatson)	2003-05-16 19:46:51 +00:00
Marcel Moolenaar	baf74b8876	o In pmap_install, don't prevent switching the pmap if we're switching to kernel_pmap. The pmap is not special enough. o Clear the active bit on the pmap we're switching out. o Fix some nearby style(9) bugs. Approved by: re@	2003-05-16 07:57:44 +00:00
Alan Cox	f820bc501e	Use vm_object_deallocate(), not vm_pager_deallocate(), to destroy a vm object. (vm_pager_deallocate() does not, in fact, destroy a vm object.) Approved by: re (scottl) Reviewed by: phk	2003-05-16 07:28:27 +00:00
Marcel Moolenaar	906f065725	Indent a comment. This makes 1.100. Still approved by: re@ (blanket)	2003-05-16 07:05:08 +00:00
Marcel Moolenaar	164d4986fd	Turn pmap_growkernel() into a critical section. While here, initialize kernel_vm_end in pmap_bootstrap. Don't delay the initialization until we need to grow the kernel VM space. This BTW happens twice before we enter either single- or multi-user mode. Don't adjust kernel_vm_end while growing based on whether the KPT contains a non-NULL entry. We trust kernel_vm_end to be correct and we make sure it's still correct after growing. Define virtual_avail and virtual_end in terms of VM_MIN_KERNEL_ADDRESS and VM_MAX_KERNEL_ADDRESS (resp). Don't hardcode region knowledge.	2003-05-16 07:03:15 +00:00
Marcel Moolenaar	8cc31ae5be	Revamp the RID allocation code: o Limit the size of the region ID map to 64KB. This gives a bitmap that is large enough to keep track of 2^19 numbers. The minimal map size is 32KB. The reason we limit the map size is that processor models may have implemented a 24-bit region ID, which would give a 2MB bitmap while the maximum number of allocations is always less than PID_MAX*5, which is less than 2^19. o Allocate all region IDs up-front. The slight downside of reserving more RIDs then a process needs (3 for ia64 native and 1 for ia32) is preferable over the call to pmap_ensure_rid() where RIDs are allocated on demand. On SMP systems this may lead to a race condition. o When allocating a region ID, don't use arc4random(). We're not interested in randomness or uniform distribution across the spectrum. We only need uniqueness. Random numbers may easily collide when the number of allocated RIDs is high, creating a possibly unbounded retry rate.	2003-05-16 06:40:40 +00:00
Marcel Moolenaar	75189cff08	Move the conditional definition of KSTACK_MAX_PAGES up ahead where it's more visible. Approved by: re@ (blanket)	2003-05-16 06:17:34 +00:00
Marcel Moolenaar	5551d84398	Sync the linker script with the one used by default for userland. Since ia64 only uses relocations with addend, remove the sections specific to non-addend relocations (.rel.*). Also remove C++ specific sections. Approved by: re@ (blanket)	2003-05-16 06:03:45 +00:00
Murray Stokely	a8a084fc17	Add variables for missing network drivers. PR: kern/51911 Submitted by: David Yeske <dyeske@yahoo.com> Approved by: re	2003-05-16 04:31:00 +00:00
Murray Stokely	4001e1ee2e	Add E-Tech ISA PnP modem ID. PR: kern/36692 Submitted by: Theo van Klaveren <t.vanklaveren@student.utwente.nl> Approved by: re (murray) MFC After: 3 days	2003-05-16 04:04:04 +00:00
David E. O'Brien	04ddc5dea6	Run $S/kern/genassym.sh with the correct NM. Approved by: re(blanket)	2003-05-16 02:27:17 +00:00
David E. O'Brien	8d542cb56d	Fix long standing bug that prevents the PT_CONTINUE, PT_KILL and PT_DETACH ptrace(2) requests from functioning as advertised in the manual page. As described in kern/35175, the PT_DETACH request will, under certain circumstances, pass an unwanted signal on to the traced process upan detaching from it. The PT_CONTINUE request will sometimes fail if you make it pass a signal that has "properties" that differ from the properties of the signal that origionally caused the traced process to be stopped. Since PT_KILL is nothing than PT_CONTINUE with SIGKILL, it is broken too. In the PT_KILL case, this leads to an unkillable process. PR: 44011 Submitted by: Mark Kettenis <kettenis@chello.nl> Approved by: re(jhb)	2003-05-16 01:34:23 +00:00
Robert Watson	98b2788832	Add a tunable/sysctl "hw.fxp_noflow" which disables flow control support on if_fxp cards. When flow control is enabled, if the operating system doesn't acknowledge the packet buffer filling, the card will begin to generate ethernet quench packets, but appears to get into a feedback loop of some sort, hosing local switches. This is a temporary workaround for 5.1: the ability to configure flow control should probably be exposed by some or another management interface on ethernet link layer devices. Approved by: re (bmah) Reviewed by: mux	2003-05-16 01:13:16 +00:00
Thomas Moestl	a93b6bf5e9	In cpu_fork(), initialize pcb_psl for the new process to PSL_KERNEL, instead of taking the (userland) eflags from the trap frame and masking out PSL_I. There is no need to inherit any flags from the forking process; the old method however can cause flags set in userland for the forking process to be bogusly set in kernel mode when the newly forked process runs for the first time (in particular PSL_T, which is set for userland when the process is single-stepped; this would cause trace traps in kernel mode). Approved by: re (jhb)	2003-05-16 01:10:33 +00:00
Robert Watson	c1dca9ab07	VOP_PATHCONF() requires a vnode lock; this patch adds locking to fpathconf(). The lock is held for direct calls to VOP_PATHCONF() in pathconf() already. Approved by: re (jhb) Pointed out by: DEBUG_VFS_LOCKS	2003-05-15 21:13:08 +00:00
Robert Watson	7042ac8cd7	This change grabs the vnode lock for NFS client vnodes when calling VOP_SETATTR() or VOP_GETATTR(); without these locks (a) VFS_DEBUG_LOCKS will panic, and (b) it may be possible to corrupt entries in the cached vnode attributes in the nfsnode, since nfsnode attribute cache data is also protected by the vnode lock. Approved by: re (jhb) Pointed out by: VFS_DEBUG_LOCKS	2003-05-15 21:12:08 +00:00
Robert Watson	62d4b85ec1	Jeff added locking assertions that the VV_ flags on vnodes were modified only while holding appropriate vnode locks. This patch slides the lock release for ufs_extattr_enable() to continue to hold the active vnode lock on a backing file until after the flag change; it also acquires a vnode lock when disabling an attribute and hence clearing a flag on the backing vnode. This permits VFS_DEBUG_LOCKS to run UFS1 extended attributes without panicking, as well as preventing a potential race and vnode flag problem. Approved by: re (jhb) Pointed out by: DEBUG_VFS_LOCKS	2003-05-15 21:07:33 +00:00
Bosko Milekic	11583f6c93	Make the mb_alloc low-watermark sysctl-tunable read-only and make netstat(1) not display it for now because its effects are not yet completely implemented and we're about to cut 5.2-RELEASE. This is temporary. Approved by: re (scottl, rwatson)	2003-05-15 19:05:28 +00:00
Julian Elischer	95f04def4b	fix a cut-n-paste error. in the case where the bridge node was closed down but a timeout still applied to it, the final reference to the node was freeing the private data structure using the wrong malloc type. Approved by: re@	2003-05-15 18:51:28 +00:00
Nate Lawson	d6061de923	Generalize a quirk for Asahi Optical-based cameras (i.e. Pentax). It appears all of the Optio series have the same problems. It might be a better approach eventually to add wildcard support to USB quirks. PR: kern/50271, kern/46369 Approved by: re (rwatson)	2003-05-15 17:36:22 +00:00
Nate Lawson	f410510b09	Add a quirk for OTi USB flash key. PR: kern/51825 Approved by: re (rwatson)	2003-05-15 17:35:35 +00:00
Thomas Moestl	18100346d1	Miscellaneous fixes: - Fix compilation without GEM_DEBUG. - Do not #define GEM_DEBUG by default; it adds overhead (due to bzero()ing RX space) and is not needed any more, since the driver is quite stable now. - Fix watchdog timeouts when failing to load TX packets. - Do not forcibly limit the number of descriptors used for a packet to GEM_NTXSEGS, by passing this number to bus_dma_tag_create(). There is no requirement for a limit any lower than the total number of available descriptors, and the present limit caused network problems due to mbuf chains requiring more descriptors. GEM_NTXSEGS is still used to estimate the interrupt window size, for which we just need an estimate. Approved by: re (rwatson)	2003-05-15 16:57:55 +00:00
Martin Blapp	f956e0b3f0	Only use a SIA/SYM media info block if no MII block is detected. The submitter of PR 32118 told me that this patch also fixes autoselecting for znyx 4 port cards (10baseT, 100baseTX did work already). PR: 32118 Reviewed by: imp Approved by: rwatson (re)	2003-05-15 16:53:29 +00:00
Marcel Moolenaar	794518cd6d	This file creates register sets based on the runtime specification. The advantage of using register sets is that you don't focus on each register seperately, but instead instroduce a level of abstraction. This reduces the chance of errors, and also simplifies the code. The register sers form the basis of everything register. The sets in this file are: struct _special contains all of the control related registers, such as instruction pointer and stack pointer. It also contains interrupt specific registers like the faulting address. The set is roughly split in 3 groups. The first contains the registers that define a context or thread. This is the only group that the kernel needs to switch threads. The second group contains registers needed in addition to the first group needed to switch userland threads. This group contains the thread pointer and the FP control register. The third group contains those registers we need for execption handling and are used on top of the first two groups. struct _callee_saved, struct _callee_saved_fp These sets contain the preserved registers, including the NaT after spilling. The general registers (including branch registers) are seperated from the FP registers for ptrace(2). struct _caller_saved, struct _caller_saved_fp These sets contain the scratch registers based on SDM 2.1, This means that both ar.csd and ar.ccd are included here, even though they contain ia32 segment register descriptions. We keep seperate NaT bits for scratch and preserved registers, because they are never saved/restored at the same time. struct _high_fp The upper 96 FP registers that can be enabled/disabled seperately on the CPU from the lower 32 FP registers. Due to the size of this set, we treat them specially, even though they are defined as scratch registers. CVS ----------------------------------------------------------------------	2003-05-15 08:36:03 +00:00
Marcel Moolenaar	4bae872201	This file contains elementary context related functions used to save and restore "sets" of registers in various places. The restorectx and swapctx functions are used by cpu_switch() and deal with the special registers, as well as the preserved registers. The callee_saved functions are used to save and restore the preserved registers (integer and floating-point). They are useful for signal delivery and ptrace support. The save_high_fp and restore_high_fp functions are used to "load" and "unload" to and from the CPU as part of lazy context switching. The ia32 specific context functions have been kept with the ia32 code. Approved by: re@ (blanket)	2003-05-15 08:08:32 +00:00
Marcel Moolenaar	1d67adffd6	This file contains the code that implements the syscall path based on the epc instruction. The epc instruction, given the permissions of the page in which the epc is located, allows the privilege level to be increased with little or no overhead. The previous privilege level is recorded in the current frame marker and is restored by a regular (function) return. Since the epc instruction has to live in a page with non-standard properties, we hardwire a "gateway" page in the address space. The address of the gateway page is exported to userland in ar.k7. This allows us to rewire the page without breaking the ABI. The syscall stubs in libc are regular function calls that slightly differ from the normal runtime. The difference is mostly to simplify the stubs themselves by by moving some of the logic to the kernel. The libc stubs call into the gateway page (offset 0), from where the kernel trampolines to the code that sets up a minimal trapframe and arranges to execute from the kernel stack. The way back is basicly the same. The kernel returns to the gateway page, whereby privilege is dropped, and jumps back to the syscall stub. Only the special registers are saved in the trapframe. None of the scratch registers are preserved and since the kernel follows the same runtime model, none of the preserved registers are saved. Future enhancements can include the implementation of lightweight syscalls, where kernel functions are performed without setting up a trapframe. Good candidates are the *context syscalls for example. Now that there's a gateway page from which code can be executed in a non-privileged context, we also have the ideal place to put the signal trampolines. By moving the signal trampolines from the user stack to the gateway page, we open up the doors to unexecutable stacks. The gateway page contains signal trampolines for both the "legacy" break-based syscall code and the new and improved epc- based syscall code. Approved: re@ (blanket)	2003-05-15 07:51:22 +00:00
Alan Cox	4a0d6dfd2c	Initialize logical_cpus_mask when the logical CPUs are enumerated in the mptable. (Previously, logical_cpus_mask was only initialized if the hyperthreading fixup was executed.) Approved by: re (jhb) Reviewed by: ps	2003-05-15 05:12:24 +00:00
Marcel Moolenaar	2a9fc22645	This commit was generated by cvs2svn to compensate for changes in r115013, which included commits to RCS files with non-trunk default branches.	2003-05-15 05:04:44 +00:00
Marcel Moolenaar	35859e5946	This is beta4 of libuwx; an ia64 stack unwinder. This code is made available by Hewlett-Packard under the MIT license. The unwinder is small, clean and fast and needed little adaptation for use in the kernel. This import has embedded in it the changes needed to make it build in a kernel environment. To optimize the common case, the kernel will minimize the number of registers saved by not saving the preserved registers. In case access to preserved registers is needed (signal handling, ptrace) the kernel will unwind to the context of the syscall or exception. For this we need an unwinder. Approved by: re (blanket)	2003-05-15 05:04:44 +00:00
Juli Mallett	7bbf05a2c3	Clear up that COMPAT_43 may not do the same thing on every architecture and clear up that COMPAT_SUNOS is similarly MI, and does something relatively similar. Approved by: re/rwatson	2003-05-15 02:10:30 +00:00
Peter Wemm	c0a54ff621	Collect the nastiness for preserving the kernel MSR_GSBASE around the load_gs() calls into a single place that is less likely to go wrong. Eliminate the per-process context switching of MSR_GSBASE, because it should be constant for a single cpu. Instead, save/restore it during the loading of the new %gs selector for the new process. Approved by: re (amd64/* blanket)	2003-05-15 00:23:40 +00:00
Peter Wemm	be52ef1399	Use compile time constants for things like PTmap[] etc because they're about to move outside of the +/- 2GB range Suggested by: jake Approved by: re (amd64/* blanket)	2003-05-15 00:20:17 +00:00
Maxime Henrion	4d340ec485	GCC 3.3 complains about anonymous structures in unions, so give the fxp_ipcb structure a name in the fxp_rfa structure. Submitted by: peter Approved by: re (jhb)	2003-05-14 20:33:41 +00:00
John Baldwin	aa7ba84232	Fix a typo that broke the pc98 kernel build. Reported by: des@'s tinderbox Pointy hat to: jhb Approved by: re (blanket/scottl)	2003-05-14 20:21:42 +00:00
John Baldwin	ce130a9573	Add <sys/queue.h> to unbreak world. Approved by: re (scottl)	2003-05-14 15:00:24 +00:00
Thomas Quinot	b3c957133a	In atapi_cam_reinit_bus, only call reinit_bus if the ATAPI channel has already been registered with ATAPI/CAM (else there is nothing to do). atapi_cam_reinit_bus may be called before the bus is registered if an ATAPI command times out during the boot sequence. PR: i386/51421 Reviewed by: roberto Approved by: re (rwatson) MFC after: 1 week	2003-05-14 14:20:22 +00:00
Wilko Bulte	5adbf8fb4d	add support for NetMos 4S0P PCI: 4S, 0P tested on -current: ceri tested on -stable: wilko approved: re (scottl)	2003-05-14 09:37:46 +00:00
Peter Wemm	e14528b349	Regen Approved by: re (amd64 blanket)	2003-05-14 04:11:25 +00:00
Peter Wemm	d85631c4ac	Add BASIC i386 binary support for the amd64 kernel. This is largely stolen from the ia64/ia32 code (indeed there was a repocopy), but I've redone the MD parts and added and fixed a few essential syscalls. It is sufficient to run i386 binaries like /bin/ls, /usr/bin/id (dynamic) and p4. The ia64 code has not implemented signal delivery, so I had to do that. Before you say it, yes, this does need to go in a common place. But we're in a freeze at the moment and I didn't want to risk breaking ia64. I will sort this out after the freeze so that the common code is in a common place. On the AMD64 side, this required adding segment selector context switch support and some other support infrastructure. The %fs/%gs etc code is hairy because loading %gs will clobber the kernel's current MSR_GSBASE setting. The segment selectors are not used by the kernel, so they're only changed at context switch time or when changing modes. This still needs to be optimized. Approved by: re (amd64/* blanket)	2003-05-14 04:10:49 +00:00
Peter Wemm	5d5ca6d75e	Fix some misunderstandings about 64 bit extension. Fix fuword/suword - they're supposed to be 'long' - ie: point them at fuword64/suword64 instead of the incorrect 32 bit versions.	2003-05-14 03:38:13 +00:00
Paul Saab	13d56a9a90	p_sigignore moved into struct sigacts. move one which was missed. Approved by: re (scottl)	2003-05-14 00:03:55 +00:00
John Baldwin	90af4afacb	- Merge struct procsig with struct sigacts. - Move struct sigacts out of the u-area and malloc() it using the M_SUBPROC malloc bucket. - Add a small sigacts_*() API for managing sigacts structures: sigacts_alloc(), sigacts_free(), sigacts_copy(), sigacts_share(), and sigacts_shared(). - Remove the p_sigignore, p_sigacts, and p_sigcatch macros. - Add a mutex to struct sigacts that protects all the members of the struct. - Add sigacts locking. - Remove Giant from nosys(), kill(), killpg(), and kern_sigaction() now that sigacts is locked. - Several in-kernel functions such as psignal(), tdsignal(), trapsignal(), and thread_stopped() are now MP safe. Reviewed by: arch@ Approved by: re (rwatson)	2003-05-13 20:36:02 +00:00
John Baldwin	25b4d3a8a6	In setitimer(2), if the it_value of the new itimer value is clear, then don't add the current time to it, but leave it as clear so that when the timer is disabled, the it_value is always clear. Reviewed by: bde Approved by: re (rwatson)	2003-05-13 19:21:46 +00:00
John Baldwin	dea7cce585	Add some extra #ifdef stubs so that this compiles on 4.8. Approved by: re (rwatson/bmah)	2003-05-13 16:59:46 +00:00
Yoshihiro Takahashi	3e4e484918	Move the ips driver from ${MACHINE_ARCH} == "i386" to ${MACHINE} == "i386". Approved by: re (scottl)	2003-05-13 11:26:08 +00:00
Alan Cox	099e981aa1	Optimize the use of splay in gbincore(). During a "make buildworld" the desired buffer is found at one of the roots more than 60% of the time. Thus, checking both roots before performing either splay eliminates unnecessary splays on the first tree splayed. Approved by: re (jhb)	2003-05-13 04:36:02 +00:00
Poul-Henning Kamp	3eb8c738fd	When a disk disappears, destroy the class from the event thread to avoid race condtion. Approved by: re/rwatson	2003-05-12 20:15:28 +00:00
Martin Blapp	7eac366be1	Add support for 3Com OfficeConnect 10/100B. PR: 49059, 50747 Submitted by: Dax Eckenberg <daxbert@dweebsoft.com> Reviewed by: imp, jhb Approved by: jhb MFC after: 2 weeks	2003-05-12 19:50:21 +00:00
Peter Wemm	8a6d52c3f8	Really stop the loader from trying to load the acpi module by lying and pretending that it is already here. Approved by: re (amd64/* stuff)	2003-05-12 18:37:56 +00:00

1 2 3 4 5 ...

39907 Commits