freebsd-dev

Author	SHA1	Message	Date
pjd	92b377bb00	Allow assert that the current thread does not hold the sx(9) lock. Reviewed by: jhb In cooperation with: juli, jhb Approved by: jhb, scottl (mentor)	2004-02-04 08:14:58 +00:00
silby	48076c274f	Style fixes Submitted by: bde	2004-02-04 08:14:47 +00:00
rwatson	95d7d14f32	A variety of further cleanups to ttyinfo(): - Rename temporary variable names ("tmp", "tmp2") to more informative names ("load", "pctcpu", "rss", ...) - Unclutter indentation and return paths: rather than lots of nested ifs, simply return earlier if it's not going to work out. Simplify general structure and avoid "deep" code. - Comment on the thread/process selection and locking. - Correct handling of "running"/"runnable" states, avoid "unknown" that people were seeing for running processes. This was due to a misunderstanding of the more complex state machine / inhibitors behavior of KSE. - Do perform ttyinfo() printing on KSE (P_SA) processes, it seems generally to work. While I initially attempted to formulate this as two commits (one layout, the other content), I concluded that the layout changes were really structural changes. Many elements submitted by: bde	2004-02-04 05:46:05 +00:00
jhb	55eda735c5	Remove a bogus assertion. Noticed by: bde Pointy hat to: jhb	2004-02-03 15:14:27 +00:00
deischen	35142b05af	Regen after adding ksem_timedwait().	2004-02-03 05:11:31 +00:00
deischen	1c435d9c21	Add ksem_timedwait() to complement ksem_wait(). Glanced at by: alfred	2004-02-03 05:08:32 +00:00
rwatson	baad160992	Don't dec/inc the amountpipes counter every time we resize a pipe -- instead, just dec/inc in the ctor/dtor. For now, increment/decrement in two's, since we're now performing the operation once per pair, not once per pipe. Not really any measurable performance change in my micro-benchmarks, but doing less work is good, especially when it comes to atomic operations. Suggested by: alc	2004-02-03 04:55:24 +00:00
rwatson	bbfa440448	Catch instances of (pipe == NULL) that were obsoleted with recent changes to jointly allocated pipe pairs. Replace these checks with pipe_present checks. This avoids a NULL pointer dereference when a pipe is half-closed. Submitted by: Peter Edwards <peter.edwards@openet-telecom.com>	2004-02-03 02:50:51 +00:00
jhb	3d93c0a101	- Assert that witness_cold is not true in enroll(). - Only check witness_watch once in enroll(). Reported by: ru (2)	2004-02-02 22:15:17 +00:00
pjd	12bcfaa377	Fix many issues related to mount/unmount: 1. Root from inside a jail was able to unmount any file system (except /). 2. Unprivileged root was able to unmount file systems mounted by privileged root (execpt /). 3. User from inside a jail was able to mount file system when sysctl vfs.usermount was set to 1. 4. User was able to mount file system when vfs.usermount was set to 1 (that's ok) and unmount it even if vfs.usermount was equal to 0 (that's not correct). Possibility from point 1 was reported by: Dariusz Kowalski <darek@76.pl> Only a part of this fix will be MFC'ed (if approved). PR: kern/60149 Reviewed by: rwatson Approved by: scottl (mentor) MFC after: 3 days	2004-02-02 19:02:05 +00:00
silby	e574da4ff8	Remove debugging code that slipped into the previous commit. Spotted by: bde	2004-02-02 09:09:59 +00:00
jeff	04d3613e88	- style fixes to the critical_exit() KASSERT(). Submitted by: bde	2004-02-02 08:13:27 +00:00
jeff	0951227a5f	- Allow interactive tasks to use the maximum time-slice. This is not as detrimental as I thought it would be in the case of massive process storms from a shell and it makes regular desktop usage noticeably better.	2004-02-01 10:38:13 +00:00
silby	b993e863e1	Rewrite sendfile's header support so that headers are now sent in the first packet along with data, instead of in their own packet. When serving files of size (packetsize - headersize) or smaller, this will result in one less packet crossing the network. Quick testing with thttpd and http_load has shown a noticeable performance improvement in this case (350 vs 330 fetches per second.) Included in this commit are two support routines, iov_to_uio, and m_uiotombuf; these routines are used by sendfile to construct the header mbuf chain that will be linked to the rest of the data in the socket buffer.	2004-02-01 07:56:44 +00:00
jeff	71972f4168	- Disable ithread binding in all cases for now. This doesn't make as much sense with sched_4bsd as it does with sched_ule. - Use P_NOLOAD instead of the absence of td->td_ithd to determine whether or not a thread should be accounted for in sched_tdcnt.	2004-02-01 06:20:18 +00:00
rwatson	b61320448d	Coalesce pipe allocations and frees. Previously, the pipe code would allocate two 'struct pipe's from the pipe zone, and malloc a mutex. - Create a new "struct pipepair" object holding the two 'struct pipe' instances, struct mutex, and struct label reference. Pipe structures now have a back-pointer to the pipe pair, and a 'pipe_present' flag to indicate whether the half has been closed. - Perform mutex init/destroy in zone init/destroy, avoiding reallocating the mutex for each pipe. Perform most pipe structure setup in zone constructor. - VM memory mappings for pageable buffers are still done outside of the UMA zone. - Change MAC API to speak 'struct pipepair' instead of 'struct pipe', update many policies. MAC labels are also handled outside of the UMA zone for now. Label-only policy modules don't have to be recompiled, but if a module is recompiled, its pipe entry points will need to be updated. If a module actually reached into the pipe structures (unlikely), that would also need to be modified. These changes substantially simplify failure handling in the pipe code as there are many fewer possible failure modes. On half-close, pipes no longer free the 'struct pipe' for the closed half until a full-close takes place. However, VM mapped buffers are still released on half-close. Some code refactoring is now possible to clean up some of the back references, etc; this patch attempts not to change the structure of most of the pipe implementation, only allocation/free code paths, so as to avoid introducing bugs (hopefully). This cuts about 8%-9% off the cost of sequential pipe allocation and free in system call tests on UP and SMP in my micro-benchmarks. May or may not make a difference in macro-benchmarks, but doing less work is good. Reviewed by: juli, tjr Testing help: dwhite, fenestro, scottl, et al	2004-02-01 05:56:51 +00:00
jeff	8dd6e29330	- Revert rev 1.240 we no longer need a kthread for loadav().	2004-02-01 05:37:36 +00:00
jeff	d1598189c6	- Use sched_load() rather than grabbing the sx lock and traversing the proc table to discover the load.	2004-02-01 02:51:33 +00:00
jeff	ea0f9d8868	- Add a new member to struct kseq called ksq_sysload. This is intended to track the load for the sched_load() function. In the SMP case this member is not defined because it would be redundant with the ksg_load member which already tracks the non ithd load. - For sched_load() in the UP case simply return ksq_sysload. In the SMP case traverse the list of kseq groups and sum up their ksg_load fields.	2004-02-01 02:48:36 +00:00
jeff	e80368acbc	- Keep a variable 'sched_tdcnt' that is used for the local implementation of sched_load(). This variable tracks the number of running and runnable non ithd threads. This removes the need to traverse the proc table and discover how many threads are runnable.	2004-02-01 02:46:47 +00:00
rwatson	b222d68370	Move KASSERT regarding td_critnest to after the value of td is set to curthread, to avoid warning and incorrect behavior. Hoped not to mind: jeff	2004-02-01 02:31:36 +00:00
jeff	1f34b17ed2	- Assert that td_critnest > 0 in critical_exit() to catch cases of unbalanced uses of the critical_* api.	2004-02-01 01:24:54 +00:00
rwatson	abd5dd28b4	Fix an error in a KASSERT string: it's pipe_free_kmem(), not pipespace(), that contains this KASSERT.	2004-01-31 23:03:22 +00:00
phk	9ae9f17a96	Introduce the SO_BINTIME option which takes a high-resolution timestamp at packet arrival. For benchmarking purposes SO_BINTIME is preferable to SO_TIMEVAL since it has higher resolution and lower overhead. Simultaneous use of the two options is possible and they will return consistent timestamps. This introduces an extra test and a function call for SO_TIMEVAL, but I have not been able to measure that.	2004-01-31 10:40:25 +00:00
rwatson	1a0f1d7ef9	Assert process lock in ptracestop(), since we're going to rely on it, and later unlock it.	2004-01-29 00:58:21 +00:00
rwatson	751378e616	Add a reset sysctl for mutex profiling: zeros all of the mutex profiling buffers and hash table. This makes it a lot easier to do multiple profiling runs without rebooting or performing gratuitous arithmetic. Sysctl is named debug.mutex.prof.reset. Reviewed by: jake	2004-01-28 22:11:53 +00:00
jhb	40204c6793	Move the loadav() callout into its own kthread since it uses allproc_lock which is a sleepable lock and thus is not safe to acquire from a callout routine.	2004-01-28 20:44:41 +00:00
jhb	2176f751c3	Rework witness_lock() to make it slightly more useful and flexible. - witness_lock() is split into two pieces: witness_checkorder() and witness_lock(). Witness_checkorder() determines if acquiring a specified lock at the time it is called would result in a lock order. It optionally adds a new lock order relationship as well. witness_lock() updates witness's data structures to assume that a lock has been acquired by stick a new lock instance in the appropriate lock instance list. - The mutex and sx lock functions now call checkorder() prior to trying to acquire a lock and continue to call witness_lock() after the acquire is completed. This will let witness catch a deadlock before it happens rather than trying to do so after the threads have deadlocked (i.e. never actually report it). - A new function witness_defineorder() has been added that adds a lock order between two locks at runtime without having to acquire the locks. If the lock order cannot be added it will return an error. This function is available to programmers via the WITNESS_DEFINEORDER() macro which accepts either two mutexes or two sx locks as its arguments. - A few simple wrapper macros were added to allow developers to call witness_checkorder() anywhere as a way of enforcing locking assertions in code that might acquire a certain lock in some situations. The macros are: witness_check_{mutex,shared_sx,exclusive_sx} and take an appropriate lock as the sole argument. - The code to remove a lock instance from a lock list in witness_unlock() was unnested by using a goto to vastly improve the readability of this function.	2004-01-28 20:39:57 +00:00
jhb	4212174151	Use mtx_assert() rather than using a home-rolled version.	2004-01-28 20:26:39 +00:00
kan	16c14cce63	Move the part of the comment which applies to osigsuspend where it belongs. The current sigsuspend syscall does expect a pointer to the mask as argument. Submitted by: Igor Sysoev <is at rambler-co dot ru>	2004-01-28 06:06:04 +00:00
des	dff7f5a96a	Rename the kern.vm.kmem.size tunable to the more logical vm.kmem_size. To assure backward compatibility (conditional on !BURN_BRIDGES), look it up by its old name first, and log a warning (but accept the setting) if it was found. If both the old and new name are defined, the new name takes precedence. Also export vm.kmem_size as a read-only sysctl variable; I find it hard to tune a parameter when I don't know its default value, especially when that default value is computed at boot time.	2004-01-27 15:59:38 +00:00
rwatson	44532c56de	When aborting fork() due to a failure, if using MAC, make sure to clean up the p_label field. Obtained from: TrustedBSD Project Sponsored by: DARPA, McAfee Research	2004-01-25 18:42:18 +00:00
ru	ee3ae6a8dc	Register the uart(4)'s spin lock with witness(4).	2004-01-25 15:04:37 +00:00
jeff	7d2010756f	- sched_strict has been dead for a long time now. Get rid of it.	2004-01-25 08:58:14 +00:00
jeff	540495ca30	- Clean up KASSERTS.	2004-01-25 08:57:38 +00:00
jeff	b2d2d3a214	- Correct function names listed in KASSERTs. These were copied from other code and it was sloppy of me not to adjust these sooner.	2004-01-25 08:21:46 +00:00
jeff	ed8d4e8703	- Implement cpu pinning and binding. This is acomplished by keeping a per- cpu run queue that is only used for pinned or bound threads. Submitted by: Chris Bradfield <chrisb@ation.org>	2004-01-25 08:00:04 +00:00
jeff	c133d855d2	- Use a unique string for the sched_setup SYSINIT and rename sched_setup to synch_setup. The schedulers use the sched_setup function name.	2004-01-25 07:49:45 +00:00
jeff	8ac75a1ec6	- Add a flags parameter to mi_switch. The value of flags may be SW_VOL or SW_INVOL. Assert that one of these is set in mi_switch() and propery adjust the rusage statistics. This is to simplify the large number of users of this interface which were previously all required to adjust the proper counter prior to calling mi_switch(). This also facilitates more switch and locking optimizations. - Change all callers of mi_switch() to pass the appropriate paramter and remove direct references to the process statistics.	2004-01-25 03:54:52 +00:00
rwatson	1d962a2810	Add some basic support for measuring sleep mutex contention to the mutex profiling code. As with existing mutex profiling, measurement is done with respect to mtx_lock() instances in the code, as opposed to specific mutexes. In particular, measure two things: (1) Lock contention. How often did this mtx_lock() call get made and have to sleep (or almost sleep) waiting for the lock. This helps identify the "victims" of contention. (2) Hold contention. How often, while the lock was held by a thread as a result of this mtx_lock(), did another thread try to acquire the same mutex. This helps identify the causes of contention. I'm currently exploring adding measurement of "time waited for the lock", but the current implementation has proven useful to me so far so I figured I'd commit it so others could try it out. Note that this increases the size of mutexes when MUTEX_PROFILING is enabled, so you might find you need to further bump UMA_BOOT_PAGES. Fixes welcome. The once over: des, others	2004-01-25 01:59:27 +00:00
phk	4a674f916d	Deal with MOD_FREQUENCY before MOD_OFFSET because the latter is the one which runs the actual update. This fixes a bug where there were a delay in applying the frequency adjustment. In extreme cases this could result in marginal stability of the kernel-pll.	2004-01-24 21:48:43 +00:00
jeff	1e5b101989	- Move smp_topology to subr_smp.c so that it is defined on all architectures.	2004-01-24 19:52:48 +00:00
rwatson	b67d5948b0	Don't grab Giant in crfree(), since prison_free() no longer requires it. The uidinfo code appears to be MPSAFE, and is referenced without Giant elsewhere. While this grab of Giant was only made in fairly rare circumstances (actually GC'ing on refcount==0), grabbing Giant here potentially introduces lock order issues with any locks held by the caller. So this probably won't help performance much unless you change credentials a lot in an application, and leave a lot of file descriptors and cached credentials around. However, it simplifies locking down consumers of the credential interfaces. Bumped into by: sam Appeased: tjr	2004-01-23 21:07:52 +00:00
rwatson	97a5db182a	Defer the vrele() on a jail's root vnode reference from prison_free() to a new prison_complete() task run by a task queue. This removes a requirement for grabbing Giant in crfree(). Embed the 'struct task' in 'struct prison' so that we don't have to allocate memory from prison_free() (which means we also defer the FREE()). With this change, I believe grabbing Giant from crfree() can now be removed, but need to check the uidinfo code paths. To avoid header pollution, move the definition of 'struct task' to _task.h, and recursively include from taskqueue.h and jail.h; much preferably to all files including jail.h picking up a requirement to include taskqueue.h. Bumped into by: sam Reviewed by: bde, tjr	2004-01-23 20:44:26 +00:00
phk	083a677018	Write 100 times for tomorrow: "Always print time_t as %jd, you never know what width it has"	2004-01-22 19:50:06 +00:00
rse	809b1aa946	Fix generation of random multicast MAC address. In case no real/physical IEEE 802 address is available, both the expired "draft-leach-uuids-guids-01" (section "4. Node IDs when no IEEE 802 network card is available") and RFC 2518 (section "6.4.1 Node Field Generation Without the IEEE 802 Address") recommend (quoted from RFC 2518): "The ideal solution is to obtain a 47 bit cryptographic quality random number, and use it as the low 47 bits of the node ID, with the _most_ significant bit of the first octet of the node ID set to 1. This bit is the unicast/multicast bit, which will never be set in IEEE 802 addresses obtained from network cards; hence, there can never be a conflict between UUIDs generated by machines with and without network cards." Unfortunately, this incorrectly explains how to implement this and the FreeBSD UUID generator code inherited this generation bug from the broken reference code in the standards draft. They should instead specify the "_least_ significant bit of the first octet of the node ID" as the multicast bit in a memory and hexadecimal string representation of a 48-bit IEEE 802 MAC address. This standards bug arised from a false interpretation, as the multicast bit is actually the _most_ significant bit in IEEE 802.3 (Ethernet) _transmission order_ of an IEEE 802 MAC address. The standards authors forgot that the bitwise order of an _octet_ from a MAC address _memory_ and hexadecimal string representation is still always from left (MSB, bit 7) to right (LSB, bit 0). Fortunately, this UUID generation bug could have occurred on systems without any Ethernet NICs only.	2004-01-22 13:34:11 +00:00
phk	25628e44b9	Add a sysctl (default: off) which enables a log(LOG_INFO...) warning if the clock is stepped.	2004-01-21 21:05:40 +00:00
rwatson	eb2019485b	Reduce gratuitous includes: don't include jail.h if it's not needed. Presumably, at some point, you had to include jail.h if you included proc.h, but that is no longer required. Result of: self injury involving adding something to struct prison	2004-01-21 17:10:47 +00:00
ache	f8d77065a1	pread/pwrite: follow lseek spirit - return EINVAL on negative offset for non-VCHR	2004-01-20 01:27:42 +00:00
phk	4967bebd9f	Add linenumber and source filename to panic(9) output. Ideally a traceback should be printed too, any takers ?	2004-01-19 21:27:11 +00:00

1 2 3 4 5 ...

6960 Commits