freebsd-dev

Author	SHA1	Message	Date
Mateusz Guzik	b584eb2e90	locks: pass the found lock value to unlock slow path This avoids an explicit read later. While here whack the cheaply obtainable 'tid' argument.	2017-11-22 22:04:04 +00:00
Mateusz Guzik	013c0b493f	locks: remove the file + line argument from internal primitives when not used The pair is of use only in debug or LOCKPROF kernels, but was passed (zeroed) for many locks even in production kernels. While here whack the tid argument from wlock hard and xlock hard. There is no kbi change of any sort - "external" primitives still accept the pair.	2017-11-22 21:51:17 +00:00
Mark Johnston	755230eb9f	Clean up the SYSINIT_FLAGS definitions for rwlock(9) and rmlock(9). Avoid duplication in their macro definitions, and document them. No functional change intended. MFC after: 1 week	2017-11-21 14:59:23 +00:00
Mateusz Guzik	8fef6b2c67	rwlock: unlock before traversing threads to wake up While here perform a minor cleanup of the unlock path.	2017-11-17 02:26:15 +00:00
Mateusz Guzik	ae7d25a4d7	locks: pull up PMC_SOFT_CALLs out of slow path loops	2017-11-17 02:22:51 +00:00
Mateusz Guzik	3af300592c	rwlock: avoid branches in the slow path if lockstat is disabled	2017-11-17 02:21:24 +00:00
Mateusz Guzik	c7e4e92ecd	rwlock: use fcmpset for setting RW_LOCK_WRITE_SPINNER	2017-11-11 09:34:11 +00:00
Mateusz Guzik	db520fdd46	rwlock: fix up compilation without KDTRACE_HOOKS after r324787	2017-11-06 05:14:05 +00:00
Mateusz Guzik	2567807c32	rwlock: reduce lockstat branches in the slowpath MFC after: 1 week	2017-10-20 03:32:42 +00:00
Mateusz Guzik	d07e22cdd8	locks: take the number of readers into account when waiting Previous code would always spin once before checking the lock. But a lock with e.g. 6 readers is not going to become free in the duration of once spin even if they start draining immediately. Conservatively perform one for each reader. Note that the total number of allowed spins is still extremely small and is subject to change later. MFC after: 1 week	2017-10-05 19:18:02 +00:00
Mateusz Guzik	20a15d1752	locks: partially tidy up waiting on readers spin first instant of instantly re-readoing and don't re-read after spinning is finished - the state is already known. Note the code is subject to significant changes later. MFC after: 1 week	2017-10-05 13:01:18 +00:00
Mateusz Guzik	574adb65c8	Sprinkle __read_frequently on few obvious places. Note that some of annotated variables should probably change their types to something smaller, preferably bit-sized.	2017-09-06 20:33:33 +00:00
Mateusz Guzik	3f7830a31e	rwlock: perform the typically false td_rw_rlocks check later Check if the lock is available first instead. MFC after: 1 week	2017-07-02 01:05:16 +00:00
Mark Johnston	704cb42f2a	Fix the !TD_IS_IDLETHREAD(curthread) locking assertions. Most of the lock slowpaths assert that the calling thread isn't an idle thread. However, this may not be true if the system has panicked, and in some cases the assertion appears before a SCHEDULER_STOPPED() check. MFC after: 3 days Sponsored by: Dell EMC Isilon	2017-06-19 21:09:50 +00:00
Mateusz Guzik	a21018063b	locks: ensure proper barriers are used with atomic ops when necessary Unclear how, but the locking routine for mutexes was using the release barrier instead of acquire. This must have been either a copy-pasto or bad completion. Going through other uses of atomics shows no barriers in: - upgrade routines (addressed in this patch) - sections protected with turnstile locks - this should be fine as necessary barriers are in the worst case provided by turnstile unlock I would like to thank Mark Millard and andreast@ for reporting the problem and testing previous patches before the issue got identified. ps. .-'---`-. ,' `. \| \ \| \ \ _ \ ,\ _ ,'-,/-)\ ( * \ \,' ,' ,'-) `._,) -',-') \/ ''/ ) / / / ,'-' Hardware provided by: IBM LTC	2017-03-01 05:06:21 +00:00
Mateusz Guzik	b247fd395d	locks: make trylock routines check for 'unowned' value Since fcmpset can fail without lock contention e.g. on arm, it was possible to get spurious failures when the caller was expecting the primitive to succeed. Reported by: mmel	2017-02-19 16:28:46 +00:00
Mateusz Guzik	5c5df0d99b	locks: clean up trylock primitives In particular thius reduces accesses of the lock itself.	2017-02-18 22:06:03 +00:00
Mateusz Guzik	ffd5c94c4f	locks: let primitives for modules unlock without always goging to the slsow path It is only needed if the LOCK_PROFILING is enabled. It has to always check if the lock is about to be released which requires an avoidable read if the option is not specified..	2017-02-17 05:39:40 +00:00
Mateusz Guzik	afa39f7a32	locks: remove SCHEDULER_STOPPED checks from primitives for modules They all fallback to the slow path if necessary and the check is there. This means a panicked kernel executing code from modules will be able to succeed doing actual lock/unlock, but this was already the case for core code which has said primitives inlined.	2017-02-17 05:09:51 +00:00
Mateusz Guzik	8eaaf58a5f	rwlock: fix r313454 The runlock slow path would update wrong variable before restarting the loop, in effect corrupting the state. Reported by: pho	2017-02-09 13:32:19 +00:00
Mateusz Guzik	3b3cf014fc	locks: tidy up unlock fallback paths Update comments to note these functions are reachable if lockstat is enabled. Check if the lock has any bits set before attempting unlock, which saves an unnecessary atomic operation.	2017-02-09 08:19:30 +00:00
Mateusz Guzik	b0a61642d4	rwlock: implemenet rlock/runlock fast path This improves singlethreaded throughput on my test machine from ~247 mln ops/s to ~328 mln. It is mostly about avoiding the setup cost of lockstat. Reviewed by: jhb (previous version)	2017-02-08 19:28:46 +00:00
Mateusz Guzik	dbccc8105c	rwlock: implement RW_LOCK_WRITER_RECURSED bit This moves recursion handling out of the inlined wunlock path and in particular saves a read and a branch. Discussed with:	2017-02-07 17:04:31 +00:00
Mateusz Guzik	8e5a3e9a9d	locks: change backoff to exponential Previous implementation would use a random factor to spread readers and reduce chances of starvation. This visibly reduces effectiveness of the mechanism. Switch to the more traditional exponential variant. Try to limit starvation by imposing an upper limit of spins after which spinning is half of what other threads get. Note the mechanism is turned off by default. Reviewed by: kib (previous version)	2017-02-07 14:49:36 +00:00
Mateusz Guzik	c1aaf63cb5	locks: fix recursion support after recent changes When a relevant lockstat probe is enabled the fallback primitive is called with a constant signifying a free lock. This works fine for typical cases but breaks with recursion, since it checks if the passed value is that of the executing thread. Read the value if necessary.	2017-02-06 09:40:14 +00:00
Mateusz Guzik	993ddec44d	rwlock: move lockstat handling out of inline primitives See r313275 for details. One difference here is that recursion handling was removed from the fallback routine. As it is it was never supposed to see a recursed lock in the first place. Future changes will move it out of inline variants, but right now there is no easy to way to test if the lock is recursed without reading additional words.	2017-02-05 13:37:23 +00:00
Mateusz Guzik	c84f347985	rwlock: switch to fcmpset Discussed with: jhb Tested by: pho	2017-02-05 04:53:13 +00:00
Mateusz Guzik	290511163d	Sprinkle __read_mostly on backoff and lock profiling code. MFC after: 1 month	2017-01-27 15:03:51 +00:00
Mateusz Guzik	3f0a0612e8	rwlock: reduce lock accesses similarly to r311172 Discussed with: jhb Tested by: pho (previous version)	2017-01-18 17:53:57 +00:00
Mateusz Guzik	fa5000a4f3	locks: fix compilation for KDTRACE_HOOKS && !ADAPTIVE_* case Reported by: Michael Butler <imb protected-networks.net>	2016-08-02 03:05:59 +00:00
Mateusz Guzik	0412689595	locks: fix up ifdef guards introduced in r303643 Both sx and rwlocks had copy-pasted ADAPTIVE_MUTEXES instead of the correct define. MFC after: 1 week	2016-08-02 00:15:08 +00:00
Mateusz Guzik	1ada904147	Implement trivial backoff for locking primitives. All current spinning loops retry an atomic op the first chance they get, which leads to performance degradation under load. One classic solution to the problem consists of delaying the test to an extent. This implementation has a trivial linear increment and a random factor for each attempt. For simplicity, this first thouch implementation only modifies spinning loops where the lock owner is running. spin mutexes and thread lock were not modified. Current parameters are autotuned on boot based on mp_cpus. Autotune factors are very conservative and are subject to change later. Reviewed by: kib, jhb Tested by: pho MFC after: 1 week	2016-08-01 21:48:37 +00:00
Mateusz Guzik	61852185ba	locks: change sleep_cnt and spin_cnt types to u_int Both variables are uint64_t, but they only count spins or sleeps. All reasonable values which we can get here comfortably hit in 32-bit range. Suggested by: kib MFC after: 1 week	2016-07-31 12:11:55 +00:00
Mateusz Guzik	7a54be1870	rwlock: s/READER/WRITER/ in wlock lockstat annotation	2016-07-30 22:21:48 +00:00
Mateusz Guzik	fc4f686d59	Microoptimize locking primitives by avoiding unnecessary atomic ops. Inline version of primitives do an atomic op and if it fails they fallback to actual primitives, which immediately retry the atomic op. The obvious optimisation is to check if the lock is free and only then proceed to do an atomic op. Reviewed by: jhb, vangyzen	2016-06-01 18:32:20 +00:00
Mark Johnston	ce1c953ee0	Don't modify curthread->td_locks unless INVARIANTS is enabled. This field is only used in a KASSERT that verifies that no locks are held when returning to user mode. Moreover, the td_locks accounting is only correct when LOCK_DEBUG > 0, which is implied by INVARIANTS. Reviewed by: jhb MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D3205	2015-08-02 00:03:08 +00:00
Mark Johnston	97cc6870f6	Don't increment the spin count until after the first attempt to acquire a rwlock read lock. Otherwise the lockstat:::rw-spin probe will fire spuriously. MFC after: 1 week	2015-07-19 22:26:02 +00:00
Mark Johnston	de2c95cc00	Consistently use a reader/writer flag for lockstat probes in rwlock(9) and sx(9), rather than using the probe function name to determine whether a given lock is a read lock or a write lock. Update lockstat(1) accordingly.	2015-07-19 22:24:33 +00:00
Mark Johnston	32cd0147fa	Implement the lockstat provider using SDT(9) instead of the custom provider in lockstat.ko. This means that lockstat probes now have typed arguments and will utilize SDT probe hot-patching support when it arrives. Reviewed by: gnn Differential Revision: https://reviews.freebsd.org/D2993	2015-07-19 22:14:09 +00:00
Mark Johnston	e2b25737ee	Pass the lock object to lockstat_nsecs() and return immediately if LO_NOPROFILE is set. Some timecounter handlers acquire a spin mutex, and we don't want to recurse if lockstat probes are enabled. PR: 201642 Reviewed by: avg MFC after: 3 days	2015-07-18 00:57:30 +00:00
Andriy Gapon	076dd8eb2e	several lockstat improvements 0. For spin events report time spent spinning, not a loop count. While loop count is much easier and cheaper to obtain it is hard to reason about the reported numbers, espcially for adaptive locks where both spinning and sleeping can happen. So, it's better to compare apples and apples. 1. Teach lockstat about FreeBSD rw locks. This is done in part by changing the corresponding probes and in part by changing what probes lockstat should expect. 2. Teach lockstat that rw locks are adaptive and can spin on FreeBSD. 3. Report lock acquisition events for successful rw try-lock operations. 4. Teach lockstat about FreeBSD sx locks. Reporting of events for those locks completely mirrors rw locks. 5. Report spin and block events before acquisition event. This is behavior documented for the upstream, so it makes sense to stick to it. Note that because of FreeBSD adaptive lock implementations both the spin and block events may be reported for the same acquisition while the upstream reports only one of them. Differential Revision: https://reviews.freebsd.org/D2727 Reviewed by: markj MFC after: 17 days Relnotes: yes Sponsored by: ClusterHQ	2015-06-12 10:01:24 +00:00
Dmitry Chagin	fd07ddcf6f	Add _NEW flag to mtx(9), sx(9), rmlock(9) and rwlock(9). A _NEW flag passed to _init_flags() to avoid check for double-init. Differential Revision: https://reviews.freebsd.org/D1208 Reviewed by: jhb, wblock MFC after: 1 Month	2014-12-13 21:00:10 +00:00
John Baldwin	2cba8dd301	Add a new thread state "spinning" to schedgraph and add tracepoints at the start and stop of spinning waits in lock primitives.	2014-11-04 16:35:56 +00:00
John Baldwin	e432d5f6a7	Drop the 3rd clause from all 3 clause BSD licenses where I am the sole holder to convert them to 2 clause BSD licenses. MFC after: 1 week	2014-02-05 18:13:27 +00:00
Attilio Rao	e7a9eed7a8	- Assert for not leaking readers rw locks counter on userland return. - Use a correct spin_cnt for KDTRACE_HOOK case in rw read lock. Sponsored by: EMC / Isilon storage division	2013-12-17 13:37:02 +00:00
Attilio Rao	54366c0bd7	- For kernel compiled only with KDTRACE_HOOKS and not any lock debugging option, unbreak the lock tracing release semantic by embedding calls to LOCKSTAT_PROFILE_RELEASE_LOCK() direclty in the inlined version of the releasing functions for mutex, rwlock and sxlock. Failing to do so skips the lockstat_probe_func invokation for unlocking. - As part of the LOCKSTAT support is inlined in mutex operation, for kernel compiled without lock debugging options, potentially every consumer must be compiled including opt_kdtrace.h. Fix this by moving KDTRACE_HOOKS into opt_global.h and remove the dependency by opt_kdtrace.h for all files, as now only KDTRACE_FRAMES is linked there and it is only used as a compile-time stub [0]. [0] immediately shows some new bug as DTRACE-derived support for debug in sfxge is broken and it was never really tested. As it was not including correctly opt_kdtrace.h before it was never enabled so it was kept broken for a while. Fix this by using a protection stub, leaving sfxge driver authors the responsibility for fixing it appropriately [1]. Sponsored by: EMC / Isilon storage division Discussed with: rstone [0] Reported by: rstone [1] Discussed with: philip	2013-11-25 07:38:45 +00:00
Davide Italiano	cf6b879fad	Consistently use the same value to indicate exclusively-held and shared-held locks for all the primitives in lc_lock/lc_unlock routines. This fixes the problems introduced in r255747, which indeed introduced an inversion in the logic. Reported by: many Tested by: bdrewery, pho, lme, Adam McDougall, O. Hartmann Approved by: re (glebius)	2013-09-22 14:09:07 +00:00
Davide Italiano	7faf4d90e8	Fix lc_lock/lc_unlock() support for rmlocks held in shared mode. With current lock classes KPI it was really difficult because there was no way to pass an rmtracker object to the lock/unlock routines. In order to accomplish the task, modify the aforementioned functions so that they can return (or pass as argument) an uinptr_t, which is in the rm case used to hold a pointer to struct rm_priotracker for current thread. As an added bonus, this fixes rm_sleep() in the rm shared case, which right now can communicate priotracker structure between lc_unlock()/lc_lock(). Suggested by: jhb Reviewed by: jhb Approved by: re (delphij)	2013-09-20 23:06:21 +00:00
John Baldwin	b5fb43e572	A few mostly cosmetic nits to aid in debugging: - Call lock_init() first before setting any lock_object fields in lock init routines. This way if the machine panics due to a duplicate init the lock's original state is preserved. - Somewhat similarly, don't decrement td_locks and td_slocks until after an unlock operation has completed successfully.	2013-06-25 20:23:08 +00:00
John Baldwin	95d28652af	- Handle the recursed/not recursed flags with RA_RLOCKED in rw_assert(). - Tweak a panic message.	2013-06-03 17:38:57 +00:00

1 2 3

110 Commits