mjg f9bb72ec71 sx: uninline slock/sunlock
Shared locking routines explicitly read the value and test it. If the
change attempt fails, they fall back to a regular function which would
retry in a loop.

The problem is that with many concurrent readers the risk of failure is pretty
high and even the value returned by fcmpset is very likely going to be stale
by the time the loop in the fallback routine is reached.

Uninline said primitives. It gives a throughput increase when doing concurrent
slocks/sunlocks with 80 hardware threads from ~50 mln/s to ~56 mln/s.

Interestingly, rwlock primitives are already not inlined.
2017-02-05 05:20:29 +00:00
..
2017-02-04 06:24:49 +00:00
2017-01-28 16:31:23 +00:00
2017-01-28 02:22:15 +00:00
2017-01-28 16:31:23 +00:00
2017-01-28 16:31:23 +00:00
2017-01-28 02:22:15 +00:00
2017-02-05 05:20:29 +00:00
2017-02-04 18:25:09 +00:00
2017-02-05 05:20:29 +00:00
2017-01-19 16:46:05 +00:00
2017-02-04 19:16:19 +00:00
2017-01-28 02:22:15 +00:00