Konstantin Belousov 6bfe9e8ccf Optimize tls_get_addr_common(). The change provides around 30% speedup
for TLS microbenchmark using global-dynamic TLS model on amd64 (which is
default for PIC dso objects).

Split the slow path into tls_get_addr_slow(), for which inlining is
disabled. This prevents the registers spill on tls_get_addr_common()
entry.

Provide static branch hint to the compiler, indicating that slow path
is not likely to be taken.

While there, do some minimal style adjustments.

Reported and tested by:	davidxu
MFC after:	1 week
2012-03-10 08:49:44 +00:00
..
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2010-02-26 09:41:16 +00:00
2010-12-19 13:05:17 +00:00
2012-01-07 16:09:54 +00:00
2012-01-07 16:09:54 +00:00
2009-12-03 15:48:24 +00:00