freebsd-nq

History

Bruce Evans 8299eb7e3e Moved all the optimizations for |x| <= 9pi/2 from

__ieee754_rem_pio2f() to its 3 callers and manually inline them.

On Athlons, with favourable compiler flags and optimizations and
favourable pipeline conditions, this gives a speedup of 30-40 cycles
for cosf(), sinf() and tanf() on the range pi/4 < |x| <= 9pi/4, so
thes functions are now signifcantly faster than the hardware trig
functions in many cases.  E.g., in a benchmark with uniformly distributed
x in [-2pi, 2pi], A64 hardware fcos took 72-129 cycles and cosf() took
37-55 cycles.  Out-of-order execution is needed to get both of these
times.  The optimizations in this commit apparently work more by
removing 1 serialization point than by reducing latency.

2005-11-19 02:38:27 +00:00

alpha

Replace fegetmask() and fesetmask() with feenableexcept(),

2005-03-16 19:03:46 +00:00

amd64

Add a missing ldexpf() alias for amd64.

2005-09-12 20:54:00 +00:00

arm

Replace fegetmask() and fesetmask() with feenableexcept(),

2005-03-16 19:03:46 +00:00

bsdsrc

Removed an unused declaration which was so old that it wasn't a prototype