freebsd-nq

History

Bruce Evans 8299eb7e3e Moved all the optimizations for \|x\| <= 9pi/2 from __ieee754_rem_pio2f() to its 3 callers and manually inline them. On Athlons, with favourable compiler flags and optimizations and favourable pipeline conditions, this gives a speedup of 30-40 cycles for cosf(), sinf() and tanf() on the range pi/4 < \|x\| <= 9pi/4, so thes functions are now signifcantly faster than the hardware trig functions in many cases. E.g., in a benchmark with uniformly distributed x in [-2pi, 2pi], A64 hardware fcos took 72-129 cycles and cosf() took 37-55 cycles. Out-of-order execution is needed to get both of these times. The optimizations in this commit apparently work more by removing 1 serialization point than by reducing latency.		2005-11-19 02:38:27 +00:00
..
alpha	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
amd64	Add a missing ldexpf() alias for amd64.	2005-09-12 20:54:00 +00:00
arm	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
bsdsrc	Removed an unused declaration which was so old that it wasn't a prototype	2005-11-18 05:03:12 +00:00
i387	Fixed some comments added in rev.1.5.	2005-10-30 12:21:02 +00:00
ia64	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
man	-mdoc sweep.	2005-11-17 13:00:00 +00:00
powerpc	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
sparc64	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
src	Moved all the optimizations for \|x\| <= 9pi/2 from	2005-11-19 02:38:27 +00:00
Makefile	Detach k_rem_pio2f.c from the build since it is now unused. It is a libm	2005-11-06 17:59:40 +00:00