freebsd-skq

History

bde f77d7dfd70 Inline __ieee754__rem_pio2f(). On amd64 (A64) and i386 (A64), this

gives an average speedup of about 12 cycles or 17% for
9pi/4 < |x| <= 2**19pi/2 and a smaller speedup for larger x, and a
small speeddown for |x| <= 9pi/4 (only 1-2 cycles average, but that
is 4%).

Inlining this is less likely to bust caches than inlining the float
version since it is much smaller (about 220 bytes text and rodata) and
has many fewer branches.  However, the float version was already large
due to its manual inlining of the branches and also the polynomial
evaluations.

2008-02-25 22:19:17 +00:00

amd64

Use hardware remainder on amd64 since it is 5 to 10 times faster than

2008-02-13 06:01:48 +00:00

arm

Use C comments since we now preprocess these files with CPP.

2007-04-29 14:05:22 +00:00

bsdsrc

Eliminate some warnings.

2008-02-22 02:26:51 +00:00

i387

Implement rintl(), nearbyintl(), lrintl(), and llrintl().