freebsd-skq

History

Bruce Evans 828f7b4a82 Rearrange the polynomial evaluation for better parallelism. This is faster on all machines tested (old Celeron (P2), A64 (amd64 and i386) and ia64) except on ia64 when compiled with -O1. It takes 2 more multiplications, so it will be slower on old machines. The speedup is about 8 cycles = 17% on A64 (amd64 and i386) with best CFLAGS and some parallelism in the caller. Move the evaluation of 2**k up a bit so that it doesn't compete too much with the new polynomial evaluation. Unlike the previous optimization, this rearrangement cannot change the result, so compilers and CPU schedulers can do it, but they don't do it quite right yet. This saves a whole 1 or 2 cycles on A64.		2008-02-13 08:36:13 +00:00
..
amd64	Use hardware remainder on amd64 since it is 5 to 10 times faster than	2008-02-13 06:01:48 +00:00
arm	Use C comments since we now preprocess these files with CPP.	2007-04-29 14:05:22 +00:00
bsdsrc	Fix tgamma() on some special args:	2007-05-02 15:24:49 +00:00
i387	Implement rintl(), nearbyintl(), lrintl(), and llrintl().	2008-01-14 02:12:07 +00:00
ia64	Use C comments since we now preprocess these files with CPP.	2007-04-29 14:05:22 +00:00
ld80	Use a better method of scaling by 2**k. Instead of adding to the	2008-02-07 03:17:05 +00:00
ld128	Use a better method of scaling by 2**k. Instead of adding to the	2008-02-07 03:17:05 +00:00
man	Introduce a new log(3) manpage and move the relevant functions there.	2008-01-18 21:43:00 +00:00
powerpc	Use C comments since we now preprocess these files with CPP.	2007-04-29 14:05:22 +00:00
sparc64	Use C comments since we now preprocess these files with CPP.	2007-04-29 14:05:22 +00:00
src	Rearrange the polynomial evaluation for better parallelism. This is	2008-02-13 08:36:13 +00:00
Makefile	Hook up exp2l() and related docs to the build.	2008-01-18 21:43:10 +00:00
Symbol.map	Hook up exp2l() and related docs to the build.	2008-01-18 21:43:10 +00:00