freebsd-skq

Author	SHA1	Message	Date
Warner Losh	a8197ad3aa	Remove sparc64 specific parts of libm and fix comments Once upon a time, sparc64 was the only ld128 architecture. However, both aarch64 and riscv are now such architectures. Many of the comments about how slow multiplication was on old sparc64 processors are now no longer true. However, since no evaluation has been done for aarch64 yet, it's unclear if they are still relevant or not. If not, the code should be changed. If so, the comments should remove the uncertainty. Reviewed by: emaste@ Differential Revision: https://reviews.freebsd.org/D23658	2020-02-26 18:55:03 +00:00
Bruce Evans	27aa844253	Centralize the complications for special efficient rounding to integers. This was open-coded in range reduction for trig and exp functions. Now there are 3 static inline functions rnint[fl]() that replace open-coded expressions, and type-generic irint() and i64rint() macros that hide the complications for efficiently using non-generic irint() and irintl() functions and casts. Special details: ld128/e_rem_pio2l.h needs to use i64rint() since it needs a 46-bit integer result. Everything else only needs a (less than) 32-bit integer result so uses irint(). Float and double cases now use float_t and double_t locally instead of STRICT_ASSIGN() to avoid bugs in extra precision. On amd64, inline asm is now only used for irint() on long doubles. The SSE asm for irint() on amd64 only existed because the ifdef tangles made the correct method of simply casting to int for this case non-obvious.	2018-07-20 12:42:24 +00:00
Pedro F. Giffuni	5e53a4f90f	lib: further adoption of SPDX licensing ID tags. Mainly focus on files that use BSD 2-Clause license, however the tool I was using mis-identified many licenses so this was mostly a manual - error prone - task. The Software Package Data Exchange (SPDX) group provides a specification to make it easier for automated tools to detect and summarize well known opensource licenses. We are gradually adopting the specification, noting that the tags are considered only advisory and do not, in any way, superceed or replace the license texts.	2017-11-26 02:00:33 +00:00
Dimitry Andric	6202fb7bd3	In lib/msun/ld128/s_expl.c, remove '/*' within block comment, to avoid a warning.	2014-02-21 21:54:36 +00:00
Steve Kargl	5f63fbd67f	* ld80/k_expl.h: * ld128/k_expl.h: . Split out a computational kernel,__k_expl(x, &hi, &lo, &k) from expl(x). x must be finite and not tiny or huge. The kernel returns hi and lo values for extra precision and an exponent k for a 2*k scale factor. . Define additional kernels k_hexpl() and hexpl() that include a 1/2 scaling and are used by the hyperbolic functions. ld80/s_expl.c: * ld128/s_expl.c: . Use the __k_expl() kernel. Obtained from: bde	2013-12-30 00:51:25 +00:00
Steve Kargl	1a287d1ddf	Change a comma to a semicolon. Remove a blank line that crept into the declarations. Fix a comment to show a sign on a NaN.	2013-06-03 20:09:22 +00:00
Steve Kargl	3ffff4bad5	ld80 and ld128 implementations of expm1l(). This code started life as a fairly faithful implementation of the algorithm found in PTP Tang, "Table-driven implementation of the Expm1 function in IEEE floating-point arithmetic," ACM Trans. Math. Soft., 18, 211-222 (1992). Over the last 18-24 months, the code has under gone significant optimization and testing. Reviewed by: bde Obtained from: bde (most of the optimizations)	2013-06-03 19:51:32 +00:00
Steve Kargl	42e4111cab	Fix two comments that got lost in the disentanglement of the larger diff.	2013-06-03 19:29:03 +00:00
Steve Kargl	8cc74771f2	ld80/s_expl.c: * Use integral numerical constants, and let the compiler do the conversion to long double. ld128/s_expl.c: * Use integral numerical constants, and let the compiler do the conversion to long double. * Use the ENTERI/RETURNI macros, which are no-ops on ld128. This however makes the ld80 and ld128 identical. Reviewed by: bde (as part of larger diff)	2013-06-03 19:13:44 +00:00
Steve Kargl	35cbca6a7f	Micro-optimization: move the unary mius operator to operate on a literal constant. Obtained from: bde	2013-06-03 18:57:35 +00:00
Steve Kargl	a3f70b4ed8	Add a comment to note that bde supplied most, if not all, of the optimizations.	2013-06-03 18:53:40 +00:00
Steve Kargl	1783063f18	ld80/s_expl.c: * In the special case x = -Inf or -NaN, use a micro-optimization to eliminate the need to access u.xbits.man. * Fix an off-by-one for small arguments \|x\| < 0x1p-65. ld128/s_expl.c: * In the special case x = -Inf or -NaN, use a micro-optimization to eliminate the need to access u.xbits.manh and u.xbits.manl. * Fix an off-by-one for small arguments \|x\| < 0x1p-114. Obtained from: bde	2013-06-03 18:51:34 +00:00
Steve Kargl	31407861b8	ld80/s_expl.c: * Update the evaluation of the polynomial. This allows the removal of the now unused variables t23 and t45. ld128/s_expl.c: * Update the evaluation of the polynomial and the intermediate result t. This update allows several numerical constants to be written as double rather than long double constants. Update the constants as appropriate. Obtained from: bde	2013-06-03 18:40:00 +00:00
Steve Kargl	f3049ab5f3	Update a comment to reflect that we are using an endpoint of an interval instead of a midpoint.	2013-06-03 18:14:18 +00:00
Steve Kargl	4aa8c9453f	Introduce the macro LOG2_INTERVAL, which is log2(number of intervals). Use the macroi as a micro-optimization to convert a subtraction and division to a shift. Obtained from: bde	2013-06-03 17:51:08 +00:00
Steve Kargl	03e1315345	Whitespace.	2013-06-03 17:40:52 +00:00
Steve Kargl	bb23de67bb	* Rename the polynomial coefficients from P2, P3, ... to A2, A3, .... The names now coincide with the name used in PTP Tang's paper. * Rename the variable from s to tbl to better reflect that this is a table, and to be consistent with the naming scheme in s_exp2l.c Reviewed by: bde (as part of larger diff)	2013-06-03 17:36:26 +00:00
Steve Kargl	a1d69112c1	ld80/s_expl.c: * Update Copyright years to include 2013. ld128/s_expl.c: * Correct and update Copyright years. This code originated from the ld80 version, so it should reflect the same time period. Reviewed by: bde (as part of larger diff)	2013-06-03 17:21:43 +00:00
Steve Kargl	dba466c344	* ld80/s_expl.c: . Fix the threshold for expl(x) where \|x\| is small. . Also update the previously incorrect comment to match the new threshold. * ld128/s_expl.c: . Re-order logic in exceptional cases to match the logic used in other long double functions. . Fix the threshold for expl(x) where is \|x\| is small. . Also update the previously incorrect comment to match the new threshold. Submitted by: bde Approved by: das (mentor)	2012-09-23 18:32:03 +00:00
Steve Kargl	8f647ffd7f	* ld80/s_expl.c: . Guard a comment from reformatting by indent(1). . Re-order variables in declarations to alphabetical order. . Remove a banal comment. * ld128/s_expl.c: . Add a comment to point to ld80/s_expl.c for implementation details. . Move the #define of INTERVAL to reduce the diff with ld80/s_expl.c. . twom10000 does not need to be volatile, so move its declaration. . Re-order variables in declarations to alphabetical order. . Add a comment that describes the argument reduction. . Remove the same banal comment found in ld80/s_expl.c. Reviewed by: bde Approved by: das (mentor)	2012-09-23 18:06:27 +00:00
Steve Kargl	ca50c4b871	Whitespace. Submitted by: bde Approved by: das (pre-approved)	2012-07-30 21:55:49 +00:00
Steve Kargl	8345cbd275	Replace the macro name NUM with INTERVALS. This change provides compatibility with the INTERVALS macro used in the soon-to-be-commmitted expm1l() and someday-to-be-committed log*l() functions. Add a comment into ld128/s_expl.c noting at gcc issue that was deleted when rewriting ld80/e_expl.c as ld128/s_expl.c. Requested by: bde Approved by: das (mentor)	2012-07-26 04:05:08 +00:00
Steve Kargl	f7cfe68f59	* ld80/expl.c: . Remove a few #ifdefs that should have been removed in the initial commit. . Sort fpmath.h to its rightful place. * ld128/s_expl.c: . Replace EXPMASK with its actual value. . Sort fpmath.h to its rightful place. Requested by: bde Approved by: das (mentor)	2012-07-26 03:59:33 +00:00
Steve Kargl	b83ccea32c	Compute the exponential of x for Intel 80-bit format and IEEE 128-bit format. These implementations are based on PTP Tang, "Table-driven implementation of the exponential function in IEEE floating-point arithmetic," ACM Trans. Math. Soft., 15, 144-157 (1989). PR: standards/152415 Submitted by: kargl Reviewed by: bde, das Approved by: das (mentor)	2012-07-23 19:13:55 +00:00

24 Commits