freebsd-dev

Author	SHA1	Message	Date
Bruce Evans	23f6483e0a	Restored a cleanup in rev.1.9 tthat was lost in rev.1.10.	2005-11-20 20:17:04 +00:00
Bruce Evans	8299eb7e3e	Moved all the optimizations for \|x\| <= 9pi/2 from __ieee754_rem_pio2f() to its 3 callers and manually inline them. On Athlons, with favourable compiler flags and optimizations and favourable pipeline conditions, this gives a speedup of 30-40 cycles for cosf(), sinf() and tanf() on the range pi/4 < \|x\| <= 9pi/4, so thes functions are now signifcantly faster than the hardware trig functions in many cases. E.g., in a benchmark with uniformly distributed x in [-2pi, 2pi], A64 hardware fcos took 72-129 cycles and cosf() took 37-55 cycles. Out-of-order execution is needed to get both of these times. The optimizations in this commit apparently work more by removing 1 serialization point than by reducing latency.	2005-11-19 02:38:27 +00:00
Bruce Evans	3f1a8f462c	Removed an unused declaration which was so old that it wasn't a prototype and thus just broke building at any nonzero WARNS level. Fixed nearby style bugs.	2005-11-18 05:03:12 +00:00
Ruslan Ermilov	110e1704d3	-mdoc sweep.	2005-11-17 13:00:00 +00:00
Bruce Evans	75ff209cbb	Minor cleanups: s_cosf.c and s_sinf.c: Use a non-bogus magic constant for the threshold of pi/4. It was 2 ulps smaller than pi/4 rounded down, but its value is not critical so it should be the result of natural rounding. s_cosf.c and s_tanf.c: Use a literal 0.0 instead of an unnecessary variable initialized to [(float)]0.0. Let the function prototype convert to 0.0F. Improved wording in some comments. Attempted to improve indentation of comments.	2005-11-17 03:53:22 +00:00
Bruce Evans	123e5d3dae	Rearranged the the optimizations for special cases to reduce the average number of branches. Use a non-bogus magic constant for the threshold of pi/4. It was 2 ulps smaller than pi/4 rounded down, but its value is not critical so it should be the result of natural rounding. Use "<=" comparisons with rounded- down thresholds for all small multiples of pi/4. Cleaned up previous commit: - use static const variables instead of expressions for multiples of pi/2 to ensure that they are evaluated at compile time. gcc currently evaluates them at compile time but C99 compilers are not required to do so. We want compile time evaluation for optimization and don't care about side effects. - use M_PI_2 instead of a magic constant for pi/2. We need magic constants related to pi/2 elsewhere but not here since we just want pi/2 rounded to double and even prefer it to be rounded in the default rounding mode. We can depend on the cmpiler being C99ish enough to round M_PI_2 correctly just as much as we depended on it handling hex constants correctly. This also fixes a harmless rounding error in the hex constant. - keep using expressions n<value for pi/2> in the initializers for the static const variables. 2M_PI_2 and 4M_PI_2 are obviously rounded in the same way as the corresponding infinite precision expressions for multiples of pi/2, and 3M_PI_2 happens to be rounded like this, so we don't need magic constants for the multiples. - fixed and/or updated some comments.	2005-11-17 02:20:04 +00:00
Bruce Evans	25efbfb212	Fixed some magic numbers. The threshold for not being tiny was too small. Use the usual 2*-12 threshold. This change is not just an optimization, since the general code that we fell into has accuracy problems even for tiny x. Avoiding it fixes 21366 args with errors of more than 1 ulp, with a maximum error of 1.167 ulps. The magic number 22 is log(DBL_EPSILON)/2 plus slop. This is bogus for float precision. Use 9 (~log(FLT_EPSILON)/2 plus less slop than for double precision). The code for handling the interval [2*-28, 9_was_22] has accuracy problems even for [9, 22], so this change happens to fix errors of more than 1 ulp in about 217000 cases. It leaves such errors in about 21074000 cases, with a max error of 1.242 ulps. The threshold for switching from returning exp(x)/2 to returning exp(x/2)^2/2 was a little smaller than necessary. As for coshf(), This was not quite harmless since the exp(x/2)^2/2 case is inaccurate, and fixing it avoids accuracy problems in 26 cases, leaving problems in 2*19997 cases. Fixed naming errors in pseudo-code in comments.	2005-11-13 00:41:46 +00:00
Bruce Evans	c24b7984fc	Fixed some magic numbers. The threshold for not being tiny was confusing and too small. Use the usual 2*-12 threshold and simplify the algorithm slightly so that this threshold works (now use the threshold for sinhf() instead of one for 1+expm1()). This is just a small optimization. The magic number 22 is log(DBL_EPSILON)/2 plus slop. This is bogus for float precision. Use 9 (~log(FLT_EPSILON)/2 plus less slop than for double precision). The threshold for switching from returning exp(x)/2 to returning exp(x/2)^2/2 was a little smaller than necessary. This was not quite harmless since the exp(x/2)^2/2 case is inaccurate. Fixing it happens to avoid accuracy problems for 26 of the 2151 args that were handled by the exp(x)/2 case. This leaves accuracy problems for about 219997 args near the overflow threshold (~89); the maximum error there is 2.5029 ulps. There are also accuracy probles for args in +-[0.5ln2, 9] -- 2188885 args with errors of more than 1 ulp, with a maximum error of 1.384 ulps. Fixed a syntax error and naming errors in pseudo-code in comments.	2005-11-13 00:08:23 +00:00
Bruce Evans	e96c4fd9f7	Imoproved comments for the minimax polynomial. Removed an unused variable. Fixed some wrong comments and some nearby misformatting.	2005-11-12 20:06:04 +00:00
Bruce Evans	6e10a447f8	Tweaked the minimax polynomial and improved its comments.	2005-11-12 19:56:35 +00:00
Bruce Evans	787d6d77d5	Improved comments for the minimax polynomial.	2005-11-12 19:54:45 +00:00
Bruce Evans	d4a74de9fc	As for the float trig functions, use a minimax polynomial that is specialized for float precision. The new polynomial has degree 8 instead of 14, and a maximum error of 2-34.34 (absolute) instead of 2-30.66. This doesn't affect the final error significantly; the maximum error was and is about 0.8879 ulps on amd64 -01. The fdlibm expf() is not used on i386's (the "optimized" asm version is used), but probably should be since it was already significantly faster than the asm version on athlons. The asm version has the advantage of being more accurate, so keep using it for now.	2005-11-12 18:20:09 +00:00
Bruce Evans	c01611e437	As for __kernel_cosf() and __kernel_sinf(), use a fairly optimal minimax polynomial for __kernel_tanf(). The old one was the double-precision polynomial with coefficients truncated to float. Truncation is not a good way to convert minimax polynomials to lower precision. Optimize for efficiency and use the lowest-degree polynomial that gives a relative error of less than 1 ulp. It has degree 13 instead of 27, and happens to be 2.5 times more accurate (in infinite precision) than the old polynomial (the maximum error is 0.017 ulps instead of 0.041 ulps). Unlike for cosf and sinf, the old accuracy was close to being inadequate -- the polynomial for double precision has a max error of 0.014 ulps and nearly this small an error is needed. The new accuracy is also a bit small, but exhaustive checking shows that even the old accuracy was enough. The increased accuracy reduces the maximum relative error in the final result on amd64 -O1 from 0.9588 ulps to 0.9044 ulps.	2005-11-10 17:43:49 +00:00
Bruce Evans	2b6ca0f6a5	Detach k_rem_pio2f.c from the build since it is now unused. It is a libm internal so this shouldn't cause version problems.	2005-11-06 17:59:40 +00:00
Bruce Evans	efff995f3b	Use a 53-bit approximation to pi/2 instead of a 33+53 bit one for the special case pi/4 <= \|x\| < 3pi/4. This gives a tiny optimization (it saves 2 subtractions, which are scheduled well so they take a whole 1 cycle extra on an AthlonXP), and simplifies the code so that the following optimization is not so ugly. Optimize for the range 3pi/4 < \|x\| < 9Pi/2 in the same way. On Athlon{XP,64} systems, this gives a 25-40% optimization (depending a lot on CFLAGS) for the cosf() and sinf() consumers on this range. Relative to i387 hardware fcos and fsin, it makes the software versions faster in most cases instead of slower in most cases. The relative optimization is smaller for tanf() the inefficient part is elsewhere. The 53-bit approximation to pi/2 is good enough for pi/4 <= \|x\| < 3pi/4 because after losing up to 24 bits to subtraction, we still have 29 bits of precision and only need 25 bits. Even with only 5 extra bits, it is possible to get perfectly rounded results starting with the reduced x, since if x is nearly a multiple of pi/2 then x is not near a half-way case and if x is not nearly a multiple of pi/2 then we don't lose many bits. With our intentionally imperfect rounding we get the same results for cosf(), sinf() and tanf() as without this optimization.	2005-11-06 17:48:02 +00:00
Bruce Evans	32948b81c4	The logb() functions are not just ieee754 "test" functions, but are standard in C99 and POSIX.1-2001+. They are also not deprecated, since apart from being standard they can handle special args slightly better than the ilogb() functions. Move their documentation to ilogb.3. Try to use consistent and improved wording for both sets of functions. All of ieee854, C99 and POSIX have better wording and more details for special args. Add history for the logb() functions and ilogbl(). Fix history for ilogb().	2005-11-06 12:18:27 +00:00
Bruce Evans	cb92d4d58f	Moved the optimization for tiny x from __kernel_tan[f](x) to tan[f](x) so that it can be faster for tiny x and avoided for reduced x. This improves things a little differently than for cosine and sine. We still need to reclassify x in the "kernel" functions, but we get an extra optimization for tiny x, and an overall optimization since tiny reduced x rarely happens. We also get optimizations for space and style. A large block of poorly duplicated code to fix a special case is no longer needed. This supersedes the fixes in k_sin.c revs 1.9 and 1.11 and k_sinf.c 1.8 and 1.10. Fixed wrong constant for the cutoff for "tiny" in tanf(). It was 2-28, but should be almost the same as the cutoff in sinf() (2-12). The incorrect cutoff protected us from the bugs fixed in k_sinf.c 1.8 and 1.10, except 4 cases of reduced args passed the cutoff and needed special handling in theory although not in practice. Now we essentially use a cutoff of 0 for the case of reduced args, so we now have 0 special args instead of 4. This change makes no difference to the results for sinf() (since it only changes the algorithm for the 4 special args and the results for those happen not to change), but it changes lots of results for sin(). Exhaustive testing is impossible for sin(), but exhaustive testing for sinf() (relative to a version with the old algorithm and a fixed cutoff) shows that the changes in the error are either reductions or from 0.5-epsilon ulps to 0.5+epsilon ulps. The new method just uses some extra terms in approximations so it tends to give more accurate results, and there are apparently no problems from having extra accuracy. On amd64 with -O1, on all float args the error range in ulps is reduced from (0.500, 0.665] to [0.335, 0.500) in 24168 cases and increased from 0.500-epsilon to 0.500+epsilon in 24 cases. Non- exhaustive testing by ucbtest shows no differences.	2005-11-02 14:01:45 +00:00
Bruce Evans	4f8d68d6ca	Updated the comment about the optimization for tiny x (the previous commit moved it). This includes a comment that the "kernel" sine no longer works on arg -0, so callers must now handle this case. The kernel sine still works on all other tiny args; without the optimization it is just a little slower on these args. I intended it to keep working on all tiny args, but that seems to be impossible without losing efficiency or accuracy. (sin(x) ~ x * (1 + S1x2 + ...) would preserve -0, but the approximation must be written as x + S1x**3 + ... for accuracy.)	2005-11-02 13:06:49 +00:00
Bruce Evans	639a1e1106	Removed dead code for handling tan[f]() on odd multiples of pi/2. This case never occurs since pi/2 is irrational so no multiple of it can be represented as a float and we have precise arg reduction so we never end up with a remainder of 0 in the "kernel" function unless the original arg is 0. If this case occurs, then we would now fall through to general code that returns +-Inf (depending on the sign of the reduced arg) instead of forcing +Inf. The correct handling would be to return NaN since we would have lost so much precision that the correct result can be anything _except_ +-Inf. Don't reindent the else clause left over from this, although it was already bogusly indented ("if (foo) return; else ..." just marches the indentation to the right), since it will be removed too. Index: k_tan.c =================================================================== RCS file: /home/ncvs/src/lib/msun/src/k_tan.c,v retrieving revision 1.10 diff -r1.10 k_tan.c 88,90c88 < if (((ix \| low) \| (iy + 1)) == 0) < return one / fabs(x); < else { --- > {	2005-11-02 06:45:21 +00:00
Bruce Evans	16622bffd4	Fixed some of the silliness related to rev.1.8. In 1.8, "double" in a declaration was not translated to "float" although bit fiddling on double variables was translated. This resulted in garbage being put into the low word of one of the doubles instead of non-garbage being put into the only word of the intended float. This had no effect on any result because: - with doubles, the algorithm for calculating -1/(x+y) is unnecessarily complicated. Just returning -1/((double)x+y) would work, and the misdeclaration gave something like that except for messing up some low bits with the bit fiddling. - doubles have plenty of bits to spare so messing up some of the low bits is unlikely to matter. - due to other bugs, the buggy code is reached for a whole 4 args out of all 232 float args. The bug fixed by 1.8 only affects a small percentage of cases and a small percentage of 4 is 0. The 4 args happen to cause no problems without 1.8, so they are even less likely to be affected by the bug in 1.8 than average args; in fact, neither 1.8 nor this commit makes any difference to the result for these 4 args (and thus for all args). Corrections to the log message in 1.8: the bug only applies to tan() and not tanf(), not because the float type can't represent numbers large enough to trigger the problem (e.g., the example in the fdlibm-5.3 readme which is > 1.0e269), but because: - the float type can't represent small enough numbers. For there to be a possible problem, the original arg for tanf() must lie very near an odd multiple of pi/2. Doubles can get nearer in absolute units. In ulps there should be little difference, but ... - ... the cutoff for "small" numbers is bogus in k_tanf.c. It is still the double value (2-28). Since this is 32 times smaller than FLT_EPSILON and large float values are not very uniformly distributed, only 6 args other than ones that are initially below the cutoff give a reduced arg that passes the cutoff (the 4 problem cases mentioned above and 2 non-problem cases). Fixing the cutoff makes the bug affect tanf() and much easier to detect than for tan(). With a cutoff of 2**-12 on amd64 with -O1, 670102 args pass the cutoff; of these, there are 337604 cases where there might be an error of >= 1 ulp and 5826 cases where there is such an error; the maximum error is 1.5382 ulps. The fix in 1.8 works with the reduced cutoff in all cases despite the bug in it. It changes the result in 84492 cases altogether to fix the 5826 broken cases. Fixing the fix by translating "double" to "float" changes the result in 42 cases relative to 1.8. In 24 cases the (absolute) error is increased and in 18 cases it is reduced, but it remains less than 1 ulp in all cases.	2005-11-02 05:37:31 +00:00
Bruce Evans	053d1689b1	Fixed spelling of remquof() in its prototype.	2005-10-30 12:34:58 +00:00
Bruce Evans	f964c6ecfb	Fixed some comments added in rev.1.5. The log message for 1.5 said that some small (one or two ulp) inaccuracies were fixed, and a comment implied that the critical change is to switch the rounding mode to to-nearest, with a switch of the precision to extended at no extra cost. Actually, the errors are very large (ucbtest finds ones of several hundred ulps), and it is the switch of the precision that is critical. Another comment was wrong about NaNs being handled sloppily.	2005-10-30 12:21:02 +00:00
Bruce Evans	19b114da0e	Implement inline functions to give the complex result x+Iy from float or double args x and y. x+Iy cannot be used directly yet due to compiler bugs. Submitted by: Steve Kargl <sgk@troutmask.apl.washington.edu>	2005-10-29 17:14:11 +00:00
Bruce Evans	8b438ea8dd	Use double precision to simplify and optimize arg reduction for small and medium size args too: instead of conditionally subtracting a float 17+24, 17+17+24 or 17+17+17+24 bit approximation to pi/2, always subtract a double 33+53 bit one. The float version is now closer to the double version than to old versions of itself -- it uses the same 33+53 bit approximation as the simplest cases in the double version, and where the float version had to switch to the slow general case at \|x\| == 2^7pi/2, it now switches at \|x\| == 2^19pi/2 the same as the double version. This speeds up arg reduction by a factor of 2 for \|x\| between 3pi/4 and 2^7pi/4, and by a factor of 7 for \|x\| between 2^7pi/4 and 2^19pi/4.	2005-10-29 16:34:50 +00:00
Bruce Evans	21b0341c80	Start trying to make the float precision trig functions actually worth using under FreeBSD. Before this commit, all float precision functions except exp2f() were implemented using only float precision, apparently because Cygnus needed this in 1993 for embedded systems with slow or inefficient double precision. For FreeBSD, except possibly on systems that do floating point entirely in software (very old i386 and now arm), this just gives a more complicated implementation, many bugs, and usually worse performance for float precision than for double precision. The bugs and worse performance were particulary large in arg reduction for trig functions. We want to divide by an approximation to pi/2 which has as many as 1584 bits, so we should use the widest type that is efficient and/or easy to use, i.e., double. Use fdlibm's __kernel_rem_pio2() to do this as Sun apparently intended. Cygnus's k_rem_pio2f.c is now unused. e_rem_pio2f.c still needs to be separate from e_rem_pio2.c so that it can be optimized for float args. Similarly for long double precision. This speeds up cosf(x) on large args by a factor of about 2. Correct arg reduction on large args is still inherently very slow, so hopefully these args rarely occur in practice. There is much more efficiency to be gained by using double precision to speed up arg reduction on medium and small float args.	2005-10-29 08:15:29 +00:00
Bruce Evans	11dc241777	Use fairly optimal minimax polynomials for __kernel_cosf() and __kernel_sinf(). The old ones were the double-precision polynomials with coefficients truncated to float. Truncation is not a good way to convert minimax polynomials to lower precision. Optimize for efficiency and use the lowest-degree polynomials that give a relative error of less than 1 ulp -- degree 8 instead of 14 for cosf and degree 9 instead of 13 for sinf. For sinf, the degree 8 polynomial happens to be 6 times more accurate than the old degree 14 one, but this only gives a tiny amount of extra accuracy in results -- we just need to use a a degree high enough to give a polynomial whose relative accuracy in infinite precision (but with float coefficients) is a small fraction of a float ulp (fdlibm generally uses 1/32 for the small fraction, and the fraction for our degree 8 polynomial is about 1/600). The maximum relative errors for cosf() and sinf() are now 0.7719 ulps and 0.7969 ulps, respectively.	2005-10-28 13:36:58 +00:00
Bruce Evans	3b46e988e7	Use a better algorithm for reducing the error in __kernel_cos[f](). This supersedes the fix for the old algorithm in rev.1.8 of k_cosf.c. I want this change mainly because it is an optimization. It helps make software cos[f](x) and sin[f](x) faster than the i387 hardware versions for small x. It is also a simplification, and reduces the maximum relative error for cosf() and sinf() on machines like amd64 from about 0.87 ulps to about 0.80 ulps. It was validated for cosf() and sinf() by exhaustive testing. Exhaustive testing is not possible for cos() and sin(), but ucbtest reports a similar reduction for the worst case found by non-exhaustive testing. ucbtest's non-exhaustive testing seems to be good enough to find problems in algorithms but not maximum relative errors when there are spikes. E.g., short runs of it find only 3 ulp error where the i387 hardware cos() has an error of about 2**40 ulps near pi/2.	2005-10-26 12:36:18 +00:00
Bruce Evans	a92cb60b4e	More fixes for arg reduction near pi/2 on systems with broken assignment to floats (mainly i386's). All errors of more than 1 ulp for float precision trig functions were supposed to have been fixed; however, compiling with gcc -O2 uncovered 18250 more such errors for cosf(), with a maximum error of 1.409 ulps. Use essentially the same fix as in rev.1.8 of k_rem_pio2f.c (access a non-volatile variable as a volatile). Here the -O1 case apparently worked because the variable is in a 2-element array and it takes -O2 to mess up such a variable by putting it in a register. The maximum error for cosf() on i386 with gcc -O2 is now 0.5467 (it is still 0.5650 with gcc -O1). This shows that -O2 still causes some extra precision, but the extra precision is now good. Extra precision is harmful mainly for implementing extra precision in software. We want to represent x+y as w+r where both "+" operations are in infinite precision and r is tiny compared with w. There is a standard algorithm for this (Knuth (1981) 4.2.2 Theorem C), and fdlibm uses this routinely, but the algorithm requires w and r to have the same precision as x and y. w is just x+y (calculated in the same finite precision as x and y), and r is a tiny correction term. The i386 gcc bugs tend to give extra precision in w, and then using this extra precision in the calculation of r results in the correction mostly staying in w and being missing from r. There still tends to be no problem if the result is a simple expression involving w and r -- modulo spills, w keeps its extra precision and r remains the right correction for this wrong w. However, here we want to pass w and r to extern functions. Extra precision is not retained in function args, so w gets fixed up, but the change to the tiny r is tinier, so r almost remains as a wrong correction for the right w.	2005-10-25 12:13:37 +00:00
Bruce Evans	4339c67c48	Moved the optimization for tiny x from __kernel_{cos,sin}[f](x) to {cos_sin}[f](x) so that x doesn't need to be reclassified in the "kernel" functions to determine if it is tiny (it still needs to be reclassified in the cosine case for other reasons that will go away). This optimization is quite large for exponentially distributed x, since x is tiny for almost half of the domain, but it is a pessimization for uniformally distributed x since it takes a little time for all cases but rarely applies. Arg reduction on exponentially distributed x rarely gives a tiny x unless the reduction is null, so it is best to only do the optimization if the initial x is tiny, which is what this commit arranges. The imediate result is an average optimization of 1.4% relative to the previous version in a case that doesn't favour the optimization (double cos(x) on all float x) and a large pessimization for the relatively unimportant cases of lgamma[f][_r](x) on tiny, negative, exponentially distributed x. The optimization should be recovered for lgamma() as part of fixing lgamma()'s low-quality arg reduction. Fixed various wrong constants for the cutoff for "tiny". For cosine, the cutoff is when x2/2! == {FLT or DBL}_EPSILON/2. We round down to an integral power of 2 (and for cos() reduce the power by another 1) because the exact cutoff doesn't matter and would take more work to determine. For sine, the exact cutoff is larger due to the ration of terms being x2/3! instead of x2/2!, but we use the same cutoff as for cosine. We now use a cutoff of 2-27 for double precision and 2-12 for single precision. 2-27 was used in all cases but was misspelled 2**27 in comments. Wrong and sloppy cutoffs just cause missed optimizations (provided the rounding mode is to nearest -- other modes just aren't supported).	2005-10-24 14:08:36 +00:00
Bruce Evans	74bbe8ed42	Fixed range reduction for large multiples of pi/2 on systems with broken assignment to floats (e.g., i386 with gcc -O, but not amd64 or ia64; i386 with gcc -O0 worked accidentally). Use an unnamed volatile temporary variable to trick gcc -O into clipping extra precision on assignment. It's surprising that only 1 place needed to be changed. For tanf() on i386 with gcc -O, the bug caused errors > 1 ulp with a density of 2.3% for args larger in magnitude than 128pi/2, with a maximum error of 1.624 ulps. After this fix, exhaustive testing shows that range reduction for floats works as intended assuming that it is in within a factor of about 2^16 of working as intended for doubles. It provides >= 8 extra bits of precision for all ranges. On i386: range max error in double/single ulps extra precision ----- ------------------------------- --------------- 0 to 3pi/4 0x000d3132 / 0.0016 9+ bits 3pi/4 to 128pi/2 0x00160445 / 0.0027 8+ 128pi/2 to +Inf 0x00000030 / 0.00000009 23+ 128pi/2 up, -O0 before fix 0x00000030 / 0.00000009 23+ 128*pi/2 up, -O1 before fix 0x10000000 / 0.5 1 The 23+ bits of extra precision for large multiples corresponds to almost perfect reduction to a pair of floats (24 extra would be perfect). After this fix, the maximum relative error (relative to the corresponding fdlibm double precision function) is < 1 ulp for all basic trig functions on all 2^32 float args on all machines tested: amd64 ia64 i386-O0 i386-O1 ------ ------ ------ ------ cosf: 0.8681 0.8681 0.7927 0.5650 sinf: 0.8733 0.8610 0.7849 0.5651 tanf: 0.9708 0.9329 0.9329 0.7035	2005-10-11 07:56:05 +00:00
Bruce Evans	59b8fc1535	Fixed range reduction near (but not very near) medium-sized multiples of pi/2 (1 line) and expand a comment about related magic (many lines). The bug was essentially the same as for the +-pi/2 case (a mistranslated mask), but was smaller so it only significantly affected multiples starting near +-13*pi/2. At least on amd64, for cosf() on all 2^32 float args, the bug caused 128 errors of >= 1 ulp, with a maximum error of 1.2393 ulps.	2005-10-10 20:02:02 +00:00
Bruce Evans	11cba99f67	Fix numerous errors of >= 1 ulp for cosf(x) and sinf(x) (1 line) and add a comment about related magic (many lines)). __kernel_cos[f]() needs a trick to reduce the error to below 1 ulp when \|x\| >= 0.3 for the range-reduced x. Modulo other bugs, naive code that doesn't use the trick would have an error of >= 1 ulp in about 0.00006% of cases when \|x\| >= 0.3 for the unreduced x, with a maximum relative error of about 1.03 ulps. Mistransation of the trick from the double precision case resulted in errors in about 0.2% of cases, with a maximum relative error of about 1.3 ulps. The mistranslation involved not doing implicit masking of the 32-bit float word corresponding to to implicit masking of the lower 32-bit double word by clearing it. sinf() uses __kernel_cosf() for half of all cases so its errors from this bug are similar. tanf() is not affected. The error bounds in the above and in my other recent commit messages are for amd64. Extra precision for floats on i386's accidentally masks this bug, but only if k_cosf.c is compiled with -O. Although the extra precision helps here, this is accidental and depends on longstanding gcc precision bugs (not clipping extra precision on assignment...), and the gcc bugs are mostly avoided by compiling without -O. I now develop libm mainly on amd64 systems to simplify error detection and debugging.	2005-10-09 21:07:23 +00:00
Bruce Evans	a0e34da09f	Oops, the last-minute optimization in rev.1.8 wasn't a good idea. The 17+17+24 bit pi/2 must only be used when subtraction of the first 2 terms in it from the arg is exact. This happens iff the the arg in bits is one of the 2**17[-1] values on each side of (float)(pi/2). Revert to the algorithm in rev.1.7 and only fix its threshold for using the 3-term pi/2. Use the threshold that maximizes the number of values for which the 3-term pi/2 is used, subject to not changing the algorithm for comparing with the threshold. The 3-term pi/2 ends up being used for about half of its usable range (about 64K values on each side).	2005-10-09 04:29:08 +00:00
Bruce Evans	cd604283af	Fixed syntax error (a missing brace) in previous commit.	2005-10-08 22:55:36 +00:00
Bruce Evans	a7b8acac04	Fixed range reduction near (but not very near) +-pi/2. A bug caused a maximum error of 2.905 ulps for cosf(), but the algorithm for cosf() is good for < 1 ulps and happens to give perfect rounding (< 0.5 ulps) near +-pi/2 except for the bug. The extra relative errors for tanf() were similar (slightly larger). The bug didn't affect sinf() since sinf'(+-pi/2) is 0. For range reduction in ~[-3pi/4, -pi/4] and ~[pi/4, 3pi/4] we must subtract +-pi/2 and the only complication is that this must be done in extra precision. We have handy 17+24-bit and 17+17+24-bit approximations to pi/2. If we always used the former then we would lose up to 24 bits of accuracy due to cancelation of leading bits, but we need to keep at least 24 bits plus a guard digit or 2, and should keep as many guard bits as efficiency permits. So we used the less-precise pi/2 not very near +-pi/2 and switched to using the more-precise pi/2 very near +-pi/2. However, we got the threshold for the switch wrong by allowing 19 bits to cancel, so we ended up with only 21 or 22 bits of accuracy in some cases, which is even worse than naively subtracting pi/2 would have done. Exhaustive checking shows that allowing only 17 bits to cancel (min. accuracy ~24 bits) is sufficient to reduce the maximum error for cosf() near +-pi/2 to 0.726 ulps, but allowing only 6 bits to cancel (min. accuracy ~35-bits) happens to give perfect rounding for cosf() at little extra cost so we prefer that. We actually (in effect) allow 0 bits to cancel and always use the 17+17+24-bit pi/2 (min. accuracy ~41 bits). This is simpler and probably always more efficient too. Classifying args to avoid using this pi/2 when it is not needed takes several extra integer operations and a branch, but just using it takes only 1 FP operation. The patch also fixes misspelling of 17 as 24 in many comments. For the double-precision version, the magic numbers include 33+53 bits for the less-precise pi/2 and (53-32-1 = 20) bits being allowed to cancel, so there are ~33-20 = 13 guard bits. This is sufficient except probably for perfect rounding. The more-precise pi/2 has 33+33+53 bits and we still waste time classifying args to avoid using it. The bug is apparently from mistranslation of the magic 32 in 53-32-1. The number of bits allowed to cancel is not critical and we use 32 for double precision because it allows efficient classification using a 32-bit comparison. For float precision, we must use an explicit mask, and there are fewer bits so there is less margin for error in their allocation. The 32 got reduced to 4 but should have been reduced almost in proportion to the reduction of mantissa bits.	2005-10-08 22:43:55 +00:00
Bruce Evans	0b42281ee9	Fixed aliasing bugs in TRUNC() by using the fdlibm macros for access to doubles as bits. fdlibm-1.1 had similar aliasing bugs, but these were fixed by NetBSD or Cygnus before a modified version of fdlibm was imported in 1994. TRUNC() is only used by tgamma() and some implementation-detail functions. The aliasing bugs were detected by compiling with gcc -O2 but don't seem to have broken tgamma() on i386's or amd64's. They broke my modified version of tgamma(). Moved the definition of TRUNC() to mathimpl.h so that it can be fixed in one place, although the general version is even slower than necessary because it has to operate on pointers to volatiles to handle its arg sometimes being volatile. Inefficiency of the fdlibm macros slows down libm generally, and tgamma() is a relatively unimportant part of libm. The macros act as if on 32-bit words in memory, so they are hard to optimize to direct actions on 64-bit double registers for (non-i386) machines where this is possible. The optimization is too hard for gcc on amd64's, and declaring variables as volatile makes it impossible.	2005-09-19 11:28:19 +00:00
David Schultz	26bd283f2a	Add a missing ldexpf() alias for amd64. Noticed by: bz@, tjr@	2005-09-12 20:54:00 +00:00
Ken Smith	a84020c2b9	Bump the shared library version number of all libraries that have not been bumped since RELENG_5. Reviewed by: ru Approved by: re (not needed for commit check but in principle...)	2005-07-22 17:19:05 +00:00
Ruslan Ermilov	01293bdb90	Markup nit. Approved by: re (blanket)	2005-06-16 21:56:03 +00:00
Ruslan Ermilov	70db9cd000	Fixed compile warning. Approved by: re (blanket)	2005-06-16 21:55:45 +00:00
Ruslan Ermilov	f789cb8293	Assorted markup fixes. Approved by: re	2005-06-15 19:04:04 +00:00
Daniel Eischen	7f8fa2cf47	Prevent these functions from using stack outside of their frame. Reported by: Marc Olzheim <marcolz at stack dot nl> OK'd by: das	2005-05-06 15:44:20 +00:00
Stefan Farfeleder	66116c07a7	Revert the last change, the conversion from long double to double can raise unwanted underflow exceptions. Pointed out by: das	2005-04-28 19:45:55 +00:00
Stefan Farfeleder	8f58ab910f	Use double additions to raise the inexact exception to work around problems with long double addition on sparc64.	2005-04-22 09:57:55 +00:00
Stefan Farfeleder	9eb30792de	Fix raising the inexact exception (FE_INEXACT) if the result differs from the argument. Noticed by: das	2005-04-22 08:30:33 +00:00
Andrey A. Chernov	db7354df52	Fix truncl.3 MLINKS	2005-04-17 19:57:52 +00:00
David Schultz	a4ca7ca8ac	More optimized math functions.	2005-04-16 21:12:55 +00:00
David Schultz	2f2ee27de4	Implement truncl() based on floorl().	2005-04-16 21:12:47 +00:00
David Schultz	07f3bc5b9c	Add roundl(), lroundl(), and llroundl().	2005-04-08 01:24:08 +00:00
David Schultz	4bb190a74b	These files should include s_lround.c instead of s_lrint.c. This only matters for efficiency, not for correctness.	2005-04-08 00:52:27 +00:00
David Schultz	fc87986708	Fix a (coincidentally harmless) bug.	2005-04-08 00:52:16 +00:00
David Schultz	46691dfbe7	Fix a long-standing bug in k_rem_pio2(), which led to large errors when tanf() was called with big arguments close to multiples of pi/2. Reported by: ucbtest via bde	2005-04-05 23:27:47 +00:00
David Schultz	d06a0070af	Build exp2(), exp2f(), and related documentation.	2005-04-05 02:57:39 +00:00
David Schultz	90232fdf16	Document exp2() and exp2f(), and make other minor tweaks and updates.	2005-04-05 02:57:28 +00:00
David Schultz	f8d6ede6b5	Implement exp2() and exp2f().	2005-04-05 02:57:15 +00:00
David Schultz	3b9141ee91	Implement and document remquo() and remquof().	2005-03-25 04:40:44 +00:00
David Schultz	2c2435825a	Fix the double rounding problem with subnormals, and remove the XXX comments, which no longer apply.	2005-03-18 02:27:59 +00:00
David Schultz	21122bea01	Add missing prototypes for fma() and fmaf(), and remove an inaccurate comment.	2005-03-18 01:47:42 +00:00
David Schultz	9233b45ad9	Make the fenv.h routines work for programs that use SSE for floating-point arithmetic on i386. Now I'm going to make excuses for why this code is kinda scary: - To avoid breaking the ABI with 5.3-RELEASE, we can't change sizeof(fenv_t). I stuck the saved mxcsr in some discontiguous reserved bits in the existing structure. - Attempting to access the mxcsr on older processors results in an illegal instruction exception, so support for SSE must be detected at runtime. (The extra baggage is optimized away if either the application or libm is compiled with -msse{,2}.) I didn't run tests to ensure that this doesn't SIGILL on older 486's lacking the cpuid instruction or on other processors lacking SSE. Results from running the fenv regression test on these processors would be appreciated. (You'll need to compile the test with -DNO_STRICT_DFL_ENV.) If you have an 80386, or if your processor supports SSE but the kernel didn't enable it, then you're probably out of luck. Also, I un-inlined some of the functions that grew larger as a result of this change, moving them from fenv.h to fenv.c.	2005-03-17 22:21:46 +00:00
David Schultz	56ad27535a	Spell 'fedisableexcept' correctly.	2005-03-16 22:34:14 +00:00
David Schultz	2e5fb44003	Document feenableexcept(), fedisableexcept(), and fegetexcept().	2005-03-16 19:04:28 +00:00
David Schultz	10b01832c3	Replace fegetmask() and fesetmask() with feenableexcept(), fedisableexcept(), and fegetexcept(). These two sets of routines provide the same functionality. I implemented the former as an undocumented internal interface to make the regression test easier to write. However, fe(enable\|disable\|get)except() is already part of glibc, and I would like to avoid gratuitous differences. The only major flaw in the glibc API is that there's no good way to report errors on processors that don't support all the unmasked exceptions.	2005-03-16 19:03:46 +00:00
David Schultz	3d266bde6d	Replace strong references with weak references. There's no particularly good reason to do this, except that __strong_reference does type checking, whereas __weak_reference does not. On Alpha, the compiler won't accept a 'long double' parameter in place of a 'double' parameter even thought the two types are identical.	2005-03-07 21:27:37 +00:00
Stefan Farfeleder	3ddc6e9440	Remove an obsolete sentence from a comment.	2005-03-07 20:28:26 +00:00
David Schultz	c8642491d5	- If z is 0, one of x or y is 0, and the other is infinite, raise an invalid exception and return an NaN. - If a long double has 113 bits of precision, implement fma in terms of simple long double arithmetic instead of complicated double arithmetic. - If a long double is the same as a double, alias fma as fmal.	2005-03-07 05:02:09 +00:00
David Schultz	388bf3b630	Document scalbnl and scalblnl.	2005-03-07 05:00:44 +00:00
David Schultz	6af2c5a60c	Document nextafterl and nexttoward{,f,l}.	2005-03-07 05:00:29 +00:00
David Schultz	15a53f77fd	Add nexttoward to the list of implemented functions, and explicitly list the four that are still missing.	2005-03-07 04:59:53 +00:00
David Schultz	66d672d8cb	Document fmal.	2005-03-07 04:59:43 +00:00
David Schultz	94e03502dc	Remove ldexp and ldexpf. The former is in libc, and the latter is identical to scalbnf, which is now aliased as ldexpf. Note that the old implementations made the mistake of setting errno and were the only libm routines to do so.	2005-03-07 04:59:30 +00:00
David Schultz	aeb5e711f3	- Remove s_ldexpf.c (now aliased to scalbn.) - Add nexttoward{,f,l} and nextafterl. On all platforms, nexttowardl is an alias for nextafterl. - Add fmal. - Add man pages for new routines: fmal, nextafterl, nexttoward{,f,l}, scalb{,l}nl. Note that on platforms where long double is the same as double, we generally just alias the double versions of the routines, since doing so avoids extra work on the source code level and redundant code in the binary. In particular: ldbl53 ldbl64/113 fmal s_fma.c s_fmal.c ldexpl s_scalbn.c s_scalbnl.c nextafterl s_nextafter.c s_nextafterl.c nexttoward s_nextafter.c s_nexttoward.c nexttowardf s_nexttowardf.c s_nexttowardf.c nexttowardl s_nextafter.c s_nextafterl.c scalbnl s_scalbn.c s_scalbnl.c	2005-03-07 04:59:11 +00:00
David Schultz	228ad57d05	- Define FP_FAST_FMA for sparc64, since fma() is now implemented using sparc64's 128-bit long doubles. - Define FP_FAST_FMAL for ia64. - Prototypes for fmal, frexpl, ldexpl, nextafterl, nexttoward{,f,l}, scalblnl, and scalbnl.	2005-03-07 04:58:43 +00:00
David Schultz	beed720c37	Alias scalbn as ldexpl and scalbnl on platforms where long double is the same as double.	2005-03-07 04:58:03 +00:00
David Schultz	7b6a19039d	- Implement scalblnl. - In scalbln and scalblnf, check the bounds of the second argument. This is probably unnecessary, but strictly speaking, we should report an error if someone tries to compute scalbln(x, INT_MAX + 1ll).	2005-03-07 04:57:50 +00:00
David Schultz	caacab9b5f	Implement nexttowardf. This is used on both platforms with 11-bit exponents and platforms with 15-bit exponents for long doubles.	2005-03-07 04:57:38 +00:00
David Schultz	ef94de735a	Implement nexttoward and nextafterl; the latter is also known as nexttowardl. These are not needed on machines where long doubles look like IEEE-754 doubles, so the implementation only supports the usual long double formats with 15-bit exponents. Anything bizarre, such as machines where floating-point and integer data have different endianness, will cause problems. This is the case with big endian ia64 according to libc/ia64/_fpmath.h. Please contact me if you managed to get a machine running this way.	2005-03-07 04:56:46 +00:00
David Schultz	a506506a1c	- Try harder to trick gcc into not optimizing away statements that are intended to raise underflow and inexact exceptions. - On systems where long double is the same as double, nextafter should be aliased as nexttoward, nexttowardl, and nextafterl.	2005-03-07 04:55:58 +00:00
David Schultz	e0fe8e4440	Implement frexpl.	2005-03-07 04:54:51 +00:00
David Schultz	f8a40fca14	Alias frexp as frexpl on platforms where a long double is the same as a double.	2005-03-07 04:54:39 +00:00
David Schultz	65e60ab108	Implement fmal.	2005-03-07 04:54:20 +00:00
David Schultz	b1f37dcef4	- Define the LDBL_PREC to be the number of significant bits in a long double's mantissa. - Add an assembly version of fmal.	2005-03-07 04:54:02 +00:00
David Schultz	99401fa2e9	- Define the LDBL_PREC to be the number of significant bits in a long double's mantissa. - Add an assembly version of scalbnl.	2005-03-07 04:53:48 +00:00
David Schultz	4be31f0664	Define the LDBL_PREC to be the number of significant bits in a long double's mantissa.	2005-03-07 04:53:36 +00:00
David Schultz	4442891961	Add an assembly version of fmal.	2005-03-07 04:53:11 +00:00
David Schultz	cd7d05b5a2	Add scalbnl, also known as as ldexpl.	2005-03-07 04:52:58 +00:00
David Schultz	4b2011300b	Alias scalbnf as ldexpf. The two are identical in binary floating-point formats.	2005-03-07 04:52:43 +00:00
David Schultz	1b32579f23	Fix a mistake in the exponent range.	2005-03-06 19:08:18 +00:00
David Schultz	f4a5643005	Work around a gcc bug. This fixes feholdexcept() et al. at -O1. Symptoms of the problem included assembler warnings and nondeterministic runtime behavior when a fe*() call that affects the fpsr is closely followed by a float point op. The bug (at least, I think it's a bug) is that gcc does not insert a break between a volatile asm and a dependent instruction if the volatile asm came from an inlined function. Volatile asms seem to be fine in other circumstances, even without -mvolatile-asm-stop, so perhaps the compiler adds the stop bits before inlining takes place. The problem does not occur at -O0 because inlining is disabled, and it doesn't happen at -O2 because -fschedule-insns2 knows better.	2005-03-05 20:34:45 +00:00
David Schultz	57276bb6ea	Un-document the non-extant exp10() and exp10f() functions. exp10() was a casualty of the transition away from the VAX.	2005-02-26 08:54:45 +00:00
David Schultz	aa28340df9	Revert rev 1.8, which causes small (e.g. 2 ulp) errors for some inputs. The trouble with replacing two floats with a double is that the latter has 6 extra bits of precision, which actually hurts accuracy in many cases. All of the constants are optimal when float arithmetic is used, and would need to be recomputed to do this right. Noticed by: bde (ucbtest)	2005-02-24 06:32:13 +00:00
David Schultz	adec44c08b	Use hardware instructions for sqrt() and sqrtf().	2005-02-21 18:27:57 +00:00
David Schultz	96efaf6c36	Use double arithmetic instead of simulating it with two floats. This results in a performance gain on the order of 10% for amd64 (sledge), ia64 (pluto1), i386+SSE (Pentium 4), and sparc64 (panther), and a negligible improvement for i386 without SSE. (The i386 port still uses the hardware instruction, though.)	2005-02-21 17:44:57 +00:00
David Schultz	f674c13c78	Remove the i387 versions of atan(), atan2(), and atan2f(). They are slower than the MI routines on modern hardware, except for degenerate cases such as the Pentium 4. PR: 67469	2005-02-21 16:04:23 +00:00
David Schultz	c4691a5da9	Remove i387 versions of asin() and acos(). Although the hardware instruction was faster on the 486, it's slower than our MD version on modern processors. Determined by: bde PR: 67469	2005-02-20 22:51:08 +00:00
David Schultz	dab1571b90	Remove the float versions of the i387 trig functions obtained from NetBSD. They're buggy, giving particularly for inputs larger in magnitude than 2**63. Noticed by: bde PR: 67469	2005-02-20 22:50:40 +00:00
David Schultz	e02846ce13	Fix a small scripting snafu in the previous revision.	2005-02-04 20:05:39 +00:00
David Schultz	b21154f677	Remove another vestige of support for a non-IEEE libm.	2005-02-04 18:32:13 +00:00
David Schultz	3f70824172	Reduce diffs against vendor source (Sun fdlibm 5.3).	2005-02-04 18:26:06 +00:00
David Schultz	79b990338f	Move machine-dependent crud to its own makefile.	2005-02-04 14:33:39 +00:00
David Schultz	e1b61b5b93	Remove wrappers and other cruft intended to support SVID, mistakes in C90, and other arcana. Most of these features were never fully supported or enabled by default. Ok: bde, stefanf	2005-02-04 14:08:32 +00:00
Ruslan Ermilov	1f8ee0e102	Typo.	2005-01-28 21:14:16 +00:00
Ruslan Ermilov	d7a604cc33	Properly terminate sentence.	2005-01-28 21:13:34 +00:00
David Schultz	29bf6af890	- Move the functions presently described in in ieee(3) to their own manpages. They are not very related, so separating them makes it easier to add meaningful cross-references and extend some of the descriptions. - Move the part of math(3) that discusses IEEE 754 to the ieee(3) manpage.	2005-01-27 05:46:17 +00:00
Olivier Houchard	15d3b4db61	Define FE_TONEAREST, FE_TOWARDZERO, FE_UPWARD, FE_DOWNWARD and _ROUND_MASK to unbreak the build for arm.	2005-01-24 00:35:02 +00:00
David Schultz	cb2d2321cd	Update comment to reflect the code change in the previous revision. Noticed by: ceri	2005-01-23 22:56:08 +00:00
David Schultz	52611c608e	Many changes, including the following major ones: - Rearrange the list of functions into categories. - Remove the ulps column. It was appropriate for only some of the functions in the list, and correct for even fewer of them. - Add some new paragraphs, and remove some old ones about NaNs that may do more harm than good. - Document precisions other than double-precision.	2005-01-23 22:05:33 +00:00
David Schultz	3c4d0a0973	If x == y, return y, not x. C99 (though not IEEE 754) requires that nextafter(+0.0, -0.0) returns -0.0 and nextafter(-0.0, +0.0) returns +0.0.	2005-01-23 15:46:22 +00:00
David Schultz	d5580d091a	Add fma() and fmaf(), which implement a fused multiply-add operation.	2005-01-22 09:53:18 +00:00
Ruslan Ermilov	24a0682c64	Sort sections.	2005-01-20 09:17:07 +00:00
Ruslan Ermilov	5391441c05	Use the \*(If string provided by mdoc(7), to represent infinity.	2005-01-16 16:49:10 +00:00
Ruslan Ermilov	1fbb01b7f0	Removed redundant .br call.	2005-01-16 16:46:14 +00:00
David Schultz	cd3cc47033	amd64 assembly versions of sqrt(), lrint(), and llrint() using SSE2.	2005-01-15 03:32:28 +00:00
David Schultz	b6e65225a6	Most libm routines depend on the rounding mode and/or set exception flags, so they are not pure. Remove the __pure2 annotation from them. I believe that the following routines and their float and long double counterparts are the only ones here that can be __pure2: copysign is* fabs finite fmax fmin fpclassify ilogb nan signbit When gcc supports FENV_ACCESS, perhaps there will be a new annotation that allows the other functions to be considered pure when FENV_ACCESS is off. Discussed with: bde	2005-01-15 02:55:10 +00:00
David Schultz	71936f351e	Braino. Revert rev 1.50. Pointy hat to: das	2005-01-15 00:37:31 +00:00
David Schultz	8e26469445	Remove numerous references to VAX floating-point and the setting of errno, replacing them with a discussion of IEEE exceptions where appropriate. Cross-reference fenv(3) whenever exceptions are mentioned.	2005-01-14 23:28:28 +00:00
David Schultz	ce4e53c460	Set math_errhandling to MATH_ERREXCEPT. Now that we have fenv.h, we basically support this, subject to gcc's lack of FENV_ACCESS support. In any case, the previous setting of math_errhandling to 0 is not allowed by POSIX.	2005-01-14 22:03:27 +00:00
David Schultz	c165c4b9aa	Remove some #if 0'd code.	2005-01-14 21:51:46 +00:00
Ruslan Ermilov	e880667b92	Tiny markup nits.	2005-01-14 09:12:05 +00:00
David Schultz	f365db00e5	Mark all inline asms that read the floating-point control or status registers as volatile. Instructions that wrote to FP state were already marked volatile, but apparently gcc has license to move non-volatile asms past volatile asms. This broke amd64's feupdateenv at -O2 due to a WAR conflict between fnstsw and fldenv there.	2005-01-14 07:09:23 +00:00
Stefan Farfeleder	749f5f532e	Fixed too many of "the", and enclose multi-word argument in double quotes. Obtained from: ru	2005-01-13 20:33:42 +00:00
David Schultz	fe69257da2	Import the subset of J.T. Conklin's single-precision x86-optimized math routines that appear to be (a) correct and (b) faster than their MI counterparts on my Pentium 4. Obtained from: NetBSD	2005-01-13 18:58:25 +00:00
David Schultz	0d8f9eca28	The isnormal() in rev 1.2 should have been isfinite() so subnormals round correctly. Noticed by: stefanf	2005-01-13 15:43:41 +00:00
David Schultz	3cdb8115d7	Things that are broken, unneeded, and unused since 1997 belong in the attic.	2005-01-13 15:43:22 +00:00
Ruslan Ermilov	83e0359d53	Markup nits.	2005-01-13 10:43:01 +00:00
Ruslan Ermilov	113ed1bb1d	Fixed too many of "the", and enclose multi-word argument in double quotes.	2005-01-13 09:35:47 +00:00
Stefan Farfeleder	43295fac79	Implement and document ceill().	2005-01-13 09:11:41 +00:00
Stefan Farfeleder	4067ee86a5	Bump .Dd for the last commit.	2005-01-13 09:08:16 +00:00
Stefan Farfeleder	7e2ee1f065	Hook up and document floorl().	2005-01-12 22:16:26 +00:00
Stefan Farfeleder	17f418f9f4	Implement floorl().	2005-01-12 22:10:46 +00:00
Stefan Farfeleder	a7d82b7150	Whitespace nit.	2005-01-12 22:05:41 +00:00
David Schultz	10c9ffa425	Add MI implementations of [l]lrint[f]() and [l]lround[f](). Discussed with: bde	2005-01-11 23:12:55 +00:00
David Schultz	2aac156d2e	Document [l]lrint[f]() and [l]lround[f]().	2005-01-11 23:12:17 +00:00
David Schultz	439e59cf85	Faster lrint() and llrint() implementations for x86.	2005-01-11 23:10:53 +00:00
David Schultz	c1b70ced4f	Mark inline stmxcsr instructions as volatile, since this appears to be the only way to convince gcc that they read the MXCSR. The volatile annotation may be needed elsewhere as well.	2005-01-11 22:10:43 +00:00
Ruslan Ermilov	2d82ac3110	Scheduled mdoc(7) sweep.	2005-01-11 20:50:51 +00:00
Ruslan Ermilov	4e05ab77a8	Sanitize the markup, as prompted.	2005-01-11 20:16:03 +00:00
David Schultz	527055d12f	GC unused declaration	2004-12-16 20:40:49 +00:00
David Schultz	17519e9b79	Cosmetic changes only: - style - remove unused variables - de-support VAX Inspired by: bin/42388	2004-12-16 20:40:37 +00:00
David Schultz	dbc8f2b5ce	More updates for math(3): - Make some minor rearrangements in the introduction. - Mention the problem with argument reduction on i386. - Add recently-implemented functions to the table. - Un-document the error bounds that only apply to the old 4BSD math library, and fill in the correct values where I know them. No attempt has been made to document bounds lower than 1 ulp, although smaller bounds are usually achievable in round-to-nearest mode.	2004-10-11 20:13:52 +00:00
Stefan Farfeleder	2fd3a32ee1	Add and document ilogbl(), a long double version of ilogb().	2004-10-11 18:13:52 +00:00
Stefan Farfeleder	552ebda9dd	Use the FP_ILOG macros from <math.h> rather than hardcoded return values. Also be prepared for FP_ILOGBNAN != INT_MAX. Reviewed by: md5	2004-10-09 17:14:28 +00:00
Ken Smith	85a8b887df	Bump the library version numbers for the following libraries: /lib/{libm,libreadline} /usr/lib/{libhistory,libopie,libpcap} in preparation for doing the same thing to RELENG_5. HUGE amounts of help for determining what to bump provided by kris. Discussed on: freebsd-current Approved by: re (not required for commit but something like this should be)	2004-10-01 15:38:07 +00:00
David Schultz	d622ef6993	Further refine some #ifs: - Simplify the logic by using __GNUC_PREREQ__. Suggested by stefanf. - Make math.h compile with old (pre-8.0) versions of icc. Submitted by sf [sic].	2004-09-17 05:15:33 +00:00
Stefan Farfeleder	bef5493789	Add man pages for the cimag(), conj() and creal() functions.	2004-08-07 23:03:36 +00:00
Olivier Houchard	60b22cf1c2	Only use rfs and wfs if ARM_HARD_FLOAT is defined, and use stubs if it is not, in order to unbreak arm make world. The right way to do it with soft floats will be figured out later. Discussed with: das	2004-08-05 14:07:24 +00:00
David Schultz	2208ce0a06	Replace s_isnan.c and s_isnanf.c with the more compact s_isnan.c from libc. The externally-visible effect of this is to add __isnanl() to libm, which means that libm.so.2 can once again link against libc.so.4 when LD_BIND_NOW is set. This was broken by the addition of fdiml(), which calls __isnanl().	2004-08-05 01:46:11 +00:00
David Schultz	8dc56b6821	Use isnormal() instead of fpclassify() to avoid dependency on libc.so.5.	2004-08-05 01:44:55 +00:00
Alexander Kabaev	dd86691ec8	Work around known GCC 3.4.x problem and use ANSI prototype for dremf().	2004-07-28 05:53:18 +00:00
David Schultz	ec79bc0da9	Fix two bugs in the signbit() macro, which was implemented last year: - It was added to libc instead of libm. Hopefully no programs rely on this mistake. - It didn't work properly on large long doubles because its argument was converted to type double, resulting in undefined behavior.	2004-07-19 08:16:10 +00:00
Stefan Farfeleder	9979bae3e7	Fix minor namespace pollution: The prototypes for f{dim,max,min}(), nearbyint(), round() and trunc() shouldn't be visible when compiling with -D_XOPEN_SOURCE=500.	2004-07-17 15:03:52 +00:00
David Schultz	205d3300b8	Tweak the conditions under which certain gcc builtins are used: - Unlike the builtin relational operators, builtin floating-point constants were not available until gcc 3.3, so account for this.[1] - Apparently some versions of the Intel C Compiler fallaciously define __GNUC__ without actually being compatible with the claimed gcc version. Account for this, too.[2] [1] Noticed by: Christian Hiris <4711@chello.at> [2] Submitted by: Alexander Leidinger <Alexander@Leidinger.net>	2004-07-16 06:21:56 +00:00
David Schultz	9fc5c45bad	Remove the declaration of isnan() from this file. It is no longer needed as of math.h v1.40, and its prototype is incorrect here.	2004-07-09 10:01:10 +00:00
David Schultz	240dbabfa8	Implement the classification macros isfinite(), isinf(), isnan(), and isnormal() the hard way, rather than relying on fpclassify(). This is a lose in the sense that we need a total of 12 functions, but it is necessary for binary compatibility because we have never bumped libm's major version number. In particular, isinf(), isnan(), and isnanf() were BSD libc functions before they were C99 macros, so we can't reimplement them in terms of fpclassify() without adding a dependency on libc.so.5. I have tried to arrange things so that programs that could be compiled in FreeBSD 4.X will generate the same external references when compiled in 5.X. At the same time, the new macros should remain C99-compliant. The isinf() and isnan() functions remain in libc for historical reasons; however, I have moved the functions that implement the macros isfinite() and isnormal() to libm where they belong. Moreover, half a dozen MD versions of isinf() and isnan() have been replaced with MI versions that work equally well. Prodded by: kris	2004-07-09 03:32:40 +00:00
David Schultz	b2d5d0b376	Define the following macros in terms of [gi]cc builtins when the builtins are available: HUGE_VAL, HUGE_VALF, HUGE_VALL, INFINITY, and NAN. These macros now expand to floating-point constant expressions rather than external references, as required by C99. Other compilers will retain the historical behavior. Note that it is not possible say, e.g. #define HUGE_VAL 1.0e9999 because the above may result in diagnostics at translation time and spurious exceptions at runtime. Hence the need for compiler support for these features. Also use builtins to implement the macros isgreater(), isgreaterequal(), isless(), islessequal(), islessgreater(), and isunordered() when such builtins are available. Although the old macros are correct, the builtin versions are much faster, and they avoid double-expansion problems.	2004-07-09 03:31:09 +00:00
David Schultz	9428e108c9	Add C99's nearbyint{,f}() functions as wrappers around rint(). These trivial implementations are about 25 times slower than rint{,f}() on x86 due to the FP environment save/restore. They should eventually be redone in terms of fegetround() and bit fiddling.	2004-07-06 04:46:08 +00:00
Ruslan Ermilov	30950a21e1	Eliminate double whitespace.	2004-07-03 22:30:10 +00:00
Ruslan Ermilov	1a0a934547	Mechanically kill hard sentence breaks.	2004-07-02 23:52:20 +00:00
Ruslan Ermilov	862b46f607	Markup, grammar, punctuation.	2004-07-01 18:20:57 +00:00
David Schultz	4f82cb46c4	Implement and document fdim{,f,l}, fmax{,f,l}, and fmin{,f,l}.	2004-06-30 07:04:01 +00:00
Marcel Moolenaar	c987479dd0	s/ARCH/ARCH_SUBDIR/g -- This reduces the chance of possible conflicts with the user's environment. Wondered why his cross-builds kept failing: marcel	2004-06-24 00:02:32 +00:00
Stefan Farfeleder	c8764bba5a	Completely remove s_ilogb.S as the assembler implementation gives very little speed improvement to none at all over the MI version. Submitted by: bde	2004-06-20 10:42:23 +00:00
David Schultz	f7748f6e01	Uncomment some functions that we now support.	2004-06-20 10:39:09 +00:00
David Schultz	a9a0bf07f3	Cross-reference round(3) and trunc(3) as appropriate.	2004-06-20 09:27:17 +00:00
David Schultz	209547598d	Connect scalbln(), trunc(), and the associated documentation to the build.	2004-06-20 09:27:03 +00:00
David Schultz	62247e9034	Declare scalbln(), scalblnf(), trunc(), and truncf().	2004-06-20 09:26:41 +00:00
David Schultz	7ffaea8021	Implement trunc() and truncf().	2004-06-20 09:25:43 +00:00
David Schultz	2f90a15e14	Add trivial implementations of scalbln() and scalblnf(). These routines are specified in C99 for the sake of architectures where an int isn't big enough to represent the full range of floating-point exponents. However, even the 128-bit long double format has an exponent smaller than 15 bits, so for all practical purposes, scalbln() and scalblnf() are aliases for scalbn() and scalbnf(), respectively.	2004-06-20 09:25:27 +00:00
Stefan Farfeleder	32ef5abfe3	Document ilogb()'s return values in terms of the FP_ILOGB* macros.	2004-06-19 09:33:29 +00:00
Stefan Farfeleder	b6161bb16a	Return the same result as the MI version for 0.0, INFINITY and NaN. Reviewed by: standards@	2004-06-19 09:30:00 +00:00
Stefan Farfeleder	83bc89312c	Our MI implementation of ilogb() returns -INT_MAX for the argument 0.0 rather than INT_MIN, so adjust FP_ILOGB0 to reflect this. Use <machine/_limits.h> for INT_MAX's value while there. Reviewed by: standards@	2004-06-19 09:25:21 +00:00
David Schultz	2a6bf1fadb	Memory's free, but all the world ain't a VAX anymore. Bring math.3 kicking and screaming into the 1980's. This change converts most of the markup from man(7) to mdoc(7) format, and I believe it removes or updates everything that was flat out wrong. However, much work is still needed to sanitize the markup, improve coverage, and reduce overlap with other manpages. Some of the sections would better belong in a philosophy_of_w_kahan.3 manpage, but they are informative and remain at least as reminders of topics to cover. Reviewed by: doc@, trhodes@	2004-06-19 03:25:28 +00:00
David Schultz	9772caa388	The references to scalbn and scalbnf should be scalb and scalbf. (The former are actually useful, and ieee_test(3) only documents functions that aren't.) Add a sentence describing the domain of scalb() and scalbf().	2004-06-12 04:40:47 +00:00
David Schultz	16919a6cf7	Shift the FPSR contents by the correct amount so feupdateenv() raises the correct exceptions from the old environment.	2004-06-11 02:35:30 +00:00
David Schultz	0d2354c6fd	Insert a missing '~' in feholdexcept(), so that it correctly clears the exception flags in the mxcsr as well as the x87 FPU.	2004-06-11 02:35:19 +00:00
David Schultz	c4da2324a3	Fix a bug where rintf() rounded the wrong way in round-to-nearest mode on all inputs of the form x.75, where x is an even integer and log2(x) = 21. A similar problem occurred when rounding upward. The bug involves the following snippet copied from rint(): i>>=1; if((i0&i)!=0) i0 = (i0&(~i))\|((0x100000)>>j0); The constant 0x100000 should be 0x200000. Apparently this case was never tested. It turns out that the bit manipulation is completely superfluous anyway, so remove it. (It tries to simulate 90% of the rounding process that the FPU does anyway.) Also, the special case of +-0 is handled twice (in different ways), so remove the second instance. Throw in some related simplifications from bde: - Work around a bug where gcc fails to clip to float precision by declaring two float variables as volatile. Previously, we tricked gcc into generating correct code by declaring some float constants as doubles. - Remove additional superfluous bit manipulation. - Minor reorganization. - Include <sys/types.h> explicitly. Note that some of the equivalent lines in rint() also appear to be unnecessary, but I'll defer to the numerical analysts who wrote it, since I can't test all 2^64 cases. Discussed with: bde	2004-06-09 21:24:52 +00:00
David Schultz	207bc1d79b	Include <sys/cdefs.h> earlier to get the various visibility constants. Previously, we were relying on <sys/_types.h> to include it implicitly.	2004-06-09 10:32:05 +00:00
David Schultz	d0f1363370	Add round(3) and roundf(3) and the associated documentation. PR: 59797 Submitted by: "Steven G. Kargl" <kargl@troutmask.apl.washington.edu> Reviewed by: bde (earlier version, last year)	2004-06-07 08:05:36 +00:00
David Schultz	54dd6976a8	Add fenv.h, fenv.c, and the associated documentation to the libm build. To facilitate this, add ${.CURDIR}/${ARCH} to make's search path unconditionally. Reviewed by: standards@	2004-06-06 10:06:57 +00:00
David Schultz	07235cc8f7	Add documentation for: - fenv(3) - feclearexcept(3), fegetexceptflag(3), feraiseexcept(3), fesetexceptflag(3), fetestexcept(3) - fegetround(3), fesetround(3) - fegetenv(3), feholdexcept(3), fesetenv(3), feupdateenv(3) Reviewed by: standards@	2004-06-06 10:06:26 +00:00
David Schultz	7ab6d2aa74	Add an fenv.h implementation for the sparc64 port. Reviewed by: standards@	2004-06-06 10:05:57 +00:00
David Schultz	122e138072	Add an fenv.h implementation for the powerpc port. Reviewed by: standards@	2004-06-06 10:05:10 +00:00
David Schultz	50c4f20324	Add an fenv.h implementation for the ia64 port. Reviewed by: standards@	2004-06-06 10:04:43 +00:00
David Schultz	0b71a226d1	Add an fenv.h implementation for the i386 port. Reviewed by: standards@	2004-06-06 10:04:17 +00:00
David Schultz	19220bc13f	Add an fenv.h implementation for the arm port. It does not appear to be possible to cross-build arm from i386 at the moment, and I have no ARM hardware anyway. Thus, I'm sure there are bugs. I will gladly fix these when the arm port is more mature. Reviewed by: standards@	2004-06-06 10:03:59 +00:00
David Schultz	fc27daefcd	Add an fenv.h implementation for the amd64 port. Reviewed by: standards@	2004-06-06 10:03:25 +00:00
David Schultz	7993050251	Add an fenv.h implementation for the alpha port. All of the standard features appear to work, subject to the caveat that you tell gcc you want standard rather than recklessly fast behavior (-mieee-with-inexact -mfp-rounding-mode=d). The non-standard feature of delivering a SIGFPE when an application raises an unmasked exception does not work, presumably due to a kernel bug. This isn't so bad given that floating-point exceptions on the Alpha architecture are not precise, so making them useful in userland requires a significant amount of wizardry. Reviewed by: standards@	2004-06-06 09:58:55 +00:00
Bruce Evans	4f8f819975	Fixed lots of 1 ULP errors caused by a broken approximation for pi/2. We approximate pi with more than float precision using pi_hi+pi_lo in the usual way (pi_hi is actually spelled pi in the source code), and expect (float)0.5pi_lo to give the low part of the corresponding approximation for pi/2. However, the high part for pi/2 (pi_o_2) is rounded to nearest, which happens to round up, while the high part for pi was rounded down. Thus pi_o_2+(float)0.5pi (in infinite precision) was a very bad approximation for pi/2 -- the low term has the wrong sign and increases the error drom less than half an ULP to a full ULP. This fix rounds up instead of down for pi_hi. Consistently rounding down instead of up should work, and is the method used in e_acosf.c and e_asinf.c. The reason for the difference is that we sometimes want to return precisely pi/2 in e_atan2f.c, so it is convenient to have a correctly rounded (to nearest) value for pi/2 in a variable. a_acosf.c and e_asinf.c also differ in directly approximating pi/2 instead pi; they multiply by 2.0 instead of dividing by 0.5 to convert the approximation. These complications are not directly visible in the double precision versions because rounding to nearest happens to round down.	2004-06-02 17:09:05 +00:00
David Schultz	73fbb89dd6	Port a bugfix from FDLIBM 5.3. The bug really only applies to tan() and not tanf() because float type can't represent numbers large enough to trigger the problem. However, there seems to be a precedent that the float versions of the fdlibm routines should mirror their double counterparts. Also update to the FDLIBM 5.3 license. Obtained from: FDLIBM Reviewed by: exhaustive comparison	2004-06-02 04:39:44 +00:00
David Schultz	21d39caaee	Merge a bugfix from FDLIBM 5.3 to ensure that the error in tan() is always less than 1 ulp. Also update to the 5.3 license. Obtained from: FDLIBM	2004-06-02 04:39:29 +00:00
Bruce Evans	f88a48cc43	Merged from double precision case (e_pow.c 1.10: sign fixes).	2004-06-01 19:33:30 +00:00
Bruce Evans	f083533b68	Fixed the sign of the result in some overflow and underflow cases (ones where the exponent is an odd integer and the base is negative). Obtained from: fdlibm-5.3 Sun finally released a new version of fdlibm just a coupe of weeks ago. It only fixes 3 bugs (this one, another one in pow() that we already have (rev.1.9), and one in tan(). I've learned too much about powf() lately, so this fix was easy to merge. The patch is not verbatim, because our base version has many differences for portability and I didn't like global renaming of an unrelated variable to keep it separate from the sign variable. This patch uses a new variable named sn for the sign.	2004-06-01 19:28:38 +00:00
Bruce Evans	5f20e5ce7f	Fixed another precision bug in powf(). This one is in the computation [t=p_l+p_h High]. We multiply t by lg2_h, and want the result to be exact. For the bogus float case of the high-low decomposition trick, we normally discard the lowest 12 bits of the fraction for the high part, keeping 12 bits of precision. That was used for t here, but it doesnt't work because for some reason we only discard the lowest 9 bits in the fraction for lg2_h. Discard another 3 bits of the fraction for t to compensate. This bug gave wrong results like: powf(0.9999999, -2.9999995) = 1.0000002 (should be 1.0000001) hex values: 3F7FFFFF C03FFFFE 3F800002 3F800001 As explained in the log for the previous commit, the bug is normally masked by doing float calculations in extra precision on i386's, but is easily detected by ucbtest on systems that don't have accidental extra precision. This completes fixing all the bugs in powf() that were routinely found by ucbtest.	2004-06-01 19:03:31 +00:00
Bruce Evans	12be4e0d5a	Fixed 2 bugs in the computation /* t_h=ax+bp[k] High */. (1) The bit for the 1.0 part of bp[k] was right shifted by 4. This seems to have been caused by a typo in converting e_pow.c to e_powf.c. (2) The lower 12 bits of ax+bp[k] were not discarded, so t_h was actually plain ax+bp[k]. This seems to have been caused by a logic error in the conversion. These bugs gave wrong results like: powf(-1.1, 101.0) = -15158.703 (should be -15158.707) hex values: BF8CCCCD 42CA0000 C66CDAD0 C66CDAD4 Fixing (1) gives a result wrong in the opposite direction (hex C66CDAD8), and fixing (2) gives the correct result. ucbtest has been reporting this particular wrong result on i386 systems with unpatched libraries for 9 years. I finally figured out the extent of the bugs. On i386's they are normally hidden by extra precision. We use the trick of representing floats as a sum of 2 floats (one much smaller) to get extra precision in intermediate calculations without explicitly using more than float precision. This trick is just a pessimization when extra precision is available naturally (as it always is when dealing with IEEE single precision, so the float precision part of the library is mostly misimplemented). (1) and (2) break the trick in different ways, except on i386's it turns out that the intermediate calculations are done in enough precision to mask both the bugs and the limited precision of the float variables (as far as ucbtest can check). ucbtest detects the bugs because it forces float precision, but this is not a normal mode of operation so the bug normally has little effect on i386's. On systems that do float arithmetic in float precision, e.g., amd64's, there is no accidental extra precision and the bugs just give wrong results.	2004-06-01 18:08:39 +00:00
Stefan Farfeleder	8b5cd5a662	Add implementations for cimag{,f,l}, creal{,f,l} and conj{,f,l}. They are needed for cases where GCC's builtin functions cannot be used and for compilers that don't know about them. Approved by: das (mentor)	2004-05-30 09:21:56 +00:00
David Schultz	6955d806c0	Remove some kludges designed to ensure that the compiler didn't round constants the wrong way on the VAX. Instead, use C99 hexadecimal floating-point constants, which are guaranteed to be exact on binary IEEE machines. (The correct hexadecimal values were already provided in the source, but not used.) Also, convert the constants to lowercase to work around a gcc bug that wasn't fixed until gcc 3.4.0. Prompted by: stefanf	2004-05-17 01:04:37 +00:00
Stefan Farfeleder	b60cb13f76	Add an implementation of copysignl(), a long double version of copysign(). Approved by: das (mentor)	2004-05-07 18:56:31 +00:00
Stefan Farfeleder	325152e8fb	Add an MLINK for fabsl(). Approved by: das (mentor)	2004-05-07 17:55:07 +00:00
Stefan Farfeleder	89c5bc6db4	The prototypes for cabs() and cabsf() are in <complex.h>. Fix their arguments' types and describe them briefly. Reviewed by: ru, bde Approved by: das (mentor)	2004-05-06 13:11:18 +00:00
David Schultz	8f3f7c66d0	Make sure that symbols are declared in math.h iff the appropriate namespaces are visible. Previously, math.h failed to hide some C99-, XSI-, and BSD-specific symbols in certain compilation environments. The referenced PR has a nice listing of the appropriate conditions for making symbols visible in math.h. The only non-stylistic difference between the patch in the PR and this commit is that I superfluously test for __BSD_VISIBLE in a few places to be more explicit about which symbols have historically been part of the FreeBSD environment. PR: 65939 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2004-04-25 02:35:42 +00:00
David Schultz	334c760eea	Remove a stale comment referring to values.h, which has never been part of FreeBSD. PR: 65939	2004-04-25 02:32:46 +00:00
Bruce Evans	6eb2d83e44	Initial support for C99's (or is it POSIX.1-2001's?) MATH_ERRNO, MATH_ERREXCEPTION and math_errhandling, so that C99 applications at least have the possibility of determining that errno is not set for math functions. Set math_errhandling to the non-standard-conforming value of 0 for now to indicate that we don't support either method of reporting errors. We intentionally don't support MATH_ERRNO because errno is a mistake, and we are missing support for MATH_ERREXCEPTION (<fenv.h>, compiler support for <fenv.h>, and actually setting the exception flags correctly).	2004-03-12 12:02:03 +00:00
David Schultz	7a773faadc	Fix a problem where libm compiled under 5.X would depend on features that are only in libc.so.5. This broke some 4.X applications linked to libm and run under 5.X. Background: In C99, isinf() and isnan() cannot be implemented as regular functions. We use macros that call libc functions in 5.X, but for libm-internal use, we need to use the old versions until the next time libm's major version number is bumped. Submitted by: bde Reported by: imp, kris	2003-10-27 01:28:07 +00:00
Dag-Erling Smørgrav	2a063d30f6	Better safe than clever. Submitted by: das	2003-10-25 19:53:28 +00:00
Dag-Erling Smørgrav	801517fd4e	Document fabsl(3). Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2003-10-25 13:45:11 +00:00
Dag-Erling Smørgrav	e334ea2edc	- fabsl.c should be named s_fabsl.c for consistency with libmsun's documented naming scheme (unfortunately the documentation isn't in the tree as far as I can tell); no repocopy is required as there is no history to preserve. - replace simple and almost-correct implementation with slightly hackish but definitely correct implementation (tested on i386, alpha, sparc64) which requires pulling in fpmath.h and the MD _fpmath.h from libc. - try not to make a mess of the Makefile in the process. - enterprising minds are encouraged to implement more C99 long double functions.	2003-10-25 09:32:18 +00:00
Dag-Erling Smørgrav	4318dce616	Connect fabsl.c to the build.	2003-10-23 08:23:51 +00:00
Dag-Erling Smørgrav	29bd23abf0	Add prototypes for all long double functions in C99. Leave them all #if 0'd out, except for fabsl(3) which I've implemented.	2003-10-23 08:23:38 +00:00
Dag-Erling Smørgrav	017e4316ae	Implement fabsl(3), allowing the world to build with -fno-builtin.	2003-10-23 08:20:47 +00:00
Gordon Tetlow	41d8423f71	Stage 3 of dynamic root support. Make all the libraries needed to run binaries in /bin and /sbin installed in /lib. Only the versioned files reside in /lib, the .so symlink continues to live /usr/lib so the toolchain doesn't need to be modified.	2003-08-17 08:28:46 +00:00
Bruce Evans	262e4c00bd	Fixed some style bugs (misplacement and misformatting of some commented-out code).	2003-07-23 09:24:44 +00:00
Peter Wemm	3819e84017	Only provide one copy of the math functions. If we provide a MD function, do not also provide a __generic_XXX version as well. This is how we used to runtime select the generic vs i387 versions on the i386 platform. This saves a pile of #defines in the src/math_private.h file to undo the __generic_XXX renames in some of the *.c files.	2003-07-23 04:53:47 +00:00
Peter Wemm	d48084b9e5	No longer need the internal __get_hw_float() function.	2003-07-23 04:25:04 +00:00
Peter Wemm	c3e6df78e1	Now that we do not need to do runtime detection for the broken default fp emulator, stop doing the runtime selection of hardware or emulated floating point operations on i386. Note that I have not suppressed the duplicate compiles yet. While here, fix the alpha. It has provided specific copysign/copysignf functions since the beginning of time, but they have never been used.	2003-07-23 04:23:36 +00:00
Mike Barcroft	6f9622a926	Fix two misuses of __BSD_VISIBLE. Submitted by: bde Approved by: re	2003-05-22 17:07:57 +00:00
Peter Wemm	8e80f8a438	AMD64 support (another IEEEFP platform)	2003-04-30 21:06:30 +00:00
David Schultz	6d3bd9530d	Fix braino in definition of isfinite(). Noticed by: marcus Pointy hat to: das	2003-04-04 13:27:47 +00:00
Ruslan Ermilov	3892c30012	mdoc(7) police: Nits.	2003-03-02 21:04:21 +00:00
Warner Losh	457f6cd2d6	- gamma_r, lgamma_r, gammaf_r, and lgammaf_r were protected by _REENTRANT in math.h; the consensus here was that __BSD_VISIBLE was correct instead. - gamma_r, lgamma_r, gammaf_r, and lgammaf_r had no documentation in the lgamma(3) manpage. Reviewed by: standards@ Submitted by: Ben Mesander	2003-02-26 13:12:03 +00:00
Mike Barcroft	5d62092f94	o Implement C99 classification macros isfinite(), isinf(), isnan(), isnormal(). The current isinf() and isnan() are perserved for binary compatibility with 5.0, but new programs will use the macros. o Implement C99 comparison macros isgreater(), isgreaterequal(), isless(), islessequal(), islessgreater(), isunordered(). Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU>	2003-02-12 20:03:41 +00:00
Mike Barcroft	8e9b28311e	Implement C99's signbit() macro.	2003-02-11 21:56:21 +00:00
Mike Barcroft	8cf5ed5125	Implement fpclassify(): o Add a MD header private to libc called _fpmath.h; this header contains bitfield layouts of MD floating-point types. o Add a MI header private to libc called fpmath.h; this header contains bitfield layouts of MI floating-point types. o Add private libc variables to lib/libc/$arch/gen/infinity.c for storing NaN values. o Add __double_t and __float_t to <machine/_types.h>, and provide double_t and float_t typedefs in <math.h>. o Add some C99 manifest constants (FP_ILOGB0, FP_ILOGBNAN, HUGE_VALF, HUGE_VALL, INFINITY, NAN, and return values for fpclassify()) to <math.h> and others (FLT_EVAL_METHOD, DECIMAL_DIG) to <float.h> via <machine/float.h>. o Add C99 macro fpclassify() which calls __fpclassify{d,f,l}() based on the size of its argument. __fpclassifyl() is never called on alpha because (sizeof(long double) == sizeof(double)), which is good since __fpclassifyl() can't deal with such a small `long double'. This was developed by David Schultz and myself with input from bde and fenner. PR: 23103 Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU> (significant portions) Reviewed by: bde, fenner (earlier versions)	2003-02-08 20:37:55 +00:00
Jens Schweikhardt	9d5abbddbf	Correct typos, mostly s/ a / an / where appropriate. Some whitespace cleanup, especially in troff files.	2003-01-01 18:49:04 +00:00
Jens Schweikhardt	57bd0fc6e8	english(4) police.	2002-12-27 12:15:40 +00:00
Archie Cobbs	83999f5a32	Re-apply the previously backed-out commit that fixes the problem where HUGE_VAL is not properly aligned on some architectures. The previous fix now works because the two versions of 'math.h' (include/math.h and lib/msun/src/math.h) have since been merged into one. PR: bin/43544	2002-10-31 23:05:20 +00:00
Mark Murray	bf2f52b5fa	Remove duplicate declaration.	2002-10-23 17:35:11 +00:00
Bruce Evans	54e9b36765	Fixed a last-minute editing error in previous commit. nfs and/or cvs replaced a 14-byte change in the middle of the file with 14 NULs at EOF despite or because of aborting the initial commit to pick up the change.	2002-10-01 11:44:35 +00:00
Bruce Evans	219cbe1087	Merged all interesting difference between the old math.h and the current one into the latter and removed the former. This works around the bug that some broken Makefiles add -I.../src/include to CFLAGS, resulting in the old math.h being preferred and differences between the headers possibly being fatal. The merge mainly involves declaring some functions as __pure2 although they are not yet all strictly free of side effects. PR: 43544	2002-10-01 11:34:42 +00:00
Archie Cobbs	ae8a4b2f36	Revert previous commit to unbreak world until we figure out the right way to do it.	2002-09-20 15:43:26 +00:00
Archie Cobbs	f5f1272284	Fix a problem with the definition of HUGE_VAL causing the gcc warning "cast increases required alignment of target type" on some platforms. Reviewed by: bde	2002-09-19 19:47:27 +00:00
Bruce Evans	3e2ec6ea88	e_pow.c: Fixed pow(x, y) when x is very close to -1.0 and y is a very large odd integer. E.g., pow(-1.0 - pow(2.0, -52.0), 1.0 + pow(2.0, 52.0)) was 0.0 instead of being very close to -exp(1.0). PR: 39236 Submitted by: Stephen L Moshier <steve@moshier.net> e_powf.c: Apply the same patch although it is just cosmetic because odd integers large enough to cause the problem are too large to be precisely represented as floats. MFC after: 1 week	2002-06-17 15:28:59 +00:00
Alfred Perlstein	59b19ff14a	Fix formatting, this is hard to explain, so I'll show one example. - float ynf(int n, float x) /* wrapper ynf / +float +ynf(int n, float x) / wrapper ynf */ This is because the __STDC__ stuff was indented. Reviewed by: md5	2002-05-28 18:15:04 +00:00
Alfred Perlstein	2dcc228679	Assume __STDC__, remove non-__STDC__ code. Reviewed by: md5	2002-05-28 17:51:46 +00:00
Alfred Perlstein	a82bbc730e	Assume __STDC__, remove non-__STDC__ code. Submitted by: keramida	2002-05-28 17:03:12 +00:00
Benno Rice	7191eaa757	Spread the word of PowerPC.	2002-05-21 04:00:47 +00:00
Ruslan Ermilov	c7b111cba8	Added new bsd.incs.mk which handles installing of header files via INCS. Implemented INCSLINKS (equivalent to SYMLINKS) to handle symlinking include files. Allow for multiple groups of include files to be installed, with the powerful INCSGROUPS knob. Documentation to follow. Added standard `includes' and `incsinstall' targets, use them in Makefile.inc1. Headers from the following makefiles were not installed before (during `includes' in Makefile.inc1): kerberos5/lib/libtelnet/Makefile lib/libbz2/Makefile lib/libdevinfo/Makefile lib/libform/Makefile lib/libisc/Makefile lib/libmenu/Makefile lib/libmilter/Makefile lib/libpanel/Makefile Replaced all `beforeinstall' targets for installing includes with the INCS stuff. Renamed INCDIR to INCSDIR, for consistency with FILES and SCRIPTS, and for compatibility with NetBSD. Similarly for INCOWN, INCGRP, and INCMODE. Consistently use INCLUDEDIR instead of /usr/include. gnu/lib/libstdc++/Makefile and gnu/lib/libsupc++/Makefile changes were only lightly tested due to the missing contrib/libstdc++-v3. I fully tested the pre-WIP_GCC31 version of this patch with the contrib/libstdc++.295 stuff. These changes have been tested on i386 with the -DNO_WERROR "make world" and "make release".	2002-05-12 16:01:00 +00:00
Bruce Evans	46d7c2979e	Resurrect Lite1's gamma() as C99's tgamma(). Minimal changes.	2002-03-26 11:59:29 +00:00
Bruce Evans	675902aa73	Fixed some bugs in the description of plain gamma() (and gammaf()). Give a more detailed and correct history of when gamma() was actually the gamma function.	2002-03-26 10:18:20 +00:00
Bruce Evans	6898f8c48e	Fixed some minor style bugs.	2002-03-26 09:18:09 +00:00
David E. O'Brien	69160b1eb7	Remove __P() usage.	2002-03-21 23:54:04 +00:00
David E. O'Brien	84c63a156a	Fix SCM ID's.	2002-03-21 18:06:09 +00:00
David E. O'Brien	118ce04e39	We need an frexp() function.	2002-03-01 01:58:20 +00:00
Jake Burkholder	fbeabbfad6	Add ifdef sparc64.	2002-01-02 06:54:18 +00:00
Alexey Zelkin	ef1ee63e3c	Fix style bugs (mostly remove 'extern' from function prototypes) Inspired by: conversation with bde	2001-12-13 17:22:17 +00:00
Alexey Zelkin	4b667ee0db	* remove reference to m68k-dependent sources * fix comment	2001-12-13 17:18:26 +00:00
Ruslan Ermilov	f598d0519c	Grammar nit.	2001-11-21 09:25:14 +00:00
Ruslan Ermilov	487d13c0e7	mdoc(7) police: fixed bugs from rev. 1.15.	2001-11-20 16:40:04 +00:00
David Malone	a9dbc63dc2	gamma(x) actually returns \log(\|\Gamma(x)\|), so correct the man page and add an historical note explaining this. This patch is based on Stephen's. We still need someone to implement tgamma. PR: 28972, 31764 Submitted by: Stephen Montgomery-Smith <stephen@math.missouri.edu>	2001-11-05 10:10:33 +00:00
Dima Dorfman	211feb6175	Match parenthesis and don't give names to return values. PR: 31214	2001-10-15 13:34:43 +00:00
Bruce Evans	aa842e6a12	Fixed missing quoting of >= (in ceil.3) and <= (in floor.3) by reverting to describing these operators in English. This completes the fix in rev.1.3 (rev.1.2 got this wrong by describing wrong operators in English). Fixed bitrot and improved English in the DESCRIPTION section.	2001-10-13 13:57:32 +00:00
Bruce Evans	aa00e9d96e	Fixed missing quoting of [-1, +1]. Submitted by: phantom	2001-10-13 12:29:25 +00:00

... 3 4 5 6 7 ...

536 Commits