awk: revert upstream's attempt to disallow hex strings

Upstream one-true-awk decided to disallow hex strings as numbers. This
is in line with awk's behavior prior to C99, and allowed by the POSIX
standard. The standard, however, allows them to be treated as numbers
because that's what the standard said in the 2001 through 2004 editions.
Since 2001, the nawk in FreeBSD has treated them as numbers, so restore
that behavior, allowed by the standard.

A number of scripts in the FreeBSD tree depend on this interpretation,
including scripts to build the kernel which had mysteriously started
failing for some people and not others. By re-allowing 0x hex numbers,
this fixes those scripts and restores POLA.

Upstream issue:		https://github.com/onetrueawk/awk/issues/126
Sponsored by:		Netflix
Reviewed by:		kevans
MFC After:		asap due to regression alrady merged to stable
Differential Revision:	https://reviews.freebsd.org/D31199
This commit is contained in:
Warner Losh 2021-07-15 16:46:06 -06:00
parent 7fb152d229
commit d4d252c499

View File

@ -793,9 +793,18 @@ bool is_valid_number(const char *s, bool trailing_stuff_ok,
while (isspace(*s))
s++;
/*
* This test, while allowed by newer POSIX standards, represents a regression
* where hex strings were treated as numbers in nawk the whole time it has been
* in FreeBSD (since 2001). The POSIX 2001 through 2004 standards mandated this
* behavior and the current standard allows it. Deviate from upstream by restoring
* the prior FreeBSD behavior.
*/
#if 0
// no hex floating point, sorry
if (s[0] == '0' && tolower(s[1]) == 'x')
return false;
#endif
// allow +nan, -nan, +inf, -inf, any other letter, no
if (s[0] == '+' || s[0] == '-') {