Warner Losh 23f24377b1 awk: Merge 20210729 from One True Awk upstream (0592de4a)
July 27, 2021:
	As per IEEE Std 1003.1-2008, -F "str" is now consistent with
	-v FS="str" when str is null. Thanks to Warner Losh.

July 24, 2021:
	Fix readrec's definition of a record. This fixes an issue
	with NetBSD's RS regular expression support that can cause
	an infinite read loop. Thanks to Miguel Pineiro Jr.

	Fix regular expression RS ^-anchoring. RS ^-anchoring needs to
	know if it is reading the first record of a file. This change
	restores a missing line that was overlooked when porting NetBSD's
	RS regex functionality. Thanks to Miguel Pineiro Jr.

	Fix size computation in replace_repeat() for special case
	REPEAT_WITH_Q. Thanks to Todd C. Miller.

Also, included the tests from upstream, though they aren't yet connected
to the tree.

Sponsored by:		Netflix
2021-08-01 10:22:39 -06:00

38 lines
914 B
Plaintext
Executable File

echo T.latin1: tests of 8-bit input
awk=${awk-../a.out}
$awk '
{ print $0 }
' latin1 >foo1
diff latin1 foo1 || echo 'BAD: T.latin1 1'
$awk '{ gsub(/\351/, "\370"); print }' latin1 >foo0
$awk '{ gsub(/é/, "ø"); print }' latin1 >foo1
diff foo0 foo1 || echo 'BAD: T.latin1 3'
$awk '{ gsub(/[^\300-\370]/, ""); print }' latin1 >foo0
$awk '{ gsub(/[^À-ø]/, ""); print } ' latin1 >foo1
diff foo0 foo1 || echo 'BAD: T.latin1 4'
echo '/á/' >foo1
$awk -f foo1 foo1 >foo2
diff foo1 foo2 || echo 'BAD: T.latin1 5'
echo /[áé]/ >foo1
$awk -f foo1 foo1 >foo2
diff foo1 foo2 || echo 'BAD: T.latin1 6'
echo 'This is a line.
Patterns like /[áé]/ do not work yet. Example, run awk /[áé]/
over a file containing just á.
This is another line.' >foo0
echo 'Patterns like /[áé]/ do not work yet. Example, run awk /[áé]/
over a file containing just á.' >foo1
$awk '/[áé]/' foo0 >foo2
diff foo1 foo2 || echo 'BAD: T.latin1 7'