freebsd-nq

Author	SHA1	Message	Date
Tim J. Robbins	e5996857ad	Make regular expression matching aware of multibyte characters. The general idea is that we perform multibyte->wide character conversion while parsing and compiling, then convert byte sequences to wide characters when they're needed for comparison and stepping through the string during execution. As with tr(1), the main complication is to efficiently represent sets of characters in bracket expressions. The old bitmap representation is replaced by a bitmap for the first 256 characters combined with a vector of individual wide characters, a vector of character ranges (for [A-Z] etc.), and a vector of character classes (for [[:alpha:]] etc.). One other point of interest is that although the Boyer-Moore algorithm had to be disabled in the general multibyte case, it is still enabled for UTF-8 because of its self-synchronizing nature. This greatly speeds up matching by reducing the number of multibyte conversions that need to be done.	2004-07-12 07:35:59 +00:00
Jacques Vidrine	e0554a531f	Eliminate 61 warnings emitted at WARNS=2 (leaving 53 to go). Only warnings that could be fixed without changing the generated object code and without restructuring the source code have been handled. Reviewed by: /sbin/md5	2003-02-16 17:29:11 +00:00
Mike Barcroft	4047df8d24	Add restrict type-qualifier.	2002-10-02 07:49:35 +00:00
David E. O'Brien	8fb3f3f682	Remove 'register' keyword.	2002-03-21 18:49:23 +00:00
John Birrell	cfc1614a48	int -> long changes that reduce the diffs with the NetBSD version to work in a 64-bit environment.	1998-05-14 21:45:18 +00:00
Rodney W. Grimes	58f0484fa2	BSD 4.4 Lite Lib Sources	1994-05-27 05:00:24 +00:00

6 Commits