Mateusz Guzik a8e3f99ec1 amd64: implement memcmp in assembly
Both the in-kernel C variant and libc asm variant have very poor performance.
The former compiles to a single byte comparison loop, which breaks down even
for small sizes. The latter uses rep cmpsq/b which turn out to have very poor
throughput and are slower than a hand-coded 32-byte comparison loop.

Depending on size this is about 3-4 times faster than the current routines.

Reviewed by:	kib
Approved by:	re (gjb)
Differential Revision:	https://reviews.freebsd.org/D17328
2018-09-27 14:05:44 +00:00
..
2018-09-27 14:05:44 +00:00
2018-09-27 14:05:44 +00:00
2018-09-27 14:05:44 +00:00
2018-09-27 14:05:44 +00:00
2018-09-03 14:43:16 +00:00