Mateusz Guzik a8e3f99ec1 amd64: implement memcmp in assembly
Both the in-kernel C variant and libc asm variant have very poor performance.
The former compiles to a single byte comparison loop, which breaks down even
for small sizes. The latter uses rep cmpsq/b which turn out to have very poor
throughput and are slower than a hand-coded 32-byte comparison loop.

Depending on size this is about 3-4 times faster than the current routines.

Reviewed by:	kib
Approved by:	re (gjb)
Differential Revision:	https://reviews.freebsd.org/D17328
2018-09-27 14:05:44 +00:00
..
2018-09-27 14:05:44 +00:00
2018-09-13 21:00:17 +00:00
2018-09-27 13:54:09 +00:00
2018-09-13 07:04:00 +00:00
2018-09-27 14:05:44 +00:00
2018-08-24 15:00:36 +00:00
2018-08-23 13:21:01 +00:00
2018-09-24 17:41:29 +00:00
2018-07-15 00:31:17 +00:00
2018-09-22 17:05:49 +00:00