freebsd-dev

History

Romain Dolbeau 24cdeaf12e Fletcher4 algorithm implemented in pure NEON for Aarch64 / ARMv8 64 bits This is not useful on micro-architecture with a weak NEON implementation (only 64 bits); the native version is slower & the byteswap barely faster than scalar. On A53 or A57, it's a small improvement on scalar but OK for byteswap. Results from an A53 system: 0 0 0x01 -1 0 1499068294333000 1499101101878000 implementation native byteswap scalar 1008227510 755880264 aarch64_neon 1198098720 1044818671 fastest aarch64_neon aarch64_neon Results from a A57 system: 0 0 0x01 -1 0 4407214734807033 4407233933777404 implementation native byteswap scalar 2302071241 1124873346 aarch64_neon 2542214946 2245570352 fastest aarch64_neon aarch64_neon Reviewed-by: Gvozden Neskovic <neskovic@gmail.com> Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Romain Dolbeau <romain.dolbeau@atos.net> Closes #5248		2016-10-21 10:55:49 -07:00
..
kernel.c	Fix coverity defects: CID 147452, 147447, 147446	2016-10-11 11:32:34 -07:00
Makefile.am	Fletcher4 algorithm implemented in pure NEON for Aarch64 / ARMv8 64 bits	2016-10-21 10:55:49 -07:00
taskq.c	Fix strncpy in taskq_create	2016-09-20 11:27:15 -07:00
util.c	Fix coverity defects: CID 147565-147567	2016-10-07 13:19:43 -07:00