freebsd-dev/lib/libzpool
Romain Dolbeau 24cdeaf12e Fletcher4 algorithm implemented in pure NEON for Aarch64 / ARMv8 64 bits
This is not useful on micro-architecture with a weak NEON
implementation (only 64 bits); the native version is slower &
the byteswap barely faster than scalar.  On A53 or A57, it's
a small improvement on scalar but OK for byteswap.

Results from an A53 system:
0 0 0x01 -1 0 1499068294333000 1499101101878000
implementation   native         byteswap       
scalar           1008227510     755880264      
aarch64_neon     1198098720     1044818671     
fastest          aarch64_neon   aarch64_neon 

Results from a A57 system:
0 0 0x01 -1 0 4407214734807033 4407233933777404
implementation   native         byteswap       
scalar           2302071241     1124873346     
aarch64_neon     2542214946     2245570352     
fastest          aarch64_neon   aarch64_neon 

Reviewed-by: Gvozden Neskovic <neskovic@gmail.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Signed-off-by: Romain Dolbeau <romain.dolbeau@atos.net>
Closes #5248
2016-10-21 10:55:49 -07:00
..
kernel.c Fix coverity defects: CID 147452, 147447, 147446 2016-10-11 11:32:34 -07:00
Makefile.am Fletcher4 algorithm implemented in pure NEON for Aarch64 / ARMv8 64 bits 2016-10-21 10:55:49 -07:00
taskq.c Fix strncpy in taskq_create 2016-09-20 11:27:15 -07:00
util.c Fix coverity defects: CID 147565-147567 2016-10-07 13:19:43 -07:00