for your tips! Crafty 23.6 now runs fine on my ARM based Linux system (without SMP). Your bitscan intrinsics gives a nice speed boost of nearby 8%.
After playing with some compiler options (-O2, -O3, -Ofast, -flto and -mtune=cortex-A15) it looks like -O2 produces the fastest executables and gcc 4.7.3 optimazitions through -mtune=cortex-a15 don't help much (less than 1% speedup).
Code: Select all
$ ./crafty236
unable to open book file [/usr/share/crafty/book.bin].
book is disabled
unable to open book file [/usr/share/crafty/books.bin].
pondering disabled.
Warning-- xboard 'memory' option disabled
hash table memory = 256M bytes (16M entries).
Warning-- xboard 'memory' option disabled
pawn hash table memory = 64M bytes (2M entries).
Crafty v23.6 (1 cpus)
White(1): bench
Running benchmark. . .
......Some values from the above benchmark with different compiler options:
Raw nodes per second: 1159129, -O2 + GCC intrinsics + -mtune=cortex-a15
Raw nodes per second: 1149211, -O2 + GCC intrinsics
Raw nodes per second: 1142880, -O2 + GCC intrinsics + -flto
Raw nodes per second: 1105610, -Ofast + GCC intrinsics
Raw nodes per second: 1078021, -O2 + -mtune=cortex-a15
Raw nodes per second: 1065739, -O2
Raw nodes per second: 984421, -O3 + -mtune=cortex-a15
Max

