AlexChess wrote: ↑Thu Sep 09, 2021 6:13 pmI have restarted Arena

Oh that's great, now it's all the same node counts and PV as it should be!

So we can compare the speed directly for depth 20:
ARM64, native: 100%
x86-64, emulated: 64.7%
x86-32, emulated: 63.4%
Interestingly, it's different for depth 21 where ARM64 (native) sees a sudden 15% uplift in NPS (relative to depth 20) while the x86-64 emulation doesn't, so the emulation has only 55.3% speed for that depth. I'm not sure where that 15% uplift comes from - I'd guess it's how Windows runs the M1 hardware. Maybe the Windows scheduler isn't quite prepared for the asymmetric cores and schedules the engine somewhat randomly across the fast and slow cores. That would of course be something to keep in mind for engine tournaments.
AlexChess wrote: ↑Thu Sep 09, 2021 6:29 pmSince you are an ARM expert, do you know that the Raspberry PI 4 can execute Windows 11 ARM64 and his engines are compatibles with Ubuntu ARM64 out-of-the-box?
That should be due to the WSL (Windows Subsystem for Linux) that allows running Linux binaries on Windows. Though my engine can already be built directly under Linux, including Ubuntu ARM64, using the
make_ct800_linux.sh script, executed on the Raspi itself.
Technical Data
Note that this does not refer to ARM Linux or Windows such as on the Raspi, it's instead about the ARM Cortex-M4 target. That's a small microcontroller that has no operating system at all. 1MB ROM, 192kB RAM, 168MHz. That's what's inside the physical CT800 build.
