https://en.wikipedia.org/wiki/AI_accelerator
Apple has ~38 TOPS in M4, Intel has ~48 and AMD ~50 TOPS in their new mobile processors, ARM has SVE2 for mat-mul and Microsoft Copilot+ requires now 40 TOPS NPU:
https://arstechnica.com/gadgets/2024/05 ... l-and-amd/Microsoft requires an NPU with performance rated at 40 trillion operations per second (TOPS), a high-level performance figure that Microsoft, Qualcomm, Apple, and others use for NPU performance comparisons. Right now, that requirement can only be met by a single chip in the Windows PC ecosystem, one that isn't even quite available yet: Qualcomm's Snapdragon X Elite and X Plus, launching in the new Surface and a number of PCs from the likes of Dell, Lenovo, HP, Asus, Acer, and other major PC OEMs in the next couple of months. All of those chips have NPUs capable of 45 TOPS, just a shade more than Microsoft's minimum requirement.
What is the SF devs take? Can Stockfish make use of 40+ TOPS NPU for NNUE, or maybe switch to an CNN architecture?
If Microsoft is the driver, there must be a unified way to program these?
--
Srdja