Ipman has 2 interesting SF benchmarks

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

Jouni
Posts: 3769
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Ipman has 2 interesting SF benchmarks

Post by Jouni »

102.542.506.414 nps for 131.072 threads cluster!
85.835.086 nps for AWS Graviton3 64 threads. What's this?
Jouni
Vinvin
Posts: 5310
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Ipman has 2 interesting SF benchmarks

Post by Vinvin »

Jouni wrote: Sun Nov 27, 2022 11:16 pm 102.542.506.414 nps for 131.072 threads cluster!
https://ipmanchess.yolasite.com/amd--in ... ckfish.php

It would be interesting to have a short video (even took with a smartphone) of this benchmark.
And a short video (15 to 30 seconds) displaying NPS in the starting positions with this monster.
CornfedForever
Posts: 650
Joined: Mon Jun 20, 2022 4:08 am
Full name: Brian D. Smith

Re: Ipman has 2 interesting SF benchmarks

Post by CornfedForever »

I've been wondering if there is a rule of thumb regarding NPS and # of threads. Ex:

NPS: 69.539.701 AMD Ryzen 9 7950X @4.9Ghz DDR5 6000 CL30 32threads

I doubt it would be fair to say therefore:
16 threads would = about 34,769.850 NPS
and 8 threads = 17,384,925 NPS

But perhaps it is NOT a linear relationship like that. Would anyone have a 'rule of thumb' if not?
smatovic
Posts: 3480
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Ipman has 2 interesting SF benchmarks

Post by smatovic »

Jouni wrote: Sun Nov 27, 2022 11:16 pm [...]
85.835.086 nps for AWS Graviton3 64 threads. What's this?
Amazon AWS cloud Graviton3 CPU, ARM Neoverse V1 based with 64 cores (up to two sockets?):

https://en.wikipedia.org/wiki/AWS_Graviton#Graviton3

https://en.wikipedia.org/wiki/ARM_Neoverse#Neoverse_V1

--
Srdja
smatovic
Posts: 3480
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Ipman has 2 interesting SF benchmarks

Post by smatovic »

CornfedForever wrote: Mon Nov 28, 2022 3:57 am I've been wondering if there is a rule of thumb regarding NPS and # of threads. Ex:

NPS: 69.539.701 AMD Ryzen 9 7950X @4.9Ghz DDR5 6000 CL30 32threads

I doubt it would be fair to say therefore:
16 threads would = about 34,769.850 NPS
and 8 threads = 17,384,925 NPS

But perhaps it is NOT a linear relationship like that. Would anyone have a 'rule of thumb' if not?
Generally, Stockfish NNUE NPS depends also on the network size, currently it makes 1M to 2M NPS per core on modern CPUs (depends for example on frequency and vector-unit like SSE, AVX2, NEON), and with HyperThreading on (SMT2, 2 threads per core) you gain roughly 1.5x more NPS.

--
Srdja
Werewolf
Posts: 2059
Joined: Thu Sep 18, 2008 10:24 pm

Re: Ipman has 2 interesting SF benchmarks

Post by Werewolf »

Jouni wrote: Sun Nov 27, 2022 11:16 pm 102.542.506.414 nps for 131.072 threads cluster!
Presumably this is a cloud setup he has access to? I'd love to know the elo gain...
Jouni
Posts: 3769
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: Ipman has 2 interesting SF benchmarks

Post by Jouni »

Obviously confidential tester has access to one TOP500 supercomputer and can run SF bench!
Jouni
Jouni
Posts: 3769
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: Ipman has 2 interesting SF benchmarks

Post by Jouni »

In discord is even faster bench:

graviton3 / ARM64
sf 15:
git checkout e6e324eb28fd49c1fc44b3b65784f85a773ec61c
make -j ARCH=armv8 profile-build
./stockfish bench 128 64 24 default depth
===========================
Total time (ms) : 57573
Nodes searched : 7099323910
Nodes/second : 123309952

Threadripper speed :!: .
Jouni
CornfedForever
Posts: 650
Joined: Mon Jun 20, 2022 4:08 am
Full name: Brian D. Smith

Re: Ipman has 2 interesting SF benchmarks

Post by CornfedForever »

smatovic wrote: Mon Nov 28, 2022 9:14 am
CornfedForever wrote: Mon Nov 28, 2022 3:57 am I've been wondering if there is a rule of thumb regarding NPS and # of threads. Ex:

NPS: 69.539.701 AMD Ryzen 9 7950X @4.9Ghz DDR5 6000 CL30 32threads

I doubt it would be fair to say therefore:
16 threads would = about 34,769.850 NPS
and 8 threads = 17,384,925 NPS

But perhaps it is NOT a linear relationship like that. Would anyone have a 'rule of thumb' if not?
Generally, Stockfish NNUE NPS depends also on the network size, currently it makes 1M to 2M NPS per core on modern CPUs (depends for example on frequency and vector-unit like SSE, AVX2, NEON), and with HyperThreading on (SMT2, 2 threads per core) you gain roughly 1.5x more NPS.

--
Srdja
So, in general, in doubling the thread count (say from 8 to 16) instead of a 2x gain in NPS, you get closer to 1.5x gain in NPS - instead of from say 100 to 200, you actually get closer to 150?
smatovic
Posts: 3480
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Ipman has 2 interesting SF benchmarks

Post by smatovic »

CornfedForever wrote: Mon Nov 28, 2022 2:48 pm
smatovic wrote: Mon Nov 28, 2022 9:14 am
CornfedForever wrote: Mon Nov 28, 2022 3:57 am I've been wondering if there is a rule of thumb regarding NPS and # of threads. Ex:

NPS: 69.539.701 AMD Ryzen 9 7950X @4.9Ghz DDR5 6000 CL30 32threads

I doubt it would be fair to say therefore:
16 threads would = about 34,769.850 NPS
and 8 threads = 17,384,925 NPS

But perhaps it is NOT a linear relationship like that. Would anyone have a 'rule of thumb' if not?
Generally, Stockfish NNUE NPS depends also on the network size, currently it makes 1M to 2M NPS per core on modern CPUs (depends for example on frequency and vector-unit like SSE, AVX2, NEON), and with HyperThreading on (SMT2, 2 threads per core) you gain roughly 1.5x more NPS.

--
Srdja
So, in general, in doubling the thread count (say from 8 to 16) instead of a 2x gain in NPS, you get closer to 1.5x gain in NPS - instead of from say 100 to 200, you actually get closer to 150?
If thread and core count is equal you can expcect an doubling of NPS, 8cores with 8threads to 16cores with 16threads doubles the NPS, assuming same architecture and frequency cos most engines scale linear NPS wise nowadays across cores on a single socket, then you turn SMT resp. HT on, and you get further ~1.5x NPS by this, depends, on architecture and engine.

https://en.wikipedia.org/wiki/Simultane ... mentations

Modern Intel and AMD processors profit both from two threads per core, 2-way SMT, in varying percentages, some people prefer SMT resp. HT off during testing.

--
Srdja