SPCC: Testrun of SFnnue sv200728_1104 finished

Discussion of computer chess matches and engine tournaments.

Moderators: bob, hgm, Harvey Williamson

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Post Reply
pohl4711
Posts: 1303
Joined: Sat Sep 03, 2011 5:25 am
Location: Berlin, Germany
Contact:

SPCC: Testrun of SFnnue sv200728_1104 finished

Post by pohl4711 » Thu Jul 30, 2020 12:45 pm

AB-testrun of SFnnue sv200728_1104 (nodchip avx2-compile) finished. Another huge Elo-gain !!!

https://www.sp-cc.de

pohl4711
Posts: 1303
Joined: Sat Sep 03, 2011 5:25 am
Location: Berlin, Germany
Contact:

Re: SPCC: Testrun of SFnnue sv200728_1104 finished

Post by pohl4711 » Thu Jul 30, 2020 12:56 pm

Code: Select all

Individual statistics:

1 SFnnue sv200728_1104   : 3648 7000 (+4475,=2425,-100), 81.3 %

Slow Chess 2.2 popc      : 1000 (+722,=268,- 10), 85.6 %
Ethereal 12.25 pext      : 1000 (+715,=280,-  5), 85.5 %
Komodo 14 bmi2           : 1000 (+548,=435,- 17), 76.5 %
Stockfish 11 200118      : 1000 (+357,=597,- 46), 65.5 %
Xiphos 0.6 bmi2          : 1000 (+759,=236,-  5), 87.7 %
Fire 7.1 popc            : 1000 (+769,=227,-  4), 88.3 %
Houdini 6 pext           : 1000 (+605,=382,- 13), 79.6 %
Only 100 losses out of 7000 games versus 7 strong AB-engines...
Classical, official Stockfish is released, when around +50 Elo are reached, compared to the latest official release. So SFnnue now plays nearly at the level of Stockfish 13 (!)

marsell
Posts: 56
Joined: Tue Feb 07, 2012 10:14 am

Re: SPCC: Testrun of SFnnue sv200728_1104 finished

Post by marsell » Thu Jul 30, 2020 2:06 pm

thanks for the test, Stockfish nnue is great.

mehmet123
Posts: 135
Joined: Sun Jan 26, 2020 9:38 pm
Location: Turkey
Full name: Mehmet Karaman

Re: SPCC: Testrun of SFnnue sv200728_1104 finished

Post by mehmet123 » Thu Jul 30, 2020 2:16 pm

Stockfish NNUE SV 200728_1104 vs Stockfish 11 :1000 (+357,=597,- 46), 65.5 %

+112 elo difference (elostat). Really great

Gary Internet
Posts: 53
Joined: Thu Jan 04, 2018 6:09 pm

Re: SPCC: Testrun of SFnnue sv200728_1104 finished

Post by Gary Internet » Thu Jul 30, 2020 4:20 pm

This has to be the end of an era that people have talked about. This is a real step change in strength gain for an engine that runs on CPU only. Stefan's words say more than mine could:
AB-testrun of SFnnue sv200728_1104 (nodchip avx2-compile) finished. Another huge Elo-gain: +18 Elo to SFnnue sv200724_0123, +61 Elo to Stockfish 200717 (latest SF-dev) and +94 Elo to Stockfish 11 !!!
Even if NNUE stopped gaining Elo forever by mid August 2020, it would probably take Stockfish Dev about 3 years to catch up. At the moment NNUE seems to making months of progress in just days, and years of progress in a week or two.

EDIT: As Mehmet says, if you just look at the head-to-head results against SF 11 and calculate Elo from that, 112 Elo is light years ahead of where SF Dev would be in the same head-to-head match right now. Yes, it's "only" 1,000 games, but when the winning margin is so great, you can't seriously expect that if they played another 1,000 games, the scores would balance out to a draw. Doesn't look like SF11 would stand a chance.

Post Reply