SPCC: Testruns of Stockfish 230531 finished

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
pohl4711
Posts: 2822
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

SPCC: Testruns of Stockfish 230531 finished

Post by pohl4711 »

Ratinglist-testruns of Stockfish 230531 finished. First Stockfish Dev with the new, bigger nnue-net architecture...


https://www.sp-cc.de

https://www.sp-cc.de/uho_ratinglist.htm

Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm

(Perhaps you have to clear your browsercache (press STRG+SHIFT+DEL) or reload the website))
ernest
Posts: 2053
Joined: Wed Mar 08, 2006 8:30 pm

Re: SPCC: Testruns of Stockfish 230531 finished

Post by ernest »

+7 Elo... Amazing !
But what made you interrupt/cancel your previous test-run ?

BTW, your "competitor" (? :D ) NCM finds a -10 Elo regression...
User avatar
pohl4711
Posts: 2822
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testruns of Stockfish 230531 finished

Post by pohl4711 »

ernest wrote: Wed Jun 07, 2023 4:53 am +7 Elo... Amazing !
But what made you interrupt/cancel your previous test-run ?

BTW, your "competitor" (? :D ) NCM finds a -10 Elo regression...
NCM uses extremly short thinking-time. Stockfish with the new and bigger net runs -20% slower. The tests on fishtests of the new Stockfish have shown clearly, that there is Elo progress on longer thinking-times and Elo loss with hyper-bullet speed.

Here you can see it clearly (Fishtest results of the new and bigger net): The longer the thinkig-time gets, the better the result gets...

Failed STC
https://tests.stockfishchess.org/tests/ ... e4cfa75f97
LLR: -2.94 (-2.94,2.94) <0.00,2.00>
Total: 13728 W: 3588 L: 3829 D: 6311 Elo -6.10
Ptnml(0-2): 71, 1661, 3610, 1482, 40

Failing LTC
https://tests.stockfishchess.org/tests/ ... 3c4c9f3618
LLR: -1.91 (-2.94,2.94) <0.50,2.50>
Total: 35424 W: 9522 L: 9603 D: 16299 Elo -0.79
Ptnml(0-2): 24, 3579, 10585, 3502, 22

Passed VLTC 180+1.8
https://tests.stockfishchess.org/tests/ ... 3c4c9f3638
LLR: 2.95 (-2.94,2.94) <0.50,2.50>
Total: 47616 W: 13174 L: 12863 D: 21579 Elo +2.27
Ptnml(0-2): 13, 4261, 14952, 4566, 16

Passed VLTC SMP 60+0.6 th 8
https://tests.stockfishchess.org/tests/ ... e4cfa761e5
LLR: 2.94 (-2.94,2.94) <0.50,2.50>
Total: 19942 W: 5694 L: 5451 D: 8797 Elo +4.23
Ptnml(0-2): 6, 1504, 6707, 1749, 5
ernest
Posts: 2053
Joined: Wed Mar 08, 2006 8:30 pm

Re: SPCC: Testruns of Stockfish 230531 finished

Post by ernest »

pohl4711 wrote: Wed Jun 07, 2023 8:20 am
ernest wrote: Wed Jun 07, 2023 4:53 am +7 Elo... Amazing !
But what made you interrupt/cancel your previous test-run ?

BTW, your "competitor" (? :D ) NCM finds a -10 Elo regression...
NCM uses extremly short thinking-time. Stockfish with the new and bigger net runs -20% slower. The tests on fishtests of the new Stockfish have shown clearly, that there is Elo progress on longer thinking-times and Elo loss with hyper-bullet speed.
New Stockfish from abrok.eu/stockfish 230612 with "a net with reordered weights" shows a +10 Elo on short time controls (STC).

Seems confirmed by your "competitor" NCM...

Awaiting your test ! :)