SPCC: Testruns of Koivisto 9 finished

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
pohl4711
Posts: 2824
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

SPCC: Testruns of Koivisto 9 finished

Post by pohl4711 »

Ratinglist-testrun of Koivisto 9 finished.
(I did the testrun with the Ipman-compile of Koivisto 8.17, but Koivisto 9 is identical to 8.17 (after "go depth 22" same pv-line and nodes). Only difference is, that the official Koivisto 9 binary is around +2% faster on my machines - such a small speed increase is meanigless for the Elo-performance in my ratinglist)


https://www.sp-cc.de

Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm

(Perhaps you have to clear your browsercache (press STRG+SHIFT+DEL) or reload the website))
Jouni
Posts: 3717
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: SPCC: Testruns of Koivisto 9 finished

Post by Jouni »

Nice they are back! But I can't detect any improvement here:

Code: Select all

Score of Koivisto9 vs Koivisto8.16: 46 - 58 - 396 [0.488]
...      Koivisto9 playing White: 35 - 10 - 205  [0.550] 250
...      Koivisto9 playing Black: 11 - 48 - 191  [0.426] 250
...      White vs Black: 83 - 21 - 396  [0.562] 500
Elo difference: -8.3 +/- 13.9, LOS: 12.0 %, DrawRatio: 79.2 %
500 of 500 games finished.
I used HERT book and 60+0,6 games. Compile from github. It's faster than Ipman compile. Github 2,19 Mnps and Ipman 2,14 Mnps.
Jouni
Jouni
Posts: 3717
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: SPCC: Testruns of Koivisto 9 finished

Post by Jouni »

Also weaker in test suites: Arasan suite Koivisto8.16 scored 158/200 but Koivisto9 150/200 :? .
Jouni
User avatar
pohl4711
Posts: 2824
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testruns of Koivisto 9 finished

Post by pohl4711 »

Jouni wrote: Mon Jan 16, 2023 5:37 pm
I used HERT book and 60+0,6 games. Compile from github. It's faster than Ipman compile. Github 2,19 Mnps and Ipman 2,14 Mnps.
As I said: Official avx2 binary is around 2% faster. No big deal. 1 Elo or less...
User avatar
pohl4711
Posts: 2824
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testruns of Koivisto 9 finished

Post by pohl4711 »

Jouni wrote: Mon Jan 16, 2023 5:37 pm Nice they are back! But I can't detect any improvement here:

Code: Select all

Score of Koivisto9 vs Koivisto8.16: 46 - 58 - 396 [0.488]
...      Koivisto9 playing White: 35 - 10 - 205  [0.550] 250
...      Koivisto9 playing Black: 11 - 48 - 191  [0.426] 250
...      White vs Black: 83 - 21 - 396  [0.562] 500
Elo difference: -8.3 +/- 13.9, LOS: 12.0 %, DrawRatio: 79.2 %
500 of 500 games finished.
I used HERT book and 60+0,6 games. Compile from github. It's faster than Ipman compile. Github 2,19 Mnps and Ipman 2,14 Mnps.
Last tested dev-version in my list was 8.13 and from 8.13 to 8.17(=9.0), the progress was +10 Elo. So, I can not say anything about progress from 8.16 to 8.17(=9.0)
Only change from 8.16 to 8.17(=9.0) was the new nnue-net. In the selftest on Openbench, there was a clear progress, using the new net:
ELO | 19.41 +- 6.65 (95%)
SPRT | 40.0+0.40s Threads=1 Hash=64MB
LLR | 2.95 (-2.94, 2.94) [0.00, 2.50]
GAMES | N: 5072 W: 1369 L: 1086 D: 2617

Of course, this test was done, using my UHO-openings, which are spreading the Elo at least factor 2. And the thinking-time was small. But there is definitly a progress. In your 500 game test, the regression is clearly inside errorbar.
Jouni
Posts: 3717
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: SPCC: Testruns of Koivisto 9 finished

Post by Jouni »

After more games I got:

Code: Select all

                        
1   Koivisto9       +5  +176/=845/-159 50.72%  598.5/1180
2   Koivisto8.16    -5  +159/=845/-176 49.28%  581.5/1180

Reminder, that even 500 games is not enough among equal engines!
Jouni