NCM Stockfish Dev testing

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

Jouni
Posts: 3283
Joined: Wed Mar 08, 2006 8:15 pm

NCM Stockfish Dev testing

Post by Jouni »

https://nextchessmove.com/dev-builds

How is +60 ELO to SF8 explained?? Unbalanced book?
Jouni
JJJ
Posts: 1346
Joined: Sat Apr 19, 2014 1:47 pm

Re: NCM Stockfish Dev testing

Post by JJJ »

It is compared to Stockfish 7, not 8.That's might explain why there is more elo than it should be.

I also see the time control is lower than Stockfish long time control test. 30 sec +0,3 would explain why more elo.
Jouni
Posts: 3283
Joined: Wed Mar 08, 2006 8:15 pm

Re: NCM Stockfish Dev testing

Post by Jouni »

They have 30+0,3s and 2 CPU. Should be close to framework 60+0,6s?
Jouni
JJJ
Posts: 1346
Joined: Sat Apr 19, 2014 1:47 pm

Re: NCM Stockfish Dev testing

Post by JJJ »

Many patches are doing better at shorter time control.
Andre
Posts: 98
Joined: Thu Jul 23, 2009 5:40 am

Re: NCM Stockfish Dev testing

Post by Andre »

It's +130 compared to SF7 and around +60 compared to SF8
JJJ
Posts: 1346
Joined: Sat Apr 19, 2014 1:47 pm

Re: NCM Stockfish Dev testing

Post by JJJ »

Andre wrote:It's +130 compared to SF7 and around +60 compared to SF8
Of course it should be +30 compared to SF8 in reality, so I guess time control helps a lot here.
Jouni
Posts: 3283
Joined: Wed Mar 08, 2006 8:15 pm

Re: NCM Stockfish Dev testing

Post by Jouni »

So they were testing WITHOUT book! And now they are re-testing all versions with 8 moves book - waste of time?
Jouni
Jouni
Posts: 3283
Joined: Wed Mar 08, 2006 8:15 pm

Re: NCM Stockfish Dev testing

Post by Jouni »

Now everything is displayed exactly and PGN available, great! But draw after move 34 seems to be a little premature?
Jouni
User avatar
Ozymandias
Posts: 1534
Joined: Sun Oct 25, 2009 2:30 am

Re: NCM Stockfish Dev testing

Post by Ozymandias »

Jouni wrote:PGN available
Not anymore.
Jouni
Posts: 3283
Joined: Wed Mar 08, 2006 8:15 pm

Re: NCM Stockfish Dev testing

Post by Jouni »

Let see what happens next. But this is definitely bad:

Code: Select all

-draw \
    movenumber=34 \
    movecount=8 \
    score=20 \
My suggestion is to re-test one version per month after SF8. And all 2018 versions.
Jouni