Page 1 of 4

Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Mon Jun 11, 2018 10:38 am
by corres
TCEC proves the difference between developer version of Stockfish and the other top engines decreased to now.
I think it is the time to change the basic engine (Stockfish 7) to a stronger one (Stockfish 8).
I suppose it may show a more real picture about the power of Stockfish dev. than Stockfish 7 does.
Moreover it may stimulate the developers to make changes to gain Elo instead of making cosmetics code modifications with noisy results in Elo.

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Mon Jun 11, 2018 1:18 pm
by Vinvin
Where do you see "Stockfish 7" ?

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Mon Jun 11, 2018 2:09 pm
by Uri Blass
corres wrote: Mon Jun 11, 2018 10:38 am TCEC proves the difference between developer version of Stockfish and the other top engines decreased to now.
I think it is the time to change the basic engine (Stockfish 7) to a stronger one (Stockfish 8).
I suppose it may show a more real picture about the power of Stockfish dev. than Stockfish 7 does.
Moreover it may stimulate the developers to make changes to gain Elo instead of making cosmetics code modifications with noisy results in Elo.
TCEC proves nothing because of not having enough games

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Mon Jun 11, 2018 6:23 pm
by corres
Uri Blass wrote: Mon Jun 11, 2018 2:09 pm
TCEC proves nothing because of not having enough games
Only in the case of small Elo difference is important the number of games.
If the difference is enough big even two games played from the same starting position with reversed colors may show the power relation of the two engine.

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Mon Jun 11, 2018 10:53 pm
by Uri Blass
corres wrote: Mon Jun 11, 2018 6:23 pm
Uri Blass wrote: Mon Jun 11, 2018 2:09 pm
TCEC proves nothing because of not having enough games
Only in the case of small Elo difference is important the number of games.
If the difference is enough big even two games played from the same starting position with reversed colors may show the power relation of the two engine.

My point is that there is no proof that the difference between stockfish and the rest of the programs got smaller.

Stockfish has 12 wins and 32 draws in the premier of season 12 total result of 63.636%
Stockfish had 39 wins and one loss and 44 draws in the premier of season 11 total result of 72.619%

I think that the expected result may be 69% when stockfish was lucky in season 11 and unlucky in season 12

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Tue Jun 12, 2018 12:09 am
by corres
In my opinion the top chess engines are: Stockfish, Komodo, Houdini.
The recent standing is:
Stockfish 160518... -110 Elo
Komodo 12............+59 Elo
Houdini 6.03..........-27 Elo
No comment...

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Tue Jun 12, 2018 1:35 am
by AndrewGrant
No comment...
Edgy closers don't replace statistics.

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Tue Jun 12, 2018 1:36 pm
by corres
Uri Blass wrote: Mon Jun 11, 2018 10:53 pm
I think that the expected result may be 69% when stockfish was lucky in season 11 and unlucky in season 12
You would be right if the participants are the same.
But it is not the situation.
Now Stockfish is relative weaker than it was.

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Tue Jun 12, 2018 1:40 pm
by corres
AndrewGrant wrote: Tue Jun 12, 2018 1:35 am
No comment...
Edgy closers don't replace statistics.
If you are interested in statistics please, make those statistics.
The data are public.
To me it is obvious that to loose 110 Elo during 44 games this is not a statistical issue.

Re: Developer tests of Stockfish need Stockfish 8 instead of Stockfish 7

Posted: Tue Jun 12, 2018 1:59 pm
by sovaz1997
TC you don't understand, why are you talking about this?