I was not sure if CEGT reported larger gains, thanks for confirm it.Uri Blass wrote:There are rating lists that show more than 56 elo for stockfish4 relative to stockfish3
CEGT 40/4 rating list shows 66 elo improvement so I think that the main reason is not that the 56 elo is too much but the fact that the IPON use slower time control.
http://www.husvankempen.de/nunn/40_4_Ra ... liste.html
18 Stockfish 4.0 x64 1CPU 3032 13 13 1700 66.6% 2911 41.2%
53 Stockfish 3.0 x64 1CPU 2966 12 12 2000 61.0% 2889 43.6%
Note also that the best stockfish in the CEGT 40/4 rating list is a different stockfish and 30 elo may not enough for stockfish developement version with 1 cpu to catch the first place there.
7 Stockfish 2.2.2 x64 4CPU 3081 13 13 1600 68.4% 2947 40.0%
------------------------
Sure. Just as a side note I want to remember that the Elo difference of two engines is a function of wins - loses:Adam Hair wrote:The regression results inside the Stockfish testing framework are not directly comparable to any rating list due to the large draw rate.
Code: Select all
W: wins/games.
L: loses/games:
(Elo difference) = 400*log{[1 + (W - L)]/[1 - (W - L)]}
------------------------
Thanks for your insights, Uri and Adam. They are helpful, as usual.
Regards from Spain.
Ajedrecista.