Re: CCRL update (1st September 2007)
Posted: Tue Sep 04, 2007 2:12 pm
Hi Kiril,
If both "ponder hits" and "draw analysis" point in the same direction, then there are likely to be 2 similar thinking engines.
Are there any "control" experiments with "ponder hits" to establish reference points for the stats?. Like calculating "ponder hits" with two identical engines, or with two intra-family engines with a 100 elo difference in strength, or with two unrelated engines where the source codes are published and there is no doubt that the engines are unrelated.
I think the "in between" issue with the "draw analysis" of Strelka printed above, is due to the mixture of Rybka+Toga+Fruit. If I just focus of Strelka v Rybka 1.0 Beta, there is a clearer picture.
Strelka 1.0b 32-bit v Rybka 1.0 Beta 32-bit: 9 wins, 32 draws, 23 losses
draw rate: 50.0%
Strelka 1.8 32-bit v Rybka 1.0 Beta 32-bit: 10 wins, 33 draws, 21 losses
draw rate: 51.6%
(note: Rybka 1.0 Beta 32-bit was sometimes previously mentioned as Rybka Beta 32-bit)
(inter-family draw rate 29.8%, intra-family draw rate 50.1%)
(note: These Strelka v Rybka 1.0 games were included in the inter-family group, not the intra-family group).
Based on these 128 games, there is an indication of similar thinking. But I do not think the data is sufficient for a "statistical" conclusion. I imagine that it would need 20 times as many games.
-Norm
If both "ponder hits" and "draw analysis" point in the same direction, then there are likely to be 2 similar thinking engines.
Are there any "control" experiments with "ponder hits" to establish reference points for the stats?. Like calculating "ponder hits" with two identical engines, or with two intra-family engines with a 100 elo difference in strength, or with two unrelated engines where the source codes are published and there is no doubt that the engines are unrelated.
I think the "in between" issue with the "draw analysis" of Strelka printed above, is due to the mixture of Rybka+Toga+Fruit. If I just focus of Strelka v Rybka 1.0 Beta, there is a clearer picture.
Strelka 1.0b 32-bit v Rybka 1.0 Beta 32-bit: 9 wins, 32 draws, 23 losses
draw rate: 50.0%
Strelka 1.8 32-bit v Rybka 1.0 Beta 32-bit: 10 wins, 33 draws, 21 losses
draw rate: 51.6%
(note: Rybka 1.0 Beta 32-bit was sometimes previously mentioned as Rybka Beta 32-bit)
(inter-family draw rate 29.8%, intra-family draw rate 50.1%)
(note: These Strelka v Rybka 1.0 games were included in the inter-family group, not the intra-family group).
Based on these 128 games, there is an indication of similar thinking. But I do not think the data is sufficient for a "statistical" conclusion. I imagine that it would need 20 times as many games.
-Norm