I believe that this tournament is showing that the differences in power between the various top-engines, are smaller than some would have us believe.S.Taylor wrote:That would be reassuring to those who want to feel secure that they know which is the strongest program, and use it for that reason.Leto wrote:While the chance of Houdini 3 performing this poorly at the start of the tournament is very low it is still a statistical probability. In all likelihood Houdini 3 will pull ahead shortly and win this tournament with a comfortable lead thanks to the long format.S.Taylor wrote:OK, so it's a genuine one. That wasn't something i was doubting.Graham Banks wrote:Houdini 3 64-bit 4CPU is a legitimate copy and is using the correct default settings.S.Taylor wrote:I just now noticed this whole tournament.
Is there possible a reason why Houdini 3 is taking it easy, and not showing fully convincing results in this tournament?
I had thought that it often DOES, against these opponents.
Is there something wrong in its settings? in its power due to something technical? Or is it weaker in shorter times, like these games are played?
These things can happen in one off tournaments, as people should be well aware by now.
That's why rating lists with hundreds or thousands of games give a more accurate indication of the relative strengths of engines.
All games of this tournament are available for scrutiny.
But, it seems to be in a completely different mood this time. Almost every game, no matter who it is against, is doing a favour to be a draw, sometimes not even that. And if it wins, once in a while, it looks like it was by accident.
But the way it looks is like it might even have 2 more losses, one win (perhaps) and the rest draws.
(with rybka and critter getting straight wins, or maybe with 2 draws)
But on the other hand, i may see it differently if i actually watched the games.
OK, so it is statistically possible. But it shows there are still enough areas that need to be strengthened in its playing.
As Graham said, to have the rankings statistically reliable, it takes thousands of games, but this tournament (and other tournaments) is showing that the gap between the various engines is not that great.
Another interesting point: the old Rybka is proving to have much more to say.
This shows that this "old clone" was not so bad!
