Modern Times wrote:Hi Sedat,
Yes it is a very complex issue that is certain.
The results are what they are and it is easy to get too hung up with over-analysing them. Only with tens of thousands of games can you have any real confidence to draw firm conclusions. With reference to the AMD games, they were run by me in a meticulous manner, with a wide variety of different books and opening sets, and all GUI adjudication turned off. So I stand by those games 100%. It is kind of a sense of deja-vu for me though, because I had similarly disappointing results for Komodo 4 on the same machine.
Hello Ray,
Yes..this issue is something like
deja-vu for me too
Actually i have played thousands of games for Hardware Elo measurements,but not tens of thousands of games per player
And i guess you probably mean about 'tens of thousands of games' per player,right ?
If yes...it sounds good, but then i am afraid that we will not find a such
hero
Forget tens of thousands of games per player,even there is no any rating to be based on such similar number of games per player
Plus,in case of running tens of thousands of games per player+ a small decent neutral book,then there is BIG possibility to appear many similar/double games
Another very important factor is the openings issue,and for less double/similar games we need a lot of various openings
For example,to produce tens of thousands of games per player,we should work with hundreds or maybe thousands of openings...
Then it will be no surprise,where we will see many hundreds of games per player lost due to those critical/disadvantage openings
Of course,there is no doubt that depending on hardware speed, X engine perform better or weaker
But however, the openings are another very important factor to see different standings...
Btw,please check the below table:
http://www.sedatcanbaz.com/chess/tourna ... ournament/
Note also that the current Elo difference is approx. 160 Elo (between first and last place)
Code: Select all
Rank Participant Elo + – games score oppo. draws
1 Hitman H15a 3384 45 44 152 63% 3299 42%
77 Jakal H15a 3225 44 45 152 38% 3301 43%
In other words,
Even tens of thousands games per player will not give us the necessary accurate data about
determining the noconfidence to draw firm conclusions
And in my opinion, the Hardware Elo measurements should be done with well-optimized openings (up to 10-12 moves)
Plus, the data (between 1.000 and 2.000 games per player) will be a quite good indicator about which processor or engine is stronger in Elo for chess
Best,
Sedat