Dedicated Chess Machine Elo vs Human Elo, a least squares analysis
Posted: Fri Aug 23, 2019 11:35 pm
Inspired by something said by GM Kaufman here: http://www.talkchess.com/forum3/viewtop ... 2&start=30 I went ahead and found some data and below is the correlation between dedicated chess machine Elo (SSDF) and human Elos, the human pool was apparently from 1987-1991 in Europe. Data below, some of the links are now dead, but I had saved a local copy of the data.
The conclusion is that there's a strong correlation between engine Elo and human Elo, which is comforting for people like me who don't play in human tournaments but track their Elo by playing vs an engine.
I suspect software is similar to this dedicated hardware, I don't see why not, at least at the below super GM level.
JR
Human pool from 1987-1991 in Europe.
For similar hardware I combined the games played and averaged the Elos. The numbers from left to right are: SSDF rating, FIDE vs humans , #Number of Games, and the final entry are comments.
SSDF rating FIDE vs humans # Games Comments
1621 1617 23 Novag Super Forte A 6502 5 MHz, Fidelity Excellence 6502 3 MHz, 24. Novag Super Constellation 6502 4MHz
1716 1861 28 17. Novag Forte B 6502 5 MHz
1780 1999 28 8. CXG Sphinx Galaxy 6502 4 MHz
1815 1866 28 15. Mephisto Mega IV 6502 5 MHz
1816 1722 18 21. Saitek Maestro D 6502 10 MHz
1817.5 1894 18 13. Saitek Maestro A 6502 6 MHz, 14. Novag Super Expert B 6502 6 MHz
1839 2057 21 6. Mephisto Academy 6502 5 MHz
1860 1996 15 9. Novag Super Forte C 6502 6 MHz
1872.3 1967.0 23 10. Mephisto Roma 68020 14 MHz, 11. Novag Diablo 68000 16 MHz, 12. Psion Atari 68000 8 MHz
1893 2067 25 5. Fidelity Mach III 68000 16 MHz
1923 1866 15 15. Mephisto Dallas 68000 12 MHz
1973 2030 26 7. Mephisto Almeria 68020 12 Mhz
1977 2177 19 3. Fidelity Mach IV 68020 12 MHz
2145 2217 22 1. Mephisto Lyon 68020 12 MHz & 2. Mephisto Portorose 68020 12 MHz, combined
Correlation: 0.80126309 very high for this sample
Least fit equations and graphic: Red is the Fide rating vs human players, Blue is the SSDF rating:
https://pasteboard.co/Iu4XoPE.png
raw data from: (dead link) http://home.interact.se/~w100107/level.htm
The conclusion is that there's a strong correlation between engine Elo and human Elo, which is comforting for people like me who don't play in human tournaments but track their Elo by playing vs an engine.
I suspect software is similar to this dedicated hardware, I don't see why not, at least at the below super GM level.
JR
Human pool from 1987-1991 in Europe.
For similar hardware I combined the games played and averaged the Elos. The numbers from left to right are: SSDF rating, FIDE vs humans , #Number of Games, and the final entry are comments.
SSDF rating FIDE vs humans # Games Comments
1621 1617 23 Novag Super Forte A 6502 5 MHz, Fidelity Excellence 6502 3 MHz, 24. Novag Super Constellation 6502 4MHz
1716 1861 28 17. Novag Forte B 6502 5 MHz
1780 1999 28 8. CXG Sphinx Galaxy 6502 4 MHz
1815 1866 28 15. Mephisto Mega IV 6502 5 MHz
1816 1722 18 21. Saitek Maestro D 6502 10 MHz
1817.5 1894 18 13. Saitek Maestro A 6502 6 MHz, 14. Novag Super Expert B 6502 6 MHz
1839 2057 21 6. Mephisto Academy 6502 5 MHz
1860 1996 15 9. Novag Super Forte C 6502 6 MHz
1872.3 1967.0 23 10. Mephisto Roma 68020 14 MHz, 11. Novag Diablo 68000 16 MHz, 12. Psion Atari 68000 8 MHz
1893 2067 25 5. Fidelity Mach III 68000 16 MHz
1923 1866 15 15. Mephisto Dallas 68000 12 MHz
1973 2030 26 7. Mephisto Almeria 68020 12 Mhz
1977 2177 19 3. Fidelity Mach IV 68020 12 MHz
2145 2217 22 1. Mephisto Lyon 68020 12 MHz & 2. Mephisto Portorose 68020 12 MHz, combined
Correlation: 0.80126309 very high for this sample
Least fit equations and graphic: Red is the Fide rating vs human players, Blue is the SSDF rating:
https://pasteboard.co/Iu4XoPE.png
raw data from: (dead link) http://home.interact.se/~w100107/level.htm