Hello Geert:
countrychess wrote:Code: Select all
CountryChess 02-07 Germany d1
Halle, 2012.11.16 - 2012.11.19
Score SB O G A N H C L G S I A Y
-------------------------------------------------------------------------
1: Octochess revision 4741 8.0 / 11 40.75 X 1 0 1 = 1 0 1 1 = 1 1
2: Ghost 2.01 8.0 / 11 37.00 0 X 0 1 = 1 1 = 1 1 1 1
3: Aristarch 4.50 7.5 / 11 39.75 1 1 X 0 0 1 = 1 = = 1 1
4: N2 0.4 6.0 / 11 31.00 0 0 1 X = 1 1 0 1 1 0 =
5: Homer 2.01 UCI 6.0 / 11 31.00 = = 1 = X 0 = 0 0 1 1 1
6: Comet B68 5.5 / 11 25.25 0 0 0 0 1 X = 1 1 1 = =
7: Little Goliath Evolution 5.0 / 11 28.50 1 0 = 0 = = X 1 = 0 1 0
8: Gromit 3.8.2 5.0 / 11 26.00 0 = 0 1 1 0 0 X 1 0 1 =
9: Snitch 1.6.2 5.0 / 11 22.25 0 0 = 0 1 0 = 0 X 1 1 1
10: Ikarus V0.18 3.5 / 11 19.25 = 0 = 0 0 0 1 1 0 X 0 =
11: Abrok 5.0 3.5 / 11 15.25 0 0 0 1 0 = 0 0 0 1 X 1
12: Yace 0.99.87 3.0 / 11 15.00 0 0 0 = 0 = 1 = 0 = 0 X
-------------------------------------------------------------------------
66 games: +27 =16 -23
Thanks for your kind words in
CountryChess 02: Germany, second division topic.
In this case, I get exactly the same ratings in both of my lists (I use this arbitrary scale = min_score + max_score; here: (3 + 8)/11 = 1), so you can see that my programme is lucky sometimes, sometimes not and of course it is not a true rating programme as BayesElo, EloSTAT and Ordo.
I downloaded 173077.pgn and I got these ratings (setting the average rating to 0):
BayesElo:
Code: Select all
version 0057.2, Copyright (C) 1997-2010 Remi Coulom.
compiled Apr 5 2012 17:26:01.
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under the terms and conditions of the GNU General Public License.
See http://www.gnu.org/copyleft/gpl.html for details.
ResultSet>readpgn 173077.pgn
66 game(s) loaded, 0 game(s) with unknown result ignored.
ResultSet>elo
ResultSet-EloRating>mm 1 1
00:00:00,00
ResultSet-EloRating>confidence 0.95
0.95
ResultSet-EloRating>ratings
Rank Name Elo Diff + - Games Score Oppo. Draws Win W-L-D
1 Octochess revision 4741 132.11 0.00 155.89 155.89 11 72.73% -12.01 18.18% 63.64% 7-2-2
2 Ghost 2.01 127.34 -4.77 151.75 151.75 11 72.73% -11.58 18.18% 63.64% 7-2-2
3 Aristarch 4.50 91.29 -36.05 151.74 151.74 11 68.18% -8.30 27.27% 54.55% 6-2-3
4 Homer 2.01 UCI 34.67 -56.62 143.84 143.84 11 54.55% -3.15 36.36% 36.36% 4-3-4
5 N2 0.4 20.66 -14.01 152.16 152.16 11 54.55% -1.88 18.18% 45.45% 5-4-2
6 Comet B68 -12.40 -33.05 144.90 144.90 11 50.00% 1.13 27.27% 36.36% 4-4-3
7 Snitch 1.6.2 -20.02 -7.62 148.50 148.50 11 45.45% 1.82 18.18% 36.36% 4-5-2
8 Little Goliath Evolution -21.22 -1.20 146.02 146.02 11 45.45% 1.93 36.36% 27.27% 3-4-4
9 Gromit 3.8.2 -24.00 -2.78 151.69 151.69 11 45.45% 2.18 18.18% 36.36% 4-5-2
10 Ikarus V0.18 -97.69 -73.70 151.02 151.02 11 31.82% 8.88 27.27% 18.18% 2-6-3
11 Abrok 5.0 -110.31 -12.61 155.42 155.42 11 31.82% 10.03 9.09% 27.27% 3-7-1
12 Yace 0.99.87 -120.42 -10.11 145.67 145.67 11 27.27% 10.95 36.36% 9.09% 1-6-4
ResultSet-EloRating>
EloSTAT (using BayesElo):
Code: Select all
version 0057.2, Copyright (C) 1997-2010 Remi Coulom.
compiled Apr 5 2012 17:26:01.
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under the terms and conditions of the GNU General Public License.
See http://www.gnu.org/copyleft/gpl.html for details.
ResultSet>read 173077.pgn
Unknown command: read
type '?' for help
ResultSet>readpgn 173077.pgn
66 game(s) loaded, 0 game(s) with unknown result ignored.
ResultSet>elo
ResultSet-EloRating>elostat
8 iterations
00:00:00,00
ResultSet-EloRating>ratings
Rank Name Elo Diff + - Games Score Oppo. Draws Win W-L-D
1 Octochess revision 4741 155.71 0.00 374.35 173.03 11 72.73% -14.16 18.18% 63.64% 7-2-2
2 Ghost 2.01 155.71 -0.00 374.35 173.03 11 72.73% -14.16 18.18% 63.64% 7-2-2
3 Aristarch 4.50 120.88 -34.82 268.98 164.53 11 68.18% -10.99 27.27% 54.55% 6-2-3
4 N2 0.4 28.55 -92.33 221.77 196.14 11 54.55% -2.60 18.18% 45.45% 5-4-2
5 Homer 2.01 UCI 28.55 -0.00 187.82 169.63 11 54.55% -2.60 36.36% 36.36% 4-3-4
6 Comet B68 -0.48 -29.03 192.69 192.69 11 50.00% 0.04 27.27% 36.36% 4-4-3
7 Gromit 3.8.2 -29.52 -29.03 196.14 221.77 11 45.45% 2.68 18.18% 36.36% 4-5-2
8 Snitch 1.6.2 -29.52 -0.00 196.14 221.77 11 45.45% 2.68 18.18% 36.36% 4-5-2
9 Little Goliath Evolution -29.52 -0.00 169.63 187.82 11 45.45% 2.68 36.36% 27.27% 3-4-4
10 Abrok 5.0 -121.85 -92.33 187.48 352.64 11 31.82% 11.08 9.09% 27.27% 3-7-1
11 Ikarus V0.18 -121.85 -0.00 164.53 268.98 11 31.82% 11.08 27.27% 18.18% 2-6-3
12 Yace 0.99.87 -156.67 -34.82 147.02 256.20 11 27.27% 14.24 36.36% 9.09% 1-6-4
ResultSet-EloRating>
Ordo 0.6:
Code: Select all
[...]\ordo-windows-v0.6>ordo-win32 -a 0 -p 173077.pgn -o ordo.txt
Loading data (2000 games x dot):
|
Total games 66
White wins 27
Draws 16
Black wins 23
No result 0
Unique head to head 0.00%
Reference rating 0.0 (average of the pool)
Convergence rating calculation
phase iteration deviation resolution
0 1 76.63472 130.44612
1 22 0.00000 0.00000
done
Post-Convergence rating estimation
# ENGINE : RATING POINTS PLAYED (%)
1 Ghost 2.01 : 169.1 8.0 11 72.7%
2 Octochess revision 4741 : 169.1 8.0 11 72.7%
3 Aristarch 4.50 : 132.2 7.5 11 68.2%
4 Homer 2.01 UCI : 31.5 6.0 11 54.5%
5 N2 0.4 : 31.5 6.0 11 54.5%
6 Comet B68 : -0.5 5.5 11 50.0%
7 Little Goliath Evolution : -32.5 5.0 11 45.5%
8 Gromit 3.8.2 : -32.5 5.0 11 45.5%
9 Snitch 1.6.2 : -32.5 5.0 11 45.5%
10 Abrok 5.0 : -132.9 3.5 11 31.8%
11 Ikarus V0.18 : -132.9 3.5 11 31.8%
12 Yace 0.99.87 : -169.5 3.0 11 27.3%
At least I get an almost equal list to EloSTAT:
Code: Select all
Round Robin with 12 engines and 11 games per engine.
Total number of games: 66 games.
154.42 (engine 01).
154.42 (engine 02).
119.88 (engine 03).
28.31 (engine 04).
28.31 (engine 05).
-0.48 (engine 06).
-29.27 (engine 07).
-29.27 (engine 08).
-29.27 (engine 09).
-120.84 (engine 10).
-120.84 (engine 11).
-155.38 (engine 12).
Mean of ratings: 0.00 Elo.
A comparison with EloSTAT is acceptable and even encouraging, in this case; other comparisons are embarrasing, specially with BayesElo! It is a true achievement for me to obtain very similar results to at least one reputed rating programme (EloSTAT in this case), even more if I do not know how EloSTAT (or other rating programme) internally works.
It is better to take my Elo performance list as a funny coincidence than taking it as a serious thing. Thanks again for your tournaments!
Regards from Spain.
Ajedrecista.