CountryChess 02 Germany: first division

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

countrychess
Posts: 295
Joined: Fri Sep 16, 2011 7:56 pm
Location: Belgium

CountryChess 02 Germany: first division

Post by countrychess »

Code: Select all


CountryChess 02-07 Germany d1,

Listing of Players
 N     Name                     Ratin 
  1    Octochess R4751 X64 6Cpu 2582  
  2    Aristarch 4.50           2511  
  3    N2 0.4 X64               2500  
  4    Little Goliath Evolution 2465  
  5    Yace 0.99.87             2442  
  6    Homer 2.01               2429  
  7    Ikarus 0.18              2412  
  8    Ghost 2.01               2380  
  9    Gromit 3.82              2376  
 10    Comet B68 Cb             2344  
 11    Abrok 5.0                2335  
 12    Snitch 1.62 X64          2244  
Engines ranked 1-2 promote to the premier division; engines ranked 11-12 demote to the second division.
countrychess
Posts: 295
Joined: Fri Sep 16, 2011 7:56 pm
Location: Belgium

Standings after first leg

Post by countrychess »

Code: Select all


CountryChess 02-07 Germany d1
Halle, 2012.11.16 - 2012.11.19
                                Score       SB   O G A N H C L G S I A Y
-------------------------------------------------------------------------
 1: Octochess revision 4741    8.0 / 11   40.75  X 1 0 1 = 1 0 1 1 = 1 1
 2: Ghost 2.01                 8.0 / 11   37.00  0 X 0 1 = 1 1 = 1 1 1 1
 3: Aristarch 4.50             7.5 / 11   39.75  1 1 X 0 0 1 = 1 = = 1 1
 4: N2 0.4                     6.0 / 11   31.00  0 0 1 X = 1 1 0 1 1 0 =
 5: Homer 2.01 UCI             6.0 / 11   31.00  = = 1 = X 0 = 0 0 1 1 1
 6: Comet B68                  5.5 / 11   25.25  0 0 0 0 1 X = 1 1 1 = =
 7: Little Goliath Evolution   5.0 / 11   28.50  1 0 = 0 = = X 1 = 0 1 0
 8: Gromit 3.8.2               5.0 / 11   26.00  0 = 0 1 1 0 0 X 1 0 1 =
 9: Snitch 1.6.2               5.0 / 11   22.25  0 0 = 0 1 0 = 0 X 1 1 1
10: Ikarus V0.18               3.5 / 11   19.25  = 0 = 0 0 0 1 1 0 X 0 =
11: Abrok 5.0                  3.5 / 11   15.25  0 0 0 1 0 = 0 0 0 1 X 1
12: Yace 0.99.87               3.0 / 11   15.00  0 0 0 = 0 = 1 = 0 = 0 X
-------------------------------------------------------------------------
66 games: +27 =16 -23

User avatar
Ajedrecista
Posts: 2181
Joined: Wed Jul 13, 2011 9:04 pm
Location: Madrid, Spain.

Re: Standings after first leg.

Post by Ajedrecista »

Hello Geert:
countrychess wrote:

Code: Select all


CountryChess 02-07 Germany d1
Halle, 2012.11.16 - 2012.11.19
                                Score       SB   O G A N H C L G S I A Y
-------------------------------------------------------------------------
 1: Octochess revision 4741    8.0 / 11   40.75  X 1 0 1 = 1 0 1 1 = 1 1
 2: Ghost 2.01                 8.0 / 11   37.00  0 X 0 1 = 1 1 = 1 1 1 1
 3: Aristarch 4.50             7.5 / 11   39.75  1 1 X 0 0 1 = 1 = = 1 1
 4: N2 0.4                     6.0 / 11   31.00  0 0 1 X = 1 1 0 1 1 0 =
 5: Homer 2.01 UCI             6.0 / 11   31.00  = = 1 = X 0 = 0 0 1 1 1
 6: Comet B68                  5.5 / 11   25.25  0 0 0 0 1 X = 1 1 1 = =
 7: Little Goliath Evolution   5.0 / 11   28.50  1 0 = 0 = = X 1 = 0 1 0
 8: Gromit 3.8.2               5.0 / 11   26.00  0 = 0 1 1 0 0 X 1 0 1 =
 9: Snitch 1.6.2               5.0 / 11   22.25  0 0 = 0 1 0 = 0 X 1 1 1
10: Ikarus V0.18               3.5 / 11   19.25  = 0 = 0 0 0 1 1 0 X 0 =
11: Abrok 5.0                  3.5 / 11   15.25  0 0 0 1 0 = 0 0 0 1 X 1
12: Yace 0.99.87               3.0 / 11   15.00  0 0 0 = 0 = 1 = 0 = 0 X
-------------------------------------------------------------------------
66 games: +27 =16 -23

Thanks for your kind words in CountryChess 02: Germany, second division topic.

In this case, I get exactly the same ratings in both of my lists (I use this arbitrary scale = min_score + max_score; here: (3 + 8)/11 = 1), so you can see that my programme is lucky sometimes, sometimes not and of course it is not a true rating programme as BayesElo, EloSTAT and Ordo.

I downloaded 173077.pgn and I got these ratings (setting the average rating to 0):

BayesElo:

Code: Select all

version 0057.2, Copyright (C) 1997-2010 Remi Coulom.
compiled Apr  5 2012 17:26:01.
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under the terms and conditions of the GNU General Public License.
See http://www.gnu.org/copyleft/gpl.html for details.
ResultSet>readpgn 173077.pgn
66 game(s) loaded, 0 game(s) with unknown result ignored.
ResultSet>elo
ResultSet-EloRating>mm 1 1
00:00:00,00
ResultSet-EloRating>confidence 0.95
0.95
ResultSet-EloRating>ratings
Rank Name                         Elo     Diff     +     -      Games  Score    Oppo.   Draws    Win          W-L-D
   1 Octochess revision 4741   132.11     0.00 155.89 155.89       11  72.73%  -12.01  18.18%  63.64%         7-2-2
   2 Ghost 2.01                127.34    -4.77 151.75 151.75       11  72.73%  -11.58  18.18%  63.64%         7-2-2
   3 Aristarch 4.50             91.29   -36.05 151.74 151.74       11  68.18%   -8.30  27.27%  54.55%         6-2-3
   4 Homer 2.01 UCI             34.67   -56.62 143.84 143.84       11  54.55%   -3.15  36.36%  36.36%         4-3-4
   5 N2 0.4                     20.66   -14.01 152.16 152.16       11  54.55%   -1.88  18.18%  45.45%         5-4-2
   6 Comet B68                 -12.40   -33.05 144.90 144.90       11  50.00%    1.13  27.27%  36.36%         4-4-3
   7 Snitch 1.6.2              -20.02    -7.62 148.50 148.50       11  45.45%    1.82  18.18%  36.36%         4-5-2
   8 Little Goliath Evolution  -21.22    -1.20 146.02 146.02       11  45.45%    1.93  36.36%  27.27%         3-4-4
   9 Gromit 3.8.2              -24.00    -2.78 151.69 151.69       11  45.45%    2.18  18.18%  36.36%         4-5-2
  10 Ikarus V0.18              -97.69   -73.70 151.02 151.02       11  31.82%    8.88  27.27%  18.18%         2-6-3
  11 Abrok 5.0                -110.31   -12.61 155.42 155.42       11  31.82%   10.03   9.09%  27.27%         3-7-1
  12 Yace 0.99.87             -120.42   -10.11 145.67 145.67       11  27.27%   10.95  36.36%   9.09%         1-6-4
ResultSet-EloRating>
EloSTAT (using BayesElo):

Code: Select all

version 0057.2, Copyright (C) 1997-2010 Remi Coulom.
compiled Apr  5 2012 17:26:01.
This program comes with ABSOLUTELY NO WARRANTY.
This is free software, and you are welcome to redistribute it
under the terms and conditions of the GNU General Public License.
See http://www.gnu.org/copyleft/gpl.html for details.
ResultSet>read 173077.pgn
Unknown command: read
type '?' for help
ResultSet>readpgn 173077.pgn
66 game(s) loaded, 0 game(s) with unknown result ignored.
ResultSet>elo
ResultSet-EloRating>elostat
8 iterations
00:00:00,00
ResultSet-EloRating>ratings
Rank Name                         Elo     Diff     +     -      Games  Score    Oppo.   Draws    Win          W-L-D
   1 Octochess revision 4741   155.71     0.00 374.35 173.03       11  72.73%  -14.16  18.18%  63.64%         7-2-2
   2 Ghost 2.01                155.71    -0.00 374.35 173.03       11  72.73%  -14.16  18.18%  63.64%         7-2-2
   3 Aristarch 4.50            120.88   -34.82 268.98 164.53       11  68.18%  -10.99  27.27%  54.55%         6-2-3
   4 N2 0.4                     28.55   -92.33 221.77 196.14       11  54.55%   -2.60  18.18%  45.45%         5-4-2
   5 Homer 2.01 UCI             28.55    -0.00 187.82 169.63       11  54.55%   -2.60  36.36%  36.36%         4-3-4
   6 Comet B68                  -0.48   -29.03 192.69 192.69       11  50.00%    0.04  27.27%  36.36%         4-4-3
   7 Gromit 3.8.2              -29.52   -29.03 196.14 221.77       11  45.45%    2.68  18.18%  36.36%         4-5-2
   8 Snitch 1.6.2              -29.52    -0.00 196.14 221.77       11  45.45%    2.68  18.18%  36.36%         4-5-2
   9 Little Goliath Evolution  -29.52    -0.00 169.63 187.82       11  45.45%    2.68  36.36%  27.27%         3-4-4
  10 Abrok 5.0                -121.85   -92.33 187.48 352.64       11  31.82%   11.08   9.09%  27.27%         3-7-1
  11 Ikarus V0.18             -121.85    -0.00 164.53 268.98       11  31.82%   11.08  27.27%  18.18%         2-6-3
  12 Yace 0.99.87             -156.67   -34.82 147.02 256.20       11  27.27%   14.24  36.36%   9.09%         1-6-4
ResultSet-EloRating>
Ordo 0.6:

Code: Select all

[...]\ordo-windows-v0.6>ordo-win32 -a 0 -p 173077.pgn -o ordo.txt

Loading data (2000 games x dot):

|

Total games               66
White wins                27
Draws                     16
Black wins                23
No result                  0
Unique head to head     0.00%
Reference rating         0.0 (average of the pool)


Convergence rating calculation

phase iteration  deviation resolution
  0       1       76.63472  130.44612
  1      22        0.00000    0.00000
done

Post-Convergence rating estimation



   # ENGINE                      : RATING    POINTS  PLAYED    (%)
   1 Ghost 2.01                  :  169.1       8.0      11   72.7%
   2 Octochess revision 4741     :  169.1       8.0      11   72.7%
   3 Aristarch 4.50              :  132.2       7.5      11   68.2%
   4 Homer 2.01 UCI              :   31.5       6.0      11   54.5%
   5 N2 0.4                      :   31.5       6.0      11   54.5%
   6 Comet B68                   :   -0.5       5.5      11   50.0%
   7 Little Goliath Evolution    :  -32.5       5.0      11   45.5%
   8 Gromit 3.8.2                :  -32.5       5.0      11   45.5%
   9 Snitch 1.6.2                :  -32.5       5.0      11   45.5%
  10 Abrok 5.0                   : -132.9       3.5      11   31.8%
  11 Ikarus V0.18                : -132.9       3.5      11   31.8%
  12 Yace 0.99.87                : -169.5       3.0      11   27.3%
At least I get an almost equal list to EloSTAT:

Code: Select all

Round Robin with 12 engines and     11 games per engine.
Total number of games:        66 games.
 
  154.42 (engine 01).
  154.42 (engine 02).
  119.88 (engine 03).
   28.31 (engine 04).
   28.31 (engine 05).
   -0.48 (engine 06).
  -29.27 (engine 07).
  -29.27 (engine 08).
  -29.27 (engine 09).
 -120.84 (engine 10).
 -120.84 (engine 11).
 -155.38 (engine 12).
 
Mean of ratings:     0.00 Elo.
A comparison with EloSTAT is acceptable and even encouraging, in this case; other comparisons are embarrasing, specially with BayesElo! It is a true achievement for me to obtain very similar results to at least one reputed rating programme (EloSTAT in this case), even more if I do not know how EloSTAT (or other rating programme) internally works.

It is better to take my Elo performance list as a funny coincidence than taking it as a serious thing. Thanks again for your tournaments!

Regards from Spain.

Ajedrecista.
countrychess
Posts: 295
Joined: Fri Sep 16, 2011 7:56 pm
Location: Belgium

Standings after second leg.

Post by countrychess »

Code: Select all


CountryChess 02-07 Germany d1
Halle, 2012.11.16 - 2012.11.22
                                Score       SB   Oc Ar N2 Ho Gh Sn Gr Co Li Ik Ya Ab
-------------------------------------------------------------------------------------
 1: Octochess revision 4741   17.0 / 22  171.75  XX 00 11 == 11 11 11 11 0= =1 11 11
 2: Aristarch 4.50            14.5 / 22  157.00  11 XX 0= 0= 11 =1 10 11 == == 11 10
 3: N2 0.4                    13.5 / 22  138.00  00 1= XX == 01 1= 01 11 11 11 == 0=
 4: Homer 2.01 UCI            11.5 / 22  124.75  == 1= == XX =1 00 0= 0= =1 1= 10 1=
 5: Ghost 2.01                11.0 / 22  107.25  00 00 10 =0 XX 11 == 1= 10 10 10 11
 6: Snitch 1.6.2              11.0 / 22  105.25  00 =0 0= 11 00 XX 01 00 =1 11 1= 11
 7: Gromit 3.8.2              10.0 / 22  104.00  00 01 10 1= == 10 XX 00 01 00 =1 11
 8: Comet B68                 10.0 / 22   99.00  00 00 00 1= 0= 11 11 XX =1 1= =0 =0
 9: Little Goliath Evolution   9.0 / 22  101.25  1= == 00 =0 01 =0 10 =0 XX 0= 01 1=
10: Ikarus V0.18               8.5 / 22   90.50  =0 == 00 0= 01 00 11 0= 1= XX == 0=
11: Yace 0.99.87               8.5 / 22   86.50  00 00 == 01 01 0= =0 =1 10 == XX 01
12: Abrok 5.0                  7.5 / 22   81.25  00 01 1= 0= 00 00 00 =1 0= 1= 10 XX
-------------------------------------------------------------------------------------
132 games: +57 =38 -37

OctoChess and Aristarch promote to the premier division; Yace and Abrok demote to the second division.
Games at the CountryChess blog.
User avatar
Ajedrecista
Posts: 2181
Joined: Wed Jul 13, 2011 9:04 pm
Location: Madrid, Spain.

Re: Standings after second leg.

Post by Ajedrecista »

Hello Geert:
countrychess wrote:

Code: Select all


CountryChess 02-07 Germany d1
Halle, 2012.11.16 - 2012.11.22
                                Score       SB   Oc Ar N2 Ho Gh Sn Gr Co Li Ik Ya Ab
-------------------------------------------------------------------------------------
 1: Octochess revision 4741   17.0 / 22  171.75  XX 00 11 == 11 11 11 11 0= =1 11 11
 2: Aristarch 4.50            14.5 / 22  157.00  11 XX 0= 0= 11 =1 10 11 == == 11 10
 3: N2 0.4                    13.5 / 22  138.00  00 1= XX == 01 1= 01 11 11 11 == 0=
 4: Homer 2.01 UCI            11.5 / 22  124.75  == 1= == XX =1 00 0= 0= =1 1= 10 1=
 5: Ghost 2.01                11.0 / 22  107.25  00 00 10 =0 XX 11 == 1= 10 10 10 11
 6: Snitch 1.6.2              11.0 / 22  105.25  00 =0 0= 11 00 XX 01 00 =1 11 1= 11
 7: Gromit 3.8.2              10.0 / 22  104.00  00 01 10 1= == 10 XX 00 01 00 =1 11
 8: Comet B68                 10.0 / 22   99.00  00 00 00 1= 0= 11 11 XX =1 1= =0 =0
 9: Little Goliath Evolution   9.0 / 22  101.25  1= == 00 =0 01 =0 10 =0 XX 0= 01 1=
10: Ikarus V0.18               8.5 / 22   90.50  =0 == 00 0= 01 00 11 0= 1= XX == 0=
11: Yace 0.99.87               8.5 / 22   86.50  00 00 == 01 01 0= =0 =1 10 == XX 01
12: Abrok 5.0                  7.5 / 22   81.25  00 01 1= 0= 00 00 00 =1 0= 1= 10 XX
-------------------------------------------------------------------------------------
132 games: +57 =38 -37

OctoChess and Aristarch promote to the premier division; Yace and Abrok demote to the second division.
Games at the CountryChess blog.
Octochess won convincingly this time... it is strong! OTOH it is a pity to find a historic and extremely useful engine as Yace is in eleventh position out of twelve participants.

I got rid of my 'scale' parameter in my clumsy rating programme, so the whole code has around 200 lines now, and the part of the calculation of ratings has less than fifteen lines! It is very minimal indeed, but I get very similar results to EloSTAT, as always:

Code: Select all

Elo_ratings_for_Round_Robin_tournaments, ® 2012.

Write down the full name of the Notepad (including .txt), up to 64 characters:

a.txt

Write down the number of engines of the Round Robin tournament (up to 64):

12

Write down the number of games of each engine (up to 400000):

22

Write down your desired mean of ratings:

0

Write down the clock rate of the CPU (in GHz), only for timing the elapsed time of the calculations:

3

End of the calculations. Elo performances have been saved in Elo_rating_list.txt file.

Approximated elapsed time:   27 ms.

Thanks for using Elo_ratings_for_Round_Robin_tournaments. Press Enter to exit.

Code: Select all

Round Robin with 12 engines and     22 games per engine.
Total number of games:       132 games.
 
  191.69 (engine 01).
  102.54 (engine 02).
   71.48 (engine 03).
   12.79 (engine 04).
   -1.58 (engine 05).
   -1.58 (engine 06).
  -30.37 (engine 07).
  -30.37 (engine 08).
  -59.65 (engine 09).
  -74.64 (engine 10).
  -74.64 (engine 11).
 -105.69 (engine 12).
 
Mean of ratings:     0.00 Elo.
I downloaded EloSTAT 1.3 two few days ago, so there is no need for me to use EloSTAT algorithm under BayesElo again:

Code: Select all

EloStat Version 1.3
by Dr. Frank Schubert, 01/2005
-------------------------------

Choose program mode:

(1) Rating list
(2) Tournament
(3) Single competition
1

Name of pgn-file = 173524
Start Elo = 0
Minimum number of games = 22

Reading 173524.pgn...
12 programs, 132 games
Analysing database structure... 1 cluster(s)

Calculating Elo ratings...
16 iterations

General statistics in 'general.dat'
Individual statistics in 'programs.dat'
Rating list in 'rating.dat'
Database structure in 'cluster.dat'

CPU time = 0 sec

Ende der Berechnung (Taste)

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Octochess revision 4741        :  193  165 152    22    77.3 %    -19   18.2 %
  2 Aristarch 4.50                 :  103  131 126    22    65.9 %    -11   31.8 %
  3 N2 0.4                         :   72  128 125    22    61.4 %     -8   31.8 %
  4 Homer 2.01 UCI                 :   13  106 105    22    52.3 %     -3   50.0 %
  5 Ghost 2.01                     :   -2  138 138    22    50.0 %     -2   18.2 %
  6 Snitch 1.6.2                   :   -2  138 138    22    50.0 %     -2   18.2 %
  7 Gromit 3.8.2                   :  -31  138 140    22    45.5 %      1   18.2 %
  8 Comet B68                      :  -31  129 131    22    45.5 %      1   27.3 %
  9 Little Goliath Evolution       :  -60  120 123    22    40.9 %      4   36.4 %
 10 Ikarus V0.18                   :  -75  115 118    22    38.6 %      5   40.9 %
 11 Yace 0.99.87                   :  -75  125 128    22    38.6 %      5   31.8 %
 12 Abrok 5.0                      : -107  136 142    22    34.1 %      8   22.7 %
The differences are less than 1%!

Code: Select all

Max.(EloSTAT) - min.(EloSTAT) = 193 - (-107) = 300
Max.(my list) - min.(my list) = 191.69 - (-105.69) = 297.38

300/297.38 ~ 1.0088; 297.38/300 ~ 0.9913
I am quite happy considering that I do not know the EloSTAT algorithm while I use my own algorithm. For a quick comparison:

Code: Select all

    Program                          Elo     My list:

  1 Octochess revision 4741        :  193  |  191.69
  2 Aristarch 4.50                 :  103  |  102.54
  3 N2 0.4                         :   72  |   71.48
  4 Homer 2.01 UCI                 :   13  |   12.79
  5 Ghost 2.01                     :   -2  |   -1.58
  6 Snitch 1.6.2                   :   -2  |   -1.58
  7 Gromit 3.8.2                   :  -31  |  -30.37
  8 Comet B68                      :  -31  |  -30.37
  9 Little Goliath Evolution       :  -60  |  -59.65
 10 Ikarus V0.18                   :  -75  |  -74.64
 11 Yace 0.99.87                   :  -75  |  -74.64
 12 Abrok 5.0                      : -107  | -105.69
Once again: thank you very much for your tournaments! I will not spam more with my ratings.

Regards from Spain.

Ajedrecista.
countrychess
Posts: 295
Joined: Fri Sep 16, 2011 7:56 pm
Location: Belgium

Re: Standings after second leg.

Post by countrychess »

Hello Jesús,

Oh, I don't see your posts as spam :-), the only thing is that I don't know much about algorithms. But go ahead if you like to try out your programme :-).

Best regards,

Geert