First Gambit Rating List

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

First Gambit Rating List

Post by Rebel »

http://rebel13.nl/misc/gambit-rating-list.html

Code: Select all

   Gambit Rating List  : 2021-05-01
   Time Control        : 40 moves in 2 minutes repeating
   Software            : cutechess-cli 1.1.0.f.1.0
   Elo calculation     : Ordo 1.2.6
   Games               : 13.500
   
   # PLAYER            :  RATING  POINTS  PLAYED   (%)    W    D    L  D(%)  OppAvg
   1 Stockfish 13      :  3579.3   724.5     900    81  563  323   14    36  3320.1
   2 Komodo Dragon     :  3486.8   629.5     900    70  427  405   68    45  3330.4
   3 Lc0 v27 RTX 1060  :  3442.3   434.0     700    62  265  338   97    48  3351.0
   4 Komodo 14         :  3337.0   443.5     900    49  235  417  248    46  3347.1
   5 Anchor engine     :  3300.0   827.0    1400    59  550  554  296    40  3250.4
   6 RubiChess 2.1     :  3276.6   365.0     900    41  162  406  332    45  3353.8
   7 SlowChess 2.5     :  3269.2   355.5     900    40  148  415  337    46  3354.6
   8 Pedone 3.1        :  3267.2   529.0     800    66  386  286  128    36  3143.3
   9 Igel 3.0.5        :  3261.6   274.5     700    39  100  349  251    50  3351.0
  10 Ethereal 12.75    :  3260.1   344.0     900    38  156  376  368    42  3355.6
  11 Igel 3.0.0        :  3247.8   328.5     900    37  111  435  354    48  3357.0
  12 Nemorino 6.00     :  3207.9   461.0     800    58  332  258  210    32  3150.7
  13 Pedone 3.0        :  3173.9   420.0     800    53  265  310  225    39  3155.0
  14 rofChade 2.3      :  3168.5   413.5     800    52  262  303  235    38  3155.7
  15 Booot 6.5         :  3166.8   411.5     800    51  242  339  219    42  3155.9
  16 Anchor engine     :  3064.7  1245.5    2200    57  907  677  616    31  3016.6
  17 Wasp 4.50         :  3058.7   284.5     800    36  145  279  376    35  3169.4
  18 Halogen 10        :  3044.3   422.0     700    60  315  214  171    31  2968.8
  19 Seer 2.0.0        :  3021.6   552.0     900    61  425  254  221    28  2938.7
  20 Tucano 9.0        :  3018.6   493.5     700    71  420  147  133    21  2850.8
  21 Berserk 3.3.0     :  3016.4   491.5     700    70  407  169  124    24  2851.1
  22 Weiss 1.3         :  3014.1   542.0     900    60  403  278  219    31  2939.5
  23 Minic 3.04        :  3006.8   381.0     700    54  272  218  210    31  2974.2
  24 Arasan 2.22       :  3006.1   227.5     800    28  117  221  462    28  3176.0
  25 Mr Bob 1.0.0      :  3005.4   481.5     700    69  413  137  150    20  2852.7
  26 Marvin 5.0        :  2982.9   354.5     700    51  232  245  223    35  2977.6
  27 Orion 0.8         :  2961.7   332.0     700    47  217  230  253    33  2980.6
  28 Stash 29.0        :  2959.9   330.0     700    47  217  226  257    32  2980.9
  29 Vajolet2 2.8      :  2950.8   320.0     700    46  214  212  274    30  2982.2
  30 Topple 0.8.0      :  2945.7   449.5     900    50  300  299  301    33  2947.1
  31 Cheng 4.41        :  2928.3   425.5     900    47  287  277  336    31  2949.0
  32 Counter 3.7       :  2928.0   425.0     900    47  274  302  324    34  2949.1
  33 Seer 1.2.1        :  2924.9   359.5     800    45  245  229  326    29  2961.8
  34 FabChess 1.16     :  2920.2   413.5     900    46  277  273  350    30  2949.9
  35 Cheng 4.40        :  2872.0   348.5     900    39  224  249  427    28  2955.3
  36 Amoeba 3.3        :  2855.2   218.5     700    31  122  193  385    28  2995.9
  37 Anchor engine     :  2850.0   482.5    1300    37  309  347  644    27  2927.8
  38 Cheese 2.2        :  2836.9   314.0     700    45  228  172  300    25  2876.8
  39 ProDeo 3.1        :  2808.8   285.5     700    41  205  161  334    23  2880.8
  40 ProDeo 3.0        :  2745.4   224.0     700    32  154  140  406    20  2889.9
  41 Benjamin          :  2713.5   356.5     600    59  294  125  181    21  2632.6
  42 K2 0.99           :  2699.3   343.5     600    57  270  147  183    25  2635.0
  43 Dumb 1.8          :  2693.3   338.0     600    56  271  134  195    22  2636.0
  44 Anchor engine     :  2692.9   485.5    1200    40  368  235  597    20  2770.7
  45 Fridolin 3.10     :  2667.4   314.0     600    52  246  136  218    23  2640.3
  46 Supernova 2.3     :  2666.3   313.0     600    52  249  128  223    21  2640.5
  47 Foxsee 7.8        :  2376.5    85.0     600    14   44   82  474    14  2688.8
Next:
. Berserk 4.0.0
. Drofa 3.0.0
. galjoen 0.41.1
. Olithink 5.9.3
. Velvet 1.2.0
90% of coding is debugging, the other 10% is writing bugs.
User avatar
xr_a_y
Posts: 1871
Joined: Sat Nov 25, 2017 2:28 pm
Location: France

Re: First Gambit Rating List

Post by xr_a_y »

Do you plan to extract figures about those gambit ? like winning % for white and black, draw %.
How does engine Elo impact the way those gambit are succesful or not ?
connor_mcmonigle
Posts: 530
Joined: Sun Sep 06, 2020 4:40 am
Full name: Connor McMonigle

Re: First Gambit Rating List

Post by connor_mcmonigle »

Thanks for the tournament Ed! Guenther has pointed this out in the Seer v2 thread, but I'll make a secondary post here. Seer v2.0.0 played many illegal moves in your tournament due to running out of time (getting so low on time that the bestmove from the previous position wouldn't be updated) as a consequence of buggy TM code which resulted in Seer sometimes going greater than 100ms over the allocated time budget. Many of these illegal moves were in completely winning positions as well.

Technically, the TM bug affects V2.0.0 at both cyclical and incremental TC, though is most problematic for cyclical TC where precise time usage is critical. In addition to resolving the buggy TM code, I've also tweaked my cyclical TM code to be less aggressive resulting in about 50 elo in my testing as apparently my old cyclical TM constants were quite terrible.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

connor_mcmonigle wrote: Sat May 01, 2021 9:28 pm Thanks for the tournament Ed! Guenther has pointed this out in the Seer v2 thread, but I'll make a secondary post here. Seer v2.0.0 played many illegal moves in your tournament due to running out of time (getting so low on time that the bestmove from the previous position wouldn't be updated) as a consequence of buggy TM code which resulted in Seer sometimes going greater than 100ms over the allocated time budget. Many of these illegal moves were in completely winning positions as well.

Technically, the TM bug affects V2.0.0 at both cyclical and incremental TC, though is most problematic for cyclical TC where precise time usage is critical. In addition to resolving the buggy TM code, I've also tweaked my cyclical TM code to be less aggressive resulting in about 50 elo in my testing as apparently my old cyclical TM constants were quite terrible.
Seer 2.0.1 will be next :wink:
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

xr_a_y wrote: Sat May 01, 2021 6:23 pm Do you plan to extract figures about those gambit ? like winning % for white and black, draw %.
I am new to Ordo, I will look if it has those features, else I write them myself.
xr_a_y wrote: Sat May 01, 2021 6:23 pm How does engine Elo impact the way those gambit are succesful or not ?
I don't understand the question, can you rephrase?
90% of coding is debugging, the other 10% is writing bugs.
User avatar
xr_a_y
Posts: 1871
Joined: Sat Nov 25, 2017 2:28 pm
Location: France

Re: First Gambit Rating List

Post by xr_a_y »

Rebel wrote: Sat May 01, 2021 9:44 pm
xr_a_y wrote: Sat May 01, 2021 6:23 pm How does engine Elo impact the way those gambit are succesful or not ?
I don't understand the question, can you rephrase?
I was wondering if 2800 engines handle those positions very differently than 3200 engines for instance.
So maybe, the win rate for one gambit on a specific engine range is different from another range in a way that it is different from common knowledge (like draw rate increasing at a specific paste when Elo increase). Is it more understandable ?
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

xr_a_y wrote: Sat May 01, 2021 9:58 pm
Rebel wrote: Sat May 01, 2021 9:44 pm
xr_a_y wrote: Sat May 01, 2021 6:23 pm How does engine Elo impact the way those gambit are succesful or not ?
I don't understand the question, can you rephrase?
I was wondering if 2800 engines handle those positions very differently than 3200 engines for instance.
So maybe, the win rate for one gambit on a specific engine range is different from another range in a way that it is different from common knowledge (like draw rate increasing at a specific paste when Elo increase). Is it more understandable ?
I think that's a good question. I could run (as an experiment) a massive robin round between 2800-3200 engines. As an experiment because I am not sure if the games such a tournament produces are suited to include in the rating list. As a novice I leave this as an open question to the rating list experts.
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

Added the head-to-head statistics, if you look well you can see the anchor engines in use :wink:

http://rebel13.nl/text/head-to-head.txt
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

Added some sort of live support for running matches.

http://rebel13.nl/pgn4web/test.html

PGN live support, click on navigation to explore further, click on View Live Results for current results.

Navigation options

Code: Select all

  MOST IMPORTANT SQUARES
. square B8 shows the EPD.
. square C8 shows the current PGN.
. square D8 shows the complete PGN.
. square E7 flips the board.
. square F7 removes comments.

. square C6 loads previous PGN.
. square F6 loads next PGN.

. square B5 search in PGN.
Current match is not very interesting, it just creates one of the anchor engines for the 40/15 time control.
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: First Gambit Rating List

Post by Rebel »

Added my second PC to the internet, division one 40/15 online now.

http://rebel13.nl/pgn4web/match2.html
http://rebel13.nl/b/grl.htm

Current standings

Code: Select all

Gambit Rating List
Running      : Divison One
Time Control : 40 moves in 15 minutes repeating
Games        : 1500

Results from file division-one-40-15.pgn:

No. Name           Win Draw Loss Unf.  Score Games       %
----------------------------------------------------------
  1 Pedone 3.1    +147 =153  -33   *0  223.5   333   67.1%
  2 Nemorino 6.00 +136 =152  -45   *0  212.0   333   63.7%
  3 Booot 6.5     +106 =160  -66   *0  186.0   332   56.0%
  4 rofChade 2.3  +108 =141  -83   *0  178.5   332   53.8%
  5 Arasan 2.22    +38 =129 -165   *0  102.5   332   30.9%
  6 Wasp 4.50      +29 =131 -172   *0   94.5   332   28.5%

Total Games:     997
White Wins:      282 (28.3%)
Black Wins:      282 (28.3%)
Draws:           433 (43.4%)
Unfinished:        0 (0.0%)
90% of coding is debugging, the other 10% is writing bugs.