FREEWARE FINAL: Group 2 Concludes- Last 4 Rounds!

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

FREEWARE FINAL: Group 2 Concludes- Last 4 Rounds!

Post by geots »

What can I say- some mighty tough engines playing some astounding chess. In the end, Hannibal and Quazar had just built up a lead after 26 rounds that no engine could hardly cut into much- it relied too much on what they did as well as what Doch and Protector did.

So they move on to the finals of Group 2:

1.Hannibal 1.2 x64
2.Quazar 0.4 x64

Special congratulations to Sam and Edsel, along with Dmitry. Fine work they have done. In fact- congrats to Doch, Protector and all the other engines in this free-for-all. None have anything to be ashamed of.

I am offering here the final standings in a different view. Just call it a different perspective. But on to the games:






Inspiron 620 Intel i5-4 True Cores
Fritz 11 gui/Fritz 13 gui
1CPU/64-bit where available
128MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/21 Repeating (Benched to adapt to CCRL 40/40)
RR with 2 cycles




Group 2
----------




Round 27
-----------

Texel 1.01 64-bit v Glaurung 2.2 x64 (draw)
Cyclone xTreme v Tornado 4.88 x64 SSE42 (0-1)
Grapefruit 1.0 v Hannibal 1.2 x64 (0-1)
Toga II 3.1.2SE v Spike 1.4 (draw)
Booot 5.1.0 v Protector 1.4.0 x64 (0-1)
Nemo SP64o 1.0.1b v spark 1.0 corei x64 (0-1)
MinkoChess x64 1.3 Popcnt v Quazar 0.4 x64 (draw)
Thinker 5.4c Inert 64-bit v Doch64 1.3.4 (draw)

Round 28
-----------

Glaurung 2.2 x64 v Thinker 5.4c Inert 64-bit (draw)
Doch64 1.3.4 v MinkoChess x64 1.3 Popcnt (1-0)
Quazar 0.4 x64 v Nemo SP64o 1.0.1b (0-1)
spark 1.0 corei x64 v Booot 5.1.0 (1-0)
Protector 1.4.0 x64 v Toga II 3.1.2SE (1-0)
Spike 1.4 v Grapefruit 1.0 (draw)
Hannibal 1.2 x64 v Cyclone xTreme (draw)
Tornado 4.88 x64 SSE42 v Texel 1.01 64-bit (1-0)

Round 29
-----------

Tornado 4.88 x64 SSE42 v Glaurung 2.2 x64 (1-0)
Texel 1.01 64-bit v Hannibal 1.2 x64 (0-1)
Cyclone xTreme v Spike 1.4 (draw)
Grapefruit 1.0 v Protector 1.4.0 x64 (draw)
Toga II 3.1.2SE v spark 1.0 corei x64 (0-1)
Booot 5.1.0 v Quazar 0.4 x64 (0-1)
Nemo SP64o 1.0.1b v Doch64 1.3.4 (0-1)
MinkoChess x64 1.3 Popcnt v Thinker 5.4c Inert 64-bit (0-1)

Round 30
-----------

Glaurung 2.2 x64 v MinkoChess x64 1.3 Popcnt (0-1)
Thinker 5.4c Inert 64-bit v Nemo SP64o 1.0.1b (draw)
Doch64 1.3.4 v Booot 5.1.0 (draw)
Quazar 0.4 x64 v Toga II 3.1.2SE (draw)
spark 1.0 corei x64 v Grapefruit 1.0 (1-0)
Protector 1.4.0 x64 v Cyclone xTreme (1-0)
Spike 1.4 v Texel 1.01 64-bit (1-0)
Hannibal 1.2 x64 v Tornado 4.88 x64 SSE42 (draw)



Code: Select all


Group 2 SP8-Core-i5  2012

                               1  2  3  4  5  6  7  8  9  0  1  2  3  4  5  6  
1   Hannibal 1.2 x64           ** ½½ 01 11 1½ ½½ 11 11 01 ½1 11 1½ 1½ ½1 ½½ ½½  21.5/30
2   Quazar 0.4 x64             ½½ ** 00 ½½ 11 11 ½½ 10 ½½ 11 11 11 ½½ 10 ½1 11  20.5/30
3   Doch64 1.3.4 JA            10 11 ** ½0 ½0 ½0 ½1 01 ½1 ½½ 11 1½ ½1 ½1 1½ 1½  19.0/30  270.00
4   Protector 1.4.0 x64        00 ½½ ½1 ** 1½ 1½ 01 1½ ½1 11 01 ½½ 10 ½½ 1½ 11  19.0/30  267.75
5   spark-1.0 corei x64        0½ 00 ½1 0½ ** ½1 ½0 11 ½1 11 ½1 ½1 ½1 ½1 10 ½½  18.0/30
6   Spike 1.4                  ½½ 00 ½1 0½ ½0 ** 10 ½1 ½½ 0½ ½1 1½ 1½ 1½ 11 1½  17.0/30
7   MinkoChess x64 1.3 Popcnt  00 ½½ ½0 10 ½1 01 ** ½½ 1½ ½½ ½0 ½0 ½1 1½ 11 1½  16.0/30
8   Nemo SP64o 1.0.1 Beta      00 01 10 0½ 00 ½0 ½½ ** 00 ½1 1½ ½½ ½1 ½½ 1½ 11  14.0/30
9   Toga II 3.1.2SE            10 ½½ ½0 ½0 ½0 ½½ 0½ 11 ** 00 01 1½ ½0 ½½ 11 0½  13.5/30
10  Booot 5.1.0                ½0 00 ½½ 00 00 1½ ½½ ½0 11 ** ½½ 11 ½0 0½ ½½ ½1  13.0/30  179.25
11  Texel 1.01 JA 64-bit       00 00 00 10 ½0 ½0 ½1 0½ 10 ½½ ** ½1 1½ 1½ 10 1½  13.0/30  171.75
12  Thinker 5.4c Inert 64-bit  0½ 00 0½ ½½ ½0 0½ ½1 ½½ 0½ 00 ½0 ** 1½ ½½ 11 1½  12.5/30
13  Glaurung 2.2 JA x64        0½ ½½ ½0 01 ½0 0½ ½0 ½0 ½1 ½1 0½ 0½ ** 11 ½0 00  11.5/30  173.25
14  Grapefruit 1.0             ½0 01 ½0 ½½ ½0 0½ 0½ ½½ ½½ 1½ 0½ ½½ 00 ** 0½ ½1  11.5/30  170.25
15  Tornado 4.88 x64 SSE42     ½½ ½0 0½ 0½ 01 00 00 0½ 00 ½½ 01 00 ½1 1½ ** 11  11.0/30
16  Cyclone xTreme             ½½ 00 0½ 00 ½½ 0½ 0½ 00 1½ ½0 0½ 0½ 11 ½0 00 **   9.0/30


So now we are down to the last and strongest group- Group 1. Which will start this weekend, or the first of the week. As there are still some decisions still hanging out there.



george
User avatar
Ajedrecista
Posts: 2181
Joined: Wed Jul 13, 2011 9:04 pm
Location: Madrid, Spain.

Re: FREEWARE FINAL: Group 2 concludes - last 4 rounds!

Post by Ajedrecista »

Hello George:
geots wrote:What can I say- some mighty tough engines playing some astounding chess. In the end, Hannibal and Quazar had just built up a lead after 26 rounds that no engine could hardly cut into much- it relied too much on what they did as well as what Doch and Protector did.

So they move on to the finals of Group 2:

1.Hannibal 1.2 x64
2.Quazar 0.4 x64

Special congratulations to Sam and Edsel, along with Dmitry. Fine work they have done. In fact- congrats to Doch, Protector and all the other engines in this free-for-all. None have anything to be ashamed of.

I am offering here the final standings in a different view. Just call it a different perspective. But on to the games:






Inspiron 620 Intel i5-4 True Cores
Fritz 11 gui/Fritz 13 gui
1CPU/64-bit where available
128MB hash
Bases=NONE
Ponder_Learning=OFF
Perfect 2012b.ctg w/12-move limit
40/21 Repeating (Benched to adapt to CCRL 40/40)
RR with 2 cycles




Group 2
----------




Round 27
-----------

Texel 1.01 64-bit v Glaurung 2.2 x64 (draw)
Cyclone xTreme v Tornado 4.88 x64 SSE42 (0-1)
Grapefruit 1.0 v Hannibal 1.2 x64 (0-1)
Toga II 3.1.2SE v Spike 1.4 (draw)
Booot 5.1.0 v Protector 1.4.0 x64 (0-1)
Nemo SP64o 1.0.1b v spark 1.0 corei x64 (0-1)
MinkoChess x64 1.3 Popcnt v Quazar 0.4 x64 (draw)
Thinker 5.4c Inert 64-bit v Doch64 1.3.4 (draw)

Round 28
-----------

Glaurung 2.2 x64 v Thinker 5.4c Inert 64-bit (draw)
Doch64 1.3.4 v MinkoChess x64 1.3 Popcnt (1-0)
Quazar 0.4 x64 v Nemo SP64o 1.0.1b (0-1)
spark 1.0 corei x64 v Booot 5.1.0 (1-0)
Protector 1.4.0 x64 v Toga II 3.1.2SE (1-0)
Spike 1.4 v Grapefruit 1.0 (draw)
Hannibal 1.2 x64 v Cyclone xTreme (draw)
Tornado 4.88 x64 SSE42 v Texel 1.01 64-bit (1-0)

Round 29
-----------

Tornado 4.88 x64 SSE42 v Glaurung 2.2 x64 (1-0)
Texel 1.01 64-bit v Hannibal 1.2 x64 (0-1)
Cyclone xTreme v Spike 1.4 (draw)
Grapefruit 1.0 v Protector 1.4.0 x64 (draw)
Toga II 3.1.2SE v spark 1.0 corei x64 (0-1)
Booot 5.1.0 v Quazar 0.4 x64 (0-1)
Nemo SP64o 1.0.1b v Doch64 1.3.4 (0-1)
MinkoChess x64 1.3 Popcnt v Thinker 5.4c Inert 64-bit (0-1)

Round 30
-----------

Glaurung 2.2 x64 v MinkoChess x64 1.3 Popcnt (0-1)
Thinker 5.4c Inert 64-bit v Nemo SP64o 1.0.1b (draw)
Doch64 1.3.4 v Booot 5.1.0 (draw)
Quazar 0.4 x64 v Toga II 3.1.2SE (draw)
spark 1.0 corei x64 v Grapefruit 1.0 (1-0)
Protector 1.4.0 x64 v Cyclone xTreme (1-0)
Spike 1.4 v Texel 1.01 64-bit (1-0)
Hannibal 1.2 x64 v Tornado 4.88 x64 SSE42 (draw)



Code: Select all


Group 2 SP8-Core-i5  2012

                               1  2  3  4  5  6  7  8  9  0  1  2  3  4  5  6  
1   Hannibal 1.2 x64           ** ½½ 01 11 1½ ½½ 11 11 01 ½1 11 1½ 1½ ½1 ½½ ½½  21.5/30
2   Quazar 0.4 x64             ½½ ** 00 ½½ 11 11 ½½ 10 ½½ 11 11 11 ½½ 10 ½1 11  20.5/30
3   Doch64 1.3.4 JA            10 11 ** ½0 ½0 ½0 ½1 01 ½1 ½½ 11 1½ ½1 ½1 1½ 1½  19.0/30  270.00
4   Protector 1.4.0 x64        00 ½½ ½1 ** 1½ 1½ 01 1½ ½1 11 01 ½½ 10 ½½ 1½ 11  19.0/30  267.75
5   spark-1.0 corei x64        0½ 00 ½1 0½ ** ½1 ½0 11 ½1 11 ½1 ½1 ½1 ½1 10 ½½  18.0/30
6   Spike 1.4                  ½½ 00 ½1 0½ ½0 ** 10 ½1 ½½ 0½ ½1 1½ 1½ 1½ 11 1½  17.0/30
7   MinkoChess x64 1.3 Popcnt  00 ½½ ½0 10 ½1 01 ** ½½ 1½ ½½ ½0 ½0 ½1 1½ 11 1½  16.0/30
8   Nemo SP64o 1.0.1 Beta      00 01 10 0½ 00 ½0 ½½ ** 00 ½1 1½ ½½ ½1 ½½ 1½ 11  14.0/30
9   Toga II 3.1.2SE            10 ½½ ½0 ½0 ½0 ½½ 0½ 11 ** 00 01 1½ ½0 ½½ 11 0½  13.5/30
10  Booot 5.1.0                ½0 00 ½½ 00 00 1½ ½½ ½0 11 ** ½½ 11 ½0 0½ ½½ ½1  13.0/30  179.25
11  Texel 1.01 JA 64-bit       00 00 00 10 ½0 ½0 ½1 0½ 10 ½½ ** ½1 1½ 1½ 10 1½  13.0/30  171.75
12  Thinker 5.4c Inert 64-bit  0½ 00 0½ ½½ ½0 0½ ½1 ½½ 0½ 00 ½0 ** 1½ ½½ 11 1½  12.5/30
13  Glaurung 2.2 JA x64        0½ ½½ ½0 01 ½0 0½ ½0 ½0 ½1 ½1 0½ 0½ ** 11 ½0 00  11.5/30  173.25
14  Grapefruit 1.0             ½0 01 ½0 ½½ ½0 0½ 0½ ½½ ½½ 1½ 0½ ½½ 00 ** 0½ ½1  11.5/30  170.25
15  Tornado 4.88 x64 SSE42     ½½ ½0 0½ 0½ 01 00 00 0½ 00 ½½ 01 00 ½1 1½ ** 11  11.0/30
16  Cyclone xTreme             ½½ 00 0½ 00 ½½ 0½ 0½ 00 1½ ½0 0½ 0½ 11 ½0 00 **   9.0/30


So now we are down to the last and strongest group- Group 1. Which will start this weekend, or the first of the week. As there are still some decisions still hanging out there.



george
I stay tuned for Group 1. Thanks for this Mega-freeware tournament! :)

You usually upload the PGN file of each group... as I do not see the PGN file of Group 2, I calculate rating performances of this group with an own algorithm without iterations, that give very similar result to EloSTAT output:

Code: Select all

Round Robin with 16 engines and     30 games per engine.
Total number of games:       240 games.
 
 2850.00 (engine 01).
 2824.24 (engine 02).
 2788.15 (engine 03).
 2788.15 (engine 04).
 2765.28 (engine 05).
 2743.03 (engine 06).
 2721.19 (engine 07).
 2677.89 (engine 08).
 2667.00 (engine 09).
 2656.04 (engine 10).
 2656.04 (engine 11).
 2644.98 (engine 12).
 2622.45 (engine 13).
 2622.45 (engine 14).
 2610.92 (engine 15).
 2562.16 (engine 16).
 
Mean of ratings:  2700.00 Elo.
Randomly, I decided to set the average of all ratings to 2700. The more important thing for me is the difference between the first engine and the last one: 2850 - 2562.16 = 287.84 Elo; I guess that EloSTAT will have a wideness of circa 290 Elo for this tournament.

Just for the record, the time of calculation of the ratings and print the results was 25 ms more less.

Regards from Spain.

Ajedrecista.
User avatar
Ajedrecista
Posts: 2181
Joined: Wed Jul 13, 2011 9:04 pm
Location: Madrid, Spain.

Re: FREEWARE FINAL: Group 2 concludes - last 4 rounds!

Post by Ajedrecista »

Hi again:

Thank you very much for your effort!
Ajedrecista wrote:

Code: Select all

Round Robin with 16 engines and     30 games per engine. 
Total number of games:       240 games. 
  
 2850.00 (engine 01). 
 2824.24 (engine 02). 
 2788.15 (engine 03). 
 2788.15 (engine 04). 
 2765.28 (engine 05). 
 2743.03 (engine 06). 
 2721.19 (engine 07). 
 2677.89 (engine 08). 
 2667.00 (engine 09). 
 2656.04 (engine 10). 
 2656.04 (engine 11). 
 2644.98 (engine 12). 
 2622.45 (engine 13). 
 2622.45 (engine 14). 
 2610.92 (engine 15). 
 2562.16 (engine 16). 
  
Mean of ratings:  2700.00 Elo.
Here is EloSTAT output:

Code: Select all


    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Hannibal 1.2 x64               : 2851  100  93    30    71.7 %   2689   43.3 %
  2 Quazar 0.4 x64                 : 2825  106 102    30    68.3 %   2691   36.7 %
  3 Protector 1.4.0 x64            : 2789  101  99    30    63.3 %   2694   40.0 %
  4 Doch64 1.3.4 JA                : 2789  101  99    30    63.3 %   2694   40.0 %
  5 spark-1.0 corei x64            : 2766  101  99    30    60.0 %   2695   40.0 %
  6 Spike 1.4                      : 2743   94  92    30    56.7 %   2697   46.7 %
  7 MinkoChess x64 1.3 Popcnt      : 2721   93  93    30    53.3 %   2698   46.7 %
  8 Nemo SP64o 1.0.1 Beta          : 2678   99  99    30    46.7 %   2701   40.0 %
  9 Toga II 3.1.2SE                : 2667   96  97    30    45.0 %   2702   43.3 %
 10 Texel 1.01 JA 64-bit           : 2656  104 106    30    43.3 %   2702   33.3 %
 11 Booot 5.1.0                    : 2656   92  94    30    43.3 %   2702   46.7 %
 12 Thinker 5.4c Inert 64-bit      : 2645   89  91    30    41.7 %   2703   50.0 %
 13 Glaurung 2.2 JA x64            : 2622   95  98    30    38.3 %   2705   43.3 %
 14 Grapefruit 1.0                 : 2622   80  85    30    38.3 %   2705   56.7 %
 15 Tornado 4.88 x64 SSE42         : 2611  105 108    30    36.7 %   2705   33.3 %
 16 Cyclone xTreme                 : 2562   98 103    30    30.0 %   2709   40.0 %
The rating wideness is 2851 - 2562 = 289 Elo (I predicted 290). Well, EloSTAT rounds up to 1 Elo, but at least I obtained almost the same ratings than EloSTAT without PGN files! :) My wideness is always a bit bigger (less than 1%) than EloSTAT.

I stay tuned for Group 1.

Regards from Spain.

Ajedrecista.
User avatar
geots
Posts: 4790
Joined: Sat Mar 11, 2006 12:42 am

Re: FREEWARE FINAL: Group 2 concludes - last 4 rounds!

Post by geots »

Ajedrecista wrote:Hi again:

Thank you very much for your effort!
Ajedrecista wrote:

Code: Select all

Round Robin with 16 engines and     30 games per engine. 
Total number of games:       240 games. 
  
 2850.00 (engine 01). 
 2824.24 (engine 02). 
 2788.15 (engine 03). 
 2788.15 (engine 04). 
 2765.28 (engine 05). 
 2743.03 (engine 06). 
 2721.19 (engine 07). 
 2677.89 (engine 08). 
 2667.00 (engine 09). 
 2656.04 (engine 10). 
 2656.04 (engine 11). 
 2644.98 (engine 12). 
 2622.45 (engine 13). 
 2622.45 (engine 14). 
 2610.92 (engine 15). 
 2562.16 (engine 16). 
  
Mean of ratings:  2700.00 Elo.
Here is EloSTAT output:

Code: Select all


    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Hannibal 1.2 x64               : 2851  100  93    30    71.7 %   2689   43.3 %
  2 Quazar 0.4 x64                 : 2825  106 102    30    68.3 %   2691   36.7 %
  3 Protector 1.4.0 x64            : 2789  101  99    30    63.3 %   2694   40.0 %
  4 Doch64 1.3.4 JA                : 2789  101  99    30    63.3 %   2694   40.0 %
  5 spark-1.0 corei x64            : 2766  101  99    30    60.0 %   2695   40.0 %
  6 Spike 1.4                      : 2743   94  92    30    56.7 %   2697   46.7 %
  7 MinkoChess x64 1.3 Popcnt      : 2721   93  93    30    53.3 %   2698   46.7 %
  8 Nemo SP64o 1.0.1 Beta          : 2678   99  99    30    46.7 %   2701   40.0 %
  9 Toga II 3.1.2SE                : 2667   96  97    30    45.0 %   2702   43.3 %
 10 Texel 1.01 JA 64-bit           : 2656  104 106    30    43.3 %   2702   33.3 %
 11 Booot 5.1.0                    : 2656   92  94    30    43.3 %   2702   46.7 %
 12 Thinker 5.4c Inert 64-bit      : 2645   89  91    30    41.7 %   2703   50.0 %
 13 Glaurung 2.2 JA x64            : 2622   95  98    30    38.3 %   2705   43.3 %
 14 Grapefruit 1.0                 : 2622   80  85    30    38.3 %   2705   56.7 %
 15 Tornado 4.88 x64 SSE42         : 2611  105 108    30    36.7 %   2705   33.3 %
 16 Cyclone xTreme                 : 2562   98 103    30    30.0 %   2709   40.0 %
The rating wideness is 2851 - 2562 = 289 Elo (I predicted 290). Well, EloSTAT rounds up to 1 Elo, but at least I obtained almost the same ratings than EloSTAT without PGN files! :) My wideness is always a bit bigger (less than 1%) than EloSTAT.

I stay tuned for Group 1.

Regards from Spain.

Ajedrecista.



It might be of interest that CCRL had Hannibal rated at the top with 2948 and Texel last with 2799. Being that CCRL's list of these 16 is probably very accurate- of the top 8 engines in their list- 7 of them actually finished in the top 8- could it be that possibly assigning an elo rating to engines in a 2 cycle RR could cause a lot of inflated as well as deflated ratings more often than not? I would have no idea, but the case could be made that they basically finished close to where expected. The biggest inaccuracies being Texel higher than expected, and Thinker lower than expected. Tho I have never been one to go about trumpeting Thinker's strength- it is strong but always seems to me to be overrated- whichever list I look at.

And thank you very much again for your interesting analysis.



Best,

george