4'+2" UPDATE: Loop 13.5 is here... matches underway

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

Erik Roggenburg

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Erik Roggenburg »

I figured I'd give an update with a couple of ratings lists.

Full List:

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Rybka 2.3.1 mp 32-bit          : 2830   16  15  1600    74.3 %   2645   27.9 %
  2 Rybka 2.2 mp 32-bit            : 2818   15  15  1580    72.7 %   2648   30.4 %
  3 Rybka 2.3 LK mp 32-bit         : 2813   18  18  1140    73.1 %   2639   28.8 %
  4 Rybka 2.3 mp 32-bit            : 2812   18  18  1140    73.0 %   2639   30.4 %
  5 Loop 13.5.32 2CPU              : 2737   24  24   600    64.3 %   2635   31.3 %
  6 HIARCS 11.1 MP UCI             : 2730   14  14  1600    61.5 %   2649   34.9 %
  7 Deep Fritz 10                  : 2724   14  14  1680    59.6 %   2657   30.7 %
  8 LoopMP 12.32 2CPU              : 2704   14  14  1580    57.4 %   2653   35.1 %
  9 HIARCS 11 MP UCI               : 2698   14  14  1480    58.1 %   2641   37.2 %
 10 Deep Shredder 10 UCI           : 2679   14  14  1580    53.6 %   2654   29.1 %
 11 Naum 2.1 MP                    : 2666   14  14  1400    50.6 %   2662   37.9 %
 12 HIARCS 11 UCI                  : 2655   15  15  1380    52.7 %   2636   35.7 %
 13 Deep Junior 10.1               : 2642   14  14  1680    47.4 %   2660   31.0 %
 14 Toga II 1.2.1a                 : 2640   14  14  1580    47.9 %   2655   33.2 %
 15 Spike 1.2 Turin                : 2636   14  14  1580    47.2 %   2655   36.4 %
 16 Hiarcs X54 UCI                 : 2631   16  16  1140    51.2 %   2623   35.7 %
 17 Hiarcs X50 UCI                 : 2629   16  16  1140    51.0 %   2623   36.7 %
 18 Fruit 2.2.1                    : 2625   14  14  1580    45.6 %   2656   32.8 %
 19 Fritz 9                        : 2625   15  15  1380    48.1 %   2638   29.9 %
 20 Glaurung 1.2 SMP               : 2586   15  15  1580    39.9 %   2657   28.0 %
 21 Chess Tiger 2007 UCI           : 2564   14  14  1680    36.1 %   2663   32.0 %
 22 Naum 2.0                       : 2550   16  16  1200    38.9 %   2629   36.2 %
 23 Scorpio 1.91 2CPU              : 2544   16  16  1440    31.8 %   2677   30.3 %
 24 Deep Pharaon 3.5.1             : 2527   15  15  1580    31.8 %   2659   29.1 %
 25 Chess Tiger 15.0               : 2527   17  17  1140    35.8 %   2628   34.5 %
 26 Deep Frenzee 3.0               : 2509   15  15  1680    29.0 %   2665   24.6 %
 27 Scorpio 1.8 2CPU               : 2505   16  16  1380    31.1 %   2643   30.0 %
Pruned List:

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Rybka 2.3.1 mp 32-bit          : 2826   16  16  1500    75.4 %   2632   26.2 %
  2 Loop 13.5.32 2CPU              : 2734   24  24   600    64.3 %   2632   31.3 %
  3 HIARCS 11.1 MP UCI             : 2727   15  15  1400    63.4 %   2631   33.1 %
  4 Deep Fritz 10                  : 2722   18  18  1020    59.8 %   2653   29.7 %
  5 LoopMP 12.32 2CPU              : 2708   18  18   920    58.9 %   2645   36.7 %
  6 Deep Shredder 10 UCI           : 2666   19  19   920    52.6 %   2648   27.6 %
  7 Naum 2.1 MP                    : 2662   18  18   920    52.0 %   2648   36.3 %
  8 Deep Junior 10.1               : 2639   18  18  1020    47.3 %   2658   31.4 %
  9 Toga II 1.2.1a                 : 2633   19  19   920    47.5 %   2650   32.0 %
 10 Fruit 2.2.1                    : 2632   18  18   920    47.4 %   2650   33.5 %
 11 Spike 1.2 Turin                : 2627   18  18   920    46.6 %   2650   37.2 %
 12 Glaurung 1.2 SMP               : 2594   19  19   920    41.7 %   2653   29.2 %
 13 Chess Tiger 2007 UCI           : 2556   18  18  1020    35.1 %   2663   30.6 %
 14 Scorpio 1.91 2CPU              : 2545   18  18  1020    33.5 %   2663   32.4 %
 15 Deep Pharaon 3.5.1             : 2531   20  20   920    32.6 %   2657   28.9 %
 16 Deep Frenzee 3.0               : 2509   20  20  1020    28.9 %   2665   23.5 %
And, of course:

Code: Select all

Time control: 4'+2" 
Hash: 128 MB 
EGTBs: 3, 4, 5, and some 6 piece tables are available. Engines access or do not access based upon their default settings. 
Testset: Noomen 2006 (through 2007 Mar 03); Silver Suite v2 (FULL) 
Ponder: OFF 
Hardware: AMD X2 4400+ with 2GB of RAM 

GUIs: Primarily Deep Fritz 10, but I have used Arena, Lokasoft's ERT, and Shredderchess too.  Received a version of Scorpio 1.91 from Daniel Sharwul on 3/16/07.  That version is being used in DF10 GUI, and so far, no crashes!
 
Cores and such: If an engine is capable of running on multiple cores, it does so in my tests. Since my hardware is dual core, I can't run an engine on anything greater than 2 cores. 

I wouldn't call an engine "Blah-blah 3.98 MP" or "Deep Flarbin Blah 7.3XY" if it were not running on 2 cores. If I ever did run Deep Fritz 10 on 1 CPU, I'd add the tag "1CPU" to its name. Similar to how I handle Scorpio 1.8 - I have to add the 2CPU tag so everyone knows it is running on 2 cores.
Spock

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Spock »

Ah that is interesting, so you do show a good improvement for 13.5 32-bit 2CPU

I used the 64-bit exe, and I wonder if there is a problem with that
Spock

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Spock »

Spock wrote: I used the 64-bit exe, and I wonder if there is a problem with that
Our 32-bit blitz results aren't good either so far, so 64-bit version is probably fine
Tony Thomas

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Tony Thomas »

Spock wrote:Ah that is interesting, so you do show a good improvement for 13.5 32-bit 2CPU

I used the 64-bit exe, and I wonder if there is a problem with that
Its only a 30 point improvement, but at that level it is still a very good improvement. It is very well possible that loop doesnt like your computer Ray.
Spock

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Spock »

Tony Thomas wrote: Its only a 30 point improvement, but at that level it is still a very good improvement. It is very well possible that loop doesnt like your computer Ray.
Not just my computer, other CCRL testers as well :)

Just a matter of different testing conditions presumably. I'm particularly keen to see CEGT blitz results
Tony Thomas

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Tony Thomas »

Spock wrote:
Tony Thomas wrote: Its only a 30 point improvement, but at that level it is still a very good improvement. It is very well possible that loop doesnt like your computer Ray.
Not just my computer, other CCRL testers as well :)

Just a matter of different testing conditions presumably. I'm particularly keen to see CEGT blitz results
I should seriously consider buying loop, but Tony is broke and Tony doesnt want to spend money from his savings account.
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Dr.Wael Deeb »

Tony Thomas wrote:
Spock wrote:
Tony Thomas wrote: Its only a 30 point improvement, but at that level it is still a very good improvement. It is very well possible that loop doesnt like your computer Ray.
Not just my computer, other CCRL testers as well :)

Just a matter of different testing conditions presumably. I'm particularly keen to see CEGT blitz results
I should seriously consider buying loop, but Tony is broke and Tony doesnt want to spend money from his savings account.
It can't rain all the time brother 8-)
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
Tony Thomas

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Tony Thomas »

Dr.Wael Deeb wrote:
Tony Thomas wrote:
Spock wrote:
Tony Thomas wrote: Its only a 30 point improvement, but at that level it is still a very good improvement. It is very well possible that loop doesnt like your computer Ray.
Not just my computer, other CCRL testers as well :)

Just a matter of different testing conditions presumably. I'm particularly keen to see CEGT blitz results
I should seriously consider buying loop, but Tony is broke and Tony doesnt want to spend money from his savings account.
It can't rain all the time brother 8-)
I think it would be insane to spend money from a savings account to buy a chess program. If I dont have the money, I just have to wait until I have some to buy the program. I am working 42 hrs this week because the stupid girl at my work didnt want to work today evening so she changed my schedule without my permission. I guess I can go talk shit to her, but I want the money so I would glady work. :wink:
rdan1987

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by rdan1987 »

Erik Roggenburg wrote:I figured I'd give an update with a couple of ratings lists.

Full List:

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Rybka 2.3.1 mp 32-bit          : 2830   16  15  1600    74.3 %   2645   27.9 %
  2 Rybka 2.2 mp 32-bit            : 2818   15  15  1580    72.7 %   2648   30.4 %
  3 Rybka 2.3 LK mp 32-bit         : 2813   18  18  1140    73.1 %   2639   28.8 %
  4 Rybka 2.3 mp 32-bit            : 2812   18  18  1140    73.0 %   2639   30.4 %
  5 Loop 13.5.32 2CPU              : 2737   24  24   600    64.3 %   2635   31.3 %
  6 HIARCS 11.1 MP UCI             : 2730   14  14  1600    61.5 %   2649   34.9 %
  7 Deep Fritz 10                  : 2724   14  14  1680    59.6 %   2657   30.7 %
  8 LoopMP 12.32 2CPU              : 2704   14  14  1580    57.4 %   2653   35.1 %
  9 HIARCS 11 MP UCI               : 2698   14  14  1480    58.1 %   2641   37.2 %
 10 Deep Shredder 10 UCI           : 2679   14  14  1580    53.6 %   2654   29.1 %
 11 Naum 2.1 MP                    : 2666   14  14  1400    50.6 %   2662   37.9 %
 12 HIARCS 11 UCI                  : 2655   15  15  1380    52.7 %   2636   35.7 %
 13 Deep Junior 10.1               : 2642   14  14  1680    47.4 %   2660   31.0 %
 14 Toga II 1.2.1a                 : 2640   14  14  1580    47.9 %   2655   33.2 %
 15 Spike 1.2 Turin                : 2636   14  14  1580    47.2 %   2655   36.4 %
 16 Hiarcs X54 UCI                 : 2631   16  16  1140    51.2 %   2623   35.7 %
 17 Hiarcs X50 UCI                 : 2629   16  16  1140    51.0 %   2623   36.7 %
 18 Fruit 2.2.1                    : 2625   14  14  1580    45.6 %   2656   32.8 %
 19 Fritz 9                        : 2625   15  15  1380    48.1 %   2638   29.9 %
 20 Glaurung 1.2 SMP               : 2586   15  15  1580    39.9 %   2657   28.0 %
 21 Chess Tiger 2007 UCI           : 2564   14  14  1680    36.1 %   2663   32.0 %
 22 Naum 2.0                       : 2550   16  16  1200    38.9 %   2629   36.2 %
 23 Scorpio 1.91 2CPU              : 2544   16  16  1440    31.8 %   2677   30.3 %
 24 Deep Pharaon 3.5.1             : 2527   15  15  1580    31.8 %   2659   29.1 %
 25 Chess Tiger 15.0               : 2527   17  17  1140    35.8 %   2628   34.5 %
 26 Deep Frenzee 3.0               : 2509   15  15  1680    29.0 %   2665   24.6 %
 27 Scorpio 1.8 2CPU               : 2505   16  16  1380    31.1 %   2643   30.0 %
Pruned List:

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Rybka 2.3.1 mp 32-bit          : 2826   16  16  1500    75.4 %   2632   26.2 %
  2 Loop 13.5.32 2CPU              : 2734   24  24   600    64.3 %   2632   31.3 %
  3 HIARCS 11.1 MP UCI             : 2727   15  15  1400    63.4 %   2631   33.1 %
  4 Deep Fritz 10                  : 2722   18  18  1020    59.8 %   2653   29.7 %
  5 LoopMP 12.32 2CPU              : 2708   18  18   920    58.9 %   2645   36.7 %
  6 Deep Shredder 10 UCI           : 2666   19  19   920    52.6 %   2648   27.6 %
  7 Naum 2.1 MP                    : 2662   18  18   920    52.0 %   2648   36.3 %
  8 Deep Junior 10.1               : 2639   18  18  1020    47.3 %   2658   31.4 %
  9 Toga II 1.2.1a                 : 2633   19  19   920    47.5 %   2650   32.0 %
 10 Fruit 2.2.1                    : 2632   18  18   920    47.4 %   2650   33.5 %
 11 Spike 1.2 Turin                : 2627   18  18   920    46.6 %   2650   37.2 %
 12 Glaurung 1.2 SMP               : 2594   19  19   920    41.7 %   2653   29.2 %
 13 Chess Tiger 2007 UCI           : 2556   18  18  1020    35.1 %   2663   30.6 %
 14 Scorpio 1.91 2CPU              : 2545   18  18  1020    33.5 %   2663   32.4 %
 15 Deep Pharaon 3.5.1             : 2531   20  20   920    32.6 %   2657   28.9 %
 16 Deep Frenzee 3.0               : 2509   20  20  1020    28.9 %   2665   23.5 %
Nice rating list , Erik!
But why don't you test Toga II 1.3X4?
Erik Roggenburg

Re: 4'+2" UPDATE: Loop 13.5 is here... matches underwa

Post by Erik Roggenburg »

I'll try and get around to the next Toga soon. I had heard of some troubles with the egbb access. I would have to make a decision to test them with or without if that is the case. If there are issues, I will test them without the egbb and make note of that.

If anyone can help me out and clarify the issues around the latest Toga release and its access of egbb, please let me know.