Something wrong with the testing at CEGT?

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

Dann Corbit
Posts: 12803
Joined: Wed Mar 08, 2006 8:57 pm
Location: Redmond, WA USA

Re: Something wrong with the testing at CEGT?

Post by Dann Corbit »

Simlar results for CCRL:

Deep Fritz 10 2CPU 2956 +57 -56 61.2% -71.1 45.7% 94 53.0%
Deep Fritz 10 4CPU 2928 +21 -21 49.5% +3.9 40.0% 757 52.5%
-28 Elo with a +/-68 Elo window. We can't tell if it is better or worse, but worse might be slightly more probable.

Deep Junior 10 2CPU 2902 +21 -21 54.1% -30.3 36.9% 758 57.8%
Deep Junior 10 4CPU 2912 +19 -19 48.1% +15.6 34.8% 891 49.0%
+10 Elo with a +/- 40 Elo window. We can't tell if it is better or worse, but better might be slightly more probable.

Deep Shredder 10 32-bit 2CPU 2920 +19 -19 57.2% -47.7 37.7% 951 52.4%
Deep Shredder 10 32-bit 4CPU 2959 +100 -98 54.8% -31.0 38.7% 31 49.7%
+39 Elo with a +119/-117 Elo window. We can't tell if it is better or worse, but better might be slightly more probable.

Deep Shredder 10 64-bit 2CPU 2898 +37 -37 47.5% +14.6 42.4% 224 52.2%
Deep Shredder 10 64-bit 4CPU 2931 +18 -18 50.5% -2.8 40.0% 1019 56.5%
+33 Elo with a +55/-55 Elo window. We can't tell if it is better or worse, but better might be slightly more probable.

Glaurung 1.2.1 64-bit 2CPU 2819 +24 -24 38.2% +84.2 35.0% 605 88.2%
Glaurung 1.2.1 64-bit 4CPU 2859 +21 -21 38.3% +77.1 39.0% 735 64.5%
+40 Elo with a +45/-45 Elo window. We can't tell if it is better or worse, but better might be more probable.

Hiarcs 11 2CPU 2926 +29 -29 48.3% +9.1 32.6% 393 63.6%
Hiarcs 11 4CPU 2968 +32 -32 53.8% -26.8 39.1% 312 50.6%
+42 Elo with a +61/-61 Elo window. We can't tell if it is better or worse, but better might slightly be more probable.

Hiarcs 11.1 2CPU 2944 +20 -20 57.6% -49.5 40.8% 799 57.4%
Hiarcs 11.1 4CPU 2985 +23 -23 55.7% -36.0 41.8% 619 63.4%
+41 Elo with a +43/-43 Elo window. We can't tell if it is better or worse, but better might be more probable.

Hiarcs 11.2 2CPU 2940 +49 -48 57.7% -48.7 38.0% 137 56.8%
Hiarcs 11.2 4CPU 2959 +84 -84 50.0% +0.0 46.3% 41 52.4%
+19 Elo with a +133/-132 Elo window. We can't tell if it is better or worse, but better might slightly be more probable.

Loop 13.6 64-bit 2CPU 2927 +33 -33 48.8% +3.7 33.9% 304 51.0%
Loop 13.6 64-bit 4CPU 2941 +25 -25 51.3% -7.2 41.2% 495 51.9%
+14 Elo

Loop M1-T 64-bit 2CPU 2950 +29 -29 47.4% +14.5 45.4% 368 53.1%
Loop M1-T 64-bit 4CPU 2948 +23 -23 52.4% -15.3 47.3% 577 57.9%
-2 Elo

Naum 2.1 64-bit 2CPU 2929 +24 -24 49.2% +4.7 43.6% 562 50.5%
Naum 2.1 64-bit 4CPU 2968 +21 -21 53.8% -24.3 44.7% 700 56.7%
+39 Elo

Naum 2.2 64-bit 2CPU 3016 +99 -95 61.3% -66.0 45.2% 31 59.4%
Naum 2.2 64-bit 4CPU 2975 +49 -49 53.7% -19.8 45.1% 122 58.2%
+41 Elo

Rybka 2.1 32-bit 2CPU 3003 +26 -26 70.8% -141.5 41.1% 508 83.5%
Rybka 2.1 32-bit 4CPU 3083 +47 -45 77.5% -192.3 30.8% 182 60.2%
+83 Elo

Rybka 2.1 64-bit 2CPU 3076 +26 -25 73.4% -166.7 34.3% 571 92.4%
Rybka 2.1 64-bit 4CPU 3087 +43 -41 74.4% -174.1 34.9% 209 54.0%
+11 Elo

Rybka 2.2 64-bit 2CPU 3052 +23 -23 73.0% -163.3 35.0% 697 53.1%
Rybka 2.2 64-bit 4CPU 3107 +30 -29 73.2% -156.5 38.7% 403 57.2%
+55 Elo

Rybka 2.3.2a 64-bit 2CPU 3103 +27 -26 78.6% -197.6 34.8% 529 73.5%
Rybka 2.3.2a 64-bit 4CPU 3118 +29 -28 75.3% -173.0 35.1% 447 68.3%
+15 Elo

Zap!Chess Paderborn 64-bit 2CPU 2909 +27 -27 55.3% -34.0 40.9% 430 64.7%
Zap!Chess Paderborn 64-bit 4CPU 2954 +24 -24 65.2% -102.0 43.7% 558 51.4%
+45 Elo

Zap!Chess Zanzibar 64-bit 2CPU 3023 +42 -41 56.5% -44.9 45.2% 177 56.7%
Zap!Chess Zanzibar 64-bit 4CPU 3051 +22 -22 65.1% -93.9 46.0% 661 85.8%
+28 Elo
User avatar
Werner
Posts: 2999
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

Re: Something wrong with the testing at CEGT?

Post by Werner »

Thanks a lot Dann for the statistics.
Nice to read your posts here in my holidays.
Werner