CCRL update (31st March 2007)

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Graham Banks
Posts: 44736
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

CCRL update (31st March 2007)

Post by Graham Banks »

The March 31st update of the CCRL Rating Lists and Statistics is now available for viewing at:
http://www.computerchess.org.uk/ccrl/4040/

The links to the various rating lists can be found just beneath the default Best Versions list.

Our standard testing is at 40 moves in 40 minutes repeating while our current blitz testing is at both 40 moves in 4 minutes repeating and 40 moves in 12 minutes repeating, all adjusted to the AMD64 X2 4600+ (2.4GHz).

Our team of testers currently includes:
Graham Banks, Ray Banks, Sarah Bird, Shaun Brewer, Kirill Kryukov, Dom Leste, Tom Logan, Andreas Schwartmann, Joshua Shriver, Charles Smith, George Speight, Chris Taylor, Martin Thoresen, Chuck Wilson and Gabor Szots.

We remain on the lookout for a few more reliable, trustworthy and known testers to help, so please contact one of us if you're interested.


40/40 Notes

743 games were added to our 40/40 database this week, making a total of 52,529 games.
A big thanks to all testers as usual.

Many engines on our list have few games and in many cases their ratings are likely to fluctuate (markedly for some) until a lot more games are played. Therefore no conclusions should be drawn about their strength yet.
To illustrate this point, when an engine has 200 games played, the error margin is still approximately +/-40 ELO, after 500 games +-25 ELO, after 1000 games +-17 ELO and even after 2000 games there is a +-13 ELO error margin!
This of course highlights the importance of looking at other rating lists that are also available in order to draw comparisons and get a more accurate overall picture.


Multi CPU Engines
Rybka 2.3 has been removed from our 40/40 testing and we will wait for Rybka 2.3.2 to be released.

Rybka 2.2 64 bit 4CPU therefore still heads the rating list at around 3100 ELO.

Zap!Chess Zanzibar 64 bit 4CPU is a clear second on around 3040 ELO.

Hiarcs 11.1 4CPU comes in third at around 2980 ELO.

At about 2960 ELO, Naum 2.1 64-bit 4CPU comes next, well ahead of Deep Shredder 10 64 bit 4CPU on 2928 ELO, Deep Fritz 10 4CPU on 2919 ELO and Deep Junior 10 4CPU on 2914 ELO.

Glaurung 1.2.1 64 bit 4CPU is rated at 2871 ELO.

We have only just started testing Loop 13.5.


Single CPU Engines
Due to the removal of Rybka 2.3 from our testing, Rybka 2.2 64 bit still tops this list at around 3010 ELO.

After a similar number of games, Hiarcs 11.1 lies in second spot on 2903 ELO just ahead of Zap!Chess Zanzibar 64 bit on 2894 ELO.

Fritz 10 on 2879 ELO has opened up a small gap over Loop 13.5 32 bit and Shredder 10 on 2874 ELO.
However, many more games are required for Loop 13.5 before we can say whether it's better or worse than its predecessors.

Toga II 1.2.1a on 2861 ELO comes in next, just ahead of Spike 1.2 Turin on 2851 ELO..
We won't be testing Toga II 1.3 at 40/40 until the egbb bug has been addressed.

Fruit 2.2.1, Junior 10 and Naum 2.1 32 bit are all rated 2840 ELO.

Ktulu 8.0 is on 2805 ELO and Chess Tiger 2007 is on 2794 ELO.

Glaurung 1.2.1 Avalanche 32 bit on 2771 ELO, CM10th Paralyse on 2761 ELO, SmarThink 1.00 32 bit on 2757 ELO and Scorpio 1.91 on 2755 ELO are the others above the 2750 ELO mark.

Amateur engines to keep an eye on as we test them over the coming weeks are Alaric 703, Delfi 5.1 and Twisted Logic 0.085x3x1.


3rd CCRL Amateur Championship (32-bit 1CPU):
Division 1 - Scorpio 1.91 took the crown on tiebreak from Spike 1.2 Turin while Colossus 2006f and Pharaon 3.5.1 were relegated.

Division 2 - Frenzee 3.0, Crafty 21.5 and E.T Chess 18.11.05 earned promotion while Gosu 0.16, AnMon 5.60 and Djinn 0.925x got relegated.

Division 3 - Comet B68, Post Modernist 1016 and Amateur 2.82 gained promotion while Diablo 0.5.1, Aice 0.99.2, Popochin 2.9, Anechka 0.08 and Xpdnt 061120 will drop down a division.

Division 4 - After 25 of 30 rounds played, Arion 1.7, Homer 2.0 and Hamsters 0.2 are the top three. Averno 0.81, Natwarlal 0.14, AliChess 4.06, Tornado 1.0 and Zeus 1.29 are all struggling at the wrong end.

Division 5 - With 14 of the 30 rounds having been played, it is tight at the top with Adam 3.1, Pupsi 0.18, BBChess 1.10 and EveAnn 1.62 currently scrapping it out for promotion. Monarch 1.7, Buzz 0.07 and Prophet 2.0 lie in the three relegation spots.

The amateur engines lower in our list are steadily getting more games. Our aim is that every engine in our list gets 150-200 games at least.

We have a number of tournaments in progress and most of these can also be followed in our public forum.


Blitz Notes

The 40/4 has just been updated and can be viewed here currently:
http://www.computerchess.org.uk/ccrl/40 ... index.html

Rybka 2.3.1 64 bit 2CPU on 3084 ELO tops the list ahead of Zap!Chess Zanzibar 64 bit 4CPU on 3023 ELO and Hiarcs 11.1 4CPU on 2994 ELO.

Closely grouped together are Deep Shredder 10 64 bit 4CPU on 2947 ELO, Deep Fritz 10 4CPU on 2944 ELO and Naum 2.1 64 bit 4CPU on 2939 ELO.

Deep Junior 10 4CPU on 2917 ELO is next.

Amongst the best version free engines, Toga II 1.3x4 on 2883 ELO heads off Glaurung 1.2.1 32 bit 4CPU on 2861 ELO and Spike 1.2 Turin on 2828 ELO.

We have only just started testing Loop 13.5.

Again, the list of engines that we are testing is extensive with a large range of amateur engines in addition to the commercial ones.


FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
As no new engines that can play FRC have recently come onto the scene, the list hasn't changed since the last update.

Hiarcs 11.1 edges out Shredder 10 for the top spot, but there's very little between the two.

Loop 10.32f is in third place.

Spike 1.2 Turin and Fruit 2.2.1 are neck and neck in the 4th and 5th spots.


Stats/Presentation Notes

The LOS stats to the right hand side of each rating list are "likelihood of superiority" stats. They tell you the likelihood in percentage terms of each engine being superior to the engine directly below them.

All games are also available for download through the link given at the top of this post. They can be downloaded by engine or by month.
ELO ratings are now saved in all game databases for those engines that have 150 games or more.

A list of games played this week per engine can be found in the update thread in the CCRL public forum, accessible through the link given at the top of this post.
Spock

Re: CCRL update (31st March 2007)

Post by Spock »

Should something happen to the main server, the update is also available on the backup server here

http://ccrl.org.uk/ccrl/4040/index.html


.
Alessandro Scotti

Re: CCRL update (31st March 2007)

Post by Alessandro Scotti »

Graham Banks wrote:FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
As no new engines that can play FRC have recently come onto the scene, the list hasn't changed since the last update.
I don't know if there's a cap on the program strength, but Hamsters should be able to play FRC under Shredder. 8-) (I'm not suggesting you start testing it, this is just to remember those little Hamsters can be quite versatile at times! ;-))
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: CCRL update (31st March 2007)

Post by Dr.Wael Deeb »

An awesome efforts from the CCRL team as always :D
The most remarkable engine performance that caught my eyes was Naum's one,I mean look at the fourth place occupied by this amazing engine,leaving behind well known giants....
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
User avatar
Graham Banks
Posts: 44736
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: CCRL update (31st March 2007)

Post by Graham Banks »

Alessandro Scotti wrote:
Graham Banks wrote:FRC Notes

Ray tests only those engines that can play FRC through the Shredder Classic GUI.
As no new engines that can play FRC have recently come onto the scene, the list hasn't changed since the last update.
I don't know if there's a cap on the program strength, but Hamsters should be able to play FRC under Shredder. 8-) (I'm not suggesting you start testing it, this is just to remember those little Hamsters can be quite versatile at times! ;-))
Hi Alessandro,

I'll draw this to Ray's attention. 8-)

Regards, Graham.
Spock

Re: CCRL update (31st March 2007)

Post by Spock »

Dr.Wael Deeb wrote: The most remarkable engine performance that caught my eyes was Naum's one,I mean look at the fourth place occupied by this amazing engine,leaving behind well known giants....
Yes - and considering it cost less than 20 Euros, totally amazing value for money as well :D
Tony Thomas

Re: CCRL update (31st March 2007)

Post by Tony Thomas »

Did anyone notice that a two week old Romibeta scored 2479 on the blitz list? It is ahead of previous versions of Arasan. However, I wasnt able to find Danasah on the list, my eyes fool me sometime.
User avatar
pedrox
Posts: 1056
Joined: Fri Mar 10, 2006 6:07 am
Location: Basque Country (Spain)

Re: CCRL update (31st March 2007)

Post by pedrox »

Hi Tony,

thanks for your interest in danasah. If you send a mail to me to pecastro@msn.com I can send version 2.99 (tested by Leo http://f27.parsimony.net/forum67828/messages/5450.htm I do not know if it is an improvement to 2.85k, it is thought to obtain better game in long times) or version 3.01 with support for the book of Prodeo.

Any other person, the same.

Best,

Pedro
Spock

Re: CCRL update (31st March 2007)

Post by Spock »

I will start testing Hamsters 0.2 under FRC in a few days time, see how it goes :)
Shaun
Posts: 323
Joined: Wed Mar 08, 2006 9:55 pm
Location: Brighton - UK

Re: CCRL update (31st March 2007)

Post by Shaun »

Tony Thomas wrote:Did anyone notice that a two week old Romibeta scored 2479 on the blitz list? It is ahead of previous versions of Arasan. However, I wasnt able to find Danasah on the list, my eyes fool me sometime.
Hi Tony,

All RomiChess and DanaSah results at 40/4

mmmm DanaSah is lacking some games - must fix that - which version should I test?

I have a 2nd RomiChess Gauntlet finishing once that is finished I will probably look at DanaSah

Shaun