CEGT - rating lists February 12th 2012

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Werner
Posts: 2991
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

CEGT - rating lists February 12th 2012

Post by Werner »

Hi all, :D

our actual rating lists are online and can be found under the attached links. We have adjusted our lists. New reference engine is now Deep Shredder 12 x64 1CPU with 2800 points. The difference in startelo was (-)181 points here in our 40/20 list.

40 / 20:
New games: 1862 ; 52 different engines
Total: 573.179

NEW Engines

785 DanaSah 4.88: 2379 - 6000 games (-1 to version 4.66 here - and +26 at the moment in our blitz-tests)

UPDATES
2 Houdini 2.0c x64 4CPU: 3097 - 2642 games (+2)
88 Deep Junior 13 x64 4CPU: 2867 - 252 games (+10 and +5 to version 12.5)
191 Deep Junior 13 x64 1CPU: 2767 - 1048 games (-14 and +5 to version 12.5)

40 / 4:
New games: 8600
All games now: 978.710
New startelo here is 2588 (-204). New reference engine with 2800 points is Deep Shredder 12 x64 1CPU!

New Engines
202 Deep Junior 13 x64 4CPU : 2744 - 1200 games (+9 to version 12.5)
263 Deep Junior 13 w32 1CPU : 2693 - 800 games (+-0 to version 12.5)
542 Cheng 3 v1.07 x64: 2496 - 1000 games (+23 to v. 1.06)
766 DanaSah 4.88: 2388 - 1000 games (+26 to v. 4.66)
807 GreKo 9.0 x64: 2362 - 1000 games (+5 to v. 8.2 here)
1036 EveAnn 1.67: 2154 - 800 games (+45 to v. 1.66)
1074 Waxmann 2011: 2086 - 800 games (-13 to v. 2010)

Updates
3 Critter 1.4 x64 4CPU : 3063 - 2100 games (+-0)
746 Arasan 13.4 w32 1CPU : 2397 - 1100 games (+15)
750 Tornado 4.25 w32 1CPU: 2395 - 1200 games (+1)
857 Murka 2.0 x64 : 2330 - 1400 games (-2)
1029 ECE 12.01: 2165 - 900 games (+11)

40/120
See here our new single-list:
http://www.husvankempen.de/nunn//40120n ... liste.html

A big „Thank you“ to all testers as usual!!

Links

40/20: http://www.husvankempen.de/nunn/rating.htm
Blitz: http://www.husvankempen.de/nunn/blitz.htm
40/120: http://www.husvankempen.de/nunn/rating120.htm
Tester: http://www.husvankempen.de/nunn/testers/testers.htm
Elo-comparison: http://www.husvankempen.de/nunn/Replay/ ... arison.htm
Games of the week: http://www.husvankempen.de/nunn/40_40%2 ... on/gow.jpg

Werner Schuele
CEGT-Team
lucasart
Posts: 3241
Joined: Mon May 31, 2010 1:29 pm
Full name: lucasart

Re: CEGT - rating lists February 12th 2012

Post by lucasart »

Good stuff. Thank you Werner!

Regarding the elo calculation, have you ever tried Bayeselo instead of Elostat ? Just so you know, there are some severe flaws in the Elostat algorithm, and it's fairly accepted and understood by all experts now that Bayeselo is best
User avatar
Werner
Posts: 2991
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

Re: CEGT - rating lists February 12th 2012

Post by Werner »

Hi Lucas,
thanks - we will discuss it intern here.
... and DanaSah 4.88 has 600 games - not 6000 :oops:

best wishes
Werner
Werner
mar
Posts: 2665
Joined: Fri Nov 26, 2010 2:00 pm
Location: Czech Republic
Full name: Martin Sedlak

Re: CEGT - rating lists February 12th 2012

Post by mar »

Thanks Werner.
You renormalized the rating list?
And I was so happy I was getting close to 2700 in blitz :)
Elo is relative, sure, but still :) More looks better :wink:

Martin
lucasart
Posts: 3241
Joined: Mon May 31, 2010 1:29 pm
Full name: lucasart

Re: CEGT - rating lists February 12th 2012

Post by lucasart »

mar wrote: Elo is relative, sure, but still :) More looks better :wink:
lol
Hugo
Posts: 782
Joined: Tue Dec 01, 2009 11:10 am

Re: CEGT - rating lists February 12th 2012

Post by Hugo »

Hello Werner

many thanks to the CEGT team for this great lists!
Since when is the list to shredder 2800 based? Thats great!

Regards, Clemens
User avatar
Werner
Posts: 2991
Joined: Wed Mar 08, 2006 10:09 pm
Location: Germany
Full name: Werner Schüle

Re: CEGT - rating lists February 12th 2012

Post by Werner »

Thanks Clemens,
this base is quite new - since today :wink:
I too like to watch your tournaments. I´ve tried Deep Junior 13 x64 1CPU - Houdini 1.5a x64 1CPU in a 50 games match with CEGT 40/120 conditions. And I have had not such a good result as you had. I have had a difference of 187 points. Maybe the smp works quite well on DJ 13?
best wishes
Werner
Werner
IWB
Posts: 1539
Joined: Thu Mar 09, 2006 2:02 pm

Re: CEGT - rating lists February 12th 2012

Post by IWB »

Werner wrote:Thanks Clemens,
this base is quite new - since today :wink:
Ahhh, I think about changing this and you changed first - very good.
Now you change to bayes (I provide a batch file if needed), clean up the 40/20 a bit (remove engines with a low number of games) and things would be perfect.

Bye
Ingo
lkaufman
Posts: 6258
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: CEGT - rating lists February 12th 2012

Post by lkaufman »

Regarding the renormalizing of the list to DS 12 (1 core) = 2800, I think it is a good change, although I think it is also clear that DS 12 on 1 core would crush Carlsen, Aronian, Kasparov, or Anand in a match. The reason I think it is a good change despite this is that engine vs engine ratings show larger differences than would be obtained if they only played top human players, so the result is that while 2800 engines will be underrated now in human terms, the ones at the very top will have ratings that are fairly close to what they would get in human tournaments, in my opinion.
IWB
Posts: 1539
Joined: Thu Mar 09, 2006 2:02 pm

Re: CEGT - rating lists February 12th 2012

Post by IWB »

lkaufman wrote:Regarding the renormalizing of the list to DS 12 (1 core) = 2800, I think it is a good change, although I think it is also clear that DS 12 on 1 core would crush Carlsen, Aronian, Kasparov, or Anand in a match. The reason I think it is a good change despite this is that engine vs engine ratings show larger differences than would be obtained if they only played top human players, so the result is that while 2800 engines will be underrated now in human terms, the ones at the very top will have ratings that are fairly close to what they would get in human tournaments, in my opinion.
I agree that S12 1core is rated to low compared to humans. Everything else is guessing as we do not have any reliable number ...

Anyhow, if the CEGT will change to Bayes as well AND they want to raise the raiting to something else I am open for suggestions and willing to go with them to a reasonable point ...

Bye
Ingo

PS: Hello CEGT: Put together S12 x64 and S12 w32. The Engines ARE identical and you have a much better base then!