CEGT - rating lists February 12th 2012

Werner · Post by **Werner** » Sun Feb 12, 2012 1:16 pm

Hi all,

our actual rating lists are online and can be found under the attached links. We have adjusted our lists. New reference engine is now Deep Shredder 12 x64 1CPU with 2800 points. The difference in startelo was (-)181 points here in our 40/20 list.

40 / 20:
New games: 1862 ; 52 different engines
Total: 573.179

NEW Engines

785 DanaSah 4.88: 2379 - 6000 games (-1 to version 4.66 here - and +26 at the moment in our blitz-tests)

UPDATES
2 Houdini 2.0c x64 4CPU: 3097 - 2642 games (+2)
88 Deep Junior 13 x64 4CPU: 2867 - 252 games (+10 and +5 to version 12.5)
191 Deep Junior 13 x64 1CPU: 2767 - 1048 games (-14 and +5 to version 12.5)

40 / 4:
New games: 8600
All games now: 978.710
New startelo here is 2588 (-204). New reference engine with 2800 points is Deep Shredder 12 x64 1CPU!

New Engines
202 Deep Junior 13 x64 4CPU : 2744 - 1200 games (+9 to version 12.5)
263 Deep Junior 13 w32 1CPU : 2693 - 800 games (+-0 to version 12.5)
542 Cheng 3 v1.07 x64: 2496 - 1000 games (+23 to v. 1.06)
766 DanaSah 4.88: 2388 - 1000 games (+26 to v. 4.66)
807 GreKo 9.0 x64: 2362 - 1000 games (+5 to v. 8.2 here)
1036 EveAnn 1.67: 2154 - 800 games (+45 to v. 1.66)
1074 Waxmann 2011: 2086 - 800 games (-13 to v. 2010)

Updates
3 Critter 1.4 x64 4CPU : 3063 - 2100 games (+-0)
746 Arasan 13.4 w32 1CPU : 2397 - 1100 games (+15)
750 Tornado 4.25 w32 1CPU: 2395 - 1200 games (+1)
857 Murka 2.0 x64 : 2330 - 1400 games (-2)
1029 ECE 12.01: 2165 - 900 games (+11)

40/120
See here our new single-list:
http://www.husvankempen.de/nunn//40120n ... liste.html

A big „Thank you“ to all testers as usual!!

Links

40/20: http://www.husvankempen.de/nunn/rating.htm
Blitz: http://www.husvankempen.de/nunn/blitz.htm
40/120: http://www.husvankempen.de/nunn/rating120.htm
Tester: http://www.husvankempen.de/nunn/testers/testers.htm
Elo-comparison: http://www.husvankempen.de/nunn/Replay/ ... arison.htm
Games of the week: http://www.husvankempen.de/nunn/40_40%2 ... on/gow.jpg

Werner Schuele
CEGT-Team

lucasart · Post by **lucasart** » Sun Feb 12, 2012 1:38 pm

Good stuff. Thank you Werner!

Regarding the elo calculation, have you ever tried Bayeselo instead of Elostat ? Just so you know, there are some severe flaws in the Elostat algorithm, and it's fairly accepted and understood by all experts now that Bayeselo is best

Werner · Post by **Werner** » Sun Feb 12, 2012 1:52 pm

Hi Lucas,
thanks - we will discuss it intern here.
... and DanaSah 4.88 has 600 games - not 6000

best wishes
Werner

mar · Post by **mar** » Sun Feb 12, 2012 2:35 pm

Thanks Werner.
You renormalized the rating list?
And I was so happy I was getting close to 2700 in blitz

Elo is relative, sure, but still

More looks better

Martin

lucasart · Post by **lucasart** » Sun Feb 12, 2012 2:48 pm

mar wrote: Elo is relative, sure, but still More looks better

lol

Hugo · Post by **Hugo** » Sun Feb 12, 2012 5:26 pm

Hello Werner

many thanks to the CEGT team for this great lists!
Since when is the list to shredder 2800 based? Thats great!

Regards, Clemens

Werner · Post by **Werner** » Sun Feb 12, 2012 5:54 pm

Thanks Clemens,
this base is quite new - since today

I too like to watch your tournaments. I´ve tried Deep Junior 13 x64 1CPU - Houdini 1.5a x64 1CPU in a 50 games match with CEGT 40/120 conditions. And I have had not such a good result as you had. I have had a difference of 187 points. Maybe the smp works quite well on DJ 13?
best wishes
Werner

IWB · Post by **IWB** » Sun Feb 12, 2012 6:56 pm

Werner wrote:Thanks Clemens,
this base is quite new - since today

Ahhh, I think about changing this and you changed first - very good.
Now you change to bayes (I provide a batch file if needed), clean up the 40/20 a bit (remove engines with a low number of games) and things would be perfect.

Bye
Ingo

lkaufman · Post by **lkaufman** » Sun Feb 12, 2012 7:03 pm

Regarding the renormalizing of the list to DS 12 (1 core) = 2800, I think it is a good change, although I think it is also clear that DS 12 on 1 core would crush Carlsen, Aronian, Kasparov, or Anand in a match. The reason I think it is a good change despite this is that engine vs engine ratings show larger differences than would be obtained if they only played top human players, so the result is that while 2800 engines will be underrated now in human terms, the ones at the very top will have ratings that are fairly close to what they would get in human tournaments, in my opinion.

IWB · Post by **IWB** » Sun Feb 12, 2012 7:08 pm

lkaufman wrote:Regarding the renormalizing of the list to DS 12 (1 core) = 2800, I think it is a good change, although I think it is also clear that DS 12 on 1 core would crush Carlsen, Aronian, Kasparov, or Anand in a match. The reason I think it is a good change despite this is that engine vs engine ratings show larger differences than would be obtained if they only played top human players, so the result is that while 2800 engines will be underrated now in human terms, the ones at the very top will have ratings that are fairly close to what they would get in human tournaments, in my opinion.

I agree that S12 1core is rated to low compared to humans. Everything else is guessing as we do not have any reliable number ...

Anyhow, if the CEGT will change to Bayes as well AND they want to raise the raiting to something else I am open for suggestions and willing to go with them to a reasonable point ...

Bye
Ingo

PS: Hello CEGT: Put together S12 x64 and S12 w32. The Engines ARE identical and you have a much better base then!

CEGT - rating lists February 12th 2012

CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012

Re: CEGT - rating lists February 12th 2012