CEGT - rating lists February 10th 2013

Werner · Post by **Werner** » Sun Feb 10, 2013 11:22 am

Hi all,

our actual rating lists are online and can be found under the attached links.

40 / 20:
New games: 1597; 41 different engines
Total: 643.061

NEW Engines
676 Rodent 0.18 x64: 2473 - 250 games (starts with around +20 to Version 0.17)
763 Djinn 0.979 x64 : 2427 - 400 games (+77 to Version 0.969)

UPDATES
97 Hannibal 1.3 x64 4CPU: 2879 - 1604 games (-6)
405 Gaviota 0.86 x64 1CPU: 2648 - 820 games (+2)
533 DiscoCheck 4.00 x64 : 2563 - 496 games (-37 and now close to Version 3.62)
876 Nebula 1.5 x64 1CPU: 2356 - 653 games (+27 and now +78 to Version 1.0)
711 Rodent 0.17 x64: 2454 - 701 games (-15!)

40 / 4:
no update this week!

40/120
See here our new single-list ):
http://www.husvankempen.de/nunn//40120n ... liste.html from 21th of January.

40/20 pb=on
Last update was 23th Jan.

A big „Thank you“ to all testers as usual!!

Links

40/20: http://www.husvankempen.de/nunn/rating.htm
Blitz: http://www.husvankempen.de/nunn/blitz.htm
40/120: http://www.husvankempen.de/nunn/rating120.htm
Tester: http://www.husvankempen.de/nunn/testers/testers.htm
40/20 pb=on: http://www.husvankempen.de/nunn/rating4020PBON.htm
Games of the week: http://www.husvankempen.de/nunn/40_40%2 ... on/gow.jpg

Werner Schuele
CEGT-Team

Günther Höhne · Post by **Günther Höhne** » Sun Feb 10, 2013 4:56 pm

Thanks Werner, Djinn is a new very interesting engine for me.

Dragan · Post by **Dragan** » Sun Feb 10, 2013 5:22 pm

Thanks Werner and CEGT team.
I am very happy with almost 80 ELO improvement from my evaluation changes.
My bullet tests indicated this, but I was afraid it will not translate into the longer time controls.
Current development version with LMR and singular extensions is already 100 ELO stronger than 1.5, but I would like to reach 2500 with it.
After that, ELO improvements will be much harder to achieve, because the LMR and SE were the last 'easy ELO' techniques that were missing in Nebula.

Tom Likens · Post by **Tom Likens** » Sun Feb 10, 2013 5:47 pm

Dragan wrote:Thanks Werner and CEGT team.
I am very happy with almost 80 ELO improvement from my evaluation changes.
My bullet tests indicated this, but I was afraid it will not translate into the longer time controls.
Current development version with LMR and singular extensions is already 100 ELO stronger than 1.5, but I would like to reach 2500 with it.
After that, ELO improvements will be much harder to achieve, because the LMR and SE were the last 'easy ELO' techniques that were missing in Nebula.

Dragan,

I've been very impressed with your progress on Nebula in a short amount of time. It's becoming a very strong engine, well done.

regards,
--tom

Dragan · Post by **Dragan** » Sun Feb 10, 2013 6:56 pm

Thanks Tom.
As I mentioned, improvements come easy at first, because you just implement standard techniques that every engine has.
The trick is in tuning and making it all work together and doing it without introducing major bugs.
After 2.0 my rate of improvement will most likely slow down. All the standard stuff will be implemented and I want to work on engine style a bit.
Don't like the way Nebula evaluates some positions. My positional eval scores seem to be too low.
Cheers, Dragan

Tom Likens · Post by **Tom Likens** » Sun Feb 10, 2013 7:12 pm

Dragan wrote:Thanks Tom.
As I mentioned, improvements come easy at first, because you just implement standard techniques that every engine has.
The trick is in tuning and making it all work together and doing it without introducing major bugs.
After 2.0 my rate of improvement will most likely slow down. All the standard stuff will be implemented and I want to work on engine style a bit.
Don't like the way Nebula evaluates some positions. My positional eval scores seem to be too low.
Cheers, Dragan

Yeah, I've only started to make real progress again by using "cutechess-cli" to run thousands and thousands of games. It's slow and tedious but I don't know of any other way to make real progress. I'm also better now at resisting the urge to change 10 things at once, which makes a hash out of trying to really understand the results. It's an asymptotic hill, but at least it's still going up!

regards,
--tom

Dragan · Post by **Dragan** » Sun Feb 10, 2013 7:33 pm

I spent $2000 on i7 3930 just to do the testing. My electricity bill more than doubled

Learned the hard way you can't test multiple changes at once. No matter how small or 'obvious' they are.

BTW, you made a tactical mistake replying to my post. Djinn just entered my test opponent list (hope it can handle bullet controls)

Tom Likens · Post by **Tom Likens** » Sun Feb 10, 2013 9:17 pm

Dragan wrote:I spent $2000 on i7 3930 just to do the testing. My electricity bill more than doubled
Learned the hard way you can't test multiple changes at once. No matter how small or 'obvious' they are.

BTW, you made a tactical mistake replying to my post. Djinn just entered my test opponent list (hope it can handle bullet controls)

Dragan,

I'm glad it made the cut. Actually, I've spent a lot of time recently on the time management of Djinn, in fact I even added it to my webpage,

http://webpages.charter.net/tlikens/tec ... trols.html

So it should handle just about any (semi-)reasonable time control (I think I tested it down to 1 sec. + 0.1). If it crashes or loses on time, please let me know as I really want it to be rock solid at the faster time controls.

regards,
--tom

CEGT - rating lists February 10th 2013

CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013

Re: CEGT - rating lists February 10th 2013