The Speedy Rating List

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: The Speedy Rating List

Post by Rebel »

Terje wrote: Sat May 30, 2020 1:59 pm Given that a lot of engines have exactly 41 errors it would be interesting to know which positions (or even 1 of them) these occur in.
4Rk2/2R5/2p3p1/3p4/3P3r/P7/1P3P1p/7K b - - bm Kxe8; ce -32000; acd 0;
k5rb/3r4/1ppP4/p1p1p1p1/P1P1Pp1p/1P1R4/3R1PPN/3q1K2 w - - bm Rxd1; ce -32000; acd 0;
8/3P2k1/5p2/6pp/4p3/1b2P2P/6r1/2BK4 w - - bm Ke1; ce -32000; acd 0;
3r2k1/6p1/4p3/1p1nB1Qp/1PpP3P/R1P2P2/1r4qK/8 w - - bm Qxg2; ce -32000; acd 0;

'What they have in common, it's called the chopper move.
90% of coding is debugging, the other 10% is writing bugs.
Ras
Posts: 2488
Joined: Tue Aug 30, 2016 8:19 pm
Full name: Rasmus Althoff

Re: The Speedy Rating List

Post by Ras »

Rebel wrote: Sat May 30, 2020 3:29 pm'What they have in common, it's called the chopper move.
How can an engine have trouble selecting the one and only legal move in a position?
Rasmus Althoff
https://www.ct800.net
Terje
Posts: 347
Joined: Tue Nov 19, 2019 4:34 am
Location: https://github.com/TerjeKir/weiss
Full name: Terje Kirstihagen

Re: The Speedy Rating List

Post by Terje »

Ras wrote: Sat May 30, 2020 3:35 pm
Rebel wrote: Sat May 30, 2020 3:29 pm'What they have in common, it's called the chopper move.
How can an engine have trouble selecting the one and only legal move in a position?
I'm guessing they select it and play it without giving an info string. Also not spending their time thinking, wasting that turns time which they could use to fill TT. Playing only-moves instantly is an error in 'go infinite' and a bad choice in 'go movetime'.

Confirmed to be the case in rofChade (works correctly in 'go infinite').
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: The Speedy Rating List

Post by Rebel »

Added 13 engines to the 4000ms list

Winter 0.8
ProDeo 2.2
Benjamin
CT800 1.40
Minic 2.32
Fire 7.1
Weiss 0.10
Andscacs 0.95
Arasan 22
Defenchess 2.2
Fruit 2.1
Shredder 13
Amoeba 3.1

And a new leader on the scaling list.

http://rebel13.nl/download/speedy-rating-list.html
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: The Speedy Rating List

Post by lkaufman »

Rebel wrote: Sat May 30, 2020 6:45 pm Added 13 engines to the 4000ms list

Winter 0.8
ProDeo 2.2
Benjamin
CT800 1.40
Minic 2.32
Fire 7.1
Weiss 0.10
Andscacs 0.95
Arasan 22
Defenchess 2.2
Fruit 2.1
Shredder 13
Amoeba 3.1

And a new leader on the scaling list.

http://rebel13.nl/download/speedy-rating-list.html
Comparing to the CEGT blitz list (I picked CEGT because it uses ORDO, which agrees with normal elo calculations for match results), the difference between Stockfish 11 and Fruite 2.1 (I picked it as it's near the bottom of your list, easy to find on cegt, and a very well known reference point) is 990 elo on CEGT, 532 elo on your list. So doubling your ratings and subtracting a constant would not be way off, but multiplying by 1.86 with a suitable subtraction would be ideal. Of course a proper analysis of all engines on both lists will give a somewhat different ratio, but probably not too far from this 1.86 value.
Komodo rules!
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: The Speedy Rating List

Post by Rebel »

Yes, I will give it attention for the next update.
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: The Speedy Rating List

Post by Rebel »

@Larry

On a second thought -

Code: Select all

CEGT 40/4       SF11 3486
CEGT 40/20      SF11 3450
CEGT 40/120     SF11 3409

CCRL 40/2       SF11 3534
CCRL 40/15      SF11 3433

FGRL 60s+0.6    SF11 3497
FGRL 10m+6.0    SF11 3426
FGRL 60m+15.0   SF11 3391
On all 3 rating lists the longer time control the more the elo of an engine drops, with SRL it's exactly the other way around.

Instinctively more natural, one ply deeper it is not, I understand :D

I think I will keep my elo formula.

BTW, the NICE tool is developed for engine tuning, that it also can produce a reasonable reliable rating list was an unexpected bonus.
90% of coding is debugging, the other 10% is writing bugs.
cucumber
Posts: 144
Joined: Sun Oct 14, 2018 8:21 pm
Full name: JSmith

Re: The Speedy Rating List

Post by cucumber »

Rebel wrote: Sun May 31, 2020 9:31 am @Larry

On a second thought -

Code: Select all

CEGT 40/4       SF11 3486
CEGT 40/20      SF11 3450
CEGT 40/120     SF11 3409

CCRL 40/2       SF11 3534
CCRL 40/15      SF11 3433

FGRL 60s+0.6    SF11 3497
FGRL 10m+6.0    SF11 3426
FGRL 60m+15.0   SF11 3391
On all 3 rating lists the longer time control the more the elo of an engine drops, with SRL it's exactly the other way around.

Instinctively more natural, one ply deeper it is not, I understand :D

I think I will keep my elo formula.

BTW, the NICE tool is developed for engine tuning, that it also can produce a reasonable reliable rating list was an unexpected bonus.
I had no idea that you made the tools behind this public or that they were meant for tuning!

I’ve spent all week working on engine tuning using EPDs! I even made a thread asking for help getting quiescent positions. I wish I knew about your tool and datasets beforehand. NICE is a really cool project and really well executed. Thanks for making it public.
Patrice Duhamel
Posts: 193
Joined: Sat May 25, 2013 11:17 am
Location: France
Full name: Patrice Duhamel

Re: The Speedy Rating List

Post by Patrice Duhamel »

I'm curious to see Cheese score, it's possible to add it in the list ? or to download LC1.epd ?
Anything that can go wrong will go wrong.
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: The Speedy Rating List

Post by Rebel »

Patrice Duhamel wrote: Sun May 31, 2020 5:37 pm I'm curious to see Cheese score, it's possible to add it in the list ? or to download LC1.epd ?
If Cheese is still developed I will include it.

The lc1.epd is part of NICE.

http://rebel13.nl/misc/nice.html
90% of coding is debugging, the other 10% is writing bugs.