Super Tournament XXXVII

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Graham Banks
Posts: 44799
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Super Tournament XXXVII

Post by Graham Banks »

SUPER TOURNAMENT XXXVII

Intel i7-4770k Quad
ChessGUI
256mb hash each where possible
3-4-5 piece tablebases
Ponder off
LowDraw100.cgb book
40 moves in 16 minutes repeating (adapted for CCRL)
6 cycles 54 rounds
New opening book that is fair, but expected to produce a lowish draw rate.
Standings after Round 18

13.0 - Stockfish 16 64-bit
10.5 - Dragon 3.2 by Komodo 64-bit
9.5 - CS Tal 2.00 64-bit
9.0 - RubiChess 20230410 64-bit
9.0 - Berserk 11.1 64-bit
8.5 - Ethereal 14.00 64-bit
8.0 - Koivisto 9.0 64-bit
8.0 - Clover 6.0 64-bit
7.5 - Igel 3.5.0 64-bit
7.0 - Revenge 3.0 64-bit


Web based link for live viewing (courtesy of Jay - Berserk author).
https://ccrl.live/16092

Alternatively, if you install TLCV (Tom's Live Chess Viewer) on your computer, you can watch the games live move by move. You'll also be able to chat to others following the tournament in the chatroom there.
http://kirill-kryukov.com/chess/discuss ... p?id=42959
Host - GrahamCCRL.dyndns.org Port - 16092

Linux users can use Livius:
https://github.com/kmar/livius

There is also a Livius windows version.
It has live pv boards as a nice addition.
http://www.crabaware.com/livius/
gbanksnz at gmail.com
chrisw
Posts: 4661
Joined: Tue Apr 03, 2012 4:28 pm
Location: Midi-Pyrénées
Full name: Christopher Whittington

Re: Super Tournament XXXVII

Post by chrisw »

Graham Banks wrote: Tue Sep 05, 2023 12:07 am
chrisw wrote: Mon Sep 04, 2023 5:33 pm Why was this game declared a draw? They were shuffling, but the "winning" side was going to make a pawn move and change everything sooner or later.
Hi Chris,

I've probably mentioned this before, but I've always used the default ChessGUI adjudication settings:

Image

98% of the time, they're pretty much right.

You just made that statistic up, no?!

Those default values are bonkers and they penalise the "winning" engine in the pair, probably the stronger engine. Result? The entire rating list 40/15 is skewed against the stronger engines, Elos are depressed at the top end.
There were plenty of examples of premature draws I noticed in the past days, but this is the first one I checked on, and, Wow, I'm flabbergasted.

What happens in the the games where strong engines shuffle and then resolves by pushing a pawn as rule 50 approaches? Pawn push changes everything. They're all declared as draws though. This is a really bad inaccuracy and skew bias.
Add to that the elimination of games where an arbitrary lower eval limit isn't breached in the opening and you again skew the rating list by decreasing the possibility that a stronger engine (in the aborted game) then gets a win the next game.

So, two effects we know about:
1. The limit rules are increasing the draw rate
2. Stronger engines are being penalised

You have Komodo assistant author and Berserk author and CSTal author all telling you the limits rules you're using are crazy. CSTal author is telling you the Elo results of the rating lists prepared using these rules are skewed. I suspect Jay and Larry are going to agree with that too.
User avatar
Graham Banks
Posts: 44799
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Super Tournament XXXVII

Post by Graham Banks »

If you compare the CCRL and CEGT lists, you'll see a similar scenario, in that Stockfish and Dragon by Komodo leave a good gap back to the next 10 or so engines, which are all compressed into a 55-70 Elo range.
Stefan Pohl's list using the Hert500 book also shows this pattern.

That's just the way that things are.

I looked at the first 97 games in this tournament, and find these games that fit into the category you're referring to. It's unfortunate that they both feature CS Tal 2.00:

[pgn][Event "Super Tournament XXXVII"] [Site "ChessGUI2"] [Date "2023.09.03"] [Round "7.3"] [White "Clover 6.0 64-bit"] [Black "CS Tal 2.00 64-bit"] [Result "1/2-1/2"] [Time "10:33:28 AM"] [ECO "B40"] [Opening "Kveinis Variation, Sicilian"] [TimeControl "40/960:40/960:40/960"] [PlyCount "122"] [Number "33"] [Termination "GUI adjudication"] [BlackType "program"] [WhiteType "program"] [Variant "normal"] { Intel i7 Quad } 1.e4 {[%eval 0,1] [%emt 00:00:00]} c5 {[%eval 0,1] [%emt 00:00:00]} 2.Nf3 {[%eval 0,1] [%emt 00:00:00]} e6 {[%eval 0,1] [%emt 00:00:00]} 3.d4 {[%eval 0,1] [%emt 00:00:00]} cxd4 {[%eval 0,1] [%emt 00:00:00]} 4.Nxd4 {[%eval 0,1] [%emt 00:00:00]} Qb6 {[%eval 0,1] [%emt 00:00:00]} 5.Nb3 {[%eval 0,1] [%emt 00:00:00]} Qc7 {[%eval 0,1] [%emt 00:00:00]} 6.g3 {[%eval 0,1] [%emt 00:00:00]} Nf6 {[%eval -74,23] [%emt 00:00:24]} 7.Bg2 {(Bg2) [%eval 42,29] [%emt 00:00:27]} Nc6 {(d6) [%eval -55,24] [%emt 00:00:34]} 8.c4 {(O-O) [%eval 42,32] [%emt 00:00:47]} Bb4 {(Be7) [%eval -78,23] [%emt 00:00:44]} 9.Bd2 {(Bd2) [%eval 35,33] [%emt 00:00:40]} O-O {(O-O) [%eval -88,23] [%emt 00:00:21]} 10.O-O {(O-O) [%eval 57,33] [%emt 00:00:26]} Be7 {(a5) [%eval -79,24] [%emt 00:00:52]} 11.Qe2 {(Qe2) [%eval 66,31] [%emt 00:00:32]} d6 {(d6) [%eval -84,25] [%emt 00:00:20]} 12.Rc1 {(Be3) [%eval 58,34] [%emt 00:00:35]} b6 {(Bd7) [%eval -86,23] [%emt 00:00:55]} 13.Nc3 {(Nc3) [%eval 62,31] [%emt 00:00:34]} Bb7 {(Ba6) [%eval -93,23] [%emt 00:00:30]} 14.Nd5 {(Nd5) [%eval 80,29] [%emt 00:00:46]} Qd7 {(Qd8) [%eval -88,24] [%emt 00:00:28]} 15.Nxe7 {(Nxe7) [%eval 71,31] [%emt 00:00:36]} Qxe7 {(Qxe7) [%eval -88,28] [%emt 00:00:48]} 16.Re1 {(Bc3) [%eval 75,31] [%emt 00:00:35]} Nd7 {(Nd7) [%eval -95,23] [%emt 00:00:19]} 17.Be3 {(Rad1) [%eval 66,32] [%emt 00:00:37]} Nc5 {(Nc5) [%eval -83,22] [%emt 00:00:27]} 18.Rad1 {(Rad1) [%eval 53,30] [%emt 00:00:30]} h6 {(h6) [%eval -91,24] [%emt 00:00:19]} 19.f3 {(Qc2) [%eval 59,30] [%emt 00:00:29]} Rad8 {(Rad8) [%eval -50,23] [%emt 00:00:18]} 20.Qd2 {(Nd4) [%eval 63,32] [%emt 00:00:26]} Qc7 {(Kh8) [%eval -53,24] [%emt 00:00:24]} 21.Nd4 {(Nd4) [%eval 72,35] [%emt 00:00:25]} Nxd4 {(Nxd4) [%eval -66,24] [%emt 00:00:26]} 22.Bxd4 {(Bxd4) [%eval 45,36] [%emt 00:00:27]} e5 {(e5) [%eval -32,27] [%emt 00:00:39]} 23.Be3 {(Be3) [%eval 50,35] [%emt 00:00:22]} f5 {(f5) [%eval -43,24] [%emt 00:00:18]} 24.b4 {(b4) [%eval 65,33] [%emt 00:00:27]} Na6 {(Na6) [%eval -22,27] [%emt 00:00:23]} 25.Qd3 {(Qd3) [%eval 63,31] [%emt 00:00:27]} fxe4 {(fxe4) [%eval -7,22] [%emt 00:00:26]} 26.fxe4 {(fxe4) [%eval 33,35] [%emt 00:00:23]} Bc8 {(Bc8) [%eval -27,26] [%emt 00:00:17]} 27.Rc1 {(a3) [%eval 20,34] [%emt 00:01:07]} Kh7 {(Kh7) [%eval -3,27] [%emt 00:00:17]} 28.Qa3 {(a3) [%eval 32,34] [%emt 00:01:29]} Nb8 {(Nb8) [%eval 0,27] [%emt 00:00:15]} 29.c5 {(c5) [%eval 9,33] [%emt 00:00:15]} dxc5 {(dxc5) [%eval 35,29] [%emt 00:00:22]} 30.bxc5 {(bxc5) [%eval -22,32] [%emt 00:00:21]} b5 {(b5) [%eval 8,28] [%emt 00:00:18]} 31.Red1 {(Qb2) [%eval -2,31] [%emt 00:00:21]} Nc6 {(Rxd1) [%eval 67,22] [%emt 00:00:19]} 32.Rd5 {(Qb2) [%eval -40,33] [%emt 00:00:29]} a6 {(a6) [%eval 77,25] [%emt 00:00:22]} 33.h3 {(h3) [%eval -55,30] [%emt 00:00:13]} Rde8 {(Rde8) [%eval 70,26] [%emt 00:00:21]} 34.Ra1 {(Kh2) [%eval -40,34] [%emt 00:00:19]} Na5 {(Na5) [%eval 69,26] [%emt 00:00:21]} 35.Qd3 {(Kh2) [%eval -50,32] [%emt 00:00:16]} Nc4 {(Nc4) [%eval 73,28] [%emt 00:00:26]} 36.Kh2 {(Kh2) [%eval -50,31] [%emt 00:00:02]} Rf7 {(Be6) [%eval 66,28] [%emt 00:00:17]} 37.Bg1 {(Bg1) [%eval -50,30] [%emt 00:00:03]} Ref8 {(Ref8) [%eval 74,27] [%emt 00:00:24]} 38.a4 {(Qe2) [%eval -50,31] [%emt 00:00:03]} Bb7 {(Be6) [%eval 95,24] [%emt 00:00:21]} 39.Qe2 {(Qe2) [%eval -50,30] [%emt 00:00:02]} Bc6 {(Bc6) [%eval 76,25] [%emt 00:00:19]} 40.Rad1 {(h4) [%eval -50,26] [%emt 00:00:01]} Qe7 {(Rf6) [%eval 102,23] [%emt 00:00:09]} 41.Rc1 {(Ra1) [%eval -61,33] [%emt 00:00:30]} Rf6 {(Rf6) [%eval 83,27] [%emt 00:00:26]} 42.Rcd1 {(Rcd1) [%eval -52,32] [%emt 00:00:34]} R6f7 {(Qe6) [%eval 0,4] [%emt 00:00:00]} 43.Rc1 {(hashfull) [%eval -52,35] [%emt 00:00:14]} Rf6 {(Qc7) [%eval 0,4] [%emt 00:00:00]} 44.Rcd1 {[%eval -52,38] [%emt 00:00:29]} Qe8 {(Qf7) [%eval 78,27] [%emt 00:00:21]} 45.Ra1 {(Ra1) [%eval -52,35] [%emt 00:00:06]} Qe6 {(R6f7) [%eval 79,28] [%emt 00:00:26]} 46.Rad1 {(Rad1) [%eval -52,37] [%emt 00:00:20]} Qe8 {(Qe7) [%eval 0,4] [%emt 00:00:00]} 47.Ra1 {(hashfull) [%eval -52,32] [%emt 00:00:01]} Qe6 {(Qe6) [%eval 0,4] [%emt 00:00:00]} 48.Rad1 {[%eval -52,39] [%emt 00:00:35]} Qf7 {(Qe7) [%eval 78,29] [%emt 00:00:18]} 49.Ra1 {(Ra1) [%eval -52,37] [%emt 00:00:14]} Qe8 {(Qc7) [%eval 78,29] [%emt 00:00:22]} 50.Rc1 {(Rad1) [%eval -52,39] [%emt 00:00:27]} R6f7 {(R6f7) [%eval 79,30] [%emt 00:00:20]} 51.Rcd1 {(Ra1) [%eval -52,40] [%emt 00:00:24]} Kg8 {(Qc8) [%eval 79,28] [%emt 00:00:21]} 52.Rc1 {(Rb1) [%eval -52,40] [%emt 00:00:28]} Kh7 {(Kh8) [%eval 0,4] [%emt 00:00:00]} 53.Rcd1 {(hashfull) [%eval -52,38] [%emt 00:00:11]} Kg8 {(Qc8) [%eval 0,4] [%emt 00:00:00]} 54.Rc1 {[%eval -52,42] [%emt 00:00:40]} Rf6 {(Qa8) [%eval 78,28] [%emt 00:00:30]} 55.Ra1 {(Rcd1) [%eval -52,39] [%emt 00:00:23]} Kh7 {(Qe7) [%eval 0,4] [%emt 00:00:00]} 56.Rad1 {(hashfull) [%eval -52,38] [%emt 00:00:21]} Qf7 {(Qc8) [%eval 0,4] [%emt 00:00:00]} 57.Ra1 {[%eval -52,39] [%emt 00:00:19]} Qb7 {(Kh8) [%eval 77,29] [%emt 00:00:33]} 58.Rc1 {(Rb1) [%eval -52,39] [%emt 00:00:32]} Qc8 {(Qc8) [%eval 77,30] [%emt 00:01:09]} 59.Rcd1 {(Rcd1) [%eval -52,37] [%emt 00:00:11]} R6f7 {(R6f7) [%eval 77,30] [%emt 00:00:23]} 60.Ra1 {(Rc1) [%eval -52,38] [%emt 00:00:19]} Qc7 {(Qb7) [%eval 0,4] [%emt 00:00:00]} 61.Ra2 {(hashfull) [%eval -52,39] [%emt 00:00:26]} Rf6 {(Rf6) [%eval 77,27] [%emt 00:00:37]} 1/2-1/2[/pgn]

[pgn][Event "Super Tournament XXXVII"] [Site "ChessGUI2"] [Date "2023.09.05"] [Round "16.3"] [White "CS Tal 2.00 64-bit"] [Black "Clover 6.0 64-bit"] [Result "1/2-1/2"] [Time "1:57:27 AM"] [ECO "B40"] [Opening "Kveinis Variation, Sicilian"] [TimeControl "40/960:40/960:40/960"] [PlyCount "122"] [Number "78"] [Termination "GUI adjudication"] [BlackType "program"] [WhiteType "program"] [Variant "normal"] { Intel i7 Quad } 1.e4 {[%eval 0,1] [%emt 00:00:00]} c5 {[%eval 0,1] [%emt 00:00:00]} 2.Nf3 {[%eval 0,1] [%emt 00:00:00]} e6 {[%eval 0,1] [%emt 00:00:00]} 3.d4 {[%eval 0,1] [%emt 00:00:00]} cxd4 {[%eval 0,1] [%emt 00:00:00]} 4.Nxd4 {[%eval 0,1] [%emt 00:00:00]} Qb6 {[%eval 0,1] [%emt 00:00:00]} 5.Nb3 {[%eval 0,1] [%emt 00:00:00]} Qc7 {[%eval 0,1] [%emt 00:00:00]} 6.g3 {[%eval 0,1] [%emt 00:00:00]} Nf6 {[%eval -55,31] [%emt 00:00:51]} 7.Bg2 {(Bg2) [%eval 64,25] [%emt 00:00:24]} Be7 {(Be7) [%eval -40,33] [%emt 00:00:31]} 8.O-O {(O-O) [%eval 65,23] [%emt 00:00:22]} O-O {(Nc6) [%eval -24,33] [%emt 00:00:24]} 9.Nc3 {(Qe2) [%eval 56,25] [%emt 00:00:26]} Nc6 {(Nc6) [%eval -33,34] [%emt 00:00:25]} 10.Nb5 {(Nb5) [%eval 43,25] [%emt 00:00:21]} Qb8 {(Qb8) [%eval -28,36] [%emt 00:00:29]} 11.Be3 {(Qe2) [%eval 60,26] [%emt 00:01:06]} a6 {[%eval -7,31] [%emt 00:00:26]} 12.N5d4 {(Nc3) [%eval 49,27] [%emt 00:00:30]} Qc7 {(Qc7) [%eval -19,32] [%emt 00:01:32]} 13.c4 {(h3) [%eval 57,27] [%emt 00:00:21]} d6 {(d6) [%eval -29,34] [%emt 00:00:40]} 14.Rc1 {(Qe2) [%eval 49,26] [%emt 00:00:41]} Bd7 {(Bd7) [%eval -28,32] [%emt 00:00:27]} 15.Nxc6 {(Nxc6) [%eval 55,27] [%emt 00:00:17]} Bxc6 {(Bxc6) [%eval -29,32] [%emt 00:00:34]} 16.Qd3 {(Qd3) [%eval 56,26] [%emt 00:00:19]} b6 {(b6) [%eval -31,33] [%emt 00:00:32]} 17.Nd4 {(Nd4) [%eval 50,26] [%emt 00:00:22]} Bb7 {(Bb7) [%eval -31,30] [%emt 00:01:04]} 18.Rfd1 {(b3) [%eval 57,26] [%emt 00:00:18]} Rac8 {(Rac8) [%eval -22,31] [%emt 00:00:24]} 19.b3 {(b3) [%eval 49,26] [%emt 00:00:32]} h6 {(h6) [%eval -20,30] [%emt 00:00:39]} 20.Ne2 {(Bd2) [%eval 59,27] [%emt 00:00:42]} Qb8 {(Nd7) [%eval -26,35] [%emt 00:00:24]} 21.a4 {(Nc3) [%eval 64,27] [%emt 00:00:53]} Nd7 {(Nd7) [%eval -35,35] [%emt 00:00:15]} 22.Nc3 {(Nc3) [%eval 63,27] [%emt 00:00:28]} Rfd8 {(Nc5) [%eval -35,35] [%emt 00:01:05]} 23.Bd4 {(Bd4) [%eval 44,23] [%emt 00:00:25]} Qc7 {(Nc5) [%eval -29,34] [%emt 00:00:19]} 24.Rb1 {(Qe2) [%eval 46,24] [%emt 00:00:16]} Ba8 {(Bg5) [%eval -41,35] [%emt 00:00:47]} 25.Qe2 {(Qe2) [%eval 51,28] [%emt 00:00:51]} Bf8 {(Bf6) [%eval -35,33] [%emt 00:00:32]} 26.h3 {(Rd2) [%eval 75,26] [%emt 00:00:22]} Bc6 {(Be7) [%eval -35,34] [%emt 00:00:13]} 27.Kh2 {(Kh2) [%eval 74,28] [%emt 00:00:26]} Qb7 {(Ba8) [%eval -35,34] [%emt 00:00:30]} 28.Re1 {(Rd2) [%eval 74,23] [%emt 00:00:30]} Re8 {(Qc7) [%eval -35,34] [%emt 00:00:15]} 29.h4 {(Red1) [%eval 74,23] [%emt 00:00:26]} Qc7 {(Nc5) [%eval -32,35] [%emt 00:00:17]} 30.Red1 {(Red1) [%eval 74,26] [%emt 00:00:19]} Be7 {(Bb7) [%eval -32,34] [%emt 00:00:12]} 31.Rd2 {(Rd2) [%eval 74,26] [%emt 00:00:23]} Qb7 {(Ba8) [%eval -31,32] [%emt 00:00:10]} 32.Re1 {(Rdb2) [%eval 73,25] [%emt 00:00:18]} Bf8 {(Qc7) [%eval -31,32] [%emt 00:00:28]} 33.Red1 {(Qd1) [%eval 74,25] [%emt 00:00:16]} Qc7 {(Nc5) [%eval -32,33] [%emt 00:00:14]} 34.Rc1 {(Be3) [%eval 73,26] [%emt 00:00:15]} Qb7 {(Qb7) [%eval -31,32] [%emt 00:00:08]} 35.Rcd1 {(Rcd1) [%eval 0,4] [%emt 00:00:00]} Qc7 {(hashfull) [%eval -31,33] [%emt 00:00:06]} 36.Rc1 {(Rb1) [%eval 0,4] [%emt 00:00:00]} Qb7 {[%eval -31,33] [%emt 00:00:24]} 37.Rb2 {(Be3) [%eval 70,26] [%emt 00:00:55]} Qb8 {(Qb8) [%eval -31,31] [%emt 00:00:04]} 38.Rd1 {(Rcb1) [%eval 70,26] [%emt 00:00:25]} Nc5 {(Ba8) [%eval -31,30] [%emt 00:00:06]} 39.Rbb1 {(Rdd2) [%eval 70,24] [%emt 00:00:15]} Nd7 {(Nd7) [%eval -31,30] [%emt 00:00:03]} 40.Rb2 {(Rd2) [%eval 0,4] [%emt 00:00:00]} Nc5 {(hashfull) [%eval -31,30] [%emt 00:00:01]} 41.Rbb1 {(Rdd2) [%eval 0,4] [%emt 00:00:00]} Nd7 {[%eval -31,37] [%emt 00:00:22]} 42.Rd2 {(Rd2) [%eval 69,25] [%emt 00:00:20]} Qb7 {(Qc7) [%eval -31,38] [%emt 00:00:26]} 43.Ra2 {(Ra2) [%eval 69,27] [%emt 00:01:12]} Qc7 {(Qc7) [%eval -32,37] [%emt 00:00:24]} 44.Qd1 {(Rc2) [%eval 69,27] [%emt 00:00:54]} Be7 {(Be7) [%eval -32,37] [%emt 00:00:27]} 45.Rd2 {(Rc2) [%eval 69,27] [%emt 00:00:17]} Bf8 {(Bb7) [%eval -31,36] [%emt 00:00:21]} 46.Ra2 {(Rbb2) [%eval 0,4] [%emt 00:00:00]} Qb7 {(hashfull) [%eval -31,40] [%emt 00:00:27]} 47.Qe2 {(Rd2) [%eval 0,4] [%emt 00:00:00]} Be7 {[%eval -31,37] [%emt 00:00:16]} 48.Kg1 {(Rd1) [%eval 69,26] [%emt 00:00:18]} Qb8 {(Qc7) [%eval -31,38] [%emt 00:00:28]} 49.Rd1 {(Rd1) [%eval 69,26] [%emt 00:00:20]} Bb7 {(Nc5) [%eval -31,38] [%emt 00:00:24]} 50.Rad2 {(Rad2) [%eval 69,27] [%emt 00:00:32]} Bc6 {(Qc7) [%eval -31,39] [%emt 00:00:31]} 51.Ra2 {(Kh2) [%eval 0,4] [%emt 00:00:00]} Bf8 {(hashfull) [%eval -31,30] [%emt 00:00:02]} 52.Kh2 {(Be3) [%eval 69,25] [%emt 00:00:20]} Qb7 {(Qb7) [%eval -31,36] [%emt 00:00:31]} 53.Rb2 {(Rb2) [%eval 68,28] [%emt 00:01:09]} Nc5 {(Qb8) [%eval -31,39] [%emt 00:00:38]} 54.Ra1 {(Rdd2) [%eval 68,27] [%emt 00:00:19]} Nd7 {(Nd7) [%eval -31,40] [%emt 00:00:26]} 55.Rc1 {(Rd1) [%eval 0,4] [%emt 00:00:00]} Qb8 {(hashfull) [%eval -31,38] [%emt 00:00:24]} 56.Rcb1 {(Rcb1) [%eval 69,26] [%emt 00:00:35]} Qc7 {(Bb7) [%eval -31,37] [%emt 00:00:21]} 57.Rd1 {(Rc2) [%eval 68,27] [%emt 00:00:38]} Bb7 {(Be7) [%eval -31,36] [%emt 00:00:32]} 58.Rdd2 {(Qg4) [%eval 68,27] [%emt 00:00:48]} Bc6 {(Bc6) [%eval -29,36] [%emt 00:00:31]} 59.Rd1 {(Qg4) [%eval 0,4] [%emt 00:00:00]} Bb7 {(hashfull) [%eval -29,37] [%emt 00:00:18]} 60.Rdd2 {(Rc2) [%eval 0,4] [%emt 00:00:00]} Bc6 {[%eval -29,38] [%emt 00:00:24]} 61.Qd1 {(Be3) [%eval 69,28] [%emt 00:00:37]} Be7 {(Ba8) [%eval -29,39] [%emt 00:00:24]} 1/2-1/2[/pgn]
gbanksnz at gmail.com
chrisw
Posts: 4661
Joined: Tue Apr 03, 2012 4:28 pm
Location: Midi-Pyrénées
Full name: Christopher Whittington

Re: Super Tournament XXXVII

Post by chrisw »

Graham Banks wrote: Tue Sep 05, 2023 12:10 pm If you compare the CCRL and CEGT lists, you'll see a similar scenario, in that Stockfish and Dragon by Komodo leave a good gap back to the next 10 or so engines, which are all compressed into a 55-70 Elo range.
Stefan Pohl's list using the Hert500 book also shows this pattern.

That's just the way that things are.

I looked at the first 97 games in this tournament, and find these games that fit into the category you're referring to. It's unfortunate that they both feature CS Tal 2.00:

[pgn][Event "Super Tournament XXXVII"] [Site "ChessGUI2"] [Date "2023.09.03"] [Round "7.3"] [White "Clover 6.0 64-bit"] [Black "CS Tal 2.00 64-bit"] [Result "1/2-1/2"] [Time "10:33:28 AM"] [ECO "B40"] [Opening "Kveinis Variation, Sicilian"] [TimeControl "40/960:40/960:40/960"] [PlyCount "122"] [Number "33"] [Termination "GUI adjudication"] [BlackType "program"] [WhiteType "program"] [Variant "normal"] { Intel i7 Quad } 1.e4 {[%eval 0,1] [%emt 00:00:00]} c5 {[%eval 0,1] [%emt 00:00:00]} 2.Nf3 {[%eval 0,1] [%emt 00:00:00]} e6 {[%eval 0,1] [%emt 00:00:00]} 3.d4 {[%eval 0,1] [%emt 00:00:00]} cxd4 {[%eval 0,1] [%emt 00:00:00]} 4.Nxd4 {[%eval 0,1] [%emt 00:00:00]} Qb6 {[%eval 0,1] [%emt 00:00:00]} 5.Nb3 {[%eval 0,1] [%emt 00:00:00]} Qc7 {[%eval 0,1] [%emt 00:00:00]} 6.g3 {[%eval 0,1] [%emt 00:00:00]} Nf6 {[%eval -74,23] [%emt 00:00:24]} 7.Bg2 {(Bg2) [%eval 42,29] [%emt 00:00:27]} Nc6 {(d6) [%eval -55,24] [%emt 00:00:34]} 8.c4 {(O-O) [%eval 42,32] [%emt 00:00:47]} Bb4 {(Be7) [%eval -78,23] [%emt 00:00:44]} 9.Bd2 {(Bd2) [%eval 35,33] [%emt 00:00:40]} O-O {(O-O) [%eval -88,23] [%emt 00:00:21]} 10.O-O {(O-O) [%eval 57,33] [%emt 00:00:26]} Be7 {(a5) [%eval -79,24] [%emt 00:00:52]} 11.Qe2 {(Qe2) [%eval 66,31] [%emt 00:00:32]} d6 {(d6) [%eval -84,25] [%emt 00:00:20]} 12.Rc1 {(Be3) [%eval 58,34] [%emt 00:00:35]} b6 {(Bd7) [%eval -86,23] [%emt 00:00:55]} 13.Nc3 {(Nc3) [%eval 62,31] [%emt 00:00:34]} Bb7 {(Ba6) [%eval -93,23] [%emt 00:00:30]} 14.Nd5 {(Nd5) [%eval 80,29] [%emt 00:00:46]} Qd7 {(Qd8) [%eval -88,24] [%emt 00:00:28]} 15.Nxe7 {(Nxe7) [%eval 71,31] [%emt 00:00:36]} Qxe7 {(Qxe7) [%eval -88,28] [%emt 00:00:48]} 16.Re1 {(Bc3) [%eval 75,31] [%emt 00:00:35]} Nd7 {(Nd7) [%eval -95,23] [%emt 00:00:19]} 17.Be3 {(Rad1) [%eval 66,32] [%emt 00:00:37]} Nc5 {(Nc5) [%eval -83,22] [%emt 00:00:27]} 18.Rad1 {(Rad1) [%eval 53,30] [%emt 00:00:30]} h6 {(h6) [%eval -91,24] [%emt 00:00:19]} 19.f3 {(Qc2) [%eval 59,30] [%emt 00:00:29]} Rad8 {(Rad8) [%eval -50,23] [%emt 00:00:18]} 20.Qd2 {(Nd4) [%eval 63,32] [%emt 00:00:26]} Qc7 {(Kh8) [%eval -53,24] [%emt 00:00:24]} 21.Nd4 {(Nd4) [%eval 72,35] [%emt 00:00:25]} Nxd4 {(Nxd4) [%eval -66,24] [%emt 00:00:26]} 22.Bxd4 {(Bxd4) [%eval 45,36] [%emt 00:00:27]} e5 {(e5) [%eval -32,27] [%emt 00:00:39]} 23.Be3 {(Be3) [%eval 50,35] [%emt 00:00:22]} f5 {(f5) [%eval -43,24] [%emt 00:00:18]} 24.b4 {(b4) [%eval 65,33] [%emt 00:00:27]} Na6 {(Na6) [%eval -22,27] [%emt 00:00:23]} 25.Qd3 {(Qd3) [%eval 63,31] [%emt 00:00:27]} fxe4 {(fxe4) [%eval -7,22] [%emt 00:00:26]} 26.fxe4 {(fxe4) [%eval 33,35] [%emt 00:00:23]} Bc8 {(Bc8) [%eval -27,26] [%emt 00:00:17]} 27.Rc1 {(a3) [%eval 20,34] [%emt 00:01:07]} Kh7 {(Kh7) [%eval -3,27] [%emt 00:00:17]} 28.Qa3 {(a3) [%eval 32,34] [%emt 00:01:29]} Nb8 {(Nb8) [%eval 0,27] [%emt 00:00:15]} 29.c5 {(c5) [%eval 9,33] [%emt 00:00:15]} dxc5 {(dxc5) [%eval 35,29] [%emt 00:00:22]} 30.bxc5 {(bxc5) [%eval -22,32] [%emt 00:00:21]} b5 {(b5) [%eval 8,28] [%emt 00:00:18]} 31.Red1 {(Qb2) [%eval -2,31] [%emt 00:00:21]} Nc6 {(Rxd1) [%eval 67,22] [%emt 00:00:19]} 32.Rd5 {(Qb2) [%eval -40,33] [%emt 00:00:29]} a6 {(a6) [%eval 77,25] [%emt 00:00:22]} 33.h3 {(h3) [%eval -55,30] [%emt 00:00:13]} Rde8 {(Rde8) [%eval 70,26] [%emt 00:00:21]} 34.Ra1 {(Kh2) [%eval -40,34] [%emt 00:00:19]} Na5 {(Na5) [%eval 69,26] [%emt 00:00:21]} 35.Qd3 {(Kh2) [%eval -50,32] [%emt 00:00:16]} Nc4 {(Nc4) [%eval 73,28] [%emt 00:00:26]} 36.Kh2 {(Kh2) [%eval -50,31] [%emt 00:00:02]} Rf7 {(Be6) [%eval 66,28] [%emt 00:00:17]} 37.Bg1 {(Bg1) [%eval -50,30] [%emt 00:00:03]} Ref8 {(Ref8) [%eval 74,27] [%emt 00:00:24]} 38.a4 {(Qe2) [%eval -50,31] [%emt 00:00:03]} Bb7 {(Be6) [%eval 95,24] [%emt 00:00:21]} 39.Qe2 {(Qe2) [%eval -50,30] [%emt 00:00:02]} Bc6 {(Bc6) [%eval 76,25] [%emt 00:00:19]} 40.Rad1 {(h4) [%eval -50,26] [%emt 00:00:01]} Qe7 {(Rf6) [%eval 102,23] [%emt 00:00:09]} 41.Rc1 {(Ra1) [%eval -61,33] [%emt 00:00:30]} Rf6 {(Rf6) [%eval 83,27] [%emt 00:00:26]} 42.Rcd1 {(Rcd1) [%eval -52,32] [%emt 00:00:34]} R6f7 {(Qe6) [%eval 0,4] [%emt 00:00:00]} 43.Rc1 {(hashfull) [%eval -52,35] [%emt 00:00:14]} Rf6 {(Qc7) [%eval 0,4] [%emt 00:00:00]} 44.Rcd1 {[%eval -52,38] [%emt 00:00:29]} Qe8 {(Qf7) [%eval 78,27] [%emt 00:00:21]} 45.Ra1 {(Ra1) [%eval -52,35] [%emt 00:00:06]} Qe6 {(R6f7) [%eval 79,28] [%emt 00:00:26]} 46.Rad1 {(Rad1) [%eval -52,37] [%emt 00:00:20]} Qe8 {(Qe7) [%eval 0,4] [%emt 00:00:00]} 47.Ra1 {(hashfull) [%eval -52,32] [%emt 00:00:01]} Qe6 {(Qe6) [%eval 0,4] [%emt 00:00:00]} 48.Rad1 {[%eval -52,39] [%emt 00:00:35]} Qf7 {(Qe7) [%eval 78,29] [%emt 00:00:18]} 49.Ra1 {(Ra1) [%eval -52,37] [%emt 00:00:14]} Qe8 {(Qc7) [%eval 78,29] [%emt 00:00:22]} 50.Rc1 {(Rad1) [%eval -52,39] [%emt 00:00:27]} R6f7 {(R6f7) [%eval 79,30] [%emt 00:00:20]} 51.Rcd1 {(Ra1) [%eval -52,40] [%emt 00:00:24]} Kg8 {(Qc8) [%eval 79,28] [%emt 00:00:21]} 52.Rc1 {(Rb1) [%eval -52,40] [%emt 00:00:28]} Kh7 {(Kh8) [%eval 0,4] [%emt 00:00:00]} 53.Rcd1 {(hashfull) [%eval -52,38] [%emt 00:00:11]} Kg8 {(Qc8) [%eval 0,4] [%emt 00:00:00]} 54.Rc1 {[%eval -52,42] [%emt 00:00:40]} Rf6 {(Qa8) [%eval 78,28] [%emt 00:00:30]} 55.Ra1 {(Rcd1) [%eval -52,39] [%emt 00:00:23]} Kh7 {(Qe7) [%eval 0,4] [%emt 00:00:00]} 56.Rad1 {(hashfull) [%eval -52,38] [%emt 00:00:21]} Qf7 {(Qc8) [%eval 0,4] [%emt 00:00:00]} 57.Ra1 {[%eval -52,39] [%emt 00:00:19]} Qb7 {(Kh8) [%eval 77,29] [%emt 00:00:33]} 58.Rc1 {(Rb1) [%eval -52,39] [%emt 00:00:32]} Qc8 {(Qc8) [%eval 77,30] [%emt 00:01:09]} 59.Rcd1 {(Rcd1) [%eval -52,37] [%emt 00:00:11]} R6f7 {(R6f7) [%eval 77,30] [%emt 00:00:23]} 60.Ra1 {(Rc1) [%eval -52,38] [%emt 00:00:19]} Qc7 {(Qb7) [%eval 0,4] [%emt 00:00:00]} 61.Ra2 {(hashfull) [%eval -52,39] [%emt 00:00:26]} Rf6 {(Rf6) [%eval 77,27] [%emt 00:00:37]} 1/2-1/2[/pgn]

[pgn][Event "Super Tournament XXXVII"] [Site "ChessGUI2"] [Date "2023.09.05"] [Round "16.3"] [White "CS Tal 2.00 64-bit"] [Black "Clover 6.0 64-bit"] [Result "1/2-1/2"] [Time "1:57:27 AM"] [ECO "B40"] [Opening "Kveinis Variation, Sicilian"] [TimeControl "40/960:40/960:40/960"] [PlyCount "122"] [Number "78"] [Termination "GUI adjudication"] [BlackType "program"] [WhiteType "program"] [Variant "normal"] { Intel i7 Quad } 1.e4 {[%eval 0,1] [%emt 00:00:00]} c5 {[%eval 0,1] [%emt 00:00:00]} 2.Nf3 {[%eval 0,1] [%emt 00:00:00]} e6 {[%eval 0,1] [%emt 00:00:00]} 3.d4 {[%eval 0,1] [%emt 00:00:00]} cxd4 {[%eval 0,1] [%emt 00:00:00]} 4.Nxd4 {[%eval 0,1] [%emt 00:00:00]} Qb6 {[%eval 0,1] [%emt 00:00:00]} 5.Nb3 {[%eval 0,1] [%emt 00:00:00]} Qc7 {[%eval 0,1] [%emt 00:00:00]} 6.g3 {[%eval 0,1] [%emt 00:00:00]} Nf6 {[%eval -55,31] [%emt 00:00:51]} 7.Bg2 {(Bg2) [%eval 64,25] [%emt 00:00:24]} Be7 {(Be7) [%eval -40,33] [%emt 00:00:31]} 8.O-O {(O-O) [%eval 65,23] [%emt 00:00:22]} O-O {(Nc6) [%eval -24,33] [%emt 00:00:24]} 9.Nc3 {(Qe2) [%eval 56,25] [%emt 00:00:26]} Nc6 {(Nc6) [%eval -33,34] [%emt 00:00:25]} 10.Nb5 {(Nb5) [%eval 43,25] [%emt 00:00:21]} Qb8 {(Qb8) [%eval -28,36] [%emt 00:00:29]} 11.Be3 {(Qe2) [%eval 60,26] [%emt 00:01:06]} a6 {[%eval -7,31] [%emt 00:00:26]} 12.N5d4 {(Nc3) [%eval 49,27] [%emt 00:00:30]} Qc7 {(Qc7) [%eval -19,32] [%emt 00:01:32]} 13.c4 {(h3) [%eval 57,27] [%emt 00:00:21]} d6 {(d6) [%eval -29,34] [%emt 00:00:40]} 14.Rc1 {(Qe2) [%eval 49,26] [%emt 00:00:41]} Bd7 {(Bd7) [%eval -28,32] [%emt 00:00:27]} 15.Nxc6 {(Nxc6) [%eval 55,27] [%emt 00:00:17]} Bxc6 {(Bxc6) [%eval -29,32] [%emt 00:00:34]} 16.Qd3 {(Qd3) [%eval 56,26] [%emt 00:00:19]} b6 {(b6) [%eval -31,33] [%emt 00:00:32]} 17.Nd4 {(Nd4) [%eval 50,26] [%emt 00:00:22]} Bb7 {(Bb7) [%eval -31,30] [%emt 00:01:04]} 18.Rfd1 {(b3) [%eval 57,26] [%emt 00:00:18]} Rac8 {(Rac8) [%eval -22,31] [%emt 00:00:24]} 19.b3 {(b3) [%eval 49,26] [%emt 00:00:32]} h6 {(h6) [%eval -20,30] [%emt 00:00:39]} 20.Ne2 {(Bd2) [%eval 59,27] [%emt 00:00:42]} Qb8 {(Nd7) [%eval -26,35] [%emt 00:00:24]} 21.a4 {(Nc3) [%eval 64,27] [%emt 00:00:53]} Nd7 {(Nd7) [%eval -35,35] [%emt 00:00:15]} 22.Nc3 {(Nc3) [%eval 63,27] [%emt 00:00:28]} Rfd8 {(Nc5) [%eval -35,35] [%emt 00:01:05]} 23.Bd4 {(Bd4) [%eval 44,23] [%emt 00:00:25]} Qc7 {(Nc5) [%eval -29,34] [%emt 00:00:19]} 24.Rb1 {(Qe2) [%eval 46,24] [%emt 00:00:16]} Ba8 {(Bg5) [%eval -41,35] [%emt 00:00:47]} 25.Qe2 {(Qe2) [%eval 51,28] [%emt 00:00:51]} Bf8 {(Bf6) [%eval -35,33] [%emt 00:00:32]} 26.h3 {(Rd2) [%eval 75,26] [%emt 00:00:22]} Bc6 {(Be7) [%eval -35,34] [%emt 00:00:13]} 27.Kh2 {(Kh2) [%eval 74,28] [%emt 00:00:26]} Qb7 {(Ba8) [%eval -35,34] [%emt 00:00:30]} 28.Re1 {(Rd2) [%eval 74,23] [%emt 00:00:30]} Re8 {(Qc7) [%eval -35,34] [%emt 00:00:15]} 29.h4 {(Red1) [%eval 74,23] [%emt 00:00:26]} Qc7 {(Nc5) [%eval -32,35] [%emt 00:00:17]} 30.Red1 {(Red1) [%eval 74,26] [%emt 00:00:19]} Be7 {(Bb7) [%eval -32,34] [%emt 00:00:12]} 31.Rd2 {(Rd2) [%eval 74,26] [%emt 00:00:23]} Qb7 {(Ba8) [%eval -31,32] [%emt 00:00:10]} 32.Re1 {(Rdb2) [%eval 73,25] [%emt 00:00:18]} Bf8 {(Qc7) [%eval -31,32] [%emt 00:00:28]} 33.Red1 {(Qd1) [%eval 74,25] [%emt 00:00:16]} Qc7 {(Nc5) [%eval -32,33] [%emt 00:00:14]} 34.Rc1 {(Be3) [%eval 73,26] [%emt 00:00:15]} Qb7 {(Qb7) [%eval -31,32] [%emt 00:00:08]} 35.Rcd1 {(Rcd1) [%eval 0,4] [%emt 00:00:00]} Qc7 {(hashfull) [%eval -31,33] [%emt 00:00:06]} 36.Rc1 {(Rb1) [%eval 0,4] [%emt 00:00:00]} Qb7 {[%eval -31,33] [%emt 00:00:24]} 37.Rb2 {(Be3) [%eval 70,26] [%emt 00:00:55]} Qb8 {(Qb8) [%eval -31,31] [%emt 00:00:04]} 38.Rd1 {(Rcb1) [%eval 70,26] [%emt 00:00:25]} Nc5 {(Ba8) [%eval -31,30] [%emt 00:00:06]} 39.Rbb1 {(Rdd2) [%eval 70,24] [%emt 00:00:15]} Nd7 {(Nd7) [%eval -31,30] [%emt 00:00:03]} 40.Rb2 {(Rd2) [%eval 0,4] [%emt 00:00:00]} Nc5 {(hashfull) [%eval -31,30] [%emt 00:00:01]} 41.Rbb1 {(Rdd2) [%eval 0,4] [%emt 00:00:00]} Nd7 {[%eval -31,37] [%emt 00:00:22]} 42.Rd2 {(Rd2) [%eval 69,25] [%emt 00:00:20]} Qb7 {(Qc7) [%eval -31,38] [%emt 00:00:26]} 43.Ra2 {(Ra2) [%eval 69,27] [%emt 00:01:12]} Qc7 {(Qc7) [%eval -32,37] [%emt 00:00:24]} 44.Qd1 {(Rc2) [%eval 69,27] [%emt 00:00:54]} Be7 {(Be7) [%eval -32,37] [%emt 00:00:27]} 45.Rd2 {(Rc2) [%eval 69,27] [%emt 00:00:17]} Bf8 {(Bb7) [%eval -31,36] [%emt 00:00:21]} 46.Ra2 {(Rbb2) [%eval 0,4] [%emt 00:00:00]} Qb7 {(hashfull) [%eval -31,40] [%emt 00:00:27]} 47.Qe2 {(Rd2) [%eval 0,4] [%emt 00:00:00]} Be7 {[%eval -31,37] [%emt 00:00:16]} 48.Kg1 {(Rd1) [%eval 69,26] [%emt 00:00:18]} Qb8 {(Qc7) [%eval -31,38] [%emt 00:00:28]} 49.Rd1 {(Rd1) [%eval 69,26] [%emt 00:00:20]} Bb7 {(Nc5) [%eval -31,38] [%emt 00:00:24]} 50.Rad2 {(Rad2) [%eval 69,27] [%emt 00:00:32]} Bc6 {(Qc7) [%eval -31,39] [%emt 00:00:31]} 51.Ra2 {(Kh2) [%eval 0,4] [%emt 00:00:00]} Bf8 {(hashfull) [%eval -31,30] [%emt 00:00:02]} 52.Kh2 {(Be3) [%eval 69,25] [%emt 00:00:20]} Qb7 {(Qb7) [%eval -31,36] [%emt 00:00:31]} 53.Rb2 {(Rb2) [%eval 68,28] [%emt 00:01:09]} Nc5 {(Qb8) [%eval -31,39] [%emt 00:00:38]} 54.Ra1 {(Rdd2) [%eval 68,27] [%emt 00:00:19]} Nd7 {(Nd7) [%eval -31,40] [%emt 00:00:26]} 55.Rc1 {(Rd1) [%eval 0,4] [%emt 00:00:00]} Qb8 {(hashfull) [%eval -31,38] [%emt 00:00:24]} 56.Rcb1 {(Rcb1) [%eval 69,26] [%emt 00:00:35]} Qc7 {(Bb7) [%eval -31,37] [%emt 00:00:21]} 57.Rd1 {(Rc2) [%eval 68,27] [%emt 00:00:38]} Bb7 {(Be7) [%eval -31,36] [%emt 00:00:32]} 58.Rdd2 {(Qg4) [%eval 68,27] [%emt 00:00:48]} Bc6 {(Bc6) [%eval -29,36] [%emt 00:00:31]} 59.Rd1 {(Qg4) [%eval 0,4] [%emt 00:00:00]} Bb7 {(hashfull) [%eval -29,37] [%emt 00:00:18]} 60.Rdd2 {(Rc2) [%eval 0,4] [%emt 00:00:00]} Bc6 {[%eval -29,38] [%emt 00:00:24]} 61.Qd1 {(Be3) [%eval 69,28] [%emt 00:00:37]} Be7 {(Ba8) [%eval -29,39] [%emt 00:00:24]} 1/2-1/2[/pgn]
Graham, it's not relevant what CEGT, Pohl, Hert500, do or show, to the central critique that you are adjudicating draws at 80 cp. Nor is it relevant that "this is the way that it is".

First off 80 cp is way off any draw value and second, engines nowadays are not eval-normalised and 80 cp is no way 80 cp across all engines.

It's completely nuts to adjudicate draws at 80 cp. You can adjudicate a draw if both engines show below 4 or 5 cp for N moves. That's the way it is.

It also makes no sense to be culling games on your 70 cp out of book for N moves criteria. Evals are not consistent. 70 cp is not 70 cp. You should leave it up to the book to decide if it's lines are okay or not. I know many book are just amateur affairs and could contain anything - if you want a book with start move controlled to some normalised cp value, send me the pgns or epds and I'll filter them for you using SF15 or some other accurate/normalised evaluator.
Modern Times
Posts: 3758
Joined: Thu Jun 07, 2012 11:02 pm

Re: Super Tournament XXXVII

Post by Modern Times »

chrisw wrote: Tue Sep 05, 2023 11:31 am Those default values are bonkers and they penalise the "winning" engine in the pair, probably the stronger engine.
I totally agree that the 80cp value is bonkers.

chrisw wrote: Tue Sep 05, 2023 11:31 am Result? The entire rating list 40/15 is skewed against the stronger engines, Elos are depressed at the top end.
That depends entirely on the number of incorrect adjudications. If the number is tiny, then the effect may not be as great as you think. Graham probably doesn't have any facts to back up his "98% are OK" number, but I don't think you have any facts either to be making claim such as "the entire ratings list is skewed" etc etc.

Maybe Graham might be convinced to change that 80cp value going forward at least.
lkaufman
Posts: 6260
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Super Tournament XXXVII

Post by lkaufman »

Modern Times wrote: Tue Sep 05, 2023 9:04 pm
chrisw wrote: Tue Sep 05, 2023 11:31 am Those default values are bonkers and they penalise the "winning" engine in the pair, probably the stronger engine.
I totally agree that the 80cp value is bonkers.

chrisw wrote: Tue Sep 05, 2023 11:31 am Result? The entire rating list 40/15 is skewed against the stronger engines, Elos are depressed at the top end.
That depends entirely on the number of incorrect adjudications. If the number is tiny, then the effect may not be as great as you think. Graham probably doesn't have any facts to back up his "98% are OK" number, but I don't think you have any facts either to be making claim such as "the entire ratings list is skewed" etc etc.

Maybe Graham might be convinced to change that 80cp value going forward at least.
Chris doesn't state how skewed the list is due to the 80 cp adjudication; clearly it is skewed by it, but perhaps not dramatically so. Yes, the solution is to change the 80cp value to some tiny number or even 0 in the future, but I think this should be done simultaneously with switching to increment play for the Rapid games (as is already done for blitz), because the two issue are related. Using a minimal adjudication threshhold will result in many more super-long games, which will be very boring and a huge waste of resources with 40/x repeating time controls. One of the big benefits of increment is that games can be played to the end without need for adjudication. We already benefitted from this in human chess; I had a major tournament game adjudicated by none other than Bobbby Fischer around 1965; now with increment we just play to the end. Perhaps standardizing the opening book to whatever you settle on here could also be done at the same time, with the old list archived as you do with blitz. Mixing games played with a low-draw book together with games played with normal books is statistically unsound. If all three issues are fixed at once, the new list would be much sounder and very highly respected.
Komodo rules!
User avatar
Graham Banks
Posts: 44799
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Super Tournament XXXVII

Post by Graham Banks »

I have no inclination or desire to start, or to be involved with, a new rating list.

If/when I stop my CCRL testing, I'll happily continue to run my Amateur Series tournaments for those still interested, although probably with an incremental time control.

I run engine v engine testing because I enjoy it - the tournaments in particular.
I like watching some of the games, which is why bullet or blitz hold no interest for me whatsoever.
gbanksnz at gmail.com
lkaufman
Posts: 6260
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Super Tournament XXXVII

Post by lkaufman »

Graham Banks wrote: Wed Sep 06, 2023 2:07 am I have no inclination or desire to start, or to be involved with, a new rating list.

If/when I stop my CCRL testing, I'll happily continue to run my Amateur Series tournaments for those still interested, although probably with an incremental time control.

I run engine v engine testing because I enjoy it - the tournaments in particular.
I like watching some of the games, which is why bullet or blitz hold no interest for me whatsoever.
No one was suggesting blitz games to replace Rapid. My suggestion was to retain the current pace of your 40/15 games for the first 60 moves or so, then use increment to speed up the long endgames (which are usually drawn) a bit, especially if the adjudication rules are tightened. Maybe it wouldn't even need a new list, perhaps the change would be deemed minor enough to combine them. I would think that would make them more enjoyable to watch with no loss in average quality.
Komodo rules!
User avatar
Graham Banks
Posts: 44799
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Super Tournament XXXVII

Post by Graham Banks »

lkaufman wrote: Wed Sep 06, 2023 7:48 am
Graham Banks wrote: Wed Sep 06, 2023 2:07 am I have no inclination or desire to start, or to be involved with, a new rating list.

If/when I stop my CCRL testing, I'll happily continue to run my Amateur Series tournaments for those still interested, although probably with an incremental time control.

I run engine v engine testing because I enjoy it - the tournaments in particular.
I like watching some of the games, which is why bullet or blitz hold no interest for me whatsoever.
No one was suggesting blitz games to replace Rapid. My suggestion was to retain the current pace of your 40/15 games for the first 60 moves or so, then use increment to speed up the long endgames (which are usually drawn) a bit, especially if the adjudication rules are tightened. Maybe it wouldn't even need a new list, perhaps the change would be deemed minor enough to combine them. I would think that would make them more enjoyable to watch with no loss in average quality.
Do you think that 30 minutes with 10 second increments would be on a par with 40/15 games, which on average take 50 minutes to complete (if using 40/16)?
On my 5950x, I use 40/11 repeating, with the average games taking around 35 minutes, so I'm guessing that would be about 20 minutes with 7 second increments?

I could also drop the draw adjudication from 10 consecutive moves past move 60 with less than 80, to less than 30.

I do draw the line at using what I perceive to be unfair opening lines though.
gbanksnz at gmail.com
chrisw
Posts: 4661
Joined: Tue Apr 03, 2012 4:28 pm
Location: Midi-Pyrénées
Full name: Christopher Whittington

Re: Super Tournament XXXVII

Post by chrisw »

Graham Banks wrote: Wed Sep 06, 2023 8:04 am
lkaufman wrote: Wed Sep 06, 2023 7:48 am
Graham Banks wrote: Wed Sep 06, 2023 2:07 am I have no inclination or desire to start, or to be involved with, a new rating list.

If/when I stop my CCRL testing, I'll happily continue to run my Amateur Series tournaments for those still interested, although probably with an incremental time control.

I run engine v engine testing because I enjoy it - the tournaments in particular.
I like watching some of the games, which is why bullet or blitz hold no interest for me whatsoever.
No one was suggesting blitz games to replace Rapid. My suggestion was to retain the current pace of your 40/15 games for the first 60 moves or so, then use increment to speed up the long endgames (which are usually drawn) a bit, especially if the adjudication rules are tightened. Maybe it wouldn't even need a new list, perhaps the change would be deemed minor enough to combine them. I would think that would make them more enjoyable to watch with no loss in average quality.
Do you think that 30 minutes with 10 second increments would be on a par with 40/15 games, which on average take 50 minutes to complete (if using 40/16)?
On my 5950x, I use 40/11 repeating, with the average games taking around 35 minutes, so I'm guessing that would be about 20 minutes with 7 second increments?

I could also drop the draw adjudication from 10 consecutive moves past move 60 with less than 80, to less than 30.

I do draw the line at using what I perceive to be unfair opening lines though.
You want to negotiate the value of a draw down from 80 to 30? This is like you say 2+2=5, we say 2+2=4 and you offer to compromise on 2+2=4.5
Draw = 0.0, maybe +/-5 if using engine output, some of whom appear to add some small random value to 0.0 during search.

In the context of a back to back opening book, there is no concept of an unfair opening line. The correct term is unbalanced. Unbalanced opening book are perfectly "fair", as long as each side gets to play each opening from both black and white perspectives.
You can use the term grotesquely unbalanced, where the white side will always win - these lines are useless in an opening book because they provide no information, so there is a case for culling grotesquely unbalanced lines.

Data on openings? Well you can do an SF15 eval on the exit point. You can get a result-based value from a suitable number of samples of the line found in comp-comp games. Two values in fact, whitebias = (W+D) / (W+D+L), and nondrawrate = (W+L) / (W+D+L)

Criteria for being in book:
Sufficient number of samples (I am using 50 plus)
The highest nondrawrates

If SFeval < BORING_EVAL_LO then cull
if SFeval > GROTESQUE_EVAL_HI then cull
if whitebias > TOO_MUCH_WHITEBIAS then cull

Then we don't need to do any culling where you stop the game in the first N moves and restart with another opening, because that culling is done at the book building stage.

Parameters:
DRAW = 0
BORING_EVAL_LO = 60
GROTESQUE_EVAL_HI = 150
TOO_MUCH_WHITEBIAS = 0.75

That's a scientifically generated book.