Super Tournament XXXVII

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Rebel
Posts: 7403
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Super Tournament XXXVII

Post by Rebel »

Graham Banks wrote: Thu Sep 07, 2023 1:33 am
Rebel wrote: Thu Sep 07, 2023 1:11 am
Graham Banks wrote: Tue Sep 05, 2023 12:07 am
chrisw wrote: Mon Sep 04, 2023 5:33 pm Why was this game declared a draw? They were shuffling, but the "winning" side was going to make a pawn move and change everything sooner or later.
Hi Chris,

I've probably mentioned this before, but I've always used the default ChessGUI adjudication settings:

Image

98% of the time, they're pretty much right.
I don't even fully understand the ChessGUI picture, but lemme give it a try.

1. Looks like you rewrote the 50 move rule to 30 moves with an abs(score<80) for last 10 moves.

Haven't changed anything yet.

2. To declare a win when the last 4 moves are within 4.75 is strange with nowadays high scores, I use 9.99 and never decide a game when the losing engine still has a queen.

3. No idea what Max Game Length = 50 means. Probably don't do any adjudication before move 50? Please enlighten me.

For testing to avoid long silly draws I use a game length of at least 160 moves when the mutual scores have remained under one pawn, for serious rating games 160 is a bit too low, I would suggest 200.

I don't set a move limit.
The picture is about Game adjudication, please describe the meaning of the 8 listed settings. I get the feeling you don't understand them and neither do I.
90% of coding is debugging, the other 10% is writing bugs.
Modern Times
Posts: 3758
Joined: Thu Jun 07, 2012 11:02 pm

Re: Super Tournament XXXVII

Post by Modern Times »

Very clear to me, and will be to Graham too.
User avatar
Graham Banks
Posts: 44799
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Super Tournament XXXVII

Post by Graham Banks »

Modern Times wrote: Thu Sep 07, 2023 2:26 am Very clear to me, and will be to Graham too.
Having run well over a million games at this time control (most using ChessGUI), I find it quite disrespectful to have it insinuated that I don't understand the settings I use.
gbanksnz at gmail.com
lkaufman
Posts: 6260
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Super Tournament XXXVII

Post by lkaufman »

Graham Banks wrote: Thu Sep 07, 2023 12:35 am If my memory serves me correctly, TCEC aim for a draw rate below 70%.

I think that is reasonable.
70% draw rate for top engines at long time controls on big hardware is hard to achieve, they only can do it by very unbalanced openings. But if you start with a database of blitz games on 1 or a few cores by a wide range of engines, it is just a normal result with normal openings. That's roughly the draw rate in the Hiarcs book, which is overwhelmingly good openings. Such openings will produce 99% or more draws with TCEC conditions, at least in the final stages. My point is that you have to set the limit far below 70% in a database such as Hiarcs opening book or CCRL blitz games, in order to get anywhere near a 70% draw rate among the top few engines on 4 or more threads at Rapid TC.
Komodo rules!
Krzysztof Grzelak
Posts: 1585
Joined: Tue Jul 15, 2014 12:47 pm

Re: Super Tournament XXXVII

Post by Krzysztof Grzelak »

Hi Graham,
please enter 0.15 and it should be fine.
Modern Times
Posts: 3758
Joined: Thu Jun 07, 2012 11:02 pm

Re: Super Tournament XXXVII

Post by Modern Times »

Back in the day when I used ChessGUI, I had the following:

Game > 59 moves and last 10 moves less then 5 centipawns = adjudicate draw
Adjudicate a tablebase result (draw or win) but not before the distance got to less than 10 moves away
Adjudicate a win if eval was > 950 centipawns for 5 moves

The latter two meant that the games were almost played out to their conclusion anyway, so maybe I shouldn't have bothered. When I did watch games I liked to see how the engines played them out though.

These days I only use CuteChess, and there I don't have any evaluation based adjudication at all, I only have tablebase adjudication turned on. That is what SPCC does and I decided to do the same.
User avatar
Rebel
Posts: 7403
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Super Tournament XXXVII

Post by Rebel »

Graham Banks wrote: Thu Sep 07, 2023 2:59 am
Modern Times wrote: Thu Sep 07, 2023 2:26 am Very clear to me, and will be to Graham too.
Having run well over a million games at this time control (most using ChessGUI), I find it quite disrespectful to have it insinuated that I don't understand the settings I use.
Maybe so and I am happy to withdraw that, but you did not answer my question and explain the 8 settings you use for game adjudication.
90% of coding is debugging, the other 10% is writing bugs.
Krzysztof Grzelak
Posts: 1585
Joined: Tue Jul 15, 2014 12:47 pm

Re: Super Tournament XXXVII

Post by Krzysztof Grzelak »

Modern Times wrote: Thu Sep 07, 2023 9:36 am Back in the day when I used ChessGUI, I had the following:

Game > 59 moves and last 10 moves less then 5 centipawns = adjudicate draw
Adjudicate a tablebase result (draw or win) but not before the distance got to less than 10 moves away
Adjudicate a win if eval was > 950 centipawns for 5 moves

The latter two meant that the games were almost played out to their conclusion anyway, so maybe I shouldn't have bothered. When I did watch games I liked to see how the engines played them out though.

These days I only use CuteChess, and there I don't have any evaluation based adjudication at all, I only have tablebase adjudication turned on. That is what SPCC does and I decided to do the same.
SPCC can't be taken seriously just humorously. Because of a bad debut book.
Modern Times
Posts: 3758
Joined: Thu Jun 07, 2012 11:02 pm

Re: Super Tournament XXXVII

Post by Modern Times »

Rebel wrote: Thu Sep 07, 2023 10:48 am
Graham Banks wrote: Thu Sep 07, 2023 2:59 am
Modern Times wrote: Thu Sep 07, 2023 2:26 am Very clear to me, and will be to Graham too.
Having run well over a million games at this time control (most using ChessGUI), I find it quite disrespectful to have it insinuated that I don't understand the settings I use.
Maybe so and I am happy to withdraw that, but you did not answer my question and explain the 8 settings you use for game adjudication.
It is 3 settings not 8, look at my post and you'll understand what his screenshot is saying.
chrisw
Posts: 4661
Joined: Tue Apr 03, 2012 4:28 pm
Location: Midi-Pyrénées
Full name: Christopher Whittington

Re: Super Tournament XXXVII

Post by chrisw »

lkaufman wrote: Wed Sep 06, 2023 10:36 pm
carldaman wrote: Wed Sep 06, 2023 8:11 pm
lkaufman wrote: Wed Sep 06, 2023 5:49 pm
Graham Banks wrote: Wed Sep 06, 2023 8:04 am
lkaufman wrote: Wed Sep 06, 2023 7:48 am
Graham Banks wrote: Wed Sep 06, 2023 2:07 am I have no inclination or desire to start, or to be involved with, a new rating list.

If/when I stop my CCRL testing, I'll happily continue to run my Amateur Series tournaments for those still interested, although probably with an incremental time control.

I run engine v engine testing because I enjoy it - the tournaments in particular.
I like watching some of the games, which is why bullet or blitz hold no interest for me whatsoever.
No one was suggesting blitz games to replace Rapid. My suggestion was to retain the current pace of your 40/15 games for the first 60 moves or so, then use increment to speed up the long endgames (which are usually drawn) a bit, especially if the adjudication rules are tightened. Maybe it wouldn't even need a new list, perhaps the change would be deemed minor enough to combine them. I would think that would make them more enjoyable to watch with no loss in average quality.
Do you think that 30 minutes with 10 second increments would be on a par with 40/15 games, which on average take 50 minutes to complete (if using 40/16)?
On my 5950x, I use 40/11 repeating, with the average games taking around 35 minutes, so I'm guessing that would be about 20 minutes with 7 second increments?

I could also drop the draw adjudication from 10 consecutive moves past move 60 with less than 80, to less than 30.

I do draw the line at using what I perceive to be unfair opening lines though.
I think you meant to say 15 minutes (per player) plus 10 second increment would be equal to 40/16, since 15 min plus 10 sec games would take fifty minutes for sixty moves with all time used. For your machine, you mean to say 10 minutes per player plus 7 seconds, which would take 34 minutes for sixty moves with all time used. Probably without adjudication (or with minimal adjudication) the average would be somewhat more than sixty moves, so I would recommend 12 minutes plus 8 seconds on your 5950x as the most equivalent to your current time control. I can assure you that the quality of play will in general be higher with 12 minutes plus 8 seconds, though of course it will be slightly lower when playing out very long endgames.

Adjudication at 30 cp is of course much better than at 80 cp, but as far as I know no one else uses adjudications at any double-digit centipawn value. I agree with Chris than zero would be best (with increment to make it practical), but any single-digit value is okay from my perspective.

Regarding "unfair" opening lines, I suppose you mean lines that are likely winning, or perhaps very close to winning, for one side (presumably White). I think that we all agree that winning lines should be excluded; it's just a question of where to draw the line for likely or nearly winning. I would specify a maximum White score, definitely below 75% since above that indicates a likely won position; 70% seems a good value to me. Setting it too low would mean almost all draws when the top engines meet.
I would argue that 70% is a bit low, since it would leave out a bunch of King's Indians and Benonis, which are too big a part of the common chess heritage. While White gets a sizable advantage in theory, I would not say they're automatic losses for Black, and nowhere near "grotesquely unbalanced lines", as Chris put it.
Checking the HIarcs database, which I believe is mostly strong engine games now, the mar del plata king's indian after 8...Ne7, presumably the main line, is only showing 55.5% for White. The main line Benoni with 8.h3 and 9.Bd3 shows 61.3% for White, still far below 70%. Of course you can find some minor sidelines that exceed 70%, but no need for such lines in a book, the lines actually played by grandmasters with any frequency are almost always below 70% score for White. Probably it would be best to use stats from a book such as Hiarcs book, which has 1.8+ million games and an average Elo showing over 2900. With samples and quality like this, no need to even consider engine evals, just set a maximum White score like 70% and a maximum total draw percentage, perhaps 55 to 60% (the draw % in the Hiarcs book is in the 68-69% range), and a minimum number of total games from the position, whatever value produces the desired number of total games.
Here you are talking about making unbalanced human openings based on game results, something which has already been done.

The first book I have to Graham for this tourney was same idea, only using comp-comp games and with a high eval cull (0.70). This did no better than a random book for decreasing draw rate.

To lower the draw rate, one MUST not cull high evals, one must cull low evals (out of book).
Alternatively, no culling of high nondrawrate from results, culling of lownondrawrate.