On-line engine blitz tourney July

Discussion of chess software programming and technical issues.

Moderators: hgm, Rebel, chrisw

chrisw
Posts: 4319
Joined: Tue Apr 03, 2012 4:28 pm

Re: On-line engine blitz tourney July

Post by chrisw »

Rebel wrote: Sat Jul 18, 2020 11:23 am Seems Chris is working on the long awaited Chess System Tal III :D
Do you have a stats tool for determining (well, in quite restricted set up, find an engine with which yours is roughly equal at 50%) the average length of game wins? Or even the statistical spread?
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: On-line engine blitz tourney July

Post by Rebel »

chrisw wrote: Sat Jul 18, 2020 8:26 pm
Rebel wrote: Sat Jul 18, 2020 11:23 am Seems Chris is working on the long awaited Chess System Tal III :D
Do you have a stats tool for determining (well, in quite restricted set up, find an engine with which yours is roughly equal at 50%) the average length of game wins? Or even the statistical spread?
I looked, but don't have.
90% of coding is debugging, the other 10% is writing bugs.
chrisw
Posts: 4319
Joined: Tue Apr 03, 2012 4:28 pm

Re: On-line engine blitz tourney July

Post by chrisw »

Rebel wrote: Sun Jul 19, 2020 8:42 am
chrisw wrote: Sat Jul 18, 2020 8:26 pm
Rebel wrote: Sat Jul 18, 2020 11:23 am Seems Chris is working on the long awaited Chess System Tal III :D
Do you have a stats tool for determining (well, in quite restricted set up, find an engine with which yours is roughly equal at 50%) the average length of game wins? Or even the statistical spread?
I looked, but don't have.
Useful? Get a bunch of engines, say around 3000, that are mostly equal in matches against each other.
Play tournament, let’s say 10000 games each at bullet.
You get the usual roughly 30/30/30 split WDL. Same random book for all, same end of game decision rules.
Take the wins for each engine and compute the mean length of game. On the basis that short wins are probably connected to mating attacks, you should be able to list the engines on an exciting/boring scale.
chrisw
Posts: 4319
Joined: Tue Apr 03, 2012 4:28 pm

Re: On-line engine blitz tourney July

Post by chrisw »

Rebel wrote: Sun Jul 19, 2020 8:42 am
chrisw wrote: Sat Jul 18, 2020 8:26 pm
Rebel wrote: Sat Jul 18, 2020 11:23 am Seems Chris is working on the long awaited Chess System Tal III :D
Do you have a stats tool for determining (well, in quite restricted set up, find an engine with which yours is roughly equal at 50%) the average length of game wins? Or even the statistical spread?
I looked, but don't have.

Well, it kind of works. I grabbed two recent 10000 game test runs against a couple of equal-ish sparring partners, I think around 3000 CCRL Elo, won't say who thjey, because that is not the point, but one is in the Stockfish paradigm, and the other I don't know, save that Ed suggested it as an opponent because it plays interesting chess. Did the histograms of winning game length for mine paired with each of the other two.
Results are, (what I expected)
Mine, mean game length to win vs engine1: 97.6 ply
Engine1, mean game length to win vs mine: 111.1 ply

Mine, mean game length to win vs engine2: 100.0 ply
Engine2, mean game length to win vs mine: 111.8 ply

With a bit of refinement his might be a possible metric for play style. Fireworks engines should win their wins quickly. They'll probably go down in flames quickly too, of course.

Edit: median is useful. 50% of the wins are shorter than median value.
vs engine1, median is 91, opponent 104
vs engine2, 91, 108 (everything is half moves, btw)
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: On-line engine blitz tourney July

Post by Rebel »

Nice way to measure playing style. Is the tool available? Else I must write it myself and I am so busy.










































Correction, that should read as lazy.


























:D
90% of coding is debugging, the other 10% is writing bugs.
chrisw
Posts: 4319
Joined: Tue Apr 03, 2012 4:28 pm

Re: On-line engine blitz tourney July

Post by chrisw »

Rebel wrote: Sun Jul 19, 2020 5:00 pm Nice way to measure playing style. Is the tool available? Else I must write it myself and I am so busy.
Sure, what do you want it to do? Ideally, give it paired PGN batches for a bunch of similar engines. Each PGN batch off same opening book, same ply depth, same termination rule, same time control (fast).
Then process the EPDs, report median and mean winning game lengths and save some pretty histograms? If you want to prepare masses of paired games, I'll construct the results analyser for you.









































Correction, that should read as lazy.


























:D
Joost Buijs
Posts: 1563
Joined: Thu Jul 16, 2009 10:47 am
Location: Almere, The Netherlands

Re: On-line engine blitz tourney July

Post by Joost Buijs »

Brian Richardson noticed me that there were some unfinished games in the final tournament grid.
I'm afraid that in the heat of the battle I made a mistake by posting the wrong grid.
Since the games are still available, I manually corrected the grid, standings only.
The correction has a big impact on the standing for Amoeba because he won the last game.

The corrected final standings for the July tourney are:

Code: Select all

:Tourney Players: Round 9 of 9 
:
:     Name              Rating Score Perfrm Upset  Results 
:     ----------------- ------ ----- ------ ------ ------- 
:  1 +BelFish           [2959]  7.0  [3036] [  59] =14w +07b =02w +12b +09w =03b =04w +05b +06w 
:  2 +NightmareX        [2844]  6.5  [2975] [ 432] +26w =03b =01b +18w +08w =04b +05w =09b =11w 
:  3 +LczTinker         [3055]  6.0  [2979] [   0] +18w =02w =06b +17w =05b =01w +11b =04w =07b 
:  4 +Honey             [2982]  6.0  [2955] [  36] =07w =06b +14w +11w +13b =02w =01b =03b =08w 
:  5 +Schooner          [2926]  5.5  [2832] [  68] +20w +17w =13b =08b =03w +07w -02b -01w +16b 
:  6 +WaspX             [2846]  5.5  [2871] [ 288] +22b =04w =03w -10b +14b +08w =09w +17b -01b 
:  7 +ArasanX           [2934]  5.5  [2839] [   0] +19w +10w =09b =05w -02b -06b +07w +12b =03w
:  8 +LeelaRB           [2902]  5.5  [2881] [  16] +15w +11b =08w =13w -01b +17w =06b =02w =04b
:  9 +atomNightmare     [2600]  5.5  [2720] [ 344] =04b -01w +23b +19w +10w -05b -08b +18w +17w
: 10 +Marvin            [2749]  5.0  [2620] [  97] +16w -08b =19w +06w -07b =12b =15w =11b +18b 
: 11 +Goldbar           [2736]  5.0  [2712] [ 189] +24b -09w +21w -04b +15w +13b -03w =10w =02b
: 12 +Amoeba            [2739]  5.0  [2617] [   0] +23w -05b +15w -03b +21w -09b +14w -06w +19b 
: 13 +WaDuuttie         [2635]  5.0  [2512] [  57] +25b -13b +22w -01w =18b =10w +21b -08w +24b 
: 14 -Nemorino          [2865]  4.5  [2615] [  48] +21b +12w =05w =09b -04w -11w -18b =19b +23w 
: 15 +atomGoldbar       [2541]  4.5  [2456] [ 240] =01b =18w -04b +22b -06w =16w -17b +26w +25b 
: 16 +rpiArminius       [2329]  4.5  [2518] [ 249] -09b +24w -17b +16w -11b +22w =10b +20w -05w 
: 17 +RavenEngine       [2080]  4.5  [2287] [ 452] -10b -19b +25w -15b +24w =14b +23w +21w -09b
: 18 +Arminius          [2604]  4.0  [2630] [ 276] -03b =14b +20w -02b =12w +19b +13w -07b -10w 
: 19 +CoronaVirusChess  [2416]  4.0  [2535] [ 390] -08b +16w =10b -07b +20w -18w +26b =13w -12w 
: 20 +Myrddin           [2368]  4.0  [2234] [   0] -05b =23w -18b +26w -19b =21w +25b -15b +22w 
: 21 +CTdccc            [2302]  3.5  [2257] [  33] -13w +25w -11b +24w -17b =20b -12w -16b +26b 
: 22 +feligres          [2186]  3.5  [2183] [   0] -06w +26b -12b -14w +23b -15b =24w +25w -20b 
: 23 +Abbess            [2038]  3.0  [2148] [ 211] -17b =20b -07w +25b -22w +26w -16b =24w -14b 
: 24 +Skiull            [1982]  3.0  [2091] [ 232] -11w -15b +26w -21b -16b +25w =22b =23b -13w
: 25 -Skipper           [1629]  1.0  [1935] [ 455] -12w -21b -16b -23w +26b -24b -20w -22b -15w 
: 26 +atomSargonI       [2084]  0.0  [1856] [   0] -02b -22w -24b -20b -25w -23b -19w -14b -21w 
:
:     Average Rating    2551.2