Do you have a stats tool for determining (well, in quite restricted set up, find an engine with which yours is roughly equal at 50%) the average length of game wins? Or even the statistical spread?
On-line engine blitz tourney July
Moderators: hgm, Rebel, chrisw
-
- Posts: 4319
- Joined: Tue Apr 03, 2012 4:28 pm
Re: On-line engine blitz tourney July
-
- Posts: 6995
- Joined: Thu Aug 18, 2011 12:04 pm
Re: On-line engine blitz tourney July
I looked, but don't have.
90% of coding is debugging, the other 10% is writing bugs.
-
- Posts: 4319
- Joined: Tue Apr 03, 2012 4:28 pm
Re: On-line engine blitz tourney July
Useful? Get a bunch of engines, say around 3000, that are mostly equal in matches against each other.Rebel wrote: ↑Sun Jul 19, 2020 8:42 amI looked, but don't have.
Play tournament, let’s say 10000 games each at bullet.
You get the usual roughly 30/30/30 split WDL. Same random book for all, same end of game decision rules.
Take the wins for each engine and compute the mean length of game. On the basis that short wins are probably connected to mating attacks, you should be able to list the engines on an exciting/boring scale.
-
- Posts: 4319
- Joined: Tue Apr 03, 2012 4:28 pm
Re: On-line engine blitz tourney July
Rebel wrote: ↑Sun Jul 19, 2020 8:42 amI looked, but don't have.
Well, it kind of works. I grabbed two recent 10000 game test runs against a couple of equal-ish sparring partners, I think around 3000 CCRL Elo, won't say who thjey, because that is not the point, but one is in the Stockfish paradigm, and the other I don't know, save that Ed suggested it as an opponent because it plays interesting chess. Did the histograms of winning game length for mine paired with each of the other two.
Results are, (what I expected)
Mine, mean game length to win vs engine1: 97.6 ply
Engine1, mean game length to win vs mine: 111.1 ply
Mine, mean game length to win vs engine2: 100.0 ply
Engine2, mean game length to win vs mine: 111.8 ply
With a bit of refinement his might be a possible metric for play style. Fireworks engines should win their wins quickly. They'll probably go down in flames quickly too, of course.
Edit: median is useful. 50% of the wins are shorter than median value.
vs engine1, median is 91, opponent 104
vs engine2, 91, 108 (everything is half moves, btw)
-
- Posts: 6995
- Joined: Thu Aug 18, 2011 12:04 pm
Re: On-line engine blitz tourney July
Nice way to measure playing style. Is the tool available? Else I must write it myself and I am so busy.
Correction, that should read as lazy.
Correction, that should read as lazy.
90% of coding is debugging, the other 10% is writing bugs.
-
- Posts: 4319
- Joined: Tue Apr 03, 2012 4:28 pm
Re: On-line engine blitz tourney July
Sure, what do you want it to do? Ideally, give it paired PGN batches for a bunch of similar engines. Each PGN batch off same opening book, same ply depth, same termination rule, same time control (fast).
Then process the EPDs, report median and mean winning game lengths and save some pretty histograms? If you want to prepare masses of paired games, I'll construct the results analyser for you.
Correction, that should read as lazy.
-
- Posts: 1563
- Joined: Thu Jul 16, 2009 10:47 am
- Location: Almere, The Netherlands
Re: On-line engine blitz tourney July
Brian Richardson noticed me that there were some unfinished games in the final tournament grid.
I'm afraid that in the heat of the battle I made a mistake by posting the wrong grid.
Since the games are still available, I manually corrected the grid, standings only.
The correction has a big impact on the standing for Amoeba because he won the last game.
The corrected final standings for the July tourney are:
I'm afraid that in the heat of the battle I made a mistake by posting the wrong grid.
Since the games are still available, I manually corrected the grid, standings only.
The correction has a big impact on the standing for Amoeba because he won the last game.
The corrected final standings for the July tourney are:
Code: Select all
:Tourney Players: Round 9 of 9
:
: Name Rating Score Perfrm Upset Results
: ----------------- ------ ----- ------ ------ -------
: 1 +BelFish [2959] 7.0 [3036] [ 59] =14w +07b =02w +12b +09w =03b =04w +05b +06w
: 2 +NightmareX [2844] 6.5 [2975] [ 432] +26w =03b =01b +18w +08w =04b +05w =09b =11w
: 3 +LczTinker [3055] 6.0 [2979] [ 0] +18w =02w =06b +17w =05b =01w +11b =04w =07b
: 4 +Honey [2982] 6.0 [2955] [ 36] =07w =06b +14w +11w +13b =02w =01b =03b =08w
: 5 +Schooner [2926] 5.5 [2832] [ 68] +20w +17w =13b =08b =03w +07w -02b -01w +16b
: 6 +WaspX [2846] 5.5 [2871] [ 288] +22b =04w =03w -10b +14b +08w =09w +17b -01b
: 7 +ArasanX [2934] 5.5 [2839] [ 0] +19w +10w =09b =05w -02b -06b +07w +12b =03w
: 8 +LeelaRB [2902] 5.5 [2881] [ 16] +15w +11b =08w =13w -01b +17w =06b =02w =04b
: 9 +atomNightmare [2600] 5.5 [2720] [ 344] =04b -01w +23b +19w +10w -05b -08b +18w +17w
: 10 +Marvin [2749] 5.0 [2620] [ 97] +16w -08b =19w +06w -07b =12b =15w =11b +18b
: 11 +Goldbar [2736] 5.0 [2712] [ 189] +24b -09w +21w -04b +15w +13b -03w =10w =02b
: 12 +Amoeba [2739] 5.0 [2617] [ 0] +23w -05b +15w -03b +21w -09b +14w -06w +19b
: 13 +WaDuuttie [2635] 5.0 [2512] [ 57] +25b -13b +22w -01w =18b =10w +21b -08w +24b
: 14 -Nemorino [2865] 4.5 [2615] [ 48] +21b +12w =05w =09b -04w -11w -18b =19b +23w
: 15 +atomGoldbar [2541] 4.5 [2456] [ 240] =01b =18w -04b +22b -06w =16w -17b +26w +25b
: 16 +rpiArminius [2329] 4.5 [2518] [ 249] -09b +24w -17b +16w -11b +22w =10b +20w -05w
: 17 +RavenEngine [2080] 4.5 [2287] [ 452] -10b -19b +25w -15b +24w =14b +23w +21w -09b
: 18 +Arminius [2604] 4.0 [2630] [ 276] -03b =14b +20w -02b =12w +19b +13w -07b -10w
: 19 +CoronaVirusChess [2416] 4.0 [2535] [ 390] -08b +16w =10b -07b +20w -18w +26b =13w -12w
: 20 +Myrddin [2368] 4.0 [2234] [ 0] -05b =23w -18b +26w -19b =21w +25b -15b +22w
: 21 +CTdccc [2302] 3.5 [2257] [ 33] -13w +25w -11b +24w -17b =20b -12w -16b +26b
: 22 +feligres [2186] 3.5 [2183] [ 0] -06w +26b -12b -14w +23b -15b =24w +25w -20b
: 23 +Abbess [2038] 3.0 [2148] [ 211] -17b =20b -07w +25b -22w +26w -16b =24w -14b
: 24 +Skiull [1982] 3.0 [2091] [ 232] -11w -15b +26w -21b -16b +25w =22b =23b -13w
: 25 -Skipper [1629] 1.0 [1935] [ 455] -12w -21b -16b -23w +26b -24b -20w -22b -15w
: 26 +atomSargonI [2084] 0.0 [1856] [ 0] -02b -22w -24b -20b -25w -23b -19w -14b -21w
:
: Average Rating 2551.2