Alayan wrote: ↑
Sat Nov 21, 2020 4:41 pm
pohl4711 wrote: ↑
Sat Nov 21, 2020 7:06 am
My Unbalanced Human Openings worked extremly well! The overall draw-rate was only 50.9%. The other longtime-testruns, I did before (Stockfish vs. Lc0) with the same conditions, but played with the Noomen lowdraw openings, had an overall draw-rate around 66% (!). A draw-rate of only 50.9% is an extremly low value, when 3 top-engines play with such long thinking-time! Using any classical opening set should give a draw-rate somewhere above 70%-75% here (at least!).
The best measure is not draw rate but "share of games won without losing the reverse". I expect UHO would still dominate the Noomen openings and the classical opening set in this measure, but I'd be interested in the numbers with all three opening sets...
Thats because, in my testruns for my opening-sets compared to other sets, I measure the Elo-spreading of results. Because 1:1 pairs (both games of Engine A vs. Engine B with the same opening (repeated with reversed colors) are won for white (or black)) are as bad as 2 draws here for Elo-spreading (because both engines get 1 point of 2 (=50%), so the Elo-spreading is lowered). So, measuring the Elo-spreading is the real measuring the quality of openings, not the draw-rate.
In the UHO-download all played testgames are included. But I dont know, how to count 1:1 or double draw of one Engine-pairing playing one opening twice, automatically. But, if you want to look on these results - no problem. There are testgames of classical opening-sets, UHO sets-tests...
Overview possible 9 results of Engine A vs. Engine B, playing one opening in 2 games (repeated with reversed colors):
good result: Not a 50%-50% score (not 1-1 points): Increases Elo-spreading (or lower it, if the weaker engine scores more than 50%)
bad result: 50%-50% score (1-1 points): Always
1) 1-0, 1-0 : bad
2) 1-0, draw: good
3) 1-0, 0-1: good (very good! 2-0 for one Engine!)
4) draw, 1-0: good
5) draw, draw: bad
6) draw, 0-1: good
7) 0-1, 1-0: good (very good! 2-0 for one Engine!)
8) 0-1, draw: good
9) 0-1, 0-1: bad