I find Pohl’s UHO openings very useful for testing. But there are some questions I have, in particular this one:
Is a test between two engines using the most common uho book predictive of their performance using a balanced book? It seems intuitively correct, but I don’t recall anyone doing an experiment on this.
The trouble with UHO
Moderator: Ras
-
dkappe
- Posts: 1632
- Joined: Tue Aug 21, 2018 7:52 pm
- Full name: Dietrich Kappe
The trouble with UHO
Fat Titz by Stockfish, the engine with the bodaciously big net. Remember: size matters. If you want to learn more about this engine just google for "Fat Titz".
-
Graham Banks
- Posts: 45115
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: The trouble with UHO
First of all, I think we need to define what the evaluation parameters are for balanced and unbalanced books.
gbanksnz at gmail.com
-
Graham Banks
- Posts: 45115
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: The trouble with UHO
My conditions for a balanced opening line is that in the first 10 moves out of book, there should be at least one evaluation that is less than 0.70.
gbanksnz at gmail.com
-
Krzysztof Grzelak
- Posts: 1588
- Joined: Tue Jul 15, 2014 12:47 pm
Re: The trouble with UHO
I think between 0.30 - 0.40. Weaker engines at 0.70 might not pick up anymore.Graham Banks wrote: ↑Tue Dec 20, 2022 11:34 pm My conditions for a balanced opening line is that in the first 10 moves out of book, there should be at least one evaluation that is less than 0.70.
-
Graham Banks
- Posts: 45115
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: The trouble with UHO
Most lines in my balanced books would meet that, but there are a few outliers.Krzysztof Grzelak wrote: ↑Wed Dec 21, 2022 12:00 amI think between 0.30 - 0.40. Weaker engines at 0.70 might not pick up anymore.Graham Banks wrote: ↑Tue Dec 20, 2022 11:34 pm My conditions for a balanced opening line is that in the first 10 moves out of book, there should be at least one evaluation that is less than 0.70.
gbanksnz at gmail.com
-
Graham Banks
- Posts: 45115
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: The trouble with UHO
Wouldn't looking at the various rating lists give you that answer?
gbanksnz at gmail.com
-
dkappe
- Posts: 1632
- Joined: Tue Aug 21, 2018 7:52 pm
- Full name: Dietrich Kappe
Re: The trouble with UHO
Most of the uho tests are bullet/private.Graham Banks wrote: ↑Wed Dec 21, 2022 12:12 amWouldn't looking at the various rating lists give you that answer?
Fat Titz by Stockfish, the engine with the bodaciously big net. Remember: size matters. If you want to learn more about this engine just google for "Fat Titz".
-
lkaufman
- Posts: 6279
- Joined: Sun Jan 10, 2010 6:15 am
- Location: Maryland USA
- Full name: Larry Kaufman
Re: The trouble with UHO
The question needs clarification. It is extremely obvious that testing with uho books is not predictive of elo differences between unequal engines, it greatly exaggerates such differences (unless the differences are huge). I suppose you mean, is the engine that wins with UHO books also likely to win a long match against the same engine with balanced books, never mind the score? Based on all my testing, I would say the correlation (in this later sense) is high but not perfect. Especially with neural nets, some may be better trained on "normal" openings, and others on positions not likely to arise by choice. So search changes probably will benefit the same engine regardless of book, but net changes could well favor one book or the other. But then, you can also say that with long time controls and many cores and top engines, normal books will always show zero elo (roughly) since nearly every game will be a draw, in which case the question has no meaningful answer.dkappe wrote: ↑Tue Dec 20, 2022 11:20 pm I find Pohl’s UHO openings very useful for testing. But there are some questions I have, in particular this one:
Is a test between two engines using the most common uho book predictive of their performance using a balanced book? It seems intuitively correct, but I don’t recall anyone doing an experiment on this.
Komodo rules!
-
pohl4711
- Posts: 2845
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: The trouble with UHO
IMO, you said all, that can be said here. I dont think, that there is a need for more clarification. Especially because on my website, you find testruns with classical books compared with uho (and other unbalanced openings concepts) with many games and 2 different thinking-times,lkaufman wrote: ↑Wed Dec 21, 2022 1:30 amThe question needs clarification. It is extremely obvious that testing with uho books is not predictive of elo differences between unequal engines, it greatly exaggerates such differences (unless the differences are huge). I suppose you mean, is the engine that wins with UHO books also likely to win a long match against the same engine with balanced books, never mind the score? Based on all my testing, I would say the correlation (in this later sense) is high but not perfect. Especially with neural nets, some may be better trained on "normal" openings, and others on positions not likely to arise by choice. So search changes probably will benefit the same engine regardless of book, but net changes could well favor one book or the other. But then, you can also say that with long time controls and many cores and top engines, normal books will always show zero elo (roughly) since nearly every game will be a draw, in which case the question has no meaningful answer.dkappe wrote: ↑Tue Dec 20, 2022 11:20 pm I find Pohl’s UHO openings very useful for testing. But there are some questions I have, in particular this one:
Is a test between two engines using the most common uho book predictive of their performance using a balanced book? It seems intuitively correct, but I don’t recall anyone doing an experiment on this.
3min+1sec here: https://www.sp-cc.de/anti-draw-openings.htm
5min+3sec here: https://www.sp-cc.de/uho_2022.htm
If there is any doubt, because of engine with nnue nets trained with uho, my AntiDraw openings collection offer a lot more opening concepts, containing unbalanced openings (Chess324, Drawkiller, NBC, NBSC etc.), just download the whole package and make your choice!
-
pohl4711
- Posts: 2845
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: The trouble with UHO
Simple: Elo-spreading and draw-rate. What else? That were the parameters for me, when I built my unbalanced openings.Graham Banks wrote: ↑Tue Dec 20, 2022 11:27 pm First of all, I think we need to define what the evaluation parameters are for balanced and unbalanced books.