Modern Times wrote: ↑Tue Jun 08, 2021 5:38 am
AndrewGrant wrote: ↑Tue Jun 08, 2021 4:44 am
If you see any elo gain at all, then the Network is working as intended. Not having a working Network would be - hundreds of elo, since the NNUE would be spitting out random evals essentially.
OK great news, thanks for confirming.
I am indeed running the Ethereal-13.00-pext-avx2 executable.
Then I'de expect your test to eventually end in the 50-90 range. But YMMV, and the opponent pool plays a role in whether it hits the upper end or the lower end. Ethereal tends to lose to Stockfish and its derivatives (NNUE wise, or Fire / Houdini) more so than it loses to the rest of the AB field. That can be seen on CCRLs elo diffs on individual breakdowns quite broadly over the last few releases. So a pool with stockfishes will deflate the rating. A pool with only Komodo, Xiphos, Laser, Igel (original net version), will inflate the rating.
:shrug: Shall see. Not too concerned either way, as selfplay testing has proved itself reliable for years now
For sanity's sake, here is the regression test with the 8movesv3 book and LTC (60s here, but effectively 100s due to OpenBench worker speeds, scaled to Fishtest).
http://chess.grantnet.us/test/11256/ [ELO | 76.31 +- 6.21 (95%)] This method has been the predictor for Ethereal at CCRL for the last few years, and is generally in the ball park. Where as most other lists see variations based on their book preferences.