Add to that that I used adjudication in Cutechess-Cli, !!!chrisw wrote: ↑Sun Jul 19, 2020 5:30 pmThat's median plycount of 117 NN,125 SF and no wins at all before 88 ply. Of course, only 100 games. But.Laskos wrote: ↑Sun Jul 19, 2020 5:25 pmchrisw wrote: ↑Sun Jul 19, 2020 4:44 pmOk, thanks, first assessment is a bit disappointing. There's no winning game by either side game that's over before move 80, median game length is 111 ply for SF_NNUE and 118 ply for SF_dev, but actually it's longer than that because you're starting from FENs, and cutechess is counting from the FEN position, maybe those FENs are 16 ply in? So median game length is 125-135 ply. Took at look at the five shortest NNUE wins, they last for at least 82 ply and all were endings. No fireworks, nothing exciting.Laskos wrote: ↑Sun Jul 19, 2020 3:09 pm
Ok, just played a match in Cutechess-Cli, 100 games at 15'' + 0.25'' between SF NNUE GK and SF_dev
The result is here:
15'' + 0.25''
Score of SF_NNUE vs SF_dev: 31 - 17 - 52 [0.570] 100
... SF_NNUE playing White: 26 - 0 - 24 [0.760] 50
... SF_NNUE playing Black: 5 - 17 - 28 [0.380] 50
... White vs Black: 43 - 5 - 52 [0.690] 100
Elo difference: 49.0 +/- 47.4, LOS: 97.8 %, DrawRatio: 52.0 %
Finished match
The PGN is here:
http://s000.tinyupload.com/?file_id=838 ... 7353082500
Critically, I'm getting the initial impression the NNUE is not doing anything AZ-ish to Stockfish, and that implies simply the technique squashes Stockfish into a slightly (or markedly, who knows) more effective version of the same thing. Gradual grind out into the ending. Nothing new or superior, just more of the same. Disappointing, I was hoping the NN was going to have found some "new" knowledge, but, initially, looks like not.
Thanks Chris for the assessment. The openings are 6-pliers.
Code: Select all
-draw movenumber=60 movecount=3 score=20 -resign movecount=3 score=900 -tb C:\syzygy5