hgm wrote: ↑Fri Feb 22, 2019 5:25 pm
That it has never seen a 3-Pawn advantage early in the opening is not really relevant (because the NN has generalizing capability, and if it is trained to recognize that a 3-Pawn advantage in the late middle-game is winning, it would almost certainly think this is always the case, unless it was fed explicit training examples showing the contrary.
Maybe someone else will chime in and tell me I'm totally off-base, but I don't think that it generalizes in the same way that stockfish generalizes.
Stockfish thinks a knight is worth 300 (or whatever), so if it is up a knight, that's +300, up two knights is +600 and up five knights is +15000. For Leela, being up five knights is outside of the scope of what it has been trained on, so what it's eval returns is rather unpredictable. It's like doing linear regression where your inputs are almost all between -1 and 1, and then seeing what it predicts when you put in a 4. That's why it requires an order of magnitude more data - it doesn't generalize the same way, and the parts of the space that it hasn't been trained on can end up anywhere at all. And positions that seem similar to us or to Stockfish can totally baffle neural nets (and on the other hand, this is part of what makes them able to spot positional advantages that Stockfish is clueless about).
There are image recognition nets that can identify pictures of cats, but if you change a handful of pixels in an image, it starts calling them chairs, even though the two pictures look absolutely identical to a human.
Anyway, I don't have a horse in this race, so I'm going to try to stop arguing for the sake of arguing. I do think trying to get a good score conversion is a bit of trying to fit a square peg into a round hole.