Asking to people who believe Leela NN is a book, what they think about SF NN now?

cucumber · Post by **cucumber** » Mon Sep 14, 2020 6:19 am

Milos wrote: ↑Mon Sep 14, 2020 4:43 am
dkappe wrote: ↑Sun Sep 13, 2020 11:54 pm Somehow a leela style net that never saw positions with more than 18 pieces was able to “memorize” opening moves. Yet somehow people arguing this tired old argument are unable to memorize this refutation.
You my friend have a conflict with a basic logic. The "fact" that the net with 18 pieces can't memorize the opening with 32 pieces has absolutely nothing to do with refuting that net with 32 pieces can memorize the opening with 32 pieces. You are just repeating your non sequitur argument, nothing else.
To simplify the argument for you so you'd be able to actually follow - NN is equal to book+evaluation. When you enter a position with 32 pieces into NN that is trained on 18 pieces NN will perform only eval. When you enter a position that has 32 pieces and that net has actually been trained at it will output book score adjusted by its eval.

That would be policy. At root, after a decent number of nodes, policy is dwarfed by evaluation. If your complaint is that policy is particularly refined in the opening, then fair enough. Tournament directors should use a TC that lets Leela have at least a half of a second of time in the opening to think, so we put aside concerns of policy dictating Leela's opening preferences. Are there any tournaments that don't do this?

mmt · Post by **mmt** » Mon Sep 14, 2020 7:29 am

AndrewGrant wrote: ↑Sun Sep 13, 2020 11:00 pm
Ovyron wrote: ↑Sun Sep 13, 2020 10:13 pm Because whenever an NN checks the opening position, it's the first time it sees it, to make 1.e4 and 1.d4 reach the top it has to do it from scratch. Only, it's done extremely intelligently and fast, just after 1 position generated it'll recognize the patterns that it has learned and that can suffice to bring them to the top, but it has nothing to do with the opening position, as if you switch pieces around just after 1 position it'll also come up with a decent move already, and the only reason it's not as good it's because the person training it hasn't shown it the patterns that appear from this position with pieces switched around.
I'm behind NN research by 4 decades and even I'm aware that its a well documented phenomena that NNs bake in a memorization of the dataset.

https://arxiv.org/pdf/1611.03530.pdf Read that, and if you still don't think its possible for Leela to "memorize" a selection of openings, please email the authors at the top of that document, as they will be far less generous than users here.

Overfitting is basically memorization. But I don't think LC0 overfits much. Using modern methods you can now have huge neural nets that are not overfitting and LC0's nets are not even large.

Collingwood · Post by **Collingwood** » Mon Sep 14, 2020 10:16 am

mmt wrote: ↑Mon Sep 14, 2020 7:29 am Overfitting is basically memorization. But I don't think LC0 overfits much. Using modern methods you can now have huge neural nets that are not overfitting and LC0's nets are not even large.

Overfitting = memorization does not imply underfitting cannot be memorization too.

Milos · Post by **Milos** » Mon Sep 14, 2020 11:40 am

mmt wrote: ↑Mon Sep 14, 2020 7:29 am
AndrewGrant wrote: ↑Sun Sep 13, 2020 11:00 pm
Ovyron wrote: ↑Sun Sep 13, 2020 10:13 pm Because whenever an NN checks the opening position, it's the first time it sees it, to make 1.e4 and 1.d4 reach the top it has to do it from scratch. Only, it's done extremely intelligently and fast, just after 1 position generated it'll recognize the patterns that it has learned and that can suffice to bring them to the top, but it has nothing to do with the opening position, as if you switch pieces around just after 1 position it'll also come up with a decent move already, and the only reason it's not as good it's because the person training it hasn't shown it the patterns that appear from this position with pieces switched around.
I'm behind NN research by 4 decades and even I'm aware that its a well documented phenomena that NNs bake in a memorization of the dataset.

https://arxiv.org/pdf/1611.03530.pdf Read that, and if you still don't think its possible for Leela to "memorize" a selection of openings, please email the authors at the top of that document, as they will be far less generous than users here.

Overfitting is basically memorization. But I don't think LC0 overfits much. Using modern methods you can now have huge neural nets that are not overfitting and LC0's nets are not even large.

That is a huge simplification that is also logically problematic. While overfitting can be an indication of memory, there is zero indication that nets that don't overfit don't have memory. Assuming that is plain wrong.

Milos · Post by **Milos** » Mon Sep 14, 2020 11:43 am

cucumber wrote: ↑Mon Sep 14, 2020 6:19 am
Milos wrote: ↑Mon Sep 14, 2020 4:43 am
dkappe wrote: ↑Sun Sep 13, 2020 11:54 pm Somehow a leela style net that never saw positions with more than 18 pieces was able to “memorize” opening moves. Yet somehow people arguing this tired old argument are unable to memorize this refutation.
You my friend have a conflict with a basic logic. The "fact" that the net with 18 pieces can't memorize the opening with 32 pieces has absolutely nothing to do with refuting that net with 32 pieces can memorize the opening with 32 pieces. You are just repeating your non sequitur argument, nothing else.
To simplify the argument for you so you'd be able to actually follow - NN is equal to book+evaluation. When you enter a position with 32 pieces into NN that is trained on 18 pieces NN will perform only eval. When you enter a position that has 32 pieces and that net has actually been trained at it will output book score adjusted by its eval.
That would be policy. At root, after a decent number of nodes, policy is dwarfed by evaluation. If your complaint is that policy is particularly refined in the opening, then fair enough. Tournament directors should use a TC that lets Leela have at least a half of a second of time in the opening to think, so we put aside concerns of policy dictating Leela's opening preferences. Are there any tournaments that don't do this?

The "book" (i.e. policy) information certainly helps Lc0 quite a lot, so if you want a fair tournament conditions you either need to give A/B engines much more time in the opening than to Leela, or give it some access to the opening book. The whole discussion sparked from the point that current tournament conditions are not fair to A/B engines.

Alayan · Post by **Alayan** » Mon Sep 14, 2020 2:07 pm

cucumber wrote: ↑Mon Sep 14, 2020 6:09 am People who don't think SPSA and SF's parameter-set have tuned SF toward/against certain opening moves are also delusional.

With a pure eval and without something like Leela's policy head that is explicitely trained to suggest moves that were successful in training/tuning games (instead of keeping eval parameters that do happen to produce successful moves overall), memorizing is harder.

Another element is that the parameter-space of classical eval is much smaller. The memorization capacity of NN is also related to the huge size of their parameter-space that largely exceeds the training dataset size in many instances. The study linked by Andrew earlier shows that CNN can be made to fit arbitrary datasets.

Nonetheless, you'd have a point, if SPSA tuning was done from the start position. This would definitely cause some memorization however minor and hidden in normal-looking parameters it might be.

SF's SPSA tuning is done from a book containing tens of thousands of positions, however. This is much bigger than SF's eval parameter space.

dkappe · Post by **dkappe** » Mon Sep 14, 2020 2:30 pm

dkappe wrote: ↑Mon Sep 14, 2020 5:18 am
Milos wrote: ↑Mon Sep 14, 2020 4:43 am
dkappe wrote: ↑Sun Sep 13, 2020 11:54 pm Somehow a leela style net that never saw positions with more than 18 pieces was able to “memorize” opening moves. Yet somehow people arguing this tired old argument are unable to memorize this refutation.
You my friend have a conflict with a basic logic. The "fact" that the net with 18 pieces can't memorize the opening with 32 pieces has absolutely nothing to do with refuting that net with 32 pieces can memorize the opening with 32 pieces. You are just repeating your non sequitur argument, nothing else.
To simplify the argument for you so you'd be able to actually follow - NN is equal to book+evaluation. When you enter a position with 32 pieces into NN that is trained on 18 pieces NN will perform only eval. When you enter a position that has 32 pieces and that net has actually been trained at it will output book score adjusted by its eval.
So your hypothesis is that a leela type network memorizes openings. How do we test this hypothesis? What evidence, for instance, would show this hypothesis to be false? If there is no possible way for the hypothesis to be disproven, then it is vacuous.

So, how would one go about trying to disprove it?

Crickets. Just people who like to argue.

cucumber · Post by **cucumber** » Mon Sep 14, 2020 9:28 pm

Alayan wrote: ↑Mon Sep 14, 2020 2:07 pm
cucumber wrote: ↑Mon Sep 14, 2020 6:09 am People who don't think SPSA and SF's parameter-set have tuned SF toward/against certain opening moves are also delusional.
With a pure eval and without something like Leela's policy head that is explicitely trained to suggest moves that were successful in training/tuning games (instead of keeping eval parameters that do happen to produce successful moves overall), memorizing is harder.

Another element is that the parameter-space of classical eval is much smaller. The memorization capacity of NN is also related to the huge size of their parameter-space that largely exceeds the training dataset size in many instances. The study linked by Andrew earlier shows that CNN can be made to fit arbitrary datasets.

Nonetheless, you'd have a point, if SPSA tuning was done from the start position. This would definitely cause some memorization however minor and hidden in normal-looking parameters it might be.

SF's SPSA tuning is done from a book containing tens of thousands of positions, however. This is much bigger than SF's eval parameter space.

I'm not convinced.

SPSA has done a ridiculous amount to teach SF theory. And SF's parameter space can capture more than enough for an optimizer to make it function as a highly-compressed book

Stockfish 070620 at depth 12: The PV in its entirety follows theory for 12 straight plies with 72,023 nodes.
NNUE at depth 12: PV follows theory for 9 straight plies with a mere 9,781 nodes.
Leela, with the latest T60 net (64988), follows theory for 4 plies with 10,381 nodes before playing weird moves.
Ethereal, with 161,564 nodes, is able to follow theory for four plies. Clearly, Ethereal demonstrates highly advanced opening-book tendencies just like Leela.

NNUE and classical eval both follow theory at laughably small node counts perfectly fine, where other engines (Leela included) would otherwise struggle tremendously. Even classical eval is able to follow some level of theory regardless of the node count at nearly any depth.

Stockfish is the best book out there.

cucumber · Post by **cucumber** » Mon Sep 14, 2020 9:41 pm

Milos wrote: ↑Mon Sep 14, 2020 11:43 am
cucumber wrote: ↑Mon Sep 14, 2020 6:19 am
Milos wrote: ↑Mon Sep 14, 2020 4:43 am
dkappe wrote: ↑Sun Sep 13, 2020 11:54 pm Somehow a leela style net that never saw positions with more than 18 pieces was able to “memorize” opening moves. Yet somehow people arguing this tired old argument are unable to memorize this refutation.
You my friend have a conflict with a basic logic. The "fact" that the net with 18 pieces can't memorize the opening with 32 pieces has absolutely nothing to do with refuting that net with 32 pieces can memorize the opening with 32 pieces. You are just repeating your non sequitur argument, nothing else.
To simplify the argument for you so you'd be able to actually follow - NN is equal to book+evaluation. When you enter a position with 32 pieces into NN that is trained on 18 pieces NN will perform only eval. When you enter a position that has 32 pieces and that net has actually been trained at it will output book score adjusted by its eval.
That would be policy. At root, after a decent number of nodes, policy is dwarfed by evaluation. If your complaint is that policy is particularly refined in the opening, then fair enough. Tournament directors should use a TC that lets Leela have at least a half of a second of time in the opening to think, so we put aside concerns of policy dictating Leela's opening preferences. Are there any tournaments that don't do this?
The "book" (i.e. policy) information certainly helps Lc0 quite a lot, so if you want a fair tournament conditions you either need to give A/B engines much more time in the opening than to Leela, or give it some access to the opening book. The whole discussion sparked from the point that current tournament conditions are not fair to A/B engines.

SF12 knows opening theory better than Leela does. In fact, both SF11 and SF12 are far more efficient in discovering theory than either Leela or non-SPSA'd engines like Ethereal.

Do you think we should give other engines more time in the opening when playing against Stockfish as well, then? Do we only penalize engines with policy heads? If so, can large decision trees be used in place of a neural policy head? Is it just the neural network structure that's problematic? How many parameters can search and move ordering code have before we need to give other engines extra time?

jhellis3 · Post by **jhellis3** » Mon Sep 14, 2020 9:46 pm

cucumber wrote:SF12 knows opening theory better than Leela does.

On 1 node (or equal node counts) or via search? Because I have a hard time believing the former....

Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?

Re: Asking to people who believe Leela NN is a book, what they think about SF NN now?