NNUE variation

Rebel · Post by **Rebel** » Tue Sep 29, 2020 11:22 am

Tested several NNUE nets with SF12 and Minic (depth=1) and it seems to me there is a lot a variation between nets, even between the last 7 Sergio nets.

http://rebel13.nl/dump/sf12.html
http://rebel13.nl/dump/minic.html

Maybe this softens Andrew's pain a bit.

What worries me about neural nets (also Lc0) is that it changes your engine playing style while you are not aware of it. Oh wait, that already happens when you only look at the cute-chess results without ever replaying a game

Raphexon · Post by **Raphexon** » Tue Sep 29, 2020 11:38 am

If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns

Guenther · Post by **Guenther** » Tue Sep 29, 2020 12:03 pm

Raphexon wrote: ↑Tue Sep 29, 2020 11:38 am If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns

Maybe depth=1 similarity simply doesn't work as good with nnue...
There should be more tests with various depths and times too for getting a clearer picture regarding the simtest.

Rebel · Post by **Rebel** » Tue Sep 29, 2020 8:21 pm

Guenther wrote: ↑Tue Sep 29, 2020 12:03 pm
Raphexon wrote: ↑Tue Sep 29, 2020 11:38 am If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns
Maybe depth=1 similarity simply doesn't work as good with nnue...
There should be more tests with various depths and times too for getting a clearer picture regarding the simtest.

What do you suggest?

While Fire 7.1 scores 78% at depth=1, it is below 60% at 100ms.

It isn't simple.

Guenther · Post by **Guenther** » Tue Sep 29, 2020 8:24 pm

Rebel wrote: ↑Tue Sep 29, 2020 8:21 pm
Guenther wrote: ↑Tue Sep 29, 2020 12:03 pm
Raphexon wrote: ↑Tue Sep 29, 2020 11:38 am If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns
Maybe depth=1 similarity simply doesn't work as good with nnue...
There should be more tests with various depths and times too for getting a clearer picture regarding the simtest.
What do you suggest?

While Fire 7.1 scores 78% at depth=1, it is below 60% at 100ms.

It isn't simple.

? I did not know Fire 7.1 is nnue...
We are in a nnue thread, so what has Fire to do here?

Ed, please read again what I wrote, especially sentence one, which is the base for sentence two.

Rebel · Post by **Rebel** » Tue Sep 29, 2020 8:34 pm

Guenther wrote: ↑Tue Sep 29, 2020 8:24 pm
Rebel wrote: ↑Tue Sep 29, 2020 8:21 pm
Guenther wrote: ↑Tue Sep 29, 2020 12:03 pm
Raphexon wrote: ↑Tue Sep 29, 2020 11:38 am If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns
Maybe depth=1 similarity simply doesn't work as good with nnue...
There should be more tests with various depths and times too for getting a clearer picture regarding the simtest.
What do you suggest?

While Fire 7.1 scores 78% at depth=1, it is below 60% at 100ms.

It isn't simple.
? I did not know Fire 7.1 is nnue...
We are in a nnue thread, so what has Fire to do here?

Ed, please read again what I wrote, especially sentence one, which is the base for sentence two.

Ok, more clear, when search moves in (note the Fire example) the same will happen with NNUE, huge swings in similarity and the longer the time control the more similar engines become. But I already made a start with 100ms, 250, 500, 1000ms and maybe even 4000ms to see if my prediction for NNUE is also true.

Guenther · Post by **Guenther** » Tue Sep 29, 2020 8:43 pm

Rebel wrote: ↑Tue Sep 29, 2020 8:34 pm
Guenther wrote: ↑Tue Sep 29, 2020 8:24 pm
Rebel wrote: ↑Tue Sep 29, 2020 8:21 pm
Guenther wrote: ↑Tue Sep 29, 2020 12:03 pm
Raphexon wrote: ↑Tue Sep 29, 2020 11:38 am If anything that shows that you can just take a network, add some very minor modifications and pass the similarity test with ease.

Could be interesting to test Vondele's net since that's a Sergio net with the last layer SPSA tuned.

https://tests.stockfishchess.org/nns
Maybe depth=1 similarity simply doesn't work as good with nnue...
There should be more tests with various depths and times too for getting a clearer picture regarding the simtest.
What do you suggest?

While Fire 7.1 scores 78% at depth=1, it is below 60% at 100ms.

It isn't simple.
? I did not know Fire 7.1 is nnue...
We are in a nnue thread, so what has Fire to do here?

Ed, please read again what I wrote, especially sentence one, which is the base for sentence two.
Ok, more clear, when search moves in (note the Fire example) the same will happen with NNUE, huge swings in similarity and the longer the time control the more similar engines become. But I already made a start with 100ms, 250, 500, 1000ms and maybe even 4000ms to see if my prediction for NNUE is also true.

I wouldn't even test so long tcs, just a few clock cycles (e.g. 16ms rounded up) so 20 40 50 and what ever is slightly above N= X/16
and depths 2-12 or so. (not every depth needed)

Tony P. · Post by **Tony P.** » Tue Sep 29, 2020 9:06 pm

Ed, you wrote yourself in the 2019 similarity report:

All tested engines in this report are of the alpha-beta type, so our proposed baseline is an alpha-beta baseline. When we test as many neural net engines as possible for our next report, we may well discover a different baseline figure for move variance, since neural net engines anecdotally evaluate positions differently to alpha-beta handcrafted evaluation functions.

To police CPU NN origins, you'll need to lower the thresholds. From the user perspective, though, I'm just happy to see the bigger variety regardless of the baseline.

Rebel · Post by **Rebel** » Wed Sep 30, 2020 12:29 am

Here are some results -

http://rebel13.nl/dump/nnue-depth-1.html
http://rebel13.nl/dump/nnue-100ms.html
http://rebel13.nl/dump/nnue-250ms.html
http://rebel13.nl/dump/nnue-500ms.html

No serious similarity in sight, but rising due to increasing time control (strength), same pattern as pure AB engines.

Rebel · Post by **Rebel** » Wed Sep 30, 2020 12:15 pm

Added 1000ms and 2000ms

http://rebel13.nl/dump/nnue-1000ms.html
http://rebel13.nl/dump/nnue-2000ms.html

Same pattern, nothing to worry about.

NNUE doesn't change the diversity.

NNUE variation

NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation

Re: NNUE variation