Do you know which change(s) gives so many points ?AndrewGrant wrote: ↑Mon Sep 28, 2020 8:29 amNope. NNUE is a one way ticket to kill the originality of the engine.
I'll stick to hand picked terms, slightly adjusted by a minimal NN.
Thanks for the tests. +34 in your testsing; +26 in our regression testing.
Good to see that the self-play gains are inline with the general gains.
SPCC: Testrun of Ethereal 12.62 finished
Moderators: hgm, Dann Corbit, Harvey Williamson
-
Vinvin
- Posts: 5223
- Joined: Thu Mar 09, 2006 9:40 am
- Full name: Vincent Lejeune
Re: SPCC: Testrun of Ethereal 12.62 finished
-
AndrewGrant
- Posts: 1660
- Joined: Tue Apr 19, 2016 6:08 am
- Location: U.S.A
- Full name: Andrew Grant
Re: SPCC: Testrun of Ethereal 12.62 finished
You could go through the repo history https://github.com/AndyGrant/Ethereal/commits/masterVinvin wrote: ↑Tue Sep 29, 2020 12:43 amDo you know which change(s) gives so many points ?AndrewGrant wrote: ↑Mon Sep 28, 2020 8:29 amNope. NNUE is a one way ticket to kill the originality of the engine.
I'll stick to hand picked terms, slightly adjusted by a minimal NN.
Thanks for the tests. +34 in your testsing; +26 in our regression testing.
Good to see that the self-play gains are inline with the general gains.
15 commits since Ethereal 12.50, each commit message will show SPRT results for the content of the commit.
SPRT, especially with a broad opening book, generally over estimates the elo gains.
However, you can gauge which patches had more weight than others.
But the answer is these 3:
https://github.com/AndyGrant/Ethereal/c ... f359399567
https://github.com/AndyGrant/Ethereal/c ... 5ccd38c171
https://github.com/AndyGrant/Ethereal/c ... 675669cecd
Talkchess is dead without moderation. If you want my attention, contact me via andrew@grantnet.us
-
Rebel
- Posts: 6946
- Joined: Thu Aug 18, 2011 12:04 pm
Re: SPCC: Testrun of Ethereal 12.62 finished
I still have the data (one of my PC's gave up last week) but I am unable to reproduce the result because it's unclear which Sergio nn.bin I used for SF. I am sorry.Rebel wrote: ↑Mon Sep 28, 2020 10:37 pmYes, I remember. I will look into it, hoping I still have the data.xr_a_y wrote: ↑Mon Sep 28, 2020 8:32 pmDidn't t you show some week ago that minicnnue is 86% like sf while minic is only 36%? It was just for opening position maybe?Rebel wrote: ↑Mon Sep 28, 2020 4:53 pmI am not so sure that's true.AndrewGrant wrote: ↑Mon Sep 28, 2020 8:29 amNope. NNUE is a one way ticket to kill the originality of the engine.
Here is a sim-test at depth=1 with the current NNUE engines.
http://rebel13.nl/dump/mysim.html
I expected 70-80% but.....
90% of coding is debugging, the other 10% is writing bugs.
-
pohl4711
- Posts: 2390
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: SPCC: Testrun of Ethereal 12.62 finished
These results match perfectly. Mention, in my testruns, Ethereal 12.50 was popcount-binary and Ethereal 12.62 was new avx2-binary, which is more than 8% faster, than popcount-compile. So, my result of Ethereal 12.62 should be a little bit better (compared to Ethereal 12.50), than your selfplay-testings. And that is exactly what we see...AndrewGrant wrote: ↑Mon Sep 28, 2020 8:29 am Thanks for the tests. +34 in your testsing; +26 in our regression testing.
Good to see that the self-play gains are inline with the general gains.