SPCC: Testrun of Ethereal 12.62 finished

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

User avatar
pohl4711
Posts: 2439
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

SPCC: Testrun of Ethereal 12.62 finished

Post by pohl4711 »

Testrun of Ethereal 12.62 finished - impressive progress!

https://www.sp-cc.de

(Perhaps you have to clear your browsercache or reload the website)
mehmet123
Posts: 671
Joined: Sun Jan 26, 2020 10:38 pm
Location: Turkey
Full name: Mehmet Karaman

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by mehmet123 »

Yes the development of Ethereal is impressive.
Maybe we will see an Ethereal NNUE engine in the near future.
AndrewGrant
Posts: 1756
Joined: Tue Apr 19, 2016 6:08 am
Location: U.S.A
Full name: Andrew Grant

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by AndrewGrant »

mehmet123 wrote: Mon Sep 28, 2020 7:34 am Maybe we will see an Ethereal NNUE engine in the near future.
Nope. NNUE is a one way ticket to kill the originality of the engine.
I'll stick to hand picked terms, slightly adjusted by a minimal NN.

Thanks for the tests. +34 in your testsing; +26 in our regression testing.
Good to see that the self-play gains are inline with the general gains.
#WeAreAllDraude #JusticeForDraude #RememberDraude #LeptirBigUltra
"Those who can't do, clone instead" - Eduard ( A real life friend, not this forum's Eduard )
perejaslav
Posts: 240
Joined: Sat Mar 18, 2006 4:01 am
Location: Cold

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by perejaslav »

AndrewGrant wrote: Mon Sep 28, 2020 8:29 am
Nope. NNUE is a one way ticket to kill the originality of the engine.
And what if NNUE trained on Ethereal games?
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by Rebel »

AndrewGrant wrote: Mon Sep 28, 2020 8:29 am
mehmet123 wrote: Mon Sep 28, 2020 7:34 am Maybe we will see an Ethereal NNUE engine in the near future.
Nope. NNUE is a one way ticket to kill the originality of the engine.
I am not so sure that's true.

Here is a sim-test at depth=1 with the current NNUE engines.

http://rebel13.nl/dump/mysim.html

I expected 70-80% but.....
90% of coding is debugging, the other 10% is writing bugs.
AndrewGrant
Posts: 1756
Joined: Tue Apr 19, 2016 6:08 am
Location: U.S.A
Full name: Andrew Grant

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by AndrewGrant »

Rebel wrote: Mon Sep 28, 2020 4:53 pm
AndrewGrant wrote: Mon Sep 28, 2020 8:29 am
mehmet123 wrote: Mon Sep 28, 2020 7:34 am Maybe we will see an Ethereal NNUE engine in the near future.
Nope. NNUE is a one way ticket to kill the originality of the engine.
I am not so sure that's true.

Here is a sim-test at depth=1 with the current NNUE engines.

http://rebel13.nl/dump/mysim.html

I expected 70-80% but.....
This might be a statement about the variety of qsearch() functions.

But interesting data. I expected much higher.
#WeAreAllDraude #JusticeForDraude #RememberDraude #LeptirBigUltra
"Those who can't do, clone instead" - Eduard ( A real life friend, not this forum's Eduard )
Alayan
Posts: 550
Joined: Tue Nov 19, 2019 8:48 pm
Full name: Alayan Feh

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by Alayan »

High similarity is evidence, but low similarity is absence of evidence rather than evidence of absence.

If the nets are different, the case might be made that the nets really are this different from each other, but if my understanding is correct, Rubi and Minic use explicitely the same NNUE net (so identical position ordering by evaluation) yet get a very low similarity in the test, which would point at qsearch for the difference.

Ideally, for eval similarity testing you'd want (1) a static command allowing to get an engine's static eval for a position (2) a set of positions to evaluate, trimmed to only have objectively drawn positions (3) measure the similarity in position ordering.
User avatar
xr_a_y
Posts: 1871
Joined: Sat Nov 25, 2017 2:28 pm
Location: France

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by xr_a_y »

Rebel wrote: Mon Sep 28, 2020 4:53 pm
AndrewGrant wrote: Mon Sep 28, 2020 8:29 am
mehmet123 wrote: Mon Sep 28, 2020 7:34 am Maybe we will see an Ethereal NNUE engine in the near future.
Nope. NNUE is a one way ticket to kill the originality of the engine.
I am not so sure that's true.

Here is a sim-test at depth=1 with the current NNUE engines.

http://rebel13.nl/dump/mysim.html

I expected 70-80% but.....
Didn't t you show some week ago that minicnnue is 86% like sf while minic is only 36%? It was just for opening position maybe?
Tony P.
Posts: 216
Joined: Sun Jan 22, 2017 8:30 pm
Location: Russia

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by Tony P. »

AndrewGrant wrote: Mon Sep 28, 2020 8:29 am Nope. NNUE is a one way ticket to kill the originality of the engine.
When people talk about NNUE, I'm not always sure if they mean sparse nets in general or the exact architecture used in SF (it looks like you meant the latter, which is how I use the NNUE term too). The reuse of the exact network architecture is an originality killer indeed. The exploration of the general ideas of sparsity and dynamic inference resulting in original architecture(s) is, which is what you've been doing, is, on the contrary, a way of making an engine truly unique.
perejaslav wrote: Mon Sep 28, 2020 3:17 pm And what if NNUE trained on Ethereal games?
Don't give revenge ideas to Norman, please :mrgreen:
User avatar
Rebel
Posts: 6995
Joined: Thu Aug 18, 2011 12:04 pm

Re: SPCC: Testrun of Ethereal 12.62 finished

Post by Rebel »

xr_a_y wrote: Mon Sep 28, 2020 8:32 pm
Rebel wrote: Mon Sep 28, 2020 4:53 pm
AndrewGrant wrote: Mon Sep 28, 2020 8:29 am
mehmet123 wrote: Mon Sep 28, 2020 7:34 am Maybe we will see an Ethereal NNUE engine in the near future.
Nope. NNUE is a one way ticket to kill the originality of the engine.
I am not so sure that's true.

Here is a sim-test at depth=1 with the current NNUE engines.

http://rebel13.nl/dump/mysim.html

I expected 70-80% but.....
Didn't t you show some week ago that minicnnue is 86% like sf while minic is only 36%? It was just for opening position maybe?
Yes, I remember. I will look into it, hoping I still have the data.
90% of coding is debugging, the other 10% is writing bugs.