Distilled Networks for Lc0

corres · Post by **corres** » Mon Jan 28, 2019 4:30 pm

dkappe wrote: ↑Mon Jan 28, 2019 3:13 pm
corres wrote: ↑Mon Jan 28, 2019 9:00 am OK, but these papers do not tell us your own works and the results depend on what you actually did. So I think an interpretation of your works would be needed.
Naturally LC0 has the both head. But there are NN with separated value and policy head and there are NN in which the both are in the same structure.
It is pity that although LC0 is an open project yet its developers give us a very desultory and defective write down about LC0. Maybe they follow the precedent of Google Team?
If you only "expect" something about network of LC0 who is the man who know what is the truth?
I used the following branch of the lczero training code.
https://github.com/Ttl/lczero-training/tree/distill
As far explaining the code or the network architecture (beyond what’s already been written by the developers), I’m not the man.

I see.
Thanks for the answers.

dkappe · Post by **dkappe** » Tue Jan 29, 2019 5:48 pm

The 24x3-se network has been released. See the distilled networks page. https://github.com/dkappe/leela-chess-w ... d-Networks

24x3-se is ~300 elo stronger than 16x2-se, just around winter in strength. If someone has a very slow machine, like a raspberry pi, this might be the one for you.

104x9 is cooking now.

Distilled Networks for Lc0

Re: Distilled Networks for Lc0

Re: Distilled Networks for Lc0