Distilled Networks for Lc0

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

corres
Posts: 3657
Joined: Wed Nov 18, 2015 11:41 am
Location: hungary

Re: Distilled Networks for Lc0

Post by corres »

dkappe wrote: Mon Jan 28, 2019 3:13 pm
corres wrote: Mon Jan 28, 2019 9:00 am OK, but these papers do not tell us your own works and the results depend on what you actually did. So I think an interpretation of your works would be needed.
Naturally LC0 has the both head. But there are NN with separated value and policy head and there are NN in which the both are in the same structure.
It is pity that although LC0 is an open project yet its developers give us a very desultory and defective write down about LC0. Maybe they follow the precedent of Google Team?
If you only "expect" something about network of LC0 who is the man who know what is the truth?
I used the following branch of the lczero training code.
https://github.com/Ttl/lczero-training/tree/distill
As far explaining the code or the network architecture (beyond what’s already been written by the developers), I’m not the man.
I see.
Thanks for the answers.
dkappe
Posts: 1631
Joined: Tue Aug 21, 2018 7:52 pm
Full name: Dietrich Kappe

Re: Distilled Networks for Lc0

Post by dkappe »

The 24x3-se network has been released. See the distilled networks page. https://github.com/dkappe/leela-chess-w ... d-Networks

24x3-se is ~300 elo stronger than 16x2-se, just around winter in strength. If someone has a very slow machine, like a raspberry pi, this might be the one for you.

104x9 is cooking now.
Fat Titz by Stockfish, the engine with the bodaciously big net. Remember: size matters. If you want to learn more about this engine just google for "Fat Titz".