Future plans of Leela,(reset of the current one)

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Nay Lin Tun
Posts: 708
Joined: Mon Jan 16, 2012 6:34 am

Future plans of Leela,(reset of the current one)

Post by Nay Lin Tun »

(copy from Leela blog)

As it was planned, we concluded our test10 run, and now it is time for another one.
Test10 was undoubtedly a success, but it has reached its limit. The vote on discord has shown that the community wants the reset as soon as possible, and that's what we did. :)


We used to keep network identifiers with test numbers (e.g test5 had network id 5xx), but as we had so many networks for the test10 that it overflown into networks id11xxx, the next test is called test20.

It is expected that at the current game rate it will take 6-7 weeks for test20 to become stronger than latest networks from test10.

Changes

What didn't change

Before telling what's new in the next run, let me list what of what we promised, but is not there:
Weights quantization is not enabled.
It is implemented, but we didn't test it enough to confirm that it doesn't lead to weaker nets.
SWA (Stochastic weights averaging).
Implementation turned out to be too slow, optimizations are needed.
Training multiple networks in parallel.
With frequent training that we plan, training pipeline won't be able to keep up with that.
There are plans to employ several GPUs during training, but that's not implemented yet.
It's not main2, but rather test20.
It's running on test server, but at least we updated the server version.

What did change

And now, how test20 will be different from test10:
Cpuct will be equal to 5
That's the value that Deepmind used in AlphaGo (they did not mention values of Cpuct in AlphaGo Zero and AlphaZero papers).
It is expected that this will make Leela better in tactics, and will add more variance to openings.
Rule50 bug fixed.
Leela will be able to use information about number of moves without captures and pawn moves.
Cache history bug fixed.
We recently found a bug, that different transposition of the same position could be taken from NN cache, while in reality NN can return different output depending on history. That was fixed.
Better resign threshold handling.
We'll watch at which eval value probability to resign correctly becomes 95% and adjust threshold dynamically.
Frequent network generation, ~40 networks per day.
Test10 started with only ~4 networks per day.
Larger batch size in training pipeline.
This is closer to what DeepMind did for AlphaZero and should reduce overfitting.
Ghost Batch Normalization from start
(I don't really know what it is). Also closer to what DeepMind did and also prevents overfitting.
En passant + threefold repetition bug is fixed.
This was a minor bug which probably won't have much effect. After pawn move by 2 squares, position was never counted towards three-fold repetition.
User avatar
Rubinus
Posts: 1161
Joined: Thu Jan 18, 2007 4:05 pm
Location: Prague
Full name: Pavel Háse

Re: Future plans of Leela,(reset of the current one)

Post by Rubinus »

User avatar
AdminX
Posts: 6339
Joined: Mon Mar 13, 2006 2:34 pm
Location: Acworth, GA

Re: Future plans of Leela,(reset of the current one)

Post by AdminX »

Rubinus wrote: Fri Aug 31, 2018 10:49 am http://testserver.lczero.org/active_users
And link engine 0.18?
You will need to compile it.
Version 18 is still in development, but here is the link to the source (Use the Master branch): https://github.com/LeelaChessZero/lc0
"Good decisions come from experience, and experience comes from bad decisions."
__________________________________________________________________
Ted Summers
jp
Posts: 1470
Joined: Mon Apr 23, 2018 7:54 am

Re: Future plans of Leela,(reset of the current one)

Post by jp »

Leela blog wrote:Update3
test20 training is finally started! First network training from non-random self-play games will be id20058. Networks id20000–20056 were intermediate networks from initial training, and id20057 is the final seed network.
What exactly do they mean by "reset"? Using nothing at all from previous runs (no past weights, no past games) & everything from scratch from 20058 on?
"Non-random" games?
20057 "seed network", but no relation to 20058?

crem, can you explain?
crem
Posts: 177
Joined: Wed May 23, 2018 9:29 pm

Re: Future plans of Leela,(reset of the current one)

Post by crem »

jp wrote: Fri Aug 31, 2018 5:36 pm
Leela blog wrote:Update3
test20 training is finally started! First network training from non-random self-play games will be id20058. Networks id20000–20056 were intermediate networks from initial training, and id20057 is the final seed network.
What exactly do they mean by "reset"? Using nothing at all from previous runs (no past weights, no past games) & everything from scratch from 20058 on?
"Non-random" games?
20057 "seed network", but no relation to 20058?

crem, can you explain?
Random games were generated, and used to train the neural network initially. The result of that process is id20057 (networks before that is just intermediate steps for that).
After that, the usual training was started. Clients downloaded id20057, generated selfplay games, and from those games id20058 is generated (with id20057 as a base). And so on.

What used to be id1 now id20057. What used to be id2 now id20058.

No past games, no past weights from test10 or any other run was used. Even random games are completely fresh.
whereagles
Posts: 565
Joined: Thu Nov 13, 2014 12:03 pm

Re: Future plans of Leela,(reset of the current one)

Post by whereagles »

doing it all over again? what's the point?
Damir
Posts: 2801
Joined: Mon Feb 11, 2008 3:53 pm
Location: Denmark
Full name: Damir Desevac

Re: Future plans of Leela,(reset of the current one)

Post by Damir »

bug fixes and code cleanup needs a new start....
jp
Posts: 1470
Joined: Mon Apr 23, 2018 7:54 am

Re: Future plans of Leela,(reset of the current one)

Post by jp »

Damir wrote: Sat Sep 01, 2018 5:16 pm bug fixes and code cleanup needs a new start....
Bug fixes & better parameters like cpuct.
The real problem is how to know you've got rid of all bugs and using the best parameters for training.
Damir
Posts: 2801
Joined: Mon Feb 11, 2008 3:53 pm
Location: Denmark
Full name: Damir Desevac

Re: Future plans of Leela,(reset of the current one)

Post by Damir »

How to know if not trying it anyway ?
jp
Posts: 1470
Joined: Mon Apr 23, 2018 7:54 am

Re: Future plans of Leela,(reset of the current one)

Post by jp »

Yep, and if it's close to flatlining might as well.