It seems that Deepmind has released the source code of a reimplementation of AlphaZero on github:
https://github.com/deepmind/open_spiel/ ... alpha_zero
Documentation: https://github.com/deepmind/open_spiel/ ... ha_zero.md
MCTS search: https://github.com/deepmind/open_spiel/ ... ms/mcts.cc
Thanks to Daniel Shawul for finding this information and sharing it with the Leela discord.
Deepmind releases AlphaZero code
Moderators: hgm, Rebel, chrisw
-
- Posts: 512
- Joined: Tue Sep 29, 2020 4:29 pm
- Location: Dublin, Ireland
- Full name: Madeleine Birchfield
-
- Posts: 89
- Joined: Sat Nov 09, 2019 3:24 pm
- Full name: .
Re: Deepmind releases AlphaZero code
So has anyone tested this yet?
-
- Posts: 708
- Joined: Mon Jan 16, 2012 6:34 am
Re: Deepmind releases AlphaZero code
1.It is not full details of codes
2. It is a very good documentary record on revolution of early neural networks
3. Current Lc0 running on TCEC is 5x times larger NN weights than A0 ( 30x386 vs 20x256) with much more advanced features than A0 ( e.g TB probe, TB scoring of value head, MLH etc) and likely minimal of +180 elo stronger than A0 now.
2. It is a very good documentary record on revolution of early neural networks
3. Current Lc0 running on TCEC is 5x times larger NN weights than A0 ( 30x386 vs 20x256) with much more advanced features than A0 ( e.g TB probe, TB scoring of value head, MLH etc) and likely minimal of +180 elo stronger than A0 now.
-
- Posts: 89
- Joined: Sat Nov 09, 2019 3:24 pm
- Full name: .
Re: Deepmind releases AlphaZero code
What does this mean? Does the github dump not compile and run?
-
- Posts: 708
- Joined: Mon Jan 16, 2012 6:34 am
Re: Deepmind releases AlphaZero code
It doesn't mention about all details of A0 training. You can get a running version of weak A0.Collingwood wrote: ↑Sun Jan 03, 2021 8:03 amWhat does this mean? Does the github dump not compile and run?
-
- Posts: 89
- Joined: Sat Nov 09, 2019 3:24 pm
- Full name: .
Re: Deepmind releases AlphaZero code
Okay. That's not very useful then, esp. since Lc0 passed full A0 long, long ago.Nay Lin Tun wrote: ↑Sun Jan 03, 2021 10:04 am It doesn't mention about all details of A0 training. You can get a running version of weak A0.
-
- Posts: 1242
- Joined: Sat Jul 05, 2014 7:54 am
- Location: Southwest USA
Re: Deepmind releases AlphaZero code
Well fellow Citizens...AlphaZero is now a "Chess Legend" and I am sure along with many others would like to see some 'working Windows compiles' produced for some "Testings"....Thx in Advance......ARNay Lin Tun wrote: ↑Sun Jan 03, 2021 10:04 amIt doesn't mention about all details of A0 training. You can get a running version of weak A0.Collingwood wrote: ↑Sun Jan 03, 2021 8:03 amWhat does this mean? Does the github dump not compile and run?
-
- Posts: 178
- Joined: Wed Nov 13, 2019 1:36 am
- Full name: Jonathan Kreuzer
Re: Deepmind releases AlphaZero code
I haven't tested it out yet, but looks like they made a very generalized framework for learning to play games
https://github.com/deepmind/open_spiel
Then they re-implemented some of the algorithms used by alpha zero in that framework.
I am curious about strength but I wouldn't expect too much (at least compared to the real alpha-zero or the strongest currently available programs). I will probably play around with the framework just to have an idea of what it is and how well it works.
I was actually working on something like this. I stalled a bit and went back to using chess to test when I realized how convenient having something like CuteChess-cli is, just auto-run a match on multiple cores with any opponent or options. I kept finding more and more features I'd need to add make working with my general framework nice. I had a cobbled together 8x8 Checkers, 8x8 Othello, and support for different board sizes (but not trained well like the 8x8 versions.)
DeepMind's is certainly much more general and much more complete than I had gotten. Not sure if I will feel like working on mine to releasable state, the number of things to do to make it good and general kept adding up. I am curious if anyone has tested Open Spiel performance/strength.
https://github.com/deepmind/open_spiel
Then they re-implemented some of the algorithms used by alpha zero in that framework.
I am curious about strength but I wouldn't expect too much (at least compared to the real alpha-zero or the strongest currently available programs). I will probably play around with the framework just to have an idea of what it is and how well it works.
I was actually working on something like this. I stalled a bit and went back to using chess to test when I realized how convenient having something like CuteChess-cli is, just auto-run a match on multiple cores with any opponent or options. I kept finding more and more features I'd need to add make working with my general framework nice. I had a cobbled together 8x8 Checkers, 8x8 Othello, and support for different board sizes (but not trained well like the 8x8 versions.)
DeepMind's is certainly much more general and much more complete than I had gotten. Not sure if I will feel like working on mine to releasable state, the number of things to do to make it good and general kept adding up. I am curious if anyone has tested Open Spiel performance/strength.