AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Leo
Posts: 1080
Joined: Fri Sep 16, 2016 6:55 pm
Location: USA/Minnesota
Full name: Leo Anger

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Leo »

Ras wrote:
Leo wrote:Was it to inconvenient to download the latest SF?
Because it isn't an official release and has not been tested as exhaustively. That's the point of releases. Just take a look at the TCEC what happened to Komodo when going with a dev version.

Besides, the release has been rated in many games while the "dev version of today" has not.
It is still unsatisfying to use that old of a SF.
Advanced Micro Devices fan.
User avatar
Leto
Posts: 2071
Joined: Thu May 04, 2006 3:40 am
Location: Dune

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Leto »

Leo wrote:
Ras wrote:
Leo wrote:Was it to inconvenient to download the latest SF?
Because it isn't an official release and has not been tested as exhaustively. That's the point of releases. Just take a look at the TCEC what happened to Komodo when going with a dev version.

Besides, the release has been rated in many games while the "dev version of today" has not.
It is still unsatisfying to use that old of a SF.
Judging from the score of the match it wouldn't matter if the latest SF dev were used, AlphaZero should dominate it easily. And that was with just 4 hours of training. Imagine how much stronger AlphaZero is now.
Leo
Posts: 1080
Joined: Fri Sep 16, 2016 6:55 pm
Location: USA/Minnesota
Full name: Leo Anger

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Leo »

Leto wrote:
Leo wrote:
Ras wrote:
Leo wrote:Was it to inconvenient to download the latest SF?
Because it isn't an official release and has not been tested as exhaustively. That's the point of releases. Just take a look at the TCEC what happened to Komodo when going with a dev version.

Besides, the release has been rated in many games while the "dev version of today" has not.
It is still unsatisfying to use that old of a SF.
Judging from the score of the match it wouldn't matter if the latest SF dev were used, AlphaZero should dominate it easily. And that was with just 4 hours of training. Imagine how much stronger AlphaZero is now.
When it solves a 10 man EGTB I will agree its impressive. Maybe it can do that in 4 hours too.
Advanced Micro Devices fan.
Ras
Posts: 2488
Joined: Tue Aug 30, 2016 8:19 pm
Full name: Rasmus Althoff

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Ras »

Leo wrote:It is still unsatisfying to use that old of a SF.
It's really not Google's fault that the Stockfish team hasn't had enough confidence to release Stockfish 9 for more than a full year.
Leo
Posts: 1080
Joined: Fri Sep 16, 2016 6:55 pm
Location: USA/Minnesota
Full name: Leo Anger

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Leo »

Ras wrote:
Leo wrote:It is still unsatisfying to use that old of a SF.
It's really not Google's fault that the Stockfish team hasn't had enough confidence to release Stockfish 9 for more than a full year.
I see your point. It does look like this is a chess computer revolution. Most of us new this was coming someday right?
Advanced Micro Devices fan.
Ras
Posts: 2488
Joined: Tue Aug 30, 2016 8:19 pm
Full name: Rasmus Althoff

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Ras »

Leo wrote:It does look like this is a chess computer revolution.
I wouldn't limit this to chess. It seems that their framework is flexible enough to master chess, Go and other games, indicating that it might be suited to a wide range of tasks - by self-learning. Maybe, the AI spring is coming, after decades of winter. This could be a computer revolution.

For now, Google has a monopoly on these TPUs, and they will continue to enjoy this advantage for quite some time. That's why they don't sell them and rent them out via cloud. They're cashing in on being the first to the market.

However, market economy inevitably dictates what will happen: other big companies will jump the train, Intel and Nvidia in particular, aiming to sell that stuff to end customers. Similarly to what happened in the PC revolution.

Especially Intel actually HAS to think of a new strategy, given that their adventures to get hold in ARM's domain by and large have failed. They can't bet their whole company's future on x86. Plus that Intel has the money and the experts to achieve success, I think.
Uri Blass
Posts: 10309
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by Uri Blass »

Ras wrote:
Leo wrote:It is still unsatisfying to use that old of a SF.
It's really not Google's fault that the Stockfish team hasn't had enough confidence to release Stockfish 9 for more than a full year.
Stockfish release a new version every few days that everybody can download and I see no reason to care about the name of the new version.

Tests show maybe 40 elo advantage for latest version and certainly not 100 elo at long time control so alphago could beat also the new version of stockfish.
User avatar
MikeB
Posts: 4889
Joined: Thu Mar 09, 2006 6:34 am
Location: Pen Argyl, Pennsylvania

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by MikeB »

Uri Blass wrote:
Ras wrote:
Leo wrote:It is still unsatisfying to use that old of a SF.
It's really not Google's fault that the Stockfish team hasn't had enough confidence to release Stockfish 9 for more than a full year.
Stockfish release a new version every few days that everybody can download and I see no reason to care about the name of the new version.

Tests show maybe 40 elo advantage for latest version and certainly not 100 elo at long time control so alphago could beat also the new version of stockfish.
That's true but I believe Google purposely choose a weaker version for impact. This is not about chess, this about selling their AI packages to Cities and Governments, the more dominating the result , the bigger the headline - free advertising. Even the result is not as dominating as it sounds - as the relatively high draw rate keeps the ELO difference under 80 ELO, I added one win for SF otherwise Bayeselo does not compute the ELO difference correctly. Current SF would have lost but probably would have been within 30 or 40 ELO

Code: Select all

ResultSet-EloRating>x
ResultSet>reset
ResultSet>rp /Users/michaelbyrne/cluster.mfb/12052200.txt 
101 game(s) loaded
ResultSet>elo
ResultSet-EloRating>mm 
00:00:00,00
ResultSet-EloRating>confidence 0.95
0.9
ResultSet-EloRating>r
Rank Name                 Rating   Δ     +    -     #     Σ    Σ%     W    L    D   W%    =%   OppR 
---------------------------------------------------------------------------------------------------------
   1 Stronger Engine       3132   0.0   50   50   101   64.0  63.4   28    1   72  27.7  71.3  3068 
   2 Weaker Engine         3068  64.3   50   50   101   37.0  36.6    1   28   72   1.0  71.3  3132 
---------------------------------------------------------------------------------------------------------
jhellis3
Posts: 546
Joined: Sat Aug 17, 2013 12:36 am

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by jhellis3 »

I would say the result is much more dominating that the Elo difference would suggest. If one looks at the games, it becomes quite clear at how efficient it is at exploiting holes in conventional programs evaluate functions, especially toward the late midgame / early endgame.
User avatar
MikeB
Posts: 4889
Joined: Thu Mar 09, 2006 6:34 am
Location: Pen Argyl, Pennsylvania

Re: AlphaZero beats AlphaGo Zero, Stockfish, and Elmo

Post by MikeB »

jhellis3 wrote:I would say the result is much more dominating that the Elo difference would suggest. If one looks at the games, it becomes quite clear at how efficient it is at exploiting holes in conventional programs evaluate functions, especially toward the late midgame / early endgame.
I wrote that before going through the games and I only have gone through two of them, but as you suggest it does appear more dominating than the ELO would suggest.