Google's AlphaGo team has been working on chess

Daniel Shawul · Post by **Daniel Shawul** » Wed Dec 06, 2017 1:05 pm

mar wrote:While this is indeed incredible, show me how it beats SF dev with good book and syzygy on equal hardware in a 1000 game match.

Alternatively winning next TCEC should do

"equal hardware", "same book", "same tb" wasn't an issue for WCCC, why now ?

mar · Post by **mar** » Wed Dec 06, 2017 1:11 pm

Daniel Shawul wrote:"equal hardware", "same book", "same tb" wasn't an issue for WCCC, why now ?

They are scientists so it would be nice to compare apples to apples.

So far we know that AlphaZero is able to beat SF8 without book and tbs on their hardware in a 100 game match (while the result is significant, more games would be better)

EvgeniyZh · Post by **EvgeniyZh** » Wed Dec 06, 2017 1:15 pm

mar wrote:While this is indeed incredible, show me how it beats SF dev with good book and syzygy on equal hardware in a 1000 game match.

Alternatively winning next TCEC should do

You suppose to run Stockfish on GPU?)

mar wrote:They are scientists so it would be nice to compare apples to apples.

AlphaZero din't used neither book nor syzygy, neither did stockfish. That sounds like apples to apples.

mar · Post by **mar** » Wed Dec 06, 2017 1:29 pm

EvgeniyZh wrote:
mar wrote:While this is indeed incredible, show me how it beats SF dev with good book and syzygy on equal hardware in a 1000 game match.

Alternatively winning next TCEC should do
You suppose to run Stockfish on GPU?)

mar wrote:They are scientists so it would be nice to compare apples to apples.
AlphaZero din't used neither book nor syzygy, neither did stockfish. That sounds like apples to apples.

Obviously I'd like to see AlphaZero running on a CPU (because running SF on a TPU won't happen) and still beating SF, while allowing SF to use every means to play the best chess it can, leaving zero doubt.

I wonder if they could do it, maybe not at the moment but probably soon.

Considering the hardware at their disposal, a 100 game match seems rather short.

I'm shocked what they could accomplish without alphabeta though.

EvgeniyZh · Post by **EvgeniyZh** » Wed Dec 06, 2017 1:47 pm

mar wrote:
EvgeniyZh wrote:
mar wrote:While this is indeed incredible, show me how it beats SF dev with good book and syzygy on equal hardware in a 1000 game match.

Alternatively winning next TCEC should do
You suppose to run Stockfish on GPU?)

mar wrote:They are scientists so it would be nice to compare apples to apples.
AlphaZero din't used neither book nor syzygy, neither did stockfish. That sounds like apples to apples.
Obviously I'd like to see AlphaZero running on a CPU (because running SF on a TPU won't happen) and still beating SF, while allowing SF to use every means to play the best chess it can, leaving zero doubt.

I wonder if they could do it, maybe not at the moment but probably soon.

Considering the hardware at their disposal, a 100 game match seems rather short.

I'm shocked what they could accomplish without alphabeta though.

Well, probably they should have give same FLOPS budget to both, that seems like the most fair you can get, given the inefficiency of switching hardware for either side.

Winning against latest Stockfish with opening book and endgame tables would be definitely even more impressive.

jorose · Post by **jorose** » Wed Dec 06, 2017 1:57 pm

Very cool! I am especially surprised they still relied on a MCTS approach in chess. I don't think anybody can actually reproduce these results at the moment with hardware at home but this certainly marks a significant shift in how computer chess will develop.

I am curious what kind of performance their program would be able to achieve on sub 2k off the shelf commercial hardware. Considering the power of their TPUs I imagine the penalty would be pretty huge. Regardless, commercial hardware is a question of when, and not if. Perhaps someone will improve their approach specifically for chess in some way?

I am curious if the same amount of people will work on the tinkering form of chess programming.

Rémi Coulom · Post by **Rémi Coulom** » Wed Dec 06, 2017 2:36 pm

xcombelle wrote:
Money would be a better measure.
The AlphaZero training system costed $ 4 millions of hardware. (figures given for alpha go zero, don't have source under hand)

The paper says they use 5,000 first-generation TPUs, and 64 second-generation TPUs. Such hardware is not available for sale, but might be similar to a V100 in terms of computing power. A single PCI V100 costs about 10,000 Euros in Europe. But if you buy 5,000, you can certainly get a much cheaper price. Of course you also need the computers that host them, and the power supply (250W*5,000 = 1.25 MW).

This being said, I would not be surprised if their trained network could still beat Stockfish on ordinary hardware. And I expect deep-learning hardware will become much cheaper and commonplace in the future. Even cell-phones are starting to have deep-learning hardware now.

A distributed open-source effort might be enough to produce a super-strong network in a few months. This is what Gian-Carlo has started with Leela in Go. Maybe he'll do it for chess and shogi, too.

mcostalba · Post by **mcostalba** » Wed Dec 06, 2017 2:50 pm

I have read the paper: result is impressive!

Honestly I didn't think it was possible because my understanding was that chess is more "computer friendly" than Go....I was wrong.

It is true, SF is not meant to play at its best without a book and especially 1 fixed minute per move cuts out the whole time management, it would be more natural to play with tournament conditions, but nevertheless I think these are secondary aspects, what has been accomplished is huge.

Michel · Post by **Michel** » Wed Dec 06, 2017 3:43 pm

Rémi Coulom wrote:
xcombelle wrote:
Money would be a better measure.
The AlphaZero training system costed $ 4 millions of hardware. (figures given for alpha go zero, don't have source under hand)
The paper says they use 5,000 first-generation TPUs, and 64 second-generation TPUs. Such hardware is not available for sale, but might be similar to a V100 in terms of computing power. A single PCI V100 costs about 10,000 Euros in Europe. But if you buy 5,000, you can certainly get a much cheaper price. Of course you also need the computers that host them, and the power supply (250W*5,000 = 1.25 MW).

This being said, I would not be surprised if their trained network could still beat Stockfish on ordinary hardware. And I expect deep-learning hardware will become much cheaper and commonplace in the future. Even cell-phones are starting to have deep-learning hardware now.

A distributed open-source effort might be enough to produce a super-strong network in a few months. This is what Gian-Carlo has started with Leela in Go. Maybe he'll do it for chess and shogi, too.

I have a question that perhaps you can answer right away.

Almost a 1000 CPU years went into tuning SF until today....

Would you say that the training of AlphaGo required less or more resources than this?

Rémi Coulom · Post by **Rémi Coulom** » Wed Dec 06, 2017 3:55 pm

Michel wrote:I have a question that perhaps you can answer right away.

Almost a 1000 CPU years went into tuning SF until today....

Would you say that the training of AlphaGo required less or more resources than this?

According to the paper, they trained for 9 hours, over 5000 TPUs.

5000 * 9 / 24 = 1875 TPU-days

A TPU is a bit like a super-powerful GPU. A very rough estimate may be that 10 GTX 1080 ti may have the power of a TPU. So if you get 100 people volunteering their GPU full time, that would take about 6 months. That looks doable.

Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess

Re: Google's AlphaGo team has been working on chess