Historic Milestone: AlphaZero

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

jhellis3
Posts: 546
Joined: Sat Aug 17, 2013 12:36 am

Re: Much weaker than Stockfish

Post by jhellis3 »

If K80 is 5k$, that would be like 100k$ minimum starting price.
It is $70,000 AFAIK.
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Much weaker than Stockfish

Post by Lyudmil Tsvetkov »

kranium wrote:
Lyudmil Tsvetkov wrote:
The training matches are different from the 100 games match with Stockfish.
It is not at all clear to me where were books used and where not.
12 openings with reversed colours don't square in any way with 100 played games, so did they actually left some openings played more than others, or did not they flip colours?
I'm sure opening books were not used...
In the early self-play games things like 1.a3, 1.a4, etc. were probably tried by AlphaZero...
eventually it learned that 1. e4 or 1. d4 had the highest success rates.
How can you be sure if they don't specify it?
And it learned wrong. But 1.Nf3?

Is this engine still based on random choices? What perfect engine we are talking about then?
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Much weaker than Stockfish

Post by Lyudmil Tsvetkov »

MikeGL wrote:
Lyudmil Tsvetkov wrote:The training matches are different from the 100 games match with Stockfish.
Yes, the plot on the diagram is the training game, but 100 games per openning was played. 50-50, and the score below the diagram is on AlphaZero perspective.
12 openings with reversed colours don't square in any way with 100 played games, so did they actually left some openings played more than others, or did not they flip colours?
12 opennings x 100 = 1,200 games total.
Before we were talking about 300 and 100, now 1200 suddenly appears...
The 64/36 score certainly comes from 100 games, unless they assigned random points for a win.
And in that sample, I see Alpha playing just 1.d4 and 1.Nf3.
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Historic Milestone: AlphaZero

Post by Lyudmil Tsvetkov »

JJJ wrote:
Lyudmil Tsvetkov wrote:
jdart wrote:Even Stockfish on a Raspberry Pi is a strong chess player, though. So AlphaZero has decent performance, we just need a better comparison. I don't think there is any fundamental reason a NN based system such as AlphaZero couldn't run on commodity hardware, so maybe that can happen.

--Jon
Well, probably lower than 3000 elos.
I don't think any engine below 3000 elo would crush Stockfish with a strong hardware.
This is a specifically-built engine running on a very specific hardware.

It is not that easy to find promising advances in chess evaluation, unless you get rid of all redundances and come up with completely new terms, which they have not done, so it is simply logically impossible to improve vastly in that way.

I can not believe something that is simply unachievable.

On a single core, if they would adapt their code that way, Alpha would certainly be weaker than 2900.
But actually, why don't they offer a single-core standard version to convince everyone?
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Historic Milestone: AlphaZero

Post by Lyudmil Tsvetkov »

One good thing Alpha learned is to develop its bishop on g2.
How did it learn that?
MikeGL
Posts: 1010
Joined: Thu Sep 01, 2011 2:49 pm

Re: Much weaker than Stockfish

Post by MikeGL »

Lyudmil Tsvetkov wrote:
MikeGL wrote:
Lyudmil Tsvetkov wrote:The training matches are different from the 100 games match with Stockfish.
Yes, the plot on the diagram is the training game, but 100 games per openning was played. 50-50, and the score below the diagram is on AlphaZero perspective.
12 openings with reversed colours don't square in any way with 100 played games, so did they actually left some openings played more than others, or did not they flip colours?
12 opennings x 100 = 1,200 games total.
Before we were talking about 300 and 100, now 1200 suddenly appears...
The 64/36 score certainly comes from 100 games, unless they assigned random points for a win.
And in that sample, I see Alpha playing just 1.d4 and 1.Nf3.
Read the series of posts properly. It is 100 games per openning, you clearly don't understand Table 2.

300 games because you were talking about 1.e4 earlier which appears in 6 diagrams.
How many is 50 x 6 ?
You were claiming that AlphaZero didn't play 1.e4, i told you it did! 300 times it did play 1.e4 against SF8.

See the total summation below: 1,200 games for all 12 opennings. Come on man, even this very
basic stuff we argue?
shrapnel
Posts: 1339
Joined: Fri Nov 02, 2012 9:43 am
Location: New Delhi, India

Re: Much weaker than Stockfish

Post by shrapnel »

MikeGL wrote:See the total summation below: 1,200 games for all 12 opennings. Come on man, even this very
basic stuff we argue?
I suppose when the Wheel was first invented, there must have been people like Lyudmil Tsvetkov around, who said that it would never work.
Throughout the History of Mankind there have always been people who flatly refused to believe in revolutionary discoveries and inventions.
I suppose this falls in the same Category.
i7 5960X @ 4.1 Ghz, 64 GB G.Skill RipJaws RAM, Twin Asus ROG Strix OC 11 GB Geforce 2080 Tis
smatovic
Posts: 2657
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Historic Milestone: AlphaZero

Post by smatovic »

Milos wrote:
smatovic wrote:
Since no one of us has access to TPU it's only fair to count in terms of what is available (for example if we had Alpha0 x64 binary compile and wanted to run it at home).
1TPU ~ 30xE5-2699v3 (18 cores machine).
4TPUs ~ 2000 Haswell cores
Apples and Bananas,

Stockfish is not able to make use of these TPUs,
and AlphaZero depends probably heavily on floating point operations (maybe half precision) to query the neural network.

So the question might be, if an stripped down x86-64 version of AlphaZero, with only some hundred or thousand of nps, is still able to beat Stockfish.....dunno.



--
Srdja
No original paper is comparing apples and bananas. SF is running on general purpose hardware. TPUs are not commercially available so running alpha0 on TPUs is giving it huge unfair advantage.
It would be like running SF on special hardware where search is happening on conventional CPU and all evaluation is handled with hundreds if not thousands of FPGAa, something like DeepBlue. Then we could say that comparison is fair.
Even in this setup, if it was the most recent version of Brainfish (so with opening book), and normal TC like 40/40 not 1move/min, Alpha0 would probably loose.
- i was wrong, it looks like they are doing 8 bit integer operations for inferencing

https://cloud.google.com/blog/big-data/ ... g-unit-tpu

- considering the raw 8 bit compute throughput you may be right with your math

https://arxiv.org/abs/1704.04760

--
Srdja
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Much weaker than Stockfish

Post by Lyudmil Tsvetkov »

MikeGL wrote:
Lyudmil Tsvetkov wrote:
MikeGL wrote:
Lyudmil Tsvetkov wrote:The training matches are different from the 100 games match with Stockfish.
Yes, the plot on the diagram is the training game, but 100 games per openning was played. 50-50, and the score below the diagram is on AlphaZero perspective.
12 openings with reversed colours don't square in any way with 100 played games, so did they actually left some openings played more than others, or did not they flip colours?
12 opennings x 100 = 1,200 games total.
Before we were talking about 300 and 100, now 1200 suddenly appears...
The 64/36 score certainly comes from 100 games, unless they assigned random points for a win.
And in that sample, I see Alpha playing just 1.d4 and 1.Nf3.
Read the series of posts properly. It is 100 games per openning, you clearly don't understand Table 2.

300 games because you were talking about 1.e4 earlier which appears in 6 diagrams.
How many is 50 x 6 ?
You were claiming that AlphaZero didn't play 1.e4, i told you it did! 300 times it did play 1.e4 against SF8.

See the total summation below: 1,200 games for all 12 opennings. Come on man, even this very
basic stuff we argue?
We are talking here of the 100 game match, for which we have the pgn.
Do you have the pgn for the training games, which, btw., are claimed to run into the thousands?

In the pgn available, I see only 1.d4 and 1.Nf3.
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Much weaker than Stockfish

Post by Lyudmil Tsvetkov »

shrapnel wrote:
MikeGL wrote:See the total summation below: 1,200 games for all 12 opennings. Come on man, even this very
basic stuff we argue?
I suppose when the Wheel was first invented, there must have been people like Lyudmil Tsvetkov around, who said that it would never work.
Throughout the History of Mankind there have always been people who flatly refused to believe in revolutionary discoveries and inventions.
I suppose this falls in the same Category.
That is the point, Anil, there is nothing revolutionary here, except the big hardware.
Deep Blue followed the same approach: big hardware.
Revolutionary discoveries require a lot of effort, while the Alpha team has put all the brunt on the hardware. That is why it can not expect any significant discoveries.

I am eagerly waiting for the time(which will never come), when they will release their single-core engine, around 2850, to see whether you are going to buy it.