Joost Buijs wrote:From the AlphaZero article it is not clear which TPU they used to play the games.
I didn't read the AlphaGo Zero article, and I don't get why the hardware of AlphaGo Zero and AlphaZero should be the same.
The AlphaZero paper only describes the differences with AlphaGo Zero, (and explicitly states that), so for everything that is not described there you will have to refer to the AlphaGo Zero paper.
In addition, it makes sense: the advantage of gen2 TPUs is that they can do floating point arithmetic, while the gen1 TPUs can only do 8 or (at half the speed) 16-bit multiplication. Floating-point arithmetic is only needed for the back-propagation during training of the network. To run the network on a position ('inference') the 8-bit precision is enough.
I think Stockfish was crippled because they deprived it from using an opening-book, let it run on a 32 core machine with 64 threads (hyper-threading), used a fixed time per move and deprived it from using an egtb.
An opening book is not Stockfish. This is like saying I am crippled when playing a Chess game because I am not allowed to have Kasparov make the first 25 moves for me. If that counts as 'crippled', then AlphaZero was just as much crippled. If your claim is that it is not proven that AlphaZero plays better opening lines that those known from centuries of collective effort of the entire human race, you might have a point. Not related to Stockfish, though. And no one was actually claiming that.
AlphaZero was not using EGTB (which do not provide measurable strength in a 100-game match anyway...). AlphaZero was using fixed time per move. That there are conditions under which Stockfish might perform better doesn't mean it is 'crippled'. If I play a 5-min blitz game, and lose, how do you think people would react in my Chess club if I kept complaining that I only lost because I was 'crippled' by having to complete the entire game in 5 min, while I play much stronger if I can think 5 min per move? I would be laughed out of the building...
As to the HT issue: it seems that Stockfish had a massive hardware advantage with 32 cores, while AlphaZero was only using 8 cores + 4 TPUs. So even if the hyer-threading was actually detrimental, how many threads non-HT cores would 64 HT be equivalent to? 24 cores? 16 cores? Still more than AlphaZero was using...
Every scientist knows..., knowing is something else than doing.
You are accusing them of plain, premeditated fraud. For virtually no gain, jeopardizing the great fame they achieved through their Go results (which cannot possibly have been forged). Does that sound likely to you? And yes, it does happen that millionaire movie stars are caught shop-lifting at the super-market. But so rarely that it makes international news, and they are considered mentally ill. How likely is it that a large team of people would want to ruin their future that way? I never heard of a gang of millionaires going around at night to burgle houses, which would roughly be the equivalent. Suspecting this sounds like complete paranoia.