Rebel wrote:hgm wrote:The 100 games all started from the normal start position.
Nothing of that in the document.
Well, it should have been if they started from non-standard positions. The 10 games they published from that match all started from the standard position.
But if so,then where are all the random openings coming from? Nothing in the paper. And I may assume no randomness in SF. We have only seen 10 games and I may hope no doubles with 1 min per move and then BOOM -> force move.
This is a good question, as in the games they did publish it is often Stockfish that deviated. So with 64 threads there seems to be a lot of randomness. AlphaZero also is see to play two different initial moves, and after the same first move two different second moves. So I guess there is a legitimate concern whether the games are sufficiently independent to claim that for calculating rating error bars they should indeed be counted as 100 games.
Instead we have:
Table 2 analyses the most common human openings (those played more than 100,000 times in an online database of human chess games (1)). Each of these openings is independently discovered and played frequently by AlphaZero during self-play training. When starting from each human opening, AlphaZero convincingly defeated Stockfish, suggesting that it has indeed mastered a wide spectrumof chess play.
This is no mystery. These are different matches, not starting from the standard opening position, each with their own result. And indeed they do give the starting position in this case. So there have been 1300 games actually, from 13 different starting positions, the 12 given popular openings, and the standard opeing position.