Yes, you rely on 7 games to make your determination, while I have 2783 games to make my determination.hgm wrote:Stupid, stupid, stupid... The Stockfish testing people want to reject 3-Elo regressions with confidence, not 300-Elo regressions. Hundred times smaller error bars requires 10,000 times as many games, because the error bars decrease as the square root of the number of games.Dann Corbit wrote:Well then, the Stockfish testing people can rejoice because now instead of running thousands of tests, they can stop at 7.
I really must advise you to educate yourself in the most elementary aspects of statistics, as anything you say here makes you look more and more ignorant....You seem to misunderstand the difference between an observation and the probability of that observation. But that is OK with me.
Just because it is unlikely does not mean you won't see it.
Maybe you are right. Maybe since TCEC 9 Jonny has not only picked up hundreds of Elo, with the others like Stockfish and Komodo not gaining any in the same time frame so that Jonny has caught up, but also Jonny has overturned Amdahl's law too.
I genuflect to your brilliance sir. Those seven games fill you with such confidence that I know you must really know what you are talking about and 7 is plenty, even the the 200 Elo error bars to know that for certain you are right.
No wonder you have so much confidence.
I guess maybe you are having a guilt complex over your role in the ICGA mess, but I think you should forgive yourself and just forget it.