H.G. Muller posted this very important formula in one thread and I just want to make sure I got it right. So let's take just one example.I think the rule-of-thumb Error = 40%/sqrt(numberOfGames) is accurate enough in practice, for scores in the 65%-35% range. (This is for the 1-sigma or 84% confidence level; for 95% confidence, double it.)
Match Result:
A - B: 460 - 440
Score percentage for A: 460 / 900 = 51.1%.
Error margin: 40% / sqrt(900) = 1.3%. Now where should I apply this error margin? Is it calculated directly for score percentage?
So the correct result is 51.1% +- 1.3% (with 84% confidence). Did I got this right?
Now if we are improving the engine through the self-play, the truly interesting question is
"With given match result 460-440 what is the confidence level that the correct score percentage is >=50%?". I know there can't be such an easy rule thumb formula here, but if someone has already figured out the more complicate one, please post it here
