True, but this is a problem when one has few widely ditributed on Elo scale participants. I would try to take these 1900, 2600, 2700 Elo participants to see what happens. I am also disturbed by that 1900 participant, as the trinomial distributions would look more like Poisson or Gamma, but I hope it doesn't matter, because it will fade away after several games. For now I have an empirical normalized distribution function of points for a participant which scores ProbabilityOfSuccess against its opponent (I fitted the normalized Gaussian to a binomial, not trinomial, but the errors are small)hgm wrote:The problem is that the engine you want to calculate the winning probability of has to be compared to the maximum of the score of its competitors. Ad there usually are only a small number of serious competitors, even in a large round-robin. The distribution of the maxium of a small number of normally distributed variables is not very Gaussian.
1/(sqrt(NumberOfGames) * 0.44*sqrt(2*pi)) * exp((X-ProbabilityOfSuccess * NumberOfGames)^2/(0.3872 * NumberOfGames))
where X is the number of points for which the distribution function gives the probability.
(your 0.40 formula for sigma becomes now 0.44 because of the binomial distribution taken, but it's a first try

It's very easy now (with Erf function, integrating the Gaussian on an interval ) to calculate the probability of a tourney win for two engines, I hope it's simple to generalize to arbitrary number of participants. But it will be a pretty cumbersome rule of thumb

Kai