I don't see why it's necessary to lose the % predictive power by not adjusting the scale. To have some "imponderable" 200 points difference in ratings without any meaning? Better adjust the scale and say what this 200 means as %. Setting "scale 1" I was getting results which were giving very good predictions for individual matches (in 200-75% sense), maybe you can explain why is that, taking into account DrawElo and EloAdvantage. I think Adam is wrong not to follow that "scale 1" procedure just because others either use the wrong EloStat or the default (wrong as % interpretation goes) Bayeselo. Ordo gives good predictions in the 200-75% sense, by the way.Daniel Shawul wrote:When you use scale = 1, you are not preserving the 200-75% assumption.Laskos wrote:I don't understand how it doesn't change anything else. If 200-75% is not preserved, what is the meaning of those numbers given as rating?Daniel Shawul wrote:Hello Adam
I didn't know you have that problem with the complete/pure list too. Well in that case I think using scale becomes even more necessary. For other models where the formula don't give you elos close to arpad's assumption scaling would be even more appropriate. Also Remi prefered use of scaling (made it default) so I would think using it would be safer. It doesn't change anything else other than the magnitude of the elo differences.
Danile
Bayeselo has eloDraw and eloAdvantage that you need to add to the eloDelta to see the 200-75% assumption. But people want to see that with just eloDelta (i.e when comparing two engines). That is why an arbitrary factor was needed to be applied but it doesn't need to be. It doesn't change anything else in the sense that the order and relative elo differences are preserved.Code: Select all
Rating = scale * Original values + offsetAs I explained above bayeselo uses logistic by adding two more parametersEven the transitivity is not preserved using arbitary scaling. I understand by Elo rating that if I pick from the rating list an engine rated 2900 and another rated 2700, then the prediction is that the engine rated 2900 will score 75% in a match against the engine rated 2700. Bayeselo default fails in its prediction or maybe there are secret tables to derive the predictions of which I am not aware. So, tell me, with default Bayeselo, what is the prediction in % for those 2900 and 2700 rated (by Bayeselo default) engines in a match?
KaiSo you need to add and decreas eloAdvantage and eloDraw to the differece to see the 200-75%. With another draw model that uses logistic like thisCode: Select all
logistic(-eloDelta - eloAdvantage + eloDraw);,the default values you get may be even more magnified. But it is better to ask Remi ,this is just my opinion from making elo ratings on ccrl 40/40 with this model. Remi had a reason to apply the scaling by default from what I read.Code: Select all
double f = thetaD * sqrt(logistic(eloDelta + eloAdvantage) * logistic(-eloDelta - eloAdvantage)); return logistic(eloDelta + eloHome) / (1 + f);
Daniel
Kai
