I am not sure I am interpreting your results correctly, as it was a long night last night traveling. But if you introduce an opponent with a small number of games and therefore high-uncertainty, doesn't that increase the uncertainty for ANY program that plays against them, in terms of the error bar?Adam Hair wrote:Interesting. When I use joint distribution on a smaller pgn, I get this:Daniel Shawul wrote:I used the default ofcourse since Remi suggests that to avoid ugly error margin reports in some cases as shown here. Elostat gives also the same error margins. It seems to me even in case we have one pool (two players) default gives better results. If I am given one result set with 200-200-100, then I can calculate by vand margin of error to be 27 which the default shows, but I get half of that with exact dist...With the larger number of games and all methods:Code: Select all
ResultSet>elo ResultSet-EloRating>mm 1 1 00:00:00,00 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 27 27 500 50% 0 20% 2 Player1 0 27 27 500 50% 0 20% ResultSet-EloRating>covariance ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 14 14 500 50% 0 20% 2 Player1 0 14 14 500 50% 0 20%
Not exactly gettting more accurate,is it ? Maybe algorithm has problems with two players...Code: Select all
ResultSet-EloRating>mm 1 1 00:00:00,00 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 2 2 100000 50% 0 20% 2 Player1 0 2 2 100000 50% 0 20% ResultSet-EloRating>covariance ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 1 1 100000 50% 0 20% 2 Player1 0 1 1 100000 50% 0 20% ResultSet-EloRating>exactdist 00:00:00,05 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 3 3 100000 50% 0 20% 2 Player1 0 3 3 100000 50% 0 20% ResultSet-EloRating>jointdist 00:00:08,15 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Player0 0 -1439 -1439 100000 50% 0 20% 2 Player1 0 -1439 -1439 100000 50% 0 20% ResultSet-EloRating>los
When I increase the # of games, I get the same as you:Code: Select all
ResultSet-EloRating>jointdist 00:00:00,01 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Engine_B 0 16 13 504 50% 0 25% 2 Engine_A 0 16 13 504 50% 0 25% ResultSet-EloRating>
Code: Select all
ResultSet-EloRating>jointdist 00:00:00,17 ResultSet-EloRating>ratings Rank Name Elo + - games score oppo. draws 1 Engine_D 0 -1499 -1499 22176 50% 0 33% 2 Engine_C 0 -1499 -1499 22176 50% 0 33% ResultSet-EloRating>
I'll try to re-read after I recover from traveling to see what I might have mis-read...