Ordo release (rating software, ELO-like)

Discussion of chess software programming and technical issues.

Moderators: hgm, Rebel, chrisw

User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: Ordo release (rating software, ELO-like)

Post by michiguel »

Vinvin wrote:
michiguel wrote:
michiguel wrote:
Vinvin wrote:I tested with a merge version of CCRL, CEGT and WBEC (I'll produce a list next week with bayeselo.exe) : a little bit more than 1 Millions games and 3500 different players.

The program crashed ... is it because numbers are too big ?

michiguel wrote:https://sites.google.com/site/gaviotachessengine/ordo

Based on a recent discussion on the IPON rankings, I decided to clean up the command line interface and release it. It may be an alternative to BayesELO and ELOSTAT.

Miguel
It could certainly be the problem. I hardwired a limit, but the program should warn about it. Whatever it is, it is my fault. I am sorry, I rushed to get this out, otherwise it would not have happened. I will check and fix it tonight. Crashing is unacceptable.

Miguel
I fixed the problem (for now).
https://sites.google.com/site/gaviotachessengine/ordo
3 million games are accepted. I will later allow a dynamic allocation, so the limit will be the memory of the computer.

Miguel
Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
Vincent,

The program did not converge well. This is rare (I have seen it only once before), but it may happen when the data is difficult to chew (for instance, I see KoK with 100% performance, which is infinite rating). Still, I would like to deal with tough situations like this. Is there any chance you can send me the file you used?

Miguel
PS: I send you the email by PM
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Ordo release (rating software, ELO-like)

Post by Vinvin »

michiguel wrote:
Vinvin wrote:
michiguel wrote:
michiguel wrote:
Vinvin wrote:I tested with a merge version of CCRL, CEGT and WBEC (I'll produce a list next week with bayeselo.exe) : a little bit more than 1 Millions games and 3500 different players.

The program crashed ... is it because numbers are too big ?

michiguel wrote:https://sites.google.com/site/gaviotachessengine/ordo

Based on a recent discussion on the IPON rankings, I decided to clean up the command line interface and release it. It may be an alternative to BayesELO and ELOSTAT.

Miguel
It could certainly be the problem. I hardwired a limit, but the program should warn about it. Whatever it is, it is my fault. I am sorry, I rushed to get this out, otherwise it would not have happened. I will check and fix it tonight. Crashing is unacceptable.

Miguel
I fixed the problem (for now).
https://sites.google.com/site/gaviotachessengine/ordo
3 million games are accepted. I will later allow a dynamic allocation, so the limit will be the memory of the computer.

Miguel
Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
Vincent,

The program did not converge well. This is rare (I have seen it only once before), but it may happen when the data is difficult to chew (for instance, I see KoK with 100% performance, which is infinite rating). Still, I would like to deal with tough situations like this. Is there any chance you can send me the file you used?

Miguel
PS: I send you the email by PM
The file when compress with 7z weight 60 MB , probably too big to send by mail :-(

I reran Ordo during 30 min and I got this :

Code: Select all

...
****************************************   960k
****************************************  1000k
*************************************  total games: 1037253

set average rating = 2500.000000

phase iteration  deviation
  0       0        1.47822
  1       3        0.88921
  2       4        0.49703
  3      10        0.25402
  4       2        0.16557
  5       8        0.12398
  6      13        0.10929
  7      35        0.09854
  8       7        0.09775
  9       5        0.09758
 10     857        0.08366
 11      99        0.08339
 12    2515        0.07544
 13   10000        0.06067
...
This "10000" seems strange ... does that means "impossible to converge" ?
Rémi Coulom
Posts: 438
Joined: Mon Apr 24, 2006 8:06 pm

Re: Ordo release (rating software, ELO-like)

Post by Rémi Coulom »

Vinvin wrote:Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
From Miguel's description, I expect that players with 100% win rate will cause problems, because no prior is used.

Rémi
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Ordo release (rating software, ELO-like)

Post by Vinvin »

Vinvin wrote:
michiguel wrote:
Vinvin wrote:
michiguel wrote:
michiguel wrote:
Vinvin wrote:I tested with a merge version of CCRL, CEGT and WBEC (I'll produce a list next week with bayeselo.exe) : a little bit more than 1 Millions games and 3500 different players.

The program crashed ... is it because numbers are too big ?

michiguel wrote:https://sites.google.com/site/gaviotachessengine/ordo

Based on a recent discussion on the IPON rankings, I decided to clean up the command line interface and release it. It may be an alternative to BayesELO and ELOSTAT.

Miguel
It could certainly be the problem. I hardwired a limit, but the program should warn about it. Whatever it is, it is my fault. I am sorry, I rushed to get this out, otherwise it would not have happened. I will check and fix it tonight. Crashing is unacceptable.

Miguel
I fixed the problem (for now).
https://sites.google.com/site/gaviotachessengine/ordo
3 million games are accepted. I will later allow a dynamic allocation, so the limit will be the memory of the computer.

Miguel
Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
Vincent,

The program did not converge well. This is rare (I have seen it only once before), but it may happen when the data is difficult to chew (for instance, I see KoK with 100% performance, which is infinite rating). Still, I would like to deal with tough situations like this. Is there any chance you can send me the file you used?

Miguel
PS: I send you the email by PM
The file when compress with 7z weight 60 MB , probably too big to send by mail :-(

I reran Ordo during 30 min and I got this :

Code: Select all

...
****************************************   960k
****************************************  1000k
*************************************  total games: 1037253

set average rating = 2500.000000

phase iteration  deviation
  0       0        1.47822
  1       3        0.88921
  2       4        0.49703
  3      10        0.25402
  4       2        0.16557
  5       8        0.12398
  6      13        0.10929
  7      35        0.09854
  8       7        0.09775
  9       5        0.09758
 10     857        0.08366
 11      99        0.08339
 12    2515        0.07544
 13   10000        0.06067
...
This "10000" seems strange ... does that means "impossible to converge" ?
I reran again but with output in a log file (command line : "ordo -a 2500 -p ccw.pgn -o rating.txt >ordotest.log")
after 2h30 :

Code: Select all

importing results (x1000): 
****************************************    40k
****************************************    80k
****************************************   120k
****************************************   160k
****************************************   200k
****************************************   240k
****************************************   280k
****************************************   320k
****************************************   360k
****************************************   400k
****************************************   440k
****************************************   480k
****************************************   520k
****************************************   560k
****************************************   600k
****************************************   640k
****************************************   680k
****************************************   720k
****************************************   760k
****************************************   800k
****************************************   840k
****************************************   880k
****************************************   920k
****************************************   960k
****************************************  1000k
*************************************  total games: 1037253 

set average rating = 2500.000000

phase iteration  deviation
  0       0        1.47822
  1       3        0.88921
  2       4        0.49703
  3      10        0.25402
  4       2        0.16557
  5       8        0.12398
  6      13        0.10929
  7      35        0.09854
  8       7        0.09775
  9       5        0.09758
 10     857        0.08366
 11      99        0.08339
 12    2515        0.07544
 13   10000        0.06067
 14   10000        0.05164
 15   10000        0.04733
 16   10000        0.04526
 17   10000        0.04425
 18   10000        0.04375
 19   10000        0.04351
done
User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: Ordo release (rating software, ELO-like)

Post by michiguel »

Vinvin wrote:
michiguel wrote:
Vinvin wrote:
michiguel wrote:
michiguel wrote:
Vinvin wrote:I tested with a merge version of CCRL, CEGT and WBEC (I'll produce a list next week with bayeselo.exe) : a little bit more than 1 Millions games and 3500 different players.

The program crashed ... is it because numbers are too big ?

michiguel wrote:https://sites.google.com/site/gaviotachessengine/ordo

Based on a recent discussion on the IPON rankings, I decided to clean up the command line interface and release it. It may be an alternative to BayesELO and ELOSTAT.

Miguel
It could certainly be the problem. I hardwired a limit, but the program should warn about it. Whatever it is, it is my fault. I am sorry, I rushed to get this out, otherwise it would not have happened. I will check and fix it tonight. Crashing is unacceptable.

Miguel
I fixed the problem (for now).
https://sites.google.com/site/gaviotachessengine/ordo
3 million games are accepted. I will later allow a dynamic allocation, so the limit will be the memory of the computer.

Miguel
Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
Vincent,

The program did not converge well. This is rare (I have seen it only once before), but it may happen when the data is difficult to chew (for instance, I see KoK with 100% performance, which is infinite rating). Still, I would like to deal with tough situations like this. Is there any chance you can send me the file you used?

Miguel
PS: I send you the email by PM
The file when compress with 7z weight 60 MB , probably too big to send by mail :-(

I reran Ordo during 30 min and I got this :

Code: Select all

...
****************************************   960k
****************************************  1000k
*************************************  total games: 1037253

set average rating = 2500.000000

phase iteration  deviation
  0       0        1.47822
  1       3        0.88921
  2       4        0.49703
  3      10        0.25402
  4       2        0.16557
  5       8        0.12398
  6      13        0.10929
  7      35        0.09854
  8       7        0.09775
  9       5        0.09758
 10     857        0.08366
 11      99        0.08339
 12    2515        0.07544
 13   10000        0.06067
...
This "10000" seems strange ... does that means "impossible to converge" ?
It means it is struggling and moving too slow to find the optimum. I limit the number of iterations per phase to 10000, and then, it moves to the next one with a higher resolution (smaller increment). Sometimes, that helps.

I downloaded WBEC results from here http://wbec-ridderkerk.nl/html/WBEC1to10.rar
and excluded
"Xadreco 5.6",
"GreKo 2.37",
"King Of Kings 1.95",
"Neurosis 1.1",
because they either won all games, or lost all games.

Even after that, the results converge, but slowly and the results look odd (too many weak engines on top). This is a symptom that there are several pools that are either not connected, or loosely connected (for instance, strong engines did not play weak engines, except one fluke game that was lost). I will examine this more carefully, but it I confirm what I say, the data is very problematic. In addition, too many engines played too few games.

Miguel
User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: Ordo release (rating software, ELO-like)

Post by michiguel »

michiguel wrote:
Vinvin wrote:
michiguel wrote:
Vinvin wrote:
michiguel wrote:
michiguel wrote:
Vinvin wrote:I tested with a merge version of CCRL, CEGT and WBEC (I'll produce a list next week with bayeselo.exe) : a little bit more than 1 Millions games and 3500 different players.

The program crashed ... is it because numbers are too big ?

michiguel wrote:https://sites.google.com/site/gaviotachessengine/ordo

Based on a recent discussion on the IPON rankings, I decided to clean up the command line interface and release it. It may be an alternative to BayesELO and ELOSTAT.

Miguel
It could certainly be the problem. I hardwired a limit, but the program should warn about it. Whatever it is, it is my fault. I am sorry, I rushed to get this out, otherwise it would not have happened. I will check and fix it tonight. Crashing is unacceptable.

Miguel
I fixed the problem (for now).
https://sites.google.com/site/gaviotachessengine/ordo
3 million games are accepted. I will later allow a dynamic allocation, so the limit will be the memory of the computer.

Miguel
Thanks, no more crash :D

The run last more than 1 hour (I don't know exactly because I let the PC running and I leave).

There's smthg strange at the top of the list ...

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
      Houdini 1.5a 64-bit 4CPU:  3317.0    2396.0    3336   71.8%
            King Of Kings 1.95:  3317.0      28.0      28   100.0%
       Houdini 2.0 64-bit 4CPU:  3317.0    1233.0    1668   73.9%
      Houdini 1.5a 64-bit 6CPU:  3317.0     438.0     600   73.0%
    Deep Rybka 4.1 64-bit 4CPU:  3291.5     646.5    1012   63.9%
...
Why 4 engines rated at 3317 ? :?
Vincent,

The program did not converge well. This is rare (I have seen it only once before), but it may happen when the data is difficult to chew (for instance, I see KoK with 100% performance, which is infinite rating). Still, I would like to deal with tough situations like this. Is there any chance you can send me the file you used?

Miguel
PS: I send you the email by PM
The file when compress with 7z weight 60 MB , probably too big to send by mail :-(

I reran Ordo during 30 min and I got this :

Code: Select all

...
****************************************   960k
****************************************  1000k
*************************************  total games: 1037253

set average rating = 2500.000000

phase iteration  deviation
  0       0        1.47822
  1       3        0.88921
  2       4        0.49703
  3      10        0.25402
  4       2        0.16557
  5       8        0.12398
  6      13        0.10929
  7      35        0.09854
  8       7        0.09775
  9       5        0.09758
 10     857        0.08366
 11      99        0.08339
 12    2515        0.07544
 13   10000        0.06067
...
This "10000" seems strange ... does that means "impossible to converge" ?
It means it is struggling and moving too slow to find the optimum. I limit the number of iterations per phase to 10000, and then, it moves to the next one with a higher resolution (smaller increment). Sometimes, that helps.

I downloaded WBEC results from here http://wbec-ridderkerk.nl/html/WBEC1to10.rar
and excluded
"Xadreco 5.6",
"GreKo 2.37",
"King Of Kings 1.95",
"Neurosis 1.1",
because they either won all games, or lost all games.

Even after that, the results converge, but slowly and the results look odd (too many weak engines on top). This is a symptom that there are several pools that are either not connected, or loosely connected (for instance, strong engines did not play weak engines, except one fluke game that was lost). I will examine this more carefully, but it I confirm what I say, the data is very problematic. In addition, too many engines played too few games.

Miguel
Actually, it wasn't that bad. I messed up the pgn results when I wanted to quickly remove the results of some engines :-(
After I removed
"Xadreco 5.6",
"GreKo 2.37",
"King Of Kings 1.95",
"Neurosis 1.1",
"Thor'sHammer 1.9",
"Pyotr Club Ed 1.0",
"MiniChessAI 1.02",
It converges, not super fast, but it does.
After you remove the results of an engine, all of the sudden, another shows up with a 0% or a 100% (because a victory or a defeat was removed). Maybe I should include a feature to automatically detect and remove the results of these engines.

These are the top engines after calculation

Code: Select all

                        ENGINE:  RATING    POINTS  PLAYED    (%)
                Shredder 9-x64:  3061.8      70.5      92   76.6%
                 Shredder 7.04:  2978.2     120.0     168   71.4%
                   Gandalf 6.0:  2953.8      62.0      92   67.4%
                  Ktulu 6.1 b3:  2949.6      23.5      32   73.4%
                 The King 3.33:  2948.6     171.5     260   66.0%
                 Ruffian 1.0.1:  2947.0      79.0     100   79.0%
                  Gandalf 6.01:  2931.4      55.5      92   60.3%
                   Pro Deo 1.1:  2927.5      55.0      92   59.8%
                      List 512:  2914.3     110.5     184   60.1%
                    Spike 0.9a:  2907.9      68.0      92   73.9%
                 The King 3.23:  2906.5      87.0     132   65.9%
                     Zappa 1.0:  2904.4      52.0      92   56.5%
                 Ruffian 2.1.0:  2900.7     195.5     324   60.3%
                 DeepSjeng 1.6:  2895.5      87.5     140   62.5%
                   Pro Deo 1.0:  2893.5     101.5     168   60.4%
                DeepSjeng 1.6a:  2891.1     104.5     184   56.8%
                     Ktulu 7.0:  2889.2      50.0      92   54.3%
                Aristarch 4.50:  2887.8     150.0     260   57.7%
                Pharaon 3.2-64:  2885.4      49.5      92   53.8%
                 DeepSjeng 1.5:  2872.6     163.5     236   69.3%
               Maestro 1.09uci:  2870.4      47.5      92   51.6%
                   Pseudo 0.7c:  2869.8      71.0      88   80.7%
               Crafty-19.13-64:  2869.7      51.5      92   56.0%
                  Ruffian 0.76:  2868.2      36.0      52   69.2%
                 Ruffian 1.0.5:  2868.1      55.5      84   66.1%
                     Fruit 2.0:  2866.6      47.0      92   51.1%
            SlowChess blitz WV:  2865.0      66.5     124   53.6%
                Aristarch 4.41:  2864.3      37.5      64   58.6%
                 Ruffian 2.0.0:  2862.2      41.5      68   61.0%
               Maestro 1.08uci:  2854.4      49.5      92   53.8%
                  Crafty 19.12:  2853.6      36.5      64   57.0%
                 DanChess CCT7:  2847.1     103.0     156   66.0%
                  Fruit X-0705:  2844.5      44.5      60   74.2%
                  Fruit X12-11:  2843.3      63.0      92   68.5%
          SmarThink 0.18a-r165:  2841.9     133.0     260   51.2%
               SmarThink 0.17a:  2841.8      39.5      68   58.1%
                      Naum 1.8:  2840.4      19.0      32   59.4%
It converges like this

Code: Select all

phase iteration  deviation
  0       9        0.36465
  1       1        0.18250
  2       1        0.10190
  3       2        0.06489
  4       4        0.04925
  5      14        0.04145
  6      12        0.03956
  7     152        0.03197
  8      16        0.03175
  9       4        0.03172
 10    3611        0.00702
 11    1098        0.00350
 12     994        0.00219
 13    1913        0.00111
 14    2163        0.00057
 15    2363        0.00029
 16    2250        0.00016
 17    3138        0.00008
 18    3162        0.00004
 19    2620        0.00002
User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: Ordo release (rating software, ELO-like)

Post by michiguel »

New version Ordo v0.4 and Ordoprep v0.4
https://sites.google.com/site/gaviotachessengine/ordo

Vincent, you can try to use Ordoprep to remove engines with 100% or 0% (and shrink the pgn files).

a fragment from the readme file

Code: Select all

SIMULATION
This version includes the swich -s that allows to perform simulations.
The program will will virtually 'replay' the games getting results 
based on the probabilities given by the ratings previously obtained.
After the simulations have been run a standard deviation is calculated.
The error matrix file will give the error between each of the pairs of
players. A bare minimum number will be -s 10, but a reliable number would be
at least -s 100. Take into account that this will take time. Each simulation
takes the same amount of time as each rating calculation.

The rest of the switches available are:

usage: ordo [-OPTION]
 -h        print this help
 -v        print version number and exit
 -L        display the license information
 -q        quiet (no screen progress updates)
 -a <avg>  set general rating average
 -p <file> input file in .pgn format
 -c <file> output file &#40;comma separated value format&#41;
 -o <file> output file &#40;text format&#41;, goes to the screen if not present
 -s  #     perform # simulations to calculate errors
 -e <file> saves an error matrix, if -s was used

ORDOPREP
A utility is included that will shrink the pgn file to "results only". 
In addition, it could discard players that won all games, or lost all
games &#40;which generally bring problems&#41;. A typical use will be

ordoprep -d -p raw.pgn -o shrunk.pgn

'-d' tells ordoprep to discard players with 100% or 0% performance.
After this, 'shrunk.pgn' could be used as input for 'ordo'
Miguel
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Ordo release (rating software, ELO-like)

Post by Vinvin »

michiguel wrote:New version Ordo v0.4 and Ordoprep v0.4
https://sites.google.com/site/gaviotachessengine/ordo

Vincent, you can try to use Ordoprep to remove engines with 100% or 0% (and shrink the pgn files).

a fragment from the readme file

Code: Select all

SIMULATION
This version includes the swich -s that allows to perform simulations.
The program will will virtually 'replay' the games getting results 
based on the probabilities given by the ratings previously obtained.
After the simulations have been run a standard deviation is calculated.
The error matrix file will give the error between each of the pairs of
players. A bare minimum number will be -s 10, but a reliable number would be
at least -s 100. Take into account that this will take time. Each simulation
takes the same amount of time as each rating calculation.

The rest of the switches available are&#58;

usage&#58; ordo &#91;-OPTION&#93;
 -h        print this help
 -v        print version number and exit
 -L        display the license information
 -q        quiet &#40;no screen progress updates&#41;
 -a <avg>  set general rating average
 -p <file> input file in .pgn format
 -c <file> output file &#40;comma separated value format&#41;
 -o <file> output file &#40;text format&#41;, goes to the screen if not present
 -s  #     perform # simulations to calculate errors
 -e <file> saves an error matrix, if -s was used

ORDOPREP
A utility is included that will shrink the pgn file to "results only". 
In addition, it could discard players that won all games, or lost all
games &#40;which generally bring problems&#41;. A typical use will be

ordoprep -d -p raw.pgn -o shrunk.pgn

'-d' tells ordoprep to discard players with 100% or 0% performance.
After this, 'shrunk.pgn' could be used as input for 'ordo'
Miguel
Thanks, I've ordopreped my big files (removed all moves, 0% and 100%). Ordo still have troubles to converge, I sent you the files (2 MB now).
I've a request : please make an option to remove engines with <5% and >95% results ...

My best,
Vincent
Rémi Coulom
Posts: 438
Joined: Mon Apr 24, 2006 8:06 pm

Re: Ordo release (rating software, ELO-like)

Post by Rémi Coulom »

Vinvin wrote:Thanks, I've ordopreped my big files (removed all moves, 0% and 100%). Ordo still have troubles to converge, I sent you the files (2 MB now).
I've a request : please make an option to remove engines with <5% and >95% results ...
You can have all players at 50% and still have the algorithm diverge. What is necessary to ensure is that every player has at least a path made of wins to any other player, and a path made of losses (a draw can count as a win or as a loss, as needed). Otherwise, the algorithm cannot converge to finite values.

bayeselo has the "connect" command that does this. Pick a player, and it will keep only the other players that can be compared to it.

I am not sure if it is sufficient to help ordo to converge, but I am sure it is necessary.

Rémi
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: Ordo release (rating software, ELO-like)

Post by Vinvin »

Rémi Coulom wrote:
Vinvin wrote:Thanks, I've ordopreped my big files (removed all moves, 0% and 100%). Ordo still have troubles to converge, I sent you the files (2 MB now).
I've a request : please make an option to remove engines with <5% and >95% results ...
You can have all players at 50% and still have the algorithm diverge. What is necessary to ensure is that every player has at least a path made of wins to any other player, and a path made of losses (a draw can count as a win or as a loss, as needed). Otherwise, the algorithm cannot converge to finite values.

bayeselo has the "connect" command that does this. Pick a player, and it will keep only the other players that can be compared to it.

I am not sure if it is sufficient to help ordo to converge, but I am sure it is necessary.

Rémi
That was not my point :-)
I simply want to remove very low engines ...

Code: Select all

Marquis 0.1.3&#58;  1604.7       1.0      33    3.0%
Pyotr Club Ed 1.0&#58;  1604.7       1.0      32    3.1%
Pyotr Amateur v0.6&#58;  1604.7       1.0      35    2.9%
Geko 0.4.3&#58;  1604.7       3.5     105    3.3%
Xadreco 5.0&#58;  1604.7       0.5      29    1.7%
Numpty 0.22-pr&#58;  1604.7       0.5      81    0.6%