lkaufman wrote: ↑Sun Sep 26, 2021 4:35 pm
Rebel wrote: ↑Sun Sep 26, 2021 2:52 pm
Odds match minus pawn f2 - Dragon 2.5 vs an elo pool of 3364 engines
Code: Select all
Odds match minus pawn f2 - Dragon 2.5 vs an elo pool of 3364 engines
Time Control : Time control : 40/120
Games : 700
Results from file all.pgn:
No. Name Win Draw Loss Unf. Score Games %
--------------------------------------------------------------
1 Komodo-Dragon 2.5 +225 =306 -169 *0 378.0 700 54.0%
2 Ethereal 12.75 +31 =50 -19 *0 56.0 100 56.0%
3 Pedone 3.1 +21 =61 -18 *0 51.5 100 51.5%
4 Komodo 12 +32 =34 -34 *0 49.0 100 49.0%
5 Komodo 11 +31 =33 -36 *0 47.5 100 47.5%
6 Stockfish 8 +25 =34 -41 *0 42.0 100 42.0%
7 Igel 3.0.5 +13 =53 -34 *0 39.5 100 39.5%
8 Igel 3.0.0 +16 =41 -43 *0 36.5 100 36.5%
These engines are all rated over 3400 on the CCRL blitz list (or nearly identical versions, like Komodo 11.01), quite a bit higher than your own list average of 3364. I wondered why this was so. On your main page for the gambit rating list you have a comparison with CCRL, but I think you are comparing your blitz ratings to their Rapid ratings, should be comparing blitz for both. Since CCRL uses BayesElo which contracts rating differences, I would expect ratings of engines near the top to be lower on their blitz list than on yours, but they are clearly higher! I'm trying to think of an explanation for this, do you have any idea?
I wouldn't expect your choice of gambit openings to shrink rating differences, that is bizarre.
The height of elo values in rating lists are defined by using anchor engines, for the GRL I use 4 anchor engines to be more or less compatible with the CCRL values. Anchor engines are rock solid engines that played thousands of games and thus have a reliable elo. For instance, I use Critter 1.6a as an anchor engine with a
fixed elo of 3150 which I borrowed from CCRL 40/15, it currently has 3157.I use Houdini 6 (derivatives come in handy) as an anchor engine of 3400 elo, it currently has 3394. Fruit 2.1 as 2700, Nemo as 2850, also borrowed from CCRL 40/15. Now suppose I change the value of Houdini to 3500, the rating list values of 3400+ engines will go up unrealistic big time, lowering it to 3300 will have the opposite effect. Meaning, with anchor engines I can create a rating list with SF14 on top with 2000 elo, however... the order remains the same.
Secondly responding on the part I bold, have a look at my research
CCRL vs GRL - a comparison, gambit openings do make sense. If they did not I would have stopped the GRL long time ago.
90% of coding is debugging, the other 10% is writing bugs.