EPD : epd\lc1.epd
Time : 1000ms
Max Time Hash
Engine Points Used Time Found Pos Elo Score Score ms Mb Cpu Errors
15 Wasp 4.00 299925 11:45:50.1 20727 40000 2999 400000 74.98% 1000 128 1 0
23 Wasp 3.75 293908 11:44:35.3 20203 40000 2939 400000 73.48% 1000 128 1 0
It's interesting that the elo increase prediction is about half what I was seeing in test matches. But considering the already high position of SlowChess in the list looks more like a case of older versions scoring higher than match-play performance. My guess is maybe this type of testing is more sensitive to eval changes than search changes?
jonkr wrote: ↑Mon Jun 15, 2020 10:38 pm
It's interesting that the elo increase prediction is about half what I was seeing in test matches. But considering the already high position of SlowChess in the list looks more like a case of older versions scoring higher than match-play performance. My guess is maybe this type of testing is more sensitive to eval changes than search changes?
I am trying to find my way in this unexplored area but looking at the 250ms | 500ms | 1000ms | 4000ms overview I am inclined to believe search matters.
A way to look at this type of testing, 100,000 positions at 250ms takes about 25 minutes (I am using 20 cores), 100,000 positions reflect 100,000 / 50 = 2000 games. 50 stands for the average of moves that matter in games and probably 50 is too high already. 200,000 positions within one hour representing 4000 games.
90% of coding is debugging, the other 10% is writing bugs.
In my self tests, Igel 2.5.0 is around 70 elo stronger in 10+0.1 and around 40+ elo stronger at 60+0.6, so I am curios how this will hold in rating lists as well as with NICE framework predictions.
Looking more I do notice the elo range is compressed a bit from what I get in blitz guantlets so that's part of it and to be expected, but it does seem possible the evaluation changes could matter more. The upgrade from SlowChess 1.9 to 2.0 was very much evaluation compared to the later updates and that is the highest/closest match in the NICE list to my estimates.