Total: 60000 W: 16017 L: 14694 D: 29289
Ptnml(0-2): 86, 6160, 16212, 7429, 113
nElo: 15.68 ± 2.8 (95%) PairsRatio: 1.21
60+0.6 th 1 testing. Surprising

Moderator: Ras
Code: Select all
# PLAYER : RATING ERROR PLAYED W D L (%) CFS(%)
1 Stockfish 231222 avx2 : 3859 14 7500 6413 964 123 91.9 100
2 Stockfish 16 230630 : 3821 ---- 7500 6230 1067 203 90.2 100
3 Torch 1 popavx2 : 3666 14 7500 5339 1517 644 81.3 100
4 KomodoDragon 3.3 avx2 : 3557 14 7500 4575 1849 1076 73.3 100
5 Berserk 12 avx2 : 3482 14 7500 3988 2069 1443 67.0 100
6 Ethereal 14.25 nnue : 3343 14 7500 2710 2549 2241 53.1 100
7 Caissa 1.15 avx2 : 3276 14 7500 2094 2676 2730 45.8 82
8 RubiChess 230918 avx2 : 3272 14 7500 2063 2665 2772 45.3 100
9 CSTal 2.0 avx2 : 3200 14 7500 1437 2694 3369 37.1 56
10 Obsidian 9.0 avx2 : 3200 14 7500 1408 2740 3352 37.0 100
11 Clover 6.1 avx2 : 3177 14 7500 1280 2617 3603 34.5 100
12 Koivisto 9.2 avx2 : 3160 14 7500 1198 2495 3807 32.6 100
13 Rebel EAS avx2 : 3134 15 7500 986 2495 4019 29.8 100
14 Seer 2.7.0 avx2 : 3121 14 7500 872 2516 4112 28.4 67
15 RofChade 3.1 avx2 : 3119 14 7500 867 2494 4139 28.2 100
16 Uralochka 3.40a avx2 : 3084 14 7500 677 2319 4504 24.5 ---
-------------------------------------------------------------------
--- Number of all Gamepairs : 60000
--- Number of drawn Gamepairs overall: 17863 (= 29.77%)
--- Number of 1:1 drawn Gamepairs : 8411 (= 14.02%)
--- Number of 2-draws drawn Gamepairs: 9452 (= 15.75%)
-------------------------------------------------------------------
As it gets harder and harder to achieve Elo gains at this level, perhaps they should make that policy +50 normalised Elo.
90% of the openings are sidelines and game-pairs consist of 500 ultra fast games. I prefer NCM, much more realistic progress tracking.pohl4711 wrote: ↑Sun Dec 31, 2023 10:47 am Right now, the policy is to release a new full version of Stockfish, if this Stockfish beats the latest official released SF by 100 normalized Elo in the Fishtest-Progression testrun. Normalized Elo is a bit complicated, but Gamepair-Elo progress is quite similar to normalized Elo numbers...
And according to my Gamepair-rescored UHO-Top15 Ratinglist, Stockfish 231222 is +38 Gamepair-Elo stronger than SF 16.
So 1/3 of the way to SF 17 is done.
Code: Select all
# PLAYER : RATING ERROR PLAYED W D L (%) CFS(%) 1 Stockfish 231222 avx2 : 3859 14 7500 6413 964 123 91.9 100 2 Stockfish 16 230630 : 3821 ---- 7500 6230 1067 203 90.2 100 3 Torch 1 popavx2 : 3666 14 7500 5339 1517 644 81.3 100 4 KomodoDragon 3.3 avx2 : 3557 14 7500 4575 1849 1076 73.3 100 5 Berserk 12 avx2 : 3482 14 7500 3988 2069 1443 67.0 100 6 Ethereal 14.25 nnue : 3343 14 7500 2710 2549 2241 53.1 100 7 Caissa 1.15 avx2 : 3276 14 7500 2094 2676 2730 45.8 82 8 RubiChess 230918 avx2 : 3272 14 7500 2063 2665 2772 45.3 100 9 CSTal 2.0 avx2 : 3200 14 7500 1437 2694 3369 37.1 56 10 Obsidian 9.0 avx2 : 3200 14 7500 1408 2740 3352 37.0 100 11 Clover 6.1 avx2 : 3177 14 7500 1280 2617 3603 34.5 100 12 Koivisto 9.2 avx2 : 3160 14 7500 1198 2495 3807 32.6 100 13 Rebel EAS avx2 : 3134 15 7500 986 2495 4019 29.8 100 14 Seer 2.7.0 avx2 : 3121 14 7500 872 2516 4112 28.4 67 15 RofChade 3.1 avx2 : 3119 14 7500 867 2494 4139 28.2 100 16 Uralochka 3.40a avx2 : 3084 14 7500 677 2319 4504 24.5 --- ------------------------------------------------------------------- --- Number of all Gamepairs : 60000 --- Number of drawn Gamepairs overall: 17863 (= 29.77%) --- Number of 1:1 drawn Gamepairs : 8411 (= 14.02%) --- Number of 2-draws drawn Gamepairs: 9452 (= 15.75%) -------------------------------------------------------------------
Head-to-Head Gamepair-result of Stockfish 231222 vs SF 16 is:
500 ( 138+, 273=, 89-), 54.9% : +38 (Gamepair-)Elo