Frank Quisinsky wrote: ↑Tue Sep 26, 2023 7:34 pm
Wollte mir nicht schon wieder die Rübe runzelig denken und nutzte für mein laufendes Turnier einfach wieder mein altes FEOBOS Buch ohne mich hier wirklich nochmals in Zeug zu legen. Mit mehr Aktivitäten haben wir natürich auch mehr Erfahrung aber letztendlich lernen wir nie aus und werfen oftmals ältere Denkweisen über Board. Geht mir laufend so wenn ich drüber nachdenke, nur bekomme ich dann Kopfschmerzen weil zu viele Stellschrauben ineinander laufen die mir bekannt sind aber es zu zeitaufwendig wird Lösungen zu suchen.
Best
Frank
SPCC: Classical Ratinglist will be stopped
Moderator: Ras
-
chessica
- Posts: 1062
- Joined: Thu Aug 11, 2022 11:30 pm
- Full name: Esmeralda Pinto
Re: SPCC: Classical Ratinglist will be stopped
-
bastiball
- Posts: 5372
- Joined: Tue Oct 20, 2020 4:18 am
- Full name: Basti Dangca
Re: SPCC: Classical Ratinglist will be stopped
Good luck for the project. 
Basti Dangca
CCRL testing group
CCRL testing group
-
mehmet123
- Posts: 697
- Joined: Sun Jan 26, 2020 10:38 pm
- Location: Turkey
- Full name: Mehmet Karaman
Re: SPCC: Classical Ratinglist will be stopped
It's hard to understand why you have this view. I think the test results were very close to reality. There is 51 elo difference between Stockfish Dev and Dragon 3.2, and 50 elo difference between Dragon 3.2 and Berserk 230818. Then, since there are fewer goals in football matches, the home team should play with 11 players and the other team should play with 10 players.
https://PrivateLadyEscorts.com - Live Local Dating - No Verify - Anonymous Casual Dating - Chat Local Singles
-
Uri Blass
- Posts: 11165
- Joined: Thu Mar 09, 2006 12:37 am
- Location: Tel-Aviv Israel
Re: SPCC: Classical Ratinglist will be stopped
No
Even stockfish with the stronger side can lose with a biased opening.
See the following game
https://www.chess.com/computer-chess-ch ... s&game=163
It means that it can lose also from normal openings with both colors.
-
Frank Quisinsky
- Posts: 7285
- Joined: Wed Nov 18, 2009 7:16 pm
- Location: Gutweiler, Germany
- Full name: Frank Quisinsky
Re: SPCC: Classical Ratinglist will be stopped
Uri, lesser draw games!
If both engines are playing with such openings with white and black ... end of the day the Elo is not measured accurately. Because to many points with unbalanced opening engines lost. Exactly are the Elo differents between the engines with the abdominal pain in the feeling that strongest engines have an advantage because a draw in a "bad" unbalanced opening is more probably.
But all in all the differents between the engines (if all used the same set of unbalanced openings) is around to 95% (think so) OK. And with lesser draws and interesting openings more possibilties for measure tactical skills.
Thats what I mean ...
Nothing is perfect, standard opening systems (if we used balanced opening systems from A00-E99) are often boring. To pick up the best of A00-E99 can be a way but this is complicated, means to find out a "Best off from A00-E99" with many engines games.
If both engines are playing with such openings with white and black ... end of the day the Elo is not measured accurately. Because to many points with unbalanced opening engines lost. Exactly are the Elo differents between the engines with the abdominal pain in the feeling that strongest engines have an advantage because a draw in a "bad" unbalanced opening is more probably.
But all in all the differents between the engines (if all used the same set of unbalanced openings) is around to 95% (think so) OK. And with lesser draws and interesting openings more possibilties for measure tactical skills.
Thats what I mean ...
Nothing is perfect, standard opening systems (if we used balanced opening systems from A00-E99) are often boring. To pick up the best of A00-E99 can be a way but this is complicated, means to find out a "Best off from A00-E99" with many engines games.
-
pohl4711
- Posts: 2918
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: SPCC: Classical Ratinglist will be stopped
Example from my SPCC-Ratinglist: 5 Berserk 230818 avx2 : 3725 15000 (+2276,=11426,-1298), 53.3 %mehmet123 wrote: ↑Wed Sep 27, 2023 2:06 pmIt's hard to understand why you have this view. I think the test results were very close to reality. There is 51 elo difference between Stockfish Dev and Dragon 3.2, and 50 elo difference between Dragon 3.2 and Berserk 230818. Then, since there are fewer goals in football matches, the home team should play with 11 players and the other team should play with 10 players.
11426 draws by Berserk in 15000 games... more than 76% draws.
And the other Top10 engines have a similar draw-ratio. This is just bad, because it shrinks the Elo-spreadings between these engines. Because other ratinglists do very similar testings, the Elo-spreadings there are very similar. But that does not mean, that this is "close to reality". What is reality in Elo? When all ratinglist doing their tests with such a high draw-ratio, these results are just bad and shrink Elo-spreadings. When this happens in all ratinglists, this is still bad. But it seems to be normal or the "reality". But this is definitly no reason for me, not to do it better.
Using UHO-openings (and choosing the right UHO-opening (I made a lot of different UHO-sets with different advantages for white)), the draw-ratio is around 50%. That makes all statistics and results just better and normalizes the Elo-spreading back from the crunching, we see today at the top of the ratinglists.
-
chessica
- Posts: 1062
- Joined: Thu Aug 11, 2022 11:30 pm
- Full name: Esmeralda Pinto
Re: SPCC: Classical Ratinglist will be stopped
Oh no!!!
Example 100m sprint, timme is measured after reaching the finish line, right?
And NOT:
measuserd the time between 30m and 40m?
Or running with one leg?
Example 100m sprint, timme is measured after reaching the finish line, right?
And NOT:
measuserd the time between 30m and 40m?
Or running with one leg?