SPCC: Classical Ratinglist will be stopped

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

chessica
Posts: 1062
Joined: Thu Aug 11, 2022 11:30 pm
Full name: Esmeralda Pinto

Re: SPCC: Classical Ratinglist will be stopped

Post by chessica »

Frank Quisinsky wrote: Tue Sep 26, 2023 7:34 pm

Wollte mir nicht schon wieder die Rübe runzelig denken und nutzte für mein laufendes Turnier einfach wieder mein altes FEOBOS Buch ohne mich hier wirklich nochmals in Zeug zu legen. Mit mehr Aktivitäten haben wir natürich auch mehr Erfahrung aber letztendlich lernen wir nie aus und werfen oftmals ältere Denkweisen über Board. Geht mir laufend so wenn ich drüber nachdenke, nur bekomme ich dann Kopfschmerzen weil zu viele Stellschrauben ineinander laufen die mir bekannt sind aber es zu zeitaufwendig wird Lösungen zu suchen.

Best
Frank
:) :) :)
bastiball
Posts: 5372
Joined: Tue Oct 20, 2020 4:18 am
Full name: Basti Dangca

Re: SPCC: Classical Ratinglist will be stopped

Post by bastiball »

Good luck for the project. :D
Basti Dangca
CCRL testing group
mehmet123
Posts: 697
Joined: Sun Jan 26, 2020 10:38 pm
Location: Turkey
Full name: Mehmet Karaman

Re: SPCC: Classical Ratinglist will be stopped

Post by mehmet123 »

pohl4711 wrote: Tue Sep 26, 2023 8:15 am Important News: I decided to stop doing my SPCC-Ratinglist, because the balanced openings, used for this ratinglist, are not working anymore in these days of superstrong engines....
It's hard to understand why you have this view. I think the test results were very close to reality. There is 51 elo difference between Stockfish Dev and Dragon 3.2, and 50 elo difference between Dragon 3.2 and Berserk 230818. Then, since there are fewer goals in football matches, the home team should play with 11 players and the other team should play with 10 players.
https://PrivateLadyEscorts.com - Live Local Dating - No Verify - Anonymous Casual Dating - Chat Local Singles
Uri Blass
Posts: 11165
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: SPCC: Classical Ratinglist will be stopped

Post by Uri Blass »

Jouni wrote: Tue Sep 26, 2023 8:50 am Sad news :( . Chess is solved.
No
Even stockfish with the stronger side can lose with a biased opening.

See the following game
https://www.chess.com/computer-chess-ch ... s&game=163

It means that it can lose also from normal openings with both colors.
Frank Quisinsky
Posts: 7285
Joined: Wed Nov 18, 2009 7:16 pm
Location: Gutweiler, Germany
Full name: Frank Quisinsky

Re: SPCC: Classical Ratinglist will be stopped

Post by Frank Quisinsky »

Uri, lesser draw games!

If both engines are playing with such openings with white and black ... end of the day the Elo is not measured accurately. Because to many points with unbalanced opening engines lost. Exactly are the Elo differents between the engines with the abdominal pain in the feeling that strongest engines have an advantage because a draw in a "bad" unbalanced opening is more probably.

But all in all the differents between the engines (if all used the same set of unbalanced openings) is around to 95% (think so) OK. And with lesser draws and interesting openings more possibilties for measure tactical skills.

Thats what I mean ...
Nothing is perfect, standard opening systems (if we used balanced opening systems from A00-E99) are often boring. To pick up the best of A00-E99 can be a way but this is complicated, means to find out a "Best off from A00-E99" with many engines games.
User avatar
pohl4711
Posts: 2918
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Classical Ratinglist will be stopped

Post by pohl4711 »

mehmet123 wrote: Wed Sep 27, 2023 2:06 pm
pohl4711 wrote: Tue Sep 26, 2023 8:15 am Important News: I decided to stop doing my SPCC-Ratinglist, because the balanced openings, used for this ratinglist, are not working anymore in these days of superstrong engines....
It's hard to understand why you have this view. I think the test results were very close to reality. There is 51 elo difference between Stockfish Dev and Dragon 3.2, and 50 elo difference between Dragon 3.2 and Berserk 230818. Then, since there are fewer goals in football matches, the home team should play with 11 players and the other team should play with 10 players.
Example from my SPCC-Ratinglist: 5 Berserk 230818 avx2 : 3725 15000 (+2276,=11426,-1298), 53.3 %
11426 draws by Berserk in 15000 games... more than 76% draws.
And the other Top10 engines have a similar draw-ratio. This is just bad, because it shrinks the Elo-spreadings between these engines. Because other ratinglists do very similar testings, the Elo-spreadings there are very similar. But that does not mean, that this is "close to reality". What is reality in Elo? When all ratinglist doing their tests with such a high draw-ratio, these results are just bad and shrink Elo-spreadings. When this happens in all ratinglists, this is still bad. But it seems to be normal or the "reality". But this is definitly no reason for me, not to do it better.
Using UHO-openings (and choosing the right UHO-opening (I made a lot of different UHO-sets with different advantages for white)), the draw-ratio is around 50%. That makes all statistics and results just better and normalizes the Elo-spreading back from the crunching, we see today at the top of the ratinglists.
chessica
Posts: 1062
Joined: Thu Aug 11, 2022 11:30 pm
Full name: Esmeralda Pinto

Re: SPCC: Classical Ratinglist will be stopped

Post by chessica »

Oh no!!!

Example 100m sprint, timme is measured after reaching the finish line, right?

And NOT:

measuserd the time between 30m and 40m?

Or running with one leg?