Stockfish had become the real world champion more than dozen times. But will probably never become "champion" of bogus competition with operators, books and 8 games sample size indeed.
ICGA WCCC and WCSC in Santiago de Compostela
Moderator: Ras
-
- Posts: 223
- Joined: Tue Apr 09, 2024 6:24 am
- Full name: Michael Chaly
Re: ICGA WCCC and WCSC in Santiago de Compostela
-
- Posts: 28315
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: ICGA WCCC and WCSC in Santiago de Compostela
The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
-
- Posts: 43483
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: ICGA WCCC and WCSC in Santiago de Compostela
I think that you're underestimating Stockfish in the general context, but you're right that in a tournament based on 8 games, there is a slim chance that it might not win.hgm wrote: ↑Fri Apr 19, 2024 10:05 pm The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
gbanksnz at gmail.com
-
- Posts: 3675
- Joined: Thu Jun 07, 2012 11:02 pm
Re: ICGA WCCC and WCSC in Santiago de Compostela
I agree. They also like tightly controlled conditions like CCC and TCEC with same books for all engines (slightly unbalanced too), same hardware etc and a long run of games where they have a chance to get slightly more wins to tip the scales. It depends what other engines enter of course, but there is a very real chance Stockfish would not win. With freedom around books (which have a big impact) and hardware etc, they can't control the conditions and they don't like it. I get the impression that they don't enter because they are scared they will not win. Maybe I'm wrong on that but it is the feeling I get.hgm wrote: ↑Fri Apr 19, 2024 10:05 pm The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
-
- Posts: 3675
- Joined: Thu Jun 07, 2012 11:02 pm
Re: ICGA WCCC and WCSC in Santiago de Compostela
Either that, the concern about the slightly random nature of the tournament, or else they just consider it completely irrelevant - like most of us.
-
- Posts: 30
- Joined: Wed Dec 01, 2021 12:23 pm
- Full name: Doruk Sekercioglu
Re: ICGA WCCC and WCSC in Santiago de Compostela
"with same books for all engines (slightly unbalanced too)"Modern Times wrote: ↑Fri Apr 19, 2024 10:52 pmI agree. They also like tightly controlled conditions like CCC and TCEC with same books for all engines (slightly unbalanced too), same hardware etc and a long run of games where they have a chance to get slightly more wins to tip the scales. It depends what other engines enter of course, but there is a very real chance Stockfish would not win. With freedom around books (which have a big impact) and hardware etc, they can't control the conditions and they don't like it. I get the impression that they don't enter because they are scared they will not win. Maybe I'm wrong on that but it is the feeling I get.hgm wrote: ↑Fri Apr 19, 2024 10:05 pm The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
This means fair conditions (game pairs)
"same hardware etc"
This means fair conditions
"and a long run of games"
This means more statistical significance
"With freedom around books (which have a big impact) and hardware etc"
Freedom around books means -> largest book made with the strongest engine with the most amount of search is the best -> you are likely still getting a draw fest.
Freedom around hardware -> this should simply not happen, a weaker engine running on 256 cores would be stronger than an otherwise stronger engine running on 1 core. It's the software competing, not the hardware. The one exception I can think of is cases like Leela where hardware requirements are wildly different.
-
- Posts: 30
- Joined: Tue Mar 26, 2024 8:21 pm
- Full name: Lyndon S. Sears
Re: ICGA WCCC and WCSC in Santiago de Compostela
Then tell me what your 8-game tournament is supposed to measure? CPU noise?hgm wrote: ↑Fri Apr 19, 2024 10:05 pm The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
-
- Posts: 223
- Joined: Tue Apr 09, 2024 6:24 am
- Full name: Michael Chaly
Re: ICGA WCCC and WCSC in Santiago de Compostela
Last year software tournament was won by what, ginkgo(fritz)?
No offence to it author, he probably knows it himself, but this engine is nowhere of caliber of winning any engine tournament... Well, apart from abomination called WCCC/WCSC. If you play 10k games of it vs stockfish/torch I doubt it will win a single game pair on average, regardless of used book.
And you want SF to participate in this garbage, making it look legitimate? "Hey, look, great tournament, even stockfish participates there"?
How about no.
There is nothing bad in hanging out drinking beer or whatever with a bunch of devs, idiotic format that is outdated by 2 decades with pathos name "World tournament" while being your garage inside competition is not needed to do that.
No offence to it author, he probably knows it himself, but this engine is nowhere of caliber of winning any engine tournament... Well, apart from abomination called WCCC/WCSC. If you play 10k games of it vs stockfish/torch I doubt it will win a single game pair on average, regardless of used book.
And you want SF to participate in this garbage, making it look legitimate? "Hey, look, great tournament, even stockfish participates there"?
How about no.
There is nothing bad in hanging out drinking beer or whatever with a bunch of devs, idiotic format that is outdated by 2 decades with pathos name "World tournament" while being your garage inside competition is not needed to do that.
-
- Posts: 28315
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: ICGA WCCC and WCSC in Santiago de Compostela
How reliably t he engine performs?Antihelion wrote: ↑Sat Apr 20, 2024 12:02 am Then tell me what your 8-game tournament is supposed to measure? CPU noise?
-
- Posts: 1945
- Joined: Tue Apr 19, 2016 6:08 am
- Location: U.S.A
- Full name: Andrew Grant
Re: ICGA WCCC and WCSC in Santiago de Compostela
Can you tell me how reliably my coin works when I flip it 8 times?hgm wrote: ↑Sat Apr 20, 2024 6:34 amHow reliably t he engine performs?Antihelion wrote: ↑Sat Apr 20, 2024 12:02 am Then tell me what your 8-game tournament is supposed to measure? CPU noise?