The only way to scam this stuff more would be to test 50 cores vs 1, but this is a bit of an overkill I guess.

Moderator: Ras
1. Hit the report button if you feel insulted, we will look into it then.
On an unrelated note: I feel there is also an issue with engine naming itself, that goes two ways:
i agree with that a long time ago, all engines must have the same conditions and hardware, otherwise its not fair comprasion between the real software strength of the engines.Pali wrote: ↑Fri Apr 19, 2024 11:43 pm"with same books for all engines (slightly unbalanced too)"Modern Times wrote: ↑Fri Apr 19, 2024 10:52 pmI agree. They also like tightly controlled conditions like CCC and TCEC with same books for all engines (slightly unbalanced too), same hardware etc and a long run of games where they have a chance to get slightly more wins to tip the scales. It depends what other engines enter of course, but there is a very real chance Stockfish would not win. With freedom around books (which have a big impact) and hardware etc, they can't control the conditions and they don't like it. I get the impression that they don't enter because they are scared they will not win. Maybe I'm wrong on that but it is the feeling I get.hgm wrote: ↑Fri Apr 19, 2024 10:05 pm The point isthat Stockfish is only strong on average, because that is what Elo is, an average. It is not reliably strong. Therefore it has a significant probability to loose, in a small number of games. And of course the Elo advantage on the competitors is so small that you would need many thousands of games to see it as an average.
The Stockfish developers don't like that. So they only want to participate in tournaments that measures average performance over an insanely large number of games. Because they know they only stand little chance to win a tournament over 8 games.
This means fair conditions (game pairs)
"same hardware etc"
This means fair conditions
"and a long run of games"
This means more statistical significance
"With freedom around books (which have a big impact) and hardware etc"
Freedom around books means -> largest book made with the strongest engine with the most amount of search is the best -> you are likely still getting a draw fest.
Freedom around hardware -> this should simply not happen, a weaker engine running on 256 cores would be stronger than an otherwise stronger engine running on 1 core. It's the software competing, not the hardware. The one exception I can think of is cases like Leela where hardware requirements are wildly different.