SPCC: Testrun of Stockfish 250112 finished

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
pohl4711
Posts: 2647
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

SPCC: Testrun of Stockfish 250112 finished

Post by pohl4711 »

My UHO-Top15 Ratinglist is the world's first engine-ratinglist, using UHO-openings, and the world's first ratinglist offering additionally Gamepair-statistics.

Testrun of Stockfih 250112 finished.

https://www.sp-cc.de

Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm

(Perhaps you have to clear your browsercache with <strg>+<shift>+<delete> to reload the graphics/diagrams on my website)
Jouni
Posts: 3533
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: SPCC: Testrun of Stockfish 250112 finished

Post by Jouni »

This was a third successive regression? What's happening?
Jouni
Michel
Posts: 2288
Joined: Mon Sep 29, 2008 1:50 am

Re: SPCC: Testrun of Stockfish 250112 finished

Post by Michel »

The Stockfish project has collapsed. The developers will be working on Shashchess in the future.
Ideas=science. Simplification=engineering.
Without ideas there is nothing to simplify.
Ciekce
Posts: 174
Joined: Sun Oct 30, 2022 5:26 pm
Full name: Conor Anstey

Re: SPCC: Testrun of Stockfish 250112 finished

Post by Ciekce »

Michel wrote: Fri Jan 17, 2025 3:02 pm The Stockfish project has collapsed. The developers will be working on Shashchess in the future.
sanest sf-related talkchess post
User avatar
pohl4711
Posts: 2647
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testrun of Stockfish 250112 finished

Post by pohl4711 »

Jouni wrote: Fri Jan 17, 2025 9:04 am This was a third successive regression? What's happening?
Regression is a little bit harsh: -2 Celo in a test-setup with an errorbar of +/-4
And when comparing 2 results, the errorbar is sqrt((error1*error1)+(error2*error2))
= 5.7 (for comparing 2 results in my test-setup)

The best score of Stockfish in my testings was only 4 Celo better than the latest dev, so the regression we see right now, is clearly inside errorbar.