Stockfish dev-20240402-0716b845 short "test"

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Jouni
Posts: 3324
Joined: Wed Mar 08, 2006 8:15 pm

Stockfish dev-20240402-0716b845 short "test"

Post by Jouni »

10% less nps than SF 16.1. Solves 70/110 in HTC110. SF 16.1 got 80. Lost 60 + 0.6 match vs SF 16.1. But good start for further development? I already removed from disc :) .
Jouni
Magnum
Posts: 191
Joined: Thu Feb 04, 2021 10:24 pm
Full name: Arnold Magnum

Re: Stockfish dev-20240402-0716b845 short "test"

Post by Magnum »

Jouni wrote: Tue Apr 02, 2024 7:41 pm 10% less nps than SF 16.1. Solves 70/110 in HTC110. SF 16.1 got 80. Lost 60 + 0.6 match vs SF 16.1. But good start for further development? I already removed from disc :) .
You are so funny, tested with HTC and already removed from disc :lol: :lol: :lol:

As Eduard already mentioned:
Top Chess Engines Testsuite 2024 v2
https://www.mediafire.com/file/cypaz2t0 ... 2.pgn/file
-Stockfish 16.1 (20%) 23/115
-Stockfish 02042024 (45%) 52/115

HTC is very very old and much much weaker.
It's like:
HTC for ~3000 elo engines
TCETv2 for ~4000 elo engines.
ImNotStockfish
Posts: 51
Joined: Tue Sep 14, 2021 12:29 am
Full name: .

Re: Stockfish dev-20240402-0716b845 short "test"

Post by ImNotStockfish »

For a dumb small sample test suite, the 2021 revision of Hard Talkchess is... ok. Top Chess Engines Testsuite v2 is not much better in comparison.
Just combine all (actually correct) positions and solutions from test suites into a big one with thousands of positions instead of a single small one and you guys might actually get usable results :D
Dann Corbit
Posts: 12615
Joined: Wed Mar 08, 2006 8:57 pm
Location: Redmond, WA USA

Re: Stockfish dev-20240402-0716b845 short "test"

Post by Dann Corbit »

Of course, tactical ability is not the same as game play ability.
I don't think it's possible to project one measure from the other accurately.
I will say that I think tactical ability is more important in correspondence chess.
That is because the quiet opening positions have been studied to death and the game actually begins after both engines are fully developed.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
Werewolf
Posts: 1848
Joined: Thu Sep 18, 2008 10:24 pm

Re: Stockfish dev-20240402-0716b845 short "test"

Post by Werewolf »

Its tactical ability is poor :(

EDIT: though some positions it gets much faster than previously...