Its used after every passed patch yes: https://github.com/vondele/matetrack
There is also a zugzwang test suite for patches targetting those positions.
There is no utility running other test suites, their goal is too similar to real play and thus the result will be noise. If talkchess can prove otherwise using valid statistical methods, it can be explored further. So far, they have simply just shown a complete incompetency in understanding sample size, as well as posting positions in which multiple moves are winning.
Future of NNUE is Dimension 3072 network!
Moderator: Ras
-
- Posts: 57
- Joined: Fri Jun 18, 2021 7:54 pm
- Full name: Viren Peanut
-
- Posts: 31
- Joined: Sun Mar 10, 2024 1:56 pm
- Full name: Draude Stoalis
Re: Future of NNUE is Dimension 3072 network!
smatovic wrote: ↑Tue Mar 19, 2024 9:02 amDo you use this mate .epd regression test after every passed patch on fish-test?Viren wrote: ↑Tue Mar 19, 2024 8:53 am @talkchess guys:
Maybe start to use your brain? We already have a test suite for mates that is used to reject patches:
https://github.com/official-stockfish/S ... 2002504682
Then be smart*, it's not meant to measure Elo gain, therefore you have your SPRT self-play.
*testsuites are a moving target.
--
Srdja
Yes, you are completely correct! Multiple times I have refused to merge SF patches into my engines because they perform very bad at my test suite!
See my latest post to download

-
- Posts: 3331
- Joined: Wed Mar 10, 2010 10:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
-
- Posts: 3331
- Joined: Wed Mar 10, 2010 10:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
Re: Future of NNUE is Dimension 3072 network!
As you prob. already know, to create sound testsuites is an art in itself, there are people maintaining those, maybe give them a try? IIRC there was f.e. STS 1-15 in different iterations. And, sure, you have to re-evaluate those periodically, the positions and the best-moves/scores. My point is, if you tune for Elo (e.g. aggressive pruning), you might loose in some other edge, as you should be aware (SF derivatives), to win games, to solve puzzles, to play styles differs, or alike.Viren wrote: ↑Tue Mar 19, 2024 9:12 am Its used after every passed patch yes: https://github.com/vondele/matetrack
There is also a zugzwang test suite for patches targetting those positions.
There is no utility running other test suites, their goal is too similar to real play and thus the result will be noise. If talkchess can prove otherwise using valid statistical methods, it can be explored further. So far, they have simply just shown a complete incompetency in understanding sample size, as well as posting positions in which multiple moves are winning.
--
Srdja