Thanks Pali for constructive ideas.Pali wrote: ↑Mon Apr 29, 2024 3:36 pm Perhaps not relevant to this specific subject but I suggested this on my chat with Rebel/Ed Schroder on discord:
- Having a pinned post on testing methodologies used by modern engines. This could link to OpenBench github, FishTest github, JW's SPRT github, Neural Network Trainers etc.
- Explicitly inform users that SF clones (and clones in general) are very likely to be either worse or neutral. It's really easy to contact SF maintainers, information on which SF clones serve a purpose (e.g. I see Huntsman being brought up for mate finding) and which clones do not. Keep in mind that SF is GPLv3, if a clone contributed Elo, the changes would be ported to SF as they have full legal rights to do it.
- Flag SSS testing? Perhaps this outside the scope of talkchess moderation but it's also supposed to be informing to the average reader. I saw someone post 14 wins, 11 losses, 47 draws as evidence that a version is better the other day. Someone who's not experienced could easily fall for this kind of statistic.
Keep in mind, I do not know the limitations of what can and can't be done on talkchess and as such my phrasing may be off or I may be suggesting something that may not be possible.
I am thinking of a sticky thread about chess programming from the ground up divided into 3 sections : newbie | advanced | expert. Something like -
Newbie -
1. I heard a lot good things about Vice.
2. Focus should be on bug-free.
3. Perft
4. Compiler choice
Advanced -
1. Introduction to cutechess
2. introduction to "how many games" to proof a change
Expert -
Introduction to SPSA
And of course an introduction to NNUE.
Can we count on the cooperation of Discord members as a mutual project with proper credit given to the contributors?
------
The other issues then, Talkchess since its existence is a place for every computer chess lover who wants to share and the only limitation is the charter they should keep and we don't want to change that or become the forum for discord. People who play only 100 games know very well that is not enough, they have been told dozen and dozen times during the years. They don't have the hardware you have on OpenBench or on Fishtest and we don't want to take away their pleasure because they have limited hardware.
Then we have the well known CCRL and CEGT rating lists since 2006, even they can't produce the needed games as everybody can see looking at the error bars. Not perfect, but I don't want to miss them. Same problem, not enough computer power like you have. They are very useful for starters since they test every engine.
Derivatives - I am not into the number of legal derivatives, but tell me, has there been one SF derivative SF profited from?