mar wrote: ↑Thu Apr 15, 2021 8:11 pm
I tried to replace my texel-tuned HCE with PeSTO and immediately lost ~200 elo, which means (considering my engine is slightly below PeSTO in CCRL) that PeSTO's search is at least 200 elo better than mine.
I don't know what HCE means.

Hand-Crafted-Evaluation? Highly-Complex-Evaluation? I suppose you mean you have a costly but precise evaluation that combines PSTs with other evaluation terms. For example it looks at pawn structure, mobility and just has a lot of "chess knowledge" implemented to evaluate the board?
Ronald wrote: ↑Thu Apr 15, 2021 9:00 pm
If I remember correctly I used the "quiet-labeled.epd" created by the Zurichess developper for the tuning proces.
Training only on quiet positions is really clever. I think I
found them and have started to use that data for my own training. However there are different versions! I assume you used the smallest one? Because it seems like I managed to improve the PeSTO tables by tuning them on the v7 set.
Ronald wrote: ↑Thu Apr 15, 2021 9:00 pm
The Texel tuning process will normally find a local minimum of the error, so there are multiple tuning results possible depending on initial values/sequence of PST values. These different results will normally also result in different "strength". So for getting "optimal" results, you will probably have to play with initial values of the PST's and or sequence of elements.
I'm currently trying to grow PSTs from the default values 100, 300, 300, 500, 900 for both midgame and endgame as initial values. There's no randomness involved but my tuner is semi-automatic: you can interact with it using a command line interface and tune the tables step by step. For example I can chose to only tune the MG or EG table (no interpolation) on a subset of the training set. Or only improve a value if the mean-squared-error changes beyond a specific threshold. I can also dump to a file edit per hand and reload. It gives me a lot of control but I don't know what I'm even doing

so there's a lot of trial and error and it takes a long time... but it's kinda fun. If I stumble upon noteworthy results I'll post them here!