Albert Silver wrote: ↑Thu May 10, 2018 6:52 pm
I have not used tactical tests in many years for engines. Admittedly, this is because they got too strong for them as a rule. I'll take a look at WAC and prune out all the multi-solutions, and see what is left. I'm guessing some 200 out of 300 will survive.
I normally don't use suites by myself neither, as I wrote already too, I prefer testing engine- strength position- dependent only always also, just because any statistical measurment itself depends on the single test- positions, and as for eng-eng-games it's just the same.
Letting engines play against each other from certain starting positions, even from very early ones, always tests the starting-positions too, the nearer to the single one initial chess position the test positions get, no matter if from sets or books, the nearer to a test of this single initial position and the bookless match it gets.
If you prune out all the solutions of any tactical test suite you want to make it more selective for your special measurement-aim, you'll get the more selective results, finally ending up at more and more positional- depending testing of course.
Thanks in advance, if you do this work for WAC, still it's the only one suite I know, easy enough for LC0.
As a matter of fact, I didn't do that work on my own as for more then only a few positions of it neither, because it's simply a very old classical one, of course discussed very much and often since quite a long time.
Let's then talk about this and that one position, you doubt or you are sure about, and see, if there won't me or anybody else still find some of them, in which LC0 maybe even "succeeds", but yet maybe for the wrong reasons, giving output- lines with incorrect moves in follow- up plies or quite wrong evals.
That's the way I like to discuss engines' achievements as for certain single positions compared to other engines, and then you can use pracitically any single position of interest, as for tactical or as for "positional" play, if any such differentiation is still any good for nowadays engines at all anyhow.
And then you don't need to stick to single best move positions of undoubtable unique "solution" at all, but then you yet have to define your standards and measurements as for each and any position in question yet still.
Overall playing strenght, tactical or "positional," is an Illusion anyhow in any kind of testing, rated in Elo (Celo) out of engine-engine-games or in numbers of positions of a suite solved or not solved.
I like to call it Elosion.

Peter.