Laskos wrote: There are still problems with suites and books. For example some suites could favour one engine, because it likes those openings.
This is very, very theorectical, or speculative, especially if the variations are repeated with switched sides. Most important is that the vast majority of positions where the variations end, is fairly balanced.
Sedat Canbaz offers five sets of different size (top-10 to top-200), and I think he has selected the variations by frequency. I think that is an
objective and independant criteria. So, even if a particular engine would "like" that set more than other sets, then it doesn't spoil the significance of a test IMO, because these are simply "realistic" openings. An engine which is good at that in the lab, will also be good in the wild.
http://www.sedatchess.com/download.html
(just as example; I don't doubt that other sets are also good and suitable)
Also, the conclusion that an engine "(dis-)likes" a particular book or opening suite, requires that a
trusted result or rating already exists, to compare with. But I have doubts in an approach which uses such a comparision for test design. If I finish this thought, it would mean to say, my tests are wrong as long as they produce other results

That is certainly wrong.
As for the openings in tests, I think variety is required (no sicilian thematic tournament), and a certain quality. Just nothing extreme. That should be sufficient. - Over time, I've changed my mind from (very) short to medium deep test books.
I dislike "no book" tests very much, except as experiment, but then one shouldn't draw conclusions about general playing strength, from such a test (at least not from the results only). I have also seen tests and even ratings being published, based on only one single opening variation which was used for all games. That is absurd.
I also doubt somewhat the idea of permanent book tuning,
if conclusions about engine strengths shall be drawn (not in general). The influence of such tuning is something "artificial" which does not exist in a normal chess players practice with an engine. Because he maybe tunes openings
for himself to use, not for the engine!
