amchess wrote: ↑Fri Sep 02, 2022 10:49 am
My idea is to test not only tactical positions (in terms of Shashin, Tal or Petrosian), but also strategic ones (Capablanca).
For this, I classified them.
So, in the case of your example, simply I have to correct in
bm Qd2 Qc1 (the moves with the max score)
Can you help me in finding all strategic positions of this type and do this type of correction?
You're welcome, Andrea.
As a matter of fact, I'm already working at at revisited version of good old STS by myself too, but problem is, even the points given by comments there at very many of the positions don't fit modern engines' evals anymore neither.
As for the first one position in question, I'd have .cbh- format only for using it, giving togehter with Qf2 the three other Queeen- moves too but with commenting these moves as sidelines with = als prefix (not a subsequent = as a move comment of other meaning in that GUI) of each of these about equivalent moves, (in Fritz- GUI by "RR-equivalent is") which would make them being counted in automatic suite as "solved" too.
Yet 3 is the utmost number of moves to be counted at all for my personal trials with "STS-revised" so far, and the postions must not be too easy to be solved at all, the more equal candidates makes positions even more easily solved in most cases, I hope to get a suite of about 1000 positions out of the 1500 of old STS in .cbh format to be used finally as a modern positional test suite only again.
That will take much hardware- time and manpower- time still, I've just started lately, maybe I'll make it till end of that year working on it on my own only so far
As for a tactical single best move the position in question simply is too easy as for same hardware- time of difficult tactical positions, I'd say.
I'm planning a positional suite (to be solved more out of "static eval" of engines than out of search, the way the plan of STS was way back then too) to run about 1-3 sinlge seconds/positions SMP only, tactical suites at least with 15", so I don't think, that should work in one suite together statistically meaingful.
But with Frank Schubert's great program EloStatTS you can let some different runs of different suites with different hardware- TC be evaluated together in one ranking- and rating- list too, pity suites have to be run in .cbh- format at all, other GUIs don't work with EloStatTS that way.
But you can have a look at such a list here e.g.
forum3/viewtopic.php?p=932939#p932939
Will come back here, when I've finished a first seeing- through of your suite with some more results of my own next days.
We can exchange results and positions of each others together by email of course too, if you like to.
Edit: had already first 3 runs of your suite at 30"/position, 30 threads of 16x3.5GHz CPU, 4G hash and 6men Syzygys, MultiPV=4 for all of these 3 SF- branches I had in the list of given link above already too, same setting for ShashChess (GoldDigger and MCTS single thread on, all other options default, Persited learning off too of course)
Code: Select all
Program Elo +/- Matches Score Av.Op. S.Pos. MST1 MST2 RIndex
1 CorChess3300522-Tactical-MV4 : 3508 15 364 51.7 % 3496 159/258 5.0s 14.6s 0.69
2 ShashChess24-GoldDigger-MV4 : 3503 15 376 50.6 % 3499 162/258 6.6s 15.3s 0.66
3 BlueMarlin15.3-avx2-MV4 : 3489 16 368 47.7 % 3505 148/258 6.1s 16.3s 0.67
MST1 : Mean solution time (solved positions only)
MST2 : Mean solution time (solved and unsolved positions)
RIndex: Score according to solution time ranking for each position
Best regards
Peter.