Code: Select all
300 positions, 5 seconds per position
Stockfish 281, Arasan 287, Komodo 298, Laser 294 on Dell Latitude 3450 4 GB RAM.
Stockfish 286, Arasan 293, Komodo 298, Laser 298 on Dell XPS 8960.Moderator: Ras
Code: Select all
300 positions, 5 seconds per position
Stockfish 281, Arasan 287, Komodo 298, Laser 294 on Dell Latitude 3450 4 GB RAM.
Stockfish 286, Arasan 293, Komodo 298, Laser 298 on Dell XPS 8960.Which doesn't mean that there isn't one of them better than all the other ones, MultiPV=2:
Yes, as well as any other GUI using .epd- strings with its regular syntax, as for .pgn, alternative solutions imported from such .epd- strings then become uncommented variants instead, also treated as equal solutions by GUIs adjudicating .pgn (Fritz- .cbh-) suites automatically.
Classical still is Eret, Arasan, even if both have got to be used with short to very short hardware- TC nowadays too, and even e.g. HTC and ACT needs much less time/pos. now then some years ago at being collected then. So tools to get more discrimination out of higher numbers of solutions like EloStatTS and MEA help a lot as for error bars of results, latter (MEA) lets suites of positions with multiple solution get useable too, regardsAlso, what are some other test suites with unique best moves and not just elementary puzzles? I have found github repositories with epd files, but it is unclear which ones have been updated more recently using stronger engines and longer time limits.
By MEA do you mean the program by Ferdy from https://github.com/fsmosca/Multiple-move-Epd-Analyzer? I tried using it once, but got the following error:peter wrote: ↑Tue Sep 17, 2024 11:30 am Classical still is Eret, Arasan, even if both have got to be used with short to very short hardware- TC nowadays too, and even e.g. HTC and ACT needs much less time/pos. now then some years ago at being collected then. So tools to get more discrimination out of higher numbers of solutions like EloStatTS and MEA help a lot as for error bars of results, latter (MEA) lets suites of positions with multiple solution get useable too, regards
Code: Select all
python3 mea.py -e /usr/local/games/sf17 -n "Stockfish 17" -i eret.epd -m 512 -a 10000 -t 2 -p uci
Problem reading c0 field in epd: r1bqk1r1/1p1p1n2/p1n2pN1/2p1b2Q/2P1Pp2/1PN5/PB4PP/R4RK1 w q - - bm Rxf4; id "ERET 001 - Relief";
This position is not included.
For MEA you need special syntax of .epd, the position of yours should look like this e.g.:chesskobra wrote: ↑Tue Sep 17, 2024 11:52 am By MEA do you mean the program by Ferdy from https://github.com/fsmosca/Multiple-move-Epd-Analyzer? I tried using it once, but got the following error:
But I would like a script like that, and plan to test it more.Code: Select all
python3 mea.py -e /usr/local/games/sf17 -n "Stockfish 17" -i eret.epd -m 512 -a 10000 -t 2 -p uci Problem reading c0 field in epd: r1bqk1r1/1p1p1n2/p1n2pN1/2p1b2Q/2P1Pp2/1PN5/PB4PP/R4RK1 w q - - bm Rxf4; id "ERET 001 - Relief"; This position is not included.
Pity I didn't notice in edit- time having missed the x at "Rxf4=75" for the rewarding- points, at bm it's ok, but Rf4=75 as mistyped then doesn't work correctly, even if you won't get any error message, points for correct solution just won't be counted for this one move.
Thank you for explaining. How are the numbers corresponding to different moves obtained? Is it by normalizing the evaluation of the top move to 100 and adjusting the evaluations of other moves in proportion?peter wrote: ↑Tue Sep 17, 2024 12:43 pm
1kr5/3n4/q3p2p/p2n2p1/PppB1P2/5BP1/1P2Q2P/3R2K1 w - - bm f5; id "STS(v1.0) Undermine.001"; c0 "f5=100, Bf2=68, fxg5=46, b3=39, Bg7=32, Bg4=22, Kh1=11, Be3=8, Bxd5=6, h3=5"; c7 "f5 Bf2 fxg5 b3 Bg7 Bg4 Kh1 Be3 Bxd5 h3"; c8 "100 68 46 39 32 22 11 8 6 5"; c9 "f4f5 d4f2 f4g5 b2b3 d4g7 f3g4 g1h1 d4e3 f3d5 h2h3";
Notice, the points here are meant for much shorter hardware- TC than I use to use for tactical single best move- positions normally. MEA- STS (up to 1500 positions) is run with e.g. 100msec/pos. by Schröder and Mosca.