MEA - Multiple move EPD Analyzer beta interface can be found here.
https://mea.bitballoon.com/
Tony's positional test suite
Moderator: Ras
-
- Posts: 4841
- Joined: Sun Aug 10, 2008 3:15 pm
- Location: Philippines
-
- Posts: 12717
- Joined: Wed Mar 08, 2006 8:57 pm
- Location: Redmond, WA USA
Re: Tony's positional test suite
ThanksFerdy wrote:MEA - Multiple move EPD Analyzer beta interface can be found here.
https://mea.bitballoon.com/
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
-
- Posts: 7274
- Joined: Thu Aug 18, 2011 12:04 pm
- Full name: Ed Schröder
Re: Tony's positional test suite
Will try.Ferdy wrote:MEA - Multiple move EPD Analyzer beta interface can be found here.
https://mea.bitballoon.com/
-
- Posts: 11
- Joined: Sat Jul 22, 2017 2:50 am
- Location: New Zealand
Re: Tony's positional test suite
If anyone has the time to re-calibrate / re-test all sixteen positions with today's engines that would be appreciated.
-
- Posts: 1
- Joined: Thu Oct 10, 2024 6:43 pm
- Full name: William Bryant
Re: Tony's positional test suite
Beginner Question:
I assume you add the score of the position found for each position to get an engine rating.
At what time control.
I understand you searched for 24 hours to get the results, but for the test, any specific time controls.
I am returning to chess programming after a long absence.
My old program currently runs on a G4 Mac Laptop form 2004 while I work on re writing it.
I have forgotten a lot and am essentially starting over.
Wm.
I assume you add the score of the position found for each position to get an engine rating.
At what time control.
I understand you searched for 24 hours to get the results, but for the test, any specific time controls.
I am returning to chess programming after a long absence.
My old program currently runs on a G4 Mac Laptop form 2004 while I work on re writing it.
I have forgotten a lot and am essentially starting over.
Wm.
-
- Posts: 12717
- Joined: Wed Mar 08, 2006 8:57 pm
- Location: Redmond, WA USA
Re: Tony's positional test suite
The multi-answer type problems give a different score based upon the assumed correctness of the given move.
As programs get stronger and stronger, and hardware gets more and more powerful, the old answers (which would have been more or less correct at the time because of hardware and software limitations) improve and something that used to be a purely strategic concept like undermining, king safety, rooks on the 8th, because no material win was seen on the horizon become tactical because the engines can see deeply enough to find a tactical shot. It is also possible that other positional factors can become more important when combined with teh search.
A test set like Tony's positional suite has a number of different possible answers, each scoring differently depending upon what your engine chooses.
To update such a test suite, one might take several strong programs and run them on high end hardware for four hours per position with multi-pv set to 10 so that the engine is forced to think about many alternatives. Suppose that the best move as found by the engines has a score of 429. We divide this best score by 429 and multiply by ten giving a score of ten.
Suppose that for the next best solution we have a score of 313. We divide 313 by 429 and multiply by 10 giving a score of seven, and so on through all moves until the calculated score is less than one. Then on to the next problem.
As programs get stronger and stronger, and hardware gets more and more powerful, the old answers (which would have been more or less correct at the time because of hardware and software limitations) improve and something that used to be a purely strategic concept like undermining, king safety, rooks on the 8th, because no material win was seen on the horizon become tactical because the engines can see deeply enough to find a tactical shot. It is also possible that other positional factors can become more important when combined with teh search.
A test set like Tony's positional suite has a number of different possible answers, each scoring differently depending upon what your engine chooses.
To update such a test suite, one might take several strong programs and run them on high end hardware for four hours per position with multi-pv set to 10 so that the engine is forced to think about many alternatives. Suppose that the best move as found by the engines has a score of 429. We divide this best score by 429 and multiply by ten giving a score of ten.
Suppose that for the next best solution we have a score of 313. We divide 313 by 429 and multiply by 10 giving a score of seven, and so on through all moves until the calculated score is less than one. Then on to the next problem.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.