JacquesRW wrote: ↑Fri May 31, 2024 6:33 pm
chessica wrote: ↑Fri May 31, 2024 5:47 pm
JacquesRW wrote: ↑Fri May 31, 2024 4:08 pm
chessica wrote: ↑Fri May 31, 2024 3:24 pm
"works" ---> means that the engine does exactly what it is supposed to do, and does it correctly.
This is the emptiest statement ever made. You can't give an answer to a question like that. What exactly, in your view, is the MCTS engine meant to do?
No, no.
Here:
https://www.chessprogramming.org/Monte- ... ree_Search it says what MCTS should do.
There is no ambiguity.
Pure Monte-Carlo search with parameter T means that for each feasible move T random games are generated. The move with the best average score is played. A game is called “Monte Carlo perfect” when this procedure converges to perfect play for each position, when T goes to infinity. However, with limited time per move, increasing T does not guarantee to find a better move
But how do I check whether the engine is sticking to it?
You might want to read your source there, because that is referring to Monte-Carlo Search (MCS), not MCTS.
There absolutely is ambiguity in what you may mean by "works", because convergence properties have little practical relevance to a chess engine that has a decidedly finite amount of compute available. E.g. you can trivially edit a strong MCTS engine to no longer have these convergence properties without any noticeable impact on playing strength, would you say the search no longer works then?
The only way for you to verify that an MCTS engine "works" by your definition is to verify the code formally.
Oh, I'm just a user and not a programmer, so I can't check the code.
From WIKI:
Four Phases
MCTS consists of four strategic phases, repeated as long as there is time left [13] :
1.) In the Selection phase the tree is traversed from the root node until it selects a leaf node that is not added to the tree yet
2.) The Expansion strategy adds the leaf node to the tree
3.) --->>> The Simulation strategy plays moves in self-play until the end of the game. The result is either 1, 0 ,-1
4.) The Backpropagation strategy propagates the results through the tree
Regarding 3.) I meant the question: can you watch the games played internally? And whether all possible games were really played?