I'm working again on my unconventional chess engine and looking for new ways to improve the probability Bstar algorithm.I have tried to eliminate the double expansion of the nodes (real + optimism),so as to unify the two phases of the search.
This thinking has led me to an algorithm very similar to MCTS.
Let us examine this:(I know I've oversimplified all algorithms. This is only to highlight the similarities and essential differences.)
Code: Select all
MCTS algorithm:
===============
Selection step.
from root to a leaf, select the nodes according to heuristic UCT. (Upper Confidence Bound applied to tree).
Play out.
Random or pseudo-random self-play.
expansion.
add node to the tree.
Backpropagation.
Propagating the values back in the tree (negamax or minimax)
Code: Select all
PB* algorithm:
==============
TraceDown step.
from root to a leaf, select the nodes according to heuristic PB*.(alternate best move with best optimism probability).
Expand.
Alpha-beta, add node to the tree.
Backup.
Propagating the values back in the tree (negamax or minimax)
Code: Select all
How to name algorithm ?:
========================
Selection step.
from root to a leaf, select the nodes according to heuristic UCT.
Expand.
Alpha-beta, add node to the tree.
Backup.
Propagating the values back in the tree (negamax or minimax)
What name should have a MCTS algorithm with the "random play out" replaced by an "alpha-beta"?
Without randomness it looks like it no longer deserve the words "Monte Carlo".
regards, Antonio.