I guess you can say the same about modern AB searches with certain pruning/Werewolf wrote: ↑Sun Aug 30, 2020 6:03 pmI read the full article you posted which made my head spin, but it was helpful.smatovic wrote: ↑Sat Aug 15, 2020 6:55 pmhttps://www.chessprogramming.org/UCT#Ra ... an_et_al.Werewolf wrote: ↑Sat Aug 15, 2020 6:23 pm With Lc0 a lot of the attention went on the NN and how it was better than a hand-crafted Evaluation Function.
But I'm trying to understand the search side of things.
I have a basic, layman's, working knowledge of alpha-beta search in chess and virtually no real understanding of MCTS, which I think is used in Lc0 and Komodo MCTS?
Can I ask:
1) Where are details of how the search works, specific to chess?
2) What are the pros and cons of MCTS vs AB?
3) Does the Lc0 MCTS play the game out fully or just to a fixed point? If it is fixed, how deep?
I've heard MCTS is less able to handle tactics but I'm not sure why.
MCTS-UCT was used successfully in Go, MCTS-PUCT was used successfully in A0/LC0 with NN Policy/Value head.UCT has been shown to outperform traditional minimax based approaches in several challenging domains such as Go and KriegSpiel, although minimax search still prevails in other domains such as Chess. This work provides insights into the properties of adversarial search spaces that play a key role in the success or failure of UCT and similar sampling-based approaches. We show that certain "early loss" or "shallow trap" configurations, while unlikely in Go, occur surprisingly often in games like Chess (even in grandmaster games). We provide evidence that UCT, unlike minimax search, is unable to identify such traps in Chess and spends a great deal of time exploring much deeper game play than needed.
--
Srdja
So to sum up, Lc0 isn't doing full-game "rollouts", but rather it's building a kindof speculative, highly asymmetric game tree ?
reduction/razoring techniques, the question is why plain MCTS-UCT performs worse
for Chess than AB - it is about the nature of the game, Go compared to Chess, as
MCTS-UCT compared to AB, now with MCTS-PUCT, with P as Predicator, they fixed
one part of MCTS a bit, but imo still not enough to get en par with AB in Chess.
I am not sure how the playouts of Leela in Chess/Go work, but in AB for Chess we
have the QS, Quiescence-Search, to handle the horizon-effect, maybe this part is
still missing in MCTS-PUCT to handle end-game and tactic-sequences better?
--
Srdja