Before doing so I first wanted to quantify in the ELO metric the contribution of regular delta pruning. I didn't have this information yet because in general I don't use game playing to verify search changes but test sets. So I set up a match, 6000 opening positions (my own set) played from both sides, 120+1 time control, single core and no pondering. The result is less than spectacular:
Code: Select all
Rank Name Elo + - games score oppo. draws
1 rookie3.3 3 5 5 12000 51% -3 45% <-- delta pruning
2 rookie3.3.1 -3 5 5 12000 49% 3 45% <-- no delta pruningBut before I shelve the idea I would like to know if this is indeed what I can expect from delta pruning, or if others have observed a larger difference. (There is one implementation detail related to lazy evaluation that might be a troublemaker in my case.)
Any feedback is appreciated.