This is why it is far better to directly measure the effect of a certain caharacteristic of a position, by determining it effect on winning rate when it occurs, rather than inirectly, by changing the weight of the evaluation term corresponding to his characteristic, and hope it will make the characteristic occur at a different frequency.Dann Corbit wrote:I think there are many reasons why these changes are hard to measure.
Another obvious one is that the positions that contain the evaluation terms only arise in rare circumstances. In your example of wrong bishop with a rook pawn, how many games will we have to examine before we will find such a game?
You have to be careful, though, to choose positions where the characteristic occurs as a strategic (long term) one. For piece values this means you should exclude tactical positions with a material imbalance, as the true imbalance there is not what you think. Only in quiet positions the material imbalance is a strategic feature.
Similary, giving one side a corner Knight in anotherwise symmetric middle-game position would be pointless if the Knight can move out of the corner in a single move. Ony if the Knight is tied to the corner, e.g. beause it has to keep defending a Pawn that can only be defended in other ways with great difficulty, the cornered Knight becomes a strategic feature, and can be expected to affect the winning rate. (If it was not a strategic featre it wold also not affect the score or move choice a lot, no matter how you evaluate it, as the search would simply makes the feature disapper at the expense of a tempo just before the leaves.)