Well, as I demonstrated there will not be any local minima other than the global minimum, if the test set is not obviously flawed by failing to provide WDL info for some large range of scores.
Some examples: running a simplistic optimizer (just randomly increasing / decreasing the eval parameters by 1cP, and trying that up to 1000 times to get a better MSE) on a synthetic test set of random positions, for various sizes of the set, fitting the 5 piece values, gave the following results:
Code: Select all
100000, start at 0: 92 304 327 468 889 5396.180698
100000, start at exact: 93 305 328 470 892 5396.172637
10000, start at 0: 91 303 328 486 897 528.024707
10000, start at exact: 92 304 329 487 900 528.024649
1000, start at 0: 76 312 339 499 895 51.970099
1000, start at exact: 77 313 340 501 899 51.969961
It doesn't matter whether I start all piece values at 0, or whether I start with the values that were used to generate the results for the set (100/325/350/500/950). It always converges to the same minimum. That the best values are different for the different sets is just an artifact of the particular set, and the probability for accidentally implying something weird in a set diminishes as the set gets larger.
Interestingly, it thinks it can improve on the original values, even in asymptotically large sets. This is caused by the (simulated) positional term that helped determine the game results (but could not be fitted by the evaluation that was tuned). Since I used a symmetric distribution for that term, it has the effect of shifting the result towards a draw. Because if the material advantage is already large, getting some extra positional advantage on top of it improves the average result less than an equal positional disadvantage would reduce it. (The positional term I applied could ranges from -330 to +330 cP, but would be between -20 and +20 cP in 50% of the cases.)
Anyway, it seems that this talk about local minima is just a red herring, unless you would be fitting highly non-linear terms such as are sometimes used in calculating King safety. And it remains to be seen whether any of the local minima would be any good at all, both MSE and Elo-wise.