I do not think that it is the same test.LudiBuda wrote:What are you talking about? Did you read my post at all?
What I am suggesting is to take Ivanhoe, run the test games against 10 opponents, then put random weight between lets say 0 and 20 for each eval term and run the test again.
Your experiment is bogus. Try doing the same test for the search. Have a brute force search with the state of the art eval and see what you get.
having random weight to evaluation terms still have the relevant evaluation terms in the program.
If your point is that changes in the search helped chess programs in the last years more than changes in the evaluation then I agree about this opinion but I think that the test that you suggest does not prove it.
Note that I disagree with the following claim that you made:
"Evaluation of the engine is of almost no importance for the ELO strength."
I even disagree that changes in the evaluation are of almost no importance for the ELO strength.
My guess is something like the following:
"100 of the last 300 elo that the program gained is thanks to changes in the evaluation and if Komodo keep the same evaluation that it had when it was 300 elo weaker than it could be 200 elo stronger instead of 300 elo stronger"
Of course these are not exact numbers and it is only the idea.
