Thanks for your interest and comments regarding my tests.
Compared to my expectations before starting these tests I find the following Top 3 to be the most surprising (positive or negative) so far:
1. Spike turned out to be the weakest engine in the gambittests and more than 100 ratingpoints below the expected (compared to CEGT 40/4).
2. I have always considered Naum to be a solid, positional engine but it turned out to be considerable stronger in the gambits.
3. The same can be said about Rybka. It made (as expected) a strong performance in the positional games but Rybka was even stronger in the gambits. It should also be mentioned that Deep Fritz 10 performed very well in both testsets.
Just like you I wonder why Spike did so bad in the gambitgames. I have tested this fine engine for a long time (when I was testing for the CSS SMP Ratinglist) and I never noticed any tactical weaknesses. You could be right Mike when you suggest that engines like Spike have "problems to evaluate and/or use "dynamical" compensation for material".
I would like to present my testresults in the CSS Forum as well. However my german isn't too good (I have made some posts in this forum but only rather short ones). I would be grateful if you would make a post in the CSS Forum, just a short summary one (or a longer if you wish) and with a link to this thread. Das wäre toll
Best regards
Per

