felix wrote:Hi Isaak
The result is ok. Critter is much better than Protector.
BUT seeing your parameters your resign = 400 is way too strict.
Why don't you try with Arena to tune your parameters? This way you can see how the engines evaluate and enjoy the games.
Some time ago i used resign = -650 but surprise! some games with that score were draw and was a bad adjudication!! (Some engines tends to be over optimistic or over pesimistic)
Now i am again on resign = -900 and everything is better
Ola Félix,
I am not so sure that the result is ok, because of the "short time control" CCRL rating list: Protector 1.5.0 has a rating of 3090 and critter has a rating of 3228. The difference is less than 150 bayeselo. Hmm I don't know how this would translate in elo...
And I am using a stronger version than the "1.5.0" so I'd expect a closer rating difference. But I get 279 elo difference!
Resign at 400 cp means that both engines must evaluate the position as at least +4 for 3 consecutive moves. Of course sometimes they are both wrong and the game is wrongly adjudicated as a win instead of a draw, but this is rare and overall should not modify the rating difference noticeably. For instance, they use the exact same criteria for adjudication in the fishtest (for Stockfish improvement) and the system seems to work fine. Out of the 45 games that I've uploaded played in that match, I am sure you will find 0 of such wrongly adjudicated games.
And why I don't use Arena? Because I've read that it's not good to test engine vs engine, especially at fast time controls (my time control is very fast), because engines can lose on time and there are other problems as well I believe.
I've heard that cutechess is the best way to test engine vs engine. I think they use cutechess also in the TCEC tournament.
I can still enjoy to replay the games once they are finished, although I don't see what the engines are thinking. (I watch the TCEC not to get bored

).