60 games Komodo 5 against Top4 at 120m+3s

Houdini · Post by **Houdini** » Tue Jul 24, 2012 12:46 am

Sedat Canbaz wrote:And in the previous match,we noticed not very satisfied results by the performance of Komodo 5 (against Houdini 2.0c)

Komodo had a bad start against all 4 engines (Critter, Stockfish, Rybka and Houdini).
Komodo's poor performance was most likely just a string of bad luck. Things will even out with more games - and in fact they already have partially.

There is no need for attributing this to opening choice, to the time control, to the sudden death, or to anything else.
Just plain variability of engine matches will do.

Robert

lkaufman · Post by **lkaufman** » Tue Jul 24, 2012 12:55 am

Quite frankly, we are puzzled. Our own distributed tester (mostly independent testers) show Komodo 5 to be 15 elo ahead of Houdini 1.5, after many thousand games. None of the testing agencies have confirmed this though, and we need to find out why. Perhaps it's something about the opening books, or the time controls, or AMD. Only by putting forth these various hypothesis can we expect to find out which is correct. Once we do, we can either
1. modify our tester to better predict what others will get.
or
2. Modify Komodo to fix whatever the problem may be, if it is easily fixable.

or both.

TimoK · Post by **TimoK** » Tue Jul 24, 2012 12:58 am

And this an interesting game with a very rare constellation: black queen on h8 forces white King on b1 to move his queen to a1! Wow, never seen anything like that.

Great game: After the opening (that gave good chances to both sides) first Houdini played more actively and gathered its pieces for attack. But Komodo defended ingenious and moved its pieces for a strong counter-attack. Best move of this game for me was:

[d]3r2k1/2r2pp1/3b4/3P1Q1q/1pB4P/pP6/P1P2R2/1K5R b - - 0 33
33. ..Qh8!

Enjoy the game:

[Event "(Kom5 vs. Hou2)-Match with Perfect 2012"]
[Site "?"]
[Date "2012.07.23"]
[Round "8.1"]
[White "Houdini 2.0c Pro x64"]
[Black "Komodo 5 64-bit"]
[Result "0-1"]
[ECO "B90"]
[Annotator "0.14;0.14"]
[PlyCount "162"]
[EventDate "2012.07.22"]
[TimeControl "7200+3"]

{AMD Phenom(tm) II X4 910e Processor 2999 MHz W=26.2 plies; 1.806kN/s; 1.124
TBAs; Houdini.ctg B=22.4 plies; 1.270kN/s; Komodo.ctg} 1. Nc3 {0.00/0 0} c5 {
0.00/0 0} 2. e4 {0.00/0 0} d6 {0.00/0 0} 3. Nf3 {0.00/0 0} Nf6 {0.00/0 0} 4. d4
{0.00/0 0} cxd4 {0.00/0 0} 5. Nxd4 {0.00/0 0} a6 {0.00/0 0} 6. f3 {0.00/0 0} e5
{0.00/0 0} 7. Nb3 {0.00/0 0} Be6 {0.00/0 0} 8. Be3 {0.00/0 0} Nbd7 {0.00/0 0}
9. Qd2 {0.14/23 189} b5 {0.14/23 95 (Be7)} 10. O-O-O {0.05/22 144} Qc7 {0.14/
24 96 (Be7)} 11. g4 {0.25/22 146} h6 {0.26/25 438} 12. Nd5 {0.15/24 320 (a3)}
Bxd5 {0.25/22 29} 13. exd5 {0.21/22 17} Be7 {0.24/22 17} 14. Kb1 {0.20/23 158
(a3)} Nb6 {0.24/21 62} 15. Bxb6 {0.23/22 0} Qxb6 {0.21/21 108} 16. h4 {0.23/23
102 (a3)} a5 {0.26/22 238 (Ra7)} 17. Qe2 {0.21/23 292 (a3)} Rb8 {0.33/22 124}
18. f4 {0.21/23 86} a4 {0.17/22 14} 19. fxe5 {0.27/22 154} dxe5 {0.23/23 240}
20. Nd2 {0.26/24 0 (Nc1)} O-O {0.28/22 314} 21. Ne4 {0.16/23 112 (Bg2)} Rfd8 {
0.29/22 180} 22. Nxf6+ {0.29/22 183} Qxf6 {0.28/23 1} 23. g5 {0.28/22 128} Qd6
{0.36/24 505} 24. gxh6 {0.24/23 0} Qxh6 {0.41/20 35} 25. Qxe5 {0.24/23 94} Bf6
{0.40/23 196} 26. Qg3 {0.30/22 0 (Qf5)} Qh5 {0.22/24 306 (b4)} 27. Rd2 {0.29/
21 352 (Bg2)} b4 {0.23/22 150 (Re8)} 28. Bc4 {0.39/21 120 (Be2)} Rbc8 {0.18/23
111 (Rb6)} 29. Qf4 {0.35/21 106} Be5 {0.12/22 17} 30. Qe4 {0.35/21 0} a3 {0.12/
22 530 (Bf6)} 31. b3 {0.31/22 110} Bd6 {0.12/24 63} 32. Rf2 {0.27/24 58 (Re2)}
Rc7 {0.12/23 81 (Re8)} 33. Qf5 {0.27/23 129} Qh8 {0.05/25 51 (Qxf5)} 34. h5 {
0.15/23 158} Re8 {0.01/25 291 (Re7)} 35. Qg5 {-0.02/21 167 (Qf3)} Rce7 {-0.36/
22 105} 36. Rff1 {-0.18/21 22} Re5 {-0.39/23 141} 37. Qf4 {-0.41/23 520} R8e7 {
-0.62/25 203} 38. Rc1 {-0.36/23 139 (Rd1)} g6 {-0.55/25 83} 39. Qd4 {-0.48/24
54} Bc5 {-0.81/24 177 (Rxh5)} 40. Qa1 {-0.45/24 188} Qh6 {-0.74/26 191 (Rxh5)}
41. Rhf1 {-0.36/20 99 (Rcd1)} Qxh5 {-1.32/19 17 (Rxh5)} 42. d6 {-1.03/22 183}
Rd7 {-1.41/26 85} 43. Rfe1 {-1.10/22 37} Rxe1 {-1.46/22 2} 44. Rxe1 {-0.99/22 0
} Bxd6 {-1.53/21 12} 45. Qd4 {-1.00/23 51} Qh2 {-1.52/24 28} 46. Qd3 {-1.06/23
81 (Rd1)} Kf8 {-1.48/24 76} 47. Qd5 {-1.12/23 0 (Qd4)} Qh3 {-1.97/22 43 (Qg3)}
48. Bd3 {-1.24/23 105} Qg3 {-2.46/24 35 (Bg3)} 49. Qh1 {-1.76/22 87 (Qa8+)} Be5
{-2.54/22 45} 50. Qh6+ {-1.88/23 26} Kg8 {-2.76/27 102} 51. Qe3 {-2.46/26 246}
Qxe3 {-2.89/31 80} 52. Rxe3 {-2.22/26 0} Bc3 {-2.90/26 23} 53. Kc1 {-2.19/26
57 (Re8+)} f5 {-3.06/24 13} 54. Kd1 {-2.30/26 133} Rh7 {-3.06/29 35} 55. Bf1 {
-2.30/26 13} Rh1 {-3.07/26 14} 56. Rf3 {-2.50/26 99} Rh2 {-3.39/27 33} 57. Rg3
{-2.66/26 49} Kg7 {-3.44/28 10 (Kf7)} 58. Rg2 {-3.71/25 311} Rh8 {-3.76/32 100
(Rh1)} 59. Rg1 {-3.02/22 70 (Rf2)} Kf6 {-4.15/25 16} 60. Bd3 {-3.36/23 31} Re8
{-4.31/28 9 (Rh2)} 61. Rf1 {-3.31/23 46} Re5 {-6.72/25 18 (Re3)} 62. Be2 {-3.
37/27 38 (Bc4)} g5 {-7.70/22 13} 63. Bf3 {-5.28/25 217 (Bh5)} Ke6 {-6.79/22 19
(Kg6)} 64. Rh1 {-5.28/22 38 (Bh5)} g4 {-8.89/25 15 (Re3)} 65. Be2 {-5.98/26
189 (Bb7)} Kf6 {-12.35/21 14 (Re3)} 66. Rg1 {-6.41/24 23 (Rf1)} Re3 {-12.61/23
28} 67. Rf1 {-8.25/25 127} Kg5 {-14.85/23 6 (g3)} 68. Rh1 {-9.24/22 21 (Rg1)}
f4 {-#18/19 73 (g3)} 69. Rg1 {-20.06/23 131} f3 {-#15/20 210 (g3)} 70. Bxf3 {
-20.79/19 7} Rxf3 {-#12/20 10} 71. Ke2 {-#11/23 21} Kf4 {-#11/20 65} 72. Rd1 {
-#10/35 0} g3 {-#10/17 6} 73. Rd8 {-#9/35 7 (Rd7)} Rf2+ {-#9/15 5} 74. Kd3 {
-#8/36 6} Rd2+ {-#8/15 9} 75. Kc4 {-#7/40 0} Rxd8 {-#7/16 16} 76. Kc5 {-#6/57
0 (Kb5)} g2 {-#6/15 9} 77. Kb5 {-#5/75 1 (Kc6)} g1=Q {-#5/13 4} 78. Kc6 {-#4/
75 1} Qg6+ {-#4/13 7 (Qg7)} 79. Kc5 {-#3/16 1} Rc8+ {-#3/12 4 (Rd1)} 80. Kb5 {
-#2/14 0} Qc6+ {-#2/13 9} 81. Ka5 {-#1/75 2} Ra8# {-#1/14 8} 0-1

Computer chess at its best!

Best regards
Timo

Sedat Canbaz · Post by **Sedat Canbaz** » Tue Jul 24, 2012 1:13 am

Houdini wrote:
Sedat Canbaz wrote:And in the previous match,we noticed not very satisfied results by the performance of Komodo 5 (against Houdini 2.0c)
Komodo had a bad start against all 4 engines (Critter, Stockfish, Rybka and Houdini).

As far as i know, in the previous matches too...Komodo used different openings (Non-Perfect 2012 book)

Houdini wrote: There is no need for attributing this to opening choice, to the time control, to the sudden death, or to anything else.

Robert

Its too early for any conclusions...

But however i strongly believe in that:
-Any Engine Elo performance is highly depending on what kind of opening book usage

Btw, i hope Timo will repeat the same matches (by using Perfect 2012 ) of Komodo 5 against Critter, Stockfish, Rybka

And then i wonder a lot about what will be new performance of Komodo 5

Best,
Sedat

Houdini · Post by **Houdini** » Tue Jul 24, 2012 1:21 am

Sedat Canbaz wrote:Its too early for any conclusions...

Exactly. That's why it's surprising that you already attributed the poor results to the opening positions ("I think Komodo's performance suffers due to critical openings (especially with Blacks)"), instead of simply saying: "bad luck, but results should improve".

Sedat Canbaz · Post by **Sedat Canbaz** » Tue Jul 24, 2012 1:34 am

Houdini wrote:
Sedat Canbaz wrote:Its too early for any conclusions...
Exactly. That's why it's surprising that you already attributed the poor results to the opening positions ("I think Komodo's performance suffers due to critical openings (especially with Blacks)"), instead of simply saying: "bad luck, but results should improve".

Dear Robert,

No...its not bad luck !

I still believe that Komodo 5's performance was not so good, due to Komodo is played with different openings

Really i dont want to critic Jeroen's work...

Once more i'd like to mention that Jeroen Noomen is really GREAT Master in Book Making

But anyway, it seems Komodo 5's playing strenght suffers (under these conditions) with Noomen Testsuite 2012's openings

Best Regards,
Sedat

Houdini · Post by **Houdini** » Tue Jul 24, 2012 2:00 am

Thanks for the clarifications, I now understand that you really suggest that Komodo doesn't play very well the positions from the Noomen Suite.
Let's wait and see what happens in the rest of the 4 matches.

Sedat Canbaz · Post by **Sedat Canbaz** » Tue Jul 24, 2012 2:30 am

Houdini wrote:Thanks for the clarifications, I now understand that you really suggest that Komodo doesn't play very well the positions from the Noomen Suite.
Let's wait and see what happens in the rest of the 4 matches.

Not at all...

Of course without final results...we can not be sure 100 % that Komodo will do better with Perfect 2012 book

Actually i have much more improved version-Perfect 2012b book, but i think its now too late to resume the current match

Yes,lets wait and see...the time will tell

Best,
Sedat

Rebel · Post by **Rebel** » Tue Jul 24, 2012 8:44 am

lkaufman wrote:Quite frankly, we are puzzled. Our own distributed tester (mostly independent testers) show Komodo 5 to be 15 elo ahead of Houdini 1.5, after many thousand games.

You know how it is and when the randomness monster strikes then it is as Robert said, your turn of being the unlucky one.

Just recently I had such an unlucky experience, an extreme one I must say as I (till now) haven't seen before. I removed a part from King Safety in order to measure its impact either good or bad. And the removal gave a 15 elo improvement after 4000 games. I could not believe it and started testing the removal again but now changing the eval values of King Safety with +1% and -1%, 2 peanuts changes with hardly any effect elo wise. 2 x 4000 games again and both matches showed the wrong of the initial change, instead of a 52.x% score I then got the opposite: two 47-48% results. And thus I almost removed well working code.

Sedat Canbaz · Post by **Sedat Canbaz** » Tue Jul 24, 2012 10:48 am

Rebel wrote:
lkaufman wrote:Quite frankly, we are puzzled. Our own distributed tester (mostly independent testers) show Komodo 5 to be 15 elo ahead of Houdini 1.5, after many thousand games.
You know how it is and when the randomness monster strikes then it is as Robert said, your turn of being the unlucky one.

Just recently I had such an unlucky experience, an extreme one I must say as I (till now) haven't seen before. I removed a part from King Safety in order to measure its impact either good or bad. And the removal gave a 15 elo improvement after 4000 games. I could not believe it and started testing the removal again but now changing the eval values of King Safety with +1% and -1%, 2 peanuts changes with hardly any effect elo wise. 2 x 4000 games again and both matches showed the wrong of the initial change, instead of a 52.x% score I then got the opposite: two 47-48% results. And thus I almost removed well working code.

Hello dear Ed,

Yes...i am not a programmer and you can be right that with changing some values...your engine can start to play with 15 elo improvement

And from my experience i can say,
Depending on what kind of opening usage,any Engine can be performed between 0-200 Elo difference or sometimes even more

Btw (about the current match/opening issue),
Timo changed only the openings and now we see completely different results by Komodo 5:
http://www.team-oh.de/livegames6/

One thing more,
There is no guarantee to see same Komodo 5's results e.g under different conditions (i mean if we change the hardware,the time control...)

Greetings,
Sedat

60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s

Re: 60 games Komodo 5 against Top4 at 120m+3s