60 games Komodo 5 against Top4 at 120m+3s

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

lucasart
Posts: 3241
Joined: Mon May 31, 2010 1:29 pm
Full name: lucasart

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by lucasart »

Sedat Canbaz wrote:
Laskos wrote:
Sedat Canbaz wrote:One thing more (for those who does no agree with me),
I just wonder, what is wrong with the current my below statement ???

We need more games...,but as i expected,Komodo is started to show its real power


Really i dont want to loose more time over this issue...

Best,
Sedat
You said that based on the score 3.5:2.5 between two engines separated by no more than 50 Elo points.
I never said that 10 games are enough data or even 300-500 games are not enough data to show the engines real strenght

And i strongly believe in that:
-Minimum 1.000 games per player is required for reliable rating
After I gave you examples of what to do with these numbers, you are coming again with some pretty silly statements.

Kai

Its ok...my statements can be silly for you,no problem !

As far as i remember,(in the past) you was also against my views...it seems you did not changed a lot

And i strongly believe that Perfect 2012 book series are very well optimized, safe and i recommend to be used by Top Chess Programs

Actually i was just trying to explain that WE NEED MORE GAMES and the openings can be serious factor of any engine playing strenght

If you still have different view than mine,then i challenge you to register in Testo Third Book League

Who knows, maybe you are very good Book Maker...and one day i can see you in SCCT Super League Tournament


Best,
Sedat
Sedat,

I'm sure your book is good, and you put a lot of effort into it. I personally use it, as well as the Stockfish book from Silvo Spitaleri. As for your previous disagreements with Kari, I don't know (or care).

But what Kari is trying to say is that it would really be beneficial to you testers to have some background in probability and statistics to avoid drawing hasty conclusions.

Let X(i) be the random variable equal to the score of engine A versus B in the i-th game (values 0, 0.5 or 1). And let S(n)=(X(1)+...X(n))/n.

Let's assume that both engines are equal, in other words E(X(i))=0.5. If you stop the experiment whenever you see a value S(i)>0.5, you're in for some serious problems... It can be shown that the probability of the event {there exists an i such that S(i)>0.5} is 1. In other words you're almost surely going to conclude that A is better (the same could be done to conclude that B is better). Intuitively an unbiaised random walk will almost surely cross the X-axis (in fact it will aolmost surely cross it an infinite number of times).

There are different problems:

1/ fixed sample testing: you decide the number of games to be played before the experiment and tou stick to it no matter what scores you see! This problem is simple, as the central limit theorem provides a good approximation of the confidence interval at the end of the experiment, providing N is "sufficiently large"

2/ You play on, and only decide to stop by following a predetermined stopping algorithm. This problem is far more complicated, and as far as I know the most efficient and practical stopping rule is the sequential Wald test (or Sequential Probability Ratio Test). Even this test however has type I and type II error, so it shouldn't be used unless you understand fully how it works. There are also algorithms w/o type II error. I remember the empirical bernstein stopping rule for example. But those take considerably longer to terminate (they terminate almost surely if E(X)!=0.5)

I suggest you only focus on 1/, as 2/ requires some deeper knowledge and understanding of probabiity/statistics. But you have to follow the recipie rigourously, or you'll end up making erroneous conclusions.

Let's say (in the context of 1/) you're doing N=10 games (you decide on that and stick to it no matter what the intermediate scores are). Then a score of 9-1 is significant under 95% confidence level. If we remove the possibility of draws to simplify the problem, this can be shown by an elementary calculation (Binomial law, a classic from high school that everyone should know). It can also be shown in the real world by reducing the problem to one dimension and using a formula where P(win) and P(draw) are functions of E(X). Remi Coulom made an estimation of such a function for Bayeselo.

**But** do not go and think now that everytime you see 9-1 you ca nstop the experiment. If you decided on N=100 at the beginning, and you see 9-1, you shouldn't stop! With N=100, and stopping anytime you see that A has 8 points more than B, for example, would lead to a very high risk of error (again the error probability tends to 1 if N goes to infinity).

No rule can be given in terms of absolute number of games either. If you use N=100, and you see 501-499, no conclusion can be drawn, obviously. Whilst if you see 600-400, then you can easily say that A is stronger than B.

In the context of 2/, the stopping time will on average trigger earlier as E(X) is far from 0.5 (and will almost surely not terminate if E(X)=0.5). So even in the context of 2/, you cannot say that even 1000 games is enough. Maybe you reached such a score that 500 was enough, and maybe the stopping rule will only be triggered after 20,000 games...

Anyway, all that matters is understanding what you do, and never following rules, or what people say (especially me).
Sedat Canbaz
Posts: 3018
Joined: Thu Mar 09, 2006 11:58 am
Location: Antalya/Turkey

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Sedat Canbaz »

My finals words over this issue

First of all,
Many Thanks to Timo Klaustermeyer,who kindly accepted my demand of testing/canceling my Perfect 2012 book

Second,
I am not surprised,even i am already accustomed about my ComputerChess activities to be always criticized
Even,in the latest comments,i noticed that some chess friends used words 'silly' for my activities/knowledge
Its ok...i will take them as complement or as critics...but however i can be wrong,so i want to say sorry,if anybody is offended
Its true also that the name of my books is 'Perfect', but unfortunately my work is NOT 'Perfect',still i need to improve and learn many things ...

Third,
If,Komodo 5's performance was not so good in the previous matches,it does not mean that Jeroen Noomen is responsible
Still i believe in that (for Timo test's conditions):
-Probably the current used Jeroen's openings do not suit a lot to Komodo 5's playing style
Plus only with 21 games (played by Perfect 2012 book),it does not mean that my openings are better than Jeroen's openings
Once more i'd like to mention that i feel Jeroen Noomen like my teacher-he is one of the Greatest Book Makers
I suggest: the new beginners in Book Making should take as example and ask for a help from Jeroen Noomen's ideas and knowledge!

Finely,
Its interesting to note :
- Over more 10 years,some people never use the name 'Sedat' or 'SCCT',e.g some people never use 'Perfect books' and i really wonder why ???
I just need your feedback,in this way i can improve my work better....thanks in advance
And there is one true:
- I need to drop my ComputerChess activities,otherwise those critics will be never ended over my work !


Best Regards,
Sedat Canbaz
lkaufman
Posts: 6258
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by lkaufman »

Sedat Canbaz wrote:My finals words over this issue

First of all,
Many Thanks to Timo Klaustermeyer,who kindly accepted my demand of testing/canceling my Perfect 2012 book

Second,
I am not surprised,even i am already accustomed about my ComputerChess activities to be always criticized
Even,in the latest comments,i noticed that some chess friends used words 'silly' for my activities/knowledge
Its ok...i will take them as complement or as critics...but however i can be wrong,so i want to say sorry,if anybody is offended
Its true also that the name of my books is 'Perfect', but unfortunately my work is NOT 'Perfect',still i need to improve and learn many things ...

Third,
If,Komodo 5's performance was not so good in the previous matches,it does not mean that Jeroen Noomen is responsible
Still i believe in that (for Timo test's conditions):
-Probably the current used Jeroen's openings do not suit a lot to Komodo 5's playing style
Plus only with 21 games (played by Perfect 2012 book),it does not mean that my openings are better than Jeroen's openings
Once more i'd like to mention that i feel Jeroen Noomen like my teacher-he is one of the Greatest Book Makers
I suggest: the new beginners in Book Making should take as example and ask for a help from Jeroen Noomen's ideas and knowledge!

Finely,
Its interesting to note :
- Over more 10 years,some people never use the name 'Sedat' or 'SCCT',e.g some people never use 'Perfect books' and i really wonder why ???
I just need your feedback,in this way i can improve my work better....thanks in advance
And there is one true:
- I need to drop my ComputerChess activities,otherwise those critics will be never ended over my work !


Best Regards,
Sedat Canbaz
I hope you continue your work. Your testing hasn't been relevant for us since we haven't yet released MP, but when we do we will of course hope to do well in your testing.
In general people who disagree with you are more likely to comment than those who agree, so don't think the comments are a representative sample.
The one thing I don't understand about your work is why you feel that it is important for reversal testing to use a "perfect" book in terms of the normal White advantage. Obviously a very good book is vital for competitive play, but for engine testing we need huge samples to overcome the error bars, and if the price we must pay is using somewhat imperfect books, that is a necessary price in my opinion.

Best regards,
Larry
User avatar
Houdini
Posts: 1471
Joined: Tue Mar 16, 2010 12:00 am

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Houdini »

Game 44 was a complete crash by Houdini.

The opening position after 1. d4 Nf6 2. c4 e6 3. Nf3 b6 4. g3 Ba6 5. Qa4 Be7 6. Bg2 c6 7. Nc3 O-O 8. O-O d5 9. Ne5 Qe8:

[d] rn2qrk1/p3bppp/bpp1pn2/3pN3/Q1PP4/2N3P1/PP2PPBP/R1B2RK1 w - - 2 10
Houdini is White and plays 10.b3 (?!) and 11.a3?? and is lost. The white queen is trapped.

Code: Select all

[Event "Komodo5-Houdini"]
[Site "?"]
[Date "2012.07.25"]
[Round "44.1"]
[White "Houdini 2.0c Pro x64"]
[Black "Komodo 5 64-bit"]
[Result "0-1"]
[ECO "E15"]
[Annotator "0.14;0.14"]
[PlyCount "102"]
[EventDate "2012.07.19"]
[EventType "tourn"]

{AMD Phenom(tm) II X6 1045T Processor 2999 MHz  W=23.3 plies; 1.986kN/s  B=24.
5 plies; 1.243kN/s} 1. d4 Nf6 2. c4 e6 3. Nf3 b6 4. g3 Ba6 5. Qa4 Be7 6. Bg2 c6
7. Nc3 O-O 8. O-O d5 9. Ne5 Qe8 10. b3 {[%eval 14,21] [%emt 0:02:29]} Bb7 {
[%eval 14,21] [%emt 0:01:30] (b5)} 11. a3 {[%eval 32,18] [%emt 0:03:15] (Ba3)}
a5 {[%eval -87,19] [%emt 0:01:19] (Qc8)} 12. Bg5 {[%eval -22,19] [%emt 0:02:20]
} Ne4 {[%eval -96,21] [%emt 0:00:21] (Nfd7)} 13. Bxe4 {[%eval -42,21] [%emt 0:
05:49]} Bxg5 {[%eval -105,25] [%emt 0:04:05]} 14. Bd3 {[%eval -34,22] [%emt 0:
01:55]} Be7 {[%eval -100,23] [%emt 0:02:53] (Bf6)} 15. Kg2 {[%eval -40,18] 
[%emt 0:02:54] (Rfc1)} f6 {[%eval -157,23] [%emt 0:02:12] (Bf6)} 16. Nf3 {
[%eval -127,17] [%emt 0:02:12]} Nd7 {[%eval -156,26] [%emt 0:00:48] (Qh5)} 17.
cxd5 {[%eval -208,21] [%emt 0:02:50]} b5 {[%eval -168,28] [%emt 0:00:43]} 18.
Bxb5 {[%eval -205,22] [%emt 0:00:00]} Nb6 {[%eval -173,25] [%emt 0:00:54]} 19.
Bxc6 {[%eval -229,24] [%emt 0:03:58]} Nxa4 {[%eval -173,29] [%emt 0:01:51]} 20.
Bxe8 {[%eval -214,25] [%emt 0:00:41]} Nxc3 {[%eval -172,27] [%emt 0:00:27]} 21.
Bc6 {[%eval -189,26] [%emt 0:02:15]} Bxc6 {[%eval -169,28] [%emt 0:02:19]} 22.
dxc6 {[%eval -192,25] [%emt 0:02:01]} Rfc8 {[%eval -170,27] [%emt 0:00:27]} 23.
Rfc1 {[%eval -192,26] [%emt 0:02:33]} Rxc6 {[%eval -168,29] [%emt 0:02:09]} 24.
e3 {[%eval -195,26] [%emt 0:00:07] (Nd2)} Rac8 {[%eval -171,26] [%emt 0:01:04]}
25. Nd2 {[%eval -199,25] [%emt 0:00:54]} Nd5 {[%eval -172,27] [%emt 0:03:03] 
(a4)} 26. Rc4 {[%eval -193,25] [%emt 0:03:37]} Nb6 {[%eval -172,28] [%emt 0:01:
53]} 27. Rxc6 {[%eval -198,26] [%emt 0:01:27]} Rxc6 {[%eval -169,25] [%emt 0:
01:31]} 28. Kf1 {[%eval -181,27] [%emt 0:02:05]} Rc3 {[%eval -170,25] [%emt 0:
01:38] (Kf7)} 29. Ke2 {[%eval -168,24] [%emt 0:03:07] (b4)} Nd5 {[%eval -182,
23] [%emt 0:01:12] (Kf7)} 30. h4 {[%eval -186,24] [%emt 0:02:06] (e4)} f5 {
[%eval -203,23] [%emt 0:01:19]} 31. Kf3 {[%eval -196,26] [%emt 0:00:49]} Rc2 {
[%eval -206,25] [%emt 0:00:47] (Kf7)} 32. Ke2 {[%eval -196,27] [%emt 0:01:38]}
Kf7 {[%eval -200,24] [%emt 0:00:32]} 33. Kd1 {[%eval -193,27] [%emt 0:02:07]}
Rc3 {[%eval -212,24] [%emt 0:03:48]} 34. Ke2 {[%eval -195,27] [%emt 0:00:00]}
Kf6 {[%eval -208,22] [%emt 0:00:56] (Nf6)} 35. Ke1 {[%eval -193,25] [%emt 0:01:
36] (Kd1)} Kg6 {[%eval -221,23] [%emt 0:01:16] (h6)} 36. Kf1 {[%eval -193,25] 
[%emt 0:01:22]} Kh5 {[%eval -220,26] [%emt 0:03:30]} 37. Ke2 {[%eval -204,26] 
[%emt 0:05:53] (Ke1)} Kg4 {[%eval -230,22] [%emt 0:00:27] (Bf6)} 38. Kd1 {
[%eval -226,24] [%emt 0:02:43]} Kh3 {[%eval -283,25] [%emt 0:01:17] (Bf6)} 39.
Ke2 {[%eval -353,22] [%emt 0:04:04]} Kg2 {[%eval -325,24] [%emt 0:01:23]} 40.
Nf3 {[%eval -334,23] [%emt 0:00:00]} Rxb3 {[%eval -338,22] [%emt 0:03:38]} 41.
Rg1+ {[%eval -379,24] [%emt 0:05:15]} Kh3 {[%eval -347,25] [%emt 0:01:27]} 42.
Rh1+ {[%eval -395,22] [%emt 0:01:48]} Kg4 {[%eval -343,25] [%emt 0:03:12]} 43.
Ne5+ {[%eval -379,24] [%emt 0:00:00]} Kh5 {[%eval -342,23] [%emt 0:00:19]} 44.
a4 {[%eval -409,22] [%emt 0:01:28]} Bf6 {[%eval -343,25] [%emt 0:01:52]} 45.
Nc4 {[%eval -409,22] [%emt 0:00:44]} Kg4 {[%eval -351,23] [%emt 0:00:44]} 46.
Ra1 {[%eval -417,22] [%emt 0:00:37]} Rc3 {[%eval -366,26] [%emt 0:03:42]} 47.
Nd2 {[%eval -449,24] [%emt 0:00:00]} Rc2 {[%eval -362,24] [%emt 0:00:53]} 48.
Kd1 {[%eval -450,21] [%emt 0:00:08]} Rb2 {[%eval -368,23] [%emt 0:00:35] (Nb4)}
49. Kc1 {[%eval -435,20] [%emt 0:01:12]} Rb4 {[%eval -380,23] [%emt 0:00:31]}
50. Ra3 {[%eval -490,22] [%emt 0:05:12] (Kd1)} Be7 {[%eval -434,22] [%emt 0:00:
53] (e5)} 51. Ra2 {[%eval -500,20] [%emt 0:02:13]} Kh3 {[%eval -486,25] [%emt
0:04:10] (f4)} 0-1
Probably the worst opening handling I've ever seen from Houdini.
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Dr.Wael Deeb »

Houdini wrote:Game 44 was a complete crash by Houdini.

The opening position after 1. d4 Nf6 2. c4 e6 3. Nf3 b6 4. g3 Ba6 5. Qa4 Be7 6. Bg2 c6 7. Nc3 O-O 8. O-O d5 9. Ne5 Qe8:

[d] rn2qrk1/p3bppp/bpp1pn2/3pN3/Q1PP4/2N3P1/PP2PPBP/R1B2RK1 w - - 2 10
Houdini is White and plays 10.b3 (?!) and 11.a3?? and is lost. The white queen is trapped.

Code: Select all

[Event "Komodo5-Houdini"]
[Site "?"]
[Date "2012.07.25"]
[Round "44.1"]
[White "Houdini 2.0c Pro x64"]
[Black "Komodo 5 64-bit"]
[Result "0-1"]
[ECO "E15"]
[Annotator "0.14;0.14"]
[PlyCount "102"]
[EventDate "2012.07.19"]
[EventType "tourn"]

{AMD Phenom(tm) II X6 1045T Processor 2999 MHz  W=23.3 plies; 1.986kN/s  B=24.
5 plies; 1.243kN/s} 1. d4 Nf6 2. c4 e6 3. Nf3 b6 4. g3 Ba6 5. Qa4 Be7 6. Bg2 c6
7. Nc3 O-O 8. O-O d5 9. Ne5 Qe8 10. b3 {[%eval 14,21] [%emt 0:02:29]} Bb7 {
[%eval 14,21] [%emt 0:01:30] (b5)} 11. a3 {[%eval 32,18] [%emt 0:03:15] (Ba3)}
a5 {[%eval -87,19] [%emt 0:01:19] (Qc8)} 12. Bg5 {[%eval -22,19] [%emt 0:02:20]
} Ne4 {[%eval -96,21] [%emt 0:00:21] (Nfd7)} 13. Bxe4 {[%eval -42,21] [%emt 0:
05:49]} Bxg5 {[%eval -105,25] [%emt 0:04:05]} 14. Bd3 {[%eval -34,22] [%emt 0:
01:55]} Be7 {[%eval -100,23] [%emt 0:02:53] (Bf6)} 15. Kg2 {[%eval -40,18] 
[%emt 0:02:54] (Rfc1)} f6 {[%eval -157,23] [%emt 0:02:12] (Bf6)} 16. Nf3 {
[%eval -127,17] [%emt 0:02:12]} Nd7 {[%eval -156,26] [%emt 0:00:48] (Qh5)} 17.
cxd5 {[%eval -208,21] [%emt 0:02:50]} b5 {[%eval -168,28] [%emt 0:00:43]} 18.
Bxb5 {[%eval -205,22] [%emt 0:00:00]} Nb6 {[%eval -173,25] [%emt 0:00:54]} 19.
Bxc6 {[%eval -229,24] [%emt 0:03:58]} Nxa4 {[%eval -173,29] [%emt 0:01:51]} 20.
Bxe8 {[%eval -214,25] [%emt 0:00:41]} Nxc3 {[%eval -172,27] [%emt 0:00:27]} 21.
Bc6 {[%eval -189,26] [%emt 0:02:15]} Bxc6 {[%eval -169,28] [%emt 0:02:19]} 22.
dxc6 {[%eval -192,25] [%emt 0:02:01]} Rfc8 {[%eval -170,27] [%emt 0:00:27]} 23.
Rfc1 {[%eval -192,26] [%emt 0:02:33]} Rxc6 {[%eval -168,29] [%emt 0:02:09]} 24.
e3 {[%eval -195,26] [%emt 0:00:07] (Nd2)} Rac8 {[%eval -171,26] [%emt 0:01:04]}
25. Nd2 {[%eval -199,25] [%emt 0:00:54]} Nd5 {[%eval -172,27] [%emt 0:03:03] 
(a4)} 26. Rc4 {[%eval -193,25] [%emt 0:03:37]} Nb6 {[%eval -172,28] [%emt 0:01:
53]} 27. Rxc6 {[%eval -198,26] [%emt 0:01:27]} Rxc6 {[%eval -169,25] [%emt 0:
01:31]} 28. Kf1 {[%eval -181,27] [%emt 0:02:05]} Rc3 {[%eval -170,25] [%emt 0:
01:38] (Kf7)} 29. Ke2 {[%eval -168,24] [%emt 0:03:07] (b4)} Nd5 {[%eval -182,
23] [%emt 0:01:12] (Kf7)} 30. h4 {[%eval -186,24] [%emt 0:02:06] (e4)} f5 {
[%eval -203,23] [%emt 0:01:19]} 31. Kf3 {[%eval -196,26] [%emt 0:00:49]} Rc2 {
[%eval -206,25] [%emt 0:00:47] (Kf7)} 32. Ke2 {[%eval -196,27] [%emt 0:01:38]}
Kf7 {[%eval -200,24] [%emt 0:00:32]} 33. Kd1 {[%eval -193,27] [%emt 0:02:07]}
Rc3 {[%eval -212,24] [%emt 0:03:48]} 34. Ke2 {[%eval -195,27] [%emt 0:00:00]}
Kf6 {[%eval -208,22] [%emt 0:00:56] (Nf6)} 35. Ke1 {[%eval -193,25] [%emt 0:01:
36] (Kd1)} Kg6 {[%eval -221,23] [%emt 0:01:16] (h6)} 36. Kf1 {[%eval -193,25] 
[%emt 0:01:22]} Kh5 {[%eval -220,26] [%emt 0:03:30]} 37. Ke2 {[%eval -204,26] 
[%emt 0:05:53] (Ke1)} Kg4 {[%eval -230,22] [%emt 0:00:27] (Bf6)} 38. Kd1 {
[%eval -226,24] [%emt 0:02:43]} Kh3 {[%eval -283,25] [%emt 0:01:17] (Bf6)} 39.
Ke2 {[%eval -353,22] [%emt 0:04:04]} Kg2 {[%eval -325,24] [%emt 0:01:23]} 40.
Nf3 {[%eval -334,23] [%emt 0:00:00]} Rxb3 {[%eval -338,22] [%emt 0:03:38]} 41.
Rg1+ {[%eval -379,24] [%emt 0:05:15]} Kh3 {[%eval -347,25] [%emt 0:01:27]} 42.
Rh1+ {[%eval -395,22] [%emt 0:01:48]} Kg4 {[%eval -343,25] [%emt 0:03:12]} 43.
Ne5+ {[%eval -379,24] [%emt 0:00:00]} Kh5 {[%eval -342,23] [%emt 0:00:19]} 44.
a4 {[%eval -409,22] [%emt 0:01:28]} Bf6 {[%eval -343,25] [%emt 0:01:52]} 45.
Nc4 {[%eval -409,22] [%emt 0:00:44]} Kg4 {[%eval -351,23] [%emt 0:00:44]} 46.
Ra1 {[%eval -417,22] [%emt 0:00:37]} Rc3 {[%eval -366,26] [%emt 0:03:42]} 47.
Nd2 {[%eval -449,24] [%emt 0:00:00]} Rc2 {[%eval -362,24] [%emt 0:00:53]} 48.
Kd1 {[%eval -450,21] [%emt 0:00:08]} Rb2 {[%eval -368,23] [%emt 0:00:35] (Nb4)}
49. Kc1 {[%eval -435,20] [%emt 0:01:12]} Rb4 {[%eval -380,23] [%emt 0:00:31]}
50. Ra3 {[%eval -490,22] [%emt 0:05:12] (Kd1)} Be7 {[%eval -434,22] [%emt 0:00:
53] (e5)} 51. Ra2 {[%eval -500,20] [%emt 0:02:13]} Kh3 {[%eval -486,25] [%emt
0:04:10] (f4)} 0-1
Probably the worst opening handling I've ever seen from Houdini.
Go fix the bug then....................
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
lech
Posts: 1169
Joined: Sun Feb 14, 2010 10:02 pm

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by lech »

Houdini wrote:Game 44 was a complete crash by Houdini.
The white queen is trapped.
Probably the worst opening handling I've ever seen from Houdini.
A similar fatal situation was in the game Deep Junior vs Houdini (8 revving tournament). A black Bishop was trapped in opening. Again longer time time control! It is a nasty side effect of Houdini's chess strategy. :lol:
Maybe, I can't be friendly, but let me be useful.
Waschbaer
Posts: 68
Joined: Mon Dec 12, 2011 11:27 pm

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Waschbaer »

Strange,

after b3 and Bb7 Houdini doesn't take an interrest in a3 on my system

(Contempt = 0)


[+0.05] d=26 11.Ba3 a5 12.Rac1 Ne4 13.Bxe7 Qxe7 14.Nb1 Re8 15.Rfe1 f6 16.Nd3 Nd7 17.Qa3 Qxa3 18.Nxa3 Nd6 19.c5 bxc5 20.Nxc5 Nxc5 21.Rxc5 g6 22.Nc2 Kg7 23.e4 a4 24.b4 Rab8 25.e5 fxe5 26.Rxe5 Nc4 (0:12:11) 7548790kN
[+0.08] d=25 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 a6 17.e3 h5 18.h4 Rfd8 19.a3 Qd6 20.Qc2 Re8 21.Rfd1 Rab8 22.Na4 Qc7 23.Qe2 Bc8 24.Nc3 Bg4 (0:05:56) 3588249kN
[+0.07] d=24 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 a6 17.e3 h5 18.h4 Rfe8 19.a3 Ng4 20.Qb3 Nf6 21.Qb1 Rab8 22.Na4 Qc7 23.Qd3 Ne4 24.Bxe4 dxe4 (0:03:44) 2217073kN
[+0.05] d=23 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.e3 Rfc8 18.Rc2 Qd6 19.Rfc1 Re8 20.Qb3 Qd7 21.b5 c5 22.dxc5 bxc5 23.Rd1 c4 24.Qb4 Qf5 (0:02:22) 1362361kN
[+0.07] d=22 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.e3 Rfc8 18.Rc2 a6 19.Qb3 Re8 20.Na4 Qc7 21.Qd3 Bc8 22.Rfc1 Bd7 23.Nc3 Qd6 (0:01:45) 1024406kN
[+0.04] d=21 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.e3 Qd7 18.Rfd1 Rfc8 19.Rd2 a5 20.bxa5 Rxa5 21.Qb3 Qd8 22.Rb2 Ra6 23.Bf1 (0:01:25) 828063kN
[+0.04] d=20 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.e3 Qd7 18.Rfd1 Rfc8 19.Rd2 a5 20.bxa5 Rxa5 21.Qb3 Qd8 22.Rb2 Ra6 23.Bf1 (0:01:09) 663253kN
[+0.05] d=19 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.b5 c5 18.dxc5 bxc5 19.e3 c4 20.Rfd1 Rfd8 21.Qc2 Qe6 22.Rb1 Ne4 23.Bxe4 dxe4 (0:01:02) 591973kN
[+0.04] d=18 11.Ba3 Nfd7 12.cxd5 exd5 13.Bxe7 Qxe7 14.Nxd7 Nxd7 15.b4 Nf6 16.Rac1 h6 17.b5 c5 18.dxc5 bxc5 19.e3 c4 20.Rfd1 Qc5 21.Qc2 Rae8 22.Qf5 (0:00:50) 476732kN
[+0.08] d=17 11.Ba3 a5 12.Rfc1 Nfd7 13.Bxe7 Qxe7 14.cxd5 Nxe5 15.dxe5 exd5 16.e4 dxe4 17.Qxe4 Nd7 18.f4 Nc5 19.Qc2 Ne6 20.Rd1 Qc5+ 21.Kh1 Rad8 22.Rac1 h6 (0:00:36) 346136kN
[-0.06] d=17 11.Bf4 Na6 (0:00:26) 243718kN
[+0.23] d=17 11.Bf4 Na6 (0:00:10) 94959kN
[+0.37] d=17 11.Bf4 Na6 (0:00:04) 34598kN
[+0.43] d=16 11.Bf4 Na6 12.Rac1 c5 13.Qxe8 Rfxe8 14.dxc5 Nxc5 15.Rfd1 Nce4 16.Nb5 dxc4 17.Nxc4 Bc5 18.e3 Red8 19.Rxd8+ Rxd8 20.Nxa7 Ng4 (0:00:01) 11452kN
[+0.43] d=15 11.Bf4 Na6 12.Rac1 c5 13.Qxe8 Rfxe8 14.dxc5 Nxc5 15.Rfd1 Nce4 16.Nb5 dxc4 17.Nxc4 Bc5 18.e3 Red8 19.Rxd8+ Rxd8 20.Nxa7 Ng4 (0:00:01) 8273kN
[+0.43] d=14 11.Bf4 Na6 12.Rac1 c5 13.Qxe8 Rfxe8 14.Be3 cxd4 15.Bxd4 Nb4 16.a3 Nc6 17.Nxc6 Bxc6 18.Bxf6 Bxf6 19.cxd5 exd5 20.Nxd5 (0:00:00) 3125kN
[+0.46] d=13 11.Bf4 Na6 12.Rac1 c5 13.Qxe8 Rfxe8 14.dxc5 Nxc5 15.Rfd1 dxc4 16.Bxb7 Nxb7 17.Nxc4 Red8 18.Kg2 Rac8 19.Nb5 (0:00:00) 2751kN
[+0.43] d=13 11.Bf4 Na6 (0:00:00) 1316kN
[+0.50] d=12 11.Bf4 Na6 12.Rac1 Nc7 13.b4 h6 14.b5 cxb5 15.cxb5 a6 16.bxa6 Nxa6 17.Qb5 Nb4 18.Qxb6 (0:00:00) 634kN
[+0.50] d=11 11.Bf4 Na6 12.Rac1 Nc7 13.b4 h6 14.b5 cxb5 15.cxb5 a6 16.bxa6 Nxa6 17.Qb5 Nb4 18.Qxb6 (0:00:00) 423kN

Edit:
AMD Phenom II X6 3.2 GHz
lech
Posts: 1169
Joined: Sun Feb 14, 2010 10:02 pm

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by lech »

Waschbaer wrote:Strange,

after b3 and Bb7 Houdini doesn't take an interrest in a3 on my system

(Contempt = 0)
It means two brains in one head.
First to play, second to analyze. :?
BTW. Maybe you use a shorter "the rule 50"? :lol:
Maybe, I can't be friendly, but let me be useful.
User avatar
Houdini
Posts: 1471
Joined: Tue Mar 16, 2010 12:00 am

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Houdini »

After some of the worst, also some of the very best.
Game 47, King's Indian Defense.
The key position occurs after White's move 29.Qd2.

[d]1rq5/2p1nrk1/2Pp1np1/B2Ppb2/R3p3/2N3P1/3QBP2/4R1K1 b - - 2 29

Houdini uses nearly 12 minutes to play the sacrifice 29...Qh8!! and after 30.Bxc7 the truly outstanding 30..Kg8!! with a winning attack (Houdini eval -3.45).

Code: Select all

[Event "Komodo5-Houdini"]
[Site "?"]
[Date "2012.07.26"]
[Round "47.1"]
[White "Komodo 5 64-bit"]
[Black "Houdini 2.0c Pro x64"]
[Result "0-1"]
[ECO "E97"]
[Annotator "0.19;-0.02"]
[PlyCount "74"]
[EventDate "2012.07.19"]
[EventType "tourn"]

{AMD Phenom(tm) II X6 1045T Processor 2999 MHz  W=21.4 plies; 1.055kN/s  B=21.
6 plies; 1.833kN/s} 1. d4 Nf6 2. c4 g6 3. Nc3 Bg7 4. e4 d6 5. Nf3 O-O 6. Be2 e5
7. O-O Nc6 8. d5 Ne7 9. b4 a5 10. Ba3 b6 11. bxa5 Nh5 12. g3 {[%eval 19,22] 
[%emt 0:01:48]} Rxa5 {[%eval -2,21] [%emt 0:02:43] (f5)} 13. Bb4 {[%eval 2,20]
[%emt 0:00:32]} Ra6 {[%eval 3,23] [%emt 0:02:49]} 14. a4 {[%eval -1,23] [%emt
0:13:54]} f5 {[%eval 0,24] [%emt 0:00:00]} 15. a5 {[%eval 3,23] [%emt 0:03:42]
(Ng5)} Nf6 {[%eval 4,22] [%emt 0:04:59]} 16. axb6 {[%eval 11,25] [%emt 0:06:51]
} Rxb6 {[%eval 2,24] [%emt 0:00:00]} 17. Ba5 {[%eval 12,22] [%emt 0:02:15]} Rb7
{[%eval -4,23] [%emt 0:04:27] (fxe4)} 18. Nd2 {[%eval 5,23] [%emt 0:06:37]} Bh6
{[%eval 0,23] [%emt 0:00:40] (Nxe4)} 19. c5 {[%eval 31,21] [%emt 0:01:38]} Bxd2
{[%eval 8,22] [%emt 0:01:16]} 20. Qxd2 {[%eval 40,21] [%emt 0:00:07]} fxe4 {
[%eval 4,22] [%emt 0:03:01]} 21. c6 {[%eval 27,22] [%emt 0:00:49] (cxd6)} Rb8 {
[%eval -5,22] [%emt 0:02:18]} 22. Ra4 {[%eval 23,22] [%emt 0:01:25]} Bf5 {
[%eval -9,23] [%emt 0:06:38] (Kg7)} 23. Rfa1 {[%eval 17,21] [%emt 0:04:27]} Rf7
{[%eval -5,23] [%emt 0:00:00]} 24. Qa2 {[%eval 15,21] [%emt 0:03:45] (Rd1)} Kg7
{[%eval -4,22] [%emt 0:03:11]} 25. Re1 {[%eval 15,22] [%emt 0:00:26] (Rd1)} h5
{[%eval -17,22] [%emt 0:01:49] (g5)} 26. Rd1 {[%eval -13,21] [%emt 0:05:30] 
(h4)} h4 {[%eval -34,21] [%emt 0:04:10] (Ng4)} 27. Re1 {[%eval -14,20] [%emt 0:
02:40]} hxg3 {[%eval -56,20] [%emt 0:00:40]} 28. hxg3 {[%eval -64,22] [%emt 0:
06:06] (fxg3)} Qc8 {[%eval -57,20] [%emt 0:02:09]} 29. Qd2 {[%eval -69,21] 
[%emt 0:01:32]} Qh8 {[%eval -147,22] [%emt 0:11:55] (Bg4)} 30. Bxc7 {[%eval
-59,20] [%emt 0:01:02]} Kg8 {[%eval -345,18] [%emt 0:00:00] (e3)} 31. Bf1 {
[%eval -204,20] [%emt 0:04:02]} Ng4 {[%eval -422,23] [%emt 0:00:00]} 32. f3 {
[%eval -228,20] [%emt 0:00:49] (f4)} exf3 {[%eval -427,19] [%emt 0:00:26]} 33.
Rxg4 {[%eval -218,23] [%emt 0:02:59]} f2+ {[%eval -473,23] [%emt 0:08:26] 
(Bxg4)} 34. Qxf2 {[%eval -190,19] [%emt 0:00:55]} Bxg4 {[%eval -618,19] [%emt
0:00:00]} 35. Qd2 {[%eval -240,21] [%emt 0:04:23] (Qh2)} Rbf8 {[%eval -686,19]
[%emt 0:00:41] (Bf3)} 36. Bg2 {[%eval -512,20] [%emt 0:06:54]} Qh5 {[%eval
-783,22] [%emt 0:00:00]} 37. Bxd6 {[%eval -567,21] [%emt 0:07:13] (Be4)} Rh7 {
[%eval -1270,19] [%emt 0:00:31]} 0-1
Sedat Canbaz
Posts: 3018
Joined: Thu Mar 09, 2006 11:58 am
Location: Antalya/Turkey

Re: 60 games Komodo 5 against Top4 at 120m+3s

Post by Sedat Canbaz »

lkaufman wrote: I hope you continue your work. Your testing hasn't been relevant for us since we haven't yet released MP, but when we do we will of course hope to do well in your testing.
In general people who disagree with you are more likely to comment than those who agree, so don't think the comments are a representative sample.
The one thing I don't understand about your work is why you feel that it is important for reversal testing to use a "perfect" book in terms of the normal White advantage. Obviously a very good book is vital for competitive play, but for engine testing we need huge samples to overcome the error bars, and if the price we must pay is using somewhat imperfect books, that is a necessary price in my opinion.

Best regards,
Larry

Dear Larry,

Thank you...

Be sure,i will have a BIG pleasure, when i will start testing Komodo MP in SCCT

Yes,i hope to continue my work,but at least i need to pause some of my activities

And still i am not sure which one to pause:
-Chess Benchmarks by Houdini/Fritz ?
-Working over Perfect Book Series ?
-SCCT Rating list (testing the Top Chess Engines) ?
-SCCT Super Book League (based on Top Book Makers)
vs...

It seems those critics are appearing, due to i have lot of activities-that's my mistake or weakness

And for solution (for less critics and for more rest),i think the right choice will be:
-I need to pause some of them...,but the biggest problem is that: ''i love a lot of ComputerChess''

Let's wait and see...the time will tell :)

About the openings issue,
As far as understand, you are looking for varied book,thats ok and i give you right, because you an engine developer
Its a very good idea that you need to test Komodo in various positions,before final release,actually if i was in your shoes,probably i would do the same as you...

But however,(during my testings) i did not satisfied by the performance of some tested opening lines
Of course,i wish i could test the engines with all grandmaster openings,
but it seems,some Top engines performance suffer,in case if we test the engines with various openings ...
For me its more important the performance of the engines than to use various openings...
That's why, as far as possible i prefer the strongest openings lines for both sides White and Black (as far as possible i keep the winning lines:Withes 55 % and Blacks 45 %)
I mean,its something like:i am optimizing my opening lines for serious official tournament

For example,the below positions are based on books,based some SCCT games,
And we see the winning percentage of some popular lines suffer...(i mean less than 40 % winning percentage with Blacks)

I'd like to point out too,(in my latest neutral short books) one of the main first moves is 'Nc3'
Of course, (for official competitions or for Playchess server),i dont suggest to be used this move as main
But if the engines are using same Perfect 2012b book (for both sides) then 'Nc3' is highly recommended
The main Nc3 move increases the possibility to be played the strongest Sicilian openings



Image
Image
Image


Best Wishes,
Sedat