Baffling test position

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

F. Bluemers
Posts: 880
Joined: Thu Mar 09, 2006 11:21 pm
Location: Nederland

Re: Baffling test position

Post by F. Bluemers »

Seems to work for stock dirty too,with the "enough knights" version :wink:

Code: Select all

[Event "Computer Chess Game"]
[Site "BASEMENT_PC"]
[Date "2010.03.21"]
[Round "-"]
[White "Dirty Mar 14 2010"]
[Black "Dirty Mar 14 2010"]
[Result "1-0"]
[TimeControl "40/300"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1"]
[SetUp "1"]

{--------------
. q . q k . q .
p p p p p p p p
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
P P P P P P P P
N N N N K N N N
white to play
--------------}
1. Nab3 {-9.66/11 7} d5 {+9.60/11 7} 2. Nf3 {-9.61/12 7} b6 {+9.78/12 7} 3.
d4 {-9.49/14 7} c5 {+9.49/13 7} 4. Nhg3 {-9.57/13 7} c4 {+9.41/13 7} 5.
Nb3d2 Kf8 {+9.30/13 7} 6. Nbc3 {-9.30/13 7} Qbc8 {+9.22/13 7} 7. Nfe3
{-9.30/12 7} e6 {+9.44/14 7} 8. h3 {-9.54/13 7} g6 {+9.37/13 7} 9. Kf1
{-9.77/12 7} f5 {+9.89/13 7} 10. Nh1 {-9.97/14 7} g5 {+10.19/14 7} 11. Kg1
{-10.09/12 7} h5 {+10.08/14 7} 12. Nef1 {-9.80/14 7} Qh7 {+9.90/14 7} 13.
Ne5 Kg8 {+9.89/13 7} 14. e3 {-9.62/12 7} Qcb7 {+9.66/11 7} 15. f4
{-9.65/13 7} g4 {+9.56/14 7} 16. Nhf2 {-9.59/14 7} b5 {+9.56/14 7} 17. N1e2
{-9.69/13 7} Qe8 {+9.83/13 7} 18. Nc1 {-9.93/12 7} a5 {+9.93/11 7} 19. N3e2
{-9.92/12 7} Kh8 20. c3 {-9.97/13 7} Qhg7 {+10.04/13 7} 21. b3
{-10.08/13 7} a4 {+10.10/11 7} 22. b4 {-9.94/12 7} a3 {+10.03/14 7} 23. Kh2
{-10.04/13 7} Qeg8 {+10.03/15 7} 24. h4 {-10.07/15 7} Qbc7 {+10.02/14 7}
25. g3 {-10.05/16 7} Qd6 {+10.02/17 7} 26. Kh1 Qa7 27. Kh2 {-10.05/17 7}
Qgd8 {+10.02/18 7} 28. Kg1 {-10.05/18 7} Qg7 {+10.02/18 7} 29. Kh2
{-10.05/17 7} Q6b6 {+10.02/17 7} 30. Kh1 {-10.05/15 7} Qa6 {+10.02/16 7}
31. Kg1 {-10.05/16 7} Qdc7 {+10.02/18 7} 32. Kh1 {-10.05/16 7} Qaa7
{+10.02/17 7} 33. Kh2 {-10.05/15 7} Qd6 34. e4 {-9.39/14 7} dxe4
{+9.10/14 7} 35. Nfe3 {-9.18/13 7} Qdb6 {+9.84/14 7} 36. Ndxc4 {-9.91/13 7}
bxc4 {+9.71/15 7} 37. N3xc4 {-9.71/15 7} Qb5 {+9.65/15 7} 38. Nde3
{-9.69/14 7} Kg8 {+9.62/14 7} 39. Nb3 {-9.23/14 7} Qe8 {+8.96/14 7} 40. Nc5
{-9.28/13 7} Qac7 {+8.23/13 7} 41. d5 {-6.65/13 7} exd5 {+6.30/14 7} 42.
Nxf5 {-6.17/15 7} Qf6 {+6.19/13 7} 43. Nce3 {-6.00/14 7} Qcd8 {+6.06/13 7}
44. Ned4 {-3.50/14 7} Qa8 {+3.23/13 7} 45. c4 {-2.61/12 7} Qac8
{+2.51/11 7} 46. cxd5 {+0.04/13 7} Kh8 {-0.60/12 7} 47. Nfxe4 {+0.85/13 7}
Qb6 {-0.93/14 7} 48. Ndc2 {+1.01/13 7} Qa8 {-1.07/12 7} 49. Nfd6
{+1.33/13 7} Qg8 {-1.81/12 7} 50. Ndf7+ {+1.62/14 7} Qxf7 {-2.30/16 7} 51.
Nxf7+ {+1.65/15 7} Kg7 {-2.58/16 7} 52. Nc4 Qba7 {-2.26/15 7} 53. Nfg5
{+3.53/15 7} Qxd5 {-3.31/14 7} 54. Nce6+ {+3.32/16 7} Kg8 55. Nf6+
{+3.06/17 7} Kh8 {-3.05/18 7} 56. Nf7+ {+3.05/17 7} Qxf7 57. Nxd5
{+2.85/17 7} Qxe6 58. Nde3 {+2.85/16 7} Qc6 59. N4xa3 {+2.74/14 7} Qf3
{-2.32/15 7} 60. Ng2 {+2.55/15 7} Qf2 {-2.67/17 7} 61. b5 {+2.54/15 7} Kg7
62. Ne3 {+2.78/16 7} Qb2 {-2.84/16 7} 63. Nec4 {+2.86/15 7} Qxa2 64. b6
{+2.80/15 7} Qb3 {-2.98/16 7} 65. Nge3 Qa2+ {-3.04/14 7} 66. Nac2
{+2.98/13 7} Kf6 {-9.61/14 7} 67. b7 {+9.50/13 7} Qa7 {-9.94/15 7} 68. Na5
{+9.81/14 7} Kg6 69. Ng2 {+10.61/13 7} Qxa5 {-10.87/13 7} 70. b8=Q
{+12.12/13 7} Kf7 {-12.15/12 7} 71. Qh8 {+12.12/12 7} Ke7 {-12.07/11 7} 72.
Nce3 {+12.03/11 7} Qa2 {-17.53/12 7} 73. f5 {+17.53/12 7} Qf7 {-18.99/13 7}
74. Nd5+ {+18.84/12 7} Qxd5 {-19.16/13 7} 75. f6+ Kd6 {-19.32/13 7} 76. Qe8
{+19.24/13 7} Qf3 {-19.43/12 7} 77. f7 Kd5 {-319.82/12 7} 78. Nf4+
{+26.07/10 0.6} Kc4 {-319.84/127 0.6} 79. Qa4+ {+319.85/127 0.5} Kc3
{-319.86/127 10} 80. Qa2 {+319.87/126 0.3} Qd1 {-319.88/127 0.5} 81. f8=Q
{+319.89/127 0.4} Qc2+ {-319.90/127 0.4} 82. Ne2+ {+319.91/127 0.4} Kd2
{-319.92/127 0.4} 83. Qd6+ {+319.93/127 0.4} Ke1 {-319.94/127 0.5} 84. Qxc2
{+319.95/127 0.5} Kf1 {-319.96/127 0.4} 85. Qf4+ {+319.97/127 0.2} Ke1 86.
Qcd2# {+319.99/127 0.2}
{White mates} 1-0
Image
User avatar
David Dahlem
Posts: 900
Joined: Wed Mar 08, 2006 9:06 pm

Re: Baffling test position

Post by David Dahlem »

[d]1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1

Naum 4.2 resigns at move 5! :lol:

[Event "Arena Chess"]
[Site "?"]
[Date "2010.03.21"]
[Round "?"]
[White "Naum 4.2"]
[Black "Naum 4.2"]
[Result "0-1"]
[TimeControl "40/300:40/300:40/300"]
[SetUp "1"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1"]
[PlyCount "9"]

1.Nf3 {-9.49/10} g5 {+9.63/10} 2.Nd3 {-9.56/21} g4 {+9.56/20} 3.Nd4 {-9.56/20} Qg5 {+9.56/19}
4.Nde3 {-9.56/20} c5 {+9.56/19} 5.Ndb3 {-9.56/19} {White resigns} 0-1
User avatar
David Dahlem
Posts: 900
Joined: Wed Mar 08, 2006 9:06 pm

Re: Baffling test position

Post by David Dahlem »

[d]1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/BBBBKBBB w - - 0 1

[Event "Arena Chess"]
[Site "?"]
[Date "2010.03.21"]
[Round "?"]
[White "Protector 1.3.4"]
[Black "Protector 1.3.4"]
[Result "0-1"]
[TimeControl "40/300:40/300:40/300"]
[SetUp "1"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/BBBBKBBB w - - 0 1 "]
[PlyCount "117"]

1.e3 {-6.05/14} c6 {+5.99/13} 2.b3 {-6.00/14} e5 {+5.98/13} 3.c4 {-6.00/13} g6
{+6.05/12} 4.Bc3 {-5.87/13} d6 {+5.90/13} 5.g3 {-5.75/13} a5 {+5.75/12} 6.B1b2 {-5.75/12}
Qa7 {+5.69/12} 7.a3 {-5.60/13} f5 {+5.58/13} 8.b4 {-5.61/13} axb4 {+5.62/13} 9.axb4
{-5.62/13} Qe6 {+5.61/13} 10.h4 {-5.50/12} h5 {+5.53/12} 11.b5 {-5.35/12} Qc7 {+5.31/12}
12.Bbc2 {-5.22/12} Kf8 {+5.19/12} 13.Bhg2 {-5.08/12} Qab6 {+4.99/12} 14.Bca4 {-5.18/11}
c5 {+4.20/13} 15.Bd5 {-3.56/15} Qed7 {+3.41/16} 16.Bfg2 {-3.46/16} Qcd8 {+3.22/16}
17.Bdxb7 {-3.17/14} Kg8 {+3.21/15} 18.Bgd5+ {-3.14/15} Kh7 {+3.15/16} 19.f4 {-3.19/16}
Qbxb7 {+3.21/15} 20.b6 {-3.97/12} Qd7e7 {+5.49/12} 21.fxe5 {-4.12/13} Qbxb6
{+4.46/12} 22.e6 {-4.20/14} Qef8 {+4.29/13} 23.Bdc2 {-3.42/13} Qdb8 {+3.80/13} 24.Bb5
{-3.77/13} Q8a7 {+4.05/12} 25.Bca4 {-3.73/12} Qbd8 {+3.79/12} 26.Bf2 {-3.87/12} Qfe7
{+4.07/12} 27.Bg1 {-3.97/13} Qab6 {+3.93/13} 28.Bd1 {-3.90/12} Qdf8 {+4.05/13} 29.Bda4
{-4.08/12} Qbb8 {+4.02/12} 30.Bbd7 {-3.83/11} g5 {+4.28/11} 31.Bf2 {-3.36/11} Kg6
{+3.51/11} 32.Bc2 {-2.27/10} Kh6 {+3.91/10} 33.Bd1 {-3.86/8} gxh4 {+3.50/11} 34.gxh4
{-3.87/11} Qa7 {+3.83/11} 35.Bg3 {-3.83/11} Kg6 {+3.80/11} 36.Bc2 {-3.77/10} Qa2
{+4.05/10} 37.Bde4 {-3.55/10} Kh7 {+3.09/9} 38.Bexf5+ {-3.09/9} Kg8 {+2.48/11} 39.Bcf6
{-3.67/2} Qexf6 {+4.26/9} 40.Bxf6 {-4.15/6} Qxf6 {+4.18/10} 41.Bb5 {-5.17/13} Qaa1+
{+5.37/14} 42.Ke2 {-5.67/16} Qg1 {+5.73/16} 43.Bf2 {-5.90/16} Qh2 {+5.92/16} 44.Ke1
{-6.01/16} Qa1+ {+6.08/16} 45.Bcb1 {-6.24/18} Kf8 {+6.53/18} 46.Bc6 {-6.63/18} Qb2
{+6.63/17} 47.Bbc2 {-6.65/17} Qc1+ {+6.67/16} 48.Bd1 {-6.79/18} Qxc4 {+6.75/18} 49.Bfe4
{-6.80/18} Qc1 {+6.88/17} 50.Ke2 {-6.92/17} c4 {+7.07/14} 51.Bcd5 {-7.10/15} Ke7
{+7.21/16} 52.Bec2 {-6.98/16} Qb2 {+7.94/16} 53.Kf1 {-7.83/14} Qb4 {+7.92/15} 54.Be1
{-8.01/15} c3 {+8.01/15} 55.d3 {-9.84/15} Qc5 {+11.27/16} 56.Bg2 {-11.40/17} Qxe3
{+14.35/16} 57.Bf2 {-18.81/16} Qd2 {+20.32/16} 58.d4 {-M12/15} Kxe6 {+M12/15} 59.Bcb3+
{-M11/15} {White resigns} 0-1
User avatar
David Dahlem
Posts: 900
Joined: Wed Mar 08, 2006 9:06 pm

Re: Baffling test position

Post by David Dahlem »

[d]qnnnknnq/pppppppp/8/8/8/8/PPPPPPPP/RRRRKRRR w - - 0 1

[Event "Arena Chess"]
[Site "?"]
[Date "2010.03.21"]
[Round "?"]
[White "Stockfish 1.6.3s"]
[Black "Stockfish 1.6.3s"]
[Result "0-1"]
[TimeControl "40/300:40/300:40/300"]
[SetUp "1"]
[FEN "qnnnknnq/pppppppp/8/8/8/8/PPPPPPPP/RRRRKRRR w - - 0 1 "]
[Termination "normal"]
[PlyCount "59"]

1.f3 {-4.24/16} Nd6 {+4.28/16} 2.c4 {-4.32/16} b6 {+4.32/16} 3.d4 {-4.36/15} h5
{+4.24/16} 4.Rf2 {-4.32/14} c5 {+4.24/14} 5.dxc5 {-4.48/16} bxc5 {+4.80/15} 6.h4
{-5.05/16} Nbc6 {+5.09/15} 7.g4 {-5.33/16} Nfe6 {+5.09/16} 8.gxh5 {-5.61/15} Nf6
{+5.77/14} 9.Rc3 {-5.73/14} Qh6 {+5.93/15} 10.Rh3 {-5.69/14} Ne5 {+6.10/15} 11.Rbc1
{-6.22/14} Nf4 {+6.70/15} 12.Rhh2 {-6.30/14} Nde6 {+6.50/15} 13.Rab1 {-6.62/13} Nf5
{+6.82/15} 14.Rd2 {-6.42/13} Nfd4 {+6.62/13} 15.a3 {-6.66/14} Kf8 {+6.50/12} 16.Rba1
{-7.19/13} a5 {+7.31/13} 17.a4 {-7.39/13} Qxh5 {+7.31/13} 18.Raa3 {-7.15/14} d6 {+7.39/14}
19.Kf1 {-7.35/15} Nfg6 {+7.43/14} 20.Rgh1 {-7.51/13} Nf5 {+7.71/13} 21.Rg1 {-7.71/13}
Ngxh4 {+7.91/13} 22.Rcd1 {-7.95/13} g5 {+7.91/14} 23.Rhh1 {-8.24/12} Qa6 {+8.80/12}
24.Rdc1 {-8.96/13} g4 {+8.84/13} 25.e4 {-8.24/12} Nfd4 {+9.29/12} 26.f4 {-9.45/12} Nxc4
{+9.29/13} 27.R3xc4 {-9.57/9} Nxe4 {+10.18/12} 28.Ke1 {-10.10/11} Nxf2 {+10.42/12} 29.Kxf2
{-10.22/12} Nxf4 {+10.30/13} 30.Rgg3 {-10.78/11} {White resigns} 0-1
User avatar
David Dahlem
Posts: 900
Joined: Wed Mar 08, 2006 9:06 pm

Re: Baffling test position

Post by David Dahlem »

[d]bbbbkbbb/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1

Bishops are better than Knights!! :wink: :lol:

[Event "Arena Chess"]
[Site "?"]
[Date "2010.03.21"]
[Round "?"]
[White "Stockfish 1.6.3s"]
[Black "Stockfish 1.6.3s"]
[Result "0-1"]
[TimeControl "40/300:40/300:40/300"]
[SetUp "1"]
[FEN "bbbbkbbb/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1"]
[PlyCount "91"]

1.Nhg3 {-1.29/15} g6 {+1.61/16} 2.Nbc3 {-1.81/16} f5 {+2.06/17} 3.Nf3 {-2.22/16}
e5 {+2.46/17} 4.d3 {-2.42/16} d5 {+2.42/16} 5.e3 {-2.66/16} c5 {+2.70/16} 6.Nab3
{-2.78/15} b6 {+2.90/16} 7.Nbd2 {-3.23/16} g5 {+3.15/16} 8.e4 {-3.31/17} fxe4 {+3.31/18}
9.Ndxe4 {-3.47/16} dxe4 {+3.47/17} 10.Ncxe4 {-3.43/17} c4 {+3.59/17} 11.dxc4 {-3.67/16}
Bxc4 {+3.67/16} 12.Nfe3 {-3.79/17} Bf7 {+3.95/17} 13.Nfxg5 {-3.83/16} Bg6 {+4.04/18}
14.f3 {-4.12/17} Bh6 {+4.24/17} 15.h4 {-4.20/16} Bhf6 {+4.32/16} 16.Nce2 {-4.20/16}
Bfxg5 {+4.36/17} 17.hxg5 {-4.40/17} Bdxg5 {+4.56/17} 18.Kd2 {-4.64/18} Bcb7
{+4.68/18} 19.Kd3 {-4.64/17} Ba6+ {+4.72/17} 20.Kd2 {-4.76/17} Kf7 {+4.80/17} 21.Ne2c3
{-4.80/15} Bh4 {+4.72/16} 22.Nce2 {-4.88/15} Bc6 {+4.84/15} 23.Ndf2 {-4.76/16} Ke6
{+4.80/15} 24.b3 {-4.92/15} Bd6 {+5.33/15} 25.c4 {-5.37/15} Bb4+ {+5.49/15} 26.Kd3
{-5.77/15} Be1 {+5.93/14} 27.Ngh1 {-6.46/15} b5 {+6.94/16} 28.g3 {-6.62/16} B4g5
{+6.94/16} 29.Ned1 {-7.15/15} bxc4+ {+7.07/16} 30.bxc4 {-6.82/10} Bd5 {+6.90/17} 31.Nb2
{-7.35/17} Bgc1 {+7.35/17} 32.Nfd1 {-7.19/18} Bh5 {+7.63/16} 33.Nxc1 {-7.63/15} Bxe4+
{+7.75/18} 34.fxe4 {-7.79/18} Bxc1 {+8.04/19} 35.g4 {-8.24/19} Bxg4 {+8.28/18} 36.Nhf2
{-8.32/16} h5 {+8.48/17} 37.Na4 {-9.21/17} Kd6 {+9.13/19} 38.Nac3 {-9.41/18} Bxd1
{+9.69/19} 39.Ncxd1 {-9.81/19} Kc5 {+9.81/20} 40.Nh3 {-9.73/17} Bxc4+ {+10.42/18} 41.Kc2
{-8.76/6} Ba3 {+10.74/20} 42.Ne3 {-10.94/21} Bxa2 {+11.39/21} 43.Ng5 {-11.35/20} Bg3
{+12.20/21} 44.Nf3 {-12.60/20} a5 {+13.13/20} 45.Nd1 {-14.06/22} h4 {+14.26/20} 46.Nc3
{-15.31/22} {White resigns} 0-1
User avatar
hgm
Posts: 28393
Joined: Fri Mar 10, 2006 10:06 am
Location: Amsterdam
Full name: H G Muller

Re: Baffling test position

Post by hgm »

Indeed, Bishops seem to be significantly less good at this than Knights. I guess this should not surprise us: the Bishops have difficulty cooperating, because they are divided in two groups that cannot defend each other.

But to get back to the Knights: Stockfish does not seem very good at this, because it consistently loses when playing the seven Knights, where my new engine consistently wins with the seven Knights against the same opponent (namely Stockfish). So I don't believe it is very significant that the Knights lose in Stockfish self-play. It is much more significant that they win in self-play of my new engine. So I think the 7 Knights are really much stronger than the the 3 Queens.
LiquidNitrogenOverclocker

Re: Baffling test position

Post by LiquidNitrogenOverclocker »

hgm wrote:So I think the 7 Knights are really much stronger than the the 3 Queens.
Try to keep in mind, the "material distribution" that is "normal" in chess allows for meaningful trades throughout the course of the game. The trades are usually with like-valued material, and if pawn captures are involved during the combination, some positional factors must be considered carefully. Sometimes a positional "weakness" that is felt by one side is only temporary, other times it is lasting and contributes to a bad position becoming worse.

In the Knights vs. Queens test that was designed, the game play has been biased heavily towards the tactical evaluations. The deeper searching program will win.

The Queens need to win material outright, since trading any Queen for a few knights will only win if sufficient mating force remains on the board.

3Q vs. 7N reduced to 2Q vs. 6N would be making it more difficult to tactically "win a knight". And, the side with the 6N might not like the 18+ points for the 2Q, so we could potentially have the case where NEITHER side wants to simplify, potentially!
User avatar
hgm
Posts: 28393
Joined: Fri Mar 10, 2006 10:06 am
Location: Amsterdam
Full name: H G Muller

Re: Baffling test position

Post by hgm »

LiquidNitrogenOverclocker wrote:In the Knights vs. Queens test that was designed, the game play has been biased heavily towards the tactical evaluations. The deeper searching program will win.
I am not so sure about this. Deeper search only helps if you employ correct criteria for what to search for. Otherwise it will just be extremely smart in doing itself in. Stockfish does not search any less deep than my new engine, and yet is slaughtered by the latter when it has to play the Queens.

What seems the main factor that determines how easily the Knights win this, is the tendency of the engine to prematurely swap two Knights for a Queen. Being too eager for such swaps turns out to be a losing strategy. The best way to handle this position for white is to avoid any Q vs 2N trading, and use your numeric advantage in pieces to go for the opponent's Pawns. Once the engine playing the Knights realizes that Q for 2N is a bad trade, and it should better wait for Q+P vs 2N, I don't think it will need a very deep search to win.

I tried to test this in the following experiment: I give my new engine, prorammed to value N=500, Q=950, playing the Knights, a factor 5 time odds. So Stockfish gets 10 min, my engine only 2 min for 40 moves. Under these conditions Stockfish searches about 3 ply deeper than my engine. (Typical search depths 15 vs 12 ply.)

Yet Stockfish is totally crushed:

Code: Select all

[Event "Computer Chess Game"]
[Site "SCHAAK_PC"]
[Date "2010.03.22"]
[Round "-"]
[White "NewEng 0.16"]
[Black "Stockfish 1.6 JA"]
[Result "*"]
[TimeControl "40/600"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1"]
[SetUp "1"]

{--------------
. q . q k . q .
p p p p p p p p
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
P P P P P P P P
N N N N K N N N
white to play
--------------}
1. Nhg3 {+3.72/12 8:01} c5 {+9.53/16 28} 2. e4 {+3.96/13 2.5} e6
{+9.49/15 21} 3. d3 {+3.82/13 2.7} d5 {+10.06/14 19} 4. Ng1e2
{+3.88/12 1.8} g6 {+10.46/15 20} 5. Nfd2 {+3.85/12 1.5} f5 {+10.14/16 55}
6. f4 {+3.88/12 1.5} fxe4 {+10.78/16 46} 7. dxe4 {+4.16/13 1.6} g5
{+10.78/15 19} 8. fxg5 {+4.53/13 3} Qdxg5 {+10.74/15 19} 9. Nab3
{+4.68/13 1.7} Qe7 {+10.74/14 24} 10. c4 {+4.50/13 3} dxc4 {+11.39/15 28}
11. Nxc4 {+4.76/13 1.3} b5 {+10.82/15 15} 12. Nce3 {+4.85/14 3} h5
{+11.07/14 14} 13. e5 {+4.94/13 4} Qgh7 {+10.62/15 40} 14. Nbc3
{+5.41/13 2.7} Qxe5 {+10.22/15 36} 15. Nxb5 {+5.52/13 2.0} h4 {+9.97/14 23}
16. Ngf1 {+5.65/13 2.3} c4 {+10.50/13 19} 17. N3d4 {+5.89/12 2.1} Q7c5
{+9.65/12 10} 18. Nd2 {+6.13/11 1.5} Kd7 {+8.80/12 7} 19. N4f3
{+6.93/12 1.0} Qeh5 {+7.71/13 19} 20. Nbc3 {+7.27/12 3} Qf8 {+7.11/14 24}
21. Ndxc4 {+7.42/11 1.7} Qhg8 {+6.74/13 17} 22. Nd3 {+8.05/13 2.7} h3
{+6.14/13 9} 23. Nef4 {+8.54/12 3} Qhh7 {+4.52/14 14} 24. Nde5+
{+8.76/12 2.2} Kc8 {+5.77/12 3} 25. Nxh3 {+8.92/12 1.6} Qb7 {+5.49/12 1.6}
26. Nhg5 {+9.26/13 2.0} Kb8 {+5.01/13 7} 27. Nce4 {+9.68/12 1.7} Ka8
{+4.32/13 7} 28. Ndc3 {+9.96/12 1.8} Qbe7 {+4.00/13 5} 29. g3
{+10.31/12 2.0} Qh6 {+2.54/14 9} 30. h3 {+10.20/12 2.9} Qb7 {+3.11/12 5}
31. h4 {+10.60/12 1.0} a5 {+1.17/12 4} 32. Nd7 {+11.01/12 2.0} Qd8
{-0.64/12 2.3} 33. Nec5 {+13.07/13 3} Qbc7 {-6.10/13 4} 34. Nfe5
{+14.23/13 2.9} a4 {-8.28/12 2.3} 35. N3g4 {+16.96/13 2.9} Qg6
{-9.61/10 1.5} 36. Nxg6 {+18.86/13 4} Qxg3+ {-11.27/10 1.0} 37. Nf2
{+18.72/15 10} a3 {-11.39/10} 38. Ke2 {+18.89/14 3} axb2 {-11.35/9 0.2} 39.
Ncb6+ {+18.72/13 3} Qxb6 {-11.15/8} 40. Nxb6+ {+18.76/13 2.4} Kb8
{-11.71/10 1.1} 41. Nc5e4 {+19.73/14 3}
*
For some reason the game stops here, without a proper result message. (I still have to debug this; I suspect that Stockfish resigned, but that due to a WinBoard problem this is not properly handled. The WinBoard clocks are stopped and reste to 10 & 2 min, so WB must think the game is over.) But is is clear that Stockfish is badly losing, its score having dropped from the initial +10 t0 -11, and two of its Queens exterminated at the expense of only a single Knight.

[d]1k6/8/1N2p1N1/6N1/4N2P/2N3q1/Pp2KN2/8 b - -
User avatar
hgm
Posts: 28393
Joined: Fri Mar 10, 2006 10:06 am
Location: Amsterdam
Full name: H G Muller

Re: Baffling test position

Post by hgm »

An even nicer one: This time it is only six Knights against three Queens. Again Stockfish, playing the Queens gets a time-odds advantage of a factor 5 (10 min vs 2 min/40moves for the Knights).

Initially, Stockfish evaluates itself at +13.5. So this must be a new World record: making Stockfish lose from a +13.5 position, facing time odds as well! :lol: :lol: :lol:

Code: Select all

[Event "Computer Chess Game"]
[Site "SCHAAK_PC"]
[Date "2010.03.22"]
[Round "-"]
[White "NewEng 0.16"]
[Black "Stockfish 1.6 JA"]
[Result "1-0"]
[TimeControl "40/600"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNN1KNNN w - - 0 1"]
[SetUp "1"]

{--------------
. q . q k . q .
p p p p p p p p
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
P P P P P P P P
N N N . K N N N
white to play
--------------}
1. Nab3 {-0.98/12 1.0} f5 {+13.57/16 30} 2. d4 {-0.68/12 1.3} b6
{+13.65/16 29} 3. e3 {-0.62/12 1.4} g5 {+14.02/15 25} 4. Nd3 {-0.38/12 1.6}
a5 {+14.18/16 51} 5. Nhg3 {-0.26/13 2.9} Qb7 {+14.30/15 21} 6. f3
{+0.06/14 4} c5 {+14.38/15 37} 7. dxc5 {+0.01/13 1.0} a4 {+14.34/15 16} 8.
N3d2 {-0.29/13 1.4} bxc5 {+14.94/16 22} 9. a3 {-0.24/13 2.9} f4
{+14.46/15 18} 10. N3e2 {-0.18/13 1.5} c4 {+14.66/15 17} 11. Nb4
{+0.07/14 2.7} Qa7 {+14.22/15 13} 12. e4 {+0.06/15 2.7} g4 {+14.30/15 16}
13. g3 {+0.62/13 1.5} fxg3 {+14.58/15 14} 14. hxg3 {+0.80/13 3} Qdb6
{+14.10/15 42} 15. Nbc3 {+0.68/12 1.5} Qf2+ {+14.02/12 12} 16. Kd1
{+0.48/13 1.7} Qg6 {+13.45/14 30} 17. Nbd5 {+1.22/12 1.9} Qb7
{+13.29/13 19} 18. Nfe3 {+0.67/12 2.4} gxf3 {+15.51/14 17} 19. Ngxf3
{+0.39/13 1.3} Qxb2 {+14.94/14 6} 20. Ndxc4 {+0.22/13 2.2} Qa1+
{+15.39/13 1.9} 21. Kd2 {-2.13/15 2.3} Kf8 {+15.19/14 18} 22. Nce5
{+1.90/13 5} Qa6 {+14.94/15 11} 23. Nxd7+ {+2.55/12 2.5} Kg8 {+14.46/16 13}
24. Nde5 {+2.08/13 4} Qxa3 {+14.42/13 6} 25. Nf5 {+2.49/13 4} Qa1
{+14.18/12 5} 26. Ndxe7+ {+2.43/12 1.8} Kf8 {+14.06/10} 27. N7c6
{+2.41/13 2.2} Qb7 {+13.45/12 6} 28. Ncd4 {+2.77/11 1.7} Qh1 {+8.12/13 13}
29. Nxa4 {+3.81/12 4} Qa6 {+10.86/14 19} 30. Nc3 {+3.79/12 2.2} Qff1
{+10.50/13 13} 31. g4 {+4.01/12 4} Qfa1 {+7.19/13 9} 32. Nd5 {+4.27/11 1.8}
Q1a4 {+10.38/11 7} 33. g5 {+4.47/12 2.1} Qa3 {+6.70/11 3} 34. c3
{+4.49/12 4} Qb2+ {+6.54/12 3} 35. Ke3 {+4.58/12 1.4} Qf1 {+5.73/13 5} 36.
c4 {+4.73/12 2.1} h6 {+5.09/10 1.9} 37. g6 {+8.09/12 3} Qbb7 {+1.21/11 3}
38. g7+ {+8.34/12 2.0} Kg8 {+0.92/11 1.7} 39. c5 {+9.21/13 4} Qb8
{+0.64/10 1.6} 40. c6 {+12.74/12 2.6} Qd8 {-8.20/10 2.7} 41. c7
{+15.78/13 3} Qda8 {-12.56/16 59} 42. Nde7+ {+16.82/13 3} Kh7 {-8.00/6} 43.
g8=Q+ {+16.83/13 1.0} Qxg8 {-8.20/6} 44. Nxg8 {+16.50/15 2.0} Kxg8
{-13.37/19 58} 45. c8=Q+ {+16.53/14 1.5} Qxc8 {-14.38/15} 46. Ne7+
{+17.02/15 2.1} Kf8 {-22.59/20 41} 47. Nxc8 {+17.23/15 6} Kg7
{-29.57/18 26} 48. Nf5+ {+17.39/14 1.9} Kg8 {-30.26/18 21} 49. Nxh6+
{+17.98/13 1.0} Kf8 {-77.05/22 48} 50. Nf5 {+18.01/14 2.3} Qa1
{-21.48/19 34} 51. Ncd6 {+18.05/15 4} Kg8 {-105.75/17 19} 52. Nec4
{+19.64/15 6} Qa7+ {-92.46/17 39} 53. Ned4 {+25.10/16 1.7} Qa1
{-105.75/16 12} 54. e5 {+25.20/15 2.4} Qc1+ {-101.72/16 12} 55. Ke4
{+25.51/16 1.0} Qb1+ {-99.78/14 2.3} 56. Kd5 {+25.90/17 2.4} Qb4
{-99.80/13 1.0} 57. e6 {+25.87/14 1.3} Kh7 {-99.84/12} 58. Ng5+
{+319.91/14 1.4} Kg6 {-109.92/14 11} 59. Nge4 {+319.92/15 1.7} Kh5
{-99.86/13 2.8} 60. Nf6+ {+319.93/16 3} Kg5 {-99.88/8} 61. Nde4+
{+319.94/17 1.9} Kf4 {-99.90/12} 62. Ne2+ {+319.95/19 1.8} Kxf5 {-99.92/6}
63. Ne3+ {+319.96/22 2.0} Kg6 {-99.94/6} 64. Nf4+ {+319.97/26 2.0} Kg7
{-99.96/6} 65. Nf5+ {+319.98/28 3} Kh8 {-99.98/6} 66. Ng6# {+319.99/31 2.0}
{White mates} 1-0
[d]7k/8/4PNN1/3K1N2/1q2N3/8/8/8 b - - 0 66
final position

I think I might revise my position: even facing only six Knights, the position is badly lost for three Queens. I cannot exclude that Stockfish is badly broken for this number of Knights, though, so I will have to do some tests with other engines as well.
User avatar
Eelco de Groot
Posts: 4673
Joined: Sun Mar 12, 2006 2:40 am
Full name:   Eelco de Groot

Re: Baffling test position

Post by Eelco de Groot »

hgm wrote:An even nicer one: This time it is only six Knights against three Queens. Again Stockfish, playing the Queens gets a time-odds advantage of a factor 5 (10 min vs 2 min/40moves for the Knights).

Initially, Stockfish evaluates itself at +13.5. So this must be a new World record: making Stockfish lose from a +13.5 position, facing time odds as well! :lol: :lol: :lol:

Code: Select all

[Event "Computer Chess Game"]
[Site "SCHAAK_PC"]
[Date "2010.03.22"]
[Round "-"]
[White "NewEng 0.16"]
[Black "Stockfish 1.6 JA"]
[Result "1-0"]
[TimeControl "40/600"]
[FEN "1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNN1KNNN w - - 0 1"]
[SetUp "1"]

{--------------
. q . q k . q .
p p p p p p p p
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
P P P P P P P P
N N N . K N N N
white to play
--------------}
1. Nab3 {-0.98/12 1.0} f5 {+13.57/16 30} 2. d4 {-0.68/12 1.3} b6
{+13.65/16 29} 3. e3 {-0.62/12 1.4} g5 {+14.02/15 25} 4. Nd3 {-0.38/12 1.6}
a5 {+14.18/16 51} 5. Nhg3 {-0.26/13 2.9} Qb7 {+14.30/15 21} 6. f3
{+0.06/14 4} c5 {+14.38/15 37} 7. dxc5 {+0.01/13 1.0} a4 {+14.34/15 16} 8.
N3d2 {-0.29/13 1.4} bxc5 {+14.94/16 22} 9. a3 {-0.24/13 2.9} f4
{+14.46/15 18} 10. N3e2 {-0.18/13 1.5} c4 {+14.66/15 17} 11. Nb4
{+0.07/14 2.7} Qa7 {+14.22/15 13} 12. e4 {+0.06/15 2.7} g4 {+14.30/15 16}
13. g3 {+0.62/13 1.5} fxg3 {+14.58/15 14} 14. hxg3 {+0.80/13 3} Qdb6
{+14.10/15 42} 15. Nbc3 {+0.68/12 1.5} Qf2+ {+14.02/12 12} 16. Kd1
{+0.48/13 1.7} Qg6 {+13.45/14 30} 17. Nbd5 {+1.22/12 1.9} Qb7
{+13.29/13 19} 18. Nfe3 {+0.67/12 2.4} gxf3 {+15.51/14 17} 19. Ngxf3
{+0.39/13 1.3} Qxb2 {+14.94/14 6} 20. Ndxc4 {+0.22/13 2.2} Qa1+
{+15.39/13 1.9} 21. Kd2 {-2.13/15 2.3} Kf8 {+15.19/14 18} 22. Nce5
{+1.90/13 5} Qa6 {+14.94/15 11} 23. Nxd7+ {+2.55/12 2.5} Kg8 {+14.46/16 13}
24. Nde5 {+2.08/13 4} Qxa3 {+14.42/13 6} 25. Nf5 {+2.49/13 4} Qa1
{+14.18/12 5} 26. Ndxe7+ {+2.43/12 1.8} Kf8 {+14.06/10} 27. N7c6
{+2.41/13 2.2} Qb7 {+13.45/12 6} 28. Ncd4 {+2.77/11 1.7} Qh1 {+8.12/13 13}
29. Nxa4 {+3.81/12 4} Qa6 {+10.86/14 19} 30. Nc3 {+3.79/12 2.2} Qff1
{+10.50/13 13} 31. g4 {+4.01/12 4} Qfa1 {+7.19/13 9} 32. Nd5 {+4.27/11 1.8}
Q1a4 {+10.38/11 7} 33. g5 {+4.47/12 2.1} Qa3 {+6.70/11 3} 34. c3
{+4.49/12 4} Qb2+ {+6.54/12 3} 35. Ke3 {+4.58/12 1.4} Qf1 {+5.73/13 5} 36.
c4 {+4.73/12 2.1} h6 {+5.09/10 1.9} 37. g6 {+8.09/12 3} Qbb7 {+1.21/11 3}
38. g7+ {+8.34/12 2.0} Kg8 {+0.92/11 1.7} 39. c5 {+9.21/13 4} Qb8
{+0.64/10 1.6} 40. c6 {+12.74/12 2.6} Qd8 {-8.20/10 2.7} 41. c7
{+15.78/13 3} Qda8 {-12.56/16 59} 42. Nde7+ {+16.82/13 3} Kh7 {-8.00/6} 43.
g8=Q+ {+16.83/13 1.0} Qxg8 {-8.20/6} 44. Nxg8 {+16.50/15 2.0} Kxg8
{-13.37/19 58} 45. c8=Q+ {+16.53/14 1.5} Qxc8 {-14.38/15} 46. Ne7+
{+17.02/15 2.1} Kf8 {-22.59/20 41} 47. Nxc8 {+17.23/15 6} Kg7
{-29.57/18 26} 48. Nf5+ {+17.39/14 1.9} Kg8 {-30.26/18 21} 49. Nxh6+
{+17.98/13 1.0} Kf8 {-77.05/22 48} 50. Nf5 {+18.01/14 2.3} Qa1
{-21.48/19 34} 51. Ncd6 {+18.05/15 4} Kg8 {-105.75/17 19} 52. Nec4
{+19.64/15 6} Qa7+ {-92.46/17 39} 53. Ned4 {+25.10/16 1.7} Qa1
{-105.75/16 12} 54. e5 {+25.20/15 2.4} Qc1+ {-101.72/16 12} 55. Ke4
{+25.51/16 1.0} Qb1+ {-99.78/14 2.3} 56. Kd5 {+25.90/17 2.4} Qb4
{-99.80/13 1.0} 57. e6 {+25.87/14 1.3} Kh7 {-99.84/12} 58. Ng5+
{+319.91/14 1.4} Kg6 {-109.92/14 11} 59. Nge4 {+319.92/15 1.7} Kh5
{-99.86/13 2.8} 60. Nf6+ {+319.93/16 3} Kg5 {-99.88/8} 61. Nde4+
{+319.94/17 1.9} Kf4 {-99.90/12} 62. Ne2+ {+319.95/19 1.8} Kxf5 {-99.92/6}
63. Ne3+ {+319.96/22 2.0} Kg6 {-99.94/6} 64. Nf4+ {+319.97/26 2.0} Kg7
{-99.96/6} 65. Nf5+ {+319.98/28 3} Kh8 {-99.98/6} 66. Ng6# {+319.99/31 2.0}
{White mates} 1-0
[d]7k/8/4PNN1/3K1N2/1q2N3/8/8/8 b - - 0 66
final position

I think I might revise my position: even facing only six Knights, the position is badly lost for three Queens. I cannot exclude that Stockfish is badly broken for this number of Knights, though, so I will have to do some tests with other engines as well.
Yes I think that may be true Harm, the material imbalance table is working against Stockfish here if it favours the two Knights for a Queen trade which it probably does in all possible cases. There are all other sorts of interactions computed for Knights vs Pawns, Knights vs. Queens etc. in the table and they all may be off if there is such a big number of Knights and/or Queens. But the table will probably always trade the two Knights for a Queen. I'm not sure if you should also correct here on the Black side for redundancy of major pieces, having three Queens, Larry Kaufman suggests in his paper in case you have more than one Rook or at least one Rook and Queen it may be favourable to trade off one of the majors, I'm not sure of the exact rule from Larry but the rule in Stockfish would then trade the Rook if possible. The question is if three Queens vs two would be less good then two against one, and trading Queens is advised. There is not yet such a Queen redundancy rule in Stockfish only in case there are also Rooks, and I'm not sure about it. But it does not seem an implausible rule because it would also help make the pawns for the stronger more valuable when major pieces disappear.

I could suggest some quick changes to the material imbalance table that maybe would make Stockfish play this game a little better. Just in case you'd want to compile a new version of it yourself Harm, or anyone else? But the rules are not yet tested. I'm for the moment assuming that 6 Knights would be roughly equivalent with three Queens and you should not trade Queens with 6 Knights. I had to introduce no less than a fourth order correction factor to make this jump between 4 Knights and 6 Knights large enough 8-) I could just have used a larger constant factor with maybe and a third degree correction that would be less extravagant, but for the moment I'll stick with this.

In Rainbow Serpent's material.cpp that now looks like this:

Code: Select all

// Polynomial material balance parameters
const Value RedundantQueenPenalty = Value(320);
const Value RedundantRookPenalty  = Value(550);
const Value RedundantKnightPenalty  = -Value(50); // NEW

Code: Select all

        // Redundancy of major pieces, formula based on Kaufman's paper
        // "The Evaluation of Material Imbalances in Chess"
        // http://mywebpages.comcast.net/danheisman/Articles/evaluation_of_material_imbalance.htm
        if (pieceCount[c][ROOK] >= 1)
            matValue -= sign * ((pieceCount[c][ROOK] - 1) * RedundantRookPenalty + pieceCount[c][QUEEN] * RedundantQueenPenalty);
        else if (pieceCount[c][QUEEN] > 1)
            matValue -= sign * ((pieceCount[c][QUEEN] - 1) * Value(50) + (pieceCount[c][QUEEN] - 2) * Value(75)); // NEW
        if (pieceCount[c][KNIGHT] >= 4)
            matValue -= sign * ((pieceCount[c][KNIGHT]) * (pieceCount[c][KNIGHT] - 1) * (pieceCount[c][KNIGHT] - 2) * (pieceCount[c][KNIGHT] - 3) * RedundantKnightPenalty); // NEW 4th-degree correction

        them = opposite_color(c);
        v = 0;

        // Second-degree polynomial material imbalance by Tord Romstad
        //
        // We use NO_PIECE_TYPE as a place holder for the bishop pair "extended piece",
        // this allow us to be more flexible in defining bishop pair bonuses.

The correction factor with different number of Knights goes up very steeply and I think just enough between 4 and 6 Knights

N=7 840 * RedundantKnightPenalty
N=6 360 * RedundantKnightPenalty
N=5 120 * RedundantKnightPenalty
N=4 24 * RedundantKnightPenalty
N=3 0


With the above code the position with seven Knights is now favoured heavily for the Knight side :) I did not yet check the 6 Knights case, from the game Stockfish against New Eng.

[d]1q1qk1q1/pppppppp/8/8/8/8/PPPPPPPP/NNNNKNNN w - - 0 1

Code: Select all

00:00:00.2	3,87	1	28	Nhg3 
00:00:00.2	3,79	2	83	Nab3 c6 
00:00:00.2	4,36	3	10501	Nab3 c6 Nhg3 d5 Nd4 
00:00:00.2	4,44	4	13968	Nab3 c6 Nhg3 d5 Nd4 
00:00:00.3	4,84	5	60121	Nab3 c6 Nhg3 Qd6 Nf3 
00:00:01.0	4,84	6	495382	Nab3 c6 Nhg3 Qd6 Nf3 Qb6 Nbc3 
00:00:01.9	4,44	7	971729	Nab3 c5 Nhg3 h5 Nxc5 Qdc7 N5d3 h4 Ne4 Qxc2 Nbc3 f5 Ng5 
00:00:03.4	4,48	8	1727171	Nab3 c5 Nhg3 h5 Ne4 Qh7 Nexc5 Qxc2 Na3 Qf5 
00:00:08.0	3,91	9	4152502	Nab3 c5 Nhg3 h5 Ne4 b6 Na1 d5 Neg3 Kd7 Nab3 
00:00:11.6	4,00	9	6039176	Nbc3 d5 e3 c5 Nhg3 e5 Nge2 f5 Nf3 
00:00:15.3	4,04	9	7892403	Nhg3 c5 e3 f5 Nbc3 f4 Nh5 fxe3 Ndxe3 b5 
00:00:27.6	4,20	10	14276601	Nhg3 c5 e3 Qbc7 Nf3 g5 d3 g4 Nfd2 b5 e4 c4 Nfe3 f6 Na3 cxd3 Nxd3 Qxa2 
00:01:03.3	4,24	11	32657821	Nhg3 c5 e3 e6 Nf3 g5 Nbc3 d5 Ng1 Kf8 Nge2 d4 Ne4 dxe3 Ndxe3 c4 
00:01:24.0	4,16	12	43261422	Nhg3 c5 e3 f5 Nbc3 d5 Nce2 d4 Nf3 dxe3 Nxf5 exf2+ Nxf2 g6 N5e3 Qf7 Nfg3 
00:04:54.4	4,08	13	152023431	Nhg3 c5 Nf3 g5 h3 d5 Na3 Qg6 e3 h5 Nd3 g4 Nh4 
00:07:50.1	4,08	14	241856532	Nhg3 c5 Nf3 g5 h3 d5 Nbc3 b5 e3 f5 Nxf5 b4 Nb1 g4 hxg4 Qxg4 N5g3 Qb5 Nce2 Kf8 
00:20:08.9	4,16	15	624518407	Nhg3 d5 Nab3 e5 e3 c5 Nxc5 h5 N5b3 Qh7 Na3 Kf8 N3e2 b5 d3 b4 Nb1 Qf6 
00:31:19.2	4,16	16	963669045	Nhg3 d5 Nd3 h5 Nxh5 c5 Nhg3 Qb6 Nf3 g5 Ng1 d4 a3 Qc6 Nf3 
00:55:05.0	3,96	17	1698877623	Nhg3 c5 d3 d5 Nbd2 Qf8 Nab3 f5 Ngf3 f4 Nh1 c4 dxc4 dxc4 Nxc4 Qbc7 
[Engine: Rainbow Serpent 1.6.3s(dc) Build 221 (Athlon 2009 MHz, 128 MB)
by Tord Romstad, Marco Costalba, Joona Kiiski Modifications: Dann Corbit]
For some reason the game stops here, without a proper result message. (I still have to debug this; I suspect that Stockfish resigned, but that due to a WinBoard problem this is not properly handled. The WinBoard clocks are stopped and reste to 10 & 2 min, so WB must think the game is over.) But is is clear that Stockfish is badly losing, its score having dropped from the initial +10 t0 -11, and two of its Queens exterminated at the expense of only a single Knight.
Stockfish losing on time may be built in and not a Winboard bug Harm, under certain circumstances Stockfish just lets the clock run out, rather than resign (is it possible with UCI to resign?). I think it is a rather annoying tactic but maybe designed to annoy the Rybka clones 8-)

Eelco
Debugging is twice as hard as writing the code in the first
place. Therefore, if you write the code as cleverly as possible, you
are, by definition, not smart enough to debug it.
-- Brian W. Kernighan