I doubt if top programs can draw one game in ten thousand at knight odds.lkaufman wrote: ↑Wed May 12, 2021 8:08 pmAsking for 100% wins is a bit unreasonable in my opinion. Even Dragon vs Dragon or Stockfish vs Stockfish might draw one game in ten thousand or so at knight odds due to failure to recognize obscure fortress draws for example, and who has computer time to devote to play 10,000 Rapid games like this? For me the interesting question is the point at which the weaker engine crosses 50% outright wins, meaning it would win a match at knight odds with Armageddon scoring of draws counting as wins for the odds-giver. Benjamin is already above that point in Rapid games with Dragon 2 on one thread, it's about even that way if Dragon uses four threads.Uri Blass wrote: ↑Wed May 12, 2021 7:50 pm Benjamin still did not score 100% against Dragon and the question is what is the minimal level that can score 100%
when it play against top programs.
Rybka2.3.2a is a candidate and in my test it won against latest stockfish that for some reason allowed trading pieces(maybe because it calculated that other options are worse so I do not know if it was a mistake)
This time I gave stockfish 7 cores and contempt 100 against 1 core of Rybka.
[pgn][Event "?"]
[Site "?"]
[Date "2021.05.12"]
[Round "?"]
[White "stockfish_21051119_x64_avx2"]
[Black "Rybkav2.3.2a.mp.x64"]
[Result "0-1"]
[FEN "rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKB1R w KQkq - 0 1"]
[GameDuration "00:40:10"]
[GameEndTime "2021-05-12T20:34:12.887 שעון קיץ ירושלים"]
[GameStartTime "2021-05-12T19:54:02.631 שעון קיץ ירושלים"]
[PlyCount "136"]
[SetUp "1"]
[TimeControl "40/780"]
1. d4 {-4.95/35 46s} Nf6 {+1.82/17 29s} 2. g3 {-4.92/30 9.6s} d6 {+1.82/16 19s}
3. Bg2 {-4.62/29 8.0s} g6 {+1.85/17 18s} 4. e4 {-4.49/30 28s} Bg7 {+1.85/16 26s}
5. O-O {-4.81/33 9.6s} O-O {+1.91/16 27s} 6. e5 {-4.87/34 48s}
dxe5 {+2.31/18 20s} 7. dxe5 {-5.00/31 12s} Ng4 {+2.36/18 7.1s}
8. f4 {-4.87/29 7.2s} Nc6 {+2.13/17 28s} 9. Na3 {-4.94/35 78s}
Be6 {+2.42/16 6.7s} 10. h3 {-5.09/32 29s} Nh6 {+2.42/15 2.8s}
11. c3 {-5.17/34 40s} Qxd1 {+2.45/15 8.1s} 12. Rxd1 {-5.05/29 4.7s}
a6 {+2.54/15 3.2s} 13. Bd2 {-4.65/33 36s} Rfd8 {+2.62/15 10s}
14. Be3 {-4.87/31 6.8s} Rxd1+ {+2.75/15 16s} 15. Rxd1 {-4.71/32 6.3s}
f6 {+2.89/16 4.3s} 16. exf6 {-4.94/34 11s} exf6 {+2.96/17 12s}
17. b3 {-4.94/36 48s} Nf5 {+3.15/17 3.3s} 18. Bf2 {-4.69/32 7.3s}
Rd8 {+3.17/16 4.1s} 19. Rxd8+ {-4.87/32 9.2s} Nxd8 {+2.91/15 0s}
20. Kh2 {-5.09/36 67s} Nd6 {+3.46/18 7.5s} 21. c4 {-5.04/30 10s}
Bf5 {+3.58/17 20s} 22. c5 {-5.17/30 9.7s} Ne4 {+3.70/18 3.0s}
23. g4 {-4.67/31 7.0s} Nxf2 {+4.65/17 2.9s} 24. gxf5 {-4.86/33 11s}
gxf5 {+4.92/18 16s} 25. Nc2 {-5.43/38 55s} Ne4 {+5.01/17 12s}
26. Nd4 {-5.66/38 42s} Bh6 {+4.48/15 0s} 27. Bf1 {-5.61/39 32s}
Bxf4+ {+5.36/15 11s} 28. Kg2 {-5.94/30 6.9s} Nxc5 {+5.46/16 17s}
29. b4 {-5.94/36 28s} Nce6 {+5.69/15 26s} 30. Bc4 {-6.12/30 3.2s}
Kf7 {+4.62/13 0s} 31. Nxf5 {-6.03/30 3.1s} Kg6 {+5.90/15 13s}
32. Bd3 {-6.29/32 22s} Bd2 {+6.03/16 22s} 33. Bb1 {-6.40/30 11s}
Bxb4 {+6.45/13 17s} 34. Ne3+ {-6.47/32 12s} Kg7 {+6.63/15 44s}
35. Nf5+ {-6.55/31 1.5s} Kf7 {+6.63/14 26s} 36. Kf3 {-6.53/30 8.2s}
Nc6 {+6.68/13 20s} 37. h4 {-6.39/27 3.7s} Ne5+ {+6.85/14 25s}
38. Ke3 {-6.40/23 0.58s} Bc5+ {+6.99/16 35s} 39. Kd2 {-6.77/28 1.9s}
b5 {+7.01/16 28s} 40. a4 {-6.54/19 0.39s} bxa4 {+7.70/13 20s}
41. h5 {-10.34/38 78s} a3 {+7.94/15 16s} 42. Ba2 {-10.48/39 12s}
Nf3+ {+8.28/13 4.9s} 43. Kc3 {-12.23/40 115s} Nd4 {+8.82/17 37s}
44. Nh6+ {-13.40/40 127s} Kg7 {+8.62/13 3.8s} 45. Ng4 {-13.54/29 7.7s}
Nf4 {+8.82/17 41s} 46. h6+ {-12.24/33 28s} Kg6 {+9.11/17 26s}
47. Kc4 {-13.37/32 21s} Ba7 {+9.08/15 13s} 48. Kc3 {-12.69/36 31s}
Nf3 {+9.99/14 9.8s} 49. Kb4 {-21.85/31 87s} f5 {+11.08/16 9.7s}
50. Nh2 {-24.32/30 67s} Nxh2 {+11.65/14 7.9s} 51. Bb3 {-29.63/28 52s}
Nh5 {+12.53/14 14s} 52. Kxa3 {-73.20/35 22s} f4 {+12.50/13 7.6s}
53. Kb4 {-113.70/38 13s} f3 {+13.12/14 14s} 54. Bc4 {-M32/69 8.4s}
f2 {+12.93/11 3.9s} 55. Be2 {-M30/59 2.7s} c5+ {+16.50/14 25s}
56. Kc3 {-M26/64 2.8s} f1=Q {+19.80/14 10s} 57. Bd3+ {-M26/58 3.0s}
Qxd3+ {+14.82/12 41s} 58. Kxd3 {-M24/59 2.9s} Nf4+ {+22.94/14 28s}
59. Kd2 {-M22/58 3.4s} Nf1+ {+M35/13 6.4s} 60. Kc2 {-M18/65 7.0s}
c4 {+M25/12 1.1s} 61. Kb1 {-M16/67 2.2s} Bd4 {+M19/7 0.028s}
62. Kc2 {-M14/82 3.2s} Ne3+ {+M17/4 0.004s} 63. Kc1 {-M12/132 3.1s}
c3 {+M11/3 0s} 64. Kb1 {-M10/212 0.50s} c2+ {+M9/3 0.001s}
65. Kc1 {-M8/245 0.049s} Ne2+ {+M7/3 0.001s} 66. Kd2 {-M6/245 0.008s}
c1=Q+ {+M5/3 0s} 67. Kxe2 {-M4/245 0.006s} Qf1+ {+M3/3 0.001s}
68. Kd2 {-M2/245 0.003s} Qd1# {+M1/3 0.001s, Black mates} 0-1
[/pgn]
Failing to recognize some fortress is not enough and the engine need also not to find something better because if the engine evaluate the fortress as +3 and evaluate another line as more than +3 then it is not going to fall into the fortress but you practically do not need to play 10,000 rapid games and even a score like 100-0 may be interesting.