Discussion of computer chess matches and engine tournaments.
Moderator: Ras
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Sun May 16, 2021 3:25 pm
Komodo Dragon : Knight Odds Handicap Match
http://rebel13.nl/pgn4web-3.05/live-test.html
Code: Select all
. Tournament : Gauntlet
. Time Control : 40 moves in 10 minutes repeating
. Games : 7 games each engine
. Opponents : elo range 2534-2699
1. Benjamin 2699
2. Gaviota 0.85.1 2698
3. Fruit 2.1 2695
4. Fridolin 3.10 2683
5. Velvet 1.1.0 2685
6. Arasan 14.1 2652
7. Supernova 2.3 2652
8. Galjoen 0.41.1 ~2589
9. Maverick 1.5 2537
10 Marvin 2.0.0 2534
. Software : Arena 3.0
. Hash table : Default setting engines
. Openings : dragon.epd
. rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/R1BQKBNR w KQkq - c0 "-wnb1";
. rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNBQKB1R w KQkq - c0 "-wng1";
Komodo Dragon II will follow later.
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Sun May 16, 2021 6:40 pm
-Nc3 finished : 3.5 / 10
Excellent result for Komodo Dragon.
-Nf3 running now.
One impressive game by the Dragon, strangling the opponent in Nimzowitch style.
It starts with 16.Rxc6!
[pgn][Event "Dragon-handicap"]
[Site "Deventer"]
[Date "2021.05.16"]
[Round "1"]
[White "Komodo Dragon"]
[Black "Benjamin"]
[Result "1-0"]
[BlackElo "2200"]
[Time "15:26:34"]
[WhiteElo "2200"]
[TimeControl "40/600:40/600:40/600"]
[SetUp "1"]
[FEN "rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/R1BQKBNR w KQkq - 0 1"]
[Termination "normal"]
[PlyCount "104"]
[WhiteType "program"]
[BlackType "program"]
[Comment "c0 \"-wnb1\";"]
1. Nf3 {-4.09/23 21} Nf6 {+3.10/14 14} 2. g3 {-4.06/23 24} d5 {+3.24/14 9}
3. Bg2 {-4.21/20 6} Nc6 {+3.36/15 12} 4. d4 {-4.20/22 12} Bf5 {+3.37/15 18}
5. O-O {-4.18/25 19} e6 {+3.47/15 15} 6. Nh4 {-4.14/20 6} Bg4 {+3.43/14 11}
7. c4 {-4.11/22 15} dxc4 {+3.79/16 21} 8. h3 {-4.21/23 17} Bh5 {+3.77/16
12} 9. Bg5 {-4.41/24 16} Qd7 {+3.89/15 13} 10. Rc1 {-4.13/21 9} Rd8
{+3.92/16 9} 11. g4 {-4.20/21 6} Bg6 {+4.29/16 11} 12. Rxc4 {-4.30/22 19}
Nxd4 {+4.04/17 15} 13. Nxg6 {-3.81/22 7} hxg6 {+3.68/14 3} 14. e3 {-3.76/23
13} Nc6 {+3.87/17 14} 15. Qc1 {-3.86/22 8} Qd3 {+3.99/15 13} 16. Rxc6
{-2.91/24 13} bxc6 {+3.64/13 1} 17. Bxc6+ {-3.05/23 12} Ke7 {+2.90/17 13}
18. Qc5+ {-2.71/25 22} Qd6 {+4.62/17 13} 19. Qb5 {-3.18/24 7} a6 {+4.76/17
14} 20. Qb7 {-2.68/24 8} Rxh3 {+4.74/16 16} 21. Rc1 {-2.85/24 9} Rh8
{+4.68/15 28} 22. Kg2 {-2.86/23 7} a5 {+5.43/15 19} 23. Qa7 {-2.85/24 34}
a4 {+5.35/14 12} 24. Qa5 {-1.41/24 18} Rh7 {+3.84/14 12} 25. e4 {0.00/27
13} Rc8 {+1.41/14 11} 26. e5 {0.00/25 13} Qd4 {+1.44/16 11} 27. Bf3
{+0.09/28 15} Ke8 {+1.10/15 13} 28. Be3 {0.00/26 10} Qd8 {+0.20/15 24} 29.
Bc6+ {+3.68/25 21} Nd7 {+0.04/17 12} 30. Qxa4 {+4.01/25 9} Qe7 {0.00/17 17}
31. Rd1 {+3.77/29 14} Rd8 {0.00/21 16} 32. f3 {+4.00/28 12} Rh8 {0.00/18
11} 33. Bxd7+ {+8.03/27 12} Qxd7 {-1.94/15 2} 34. Rxd7 {+8.49/30 13} Rxd7
{-3.46/18 29} 35. Qa8+ {+8.98/29 17} Rd8 {-4.08/18 34} 36. Qc6+ {+8.96/31
16} Rd7 {-4.13/17 13} 37. a4 {+9.29/30 19} Bb4 {-4.33/18 20} 38. a5
{+9.38/29 10} Ke7 {-4.40/18 17} 39. a6 {+9.41/26 12} f6 {-5.01/17 9} 40.
Qe4 {+9.74/23 12} Kf7 {-4.69/16 11} 41. Qxb4 {+10.08/26 16} c5 {-5.49/15
18} 42. Qxc5 {+12.16/26 39} g5 {-7.85/16 10} 43. exf6 {+13.21/25 19} gxf6
{-7.18/15 7} 44. Qc6 {+13.95/24 28} Rdd8 {-8.12/17 26} 45. a7 {+14.26/23
16} Rc8 {-7.84/16 12} 46. Qb7+ {+15.37/24 14} Kg6 {-7.84/16 13} 47. Qe4+
{+15.65/23 17} Kf7 {-7.83/17 10} 48. b4 {+16.39/24 14} Rc7 {-9.77/17 10}
49. b5 {+16.63/22 61} Rc3 {-10.03/19 33} 50. b6 {+M16/17 5} Ra3 {-16.99/17
13} 51. b7 {+M12/18 1} Rb3 {-20.94/17 14} 52. a8=Q {+M8/21 1} Rb2+ {-M8/14
10 Black resigns} 1-0
[/pgn]
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Sun May 16, 2021 10:07 pm
-Ng1 run : 4-6, 40%.
Again, a lot better than SF13, 0%.
PGN at -
http://rebel13.nl/dump/dragon-1-handicap.pgn
Will repeat with Dragon 2, thereafter SF13 with this engine pool.
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Mon May 17, 2021 9:37 am
-Nb1 : 3.5 - 6.5, 35% for Dragon II.
Code: Select all
Name Gms : Win : Draw : Lose : Pts : S-B
Komodo-dragon-2 10 : 3+ : 1= : 6- : 3.5 : 0.25
Arasan 14.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Benjamin 1 : 1+ : 0= : 0- : 1.0 : 3.50
Galjoen 0.41.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Gaviota-0.85.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Marvin 2.0.0 1 : 1+ : 0= : 0- : 1.0 : 3.50
Supernova 2.3 1 : 1+ : 0= : 0- : 1.0 : 3.50
Maverick 1.5 1 : 0+ : 1= : 0- : 0.5 : 1.75
Fridolin 3.10 1 : 0+ : 0= : 1- : 0.0 : 0.00
Fruit 2.1 1 : 0+ : 0= : 1- : 0.0 : 0.00
Velvet 1.1.0 1 : 0+ : 0= : 1- : 0.0 : 0.00
Games played 10, games to go 0
Tournament finished
-Ng1 now...
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 6284 Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman
Post
by lkaufman » Mon May 17, 2021 4:26 pm
Rebel wrote: ↑ Mon May 17, 2021 9:37 am
-Nb1 : 3.5 - 6.5, 35% for Dragon II.
Code: Select all
Name Gms : Win : Draw : Lose : Pts : S-B
Komodo-dragon-2 10 : 3+ : 1= : 6- : 3.5 : 0.25
Arasan 14.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Benjamin 1 : 1+ : 0= : 0- : 1.0 : 3.50
Galjoen 0.41.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Gaviota-0.85.1 1 : 1+ : 0= : 0- : 1.0 : 3.50
Marvin 2.0.0 1 : 1+ : 0= : 0- : 1.0 : 3.50
Supernova 2.3 1 : 1+ : 0= : 0- : 1.0 : 3.50
Maverick 1.5 1 : 0+ : 1= : 0- : 0.5 : 1.75
Fridolin 3.10 1 : 0+ : 0= : 1- : 0.0 : 0.00
Fruit 2.1 1 : 0+ : 0= : 1- : 0.0 : 0.00
Velvet 1.1.0 1 : 0+ : 0= : 1- : 0.0 : 0.00
Games played 10, games to go 0
Tournament finished
-Ng1 now...
I finished my own knight odds blitz test, in which Dragon 2 and Stockfish 13 (each with Contempt 100) give knight odds (using the middle 200 positions from ChrisW opening set for the odds) at 2' + 1" to four opponents: Benjamin, Wasp 1.01, Fruit 2.2.1, and Naum 4. All engines one thread, all matches 200 games with the same openings. CCRL blitz ratings used for stats. Dragon 2 performed about a hundred elo better than SF 13 against the first three opponents, while SF 13 performed over a hundred elo better against Naum 4. I don't know the reason for this. Overall Dragon 2 performance rating was 2762, SF 13 was 2731. Of course you will get significantly lower performance ratings at your much longer time control since the odds-giver needs serious mistakes by the opponent to win or draw. Whether Dragon will outperform SF in your test, and if so by more or less than my 31 elo margin, will be interesting. Regarding the performance of the four engines receiving the handicap, Benjamin was outstanding, performing about a class (200 elo) better relative to its rating than Wasp and Fruit did, while Naum performed over a class worse relative to its rating than Wasp and Fruit did. Note that the average rating of Dragon 2 and Stockfish 13 is 3610, so this test gives a value of knight odds of 864 elo, but this is using the Bayes-Elo contracted ratings of CCRL. With normal Elo (Ordo), I think that the calculation would have come out right about my 1000 estimate for knight odds. I redid the math using CEGT ratings, which do use Ordo, and I got that knight odds = 984 elo!
Komodo rules!
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Mon May 17, 2021 5:42 pm
- Ng1 : 2-8, 20%, making it 5.5 / 20.
PGN -
http://rebel13.nl/dump/dragon-2-handicap.pgn
Will run SF13 and thereafter Lc0
Current standing
world championship Knight odds
Code: Select all
Komodo Dragon 1 7.5 points
Komodo Dragon 2 5.5 points
Stockfish 13 ..........
Lc0 ..........
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Tue May 18, 2021 8:00 am
Knight odds -Nb1 and Ng1
Code: Select all
Engine Points Games
1. Komodo Dragon 1 7.5 20
2. Stockfish 13 6.0 20
3. Komodo Dragon 2 5.5 20
4. Lc0 ... 20
Lc0 running now...
http://rebel13.nl/pgn4web-3.05/live-test.html
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Tue May 18, 2021 8:28 am
[pgn][Event "Lc0-handicap-nb1"]
[Site "Deventer"]
[Date "2021.05.18"]
[Round "1"]
[White "Lc0 v27"]
[Black "Fruit 2.1"]
[Result "0-1"]
[BlackElo "2200"]
[Time "08:18:49"]
[WhiteElo "2200"]
[TimeControl "40/600:40/600:40/600"]
[SetUp "1"]
[FEN "rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/R1BQKBNR w KQkq - 0 1"]
[Termination "adjudication"]
[PlyCount "17"]
[WhiteType "program"]
[BlackType "program"]
[Comment "c0 \"-wnb1\";"]
1. a4 {-5.40/7 53} Nc6 {+3.13/15 18} 2. Ra3 {-4.06/6 19} e5 {+3.99/14 21}
3. Nh3 {-6.96/6 14} Bxa3 {+5.93/14 10} 4. a5 {-9.52/6 15} Be7 {+8.75/14 13}
5. Ng5 {-8.07/5 14} Bxg5 {+11.48/13 8} 6. a6 {-16.10/6 12} b5 {+11.65/14
12} 7. d4 {-16.77/5 13} exd4 {+12.50/15 16} 8. Bxg5 {-30.59/5 12} Qxg5
{+13.01/12 10} 9. Qd2 {-39.71/5 11 Arena Adjudication} 0-1
[/pgn]
Laugh or cry ?
Untrained pattern?
Will try a 384 net.
* Edit - makes nornal moves now.
90% of coding is debugging, the other 10% is writing bugs.
Rebel
Posts: 7515 Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder
Post
by Rebel » Tue May 18, 2021 9:27 am
Given up on Lc0. While Arena dictates a -99.99 score for resignation the engine apparently resigns by itself.
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 6284 Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman
Post
by lkaufman » Tue May 18, 2021 4:11 pm
Rebel wrote: ↑ Tue May 18, 2021 9:27 am
Given up on Lc0. While Arena dictates a -99.99 score for resignation the engine apparently resigns by itself.
That's strange; I found that recent Lc0 (largest net) was quite strong giving knight odds, either to me or to engines. I didn't see stupid moves. But I don't use Arena, perhaps there is some hidden problem there?
Komodo rules!