Komodo-Dragon-2 vs Stockfish 14 at knight odss

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

Uri Blass
Posts: 11150
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Uri Blass »

Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds. I am pretty sure Komodo playing GM's at odds has resulted in program changes when down in material, Larry might comment on that one :D

Details at - https://prodeo.actieforum.com/t543-knig ... ds-results

Download games - http://rebel13.nl/odds.zip

------------------------------------

What's next ?

I see 5 options.

1. Play queen-odds matches 2000 / 1500 /1000 elo until finally Komodo and/or Stockfish start to win, >50%

2. Look above, instead of 2700 engines test also 2800, 2900, 3000 elo pools.

3. Invite a third engine and repeat the 2700, 2500, 2300 elo cycle. Suggest an engine that does better than SF14, but make a reasonable case for it.

4. Suggest something interesting else.

5. Stop, it's enough.

Pick your preference.
I think that invite a third engine is best.
I believe many engines are going to do better than stockfish14 at least with queen odds including Wasp and RubiChess
It may be interesting also to test Komodo Dragon2.5(with the contempt setting Larry suggest) and Stockfish13 (with maximal possible contempt).
Uri Blass
Posts: 11150
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Uri Blass »

I think that for a different version of stockfish is may be better to test because I am not sure stockfish13 is best for queen odds and I found that the evaluation of stockfish13 is also stupid in this case so it is better to test different engines.

I suggest to test the strongest non stockfish Dragon engine that shows improvement when I search deeper with queen odds.
When I use Wasp I can see that the evaluation at depth d+10 is always worse than the evaluation at depth d and this is the reason I like it.
I dislike engines that show no improvement in the evaluation regardless of search depths so it is better not to choose them as candidate for queen odds because they do not know how black can improves the position and seem to understand nothing.



C:\Users\àåøé\Downloads\Adams\uri.epd Position -1 / 1
FEN: rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNB1KBNR w KQkq - 0 1

Wasp450-x64-modern:
setting search and eval params...
2/3 00:00 43 0 -11.21 Ng1-f3 d7-d6
3/4 00:00 184 0 -11.01 Ng1-f3 e7-e6 e2-e3
4/6 00:00 448 0 -11.24 Ng1-f3 e7-e6 e2-e4 Nb8-c6
5/7 00:00 1k 0 -10.96 Ng1-f3 e7-e6 Nb1-c3 Nb8-c6 e2-e3
6/8 00:00 2k 0 -11.20 Ng1-f3 e7-e6 Nb1-c3 Ng8-f6 e2-e3 Nb8-c6
7/11 00:00 9k 0 -11.10 Ng1-f3 Ng8-f6 Nb1-c3 d7-d5 d2-d3 d5-d4 Nc3-e4 Nf6xe4 d3xe4
8/14 00:00 35k 0 -11.30 Ng1-f3 Nb8-c6 d2-d4 e7-e6 e2-e3 Ng8-f6 Nb1-c3 Nc6-b4
9/15 00:00 69k 3,799k -11.29 Ng1-f3 d7-d5 e2-e3 Nb8-c6 Bf1-b5 Bc8-d7 Nb1-c3 e7-e6 d2-d4
10/15 00:00 84k 4,633k -11.40 Ng1-f3 Nb8-c6 d2-d4 d7-d5 e2-e3 Ng8-f6 a2-a3 Bc8-f5 Bf1-b5 Bf5xc2
11/17 00:00 143k 7,852k -11.37 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 e2-e3 d7-d5 Bf1-b5 a7-a6 Bb5-d3 e7-e6 O-O
12/20 00:00 370k 6,140k -11.42 Ng1-f3 d7-d5 e2-e3 Ng8-f6 Nb1-c3 Nb8-c6 Bf1-b5 Bc8-d7 d2-d4 Nc6-b4 Bb5-d3 Nb4xd3+ c2xd3
13/20 00:00 578k 6,430k -11.41 Ng1-f3 d7-d5 d2-d4 Bc8-f5 c2-c3 e7-e6 Bc1-f4 h7-h6 h2-h3 Ng8-f6 e2-e3 Nb8-c6 Bf1-b5
14/22 00:00 1,026k 7,730k -11.44 Ng1-f3 d7-d5 d2-d4 Bc8-f5 c2-c3 e7-e6 Bc1-f4 h7-h6 h2-h3 Ng8-f6 e2-e3 Nb8-c6 Nb1-d2 Bf5-h7
15/22 00:00 1,966k 8,753k -11.54 Ng1-f3 Ng8-f6 d2-d3 Nb8-c6 e2-e4 e7-e5 Nb1-c3 d7-d5 Bc1-g5 Bf8-b4 O-O-O Bb4xc3 b2xc3 Qd8-d6 Bg5xf6 g7xf6 e4xd5 Qd6xd5
16/24 00:00 2,748k 9,152k -11.56 Ng1-f3 Ng8-f6 d2-d4 Nb8-c6 Nb1-c3 Nc6-b4 Ke1-d1 d7-d5 a2-a3 Nf6-g4 Bc1-e3 Nb4-c6 g2-g3 Ng4xe3+ f2xe3 Bc8-f5 Nf3-h4
17/26 00:00 5,095k 9,135k -11.58 Ng1-f3 Ng8-f6 d2-d4 Nb8-c6 e2-e3 d7-d6 Bc1-d2 e7-e5 d4xe5 d6xe5 Nb1-c3 e5-e4 Nf3-g5 Bc8-f5 O-O-O h7-h6 Bd2-e1
18/28 00:00 9,104k 9,668k -11.47 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 a2-a3 d7-d6 Nb1-c3 Bc8-f5 e2-e3 e7-e5 Bf1-b5 e5-e4 d4-d5 Nf6xd5 Nc3xd5 e4xf3 g2xf3 Bf5xc2
19/28 00:01 15,947k 9,866k -11.60 Ng1-f3 d7-d5 d2-d3 Nb8-c6 e2-e4 e7-e5 e4xd5 Nc6-d4 Nf3xd4 e5xd4 Bf1-e2 Ng8-f6 c2-c4 d4xc3/ep Nb1xc3 Nf6xd5 O-O c7-c6 Nc3-e4
20/30 00:03 35,994k 10,120k -11.68 Ng1-f3 d7-d5 d2-d3 Ng8-f6 Bc1-f4 c7-c5 h2-h3 e7-e6 Nb1-d2 Bf8-d6 Bf4xd6 Qd8xd6 e2-e3 O-O Bf1-e2 Nb8-c6 O-O b7-b6 c2-c4 h7-h6 c4xd5 e6xd5
21/32 00:05 53,827k 10,223k -11.70 Ng1-f3 d7-d5 c2-c3 Nb8-c6 d2-d4 Bc8-f5 Bc1-f4 e7-e6 e2-e3 Ng8-f6 h2-h3 Bf8-d6 Nb1-d2 Nf6-e4 Bf4xd6 c7xd6 Bf1-e2 O-O Ra1-c1 Ne4xd2 Nf3xd2
22/32 00:12 134,040k 10,364k -11.78 Ng1-f3 d7-d5 c2-c3 Nb8-c6 d2-d4 Bc8-f5 Bc1-g5 Ng8-f6 Nb1-d2 h7-h6 Bg5-h4 e7-e6 e2-e3 Bf8-d6 Bf1-e2 O-O O-O a7-a6 Bh4-g3 Bd6xg3 h2xg3 Nf6-e4
22/32 00:13 142,263k 10,358k -11.67 e2-e4 e7-e5 Ng1-f3 Ng8-f6 d2-d3 d7-d5 e4xd5 Nf6xd5 Nb1-c3 Nb8-c6 Bf1-e2 Nc6-d4 Be2-d1 Bf8-b4 Bc1-d2 O-O O-O f7-f6 a2-a3 Bb4-e7 Nf3xd4 Nd5xc3
23/33 00:17 182,124k 10,371k -11.70 e2-e4 e7-e5 Ng1-f3 Ng8-f6 d2-d3 d7-d5 Nb1-d2 Nb8-c6 c2-c3 h7-h6 h2-h3 Bc8-e6 Bf1-e2 Bf8-d6 O-O O-O e4xd5 Be6xd5 b2-b4 a7-a6 a2-a4 e5-e4 d3xe4 Nf6xe4 Nd2xe4 Bd5xe4
23/33 00:19 203,536k 10,374k -11.69 d2-d3 e7-e5 e2-e4 d7-d5 Ng1-f3 Ng8-f6 Nb1-d2 Nb8-c6 c2-c3 Bf8-d6 Bf1-e2 h7-h6 e4xd5 Nf6xd5 Nd2-c4 Bc8-e6 Nc4xd6+ c7xd6 O-O O-O h2-h3 b7-b5 a2-a4 a7-a6
24/34 00:22 233,341k 10,388k -11.72 d2-d3 e7-e5 e2-e4 d7-d5 Ng1-f3 Ng8-f6 Nb1-d2 Nb8-c6 c2-c3 Bf8-d6 Bf1-e2 h7-h6 e4xd5 Nf6xd5 Nd2-c4 Bc8-e6 Nc4xd6+ c7xd6 O-O O-O h2-h3 b7-b5 a2-a4 a7-a6 c3-c4 b5xc4 d3xc4
24/34 00:29 302,018k 10,226k -11.68 e2-e3 d7-d5 d2-d4 Nb8-c6 Ng1-f3 Bc8-f5 c2-c3 e7-e6 Bf1-e2 Ng8-f6 O-O Nf6-e4 Nb1-d2 Bf8-e7 Nd2xe4 Bf5xe4 a2-a4 O-O b2-b4 a7-a6 h2-h3 h7-h6 a4-a5 Be7-g5
25/37 00:37 373,638k 10,092k -11.75 e2-e3 d7-d5 d2-d4 Nb8-c6 Ng1-f3 Nc6-b4 Ke1-d1 Bc8-f5 Nf3-e1 Nb4-c6 Ne1-d3 e7-e6 Nb1-c3 Ng8-f6 f2-f3 h7-h5 h2-h3 Bf8-d6 a2-a3 Bd6-g3 Nd3-c5 Nf6-d7 Nc3-e2 Nd7xc5 Ne2xg3
25/37 00:41 419,115k 10,034k -11.67 e2-e4 e7-e5 Ng1-f3 Ng8-f6 d2-d3 Nb8-c6 Bf1-e2 d7-d5 e4xd5 Nf6xd5 O-O Bf8-c5 Nb1-c3 Nd5xc3 b2xc3 O-O Ra1-b1 Qd8-e7 Nf3-d2 f7-f6 Nd2-c4 a7-a6 Bc1-e3 Bc5xe3 Nc4xe3
26/37 00:47 471,179k 9,980k -11.74 e2-e4 e7-e5 Ng1-f3 Ng8-f6 d2-d3 Nb8-c6 c2-c3 d7-d5 Nb1-d2 Bf8-d6 Bf1-e2 h7-h6 O-O O-O h2-h3 Rf8-e8 Rf1-e1 Bc8-e6 Be2-d1 b7-b5 Bd1-c2 a7-a5 e4xd5 Nf6xd5 a2-a4 b5-b4
27/37 01:27 851,635k 9,717k -11.88 e2-e4 e7-e5 Ng1-f3 d7-d5 d2-d3 Nb8-c6 Bf1-e2 Ng8-f6 e4xd5 Nf6xd5 O-O Bf8-c5 Nb1-c3 O-O Nc3xd5 Qd8xd5 a2-a3 f7-f5 Bc1-d2 e5-e4 b2-b4 Bc5-d4 Nf3xd4 Nc6xd4 Be2-d1 e4xd3 c2xd3
27/39 02:23 1,377,366k 9,581k -11.83 Ng1-f3 d7-d5 c2-c3 Ng8-f6 d2-d3 Nb8-c6 Nb1-d2 Bc8-f5 Nf3-h4 Bf5-d7 Nh4-f3 e7-e6 g2-g3 e6-e5 e2-e4 d5xe4 d3xe4 Bf8-d6 Bf1-g2 O-O O-O h7-h6 Nd2-c4 a7-a6 Rf1-e1 Bd7-e6 Nc4xd6 c7xd6
Uri Blass
Posts: 11150
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Uri Blass »

RubiChess2.2 is also a good candidate for queen odds because it does not show the stupid behaviour of stockfish and I see the evaluation goes down when I search deeper(I used 7 cores but I do not see with stockfish evaluations go down from my testing with the same conditions)

C:\Users\àåøé\Downloads\Adams\uri.epd Position 1 / 1
FEN: rnbqkbnr/pppppppp/8/8/8/8/PPPPPPPP/RNB1KBNR w KQkq - 0 1

RubiChess-2.2_x86-64-modern:
1/2 00:00 21 0 -8.49 d2-d3
2/3 00:00 58 0 -8.76 d2-d3 Nb8-c6
3/4 00:00 214 0 -8.58 e2-e3 Ng8-f6 b2-b3
4/5 00:00 446 0 -8.80 e2-e3 Ng8-f6 b2-b3 Nb8-c6
5/6 00:00 1k 1,510k -9.20 b2-b3 Ng8-f6 d2-d3 Nb8-c6
6/7 00:00 8k 5,265k -9.22 g2-g3 d7-d6 d2-d3 Nb8-c6 Bf1-g2 e7-e6
7/7 00:00 9k 5,857k -9.22 g2-g3
8/8 00:00 9k 5,750k -9.38 g2-g3
9/9 00:00 12k 6,733k -9.43 g2-g3 Nb8-c6 d2-d3 d7-d6 Ng1-f3 e7-e5 Bf1-g2 Nc6-b4
10/10 00:00 20k 7,394k -9.47 g2-g3 Nb8-c6 d2-d3 d7-d6 Ng1-f3 e7-e5 Bf1-g2 Ng8-f6
11/11 00:00 75k 9,097k -9.73 Ng1-f3 e7-e6 d2-d3 Nb8-c6 g2-g3 d7-d5 Bc1-f4 Ng8-f6 Bf1-g2
12/14 00:00 183k 9,471k -10.03 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 c2-c3 d7-d5 g2-g3 Bc8-f5 Bf1-g2 e7-e6 O-O Bf8-e7
13/13 00:00 290k 9,635k -10.01 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 c2-c3 d7-d6 Nb1-d2 Bc8-f5 g2-g3 e7-e5 Bf1-g2
14/15 00:00 300k 9,562k -10.08 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 c2-c3 d7-d6 Nb1-d2 Bc8-f5 g2-g3 e7-e6 Bf1-g2 Bf8-e7 O-O Nf6-e4
15/19 00:00 933k 10,905k -10.29 Ng1-f3 Nb8-c6 d2-d4 d7-d5 g2-g3 Bc8-f5 c2-c3 e7-e6 Bf1-g2 Ng8-f6 Bc1-f4 Nf6-e4 O-O
16/16 00:00 955k 10,949k -10.16 Ng1-f3 Nb8-c6 d2-d4 d7-d5 g2-g3 Bc8-f5 c2-c3 Ng8-f6 Bf1-g2 Nf6-e4 Bc1-f4 e7-e6 O-O Bf8-d6
17/18 00:00 1,384k 11,336k -10.18 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 c2-c3 d7-d6 Nb1-d2 Bc8-f5 g2-g3 e7-e5 Bf1-g2 e5xd4 c3xd4 d6-d5 a2-a3
18/20 00:00 1,566k 11,419k -10.14 Ng1-f3 Nb8-c6 d2-d4 Ng8-f6 g2-g3 d7-d6 Bc1-e3 e7-e5 Nb1-c3 e5-e4 Nf3-h4 d6-d5 Bf1-g2 h7-h6 O-O Bf8-e7 a2-a3
19/25- 00:00 5,012k 12,044k -10.42 Ng1-f3 Nb8-c6
19/22 00:00 5,919k 12,112k -10.46 Ng1-f3 Nb8-c6 e2-e4 d7-d5 e4xd5 Qd8xd5 Nb1-c3 Qd5-e6+
20/20 00:00 6,481k 12,168k -10.45 Ng1-f3 Nb8-c6 e2-e4 d7-d5 e4xd5 Qd8xd5 Nb1-c3 Qd5-e6+ Bf1-e2 Nc6-b4 O-O Nb4xc2 Ra1-b1 g7-g6 Be2-d1 Qe6-f5 d2-d3 Nc2-b4 Bd1-a4+ c7-c6
21/25 00:00 10,039k 11,876k -10.48 Ng1-f3 Nb8-c6 e2-e4 d7-d5 e4xd5 Qd8xd5 Nb1-c3 Qd5-e6+ Bf1-e2 Nc6-b4 O-O Nb4xc2 Ra1-b1 g7-g6 Be2-d1 Qe6-f5 d2-d3 Nc2-b4 Bd1-a4+ Bc8-d7 Nf3-d4 Qf5xd3
22/27 00:01 16,514k 12,172k -10.53 Ng1-f3 d7-d5 g2-g3 Nb8-c6 d2-d4 Bc8-f5 c2-c3 e7-e6 Bf1-g2 Ng8-f6 Bc1-e3 Bf8-e7 O-O h7-h6 Nb1-d2 O-O Nd2-b3 b7-b6 Ra1-d1 Nf6-e4 Nf3-e5 Nc6xe5 d4xe5
23/26 00:01 17,560k 12,091k -10.52 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Bf1-g2 Ng8-f6 Nb1-d2 h7-h6 O-O Bf8-d6 Nd2-b3 O-O h2-h3 Nf6-e4 Bc1-e3 Bd6-e7 Nb3-c5 Ne4xc5
24/32 00:01 19,106k 12,114k -10.52 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Nf3-h4 Bf5-e4 Nh4-f3 h7-h6 Bf1-g2 Ng8-f6 O-O Bf8-d6 Nb1-d2 Be4-f5 Nd2-b3 O-O h2-h3 Nf6-e4 Bc1-e3
25/29 00:02 26,543k 11,986k -10.50 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Bc1-e3 Bf8-e7 Nb1-d2 h7-h6 Bf1-g2 Ng8-f6 O-O O-O Nf3-h4 Bf5-h7 Nh4-f3 Nf6-e4 Ra1-d1 a7-a5 Nd2xe4 Bh7xe4 a2-a4 Be4xf3 Bg2xf3
26/31 00:02 32,431k 11,914k -10.54 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 h7-h6 Bf1-g2 e7-e6 O-O Bf8-e7 Bc1-f4 Ng8-f6 Nb1-d2 Nf6-e4 Nf3-e1 Ne4xd2 Bf4xd2 O-O Ne1-d3 Bf5-e4 Nd3-c5
27/34+ 00:05 60,715k 11,858k -10.46 Ng1-f3
27/35- 00:07 83,553k 11,835k -10.62 Ng1-f3 Nb8-c6
27/30 00:07 85,133k 11,822k -10.62 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Bf1-g2 Ng8-f6 Nf3-h4 Bf5-e4 Nh4-f3 h7-h6 O-O Be4-f5 Bc1-e3 Bf8-d6 h2-h3 O-O Nb1-d2 Nf6-e4 Nd2-b3 Bd6-e7 Nb3-c5 Ne4xc5 d4xc5
28/29 00:07 85,673k 11,810k -10.61 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Bf1-g2 Ng8-f6 Nf3-h4 Bf5-e4 Nh4-f3 h7-h6 O-O Be4-f5 Bc1-e3 Bf8-d6 h2-h3 a7-a5 Nb1-d2 O-O a2-a3 Nf6-e4 Nd2xe4 Bf5xe4 Ra1-d1 e6-e5
29/30 00:07 88,068k 11,811k -10.61 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 h7-h6 Bf1-g2 e7-e6 a2-a4 Ng8-f6 O-O Bf8-e7 Bc1-e3 a7-a6 a4-a5 Qd8-d7 h2-h3 Nf6-e4 Nb1-d2 O-O-O Nd2-b3 Bf5-h7 Nb3-c5 Ne4xc5 d4xc5 e6-e5 Rf1-d1
30/30 00:07 94,197k 11,789k -10.61 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 h7-h6 Bf1-g2 e7-e6 a2-a4 Ng8-f6 O-O Bf8-e7 Bc1-e3 a7-a6 a4-a5 Qd8-d7 h2-h3 Nf6-e4 Nb1-d2 O-O-O Nd2-b3 Bf5-h7 Nb3-c5 Ne4xc5 d4xc5 e6-e5 Rf1-d1
31/37 00:15 176,352k 11,720k -10.67 Ng1-f3 Nb8-c6 d2-d3 e7-e5 e2-e4 Ng8-f6 Bf1-e2 d7-d5 Nb1-d2 Bf8-e7 O-O O-O Rf1-e1 Bc8-e6 a2-a3 Be7-d6 b2-b4 a7-a5 b4-b5 Nc6-d4 Nf3xd4 e5xd4 e4xd5 Nf6xd5 Bc1-b2 Bd6-e5 Nd2-f3 Qd8-f6 Nf3xe5 Qf6xe5
32/40 00:21 244,044k 11,461k -10.69 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Bf1-g2 Ng8-f6 a2-a4 Bf8-e7 O-O O-O Nb1-d2 a7-a5 Nd2-b1 Bf5-c2 Bc1-f4 h7-h6 Bf4-c1 Be7-d6 Bc1-e3 Bc2-e4 h2-h3 Bd6-e7 Nb1-d2 Be4-f5 Ra1-d1 Nf6-e4 Nf3-e5 Nc6xe5 d4xe5 Ne4xd2 Rd1xd2
33/36 00:22 257,144k 11,414k -10.67 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Nf3-h4 Bf5-c2 Bf1-h3 Bf8-e7 Nh4-g2 g7-g5 Nb1-d2 h7-h5 Ng2-e3 Bc2-g6 Bh3-g2 h5-h4 g3xh4 Rh8xh4 Nd2-f3 Rh4-h5 h2-h4 g5xh4 b2-b4 Ng8-h6 b4-b5 Nc6-a5 Nf3-e5 Nh6-f5 Ne5xg6 f7xg6
34/39 00:36 403,261k 11,026k -10.70 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 g2-g3 e7-e6 Nf3-h4 Bf5-c2 Bf1-g2 Bf8-e7 Nh4-f3 Ng8-f6 Bc1-f4 O-O O-O Bc2-f5 Nb1-d2 Nf6-e4 Nf3-e1 Ne4xd2 Bf4xd2 Bf5-e4 f2-f3 Be4-g6 Ne1-d3 Bg6xd3 e2xd3 a7-a5 a2-a4 h7-h5 Bd2-e3 h5-h4
35/40 00:41 452,214k 10,979k -10.67 Ng1-f3 Nb8-c6 d2-d4 d7-d5 g2-g3 Bc8-f5 c2-c3 e7-e6 Bf1-g2 Ng8-f6 O-O Bf8-e7 Nb1-d2 O-O a2-a4 a7-a5 Nd2-b1 Bf5-c2 Bc1-f4 h7-h6 Bf4-e3 Nf6-g4 Be3-c1 Qd8-d7 h2-h3 Ng4-f6 Bc1-e3 b7-b6 Be3-f4 Bc2-e4 Nb1-d2 Be4-f5 Nf3-h4 Bf5-h7 Nh4-f3 Nf6-e4
36/41 00:47 517,566k 10,885k -10.69 Ng1-f3 Nb8-c6 d2-d4 d7-d5 g2-g3 Bc8-f5 c2-c3 h7-h6 Bf1-g2 e7-e6 Nb1-d2 Ng8-f6 O-O Bf8-d6 b2-b4 O-O a2-a4 Nf6-e4 Bc1-b2 a7-a6 Ra1-d1 Qd8-d7 Nd2-b3 b7-b6 Nf3-d2 Bd6-e7 h2-h3 Ra8-d8 Rd1-a1 Ne4-d6 Bb2-a3 Nd6-e4 b4-b5 Be7xa3 Ra1xa3
37/43- 01:26 921,071k 10,640k -10.77 Ng1-f3 Nb8-c6
37/48- 03:01 1,879,765k 10,367k -10.85 Ng1-f3 Nb8-c6
37/47 04:50 2,962,519k 10,199k -10.87 Ng1-f3 d7-d5 h2-h3 Nb8-c6 d2-d4 Bc8-f5 c2-c3 e7-e6 g2-g3 h7-h6 Bf1-g2 Ng8-f6 O-O Bf8-d6 Bc1-e3 O-O Nb1-d2 a7-a5 a2-a4 Nf6-e4 Rf1-d1 Ne4xd2 Nf3xd2 b7-b6 Rd1-c1 Qd8-f6 Kg1-h2 Qf6-g6 Kh2-g1 Rf8-d8 Nd2-f3 Bf5-e4 Nf3-h4 Qg6-h5 Nh4-f3 Bd6-e7 Rc1-f1 Be4xf3 e2xf3 Qh5-g6
38/46 06:05 3,694,073k 10,101k -10.90 Ng1-f3 d7-d5 h2-h3 Nb8-c6 d2-d4 Bc8-f5 c2-c3 e7-e6 g2-g3 h7-h6 Bf1-g2 Ng8-f6 O-O Bf8-d6 Bc1-e3 O-O Nb1-d2 a7-a5 a2-a4 Nf6-e4 Rf1-d1 Ne4xd2 Nf3xd2 b7-b6 b2-b3 Bf5-h7 h3-h4 Nc6-e7 Nd2-f3 c7-c5 c3-c4 Bh7-e4 Rd1-c1 Qd8-c7 c4xd5 e6xd5 h4-h5
39/46 06:30 3,932,482k 10,077k -10.90 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 Nb1-d2 e7-e6 a2-a4 Ng8-f6 a4-a5 a7-a6 Nf3-h4 Bf5-c2 g2-g3 h7-h6 Bf1-g2 Qd8-d7 Nd2-f1 O-O-O Nf1-e3 Bc2-h7 O-O Kc8-b8 Nh4-f3 Bf8-d6 Rf1-d1 Rh8-e8 Nf3-e1 Bh7-e4 f2-f3 Be4-h7 Ne1-d3 Bh7xd3 Rd1xd3 e6-e5 d4xe5 Nc6xe5
40/49 11:21 6,727,744k 9,867k -10.95 Ng1-f3 Nb8-c6 d2-d4 d7-d5 c2-c3 Bc8-f5 Bc1-e3 e7-e6 g2-g3 h7-h6 Bf1-g2 Ng8-f6 Nb1-d2 Nf6-g4 Nd2-f1 Bf8-d6 Be3-d2 O-O h2-h3 Ng4-f6 Nf1-e3 Bf5-h7 Ra1-d1 Nf6-e4 Bd2-c1 Nc6-e7 O-O c7-c5 c3-c4 Ra8-c8 c4xd5 Ne7xd5 Ne3xd5 e6xd5 d4xc5 Bd6xc5 e2-e3 Qd8-e7 Rd1xd5 Rf8-d8 Rd5-e5 Qe7-d7
lkaufman
Posts: 6283
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds. I am pretty sure Komodo playing GM's at odds has resulted in program changes when down in material, Larry might comment on that one :D

Details at - https://prodeo.actieforum.com/t543-knig ... ds-results

Download games - http://rebel13.nl/odds.zip

------------------------------------

What's next ?

I see 5 options.

1. Play queen-odds matches 2000 / 1500 /1000 elo until finally Komodo and/or Stockfish start to win, >50%

2. Look above, instead of 2700 engines test also 2800, 2900, 3000 elo pools.

3. Invite a third engine and repeat the 2700, 2500, 2300 elo cycle. Suggest an engine that does better than SF14, but make a reasonable case for it.

4. Suggest something interesting else.

5. Stop, it's enough.

Pick your preference.
Yes, we do small things to improve odds play, and include some odds positions in training the nets. Stockfish does strange things that hurt odds play I think. We can send you the new Dragon 2.5, it should be even better than Dragon 2 at odds play; it is 100 elo stronger in FRC blitz play!
Komodo rules!
User avatar
Rebel
Posts: 7474
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

lkaufman wrote: Fri Sep 24, 2021 4:25 pm
Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds. I am pretty sure Komodo playing GM's at odds has resulted in program changes when down in material, Larry might comment on that one :D

Details at - https://prodeo.actieforum.com/t543-knig ... ds-results

Download games - http://rebel13.nl/odds.zip

------------------------------------

What's next ?

I see 5 options.

1. Play queen-odds matches 2000 / 1500 /1000 elo until finally Komodo and/or Stockfish start to win, >50%

2. Look above, instead of 2700 engines test also 2800, 2900, 3000 elo pools.

3. Invite a third engine and repeat the 2700, 2500, 2300 elo cycle. Suggest an engine that does better than SF14, but make a reasonable case for it.

4. Suggest something interesting else.

5. Stop, it's enough.

Pick your preference.
Yes, we do small things to improve odds play, and include some odds positions in training the nets. Stockfish does strange things that hurt odds play I think. We can send you the new Dragon 2.5, it should be even better than Dragon 2 at odds play; it is 100 elo stronger in FRC blitz play!
Thank you for your kind offer, I accept of course! I like Uri's idea to pitch Dragon 2.5 vs SF13 with equal contempt. But what should be the best contempt value?

And of course Dragon 2.5 will be tested for the GRL on my other PC.
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 6283
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Fri Sep 24, 2021 4:42 pm
lkaufman wrote: Fri Sep 24, 2021 4:25 pm
Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds. I am pretty sure Komodo playing GM's at odds has resulted in program changes when down in material, Larry might comment on that one :D

Details at - https://prodeo.actieforum.com/t543-knig ... ds-results

Download games - http://rebel13.nl/odds.zip

------------------------------------

What's next ?

I see 5 options.

1. Play queen-odds matches 2000 / 1500 /1000 elo until finally Komodo and/or Stockfish start to win, >50%

2. Look above, instead of 2700 engines test also 2800, 2900, 3000 elo pools.

3. Invite a third engine and repeat the 2700, 2500, 2300 elo cycle. Suggest an engine that does better than SF14, but make a reasonable case for it.

4. Suggest something interesting else.

5. Stop, it's enough.

Pick your preference.
Yes, we do small things to improve odds play, and include some odds positions in training the nets. Stockfish does strange things that hurt odds play I think. We can send you the new Dragon 2.5, it should be even better than Dragon 2 at odds play; it is 100 elo stronger in FRC blitz play!
Thank you for your kind offer, I accept of course! I like Uri's idea to pitch Dragon 2.5 vs SF13 with equal contempt. But what should be the best contempt value?

And of course Dragon 2.5 will be tested for the GRL on my other PC.
I would recommend for Dragon Contempt 100 for knight odds, 125 for rook odds, and 175 for queen odds. But I think Stockfish versions that had Contempt limited it to 100, so if you want to use the same value then it has to be 100 for all handicaps. The definition of Contempt isn't the same in the two engines, but I think it is similar enough for your purposes.
Komodo rules!
lkaufman
Posts: 6283
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds.
So here are the approximate Dragon performance ratings against the closest rating pool (closest to 50%): Knight odds 2739, Bishop odds 2680, Rook odds 2520, queen odds 1906. I expected the rook odds vs knight odds to be about 200 more, it was 219 more. Queen odds perf. was a bit higher than I expected. The bishop vs knight difference was a bit more than I expected for the opening position, where the large number of pawns should help the knights somewhat. If we subtract 120 elo estimated for the difference between using the first 100 positions on the list vs. average/middle positions or pure odds without a book then a fair opponent for Dragon at this blitz time control is still about 2620 at knight odds, about 2560 for bishop odds, 2400 for rook odds, and a bit below 1800 for queen odds. In general, I think that strong engines play blitz roughly at the same strength as humans of the same rating (comparing CCRL blitz to Human FIDE) play Rapid, but these are bullet games, not blitz games, so perhaps another 150 elo or so needs to be deducted to predict human performance at Rapid. That gives about 2470 for human knight odds, 2410 for human bishop odds, 2250 for rook odds, and about 1635 for queen odds. The knight odds figure agrees almost perfectly with our results in 17 Rapid human games, the rook and queen odds figures seem a bit too high.
Komodo rules!
User avatar
Rebel
Posts: 7474
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

lkaufman wrote: Fri Sep 24, 2021 6:36 pm
Rebel wrote: Fri Sep 24, 2021 10:30 am Final results Komodo vs Stockfish

Code: Select all

Knight odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  55.6    73.8    89.9
Stockfish 14     28.5    47.2    70.1

Bishop odds      Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  47.1    67.6    81.8
Stockfish 14     14.5    31.3    51.2

Rook odds        Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  25.5    52.9    73.2
Stockfish 14     18.0    41.1    64.0

Queen odds       Pool    Pool    Pool
Engine           2700    2500    2300
Komodo Dragon 2  1.0%    3.6%    9.4%
Stockfish 14     0.0%    0.2%    0.5%
Komodo wins at every odds.
So here are the approximate Dragon performance ratings against the closest rating pool (closest to 50%): Knight odds 2739, Bishop odds 2680, Rook odds 2520, queen odds 1906. I expected the rook odds vs knight odds to be about 200 more, it was 219 more. Queen odds perf. was a bit higher than I expected. The bishop vs knight difference was a bit more than I expected for the opening position, where the large number of pawns should help the knights somewhat. If we subtract 120 elo estimated for the difference between using the first 100 positions on the list vs. average/middle positions or pure odds without a book then a fair opponent for Dragon at this blitz time control is still about 2620 at knight odds, about 2560 for bishop odds, 2400 for rook odds, and a bit below 1800 for queen odds. In general, I think that strong engines play blitz roughly at the same strength as humans of the same rating (comparing CCRL blitz to Human FIDE) play Rapid, but these are bullet games, not blitz games, so perhaps another 150 elo or so needs to be deducted to predict human performance at Rapid. That gives about 2470 for human knight odds, 2410 for human bishop odds, 2250 for rook odds, and about 1635 for queen odds. The knight odds figure agrees almost perfectly with our results in 17 Rapid human games, the rook and queen odds figures seem a bit too high.
I think it's reasonable assume if we run the 2700 pool at 40/120 (so factor 3 more time) the results will favor the 2700 engines a bit.
90% of coding is debugging, the other 10% is writing bugs.
User avatar
Rebel
Posts: 7474
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by Rebel »

lkaufman wrote: Fri Sep 24, 2021 5:36 pm I would recommend for Dragon Contempt 100 for knight odds, 125 for rook odds, and 175 for queen odds. But I think Stockfish versions that had Contempt limited it to 100, so if you want to use the same value then it has to be 100 for all handicaps. The definition of Contempt isn't the same in the two engines, but I think it is similar enough for your purposes.
Started the first match with Dragon 2.5 and contempt of 100 vs the 2700 pool.

I assume the 2700 pool is the one you are most interested In ?

http://rebel13.nl/a/grl.htm
90% of coding is debugging, the other 10% is writing bugs.
lkaufman
Posts: 6283
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Komodo-Dragon-2 vs Stockfish 14 at knight odss

Post by lkaufman »

Rebel wrote: Fri Sep 24, 2021 7:10 pm
lkaufman wrote: Fri Sep 24, 2021 5:36 pm I would recommend for Dragon Contempt 100 for knight odds, 125 for rook odds, and 175 for queen odds. But I think Stockfish versions that had Contempt limited it to 100, so if you want to use the same value then it has to be 100 for all handicaps. The definition of Contempt isn't the same in the two engines, but I think it is similar enough for your purposes.
Started the first match with Dragon 2.5 and contempt of 100 vs the 2700 pool.

I assume the 2700 pool is the one you are most interested In ?

http://rebel13.nl/a/grl.htm
Yes, for knight and bishop odds anyway. Ideally we should pick a pool where we score close to 50%. Dragon 2.5 and Contempt 100 should both help, but probably not dramatically, since "better" chess is not necessarily better at giving big odds. If you switched to using the middle positions from the ChrisW list this would probably lower the performance more than the new version and Contempt would raise it, but perhaps you prefer consistency.
Komodo rules!