Status Quo 2016-1 , Stockfish - Komodo

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Dann Corbit, Harvey Williamson

Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
Jouni
Posts: 2388
Joined: Wed Mar 08, 2006 7:15 pm

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Jouni » Thu Jan 07, 2016 5:12 pm

There are not much difference in nps, depth or evaluation between engines, so no wonder 50% score now :) . But why Komodo displays 66,1% hash usage whole time?
Jouni

Dan Cooper
Posts: 184
Joined: Sun Nov 01, 2015 2:15 am

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Dan Cooper » Thu Jan 07, 2016 5:13 pm

Very nice setup, I'm jealous. Except for the hash, this is basically a replay of the TCEC Season 8 Super Final.

mjlef
Posts: 1467
Joined: Thu Mar 30, 2006 12:08 pm
Contact:

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by mjlef » Thu Jan 07, 2016 9:01 pm

Jouni wrote:There are not much difference in nps, depth or evaluation between engines, so no wonder 50% score now :) . But why Komodo displays 66,1% hash usage whole time?
Komodo only resets the hashfull count when the hash is cleared or resized. This is how Don programmed it, although I know most programs reset it for each new search.

Jouni
Posts: 2388
Joined: Wed Mar 08, 2006 7:15 pm

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Jouni » Thu Jan 07, 2016 9:26 pm

Yes I know. But why it's exactly 66,1%?
Jouni

User avatar
Graham Banks
Posts: 35073
Joined: Sun Feb 26, 2006 9:52 am
Location: Auckland, NZ

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Graham Banks » Thu Jan 07, 2016 9:34 pm

Hugo wrote:Hi engine lovers, and a happy new year 2016 :)

I have started an engine macht between Komodo and Stockfish.

Engines use 24 cpu, no hyperthreading, ponder OFF
8 GB of Hashtables
Syzyggy 3-4-5-6
100 games, 60 + 0 timer
using a set of 50 startpossitions 2- 8 moves
Komodo contemp = 0

Will post later games and crosstable
for intrested observers you can watch the games with 15 sec update frequency:
http://clemens-keck.de/screen/

regards, Clemens Keck
Brilliant! :)
Looking forward to some high quality games.
Thanks Clemens.
gbanksnz at gmail.com

User avatar
Ajedrecista
Posts: 1537
Joined: Wed Jul 13, 2011 7:04 pm
Location: Madrid, Spain.
Contact:

Re: Status Quo 2016-1, Stockfish - Komodo.

Post by Ajedrecista » Fri Jan 08, 2016 12:01 pm

Hello:
Hugo wrote:Hi engine lovers, and a happy new year 2016 :)

I have started an engine macht between Komodo and Stockfish.

Engines use 24 cpu, no hyperthreading, ponder OFF
8 GB of Hashtables
Syzyggy 3-4-5-6
100 games, 60 + 0 timer
using a set of 50 startpossitions 2- 8 moves
Komodo contemp = 0

Will post later games and crosstable
for intrested observers you can watch the games with 15 sec update frequency:
http://clemens-keck.de/screen/

regards, Clemens Keck
Excellent! Thanks for sharing.

Regarding the asymmetric confidence intervals of Fritz GUI (I suppose it is Fritz GUI), I managed to get most of the numbers. At this moment:

Code: Select all

Komodo 9.3 64-bit x24 - Stockfish 030116 64 POPCNT; 39 Games, +5 =30 -4, 51.3%, TP=+9 Elo, 68%->[-11,+65],
                        95%->[-31,+124], 99.7%->[-51,+191], predicted moves=63.6%
I reached the following equations for the score bounds after many trials. I hope no typos:

Code: Select all

µ = (wins + 0.5*draws)/games

Assuming that Fritz GUI reports 1-sigma, 2-sigma and 3-sigma confidence intervals, for z-sigma confidence interval:

(Lower score bound) = µ + z·ln(2·µ) - (2·z/3)·sqrt[µ·(1 - µ)/(games - 1)]
(Upper score bound) = µ + z·ln(2·µ) + (2·z/3)·sqrt[µ·(1 - µ)/(games - 1)]
And then convert these scores into Elo differences with the well-known 400·log10[score/(1 - score)]. Here is what I obtained with Excel after rounding to the nearest integers:

Code: Select all

1-sigma: [-11, +65]
2-sigma: [-31, +124]
3-sigma: [-51, +192]
The only difference is in the upper Elo difference of 3-sigma (+191 versus +192). The bias of the centre of each interval is z·ln(2·µ) above the score.

I do not claim it is the definitive formula for Fritz GUI. Please see:

Stockfish 070114 Vs Stockfish 070114 8 logical cores vs 4

There I replace µ + z·ln(2·µ) by 0.5 + (z/3)·ln(2·µ) and I get somewhat similar 3-sigma confidence intervals but not so similar than Clemens' current match with the other formula.

Is there any known distribution with my first formula or is it a lucky good fit for this sample? More samples to test the formula are needed, of course.

Regards from Spain.

Ajedrecista.

Dan Cooper
Posts: 184
Joined: Sun Nov 01, 2015 2:15 am

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Dan Cooper » Fri Jan 08, 2016 5:34 pm

What are the rules for adjudicating these games?

bob
Posts: 20923
Joined: Mon Feb 27, 2006 6:30 pm
Location: Birmingham, AL

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by bob » Fri Jan 08, 2016 6:45 pm

Jouni wrote:Yes I know. But why it's exactly 66,1%?
If you only show a "peak" value you would expect it to reach some "peak" and stick there. I have a "data" percentage for the parallel search data used in Crafty. I keep a peak value for that that displays the "peak usage" after each search. Not average or whatever, as I want to know if it ever reaches 100% which would indicate a need for a change in the size.

Hugo
Posts: 782
Joined: Tue Dec 01, 2009 10:10 am

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by Hugo » Fri Jan 08, 2016 11:09 pm

Dan Cooper wrote:What are the rules for adjudicating these games?
huh, I must say I dont know this. The Fritz 15 gui is in basic settings.
I think draw comes from move repetition or 50 move rule?

C.K.

bnculp
Posts: 67
Joined: Wed Mar 08, 2006 7:19 pm

Re: Status Quo 2016-1 , Stockfish - Komodo

Post by bnculp » Sat Jan 09, 2016 12:27 pm

The Fritz 15 GUI adjudication options are located in the File-Options-Game tab. If you read the help file located at :

http://help.chessbase.com/Fritz/15/Eng/ ... 000065.htm

you can see some of the specifics. Unfortunately I do not see any details on the Draw option.

Post Reply