Komodo 9.1 vs Stockfish 15061716 - 16 threads

fastgm · Post by **fastgm** » Wed Jul 01, 2015 9:17 pm

Dual AMD Opteron 6376

Komodo 9.1 vs Stockfish 15061716 - 16 threads
2048 MB Hash
TC = 20 minutes + 12 seconds

 
    Program                   Elo    +   -   Games   Score   Av.Op.  Draws
 ---------------------------------------------------------------------------
  1 Komodo 9.1 T16          &#58; 3018   20  19   326    55.2 %   2982   72.4 %
  2 Stockfish 15061716 T16  &#58; 2982   19  20   326    44.8 %   3018   72.4 %


Wins   = 62
Draws  = 236
Losses = 28
Av.Op. Elo = 3000
 
Result     &#58; 180.0/326 (+62,=236,-28&#41;
Perf.      &#58; 55.2 %
Margins    &#58;
 68 %      &#58; (+  1.4,-  1.4 %) -> &#91; 53.8, 56.6 %&#93;
 95 %      &#58; (+  2.8,-  2.8 %) -> &#91; 52.4, 58.0 %&#93;
 99.7 %    &#58; (+  4.3,-  4.2 %) -> &#91; 51.0, 59.5 %&#93;
 
Elo        &#58; 3036
Margins    &#58;
 68 %      &#58; (+ 10,- 10&#41; -> &#91;3026,3046&#93;
 95 %      &#58; (+ 20,- 19&#41; -> &#91;3017,3056&#93;
 99.7 %    &#58; (+ 30,- 29&#41; -> &#91;3007,3067&#93;


Games        &#58;    326 &#40;finished&#41;
 
White Wins   &#58;     67 &#40;20.6 %)
Black Wins   &#58;     23 ( 7.1 %)
Draws        &#58;    236 &#40;72.4 %)
 
White Perf.  &#58; 56.7 %
Black Perf.  &#58; 43.3 %
 

Individual statistics&#58;
 
1 Komodo 9.1 T16          &#58; 3018  326 (+ 62,=236,- 28&#41;, 55.2 %
2 Stockfish 15061716 T16  &#58; 2982  326 (+ 28,=236,- 62&#41;, 44.8 %

Games:
http://www.fastgm.de/schach/results-K91 ... 200+12.zip

cdani · Post by **cdani** » Wed Jul 01, 2015 9:58 pm

Very interesting!! Thanks!

Adam Hair · Post by **Adam Hair** » Wed Jul 01, 2015 11:32 pm

Thanks for sharing this, Andreas.

JJJ · Post by **JJJ** » Thu Jul 02, 2015 7:38 am

+36 ELo vs one of the lastest Stockfish Dev at this time control and number of CPU. I m not surprised at all.

Laskos · Post by **Laskos** » Thu Jul 02, 2015 5:33 pm

Thanks Andreas, excellent test as usual from you. The result is conclusive: Komodo 9.1 is the strongest engine by a significant margin on many cores and larger than blitz time controls.

syzygy · Post by **syzygy** » Sun Jul 05, 2015 1:39 pm

I would be curious to see the results under these conditions between Stockfish with 16 threads and Stockfish with 8 threads.

lucasart · Post by **lucasart** » Sun Jul 05, 2015 2:12 pm

Laskos wrote:Thanks Andreas, excellent test as usual from you. The result is conclusive: Komodo 9.1 is the strongest engine by a significant margin on many cores and larger than blitz time controls.

Yes, seems conclusive indeed. I'm expecting TCEC will be just like the previous one: SF vs. K in the final, and K wins (with > 80% draw rate).

But is it SMP scaling ? TC scaling ? or both ?

TC scaling: I think it is well proven by now that K scales better than SF, as TC increases, SMP aside (1 core only).
SMP scaling? Even if SMP scaling was the same, increasing the number of cores for equal TC is equivalent to increasing the TC. You always measure SMP+TC scaling, never SMP scaling separetely. So it's hard to conclude whether K's SMP scaling is really better, or if it's a side effect of TC scaling.

Unfortunately, none of that brings us any closer to writing SF patches that improve either TC scaling or SMP scaling

Laskos · Post by **Laskos** » Thu Jul 16, 2015 8:46 pm

lucasart wrote:
Laskos wrote:Thanks Andreas, excellent test as usual from you. The result is conclusive: Komodo 9.1 is the strongest engine by a significant margin on many cores and larger than blitz time controls.
Yes, seems conclusive indeed. I'm expecting TCEC will be just like the previous one: SF vs. K in the final, and K wins (with > 80% draw rate).

But is it SMP scaling ? TC scaling ? or both ?

TC scaling: I think it is well proven by now that K scales better than SF, as TC increases, SMP aside (1 core only).

SMP scaling? Even if SMP scaling was the same, increasing the number of cores for equal TC is equivalent to increasing the TC. You always measure SMP+TC scaling, never SMP scaling separetely. So it's hard to conclude whether K's SMP scaling is really better, or if it's a side effect of TC scaling.

Unfortunately, none of that brings us any closer to writing SF patches that improve either TC scaling or SMP scaling

My impression (just an impression) is that it's both, mildly. A bit better SMP scaling and a bit better TC scaling. Also, from the tests of Andreas Strangmüller, Leto Atreides and Andrey Chilantiev, it seems that the expected score in 64 TCEC games in superfinal with 80% draw rate is +9 -4 for Komodo, with winning probability somewhere around 90%. I don't think SF can catch up until August TCEC.

JJJ · Post by **JJJ** » Thu Jul 16, 2015 10:59 pm

That's nice to read when you read before than Komdo team wouldn't catch up Stockfish team

Laskos · Post by **Laskos** » Fri Jul 17, 2015 10:34 am

JJJ wrote:That's nice to read when you read before than Komdo team wouldn't catch up Stockfish team

Well, there were some people stating that, and who would have expected that Komodo team comes in less than two months after the release of Komodo 9.0 with a well scaling on multiple cores and long TC improvement of some 40 ELO points? This is pretty much a killer in TCEC conditions. It would have been much "easier" to have 40 ELO points improvement on 1 core at ultra-bullet, scaling to some 15 points at longer TC and multicore.

Komodo 9.1 vs Stockfish 15061716 - 16 threads

Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads

Re: Komodo 9.1 vs Stockfish 15061716 - 16 threads