Testing Stockfish 11-03-13. 480 Games.

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

User avatar
Ozymandias
Posts: 1537
Joined: Sun Oct 25, 2009 2:30 am

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Ozymandias »

That lead is misleading, it's based on two wins against Protector, which is a "familiar" engine.
lucasart
Posts: 3242
Joined: Mon May 31, 2010 1:29 pm
Full name: lucasart

Re: Testing Stockfish 11-03-13. 480 Games.

Post by lucasart »

kranium wrote: Ok Tom, I give up...
:shock:
Honestly: :roll:

The CCC community needs to ban Stockfish now, so that other efforts might have at least a small chance of being competitive!
Sheesh...it's almost becoming absurd
:lol:

PS congrats to all those working on fishtest!
If you can't beat us, join us! We need competent engine developers, like you. You can just experiment with stuff and have access to monstrous testing resources. Plus it's more fun to be part of a community than just developping on your own (which is 1% dev and 99% waiting for test results to finish).
Theory and practice sometimes clash. And when that happens, theory loses. Every single time.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Tomcass »

Ok Tom, I give up...
:shock:
Honestly: :roll:

The CCC community needs to ban Stockfish now, so that other efforts might have at least a small chance of being competitive!
Sheesh...it's almost becoming absurd
:lol:

PS congrats to all those working on fishtest![/quote]

Hi Norman.

I was ready to suggest you to join the powerful and well organized SF team... but Lucas has been faster than me. You have proven to be an extremely talented man and you would be able to add lots of value to the SF Team. :wink:

Kind regards from Barcelona.

Tom.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Tomcass »

TESTING STOCKFISH 061114MZ: 480 Games

i7 980 3.33 Ghz.
6 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2014t – Limit 8 moves. Sedat –
No tablebases. No RTB used.
Hash 512
Relative Speed: 29.54
Knodes per second: 14.177

Time Control = 4+0

SF111114MZ x64 POPCNT_ - Houdini 4 x64_st_X6_CT0 22.5 - 17.5 +10/=25/-5 56.25%
SF111114MZ x64 POPCNT_ - Komodo 8 64-bit_6_NOB 21.5 - 18.5 +10/=23/-7 53.75%
SF111114MZ x64 POPCNT_ - Gull 3 x64 XP 24.5 - 15.5 +12/=25/-3 61.25%

Time Control= 2+2

SF111114MZ x64 POPCNT_ - Houdini 4 x64_st_X6_CT0 18.0 - 22.0 +7/=22/-11 45.00%
SF111114MZ x64 POPCNT_ - Komodo 8 64-bit_6_NOB 22.0 - 18.0 +10/=24/-6 55.00%
SF111114MZ x64 POPCNT_ - Gull 3 x64 XP 27.0 - 13.0 +15/=24/-1 67.50%

Score using 6 cores: 135.5 – 104.5 = 56.46%

240 Games =
http://www.mediafire.com/view/oyenpvw29 ... 0Games.pgn


i7 975 3.33 Ghz.
4 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2012t –Limit 8 moves. Sedat-
No tablebases. No RTB used.
Hash 256
Relative Speed: 20.62
Knodes per second: 9.899

Time Control: 4+0

SF111114MZ x64 POPCNT_ - Houdini 4 Pro x64-A_OK 24.5 - 15.5 +14/=21/-5 61.25%
SF111114MZ x64 POPCNT_ - Komodo 8 64-bit_4_NOB 25.0 - 15.0 +14/=22/-4 62.50%
SF111114MZ x64 POPCNT_ - Gull 3 x64 XP 28.5 - 11.5 +17/=23/-0 71.25%

Time Control= 2+2

SF111114MZ x64 POPCNT_ - Houdini 4 Pro x64-A_OK 23.5 - 16.5 +13/=21/-6 58.75%
SF111114MZ x64 POPCNT_ - Komodo 8 64-bit_4_NOB 21.0 - 19.0 +9/=24/-7 52.50%
SF111114MZ x64 POPCNT_ - Gull 3 x64 XP 26.5 - 13.5 +17/=19/-4 66.25%
Score using 4 cores: 149.0 – 91.0 = 62.08%
240 Games:
http://www.mediafire.com/view/n1kt5055p ... 0games.pgn

Segmenting by Time Control:

Fixed TC = 146.5 – 93.5 = 61.04%
Incremental TC = 138.0 – 102.0 = 57.50%

GLOBAL SCORE: 284.5 – 195.5 = 59.27%

Against : Houdini 4.0 St. Ct0 (3227) = 55.31% ; Komodo 8 (3266) = 55.92%, Gull 3 XP (3199) = 66.56%

Average Estimated Elo Opponents = 3231
Estimated Elo Performance= 3296


Error bars= +/- 23 EEP

This is a strong compile. Just 5 Elo points below the best SF Development –within error bars-.

Regards,

Tom.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Tomcass »

TESTING STOCKFISH 151114 IPMAN: 480 Games

i7 980 3.33 Ghz.
6 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2014t – Limit 8 moves. Sedat –
No tablebases. No RTB used.
Hash 512
Relative Speed: 29.54
Knodes per second: 14.177

Time Control = 4+0

SF 151114IP 64 POPCNT_ - Houdini 4 x64_st_X6_CT0 23.5 - 16.5 +13/=21/-6 58.75%
SF 151114IP 64 POPCNT_ - Komodo 8 64-bit_6_NOB 19.5 - 20.5 +5/=29/-6 48.75%
SF 151114IP 64 POPCNT_ - Gull 3 x64 XP 22.5 - 17.5 +10/=25/-5 56.25%

Time Control= 2+2

SF 151114IP 64 POPCNT_ - Houdini 4 x64_st_X6_CT0 25.0 - 15.0 +12/=26/-2 62.50%
SF 151114IP 64 POPCNT_ - Komodo 8 64-bit_6_NOB 19.0 - 21.0 +5/=28/-7 47.50%
SF 151114IP 64 POPCNT_ - Gull 3 x64 XP 26.5 - 13.5 +15/=23/-2 66.25%

Score using 6 cores: 136.0 – 104.0 = 56.67%

240 games= http://www.mediafire.com/view/i3abd82sk ... 0Games.pgn


i7 975 3.33 Ghz.
4 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2012t –Limit 8 moves. Sedat-
No tablebases. No RTB used.
Hash 256
Relative Speed: 20.62
Knodes per second: 9.899

Time Control: 4+0

SF 151114IP 64 POPCNT_ - Houdini 4 Pro x64-A_OK 25.0 - 15.0 +14/=22/-4 62.50%
SF 151114IP 64 POPCNT_ - Komodo 8 64-bit_4_NOB 20.5 - 19.5 +7/=27/-6 51.25%
SF 151114IP 64 POPCNT_ - Gull 3 x64 XP 27.5 - 12.5 +18/=19/-3 68.75%

Time Control= 2+2

SF 151114IP 64 POPCNT_ - Houdini 4 Pro x64-A_OK 22.5 - 17.5 +9/=27/-4 56.25%
SF 151114IP 64 POPCNT_ - Komodo 8 64-bit_4_NOB 23.0 - 17.0 +12/=22/-6 57.50%
SF 151114IP 64 POPCNT_ - Gull 3 x64 XP 27.5 - 12.5 +15/=25/-0 68.75%
Score using 4 cores: 146.0 – 94.0 = 60.83%

240 games= http://www.mediafire.com/view/xphzemx9d ... 0games.pgn

Segmenting by Time Control:

Fixed TC = 138.5 – 101.5 = 57,71%
Incremental TC = 143.5 – 96.5 = 59.79%

Global Score: 282.0 - 198.0 = 58.75%

Against : Houdini 4.0 St. Ct0 (3227) = 60.00% ; Komodo 8 (3266) = 51.25%, Gull 3 XP (3199) = 65.00%

Average Estimated Elo Opponents = 3231
Estimated Elo Performance= 3292


Error bars= +/- 23 EEP

Regards,

Tom.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Tomcass »

TESTING STOCKFISH DEVELOPMENT 171114: 1440 GAMES. (THREE 480 GAMES TESTS IN ONE).

Timestamp: 1416181833 Bench: 7694316

i7 980 3.33 Ghz.
6 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2014t – Limit 8 moves. Sedat –
No tablebases. No RTB used.
Hash 512
Relative Speed: 29.54
Knodes per second: 14.177

Time Control = 4+0

Stockfish 171114 64 POPCNT_ - Houdini 4 x64_st_X6_CT0 69.0 - 51.0 +31/=76/-13 57.50%
Stockfish 171114 64 POPCNT_ - Komodo 8 64-bit_6_NOB 64.0 - 56.0 +25/=78/-17 53.33%
Stockfish 171114 64 POPCNT_ - Gull 3 x64 XP 77.0 - 43.0 +40/=74/-6 64.17%

Time Control= 2+2

Stockfish 171114 64 POPCNT_ - Houdini 4 x64_st_X6_CT0 69.0 - 51.0 +31/=76/-13 57.50%
Stockfish 171114 64 POPCNT_ - Komodo 8 64-bit_6_NOB 71.0 - 49.0 +29/=84/-7 59.17%
Stockfish 171114 64 POPCNT_ - Gull 3 x64 XP 72.0 - 48.0 +32/=80/-8 60.00%

Score using 6 cores: 422.0 – 298.0 = 58.61%

720 Games
http://www.mediafire.com/download/9gbxq ... 0Games.pgn

i7 975 3.33 Ghz.
4 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2012t –Limit 8 moves. Sedat-
No tablebases. No RTB used.
Hash 256
Relative Speed: 20.62
Knodes per second: 9.899

Time Control: 4+0

Stockfish 171114 64 POPCNT_ - Houdini 4 Pro x64-A_OK 73.5 - 46.5 +39/=69/-12 61.25%
Stockfish 171114 64 POPCNT_ - Komodo 8 64-bit_4_NOB 69.5 - 50.5 +32/=75/-13 57.92%
Stockfish 171114 64 POPCNT_ - Gull 3 x64 XP 75.5 - 44.5 +39/=73/-8 62.92%

Time Control= 2+2
Stockfish 171114 64 POPCNT_ - Houdini 4 Pro x64-A_OK 72.0 - 48.0 +35/=74/-11 60.00%
Stockfish 171114 64 POPCNT_ - Komodo 8 64-bit_4_NOB 68.0 - 52.0 +25/=86/-9 56.67%
Stockfish 171114 64 POPCNT_ - Gull 3 x64 XP 73.0 - 47.0 +35/=76/-9 60.83%
Score using 4 cores: 431.5 – 288.5= 59.93%

720 Games=
http://www.mediafire.com/download/ojq9l ... 0games.pgn

Segmenting by Time Control:

Fixed TC = 428.5 – 291.5= 59.51%
Incremental TC = 425.0 – 295.0= 59.03%

GLOBAL SCORE: 853.5 – 586.5 = 59.27%

Against : Houdini 4.0 St. Ct0 (3227) = 59.06% ; Komodo 8 (3266) = 56.77%, Gull 3 XP (3199) = 61.98%

Average Estimated Elo Opponents = 3231
Estimated Elo Performance= 3296

Error bars= +/- 13 EEP


Although 5 Elo points below the leading SF091114 DEV, IMHO fhis test confirms the big step forward made by Stockfish in the last weeks. Congratulations!.

Regards,

Tom.
User avatar
Ozymandias
Posts: 1537
Joined: Sun Oct 25, 2009 2:30 am

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Ozymandias »

Link says 720 games, file says 240 :wink:
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Tomcass »

Ozymandias wrote:Link says 720 games, file says 240 :wink:
Thanks for your comment Juan. The 240 figure in the name of the file is wrong. There are 720 games in each file, :-)



And I take advantage from this post to give a

BIG THANKS

to all TalkChess members for this amazing figure of more than 100,000 views of this thread. For me this is one more reason to keep testing Stockfish. :wink:

Kind and smiling regards to all the TalkChess community from Barcelona.

Tom.
User avatar
Ozymandias
Posts: 1537
Joined: Sun Oct 25, 2009 2:30 am

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Ozymandias »

I opened the quad games with chessx, and it shows 240.

Congratulations on your landmark, well deserved.
User avatar
Ozymandias
Posts: 1537
Joined: Sun Oct 25, 2009 2:30 am

Re: Testing Stockfish 11-03-13. 480 Games.

Post by Ozymandias »

Links are ok now, maybe I clicked somewhere else? :roll: