Testing Stockfish 11-03-13. 480 Games.

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: no more tests?

Post by ouachita »

good stuff - I assume Houdini 4.0 Standard, Contempt 0 just means the contempt box is not checked?
SIM, PhD, MBA, PE
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: no more tests?

Post by Tomcass »

Hi, Bobby.

Contempt is set 1 by defect for Houdini. Following the suggestion of two experienced testers friends of mine I got Contempt 0 as being slightly better than Contempt 1 in my computers. At least whren playing against top engines. The only change I made was setting the parameters with Contempt 0 instead of Contempt 1.

Regards,

Tom.
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: no more tests?

Post by Tomcass »

FOUR STOCKFISH AGAINST HOUDINI 4.0 AND KOMODO TCEC.

Second leg: Incremental Time Control 2min. + 2 sec. = 600 Games.


i7 980 3.33 Ghz.
6 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2012c
No tablebases. No RTB used.
Hash 512
Relative Speed: 28.66
Knodes per second: 13.759

Time Control= 2+2.

201312Sixstarstourn_2+2 2013


1 Houdini 4 x64_st_X6_CT0 11.5 - 8.5 10.0 - 10.0 9.5 - 10.5 12.0 - 8.0 12.5 - 7.5 55.5/100
2 Stockfish 041213 8.5 - 11.5 11.0 - 9.0 9.5 - 10.5 10.5 - 9.5 12.5 - 7.5 52.0/100
3 StockWood 031213 10.0 - 10.0 9.0 - 11.0 9.5 - 10.5 10.5 - 9.5 12.5 - 7.5 51.5/100
4 Stockfish DD 64 10.5 - 9.5 10.5 - 9.5 10.5 - 9.5 9.0 - 11.0 10.5 - 9.5 51.0/100
5 Stockfish 111113SL 8.0 - 12.0 9.5 - 10.5 9.5 - 10.5 11.0 - 9.0 12.0 - 8.0 50.0/100
6 Komodo TCEC 64-bitx6NOB 7.5 - 12.5 7.5 - 12.5 7.5 - 12.5 9.5 - 10.5 8.0 - 12.0 40.0/100


300 Games=
http://www.mediafire.com/download/30x5n7aznfs9kdi/

i7 975 3.33 Ghz.
4 real cores
Ponder: Off.
GUI: Fritz 12
Book: Perfect 2012c
No tablebases. No RTB used.
Hash 256
Relative Speed: 20.62
Knodes per second: 9.899

Time Control= 2+2

1 Houdini 4 x64xCT0 12.5 - 7.5 11.0 - 9.0 11.5 - 8.5 12.0 - 8.0 12.5 - 7.5 59.5/100
2 StockWood 031213 7.5 - 12.5 10.5 - 9.5 10.0 - 10.0 10.5 - 9.5 11.0 - 9.0 49.5/100
3 Stockfish 0412134.2x 9.0 - 11.0 9.5 - 10.5 9.5 - 10.5 10.0 - 10.0 11.5 - 8.5 49.5/100 4 Stockfish 111113SL 8.5 - 11.5 10.0 - 10.0 10.5 - 9.5 10.0 - 10.0 9.0 - 11.0 48.0/100 5 Komodo TCEC 64-bitx4 8.0 - 12.0 9.5 - 10.5 10.0 - 10.0 10.0 - 10.0 10.5 - 9.5 48.0/100
6 Stockfish DD 64 SSE4.2x 7.5 - 12.5 9.0 - 11.0 8.5 - 11.5 11.0 - 9.0 9.5 - 10.5 45.5/100

300 Games=
http://www.mediafire.com/view/7jlczu...2_300Games.pgn

STANDINGS FOR INCREMENTAL TIME CONTROL AFTER 600 GAMES ( Second half of this Tournament):

1.- Houdini 4x64xCT0 115.0/200
2.- Stockfish 041213 101.5/200
3.- Stockwood 031213 101.0/200
4.- Stockfish 111113SL 98.0/200
5.- Stockfish DD 96.5/200
6.- Komodo TCEC 64 88.0/200


FINAL STANDINGS AFTER 1.200 GAMES

1.- Houdini 4x64xCT0 220.5/400 55.125%
2.- Stockfish 041213 203.5/400 50.875%
3.- Stockwood 031213 203.5/400 50.875%
4.- Stockfish DD 199.0/400 49.750%
5.- Stockfish 111113SL 196.5/400 49.125%
6.- Komodo TCEC 64 177.0/400 44.250%


Although the number of games is not big enough, my preliminary conclussions are:

- What an impressive win for Houdini 4.0!. In my computers, at the mentioned time controls and against these rivals, Standard version with Contempt ‘0’ is stronger than Pro version with Contempt 1.
- We are lucky to see the fast improvement pace in Stockfish evolution: The previous leader of my ranking -Stockfish 111113SL Ipman’s compile with Large Pages- has been surpassed by all other Stockfish in this test.
- Stockwood has proven to be very strong. I will follow closely his further development.
- The performance of Komodo TCEC –current TCEC world champion- has been disappointing in this tournament.
- The efficient and creative Stockfish team has still hard work ahead to reach Houdini 4.0 Standard Contempt ‘0’ level. Go, Stockfish!.

Best regards from Barcelona.

Tom.
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: no more tests?

Post by ouachita »

Nice piece of work Tom. Very surprising to see 6 month old versions of SF ahead of the latest. This is the 3rd or 4th time I said here, "yet another confirmation of H4's dominance at blitz or STC."
SIM, PhD, MBA, PE
Tomcass
Posts: 786
Joined: Sun Apr 16, 2006 9:09 pm

Re: no more tests?

Post by Tomcass »

ouachita wrote:Nice piece of work Tom. Very surprising to see 6 month old versions of SF ahead of the latest."
Thanks for your words, Bobby. All the Stockfish versions or compiles used in this test are less than one month old. I am not sure where you have found that some of them are 6 month old. :roll:

Kind regards,

Tom.
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: no more tests?

Post by ouachita »

We have/had an ID issue. Stockfish 041213 = April 12, 2013 to me. Best format is US military, 04Dec13.

In any case, my bad.
SIM, PhD, MBA, PE
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: no more tests?

Post by Dr.Wael Deeb »

ouachita wrote:We have/had an ID issue. Stockfish 041213 = April 12, 2013 to me. Best format is US military, 04Dec13.

In any case, my bad.
Conclusion:

Tom must stick to the military framework in his tests :mrgreen:



:wink:
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: no more tests?

Post by ouachita »

take the current version name for example: 13120712. On its face, this designation causes great doubt as to the date stamp. Agree?

Tom uses a Stockfish 041213, presumably 04Dec13. However, referring to my example above, this SF version could be 13Dec07 or 2013Dec07 or ? LOL.
SIM, PhD, MBA, PE
User avatar
Dr.Wael Deeb
Posts: 9773
Joined: Wed Mar 08, 2006 8:44 pm
Location: Amman,Jordan

Re: no more tests?

Post by Dr.Wael Deeb »

ouachita wrote:take the current version name for example: 13120712. On its face, this designation causes great doubt as to the date stamp. Agree?

Tom uses a Stockfish 041213, presumably 04Dec13. However, referring to my example above, this SF version could be 13Dec07 or 2013Dec07 or ? LOL.
Indeed,a big confusion but not from Tom's side of the fence....

The official Stockfish page uses the exact format you are against :wink:
Dr.D
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: no more tests?

Post by ouachita »

Ok, I'll move forward with the current version having a date of 13Dec13, or 07Dec13, whatever!
SIM, PhD, MBA, PE