Stockfish 1.3 2 CPU vs Stockfish 1.3 2 CPU

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

mcostalba
Posts: 2684
Joined: Sat Jun 14, 2008 9:17 pm

Stockfish 1.3 2 CPU vs Stockfish 1.3 2 CPU

Post by mcostalba »

I have made an internal testing aimed at detecting a possible smp regression in 1.3 release.

CPU: my Intel Core 2 Duo T5250 1.50Ghz 2 CPU
GUI: Fritz 11 GUI
Book: hs-book.ctg
hash: 128 MB
Time control: 1' + 0" (999 games)

Code: Select all

1   Stockfish 1.3 JA 2 +308 =467 -224 54.20%  541.5/999 +29 ELO
2   Stockfish 1.2 (default) JA +224 =467 -308 45.80%  457.5/999 -29 ELO

So the regression, if any, or shows only at longer time controls or only against other engines....but it does not show for CPU 1 case where instead we are stronger then 1.2 also against other engines. :shock:

I am puzzled by these numbers and I would like to ask testing expert's an opinion on theese and on the corresponding CEGT results that instead shows a regression starting from CPU 2 case.

Thanks
Marco
swami
Posts: 6662
Joined: Thu Mar 09, 2006 4:21 am

Re: Stockfish 1.3 2 CPU vs Stockfish 1.3 2 CPU

Post by swami »

It doesnt make sense to run matches between 2 version of engines, one with single cpu and another 2 cpu. Better try Stockfish 1cpu vs Fruit and Stockfish 2cpu vs Fruit and compare the results difference.
mcostalba
Posts: 2684
Joined: Sat Jun 14, 2008 9:17 pm

Re: Stockfish 1.3 2 CPU vs Stockfish 1.2 2 CPU

Post by mcostalba »

swami wrote:It doesnt make sense to run matches between 2 version of engines, one with single cpu and another 2 cpu. Better try Stockfish 1cpu vs Fruit and Stockfish 2cpu vs Fruit and compare the results difference.
Perhaps I wasn't clear enough in my post. Title is wrong actually it should have been:

Stockfish 1.3 2 CPU vs Stockfish 1.2 2 CPU


But the macth was against 2 CPU version for both engines 1.2 and 1.3 run on an Intel dual core.

I agree that it has more sense to test against different engines but the point is that in CPU 1 case CEGT _seems_ to show an improvment against other engines too, while in CPU 2 case CEGT shows a regression not only against other engines _but_ also against 1.2 version, that's why I have directly tested against 1.2 in the CPU 2 case.
mcostalba
Posts: 2684
Joined: Sat Jun 14, 2008 9:17 pm

Re: Stockfish 1.3 2 CPU vs Stockfish 1.2 2 CPU

Post by mcostalba »

Just to be more clear, my not proven and possibly very wrong feeling is that Stockfish 1.2 is _understimated_ in CPU 1 case (where the difference from Glaurung 2.2 is too small) and more or less correctly or even over estimated in SMP case.... 1.3 perhaps is a little bit underestimated in SMP (because its ELO is less then 1.2 while actually it is very probably to be stronger also in SMP) and _hopefully_ :-) correctly estimated in CPU 1 case.

This is of course just a very very weak idea and I trust CEGT results much more then my "ideas".
Uri Blass
Posts: 10899
Joined: Thu Mar 09, 2006 12:37 am
Location: Tel-Aviv Israel

Re: Stockfish 1.3 2 CPU vs Stockfish 1.2 2 CPU

Post by Uri Blass »

mcostalba wrote:
swami wrote:It doesnt make sense to run matches between 2 version of engines, one with single cpu and another 2 cpu. Better try Stockfish 1cpu vs Fruit and Stockfish 2cpu vs Fruit and compare the results difference.
Perhaps I wasn't clear enough in my post. Title is wrong actually it should have been:

Stockfish 1.3 2 CPU vs Stockfish 1.2 2 CPU


But the macth was against 2 CPU version for both engines 1.2 and 1.3 run on an Intel dual core.

I agree that it has more sense to test against different engines but the point is that in CPU 1 case CEGT _seems_ to show an improvment against other engines too, while in CPU 2 case CEGT shows a regression not only against other engines _but_ also against 1.2 version, that's why I have directly tested against 1.2 in the CPU 2 case.
I do not see where do you find CEGT shows regression against the 1.2 version because I see no match of 1.3 against 1.2 in the CEGT.

The only result that I can find of stockfish against previous version is

stockfish1.3 4 cpu-glaurung2.2 4 cpu +16=30 -4

see
http://www.husvankempen.de/nunn/40_40%2 ... ns/65.html

stockfish1.3 2 cpu and stockfish1.3 1 cpu have no games against previous versions of stockfish when I include glaurung as previous version.

see http://www.husvankempen.de/nunn/40_40%2 ... s/106.html

http://www.husvankempen.de/nunn/40_40%2 ... s/137.html

Uri