Eelco de Groot wrote:Thanks Swami and Dann for part two of your testsuite!
You're welcome, Eelco.
Eelco de Groot wrote:
A lot of work has gone into it I can see. I think this suite may be a little less hard to solve reasonably well than your first one?
Could be. But from my tests, It seems that some engines did better in undermining than in open files and diagonals. For example, Booot, Bright. I need to re-run the same engines in the undermining suite again to find out the differences. I have conducted this (open files and diagonals) test on nearly half of the engines available. I'd post the results soon.
Eelco de Groot wrote:At least I got some better results with 'Open files and diagonals', but for the first test they were very poor. Now I got just over 50% with Blueberry on the Athlon
At the same time the antivirus program decided it was a good time to start downloading and installing new virusdefinitions so I think Blueberry might have gotten one more solution right with some more time!
Well,considering it was run on Athlon, I'd think it's indeed a good result. These test suites are mainly designed for the engines <2700. Authors who are developing their engine from initial 1800 rating would find this useful. They could make tweaks, implement understanding etc in an attempt to improve its knowledge in specific test suites. I'd hope they could gain more elos via tweaks.
Strategy and understanding of positional patterns play the most important role in computer chess. For humans, it got to be undoubtedly tactics. Engines are generally considered good tacticians, what they lack is deep positional understanding. Some lack one concept of strategy, while they play better in others.
Eelco de Groot wrote:
Code: Select all
Engine: Blueberry Beta 4 DM70 Build 421 (Athlon 2009 MHz, 128 MB)
by F. Letouzey, T. Gaksch, E. de Groot
Right until now: 52 of 100 ; 20:51m
20 seconds per position, fixed time. Only first solutions counted.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
-------------------------------------------------------------------------------------
0 | - 0 1 - - - - - - - - 10 - - - 16 - 4 - -
20 | - - 0 3 2 - 11 2 - 0 - - 0 1 - 0 17 - 16 0
40 | 18 1 2 - 9 0 0 1 6 - - - - 3 - 0 - - - 4
60 | - 1 - 7 - - 15 - 3 0 14 14 - 3 3 0 0 - 2 -
80 | 0 - - - 1 - 6 - 17 7 - 17 3 - 9 0 5 14 - 0
1 sec -> 15/100
2 sec -> 21/100
3 sec -> 25/100
4 sec -> 31/100
5 sec -> 33/100
6 sec -> 34/100
7 sec -> 36/100
8 sec -> 38/100
9 sec -> 38/100
10 sec -> 40/100
11 sec -> 41/100
12 sec -> 42/100
13 sec -> 42/100
14 sec -> 42/100
15 sec -> 45/100
20 sec -> 52/100
n/s: 554.997
TotTime: 33:29m SolTime: 20:51m
Eelco
This is very interesting test, from 1sec to 20 sec, It does highly leap forward when given more time. I thought of running this kind of test whenever the update announcement of an engine is posted onto the forum. Work on the third test suite is underway. Perhaps we could release it in 2 months time.