I was reading the chessprogramming wiki, and read about the "simplified eval" -
http://chessprogramming.wikispaces.com/ ... n+function
and Tomasz Michniewski's idea of doing a "unified eval" test tournament, where every participant would use the same basic eval detailed on that page.
I find the idea really interesting, and seems to make sense, too.
Did that tournament actually happen? I can find no documentation of it on Google.
If not, is anyone interested in doing something like that?
The simplified eval is really simple, and should not take more than 15 minutes or so for engine authors to implement.
Unified eval tournament?
Moderator: Ras
-
- Posts: 1808
- Joined: Wed Mar 08, 2006 9:19 pm
- Location: Oslo, Norway
Re: Unified eval tournament?
It did happen, but unfortunately only six programs participated. You can see the results and download the games here.cyberfish wrote:I was reading the chessprogramming wiki, and read about the "simplified eval" -
http://chessprogramming.wikispaces.com/ ... n+function
and Tomasz Michniewski's idea of doing a "unified eval" test tournament, where every participant would use the same basic eval detailed on that page.
I find the idea really interesting, and seems to make sense, too.
Did that tournament actually happen? I can find no documentation of it on Google.
The tournament was the first ever appearance of Glaurung 2, which I had started working on about one month earlier. Unlike the other programs, Glaurung was playing without a book, which explains the numerous weird opening lines.
I would certainly be interested.If not, is anyone interested in doing something like that?
Tord
Re: Unified eval tournament?
Ah thanks!
We just need to get a few more people now...
I just implemented the simplified eval in my engine, and in ~2 seconds (limited depth) games, it's 52-72 elo points weaker.
I was worrying about what am I going to do if the "simplified" eval turns out to be better than my own
. Fortunately that didn't happen.
We just need to get a few more people now...
I just implemented the simplified eval in my engine, and in ~2 seconds (limited depth) games, it's 52-72 elo points weaker.
Code: Select all
Rank Name Elo + - games score oppo. draws
1 Brainless 09-20 64-bit 31 5 5 6536 58% -31 9%
2 Brainless 09-SIM 64-bit -31 5 5 6536 42% 31 9%

-
- Posts: 908
- Joined: Mon Jan 15, 2007 11:23 am
- Location: Warsza
Re: Unified eval tournament?
Actually, set of pcsq tables produced by Tomasz Michniewski is an impressive achievement - it allows a program to play decent game, though it hates flank openings and displays ludicrously high scores.
Perhaps the latter can be helped by constructing tables where
score[sq] = ( score[sq] * 4 ) / 5
or even score[sq] = ( score[sq] * 3 ) / 5
can anyone test it?
I'd like to participate as well (the early version of Hopeless lost the first UFO tournament quite badly), but I'm not likely to do *any* programming before March, since I'm busy with a grant for cataloguing pre-1600 manuscripts in Kraków, and I'm far away from any MSVC compiler.
regards
Pawel Koziol
Perhaps the latter can be helped by constructing tables where
score[sq] = ( score[sq] * 4 ) / 5
or even score[sq] = ( score[sq] * 3 ) / 5
can anyone test it?
I'd like to participate as well (the early version of Hopeless lost the first UFO tournament quite badly), but I'm not likely to do *any* programming before March, since I'm busy with a grant for cataloguing pre-1600 manuscripts in Kraków, and I'm far away from any MSVC compiler.
regards
Pawel Koziol
Pawel Koziol
http://www.pkoziol.cal24.pl/rodent/rodent.htm
http://www.pkoziol.cal24.pl/rodent/rodent.htm
Re: Unified eval tournament?
Just multiplying all the table by 4/5 and 3/5? I can certainly test that.
-
- Posts: 10892
- Joined: Thu Mar 09, 2006 12:37 am
- Location: Tel-Aviv Israel
Re: Unified eval tournament?
I think that it may be also interesting to do also simplified search tournament.cyberfish wrote:I was reading the chessprogramming wiki, and read about the "simplified eval" -
http://chessprogramming.wikispaces.com/ ... n+function
and Tomasz Michniewski's idea of doing a "unified eval" test tournament, where every participant would use the same basic eval detailed on that page.
I find the idea really interesting, and seems to make sense, too.
Did that tournament actually happen? I can find no documentation of it on Google.
If not, is anyone interested in doing something like that?
The simplified eval is really simple, and should not take more than 15 minutes or so for engine authors to implement.
The idea is that everybody use the same simple search function but people are free to change the evaluation.
Uri
-
- Posts: 28388
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: Unified eval tournament?
This 'simplified' eval is in some respects even more complicated than micro-Max' eval, as micro-Max does not use piece-square tables, but only a single centralization table shared by Pawns, minor pieces and King. (Rooks and Queen move neutrally over the board.) This table is
Pawns get a bonus for being pushed, and for reaching 6th and 7th rank, though, which could have been implemented as a piece-square table. But the bonuses for Pawns on the 6th and 7th rank proposed on the Wiki page seem way too low. What micro-Max uses, translated to the Wiki piece values, would be more like 100 on 6th rank and 180 on the 7th, rather than ~20 and 50.
Aggressive handling of Pawns is the main source of strength for micro-Max, and allows it to regularly beat engines that outseaarch it and out-evaluate it in all other respects. This includes a very keen sense for when to start pushing its Pawns, which is implemented by making the Pawn-push bonus dependent on the amount of non-pawn material on the board. The evaluation proposed in the Wiki does allow for the principle of using a different piece-square table in a different game stage (used for King), and I think it would greatly add to the strength if it also did this for Pawns. The currently proposed PST is good for the middle game, but the effective freezing of Pawns on a2, b2 and g2, h2 is really very detrimental in the end-game. Moves like b2-b4 should get +10 there (which seems even low in case this was a passer), not -10.
Code: Select all
0 5 12 15 16 15 12 5
6 11 18 21 22 21 18 11
10 15 22 25 26 25 22 15
12 17 24 27 28 27 24 17
12 17 24 27 28 27 24 17
10 15 22 25 26 25 22 15
6 11 18 21 22 21 18 11
0 5 12 15 16 15 12 5
Aggressive handling of Pawns is the main source of strength for micro-Max, and allows it to regularly beat engines that outseaarch it and out-evaluate it in all other respects. This includes a very keen sense for when to start pushing its Pawns, which is implemented by making the Pawn-push bonus dependent on the amount of non-pawn material on the board. The evaluation proposed in the Wiki does allow for the principle of using a different piece-square table in a different game stage (used for King), and I think it would greatly add to the strength if it also did this for Pawns. The currently proposed PST is good for the middle game, but the effective freezing of Pawns on a2, b2 and g2, h2 is really very detrimental in the end-game. Moves like b2-b4 should get +10 there (which seems even low in case this was a passer), not -10.
-
- Posts: 1808
- Joined: Wed Mar 08, 2006 9:19 pm
- Location: Oslo, Norway
Re: Unified eval tournament?
That's far less than I would have thought. What does your evaluation contain, apart from material and piece square tables?cyberfish wrote:Ah thanks!
We just need to get a few more people now...
I just implemented the simplified eval in my engine, and in ~2 seconds (limited depth) games, it's 52-72 elo points weaker.
I just finished a quick Silver match between the normal version of my program and an otherwise identical version with the evaluation function replaced by Toasz Michniewski's piece square table evaluation:
Code: Select all
Glaurung 090122: 86.5 (+81,=11,-8)
Glaurung UFO 090122: 13.5 (+8,=11,-81)
-
- Posts: 10892
- Joined: Thu Mar 09, 2006 12:37 am
- Location: Tel-Aviv Israel
Re: Unified eval tournament?
I think that your simple evaluation for micromax is not simple in terms of time of implementing it in a new program because the evaluation is not dependent only on the position on the board but also on what happened earlier.hgm wrote:This 'simplified' eval is in some respects even more complicated than micro-Max' eval, as micro-Max does not use piece-square tables, but only a single centralization table shared by Pawns, minor pieces and King. (Rooks and Queen move neutrally over the board.) This table is
Pawns get a bonus for being pushed, and for reaching 6th and 7th rank, though, which could have been implemented as a piece-square table. But the bonuses for Pawns on the 6th and 7th rank proposed on the Wiki page seem way too low. What micro-Max uses, translated to the Wiki piece values, would be more like 100 on 6th rank and 180 on the 7th, rather than ~20 and 50.Code: Select all
0 5 12 15 16 15 12 5 6 11 18 21 22 21 18 11 10 15 22 25 26 25 22 15 12 17 24 27 28 27 24 17 12 17 24 27 28 27 24 17 10 15 22 25 26 25 22 15 6 11 18 21 22 21 18 11 0 5 12 15 16 15 12 5
Aggressive handling of Pawns is the main source of strength for micro-Max, and allows it to regularly beat engines that outseaarch it and out-evaluate it in all other respects. This includes a very keen sense for when to start pushing its Pawns, which is implemented by making the Pawn-push bonus dependent on the amount of non-pawn material on the board. The evaluation proposed in the Wiki does allow for the principle of using a different piece-square table in a different game stage (used for King), and I think it would greatly add to the strength if it also did this for Pawns. The currently proposed PST is good for the middle game, but the effective freezing of Pawns on a2, b2 and g2, h2 is really very detrimental in the end-game. Moves like b2-b4 should get +10 there (which seems even low in case this was a passer), not -10.
Uri
Re: Unified eval tournament?
Sure, entertaining concept and I'd be interested.
Such a tournament could be played online (I guess most authors have an ICC account) or simply in one's own basement.
I'll upload my version with it somewhere once I'm done.
Stan
Such a tournament could be played online (I guess most authors have an ICC account) or simply in one's own basement.
I'll upload my version with it somewhere once I'm done.
Stan