In order to achieve fairness in engine testing, I believe that a GUI would have to duplicate the exact same opening book position when colors are reversed.
Otherwise it can happen that A, playing white, gets a superior opening position from the the book, against B who is playing black. When the colors are reversed with B playing white, it could happen that the new opening position is neutral. In this scenario A has a distinct advantage over B in the 2 game test.
Is there a GUI out there that can remember opening book moves so that when the colors are reversed, the 2nd engine starts off in the 2nd game where the 1st engine did in the 1st game? If not, I hope to see this feature in the future.
Opening book fairness in testing
Moderators: hgm, Rebel, chrisw
-
- Posts: 1056
- Joined: Thu Mar 09, 2006 4:15 pm
- Location: Long Island, NY, USA
-
- Posts: 9773
- Joined: Wed Mar 08, 2006 8:44 pm
- Location: Amman,Jordan
Re: Opening book fairness in testing
The ChessBase GUI supports this feature....
_No one can hit as hard as life.But it ain’t about how hard you can hit.It’s about how hard you can get hit and keep moving forward.How much you can take and keep moving forward….
-
- Posts: 27809
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: Opening book fairness in testing
The way people usually do this is by first generating the opening lines (e.g. by playing two book-engines against each other) and writing them to file (PGN). Then you can do symmetric testing from that file, asking the GUI or Tournament Manager to play each opening twice, with reversed colors. The advantage of this method is that you later can use the same collection of lines for other engines, that you wanted to compare to the current ones as well. (E.g. a modified version of one of the two against the same opponent.)
-
- Posts: 1154
- Joined: Fri Jun 23, 2006 5:18 am
Re: Opening book fairness in testing
I do this in Arena by using the starting positions as the end of "book".Norm Pollock wrote:In order to achieve fairness in engine testing, I believe that a GUI would have to duplicate the exact same opening book position when colors are reversed.
Otherwise it can happen that A, playing white, gets a superior opening position from the the book, against B who is playing black. When the colors are reversed with B playing white, it could happen that the new opening position is neutral. In this scenario A has a distinct advantage over B in the 2 game test.
Is there a GUI out there that can remember opening book moves so that when the colors are reversed, the 2nd engine starts off in the 2nd game where the 1st engine did in the 1st game? If not, I hope to see this feature in the future.
-Sam
-
- Posts: 6401
- Joined: Thu Mar 09, 2006 8:30 pm
- Location: Chicago, Illinois, USA
Re: Opening book fairness in testing
I agree!SzG wrote:I believe that is a mistaken conception. What if the position favours the style and understanding of one of the opponents? The colours may be reversed but the advantage remains...Norm Pollock wrote:In order to achieve fairness in engine testing, I believe that a GUI would have to duplicate the exact same opening book position when colors are reversed.
Otherwise it can happen that A, playing white, gets a superior opening position from the the book, against B who is playing black. When the colors are reversed with B playing white, it could happen that the new opening position is neutral. In this scenario A has a distinct advantage over B in the 2 game test.
I think the most fair procedure is to have them play totally random lines in each game as long as the number of games is large enough. Then luck will even out.
If the choice of openings is as random as possible (whatever random means in this case), reversing the colors should not be needed and should be discouraged. In fact, playing the same position with white and black may generate a non-independent result between those two games. Any statistical analysis is valid if the games are as independent as possible.
The easiest way to see this, as you mentioned, is a position that benefits the style of one engine. I think I am experiencing that with the current tests I am running with my engine Gaviota.
On the other hand, reversing the colors may not be a bad idea if you want to detect weaknesses in your engine, running always the same starting positions. With one of the nunn positions, Gaviota lost almost every single time with black and white. I found that one evaluation term was badly tuned. After removing it, this behavior was fixed for this particular position (it does not improve the overall strength, because the real solution may be to tune the parameter rather than remove it). This is just to show an extreme case of this behavior.
Miguel
-
- Posts: 4610
- Joined: Wed Oct 01, 2008 6:33 am
- Location: Regensburg, Germany
- Full name: Guenther Simon
Re: Opening book fairness in testing
There is still a statistical error and no fairness, if a bad opening is repeated twice, because this only helps the weaker program.Norm Pollock wrote:In order to achieve fairness in engine testing, I believe that a GUI would have to duplicate the exact same opening book position when colors are reversed.
Otherwise it can happen that A, playing white, gets a superior opening position from the the book, against B who is playing black. When the colors are reversed with B playing white, it could happen that the new opening position is neutral. In this scenario A has a distinct advantage over B in the 2 game test.
Is there a GUI out there that can remember opening book moves so that when the colors are reversed, the 2nd engine starts off in the 2nd game where the 1st engine did in the 1st game? If not, I hope to see this feature in the future.
Instead of the 'normal' 0:2 or 0.5:1.5 now it gets probably a 1:1 result, just because of a repeated bad line.
Guenther
-
- Posts: 41455
- Joined: Sun Feb 26, 2006 10:52 am
- Location: Auckland, NZ
Re: Opening book fairness in testing
ChessGUI also supports it.Dr.Wael Deeb wrote:The ChessBase GUI supports this feature....
gbanksnz at gmail.com
Re: Opening book fairness in testing
And assuming I've understood correctly, so does Shredder GUI - come to think of it, is there any GUI that *doesn't* support this ?Graham Banks wrote:ChessGUI also supports it.Dr.Wael Deeb wrote:The ChessBase GUI supports this feature....