Kempelen wrote:Rebel wrote:I like to present a page for starters how to test a chess engine with limited hardware. I am interested in some feedback for further improvement.
http://www.top-5000.nl/tuning.htm
Hi Ed,
Very valuable an nice insides in your page. Thanks you very much.
Reading it without going into deep think, I came with a few questions for you?
Hi Fermin,
a) At the end of the page you provide a set of pgns that all sum 6060 different possitions. I suppose you dont play all positions against all opponents and two side, beacuase it will be, assuming 6 opponent, 72720 games, and that are a lot which I suspect you dont play them. So, do you choose them randomly?? or what exactly are you doing?
95% of the time I use the 8 x 500 sets. The others (endgame) related are meant for specific endgame changes. For instance, when you make a change in your (say) rook-ending code the effect will be hardly measurable playing from middle game positions, 95% (or so) is just noise. So I will use the ROOK PGN's as a foretaste. If results are good a normal match should confirm that.
b) You mention the disadventage of playing eng-eng matches, being you loose "control" over the style of the program. This is one aspect of my engine I have always missed and would like to improve. One way is "manally one by one" see games and tune by hand, but do you think is there a mix method, or an authomatic one that could take also eng-eng result to improve result and style in the same time? I said this because I suspect only hand tune is not enought to make a reasonable strong engine even if it has a fantastic style.
That's an excellent question. There is this myth: the playing style of a chess program reflects the playing style of the programmer. A myth with probably some truth. If you as a human chess player like playing king attacks (who doesn't?) **and** with a little gambling you will tend to program your evaluation that way with large code and high values. Personally I like the Karpov balanced playing style and I feel my engine plays that way. From the programs I know from the past Chess System Tal was a reckless king attacker resulting in fascinating games with high peeks but also with deep falls. Another example is "The King", its specialty: sound R vs BP (or R vs NP) sacrifices.
As to your second question I have an example to offer. Before I got interested again in chess programming last year and started to join the circus of playing thousands of games my King Safety base parameter was set to 150%. This since the late 90's. I like my King Safety this way because it has little problems with manoeuvres like 1.Rf3 2.Rh3 3.Qh5 all very natural. However tuning this parameter last year a value of 75 performs clearly better in eng-eng matches, that's a division by 2 (!!). Manoeuvres like 1.Rf3 2.Rh3 3.Qh5 need much more depth to find now and speculative (but sound 90% of the cases) sacrifices are almost gone. This clearly is a loss in attractiveness.