Robert Hyatt
Joined: 27 Feb 2006 Posts: 15820 Location: Birmingham, AL
|
Post subject: Re: What your opinion about this testing methodology? Posted: Wed Apr 18, 2012 7:28 pm |
|
|
| Sven Schüle wrote: |
| Kempelen wrote: |
| Ferdy wrote: |
Your methodology is favorable considering your goal. There is no point testing 1. b3 e5 positions if it is not in your engines repertoire. Only use test positions where you want your engine to be. Of course there are drawbacks, but that can be overcomed as you say using large number of selected positions.
Perhaps start from smaller number of positions, and as your engine able to improve from it, then add other positions, to be considered in its repertoire.
But I have a bad feeling about it, to me the engine should be able to handle all positions, it can be blocked, open, full of pinned pieces, etc. |
I think you misunderstand my idea. The goal is not test only a limited set of opening positions, but a large and varied set of starting middle-game positions. The point is repeating always the same games with the same positions, but enought positions to say the engine is played a varied. |
The point is, the positions are selected once by random but then always the same positions are used for testing. That's exactly what Bob is doing for a long while now, and also lots of other people, so it is not a new method but kind of "de facto standard". I recall there were long discussions about the details few years ago. Doing it that way instead of newly choosing different positions by random each time has been found to result in lower error bars as far as I remember. I guess Bob and the other experts in statistics can explain the exact reasons.
Sven |
It really doesn't influence the BayesElo error bar, obviously, but if you choose random openings, you introduce one more variable, as one run might choose openings you play well, and the next uses openings you play poorly, producing a false impression. |
|