Daniel, I am not afraid of _anything_. So please get that chip off your shoulder, grow up a bit, and stop this nonsense. If, you had read Beal's paper, before diving into a discussion you knew _absolutely_ nothing about, most of this would not be taking place. But since you did no research, just formed an uninformed opinion and dove in, you are going to make a ton of mistakes.Daniel Shawul wrote:Oh so now you are afraid of the result so you start complaining it should be 30 min per game, so that it would take decades. People are not stupid and will see the light at the end of the tunnel !
Just specify your conditions here so that we will get this crap done and dusted once and for all.
I am reporting what Beal found. I have also found the same thing. And others that are using Crafty skill=1 for their rating lists are finding the _same_ thing. So lose the chip, pay attention, do the research, and participate usefully. I doubt, at the moment, that you could even recognize the tunnel, much less the light at the end of it, since you have done no research at all.
My conditions are
(a) purely random eval, exactly as is done in Crafty, except that I only go down to 1% real 99% random, while in recent tests I have been using 100% random.
(b) reasonable games, not game in one second or 5 seconds. This effect does depend on reasonable depth, as Beal reported and I have mentioned several times now. I have even explained that my "best fix" so far has been to add a cpu-burning loop in evaluate to slow the search down, which reduces the depth. Because less depth reduces the "Beal effect."
Given those things, you ought to be able to make this work, and run a real test. Once you figure out how many games you need. You complain about 67-0 which is an expected result, in fact. So until you have some idea of what this is all about, there's no need to run any tests or anything else, except to study Beal's paper, the previous discussions in the general forum, and then set about testing once you understand how to actually run a meaningful test. Hint: 67 games is _not_ meaningful between two opponents this different in strength. But most of us understand that already and don't jump into a discussion half-cocked and not knowing what to expect.