Milos wrote:CRoberson wrote: I've run a two program match of 100 games and the first program wins all of the first 10 games. The next day when the games are done, the score is 80 to 20 in favor of the 2nd program. The first only won 20 and half of them were the first 10 games.
Again you prove you do not know the elementary statistics.
Assuming that the final result is 80:20 for engine B against engine A, this means chance for engine A to win against engine B is 20%.
Chance that engine A wins any 10 games in a row is 0.2^10=0.00001%. Chance to win
first 10 games of the match is even lower.
In other words you will not see it in your life, even if you spend it whole testing just these 2 engines.
First, I said the score is 80 to 20. That doesn't mean that the other 10 games were won. Second, the fact that the score is 80 to 20 in 100
games does not mean that is an exact reflection of the the two programs relative strengths.
Now, for your misunderstanding of your math. Here is a real life true example. I have 2 children and only 2. They were born on the same
day exactly 2 years apart. By your logic, I would have to have almost 365 children for that to happen, because the odds of 2
siblings being born on the same day 2 years apart are 1 in 365 (not 1 in 365^2). I only have 2 children and they were born on the same day exactly 2 years apart.
You claim the odds are 1 in 100,000 (0.00001%) to see an example of what I claim. Statistics like that only tell how often it happens, not when something is
going to happen. As in the example of my children's births, it is a statistical fallacy to assume that you have to go through the full
range of events before the unlikely event happens.
Also, you have not made any effort to find out my background in computer chess before you called me a liar. So, you
don't know how long I've been doing it or what are the odds that I have seen it. You are just talking about a single instance random chance. To calculate the odds that I have seen it, requires knowledge of how often I test programs and for how long I have
been doing it. You must factor all of that into your calculations before using the math to call me a liar.
I have been doing work computer games for around 18 years. I have posts in rec.games.chess dating back to somewhere between
1991 and 1993. In 18 years, that comes to an average of 15 tests of 100 games per day to have completed 100,000 tests in 18 years.
That is to run 100,000 tests. As I pointed out, you don't always need to run 100,000 tests to see the 1 test in 100,000. But, lets say
that on average I tested once per day at 100 games per test. That puts the odds at 1 in 5 (20%) that I have seen it. This is far better
odds than you quoted. So, it is theoretically possible. One in five are not terrible odds for something like this.
I notice that you did not ask what TC I use for testing before you did some superficial math.
Now, to the facts:
Given that I can run 400 games in about 8 hours on just one of my computers, that means I can easily run 400 games per day or up to
1200 games per day on that computer. (Yes, I have more than one. If you don't believe that, just ask anybody that has attended the
Pan American Computer Chess Championships in the last 3 years. I can name names if you like - they are all members of this forum. If you don't believe them, I have pictures on the web.) At 1200 games per day, that is 12 tests of 100 per day. At that rate, I
can complete 4,380 tests per year on 1 machine. That would be about 10,000 tests in the last two years. That would mean on something
where the odds are 1 in 100,000 that there is a 1 in 10 chance that I have seen it. This is far better than the stats that you did and did, by
far, too quickly and with too little info. How did I come up with 1 in 10 odds. Well, 10,000 tests times the probability (as you stated)
of 1 in 100,000 gives odds of 10,000/100,000 = 1/10. Also, I have more than one computer. Sometimes, I have run all of them for
weeks at a time to test something. Even with just 2 computers, the odds are doubled that I have seen it which puts it to 2 x 1/10 = 1/5.
You must admit, 1 in 5 odds are not very bad for this. The odds are better than that as I have more than 2 computers. Again,
provable via personal references or pictures that already exist on the web.
So, I have a question for you. Do you still contend that I am lying about this?