10 min + 5 sec engines ratinglist

Discussion of computer chess matches and engine tournaments.

Moderator: Ras

kranium
Posts: 2130
Joined: Thu May 29, 2008 10:43 am

Re: 10 min. + 5 sec. engines rating list.

Post by kranium »

lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
User avatar
michiguel
Posts: 6401
Joined: Thu Mar 09, 2006 8:30 pm
Location: Chicago, Illinois, USA

Re: 10 min. + 5 sec. engines rating list.

Post by michiguel »

kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
Yes, they are directly proportional for small values.

A good experimental approximation to delta elo (dE) vs speed is
dE = a * ln (S1/S0)

in general, a ~ 100 but it could change in different conditions
where S1 = S0 + dS, then

dE = a * ln ((S0+dS)/S0)
dE = a * ln (1 + dS/S0)

where dS/S0 is the relative increase in speed = dRI

dE = a * ln (1 + dRI)

it could be demonstrated that ln (1 + x) ~ x, as long as x is much smaller than 1. (see http://en.wikipedia.org/wiki/Mercator_series)

Then

dE ~ a * dRI

Which is valid as smaller dRI are used
if a = 100, then 1% of speed (dRI = 0.01) equals 1 extra elo, and that is proportional for small values.

Miguel
CRoberson
Posts: 2094
Joined: Mon Mar 13, 2006 2:31 am
Location: North Carolina, USA

Re: 10 min. + 5 sec. engines rating list.

Post by CRoberson »

Modern Times wrote:
lkaufman wrote: it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
That simply doesn't follow.

But your argument is the reverse. You are saying say 100 Elo for 100% speedup, so therefore 10% speedup must be 10% Elo. I very much doubt that is the case. 10% speedup could just be 1 Elo for example. To me it is probably a stepped line, the Elo increase corresponding with the point at which the engine starts to reach higher depths as a result.

There is zero evidence that Elo scales in a perfectly linear fashion with speedup increases. I've never seen any proper tests of it.
Over the decades, there has been quite a bit of evidence to that effect. It is super easy to test. Take two engines A and B and test them for 100 games at 40/1. Now, test them again with A using 40/1 and B using 40/2. Now, test again with A at 40/1 and B using 40/4. Cutechess has the ability to do these matches. Also, pick A and B such that Elo(A) ~= Elo(B).

Also, the line may be different if you pick two engines at 2000 Elo vs two engines at 3000 Elo.
Modern Times
Posts: 3806
Joined: Thu Jun 07, 2012 11:02 pm

Re: 10 min. + 5 sec. engines rating list.

Post by Modern Times »

That would be a useful test yes, but 100 games doesn't give any confidence to me. Maybe 1,000 or much more and then we might have something to talk about. If I knew how to use cutechess I'd run such a test, but I've never used it.
Also, the line may be different if you pick two engines at 2000 Elo vs two engines at 3000 Elo.
The you are disagreeing with Larry then. The way he talks, it is the same straight line connecting the two end points for every engine.
CRoberson
Posts: 2094
Joined: Mon Mar 13, 2006 2:31 am
Location: North Carolina, USA

Re: 10 min. + 5 sec. engines rating list.

Post by CRoberson »

Modern Times wrote:That would be a useful test yes, but 100 games doesn't give any confidence to me. Maybe 1,000 or much more and then we might have something to talk about. If I knew how to use cutechess I'd run such a test, but I've never used it.
Also, the line may be different if you pick two engines at 2000 Elo vs two engines at 3000 Elo.
The you are disagreeing with Larry then. The way he talks, it is the same straight line connecting the two end points for every engine.
No. I am not disagreeing with him. I am just giving you a way to test the idea.
lkaufman
Posts: 6284
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: 10 min. + 5 sec. engines rating list.

Post by lkaufman »

kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
I said to use the log (natural) of the speed ratio, not the ratio itself. See Miguel's respons. Also it doesn't work like this for MP, since many nodes are redundantly searched. Speedup must be one that doesn't change the search, like faster hardware or better coding, for this to apply.
kranium
Posts: 2130
Joined: Thu May 29, 2008 10:43 am

Re: 10 min. + 5 sec. engines rating list.

Post by kranium »

michiguel wrote:
kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
Yes, they are directly proportional for small values.

A good experimental approximation to delta elo (dE) vs speed is
dE = a * ln (S1/S0)

in general, a ~ 100 but it could change in different conditions
where S1 = S0 + dS, then

dE = a * ln ((S0+dS)/S0)
dE = a * ln (1 + dS/S0)

where dS/S0 is the relative increase in speed = dRI

dE = a * ln (1 + dRI)

it could be demonstrated that ln (1 + x) ~ x, as long as x is much smaller than 1. (see http://en.wikipedia.org/wiki/Mercator_series)

Then

dE ~ a * dRI

Which is valid as smaller dRI are used
if a = 100, then 1% of speed (dRI = 0.01) equals 1 extra elo, and that is proportional for small values.

Miguel
ln (1 + x) ~ x, as long as x is much smaller than 1
using extremely small values (for ex: < 1) to approach results that approximate equivalence does not mean the relationship is directly proportional
that's cherry picking

the natural logarithm function is a curve
http://en.wikipedia.org/wiki/Natural_logarithm

any graph of a function that is directly proportional is a straight line (except inverse proportion of course)
Last edited by kranium on Mon May 26, 2014 2:15 pm, edited 1 time in total.
kranium
Posts: 2130
Joined: Thu May 29, 2008 10:43 am

Re: 10 min. + 5 sec. engines rating list.

Post by kranium »

lkaufman wrote:
kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
I said to use the log (natural) of the speed ratio, not the ratio itself. See Miguel's respons. Also it doesn't work like this for MP, since many nodes are redundantly searched. Speedup must be one that doesn't change the search, like faster hardware or better coding, for this to apply.

how do you arrive at 10% = 10 ELO, as per in your quotes above
the natural logarithm of 10 is 2.30
lkaufman
Posts: 6284
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: 10 min. + 5 sec. engines rating list.

Post by lkaufman »

kranium wrote:
lkaufman wrote:
kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
I said to use the log (natural) of the speed ratio, not the ratio itself. See Miguel's respons. Also it doesn't work like this for MP, since many nodes are redundantly searched. Speedup must be one that doesn't change the search, like faster hardware or better coding, for this to apply.

how do you arrive at 10% = 10 ELO, as per in your quotes above
the natural logarithm of 10 is 2.30
Please read more carefully. I said the natural log of the ratio, which in this case is 1.1. Then multiply by 100, since the assumption is that 1% = one elo. What do you get? Then do it for a ratio of 0.9. Take the average. This is the value of a ten percent speedup averaging the two possible meanings of a 10% speedup. For example, if we run twice as fast, is that a 100% speedup or a 50% reduction in time?
kranium
Posts: 2130
Joined: Thu May 29, 2008 10:43 am

Re: 10 min. + 5 sec. engines rating list.

Post by kranium »

lkaufman wrote:
kranium wrote:
lkaufman wrote:
kranium wrote:
lkaufman wrote: In the past, when I've noticed that someone running tests on AMD machines (for example Ingo/Ipon) gets worse ratings for Komodo relative to Houdini than on either my Intel tests or others (CEGT at same time limit as IPON or myself), we've checked relative nodes per second and invariably the ratio in favor of Houdini is significantly higher (ballpark of ten percent) on AMD. So roughly running on AMD costs Komodo about ten elo vs. Houdini compared to running on Intel.
lkaufman wrote: I can't prove that it is unwise to punch a gorilla, but it is a pretty safe bet. I have never seen any test results that make me doubt this, and I've seen quite a lot. it's really just common sense. If a one percent speedup is worth one elo point, then ten of them should be worth ten elo. The assumption is that elo gains are additive, which seems to be universally true for engines. Do you doubt this?
Larry-
yes of course I doubt it...NPS and ELO are not directly proportional

for ex:
Komodo TCEC x64 NPS increases approx. 340% from 1 CPU to 4 CPU
http://www.fastgm.de/schach/table-facto ... hreads.png

according to your logic then, it should be 340 ELO stronger?!
I said to use the log (natural) of the speed ratio, not the ratio itself. See Miguel's respons. Also it doesn't work like this for MP, since many nodes are redundantly searched. Speedup must be one that doesn't change the search, like faster hardware or better coding, for this to apply.

how do you arrive at 10% = 10 ELO, as per in your quotes above
the natural logarithm of 10 is 2.30
Please read more carefully. I said the natural log of the ratio, which in this case is 1.1. Then multiply by 100, since the assumption is that 1% = one elo. What do you get? Then do it for a ratio of 0.9. Take the average. This is the value of a ten percent speedup averaging the two possible meanings of a 10% speedup. For example, if we run twice as fast, is that a 100% speedup or a 50% reduction in time?
ok now I understand how you arrived at 10 ELO...

of course any assumptions in your method may add error
for ex: Ray says he measured it here and got only 2%
Modern Times wrote:Well, I have done the nps comparison here on an Intel Sandybridge, and an AMD Phenom II X6.
The 10% difference doesn't exist here. It is 2%. So that dispels the theory about Komodo being disadvantaged on AMD anyway.
But in any event, thx much for explaining it

my apologies to Andrev if our discussion took his thread off-topic