Another attempt at comparing Evals ELO-wise

Lyudmil Tsvetkov · Post by **Lyudmil Tsvetkov** » Thu May 25, 2017 9:48 am

Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.

Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.

Your memory is good.

a single ply search is also a search.

ever seen an engine playing with just assessing the root position?

Rebel · Post by **Rebel** » Thu May 25, 2017 11:07 am

Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.

Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?

It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.

Lyudmil Tsvetkov · Post by **Lyudmil Tsvetkov** » Thu May 25, 2017 2:15 pm

elpapa wrote:
Laskos wrote:
SzG wrote:Please! Elo!
Well, I understand that Arpad Elo is a Hungarian
Let's just be glad his last name wasn't Oberknezsevics.

I would be happy, if someone offers me for free 10 000 Euro, but I also do not mind an offer of 10 000 Euros, 10 000 euros, or 10 000 EUROS.

Lyudmil Tsvetkov · Post by **Lyudmil Tsvetkov** » Thu May 25, 2017 2:17 pm

Rebel wrote:
Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.

Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.

what do you mean full?
lazy eval, or pruning at root?

Laskos · Post by **Laskos** » Thu May 25, 2017 3:06 pm

A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:

Code: Select all

#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203

This result is very similar to OP result.

Isaac · Post by **Isaac** » Thu May 25, 2017 3:27 pm

Hello Tsvetkov, I think it doesn't make much sense to compare an engine X to an engine Y at fixed depth like you did. That's because the definition of depth differs between engines (it is an iteration of something different according to which engine we consider). There are other more meaningful ways to compare engines.
Some programmer guru correct me if I'm wrong.

Rebel · Post by **Rebel** » Thu May 25, 2017 6:50 pm

Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.

Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.
what do you mean full?
lazy eval, or pruning at root?

You are getting close, there are more explanations.

abulmo2 · Post by **abulmo2** » Thu May 25, 2017 7:48 pm

Laskos wrote:A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:
Code: Select all
#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203
This result is very similar to OP result.

Are the nodes count for the 1500 positions, or an average for a single position?

Laskos · Post by **Laskos** » Thu May 25, 2017 8:10 pm

abulmo2 wrote:
Laskos wrote:A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:
Code: Select all
#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203
This result is very similar to OP result.
Are the nodes count for the 1500 positions, or an average for a single position?

Average for one position on 1500 positions.

Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise

Re: Another attempt at comparing Evals ELO-wise