Another attempt at comparing Evals ELO-wise

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Lyudmil Tsvetkov »

Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.
Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.
:lol:

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Rebel »

Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.
Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.
:lol:

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Lyudmil Tsvetkov »

elpapa wrote:
Laskos wrote:
SzG wrote:Please! Elo!
Well, I understand that Arpad Elo is a Hungarian
Let's just be glad his last name wasn't Oberknezsevics.
I would be happy, if someone offers me for free 10 000 Euro, but I also do not mind an offer of 10 000 Euros, 10 000 euros, or 10 000 EUROS.
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Lyudmil Tsvetkov »

Rebel wrote:
Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.
Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.
:lol:

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.
what do you mean full?
lazy eval, or pruning at root?
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: Another attempt at comparing Evals ELO-wise

Post by Laskos »

A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:

Code: Select all

#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203
This result is very similar to OP result.
Isaac
Posts: 265
Joined: Sat Feb 22, 2014 8:37 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Isaac »

Hello Tsvetkov, I think it doesn't make much sense to compare an engine X to an engine Y at fixed depth like you did. That's because the definition of depth differs between engines (it is an iteration of something different according to which engine we consider). There are other more meaningful ways to compare engines.
Some programmer guru correct me if I'm wrong.
User avatar
Rebel
Posts: 6991
Joined: Thu Aug 18, 2011 12:04 pm

Re: Another attempt at comparing Evals ELO-wise

Post by Rebel »

Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote:
Rebel wrote:
Lyudmil Tsvetkov wrote: I hate personalising the thread, so it is better to stop somewhere here.

you said that I should not be criticising the SF concept/model, that I do not understand.
what is so difficult to understand? recursively reaching depth 0, SF starts doing quiescence search, as any other engine around, to only evaluate 'quiet' positions with no available captures, checks or other relevant threat moves.

without quiescence search and depth 1 SF probably plays around 1500 elo
without quiescence search and depth 10 SF probably plays around 2000 elo
with qs and depth 1 probably around same 2000 elo
with qs and depth 10 around 2500 elo, etc.

so, basically, qs should be worth at least around 500-1000 elo alone.

etc., etc., etc., too boring to discuss when minds do not quite meet.
??

SF is a search engine with an excellent eval. What we are discussing is the eval part while you constantly want to involve search into the discussion.
Lyudmil Tsvetkov wrote:I guess the best thing you could currently do is offer another engine that would play 200 elo stronger, but only on Fridays.
:lol:

Your memory is good.
a single ply search is also a search.

ever seen an engine playing with just assessing the root position?
It seems to me you are still under the impression that the fun you made out of 2 positions you posted are the result of a bad full evaluation, isn't it? If so you are wrong.
what do you mean full?
lazy eval, or pruning at root?
You are getting close, there are more explanations.
abulmo2
Posts: 433
Joined: Fri Dec 16, 2016 11:04 am
Location: France
Full name: Richard Delorme

Re: Another attempt at comparing Evals ELO-wise

Post by abulmo2 »

Laskos wrote:A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:

Code: Select all

#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203
This result is very similar to OP result.
Are the nodes count for the 1500 positions, or an average for a single position?
Richard Delorme
User avatar
Laskos
Posts: 10948
Joined: Wed Jul 26, 2006 10:21 pm
Full name: Kai Laskos

Re: Another attempt at comparing Evals ELO-wise

Post by Laskos »

abulmo2 wrote:
Laskos wrote:A similar approach is to use very short "nodestime", time (of order of ms) to equal desired number of nodes, proportional to engine's speed. I used that on STS 1500 testsuite:

Code: Select all

#  PLAYER             STS  nodes

1. Komodo 11.01       865   5911
2. Deep Shredder 13   848   6446
3. Stockfish dev      815   6421
4. Andscacs 0.91      799   5413
5. Houdini 5          730   8901
6. Fruit 2.1          643   5203
This result is very similar to OP result.
Are the nodes count for the 1500 positions, or an average for a single position?
Average for one position on 1500 positions.