Manually tuned evaluation

maksimKorzh · Post by **maksimKorzh** » Sun Dec 27, 2020 1:36 pm

Hi Guys

Recently I've discovered this article on talkchess:
https://www.chessprogramming.org/Simpli ... n_Function
(strange that it didn't happen 5 years ago)

And I loved it at first sight because of the explanations on how for instance avoid
bishop captures knight on c3/f3 and at the same time force bishop captures knight on c6/f6.
I love the harmony of piece weights and PST values in Tomash's implementation.

However it performs purely - my engine with average depth 8 in blitz horribly looses to TSCP (average depth 6)
almost all the game because of malformed pawn structure and especially passed pawns.
"Simplified evaluation" above assumes NOT to use additional eval params which is totally clear
because it would ruin the harmony of interacting values (I even tried that - total disaster)

I tried simple pawn evaluation with my bitboard engine earlier (this time I have 0x88 one)
and the biggest problem was tuning... Blindly guessing bonuses and penalties for pawns
ended up that my material + PST only version was about 100 Elo points stronger compared
to material + PST + isolated + doubled + passed pawns one. Some extra disaster occured probably
due to the fact that I used PSTs from PeSTO chess engine and bearing in mind they were already texel tuned
to serve as the only eval adding new features probably did the same effect as adding new eval features to "simplified eval"

Ok, enough lyrics, so first let me say what I want to AVOID at any cost:
1. Other but material + PST eval params (because of need to tune them all together)
2. Automated tuning (too complicated, too slow)

Now what I want to achieve is:
1. material + PST eval only
2. incorporate passed pawn eval into PST values

Recalling my previous experience with PeSTO tables I clearly remember that using
tapered eval is very beneficial, but is there a way to tune the following params manually:
1. Middle game material margin (say one minor piece exchanged)
2. Endgame material margin (say no queens or queen + 1 minor piece from each side)
3. pure opening piece values
4. pure endgame piece values
5. pure opening PST values
6. pure endgame PST values

By saying manually I mean using a method similar to the one explained in "simplified evaluation" article.
Maybe there are other not-automated tuning methods are available?

Dann Corbit · Post by **Dann Corbit** » Sun Dec 27, 2020 2:47 pm

For a really simple yet effective evaluation look at olithink.
Olithink emphasizes mobility.
I guess that for the number of lines of code, with the formula:

Elo/L=N

Olithink would have the largest N. I did not measure it, but it seems logical to me since it is quite strong an incredibly small.
OK, a program with zero lines has the largest N, but I am excluding degenerate cases.

jdart · Post by **jdart** » Sun Dec 27, 2020 3:40 pm

There is lots of stuff on evaluation tuning in this forum, see for example this thread:
http://talkchess.com/forum3/viewtopic.php?f=7&t=75234. Most people are automating it now. You can try changing values manually and see how that performs but you are not likely to find optimal values that way.

maksimKorzh · Post by **maksimKorzh** » Sun Dec 27, 2020 4:31 pm

The ironic fact is that you're pointing to one of the threads I've started before)

maksimKorzh · Post by **maksimKorzh** » Sun Dec 27, 2020 4:33 pm

Dann Corbit wrote: ↑Sun Dec 27, 2020 2:47 pm For a really simple yet effective evaluation look at olithink.
Olithink emphasizes mobility.
I guess that for the number of lines of code, with the formula:

Elo/L=N

Olithink would have the largest N. I did not measure it, but it seems logical to me since it is quite strong an incredibly small.
OK, a program with zero lines has the largest N, but I am excluding degenerate cases.

Thanks Dann, totally forgot about OliThink!

Ferdy · Post by **Ferdy** » Sun Dec 27, 2020 4:40 pm

maksimKorzh wrote: ↑Sun Dec 27, 2020 1:36 pm Hi Guys

Recently I've discovered this article on talkchess:
https://www.chessprogramming.org/Simpli ... n_Function
(strange that it didn't happen 5 years ago)

And I loved it at first sight because of the explanations on how for instance avoid
bishop captures knight on c3/f3 and at the same time force bishop captures knight on c6/f6.
I love the harmony of piece weights and PST values in Tomash's implementation.

However it performs purely - my engine with average depth 8 in blitz horribly looses to TSCP (average depth 6)
almost all the game because of malformed pawn structure and especially passed pawns.
"Simplified evaluation" above assumes NOT to use additional eval params which is totally clear
because it would ruin the harmony of interacting values (I even tried that - total disaster)

I tried simple pawn evaluation with my bitboard engine earlier (this time I have 0x88 one)
and the biggest problem was tuning... Blindly guessing bonuses and penalties for pawns
ended up that my material + PST only version was about 100 Elo points stronger compared
to material + PST + isolated + doubled + passed pawns one. Some extra disaster occured probably
due to the fact that I used PSTs from PeSTO chess engine and bearing in mind they were already texel tuned
to serve as the only eval adding new features probably did the same effect as adding new eval features to "simplified eval"

Ok, enough lyrics, so first let me say what I want to AVOID at any cost:
1. Other but material + PST eval params (because of need to tune them all together)
2. Automated tuning (too complicated, too slow)

Now what I want to achieve is:
1. material + PST eval only
2. incorporate passed pawn eval into PST values

Recalling my previous experience with PeSTO tables I clearly remember that using
tapered eval is very beneficial, but is there a way to tune the following params manually:
1. Middle game material margin (say one minor piece exchanged)
2. Endgame material margin (say no queens or queen + 1 minor piece from each side)
3. pure opening piece values
4. pure endgame piece values
5. pure opening PST values
6. pure endgame PST values

By saying manually I mean using a method similar to the one explained in "simplified evaluation" article.
Maybe there are other not-automated tuning methods are available?

You aware of Larry's article?

Ferdy · Post by **Ferdy** » Sun Dec 27, 2020 5:13 pm

maksimKorzh wrote: ↑Sun Dec 27, 2020 1:36 pm Now what I want to achieve is:
1. material + PST eval only
2. incorporate passed pawn eval into PST values

Recalling my previous experience with PeSTO tables I clearly remember that using
tapered eval is very beneficial, but is there a way to tune the following params manually:
1. Middle game material margin (say one minor piece exchanged)
2. Endgame material margin (say no queens or queen + 1 minor piece from each side)
3. pure opening piece values
4. pure endgame piece values
5. pure opening PST values
6. pure endgame PST values

By saying manually I mean using a method similar to the one explained in "simplified evaluation" article.
Maybe there are other not-automated tuning methods are available?

Two years ago I wrote an article on piece value calculation based on mobility. It is not limited to FIDE piece types only, it can also handle musketeer piece types.

Ferdy · Post by **Ferdy** » Sun Dec 27, 2020 5:27 pm

There is an article regarding rebel program by Ed. I am influenced by this writing as I develop Deuterium.

maksimKorzh · Post by **maksimKorzh** » Sun Dec 27, 2020 5:47 pm

Ferdy wrote: ↑Sun Dec 27, 2020 5:27 pm There is an article regarding rebel program by Ed. I am influenced by this writing as I develop Deuterium.

Thank you so much for such an amazing resources!
It's exactly the format of what I was looking for!

maksimKorzh · Post by **maksimKorzh** » Sun Dec 27, 2020 5:58 pm

Ferdy wrote: ↑Sun Dec 27, 2020 5:27 pm There is an article regarding rebel program by Ed. I am influenced by this writing as I develop Deuterium.

Also I'm a bit confused with the fact that author of 3000+ engine is taking his time to play around with my 1600 engine)
Just curious why are you interested?

Manually tuned evaluation

Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation

Re: Manually tuned evaluation