Different eval for white/black

matthewlai · Post by **matthewlai** » Mon Jan 05, 2015 2:40 pm

Random idea of the day -

Has anyone experimented with using different eval functions for white vs black?

When we use the same eval for both, we are assuming the opponent thinks just like we do. That's not a valid assumption if the opponent has a very different playing style, for example.

What if we can tune an eval function to behave more like the opponent, and use it to decide on moves as the opponent?

One possible way to implement this is to apply 2 eval functions on leaf nodes, and propagate the scores back up as a pair.

On plies where it's the program's move, we take the max of the scores of our own eval, and on plies where it's the opponent's move, we take the max of the scores of the eval that is supposed to model the opponent.

I believe most/all current optimizations should still work correctly.

elpapa · Post by **elpapa** » Mon Jan 05, 2015 3:27 pm

So the goal is to improve the engine vs. one specific opponent?

matthewlai · Post by **matthewlai** » Mon Jan 05, 2015 3:29 pm

elpapa wrote:So the goal is to improve the engine vs. one specific opponent?

Yes.

Unless the "opponent eval" can be tuned on the fly based on what the opponent has played so far.

But what I had in mind is tuning against a specific opponent beforehand.

bob · Post by **bob** » Mon Jan 05, 2015 9:07 pm

matthewlai wrote:
elpapa wrote:So the goal is to improve the engine vs. one specific opponent?
Yes.

Unless the "opponent eval" can be tuned on the fly based on what the opponent has played so far.

But what I had in mind is tuning against a specific opponent beforehand.

This should be doable. But it represents a LOT of work for a minimal gain. For example, when you attend the next WCCC with a "tuned program" all you can tune against is existing versions. But most participants will show up with code that has some new ideas included, which means you are now playing against a different opponent than the one you tuned against. Against static programs it would certainly work, but it does represent an enormous computational cost.

sje · Post by **sje** » Tue Jan 06, 2015 6:47 am

bob wrote:This should be doable. But it represents a LOT of work for a minimal gain. For example, when you attend the next WCCC with a "tuned program" all you can tune against is existing versions. But most participants will show up with code that has some new ideas included, which means you are now playing against a different opponent than the one you tuned against. Against static programs it would certainly work, but it does represent an enormous computational cost.

GM level human players do this by preparing opening repertoires targeted to opponents based on prior opponent play. This once provided a big tactical advantage in those years long past when most games went twenty to thirty ply deep with prepared opening moves on both sides. Today with huge opening databases which force diverse play, the idea doesn't work so well.

Having different positional evaluations or search customizations based on opponent modeling sounds like a big bug magnet; tough to implement and tougher to debug. Also, it would be very difficult to accurately measure results against human players -- how do you expect any human to play a thousand or more games needed for testing?

mcostalba · Post by **mcostalba** » Tue Jan 06, 2015 8:49 am

matthewlai wrote:Random idea of the day -

Has anyone experimented with using different eval functions for white vs black?

Stockfish used two have the famous Aggressiveness and Cowardice UCI parameters, they did exactly this: different king safety evaluation according if you were the side to move (at root) or not. The two parameters were target one at the side to move and the other at the defending side.

They have been removed because testing proved them useless...but a lot of people claimed and still claim because we remove from their hands their preferred toy knob

Michel · Post by **Michel** » Tue Jan 06, 2015 9:44 am

They have been removed because testing proved them useless...

"Useless" is not an appropriate term for a technical forum. It means completely different things to different people.

In this context it means : "Has not been proven to yield an elo gain when measured from the starting position."

EDIT: Needless to say that I am personally happy that these asymmetric eval terms were removed.

mcostalba · Post by **mcostalba** » Tue Jan 06, 2015 11:03 am

Michel wrote:
They have been removed because testing proved them useless...

"Useless" is not an appropriate term for a technical forum. It means completely different things to different people.

In this context it means : "Has not been proven to yield an elo gain when measured from the starting position."

EDIT: Needless to say that I am personally happy that these asymmetric eval terms were removed.

I used "useless" like "your post is useless"....

Michel · Post by **Michel** » Tue Jan 06, 2015 11:28 am

I used "useless" like "your post is useless"....

Can you clarify?

hgm · Post by **hgm** » Tue Jan 06, 2015 2:18 pm

mcostalba wrote:..., they did exactly this: different king safety evaluation according if you were the side to move (at root) or not.

Unless I misunderstood the OP this is not what he proposed at all. In his idea both evaluations are always made in every leaf, irrespective of who has the move in the root, and both propagated towards the root based on the chosen move. It is the side to move at the current ply level that decides which of the two evaluations will be used to determine which move is best.

I wonder how badly this interferes with alpha-beta pruning, however. If side A takes a beta cutoff, his score would be a lower bound, because there are unsearched moves that might score higher. There is no guarantee at all that the opponent (B)'s score associated with that move is a lower bound. One of the unsearched moves might have a better A-score, but a lower B-score (both from A point of view), so that the true B-score of the node would actually be lower.

Different eval for white/black

Different eval for white/black

Re: Different eval for white/black

Re: Different eval for white/black

Re: Different eval for white/black

Opening preperation

Re: Different eval for white/black

Re: Different eval for white/black

Re: Different eval for white/black

Re: Different eval for white/black

Re: Different eval for white/black