tuning for the uninformed

jdart · Post by **jdart** » Sun Dec 03, 2017 4:47 pm

If your calculus is rusty, you can use this online tool:

https://www.derivative-calculator.net/

I handle midgame/endgame weighting as follows: each parameter has an entry in an array that is a structure and includes how it should be scaled.

Each time a gradient is updated, I call a scale method that applies the scaling factor for the associated parameter.

See tune.cpp and specifically the scale method here:

https://github.com/jdart1/arasan-chess

--Jon

fierz · Post by **fierz** » Tue Dec 12, 2017 10:17 pm

Is there anyone who has a large epd file for download somewhere with the results of the games?

cheers
Martin

AlvaroBegue · Post by **AlvaroBegue** » Tue Dec 12, 2017 11:47 pm

fierz wrote:Is there anyone who has a large epd file for download somewhere with the results of the games?

Give this a try: https://bitbucket.org/alonamaloh/ruy_tu ... th_results

jdart · Post by **jdart** » Thu Dec 14, 2017 4:09 pm

I have done something similar.

It has occurred to me, though, that the current tuning method would work just the same if each position had a winning probability, not just a 0, 0.5, 1 result. So if you really want to burn a lot of CPU cycles, you could run a match of say, 10 games with each position. The result would be a training set with more accurate labels.

--Jon

AlvaroBegue · Post by **AlvaroBegue** » Thu Dec 14, 2017 4:15 pm

jdart wrote:I have done something similar.

It has occurred to me, though, that the current tuning method would work just the same if each position had a winning probability, not just a 0, 0.5, 1 result. So if you really want to burn a lot of CPU cycles, you could run a match of say, 10 games with each position. The result would be a training set with more accurate labels.

--Jon

I don't have a strong argument for why I think this, but I think if you had 10 times the CPU budget to spare, you would be better off having 10 times as many positions labelled by a single game.

flok · Post by **flok** » Thu May 31, 2018 11:20 am

AlvaroBegue wrote: ↑Tue Nov 28, 2017 8:37 pm [I downloaded games from CCRL, took positions from those games and analyzed them with my program RuyDos. I saved positions on which the evaluation function was being called after searching 1000 nodes. I then labelled each position by running one very quick SF8-vs-SF8 game.

https://bitbucket.org/alonamaloh/ruy_tu ... th_results

EDIT: In that file each position has been replaced by the position from which quiescence search got its score.

In that file, we can see for example:

6k1/8/p4P1B/3b4/1pp3B1/7P/1R4P1/6K1 b - - 1-0

Now is that 1-0 for win for white or for black? I'm asking as it is now black that can move.

AlvaroBegue · Post by **AlvaroBegue** » Thu May 31, 2018 11:17 pm

1-0 for white. I use the same format of the "Result" tag in PGN.

flok · Post by **flok** » Wed Jun 26, 2019 6:21 pm

Something I forgot to ask:

for
totalError += pow(value_from_fen - calculateSigmoid(eval_score), 2);

value_from_fen = 1.0 for 1-0, 0.0 for 0-1 and so on

but eval_score, should it be from the point of view of the fen-string? or from white? or...?

Sven · Post by **Sven** » Wed Jun 26, 2019 6:45 pm

flok wrote: ↑Wed Jun 26, 2019 6:21 pm Something I forgot to ask:

for
totalError += pow(value_from_fen - calculateSigmoid(eval_score), 2);

value_from_fen = 1.0 for 1-0, 0.0 for 0-1 and so on

but eval_score, should it be from the point of view of the fen-string? or from white? or...?

Obviously the sigmoid function (and therefore also eval_score) must represent a value from the same viewpoint as value_from_fen. Otherwise you would get a high "error" e.g. if value_from_fen = 1, it is black's turn, and your eval function returns a high score in favor of white so that the sigmoid function returns a value close to 0 (from black viewpoint).

flok · Post by **flok** » Wed Jun 26, 2019 7:03 pm

Sven wrote: ↑Wed Jun 26, 2019 6:45 pm
flok wrote: ↑Wed Jun 26, 2019 6:21 pm Something I forgot to ask:

for
totalError += pow(value_from_fen - calculateSigmoid(eval_score), 2);

value_from_fen = 1.0 for 1-0, 0.0 for 0-1 and so on

but eval_score, should it be from the point of view of the fen-string? or from white? or...?
Obviously the sigmoid function (and therefore also eval_score) must represent a value from the same viewpoint as value_from_fen. Otherwise you would get a high "error" e.g. if value_from_fen = 1, it is black's turn, and your eval function returns a high score in favor of white so that the sigmoid function returns a value close to 0 (from black viewpoint).

Ok, is what I thought to be honest but as the results are currently garbage, I wondered if I maybe did it wrong.

Thanks!

tuning for the uninformed

Re: derivatives, scaling

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed

Re: tuning for the uninformed