Note that I do not intend a final floating point formula.
I just formulated it that way to find the optimal values.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
I use a STC/LTC test like Stockfish and the gauntlet test is at fast blitz (not hyper-bullet) TC). Will queue up this test but right now it is testing a version of probcut.
The Tune params array is for eval parameters that can be tuned via gradient descent. Doesn't work for search parameters, at least not in the same way.
I have tried the following version, which requires only integer math. This passed STC and LTC (0:60+0.6) self-play tests but failed to improve the gauntlet results (-17 ELO at 2:30+1). Possibly a different parameter set would do better.
I guess your results are about the same as mine, then.
I only tried self play, at fairly rapid pace.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.