Re: patricia devlog
Posted: Sat Feb 17, 2024 1:13 am
I filtered about 12 million positions from the Willow dataset of 400 million positions I have on hand using the 6th filtering program I wrote in Patricia, that "saves positions where one side can't castle and has an open king". I then retrained Patricia's current network for 10 epochs on the new dataset at a much lower LR (in order to make sure that the entire network does not begin overfitting), in the hopes that the network would appreciate "good but not necessarily better than other options" attacks more and choose them over quiet play.
The results were somewhat promising: here's a result of a 3k game match between that Patricia and the original Patricia.
The increase in sacrifice rate is statistically significant, and the strength loss was only maybe 5-10 elo. Not bad!
I'm going to do this for all 8 filtering programs. I hope Linux has a good program that can run round-robin tournaments; otherwise, I'll just have them all play matches against the original Patricia and pick the one that does best in aggressiveness while also not sacrificing too much elo. I have no problem giving up 10 elo for a better playstyle per patch, I have more of a problem giving up 100 elo.
The results were somewhat promising: here's a result of a 3k game match between that Patricia and the original Patricia.
Code: Select all
Rank EAS-Score sacs shorts draws moves Engine/player
-------------------------------------------------------------------
1 117347 11.17% 23.08% 17.82% 70 patty-dev
2 76785 08.05% 16.32% 21.15% 74 patricia
I'm going to do this for all 8 filtering programs. I hope Linux has a good program that can run round-robin tournaments; otherwise, I'll just have them all play matches against the original Patricia and pick the one that does best in aggressiveness while also not sacrificing too much elo. I have no problem giving up 10 elo for a better playstyle per patch, I have more of a problem giving up 100 elo.