Yes; most people add killers, history, and often many evaluation terms before they add a TT, but I wanted to beat TSCP with the least amount of features possible. Because TSCP is a one-trick pawn-pony, I added the TT first, because it massively helps in pawn endgames. (Because pawns don't have a lot of moves, they cause many TT hits.)algerbrex wrote: ↑Thu Sep 30, 2021 1:21 am Thanks![]()
But keep in mind that if I remember correctly, you added PVS after you had TT move ordering, TT Cuts, and MVV-LVA, and gained ~56 Elo. The 39 Elo I gained was just with using MVV-LVA and TT-Cuts. If my TT move ordering was playing nice with PVS, I think 15-20 Elo more would be realistic. Not to mention killers.
Just add the TT ordering first, by giving the TT-move a higher score than the highest MVV-LVA score, and test again. Then add the killers below the lowest MVV_LVA score and test again.While I'm glad I found the culprit of this bug, it's such an odd bug. Both the killers and the TT add significant speed, node reduction, and Elo gains to the engine, but when PVS is added to the mix, it just makes things worse. But take them away, and then all of sudden PVS is a strength gaining feature again? Huh? I honestly have no clue what might be at play here, but I have a couple of theories, so I suppose I'll just go from there.
Yes. I have had several speed improvements in the current development version, which in total added about 50 Elo (but as the margin is +/- 24, it could also be around 25 or 75). Against some engines in the gauntlet Rustic's performance rating increases (so it wins more games against said engine than it did before the optimizations), while against other engines, nothing seems to have changed. Against two engines rated 2130 and 2160 Rustic only manages 50% score in a head-to-head match, which would put the engine at 2140-2150 level. However, against two engines rated ~2100, Rustic scores +150 Elo for a rating of 2250... and against one 2260 engine, it scores -50 for a rating of 2210.And I've noticed what you mention with regards to Rustic being stronger against certain engines versus others, although Blunder's improvements are usually optimizations and whatnot. This is now why I always try to pair self-pay with gauntlet testing, which I know you do as well. Seems to be the best way to good an accurate estimate of what the engines Elo will end up being on the CCRL.
So... depending on which engines CCRL chooses, the newest Rustic version might be rated 2150, but it could also be 2200 or even higher.
It seems that some engines have, for example, evaluation features that cause them to exploit gaps in Rustic's knowledge. (It only counts material and PST's), and extra speed doesn't fix this. Maybe I should test those two 2130 and 2160 engines against the 2260 engine, to see if that 2260 engine also scores poorly against them; because I know that that engine also only uses PST's and material counting.
I'm not mentioning the specific engines to make sure CCRL doesn't get any idea's about which engines to put in the gauntlet to make it an interesting tournament. I don't want to stack the deck against my own engine, not in favor of it, but also not disadvantaging it.