New engine releases & news 2021

Rebel · Post by **Rebel** » Sat Apr 24, 2021 12:41 pm

pedrox wrote: ↑Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

Default net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2037    1857    1402    1018    714     1194    68.13   53.55

Default net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1962    1831    1481    1069    693     1190    67.06   52.07

Code: Select all

ccrl net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1613    1577    1317    1161    840     1715    88.97   43.69

ccrl net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1627    1538    1366    1131    874     1691    89.51   43.41

Code: Select all

SF net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

SF net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79

The costs are 3 bitcoins

Modern Times · Post by **Modern Times** » Sat Apr 24, 2021 1:08 pm

Modern Times wrote: ↑Sat Apr 24, 2021 10:36 am
Does DanaSah 8.8 play chess960 under Cutechess GUI, either as UCI or WinBoard ?

Currently trying it as XBoard at chess960 under Cutechess. Only 10 games played but I think it is probably fine.

pedrox · Post by **pedrox** » Sat Apr 24, 2021 2:10 pm

Modern Times wrote: ↑Sat Apr 24, 2021 1:08 pm
Modern Times wrote: ↑Sat Apr 24, 2021 10:36 am
Does DanaSah 8.8 play chess960 under Cutechess GUI, either as UCI or WinBoard ?
Currently trying it as XBoard at chess960 under Cutechess. Only 10 games played but I think it is probably fine.

DanaSah can play FRC or Chess960 with either xboard or uci protocol. No user configuration is required. It also recognises the 2 commonly used FEN formats, ShredderFEN and xFEN.

I have tried to play in cutechess with the uci protocol:

Code: Select all

variant: normal
Score of danasah88_64bit vs danasah830: 363 - 7 - 33 [0.942]
...      danasah88_64bit playing White: 180 - 3 - 19  [0.938] 202
...      danasah88_64bit playing Black: 183 - 4 - 14  [0.945] 201
...      White vs Black: 184 - 186 - 33  [0.498] 403
Elo difference: 483.3 +/- 59.1, LOS: 100.0 %, DrawRatio: 8.2 %
413 of 1000 games finished.

variant: fischerandom
Score of danasah88_64bit vs danasah830: 343 - 35 - 24 [0.883]
...      danasah88_64bit playing White: 182 - 12 - 8  [0.921] 202
...      danasah88_64bit playing Black: 161 - 23 - 16  [0.845] 200
...      White vs Black: 205 - 173 - 24  [0.540] 402
Elo difference: 351.3 +/- 49.7, LOS: 100.0 %, DrawRatio: 6.0 %
412 of 1000 games finished.

Modern Times · Post by **Modern Times** » Sat Apr 24, 2021 2:29 pm

pedrox wrote: ↑Sat Apr 24, 2021 2:10 pm
DanaSah can play FRC or Chess960 with either xboard or uci protocol. No user configuration is required. It also recognises the 2 commonly used FEN formats, ShredderFEN and xFEN.

I have tried to play in cutechess with the uci protocol:

Ah thank you. I'll try it as UCI instead. I ran previous versions as UCI under ChessGUI, but I no longer use that.

Rebel · Post by **Rebel** » Sat Apr 24, 2021 5:03 pm

pedrox wrote: ↑Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79

Pedro, I like to ask, how recent is the stock2bd10.nnue net?

pedrox · Post by **pedrox** » Sat Apr 24, 2021 6:15 pm

Rebel wrote: ↑Sat Apr 24, 2021 5:03 pm
pedrox wrote: ↑Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.
Code: Select all
stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
Pedro, I like to ask, how recent is the stock2bd10.nnue net?

Hi Ed,

stock2bd10.nnue was used with danasah or with stockfish? I guess with danasah.

I downloaded from a google drive page in mid march 2000 million positions in binpack format that I think were created with the stockfish master network (sf13?) and with a depth of 10. The file is called: gensfen_multipvdiff_100_d10.binpack, I think it is quite well known in the stockfish nnue group on discord.

Then I used nodchip trainer to train a fresh network. I haven't done a serious test on its strength, but I think it's something like 75 Elo points less than a stockfish net I tested. To make it stronger it would have been better to have more positions and depth, even up to 16000 million.

If I'm not mistaken, with a SIM value of 54 it could even pass as an original job. However I preferred to do something different and instead of using positions with stockfish evaluation or one of their networks I preferred to use ccrl positions and instead of learning by depth I did it by result. This is ccrl402net network and I imagine will have a very low SIM and then the dananet1 network that I use (default) and which is tactically stronger for having trained it in depth 5 with ccrlnet evaluations will have a higher SIM than the ccrl but lower than the one in your test.

Rebel · Post by **Rebel** » Sat Apr 24, 2021 10:20 pm

pedrox wrote: ↑Sat Apr 24, 2021 6:15 pm
Rebel wrote: ↑Sat Apr 24, 2021 5:03 pm
pedrox wrote: ↑Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.
Code: Select all
stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
Pedro, I like to ask, how recent is the stock2bd10.nnue net?
Hi Ed,

stock2bd10.nnue was used with danasah or with stockfish? I guess with danasah.

Yep.

I downloaded from a google drive page in mid march 2000 million positions in binpack format that I think were created with the stockfish master network (sf13?) and with a depth of 10. The file is called: gensfen_multipvdiff_100_d10.binpack, I think it is quite well known in the stockfish nnue group on discord.

Then I used nodchip trainer to train a fresh network. I haven't done a serious test on its strength, but I think it's something like 75 Elo points less than a stockfish net I tested. To make it stronger it would have been better to have more positions and depth, even up to 16000 million.

If I'm not mistaken, with a SIM value of 54 it could even pass as an original job.

Well noticed, and that was the reason I asked. SIM-SCORE is like SIMEX, numbers can only raise doubt, not proof innocence so to say.

However I preferred to do something different and instead of using positions with stockfish evaluation or one of their networks I preferred to use ccrl positions and instead of learning by depth I did it by result. This is ccrl402net network and I imagine will have a very low SIM and then the dananet1 network that I use (default) and which is tactically stronger for having trained it in depth 5 with ccrlnet evaluations will have a higher SIM than the ccrl but lower than the one in your test.

Well done and welcome to the NNUE family

connor_mcmonigle · Post by **connor_mcmonigle** » Tue Apr 27, 2021 11:29 am

I've released Seer 2.0.0 here: https://github.com/connormcmonigle/see ... tag/v2.0.0.

With any luck, it should be more than 100 elo stronger than v1.2.1. With this version, the network weights are now embedded into the binaries. The embedded network is trained entirely on data generated by Seer's search starting with a randomly initialized network. The network is no longer directly or indirectly trained on Stockfish derived training data. The training technique employed is a variant of semisupervised learning and involves starting with a large number of <=6 man positions labeled using Syzygy EGTBs. A lengthier description of the unique training process can be found in the README.

Thanks in advance for testing,
Connor

Damir · Post by **Damir** » Tue Apr 27, 2021 12:37 pm

Thanks a lot for this release Connor!

mar · Post by **mar** » Tue Apr 27, 2021 8:33 pm

Just released Cheng 4.41
link: http://vlasak.biz/cheng

the improvement is modest this time (I'm playing with fire - no pun intended

, estimated +15 against other engines (about half of 4.40 vs 4.39)
(self-play hyperbullet 56% against 4.40 so who knows, I usually get about half of self-play) - my apologies to the testers for not jumping hundreds of Elo points

note that I dropped own book because some people abused it for engine-engine testing (this was never intended) to make Cheng look bad
I'm constantly frustrated and annoyed by computer chess - probably not a hobby I intend to keep, we'll see...

anyway - have fun (hopefully 4.41 will do better in D5 than 4.40 did in D4

New engine releases & news 2021

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: DanaSah 8.8

Re: New engine releases & news 2021

Re: New engine releases & news 2021

Cheng 4.41