New engine releases & news 2021

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

User avatar
Rebel
Posts: 7299
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: DanaSah 8.8

Post by Rebel »

pedrox wrote: Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

Default net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2037    1857    1402    1018    714     1194    68.13   53.55

Default net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1962    1831    1481    1069    693     1190    67.06   52.07

Code: Select all

ccrl net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1613    1577    1317    1161    840     1715    88.97   43.69

ccrl net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
1627    1538    1366    1131    874     1691    89.51   43.41

Code: Select all

SF net versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

SF net versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
The costs are 3 bitcoins :D
90% of coding is debugging, the other 10% is writing bugs.
Modern Times
Posts: 3703
Joined: Thu Jun 07, 2012 11:02 pm

Re: DanaSah 8.8

Post by Modern Times »

Modern Times wrote: Sat Apr 24, 2021 10:36 am
Does DanaSah 8.8 play chess960 under Cutechess GUI, either as UCI or WinBoard ?
Currently trying it as XBoard at chess960 under Cutechess. Only 10 games played but I think it is probably fine.
User avatar
pedrox
Posts: 1056
Joined: Fri Mar 10, 2006 6:07 am
Location: Basque Country (Spain)

Re: DanaSah 8.8

Post by pedrox »

Modern Times wrote: Sat Apr 24, 2021 1:08 pm
Modern Times wrote: Sat Apr 24, 2021 10:36 am
Does DanaSah 8.8 play chess960 under Cutechess GUI, either as UCI or WinBoard ?
Currently trying it as XBoard at chess960 under Cutechess. Only 10 games played but I think it is probably fine.
DanaSah can play FRC or Chess960 with either xboard or uci protocol. No user configuration is required. It also recognises the 2 commonly used FEN formats, ShredderFEN and xFEN.

I have tried to play in cutechess with the uci protocol:

Code: Select all

variant: normal
Score of danasah88_64bit vs danasah830: 363 - 7 - 33 [0.942]
...      danasah88_64bit playing White: 180 - 3 - 19  [0.938] 202
...      danasah88_64bit playing Black: 183 - 4 - 14  [0.945] 201
...      White vs Black: 184 - 186 - 33  [0.498] 403
Elo difference: 483.3 +/- 59.1, LOS: 100.0 %, DrawRatio: 8.2 %
413 of 1000 games finished.

variant: fischerandom
Score of danasah88_64bit vs danasah830: 343 - 35 - 24 [0.883]
...      danasah88_64bit playing White: 182 - 12 - 8  [0.921] 202
...      danasah88_64bit playing Black: 161 - 23 - 16  [0.845] 200
...      White vs Black: 205 - 173 - 24  [0.540] 402
Elo difference: 351.3 +/- 49.7, LOS: 100.0 %, DrawRatio: 6.0 %
412 of 1000 games finished.
Modern Times
Posts: 3703
Joined: Thu Jun 07, 2012 11:02 pm

Re: DanaSah 8.8

Post by Modern Times »

pedrox wrote: Sat Apr 24, 2021 2:10 pm
DanaSah can play FRC or Chess960 with either xboard or uci protocol. No user configuration is required. It also recognises the 2 commonly used FEN formats, ShredderFEN and xFEN.

I have tried to play in cutechess with the uci protocol:
Ah thank you. I'll try it as UCI instead. I ran previous versions as UCI under ChessGUI, but I no longer use that.
User avatar
Rebel
Posts: 7299
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: DanaSah 8.8

Post by Rebel »

pedrox wrote: Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
Pedro, I like to ask, how recent is the stock2bd10.nnue net?
90% of coding is debugging, the other 10% is writing bugs.
User avatar
pedrox
Posts: 1056
Joined: Fri Mar 10, 2006 6:07 am
Location: Basque Country (Spain)

Re: DanaSah 8.8

Post by pedrox »

Rebel wrote: Sat Apr 24, 2021 5:03 pm
pedrox wrote: Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
Pedro, I like to ask, how recent is the stock2bd10.nnue net?
Hi Ed,

stock2bd10.nnue was used with danasah or with stockfish? I guess with danasah.

I downloaded from a google drive page in mid march 2000 million positions in binpack format that I think were created with the stockfish master network (sf13?) and with a depth of 10. The file is called: gensfen_multipvdiff_100_d10.binpack, I think it is quite well known in the stockfish nnue group on discord.

Then I used nodchip trainer to train a fresh network. I haven't done a serious test on its strength, but I think it's something like 75 Elo points less than a stockfish net I tested. To make it stronger it would have been better to have more positions and depth, even up to 16000 million.

If I'm not mistaken, with a SIM value of 54 it could even pass as an original job. However I preferred to do something different and instead of using positions with stockfish evaluation or one of their networks I preferred to use ccrl positions and instead of learning by depth I did it by result. This is ccrl402net network and I imagine will have a very low SIM and then the dananet1 network that I use (default) and which is tactically stronger for having trained it in depth 5 with ccrlnet evaluations will have a higher SIM than the ccrl but lower than the one in your test.
User avatar
Rebel
Posts: 7299
Joined: Thu Aug 18, 2011 12:04 pm
Full name: Ed Schröder

Re: DanaSah 8.8

Post by Rebel »

pedrox wrote: Sat Apr 24, 2021 6:15 pm
Rebel wrote: Sat Apr 24, 2021 5:03 pm
pedrox wrote: Fri Apr 23, 2021 10:37 pmIf Ed would like to test the networks with SIM, I would be happy to see the results.

Code: Select all

stock2bd10.nnue versus SF12
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2063    1852    1543    1059    662     1044    63.34   53.87

stock2bd10.nnue versus SF13
dev0    dev1    dev2    dev3    dev4    dev5    RMS     SIM
2048    1865    1480    1090    690     1054    62.95   53.79
Pedro, I like to ask, how recent is the stock2bd10.nnue net?
Hi Ed,

stock2bd10.nnue was used with danasah or with stockfish? I guess with danasah.
Yep.
I downloaded from a google drive page in mid march 2000 million positions in binpack format that I think were created with the stockfish master network (sf13?) and with a depth of 10. The file is called: gensfen_multipvdiff_100_d10.binpack, I think it is quite well known in the stockfish nnue group on discord.

Then I used nodchip trainer to train a fresh network. I haven't done a serious test on its strength, but I think it's something like 75 Elo points less than a stockfish net I tested. To make it stronger it would have been better to have more positions and depth, even up to 16000 million.

If I'm not mistaken, with a SIM value of 54 it could even pass as an original job.
Well noticed, and that was the reason I asked. SIM-SCORE is like SIMEX, numbers can only raise doubt, not proof innocence so to say.
However I preferred to do something different and instead of using positions with stockfish evaluation or one of their networks I preferred to use ccrl positions and instead of learning by depth I did it by result. This is ccrl402net network and I imagine will have a very low SIM and then the dananet1 network that I use (default) and which is tactically stronger for having trained it in depth 5 with ccrlnet evaluations will have a higher SIM than the ccrl but lower than the one in your test.
Well done and welcome to the NNUE family :)
90% of coding is debugging, the other 10% is writing bugs.
connor_mcmonigle
Posts: 544
Joined: Sun Sep 06, 2020 4:40 am
Full name: Connor McMonigle

Re: New engine releases & news 2021

Post by connor_mcmonigle »

I've released Seer 2.0.0 here: https://github.com/connormcmonigle/see ... tag/v2.0.0.

With any luck, it should be more than 100 elo stronger than v1.2.1. With this version, the network weights are now embedded into the binaries. The embedded network is trained entirely on data generated by Seer's search starting with a randomly initialized network. The network is no longer directly or indirectly trained on Stockfish derived training data. The training technique employed is a variant of semisupervised learning and involves starting with a large number of <=6 man positions labeled using Syzygy EGTBs. A lengthier description of the unique training process can be found in the README.

Thanks in advance for testing,
Connor
Damir
Posts: 2873
Joined: Mon Feb 11, 2008 3:53 pm
Location: Denmark
Full name: Damir Desevac

Re: New engine releases & news 2021

Post by Damir »

Thanks a lot for this release Connor! :)
mar
Posts: 2654
Joined: Fri Nov 26, 2010 2:00 pm
Location: Czech Republic
Full name: Martin Sedlak

Cheng 4.41

Post by mar »

Just released Cheng 4.41
link: http://vlasak.biz/cheng

the improvement is modest this time (I'm playing with fire - no pun intended :), estimated +15 against other engines (about half of 4.40 vs 4.39)
(self-play hyperbullet 56% against 4.40 so who knows, I usually get about half of self-play) - my apologies to the testers for not jumping hundreds of Elo points :lol:

note that I dropped own book because some people abused it for engine-engine testing (this was never intended) to make Cheng look bad
I'm constantly frustrated and annoyed by computer chess - probably not a hobby I intend to keep, we'll see...

anyway - have fun (hopefully 4.41 will do better in D5 than 4.40 did in D4 ;)