Chess324

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

peter
Posts: 3387
Joined: Sat Feb 16, 2008 7:38 am
Full name: Peter Martan

Re: Chess324

Post by peter »

Plutie wrote: Tue Sep 06, 2022 5:42 am
Chessqueen wrote: Mon Sep 05, 2022 8:12 pm
Jouni wrote: Mon Sep 05, 2022 7:59 pm According to Discord playing version is buggy!
According to Plutie it was updated after game 154, but it is still playing horribly in Chess324 here is what he wrote
Current working theory - the bad performance so far would be because we submitted an untested branch which ended up having a pretty bad bug. whether that's actually why leela has performed so poorly remains to be seen, but it's the most probable explanation, considering the analysis I was running on the side during games with a known good version. - the playing Leela was updated to a fixed version around game 154.
for the record, that was a bit of a hasty statement - it's definitely still not fixed, but the update was for one bug that was found in the meantime. it's still possible to reproduce the issue with a few positions where leela blundered though.

this position is from game 166, leela went from an easily won position to a draw with 45.. Ng3. first image is from dag-bord-lf-se-2 - known working version, second is from dag-size, which was submitted.

Image
Image
Thanks for showing.
Tried the position with the two compiles I have still installed at my PC

Lc0 v0.30.0-dev+git.75c6ada0

and

Engine: Lc0 v0.30.0-dag+git.c91bf77S

with 3070ti and compared ouput here:

https://forum.computerschach.de/cgi-bin ... #pid157734

Both play the correct 46...Bxd6 at once and keep it in output with clearly winning eval for the few minutes I did let run, in MultiPV=2 46...Ng3? as second best has drawish eval from the very start,
regards
Last edited by peter on Tue Sep 06, 2022 10:17 am, edited 1 time in total.
Peter.
Lazy_Frank
Posts: 74
Joined: Mon Jul 23, 2018 10:56 pm
Location: Latvia
Full name: Raivis Baumanis

Re: Chess324

Post by Lazy_Frank »

CPU Leela gives close to 80% confidence for Bxd6.
peter
Posts: 3387
Joined: Sat Feb 16, 2008 7:38 am
Full name: Peter Martan

Re: Chess324

Post by peter »

peter wrote: Tue Sep 06, 2022 9:47 am Both play the correct 46...Bxd6 at once and keep it in output with clearly winning eval for the few minutes I did let run, in MultiPV=2 46...Ng3? as second best has drawish eval from the very start,
regards
Edit- time over, meant 45... of course, not 46...
Peter.
Chessqueen
Posts: 5685
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Chess324

Post by Chessqueen »

Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Chessqueen
Posts: 5685
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Chess324

Post by Chessqueen »

Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
Lazy_Frank
Posts: 74
Joined: Mon Jul 23, 2018 10:56 pm
Location: Latvia
Full name: Raivis Baumanis

Re: Chess324

Post by Lazy_Frank »

Chessqueen wrote: Tue Sep 06, 2022 1:15 pm
Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
It's up to CCC organizers and admins i think.
Chessqueen
Posts: 5685
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Chess324

Post by Chessqueen »

Lazy_Frank wrote: Tue Sep 06, 2022 1:29 pm
Chessqueen wrote: Tue Sep 06, 2022 1:15 pm
Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
It's up to CCC organizers and admins i think.
I believe that LCO CPU vs Ethereal match would be interesting in Chess324 :roll:
Chessqueen
Posts: 5685
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Chess324

Post by Chessqueen »

Chessqueen wrote: Tue Sep 06, 2022 1:34 pm
Lazy_Frank wrote: Tue Sep 06, 2022 1:29 pm
Chessqueen wrote: Tue Sep 06, 2022 1:15 pm
Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
It's up to CCC organizers and admins i think.
I believe that LCO CPU vs Ethereal match would be interesting in Chess324 :roll:
Ethereal has NOT won one single game versus Dragon, and the percentage of Draws must be around 94% https://www.chess.com/computer-chess-championship
lkaufman
Posts: 6224
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA
Full name: Larry Kaufman

Re: Chess324

Post by lkaufman »

Chessqueen wrote: Tue Sep 06, 2022 4:27 pm
Chessqueen wrote: Tue Sep 06, 2022 1:34 pm
Lazy_Frank wrote: Tue Sep 06, 2022 1:29 pm
Chessqueen wrote: Tue Sep 06, 2022 1:15 pm
Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
It's up to CCC organizers and admins i think.
I believe that LCO CPU vs Ethereal match would be interesting in Chess324 :roll:
Ethereal has NOT won one single game versus Dragon, and the percentage of Draws must be around 94% https://www.chess.com/computer-chess-championship
After 93 games in this playoff between the second and third place finishers in the main event, so far the score is 19 wins for Dragon to zero for Ethereal, which is a tad under 80% draws. This is a remarkable improvement over the 98+% draws in the 600 game FRC match a year ago between Dragon and Stockfish. It seems that chess324 is much less drawish than chess960 at this top level. Furthermore, the fact that Ethereal has not won a single game means that this excellent 20+% decisive rate is not due to any of the positions being easily won, at least none of the ones played so far.
Komodo rules!
Chessqueen
Posts: 5685
Joined: Wed Sep 05, 2018 2:16 am
Location: Moving
Full name: Jorge Picado

Re: Chess324

Post by Chessqueen »

lkaufman wrote: Wed Sep 07, 2022 6:10 am
Chessqueen wrote: Tue Sep 06, 2022 4:27 pm
Chessqueen wrote: Tue Sep 06, 2022 1:34 pm
Lazy_Frank wrote: Tue Sep 06, 2022 1:29 pm
Chessqueen wrote: Tue Sep 06, 2022 1:15 pm
Chessqueen wrote: Tue Sep 06, 2022 11:31 am
Lazy_Frank wrote: Tue Sep 06, 2022 9:50 am CPU Leela gives close to 80% confidence for Bxd6.
Therefore. Leela CPU should be given another chance against the top 6 to see how it play in comparison. Another thing that I noticed is that the percentage of Draws should be lowered at the very top, for instance between Stockfish Vs Dragon there should be more decisive wins and less Draws whereas between Dragon vs Ethereal the percentage of Draws should increased considerably since in positions where Ethereal has the initial first move which create a space advantage Dragon will start from a negative evaluation and will have to fight hard to get Draws even if it does NOT lose any games :roll:
Sorry, for stating that stupid comment, it is too late for LCO now :mrgreen:
It's up to CCC organizers and admins i think.
I believe that LCO CPU vs Ethereal match would be interesting in Chess324 :roll:
Ethereal has NOT won one single game versus Dragon, and the percentage of Draws must be around 84% https://www.chess.com/computer-chess-championship
After 93 games in this playoff between the second and third place finishers in the main event, so far the score is 19 wins for Dragon to zero for Ethereal, which is a tad under 80% draws. This is a remarkable improvement over the 98+% draws in the 600 game FRC match a year ago between Dragon and Stockfish. It seems that chess324 is much less drawish than chess960 at this top level. Furthermore, the fact that Ethereal has not won a single game means that this excellent 20+% decisive rate is not due to any of the positions being easily won, at least none of the ones played so far.
I believe that the percentage of Draws between Stockfish Vs Dragon get lower with the near top 2 engines, probably to 72% with more decisive wins.