Cursed win at TCEC

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

mwyoung
Posts: 2727
Joined: Wed May 12, 2010 10:00 pm

Re: Cursed win at TCEC

Post by mwyoung »

syzygy wrote:
mwyoung wrote:
hgm wrote:
mwyoung wrote:Because I said dubious opens. As in not sound.
I still don't see why you think that would skew the results, if each engine has to play both sides. In the worst case the start position is a certain win, and then they would just get a 1-1 on that line, and it is as if the game was never played, and there were just fewer games.
And I don't see why you don't think that it does skew results.

And the games are played and do count in TCEC.

Lets make an absurd example to show the point. I play GM Carlsen a 6 game match with 6 fixed and very unsound openings. So unsound I am able to win with white every game. And so is GM Carlsen.
And that is exactly what is not happening in TCEC.
I respect your opinion.

And I will let TCEC make my point.

From TCEC themselves on the opening book choices played in TCEC season 9.

"For this season I plan to deviate from what I have done in the past. Instead of providing balanced positions that always favor white I will come up with a set of positions that may, in some cases, provoke outcries of alarm. Keep in mind the engines get to play both sides and optimal chess theory isn’t the goal here—we’re trying to produce an outcome that puts the most deserving two engines into the final. Expect the unusual and more blood on the board than we’ve seen this late in the tournament in recent seasons!"
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
bnemias
Posts: 373
Joined: Thu Aug 14, 2008 3:21 am
Location: Albuquerque, NM

Re: Cursed win at TCEC

Post by bnemias »

MikeB wrote:it's clearly one of those cases where the obvious solution is so clear and recognizable by most people
The solution in THIS case is clear, amend the result so the adjudication metric aligns with the metric the engines were fed.

The wider TCEC solution isn't so clear because it is possible to have one engine using Gaviota while the other is using Syzygy (or no TB). Luckily we don't have that case, because I doubt it is possible to adjudicate that case fairly under the current rules.

However, recognizing the possibility of the latter is important in preventing this problem in subsequent seasons.
Norm Pollock
Posts: 1056
Joined: Thu Mar 09, 2006 4:15 pm
Location: Long Island, NY, USA

Re: Cursed win at TCEC

Post by Norm Pollock »

Here are the rules about adjudicating wins:

Code: Select all

 It will adjudicate as won for one side if both playing engines have an eval of at least 6.50 pawns (or -6.50 in case of a black win) for 4 consecutive moves, or 8 plies - this rule is in effect as soon as the game starts. In the website this rule is shown as "TCEC win rule" with a number indicating how many plies there are left until it kicks in. Cutechess will also adjudicate 5-men or less tablebase endgame positions automatically.
Both rules are there to save time and get on with the next game.

The problem that occurred in the finals showed that the second rule is defective. So the simple solution is to remove the last sentence. There will still be one rule left to adjudicate a win.
Dirt
Posts: 2851
Joined: Wed Mar 08, 2006 10:01 pm
Location: Irvine, CA, USA

Re: Cursed win at TCEC

Post by Dirt »

bnemias wrote:The wider TCEC solution isn't so clear because it is possible to have one engine using Gaviota while the other is using Syzygy (or no TB). Luckily we don't have that case, because I doubt it is possible to adjudicate that case fairly under the current rules.
Nonsense. Syzygy is always right as far as I know. Gaviota can be wrong. If an engine uses Gaviota it has to expect it being wrong sometimes.
Deasil is the right way to go.
bnemias
Posts: 373
Joined: Thu Aug 14, 2008 3:21 am
Location: Albuquerque, NM

Re: Cursed win at TCEC

Post by bnemias »

Norm Pollock wrote:

Code: Select all

... Cutechess will also adjudicate 5-men or less tablebase endgame positions automatically.
...

The problem that occurred in the finals showed that the second rule is defective.
I don't think the 2nd rule is defective. The problem (in this case) is not adjudication per se, but rather that the adjudication was based on different TB than the engines were using.

As I mentioned above, we don't have the more difficult case here, where one side uses Gaviota and the other uses one (or none) that obeys the 50 move rule.

But in this case, the 50 move rule doesn't matter wrt adjudication because both engines were using TB that said it was a draw, and the tournament software used a different set that said it was a win for one side.
bnemias
Posts: 373
Joined: Thu Aug 14, 2008 3:21 am
Location: Albuquerque, NM

Re: Cursed win at TCEC

Post by bnemias »

Dirt wrote:
bnemias wrote:The wider TCEC solution isn't so clear because it is possible to have one engine using Gaviota while the other is using Syzygy (or no TB). Luckily we don't have that case, because I doubt it is possible to adjudicate that case fairly under the current rules.
Nonsense. Syzygy is always right as far as I know. Gaviota can be wrong. If an engine uses Gaviota it has to expect it being wrong sometimes.
Not nonsense. The point you are making is very debatable, because it depends if you want to apply the 50 move rule to these games. That question is a different matter, and frankly, I'm not even sure where I stand on it. But that's another thread.

[edit] nvm. I get where you are coming from after rereading your post. Yeah, if you accept that the tournament adheres to the 50 move rule, then Gaviota is indeed sometimes wrong.
Last edited by bnemias on Thu Nov 17, 2016 3:28 am, edited 1 time in total.
Norm Pollock
Posts: 1056
Joined: Thu Mar 09, 2006 4:15 pm
Location: Long Island, NY, USA

Re: Cursed win at TCEC

Post by Norm Pollock »

I consider the rule "defective" in the sense that it deviated from its purpose. Its purpose was to determine the natural outcome without intervention, and impose that outcome to save time. Instead of determining the natural outcome that the engines would eventually determine on their own, it imposed a different outcome.
rabbits23
Posts: 144
Joined: Fri Sep 12, 2014 4:57 am
Location: Randwick Australia

Re: Cursed win at TCEC

Post by rabbits23 »

Good one Patrik!
Allan
Lyudmil Tsvetkov
Posts: 6052
Joined: Tue Jun 12, 2012 12:41 pm

fortress_draw_rule

Post by Lyudmil Tsvetkov »

I especially enjoy fights 1 vs 20 or 30, usually come on top, and usually that means I am rigth.

I have never until now seen so many low-thinking people posting into a single thread. Really astounding! I think the probability this actually happens is somewhere 1 to a zillion, so we have just witnessed an extraordinary phenomenon.

The easiest thing to do to resolve the situation is of course for engines to implement a more general fortress draw rule, for example in the following way:

int fdr=0; // fortress draw rule
makemove(); fdr++; if (move(capture) || move(promotion) || move(pawnmove) || move(castling) ) fdr=0;

void fdrule() // subroutine for fortress draw rule scoring
{if (pawncount(myside)==0 && pawncount(opponentside)==0)
{if (fdr>=200) score=0;} // providing for 70% of the most common endgames falling under that rule, but if you want, as Greg pointed, you might go as far as 1100

else {if (fdr>=100) score=0;}

}

I guess with a similar code the problem is easily solved. Interesting, who will be the first author to implement such a change in his engine?

Concerning the real TCEC game, this is quite similar to FIDE rules concerning the situation when a player has been flagged, but he has just delivered his opponent a mate. In this case, although the flag of the player is already down, thus losing on time, he still wins the game, fully according to FIDE rules, as there is mate on the board, which overrides flagging.

Very similarly, based on TCEC special draw rule, the game was declared drawn, but, when Cutechess adjudicated the tablebase position, it adjudicated a win, as the position is simply won. An existing win on the board, which is the case in current game, actually an existing mate, certainly overrides the special TCEC draw rule.

A win is a win, that is very simple, it would be more than a sin to adjudicate a winning position as a draw.
syzygy
Posts: 5566
Joined: Tue Feb 28, 2012 11:56 pm

Re: Cursed win at TCEC

Post by syzygy »

mwyoung wrote:
syzygy wrote:
Lets make an absurd example to show the point. I play GM Carlsen a 6 game match with 6 fixed and very unsound openings. So unsound I am able to win with white every game. And so is GM Carlsen.
And that is exactly what is not happening in TCEC.
I respect your opinion.

And I will let TCEC make my point.
It is not an opinion but a fact. There have not been any 1-1 openings so far.

Of course we might yet get openings that are so advantageous for one side that 1-1 is inevitable. But as hgm explained, even if that happens there is no damage. It will be as if TCEC had one round less. There would be no actual skew.