Uri's Challenge : TwinFish

Tennison · Post by **Tennison** » Fri Jan 31, 2014 9:46 am

Uri Blass wrote:
lucasart wrote:I can make a few trivial changes to Stockfish and pass the similarity tests, any day!
Then please do it and release the source.
It may be interesting to know how much elo do you lose for it and if the engine that you get is stronger than DiscoCheck(note that 60% is not enough and you need similarity that is smaller than 55%)

TwinFish 0.07

The similarity is less than 55% and the elo fall is about 70-80 only.

This version of TwinFish is based on Stockfish dev 14 01 29 6:02PM (TimeStamp : 1391014933 )

The only changes made to reach a "<55%" similarity are a complete asymetric PST (based on Adam Hair values).

If you want to see the changes just search for "Robber" in the sources files.

There is only the source code, no binary.
If someone wants to compile good binaries it should be nice.

Don't forget : this version is only a joke and I don't steal Stockfish!

I'm very interested to see the result in similarity dendogram now !
Twinfish 0.07 is more related with Toga Hair than with Stockfish with the Don's Similarity Tester. And there is no code from Toga in it !!!

Michel · Post by **Michel** » Fri Jan 31, 2014 11:35 am

The similarity is less than 55% and the elo fall is about 70-80 only.

Although this a laudable attempt it is not really conclusive since cloners would never accept a 70-80 elo loss. The challenge is to do it without elo loss.

Tennison · Post by **Tennison** » Fri Jan 31, 2014 11:38 am

No Michel ! Read the Uri's Challenge ...

I just want to prove there is no difficulties to bypass the similarity tester.

But if you want to have no elo loss than it's another challenge ! (And really more difficult than the first one) ...

And 70-80 less than last dev Stockfish is nearly the same elo as Stockfish 4 ! It's still nice !

Rebel · Post by **Rebel** » Fri Jan 31, 2014 12:05 pm

And so we are witnessing the death of similarity tester. Now that the cat is out of the bag I can confirm Ben's findings. During the PST-thread in the programmers forum I did some experiments with the several posted PST's and Piece Values and indeed they dreadfully bring down the similarity percentage without too much elo loss (20-30).

So folks be aware, cloners will find out anyway.

ThomasJMiller · Post by **ThomasJMiller** » Fri Jan 31, 2014 12:17 pm

Can you please make available the exe to test it?

lucasart · Post by **lucasart** » Fri Jan 31, 2014 12:24 pm

My point is that even the least competent developper can do a SF clone and pass similarity test by butchering the eval. I never said anything about not losing elo or losing no more than 70-80 elo. Of course if you butcher carelessly the eval just you lose elo, because SF is very well tuned.

With this TwinFish you not only prove my point, but prove an even stronger version of my point, quantifying the maximum elo loss. Basically using the similarity test to declare a closed source engine clean is simply naive.
Until the source code is revealed, nothing proves that a closed source engine contains no foreign code.

At least going open source means you have nothing to hide. It still puzzles me why people develop private engines (so you can't even run the similarity test?) or closed source engines when they are hundreds of elo below the top engines. Why do they fear to show us their code?

Adam Hair · Post by **Adam Hair** » Fri Jan 31, 2014 12:30 pm

Rebel wrote:And so we are witnessing the death of similarity tester. Now that the cat is out of the bag I can confirm Ben's findings. During the PST-thread in the programmers forum I did some experiments with the several posted PST's and Piece Values and indeed they dreadfully bring down the similarity percentage without too much elo loss (20-30).

So folks be aware, cloners will find out anyway.

That is okay. There are more productive things that can be done using the tester and Polyglot. After all, the tester is just a mechanism to send positions and UCI commands to an engine. You can use the tester (with Polyglot to create logs) to evaluate large sets of positions with different engines. There are many things that can be measured, not just engine similarities.

pocopito · Post by **pocopito** » Fri Jan 31, 2014 12:35 pm

I don't know if it's a rhetoric question, but some months ago I opened a poll asking the same question.

right now I'm on my tablet, but layer I can search for it and paste the link.

Laskos · Post by **Laskos** » Fri Jan 31, 2014 1:31 pm

Rebel wrote:And so we are witnessing the death of similarity tester. Now that the cat is out of the bag I can confirm Ben's findings. During the PST-thread in the programmers forum I did some experiments with the several posted PST's and Piece Values and indeed they dreadfully bring down the similarity percentage without too much elo loss (20-30).

So folks be aware, cloners will find out anyway.

Still, no false positives with Sim, only false negatives.

Rebel · Post by **Rebel** » Fri Jan 31, 2014 1:33 pm

lucasart wrote: With this TwinFish you not only prove my point,

But you never had a point

Engines that will show a 65+% similarity are derived, you don't need the source code, that still stands.

Until the source code is revealed, nothing proves that a closed source engine contains no foreign code.

Sure, never claimed otherwise.

The tool is useless to proof an engine original.

See the difference now?

Uri's Challenge : TwinFish

Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish

Re: Uri's Challenge : TwinFish