Yes that is absolutely right. With a small number of games, surprises can and do happen. That, and there is a lack of other tournaments on such high-end hardware and long time control to compare to. But at least it is better than the ICGA WCCC, and a lot of fun to follow.IGarcia wrote: The point is to be clear (without any been offended) is the tournament was not conclusive. There are some people out there telling SF DD and Komodo nTCEC2 are stronger than Houdini 4, using your tournament as back proof.
IPON results for Houdini 4
Moderators: hgm, Rebel, chrisw
-
- Posts: 3546
- Joined: Thu Jun 07, 2012 11:02 pm
Re: IPON results for Houdini 4
-
- Posts: 2434
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: IPON results for Houdini 4
Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.Houdini wrote:To be honest, I like to see these results as well - it gives a nice idea about the general consistency of the individual matches.IWB wrote:But still I have no idea why there is interested in a one on one match with just 150 games?
Bye
Ingo
Looking at the over-all result of nearly 83% that Houdini 4 scores, it becomes clear that we're reaching a limit of the rating list. Next version should score over 85% against the same opponents, that's becoming too much...
Robert
Stefan
http://ls-ratinglist.beepworld.de
-
- Posts: 1471
- Joined: Tue Mar 16, 2010 12:00 am
Re: IPON results for Houdini 4
The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
-
- Posts: 2554
- Joined: Fri Nov 26, 2010 2:00 pm
- Location: Czech Republic
- Full name: Martin Sedlak
Re: IPON results for Houdini 4
Exactly, there are too many clones/derivatives (including Houdini of course). I only counted 5 or 6 original engines so maybe "clone rating list" would be more appropriate.Houdini wrote:You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
-
- Posts: 2434
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: IPON results for Houdini 4
A derivative is not a 100% clone. Of course there are similarities, but the engines are not identical.Houdini wrote:The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
Who can decide, how much similarity is allowed and how much is too much?
As long as an engine is not a 100% clone, I wil not ignore it.
Stefan
-
- Posts: 1600
- Joined: Mon Feb 21, 2011 9:48 am
Re: IPON results for Houdini 4
Houdini wrote: You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
Even Robbodini remember ......
how you admitted decompile his Houdini 3...
and recompiled on Robbolitto ....
This was the funny part of the day ...
No shame gives you ...
-
- Posts: 543
- Joined: Mon Jul 05, 2010 10:27 pm
Re: IPON results for Houdini 4
Besides you are correct pointing the mistake of allowing very "similar" programs its funny the way you do.Houdini wrote:The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
The fact you are selling H2, H3, H4 and nobody is suing you, does not make your engine original. It has been marked as controversial (along with Rybka and other engines) in CCRL and by community in general.
Its funny to see your writings against "clones", where several people thinks H4 is an abomination as same you called robbodini. Probably the strongest abomination today.
Regards.
-
- Posts: 10948
- Joined: Wed Jul 26, 2006 10:21 pm
- Full name: Kai Laskos
Re: IPON results for Houdini 4
Houdini 4 diverged quite a bit from Robbollito, if ICGA will ever set the 60% rule on Sim tester, Houdini will pass it. The only obstacles Houdini has are Strelka and Critter, but the pre-eminence is clear looking at their release dates. Can you guys stop being so arduous justiciaries? Already Rybka 4 was banned for a sin with Rybka 1, now you will cry "fault" on Houdini 7 based on Houdini 1.00?IGarcia wrote:Besides you are correct pointing the mistake of allowing very "similar" programs its funny the way you do.Houdini wrote:The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
The fact you are selling H2, H3, H4 and nobody is suing you, does not make your engine original. It has been marked as controversial (along with Rybka and other engines) in CCRL and by community in general.
Its funny to see your writings against "clones", where several people thinks H4 is an abomination as same you called robbodini. Probably the strongest abomination today.
Regards.
-
- Posts: 2129
- Joined: Thu May 29, 2008 10:43 am
Re: IPON results for Houdini 4
Not sure why Robert is complaining...Laskos wrote:Houdini 4 diverged quite a bit from Robbollito, if ICGA will ever set the 60% rule on Sim tester, Houdini will pass it. The only obstacles Houdini has are Strelka and Critter, but the pre-eminence is clear looking at their release dates. Can you guys stop being so arduous justiciaries? Already Rybka 4 was banned for a sin with Rybka 1, now you will cry "fault" on Houdini 7 based on Houdini 1.00?IGarcia wrote:Besides you are correct pointing the mistake of allowing very "similar" programs its funny the way you do.Houdini wrote:The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
The fact you are selling H2, H3, H4 and nobody is suing you, does not make your engine original. It has been marked as controversial (along with Rybka and other engines) in CCRL and by community in general.
Its funny to see your writings against "clones", where several people thinks H4 is an abomination as same you called robbodini. Probably the strongest abomination today.
Regards.
just like him, others are working on derivatives and trying to find improvements with the Ippolit source code.
Clearly he has had great success in that endeavor...but that does not give him some sort of exclusive 'entitlement' to it
PS I seem to remember from Adams sim tester chess pages that Houdini 1.0 matches Robbolito more than 70% (an extremely high score)
I don't have info on version 2.0, 3, and 4,
but IMO it doesn't really matter now
-
- Posts: 10948
- Joined: Wed Jul 26, 2006 10:21 pm
- Full name: Kai Laskos
Re: IPON results for Houdini 4
Houdini 3 had 57% with Robbolito, IIRC. It had 61% with Critter and 64% with Strelka, but these are results of RE of Houdini 1.5. I guess H4 has even less with Robbo.kranium wrote:Not sure why Robert is complaining...Laskos wrote:Houdini 4 diverged quite a bit from Robbollito, if ICGA will ever set the 60% rule on Sim tester, Houdini will pass it. The only obstacles Houdini has are Strelka and Critter, but the pre-eminence is clear looking at their release dates. Can you guys stop being so arduous justiciaries? Already Rybka 4 was banned for a sin with Rybka 1, now you will cry "fault" on Houdini 7 based on Houdini 1.00?IGarcia wrote:Besides you are correct pointing the mistake of allowing very "similar" programs its funny the way you do.Houdini wrote:The LS-Top10 rating list has a different issue: there is too little filtering of participants. The "Top 10" list includes Strelka 5 (=Houdini 1.5), Amitis (=Stockfish) and Bouquet/Pan Chess/Mars/Robbolito (=Ivanhoe).pohl4711 wrote:Perhaps take a look at the LS-ratinglist, where each engine plays only against 10 real strong opponents and the score of Houdini 4 was "only" 67%...And each individual match contains 1000 games and can be seen in the crosstable of the LS-top10-tournament.
Stefan
http://ls-ratinglist.beepworld.de
You have "solved" the issue of average strength of opponents by letting the same programs play multiple times.
The fact you are selling H2, H3, H4 and nobody is suing you, does not make your engine original. It has been marked as controversial (along with Rybka and other engines) in CCRL and by community in general.
Its funny to see your writings against "clones", where several people thinks H4 is an abomination as same you called robbodini. Probably the strongest abomination today.
Regards.
just like him, others are working on derivatives and trying to find improvements with the Ippolit source code.
Clearly he has had great success in that endeavor...but that does not give him some sort of exclusive 'entitlement' to it
PS I seem to remember from Adams chess pages that Houdini 1.0 matches Robbolito more than 70%
I don't have info on version 2.0, 3, and 4
Sorry, my opinion is that in those tests the literal clones can be substituted by multiple copies of IvanHoe, Iggorit, SF, whatever open source they took. They are almost in minute details all the same (if not even weakened).