IPON results for Houdini 4

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, Rebel, chrisw

Bram Visser
Posts: 52
Joined: Wed Oct 19, 2011 3:37 pm
Location: NL

Re: IPON results for Houdini 4

Post by Bram Visser »

Great to see a recent IPON list !
Daniel Shawul
Posts: 4185
Joined: Tue Mar 14, 2006 11:34 am
Location: Ethiopia

Re: IPON results for Houdini 4

Post by Daniel Shawul »

please post the links that show the tests where Stockfish murders Houdini 4. thx. (not TCEC)
This is me like saying please show me results where stockfish murders Scorpio 4. (not released yet). The only result of stockfish vs houdini 4 (9601) i know of is TCEC and that is a murder. We don't need obscure lists produced by begging the TD. If you look at the games in TCEC, most games of stockfish vs houdning ended very short. Houdini is a tactial amateur compared to Stockfish so changing version numbers is not going to hide that.
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: IPON results for Houdini 4

Post by ouachita »

Ok then, where at talkchess.com is there a complete or in-progress test showing that SF 'murders" H4?
SIM, PhD, MBA, PE
Daniel Shawul
Posts: 4185
Joined: Tue Mar 14, 2006 11:34 am
Location: Ethiopia

Re: IPON results for Houdini 4

Post by Daniel Shawul »

It seems you either can't read or you do read but you can't comprehend it. Houdini 4 just got released so all we had so far is the closest version 9601 that was murdered in TCEC! There is no reason to expect otherwise since v4 is virtually the same version that played there. Changing version number is not change anything, since one can do daily updates of versions 4.1,4.2 to evade the inevitable, that is stockfish owns Houdini. As to previous version Houdini 3 vs stockfish results, all you have to do is look down threads below.
User avatar
Houdini
Posts: 1471
Joined: Tue Mar 16, 2010 12:00 am

Re: IPON results for Houdini 4

Post by Houdini »

Houdini wrote:Ingo Bauer was so kind to run the IPON rating list matches for the new Houdini 4.

The result is a +39 Elo increase compared to Houdini 3.
The new complete IPON rating list is given below.
People at the Rybka Forum asked for the individual match results, I might as well also publish them here.
Note that in the following list the Elo scale is calibrated differently, setting Houdini 3 at 3000 points.

Code: Select all

Houdini 4 - Komodo 6 (2971)                    89.5  -  60.5    59.67%    Perf=3039
Houdini 4 - Stockfish 4 (2947)                 88.5  -  61.5    59.00%    Perf=3010
Houdini 4 - Gull 2.2 (2908)                   106.5  -  43.5    71.00%    Perf=3063
Houdini 4 - Critter 1.4a (2907)               106.5  -  43.5    71.00%    Perf=3062
Houdini 4 - Deep Rybka 4.1 (2882)             110.5  -  39.5    73.67%    Perf=3060
Houdini 4 - Hannibal 1.4a (2798)              121.0  -  29.0    80.67%    Perf=3046
Houdini 4 - Chiron 1.5 (2779)                 125.5  -  24.5    83.67%    Perf=3062
Houdini 4 - Protector 1.5.0 (2771)            129.0  -  21.0    86.00%    Perf=3086
Houdini 4 - Naum 4.2 (2768)                   131.5  -  18.5    87.67%    Perf=3108
Houdini 4 - HIARCS 14 WCSC 32b (2747)         126.0  -  24.0    84.00%    Perf=3035
Houdini 4 - Deep Shredder 12 (2730)           135.5  -  14.5    90.33%    Perf=3118
Houdini 4 - Jonny 6.00 (2729)                 130.0  -  20.0    86.67%    Perf=3054
Houdini 4 - Deep Sjeng c't 2010 32b (2713)    126.0  -  24.0    84.00%    Perf=3001
Houdini 4 - Spike 1.4 32b (2707)              131.5  -  18.5    87.67%    Perf=3047
Houdini 4 - spark-1.0 (2695)                  138.5  -  11.5    92.33%    Perf=3127
Houdini 4 - Deep Junior 13.3 (2677)           137.0  -  13.0    91.33%    Perf=3086
Houdini 4 - Booot 5.2.0 (2674)                138.5  -  11.5    92.33%    Perf=3106
Houdini 4 - Quazar 0.4 (2665)                 139.5  -  10.5    93.00%    Perf=3114
Houdini 4 - Zappa Mexico II (2652)            139.0  -  11.0    92.67%    Perf=3092
Houdini 4 - Toga II 3.0 32b (2643)            138.5  -  11.5    92.33%    Perf=3075
                                             2488.5  - 511.5    82.95%    Perf=3042
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: IPON results for Houdini 4

Post by ouachita »

LOL, I can both read and comprehend; but your rants notwithstanding, I and others here are looking for you to show us where the past or current testing of SF v. H3/H4 v. K shows or portends to show that SF is the demonstrably superior or even the equal STC or LTC engine. The tests at TCEC or currently underway at this site do not support your rants.

I also invite Marco or anyone else to show same. I am a correspondence player with no dog in this fight, rather I'm only a player in search of the best LTC (>=120) engine. Actually >500 would be better for me.
SIM, PhD, MBA, PE
Daniel Shawul
Posts: 4185
Joined: Tue Mar 14, 2006 11:34 am
Location: Ethiopia

Re: IPON results for Houdini 4

Post by Daniel Shawul »

The tests at TCEC or currently underway at this site do not support your rants.
Stuck on stupid? Houdini latest version (9601) got disqualified from TCEC in a rather pathetic manner, 3 losses out of 6 to stockfish, and in 30 moves or so like a patzer :) Once you comprehend that, look at the the threads below you for Houdini 3 results, that are even more hilarious.
ouachita
Posts: 454
Joined: Tue Jan 15, 2013 4:33 pm
Location: Ritz-Carlton, NYC
Full name: Bobby Johnson

Re: IPON results for Houdini 4

Post by ouachita »

what does this show?: http://www.talkchess.com/forum/viewtopi ... 8&start=40

what does this show?: http://www.talkchess.com/forum/viewtopi ... 04&start=0

what does this show?: http://www.talkchess.com/forum/viewtopic.php?t=50289

what does this show?: http://cegt.forumieren.com/t43-testing-stockfish-dd-x64

Are these SF "murders?" no, lol! I love this stuff. Get back to me.
SIM, PhD, MBA, PE
lkaufman
Posts: 5960
Joined: Sun Jan 10, 2010 6:15 am
Location: Maryland USA

Re: IPON results for Houdini 4

Post by lkaufman »

ouachita wrote:LOL, I can both read and comprehend; but your rants notwithstanding, I and others here are looking for you to show us where the past or current testing of SF v. H3/H4 v. K shows or portends to show that SF is the demonstrably superior or even the equal STC or LTC engine. The tests at TCEC or currently underway at this site do not support your rants.

I also invite Marco or anyone else to show same. I am a correspondence player with no dog in this fight, rather I'm only a player in search of the best LTC (>=120) engine. Actually >500 would be better for me.
Based on the results posted here so far, the fairest thing to say about Houdini 4 and StockfishDD (or any November version) is that Houdini is clearly stronger in blitz, while at standard chess they are too close to call. A. Huerga's new LTC list, based on at least 600 games per engine, has Stockfish 5 elo points ahead of Houdini 4, well within the error margin. These results do show that Stockfish scales better than Houdini, so I would probably use SF over Houdini for analysis (i.e. for second opinion after Komodo!), but the jury is still out.
Vinvin
Posts: 5228
Joined: Thu Mar 09, 2006 9:40 am
Full name: Vincent Lejeune

Re: IPON results for Houdini 4

Post by Vinvin »

Bram Visser wrote:Great to see a recent IPON list !
But Stockfish DD is missing :-/