I'm surprised to find the results very close between the one with the change and the base version. Has anyone tried this idea and have you noticed the same test results?
Here's my test results by the way:
Code: Select all
Games Completed = 1200 of 1200 (Avg game length = 58.391 sec)
Settings = Gauntlet/32MB/20000ms+200ms/M 1000cp for 12 moves, D 150 moves/EPD:D:\chess\tests\little_blitzer_2.6\NoomenCombined.epd
Time = 23653 sec elapsed, 0 sec remaining
1. Hannibal 20110819 540.5/1200 411-530-259 (L: m=396 t=0 i=0 a=134) (D: r=143 i=37 f=30 s=4 a=45) (tpm=448.5 d=13.9 nps=1230646)
2. spark-1.0 63.5/120 45-38-37 (L: m=13 t=0 i=0 a=25) (D: r=19 i=7 f=5 s=0 a=6) (tpm=383.0 d=12.1 nps=1783891)
3. Protector 1.4.0 x64 60.5/120 42-41-37 (L: m=11 t=0 i=0 a=30) (D: r=22 i=9 f=3 s=0 a=3) (tpm=454.0 d=11.9 nps=901816)
4. Spike 1.4 56.5/120 43-50-27 (L: m=11 t=0 i=0 a=39) (D: r=20 i=5 f=0 s=1 a=1) (tpm=453.3 d=12.4 nps=965890)
5. Gull 1.1 x64 57.0/120 41-47-32 (L: m=21 t=0 i=0 a=26) (D: r=18 i=3 f=3 s=1 a=7) (tpm=421.4 d=12.0 nps=2125374)
6. Gull 1.2 x64 67.5/120 51-36-33 (L: m=5 t=0 i=0 a=31) (D: r=19 i=0 f=8 s=1 a=5) (tpm=420.4 d=13.6 nps=1811320)
7. Critter 0.90 64-bit 68.5/120 59-42-19 (L: m=4 t=0 i=0 a=38) (D: r=12 i=4 f=1 s=0 a=2) (tpm=448.1 d=14.8 nps=1610554)
8. Stockfish 2.1.1 JA 64bit 52.5/120 40-55-25 (L: m=2 t=0 i=0 a=53) (D: r=15 i=4 f=3 s=0 a=3) (tpm=457.6 d=14.7 nps=1094947)
9. Komodo64 2.03 JA 77.5/120 67-32-21 (L: m=10 t=0 i=0 a=22) (D: r=7 i=1 f=3 s=1 a=9) (tpm=431.7 d=12.6 nps=1131603)
10. Critter 1.2 64-bit 70.5/120 63-42-15 (L: m=2 t=0 i=0 a=40) (D: r=7 i=2 f=2 s=0 a=4) (tpm=430.7 d=15.0 nps=1483919)
11. Houdini 1.5a x64 85.5/120 79-28-13 (L: m=0 t=0 i=0 a=28) (D: r=4 i=2 f=2 s=0 a=5) (tpm=385.4 d=13.0 nps=1676299)
Games Completed = 1200 of 1200 (Avg game length = 58.395 sec)
Settings = Gauntlet/32MB/20000ms+200ms/M 1000cp for 12 moves, D 150 moves/EPD:D:\chess\tests\little_blitzer_2.6\NoomenCombined.epd
Time = 25189 sec elapsed, 0 sec remaining
1. Hannibal 20110819x 533.5/1200 393-526-281 (L: m=399 t=0 i=0 a=127) (D: r=167 i=45 f=38 s=2 a=29) (tpm=449.9 d=13.9 nps=1162299)
2. spark-1.0 66.0/120 50-38-32 (L: m=14 t=0 i=0 a=24) (D: r=16 i=8 f=7 s=0 a=1) (tpm=389.9 d=11.9 nps=1652586)
3. Protector 1.4.0 x64 62.5/120 40-35-45 (L: m=13 t=0 i=0 a=22) (D: r=26 i=9 f=4 s=0 a=6) (tpm=446.6 d=11.9 nps=826591)
4. Spike 1.4 58.0/120 40-44-36 (L: m=11 t=0 i=0 a=33) (D: r=34 i=1 f=1 s=0 a=0) (tpm=453.5 d=12.3 nps=929081)
5. Gull 1.1 x64 64.0/120 49-41-30 (L: m=20 t=0 i=0 a=21) (D: r=22 i=3 f=2 s=0 a=3) (tpm=428.8 d=11.9 nps=2033459)
6. Gull 1.2 x64 73.5/120 58-31-31 (L: m=10 t=0 i=0 a=21) (D: r=16 i=4 f=6 s=0 a=5) (tpm=430.3 d=13.5 nps=1687940)
7. Critter 0.90 64-bit 65.5/120 50-39-31 (L: m=2 t=0 i=0 a=37) (D: r=21 i=6 f=4 s=0 a=0) (tpm=434.6 d=15.1 nps=1509492)
8. Stockfish 2.1.1 JA 64bit 55.0/120 44-54-22 (L: m=1 t=0 i=0 a=53) (D: r=12 i=5 f=4 s=1 a=0) (tpm=452.5 d=14.2 nps=1043395)
9. Komodo64 2.03 JA 77.0/120 65-31-24 (L: m=7 t=0 i=0 a=24) (D: r=11 i=5 f=5 s=0 a=3) (tpm=446.1 d=12.9 nps=1040259)
10. Critter 1.2 64-bit 68.0/120 58-42-20 (L: m=2 t=0 i=0 a=40) (D: r=6 i=4 f=3 s=1 a=6) (tpm=427.1 d=15.0 nps=1410248)
11. Houdini 1.5a x64 77.0/120 72-38-10 (L: m=1 t=0 i=0 a=37) (D: r=3 i=0 f=2 s=0 a=5) (tpm=385.2 d=12.8 nps=1569842)
One can notice that the 2 versions perform differently against the opponents. The 19x version performed better against stronger engines like Komodo, Critter1.2 and Houdini while performing worst against the weaker opponents. This suggests that maybe those strong engines are also doing this bad captures pruning/reduction ideas, though I might be wrong.
The difference in Elo is only 4 with the error bars at +-17. So which do you think is better, performing better against stronger engines or performing better against weaker engines?
Sometimes I consider to continue developing versions that perform a little bit worse than the best version when my intuition says that the idea has potential and just needs fine tuning.