lkaufman wrote:Ozymandias wrote:lkaufman wrote:Ozymandias wrote:Back to those odds already? Too bad.
I'm guessing you would have preferred another match like the last one but with larger time odds? I didn't like this so much because making Komodo play much faster than 3' + 1" seems pointless, since hardly anyone uses it with just one or two seconds thinking time I imagine. Or did you have another idea in mind?
 
A faster TC for Komodo, a longer TC for the GM or a stronger human opponent. Any of those variations, or a viable combination, would give us a more even match.
There's also the option of using a popular phone, instead of a laptop.
 
U
We might try this sort of match if we get another top ten opponent. Increasing the human's time further isn't very practical for chess.com tv coverage, and I'm told that single core of laptop already falls within the range of cellphones, so using a cellphone wouldn't be much different. We don't want matches where the winner is not in doubt.
 
Against a 2600 GM, a fair match would be against a mid-range smartphone at 30''+0.3''. Who wants to see such a match?
I have another idea: takebacks. Is it too insulting to GM to allow him taking back? This would solve one prominent issue with humans, that a GM plays generally a very sound chess, but marred by blunders and large inaccuracies. I used "Andscacs Randomizer" engine, which can play a specified percentage of random moves in the total of moves, the rest being regular Andscacs, to compute the value of a blunder defined here as random move. If taken back, this improves the performance of the player by the resulting ELO points. Here are the results, I computed them several weeks ago:
Code: Select all
BLUNDERS (RANDOM MOVES)
Random: 0%   ELO:2700
100%:95%
Score of R2 vs R1: 334 - 36 - 30  [0.873] 400
ELO difference: 334.10 +/- 47.24
Finished match
PlyCount: Total = 400  Range: 3-283  Average = 105.47  StdDev = 36.14
Av.nr.blunders = 2.63675   
Av.ELO per blunder = 126.7
ELO = 2700
Takebacks: 4.7
Random: 5%   ELO:2476
95%:90%
Score of R2 vs R1: 306 - 79 - 15  [0.784] 400
ELO difference: 223.69 +/- 40.49
Finished match
PlyCount: Total = 400  Range: 16-311  Average = 92.0  StdDev = 34.36
Av.nr.blunders = 2.30   
Av.ELO per blunder = 97.3
ELO = 2476
Takebacks: 8.5
Random: 10%   ELO:2324
90%:85%
Score of R2 vs R1: 278 - 113 - 9  [0.706] 400
ELO difference: 152.39 +/- 37.02
Finished match
PlyCount: Total = 400  Range: 17-246  Average = 89.56  StdDev = 34.99
Av.nr.blunders = 2.24  
Av.ELO per blunder: 68.0
ELO = 2324
Takebacks: 14.3
Random: 15%   ELO:2185
85%:80%
Score of R2 vs R1: 269 - 117 - 14  [0.690] 400
ELO difference: 138.99 +/- 36.18
Finished match
PlyCount: Total = 400  Range: 9-282  Average = 90.23  StdDev = 37.57
Av.nr.blunders = 2.26   
Av.ELO per blunder: 61.5
ELO = 2185
Takebacks: 18.1
Random: 20%   ELO:1947
80%:70%
Score of R2 vs R1: 315 - 77 - 8  [0.797] 400
ELO difference: 238.12 +/- 42.06
Finished match
PlyCount: Total = 400  Range: 5-233  Average = 84.18  StdDev = 34.86
Av.nr.blunders = 4.21   
Av.ELO per blunder: 56.6
ELO = 1947
Takebacks: 23.9
Random: 30%   ELO:1784
70%:60%
Score of R2 vs R1: 275 - 100 - 25  [0.719] 400
ELO difference: 162.99 +/- 36.53
Finished match
PlyCount: Total = 400  Range: 5-247  Average = 91.68  StdDev = 43.77
Av.nr.blunders = 4.58
Av.ELO per blunder: 35.6
ELO = 1784
Takebacks: 42.6
A 2700 GM would need to take back thus defined blunders 5 times to reach 3250-3300 FIDE ELO level of Komodo on 24 cores. A 2476 GM would need 8-9 takebacks.  A 2324 player about 14 takebacks, A 2185 about 18 takebacks. So on, and a 1784 ELO player would need 43 takebacks. If the blunder (or large inaccuracy) is on average still better than random move, there is a multiplicaive factor on all these numbers of takebacks. One can probably determine pretty accurately what sort of handicap is needed for what strength, say if a ELO 2700 player needs 10 takebacks, then a 2200 player would need 35 takebacks, and a 1900 player needs to take back all moves at least once. 
To me it would make for good handicap matches because it eliminates one of the main human weaknesses - blunders an serious inaccuracies. If only GMs are not offended by this proposal, I don't know their tastes.