Stockfish vs Stockfish - its a position with 12 pieces. Engine A has 7 man EGTB. A makes a move that forces a draw - and B thinks its a blunder and agrees. Only that after the taking we are left with 7 pieces and B thinks its +0.5. Only A knows this was winning and now can convert to victory.Sesse wrote: ↑Sun Nov 07, 2021 9:28 pm I don't think you can make that conclusion.
The typical case is a drawn endgame with one or two pawns up. Stockfish, even with NNUE, will show a clear positive score; with tablebases, you'll get the appropriate message of a TB draw. Will no-TB Stockfish still be able to hold the draw? Sure! Would TB have been able to get into a non-drawn endgame? Probably not. (So there's no Elo to be had from TBs here, even though the evaluation is clearly wrong. The second simply does not follow from the first, because all that matters for Elo is whether there is a winning move and it has a higher score, not the absolute values of the evaluations.) Does this prove that Stockfish already has the required knowledge? Again, it depends on what your goal is.
There are Lots and Lots of 7 man positions stockfish 14 needs hours to see the end because depth to conversation might be 40+ and depth to mate 120 or so.
Maybe with 8 man it becomes 200 elo because there are infinitely more positions that look good even for nnue to not see the true evaluation.