Madeleine Birchfield wrote: ↑Thu Nov 25, 2021 2:38 pm
Uri Blass wrote: ↑Wed Nov 24, 2021 11:38 pm
Madeleine Birchfield wrote: ↑Wed Nov 24, 2021 2:43 pm
Uri Blass wrote: ↑Wed Nov 24, 2021 7:16 am
I disagree and for me it is not understanding.
For me understanding is something that you can explain why and not hide behind the words:"neural network"
If your engine have a static evaluation of +3 for white and you have no idea why because from your point of view the position is equal then it is not a positional understanding from my point of view.
I hate seeing a huge score without understanding the reason for it and I believe that this huge score is not good from practical point of view because in a game there is a big probablity that the opponent will have no idea how to translate the huge score to victory so from my point of view old engines are better for analysis in many positions because they can give a move that they do not understand that it is losing so there is a good chance that the human opponent is not going to understand how to win and not the new engine that believe I am playing against super players and if everything lose can suggest me a move that gives no practical chances.
That is all equally applicable for any closed source engine, where the average user does not have any access to the source code of the engine. The evaluation function is still a black box, regardless of whether it uses a neural network or only handcrafted evaluation, and the average user simply cannot explain anything beyond hiding behind the words "evaluation function".
And that is even true of open source engines; most people do not have the time or resources to open Stockfish's source code and see how its evaluation function works, so for most people, the evaluation function is functionally a black box to them as well.
There are many positions that handcrafted evaluation using engines would evaluate at +3 but any semi-decent chess player would tell you is an equal position.
There is a big difference between a chess engine or chess player understanding a position and a user/developer understanding how the evaluation function works in an engine, and the former is far more important to the average chess player than the latter. Grandmasters these days like Carlsen, Caurana, and Dubov have all said at some point that after the top engines adopted neural networks, the engines were able to understand positions found in i.e. the French and the KID much better than before. No grandmaster cares about how Stockfish's handcrafted evaluation works.
My experience is that I have more misunderstanding of evaluations of the new engines.
With the old engines cases when the engine said +5 when I had no idea why were rare.
With the new engines cases when the engine say +5 when I have no idea why are common.
I suspect that if you want to predict result of a game between humans with fide rating of 2000 then evaluation of fruit2.1 after searching to depth 10 when you translate it to expected result is going to give you a better prediction than evaluation of the new engines regardless of depth and it may be interesting to check it.
As mvanthoor said, I think that has more to do with the fact that newer engines have stronger searches that allow it to search more deeply than older engines.
I think no and I can give examples for huge scores that I do not understand even at small depths.
[fen]r1b2rk1/bp3p2/p1pp1q1p/4pNp1/3PPnP1/1BP1BQP1/PP3P2/R4RK1 b - - 0 19[/fen]
Stockfish already see more than +3 at small depths when fruit does not see a decisive advantage for white.
From human point of view it is not an endgame material is equal and there are too many pieces on the board to be sure about something.
Stockfish_14.1_win_x64_avx2:
NNUE evaluation using nn-13406b1dcbe0.nnue enabled
1/4 00:00 932 932k -1.24 Bc8xf5 g4xf5 e5xd4
2/2 00:00 2k 1,518k -1.24 Bc8xf5 g4xf5
3/3 00:00 2k 2,280k -2.84 Bc8xf5 g4xf5 e5xd4
4/5 00:00 6k 3,104k -2.21 Nf4-d3 Kg1-g2
5/7 00:00 11k 5,445k -3.47 Nf4-d3 Ra1-d1 Nd3xb2 Rd1-d2 Bc8xf5
6/6 00:00 14k 4,790k -3.47 Nf4-d3 Ra1-d1 Nd3xb2 Rd1-d2 Bc8xf5 g4xf5
7/7 00:00 34k 6,861k -3.73 Nf4-d3 Ra1-d1 Bc8xf5 g4xf5 e5xd4 c3xd4 Nd3xb2
8/10 00:00 60k 8,538k -3.73 Nf4-g6 Ra1-d1 Bc8xf5 g4xf5 e5xd4 c3xd4 Ng6-e7 Kg1-g2 d6-d5
9/12 00:00 143k 7,512k -3.87 Bc8xf5 g4xf5 e5xd4 c3xd4 Nf4-g6 Ra1-d1 Rf8-e8 Bb3-c2 Ng6-f8 Kg1-g2 Nf8-d7 Rf1-h1
10/15 00:00 208k 8,010k -4.19 Bc8xf5 g4xf5 e5xd4 c3xd4 Nf4-g6 Ra1-d1 Rf8-e8 Kg1-g2 Ng6-e7 Bb3-c2 c6-c5 e4-e5 d6xe5 d4xc5
11/13 00:00 296k 7,585k -4.35 Bc8xf5 g4xf5 e5xd4 c3xd4 Nf4-g6 Qf3-h5 Ng6-e7 Bb3-c2 Kg8-g7 Ra1-d1 c6-c5 f2-f4 g5xf4 Be3xf4
12/13 00:00 344k 7,645k -4.40 Bc8xf5 g4xf5 Nf4-g6 Qf3-h5 Ng6-e7 Bb3-c2 e5xd4 c3xd4 Kg8-g7 Ra1-d1 c6-c5 f2-f4 g5xf4
13/19 00:00 791k 7,601k -4.33 Bc8xf5 g4xf5 c6-c5 d4xc5 d6xc5 Rf1-d1 Ra8-d8 a2-a4 Qf6-g7 g3xf4 e5xf4
14/18 00:00 864k 7,718k -3.97 Bc8xf5 g4xf5 c6-c5 d4xe5 d6xe5 Ra1-d1 Kg8-h7 Rd1-d7 c5-c4 Bb3xc4 g5-g4 Qf3xg4 Ba7xe3 f2xe3 Rf8-g8
15/26 00:00 1,702k 7,953k -4.13 Bc8xf5 g4xf5 e5xd4 c3xd4 Nf4-g6 Bb3-c2 Ng6-e7 Qf3-h5 Rf8-e8 Ra1-d1 c6-c5 e4-e5 d6xe5 d4xe5 Qf6xe5 Rd1-d7
16/27 00:00 2,102k 7,842k -4.17 Bc8xf5 g4xf5 Nf4-g6 Qf3-h5 Ng6-e7 Kg1-g2 e5xd4 c3xd4 d6-d5 f2-f4 Ba7xd4 f4xg5 Qf6-e5 Be3xd4 Qe5xd4 Kg2-h3 Qd4-h8 f5-f6 Ne7-g6 Kh3-g2
17/23 00:00 4,253k 7,705k -3.53 Nf4-e6 d4xe5 d6xe5 Be3xa7 Ra8xa7 Ra1-d1 Ne6-g7 Rd1-d6 Bc8-e6 Rf1-d1 Ra7-a8 Bb3xe6 f7xe6 Kg1-g2 Ng7-e8
18/28 00:00 6,115k 7,711k -3.78 Nf4-e6 Ra1-d1 Ne6-g7 d4xe5 d6xe5 Rd1-d6 Bc8-e6 Be3xa7 Ra8xa7 Qf3-e3 Ra7-a8 Rf1-d1 Ng7-e8 Rd6-d2 Ra8-c8 Qe3-b6 Rc8-c7 Rd2-d8 Rc7-c8 Rd8xc8 Be6xc8 Qb6-b4
19/30 00:00 6,470k 7,757k -3.63 Nf4-e6 d4xe5 d6xe5 Be3xa7 Ra8xa7 Ra1-d1 Ne6-g7 Rd1-d6 Bc8-e6 Qf3-e3 Ra7-a8 Rf1-d1 Ng7-e8 Rd6-d2 Ra8-c8 Qe3-b6 Rc8-b8 Qb6-a7 Ne8-g7 Rd2-d6 h6-h5 Nf5xg7 Kg8xg7 g4xh5
20/28 00:00 6,686k 7,775k -3.68 Nf4-e6 d4xe5 d6xe5 Be3xa7 Ra8xa7 Ra1-d1 Ne6-g7 Rd1-d6 Bc8-e6 Qf3-e3 Ra7-a8 Rf1-d1 Ng7-e8 Rd6-d2 Ra8-b8 Qe3-a7 Ne8-g7 Rd2-d6 h6-h5 Nf5xg7 Kg8xg7 g4xh5 Kg7-h6 g3-g4 Qf6-f4 Bb3xe6
21/28 00:00 7,461k 7,845k -3.76 Nf4-e6 d4xe5 d6xe5 Be3xa7 Ra8xa7 Ra1-d1 Ne6-g7 Rd1-d6 Bc8-e6 Qf3-e3 Ra7-a8 Rf1-d1 Ng7-e8 Rd6-d2 Ra8-c8 Qe3-b6 Rc8-b8 Qb6-c5 b7-b6 Qc5-a3 Be6xf5 g4xf5 a6-a5 Rd2-d7 g5-g4 Rd1-d6 Ne8xd6 Rd7xd6
22/39- 00:03 26,559k 7,689k -5.15 Nf4-e6 Ra1-d1
22/45 00:04 38,004k 7,671k -5.13 Bc8xf5 g4xf5 Nf4-g6 Ra1-d1 Rf8-e8 Qf3-h5 Ng6-f4 g3xf4 e5xf4 Qh5-g6+ Qf6xg6 f5xg6 Kg8-g7 g6xf7 Re8xe4 Be3-c1 d6-d5 Bb3-c2 Re4-e7 Rf1-e1 Kg7xf7 Bc2-g6+ Kf7-f6 Re1xe7 Kf6xe7 b2-b3 Ke7-f6 Bg6-h5
23/36- 00:05 43,737k 7,629k -5.21 Bc8xf5 g4xf5
23/38- 00:06 46,936k 7,654k -5.29 Bc8xf5 g4xf5
23/38- 00:08 63,535k 7,667k -5.60 Bc8xf5 g4xf5
23/38 00:09 71,047k 7,656k -5.67 Bc8xf5 g4xf5 Nf4-g6 Ra1-d1 Rf8-e8 Qf3-h5 Ng6-f4 g3xf4 e5xf4 Qh5-g6+ Qf6xg6 f5xg6 Kg8-g7 g6xf7 Re8xe4 Be3-c1 d6-d5 Rf1-e1 Re4xe1+ Rd1xe1 Kg7xf7 Bb3-d1 Kf7-f6 Bd1-h5 a6-a5 Kg1-f1 c6-c5 d4xc5 Ba7xc5 Re1-d1 Ra8-d8 Kf1-g2 d5-d4
24/33+ 00:09 71,060k 7,656k -5.52 Bc8xf5
24/33+ 00:09 71,065k 7,656k -5.43 Bc8xf5
24/33 00:09 71,413k 7,657k -5.46 Bc8xf5 g4xf5 Nf4-g6 Ra1-d1 Rf8-e8 Qf3-h5 Ng6-f4 g3xf4 e5xf4 Qh5-g6+ Qf6xg6 f5xg6 Kg8-g7 g6xf7 Re8xe4 Be3-c1 d6-d5 Rd1-e1 Re4xe1 Rf1xe1 Kg7xf7 Bb3-d1 Kf7-f6 Bd1-h5 Ra8-g8 Kg1-g2 c6-c5 d4xc5
25/33 00:10 77,823k 7,671k -5.46 Bc8xf5 g4xf5 Nf4-g6 Ra1-d1 Rf8-e8 Qf3-h5 Ng6-f4 g3xf4 e5xf4 Qh5-g6+ Qf6xg6 f5xg6 Kg8-g7 g6xf7 Re8xe4 Be3-c1 d6-d5 Rd1-e1 Re4xe1 Rf1xe1 Kg7xf7 Bb3-d1 Kf7-f6 Bd1-h5 Ra8-g8 Kg1-g2 Rg8-d8 b2-b3 c6-c5 d4xc5 Ba7xc5 Bc1-b2 g5-g4