The vast majority of tests show FF2 losing head-to-head vs SF13. Stefan Pohl's is the only one I've seen where FF2 wins, by a small margin.
I ran some tests of my own.
First One
All 960 FRC positions, played reversed sides, so 1,920 games
TimeControl 120+1
256MB hash
1 thread
5-men sysgy
Score of Stockfish 13 vs Fat Fritz 2 : 201 - 168 - 1551 [0.509]
Elo difference: 6.0 +/- 6.8, LOS: 95.7 %, DrawRatio: 80.8 %
Second One
Pohl's 5mvs_balanced pgn (2,871 positions, reversed sides so 5,742 games)
(not sure how balanced this is in terms of ECO mix)
TimeControl 90+1
256MB hash
1 thread
5-men sysgy
All too close to call and within error margins. But the question is does the FF2 NNUE offer anything special for analysis purposes ? That is the important thing for most people. I can't answer that. Engine Elo is a guide but not always definitive.
I've only got the free FF2 net, and it plays a more aggressive and imbalanced game than vanilla SF NNUE, especially with Black.
I find it interesting, but I also think 100 euro is a little too steep a price right now. Maybe I'll wait for their bi-annual sale and get it cheaper that way.
Modern Times wrote: ↑Wed Feb 24, 2021 3:35 am
The vast majority of tests show FF2 losing head-to-head vs SF13. Stefan Pohl's is the only one I've seen where FF2 wins, by a small margin.
I ran some tests of my own.
First One
All 960 FRC positions, played reversed sides, so 1,920 games
TimeControl 120+1
256MB hash
1 thread
5-men sysgy
Score of Stockfish 13 vs Fat Fritz 2 : 201 - 168 - 1551 [0.509]
Elo difference: 6.0 +/- 6.8, LOS: 95.7 %, DrawRatio: 80.8 %
Second One
Pohl's 5mvs_balanced pgn (2,871 positions, reversed sides so 5,742 games)
(not sure how balanced this is in terms of ECO mix)
TimeControl 90+1
256MB hash
1 thread
5-men sysgy
All too close to call and within error margins. But the question is does the FF2 NNUE offer anything special for analysis purposes ? That is the important thing for most people. I can't answer that. Engine Elo is a guide but not always definitive.
AS recommended more representative openings, which would definitely shrink these differences, and is perhaps more "fair". However, your first results show a LOS of 95.7%, and your next results a LOS of 99.99%+, which is to say, the error margins are negligible. Correct me if I'm wrong.
Last edited by gaard on Wed Feb 24, 2021 4:13 am, edited 1 time in total.
carldaman wrote: ↑Wed Feb 24, 2021 4:04 am
I've only got the free FF2 net, and it plays a more aggressive and imbalanced game than vanilla SF NNUE, especially with Black.
I find it interesting, but I also think 100 euro is a little too steep a price right now. Maybe I'll wait for their bi-annual sale and get it cheaper that way.
I'm not sure the "private" net wasn't meant to be made public, and there is actually a link to it on the second page of CCC: GT.