I am new here, I am a Chess FM, but never had anything to do with computerchess (well, i installed stockfish and let it run for 5 sec each move to analyse my games, was more than sufficient enough

However things changed now. A club collegue, who is into correspondence chess gave away his old hardware, because he bought new one, and i was quite surprised, when i started the old machine.
It is a Dual 2699v4 Xeon with 256GB ECC RAM and the machine never did anything else than running 24/7 stockfish since 2016.
Ok, i wiped the harddrive, installed Debian, installed ScidvsPC, compiled stockfish17and other engines and i just wanted to compare this old, but in his time very expensive hardware from 2016 with my "normal" desktop computer from 2023 (AMD 7600, 32 GB Ram), when it comes to computer chess.
Here the trouble started and i realized, things are much more complicated, than i thought and i actually have no understanding, how things work, and I came to the conclusion, that my collegue doesn't know much about benchmarking as well, so i am asking here.
Now my question and i am just looking at one specific chess engine (stockfish 17)
I thought, when comparing hardware, i just have to have a look on the Mn/s and thats it. Double threads and you get nearly double Mn/s, because stockfish scales well on parallelization. So far so good. On my new hardware, HT increases the performance quite a bit, on the 2699 it looks like it doesn't. Ok fine, whatever.
But then i came across the stockfish benchmark tool, because people claim, you can't compare hardware just by looking at the Mn/s but you need to look on the "time-to-depth" as well and the stockfish benchmark is a good tool for both. (Ok, of course, the best tool is to let my two machines play some matches, but lets put that aside)
so what i did is:
Code: Select all
stockfish bench 2048 11 25
Code: Select all
stockfish bench 2048 43 25
Code: Select all
stockfish bench 2048 86 25
11 .... threads
25 .... fixed depth (btw, what is that exactly, in my understanding, it is just a depth for some relevant variations, but what is relevant and what isnt? )
Result AMD: (roughly 10 Mn/s on the starting position)
Total time (ms) : 59644
Nodes searched : 1111302912
Nodes/second : 18632266
Result Xeon with 43 Threads (roughly 20 Mn/s on the starting position):
===========================
Total time (ms) : 117966
Nodes searched : 5223385708
Nodes/second : 44278738
Result Xeon with 86 Threads (still roughly 20 Mn/s on the starting position):
Total time (ms) : 191927
Nodes searched : 10987894123
Nodes/second : 57250382
So what can i assume now?
The Xeon takes much longer to reach depth, but analysis much more positions, and HT makes somehow sense in the benchmarking tool, but not when i let it run on the starting position, but it even takes longer to reach a certain depth. I don't understand it.
So what is the best setting for the Xeon and how much stronger is it than the AMD? (or is latter a dumb question to ask without letting them play against each other?)
I know, a long post, thx a lot for at least reading it