SPCC: Testrun of Obsidian 14.11 finished

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, chrisw, Rebel

User avatar
pohl4711
Posts: 2626
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

SPCC: Testrun of Obsidian 14.11 finished

Post by pohl4711 »

My UHO-Top15 Ratinglist is the world's first engine-ratinglist, using UHO-openings, and the world's first ratinglist offering additionally Gamepair-statistics.

Testrun of Obsidian 14.11 finished.

https://www.sp-cc.de

Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm

(Perhaps you have to clear your browsercache with <strg>+<shift>+<delete> to reload the graphics/diagrams on my website)
Paloma
Posts: 1175
Joined: Thu Dec 25, 2008 9:07 pm
Full name: Herbert L

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by Paloma »

pohl4711 wrote: Tue Dec 10, 2024 2:24 pm My UHO-Top15 Ratinglist is the world's first engine-ratinglist, using UHO-openings, and the world's first ratinglist offering additionally Gamepair-statistics.

Testrun of Obsidian 14.11 finished.

https://www.sp-cc.de

Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm

(Perhaps you have to clear your browsercache with <strg>+<shift>+<delete> to reload the graphics/diagrams on my website)
Hello Stefan,

thanks for testing Obsidian 14.11

Did you test the pure exe or with net60.bin?
I read something here somewhere about net53.bin and net60.bin

Thanks in advantages
User avatar
pohl4711
Posts: 2626
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by pohl4711 »

I tested the avx512 binary friom the official GitHub site of Obsidian.

This one, without any additional net/file:
https://github.com/gab8192/Obsidian/rel ... avx512.exe
Ciekce
Posts: 169
Joined: Sun Oct 30, 2022 5:26 pm
Full name: Conor Anstey

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by Ciekce »

Paloma wrote: Thu Dec 12, 2024 12:00 am Did you test the pure exe or with net60.bin?
I read something here somewhere about net53.bin and net60.bin
hijacking the thread slightly to point out that unless they explicitly say so, engine devs do *not* expect you to mix and match networks, and the default net for an engine will be the strongest available

in this particular case obsidian's net60 is not compatible with the master branch, and if it were stronger than the master network, it would already *be* the master network.
Jouni
Posts: 3487
Joined: Wed Mar 08, 2006 8:15 pm
Full name: Jouni Uski

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by Jouni »

"Obsidian 14.11 catched up Stockfish 15.1 - nice". Not quite. Here's face to face match with 60 + 0,6 HERT book:

Code: Select all

Score of Obsidian1411 vs stockfish15.1: 9 - 25 - 166 [0.460]
...      Obsidian1411 playing White: 9 - 5 - 86  [0.520] 100
...      Obsidian1411 playing Black: 0 - 20 - 80  [0.400] 100
...      White vs Black: 29 - 5 - 166  [0.560] 200
Elo difference: -27.9 +/- 19.6, LOS: 0.3 %, DrawRatio: 83.0 %
200 of 200 games finished.
Jouni
gabe_obsidian
Posts: 7
Joined: Fri Jan 26, 2024 10:21 pm
Full name: Gabriele Lombardo

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by gabe_obsidian »

Jouni wrote: Thu Dec 12, 2024 2:20 pm "Obsidian 14.11 catched up Stockfish 15.1 - nice". Not quite. Here's face to face match with 60 + 0,6 HERT book:

Code: Select all

Score of Obsidian1411 vs stockfish15.1: 9 - 25 - 166 [0.460]
...      Obsidian1411 playing White: 9 - 5 - 86  [0.520] 100
...      Obsidian1411 playing Black: 0 - 20 - 80  [0.400] 100
...      White vs Black: 29 - 5 - 166  [0.560] 200
Elo difference: -27.9 +/- 19.6, LOS: 0.3 %, DrawRatio: 83.0 %
200 of 200 games finished.
i may not have fully caught stockfish 15.1 yet, but why use such a goofy book (83% draw rate) and not any UHO book?
With the same number of games you could've gotten a way more accurate measurement.
User avatar
pohl4711
Posts: 2626
Joined: Sat Sep 03, 2011 7:25 am
Location: Berlin, Germany
Full name: Stefan Pohl

Re: SPCC: Testrun of Obsidian 14.11 finished

Post by pohl4711 »

Jouni wrote: Thu Dec 12, 2024 2:20 pm "Obsidian 14.11 catched up Stockfish 15.1 - nice". Not quite. Here's face to face match with 60 + 0,6 HERT book:
The full quote was: "In my full Ratinglist, Obsidian 14.11 catched up Stockfish 15.1 - nice."
I never said, a head-to-head is on 50%. Because I did not test that.

Code: Select all

   1 Stockfish 17 240906       : 3843    3    3 30000    71.6%   3676   48.6%
   2 Stockfish 16.1 240224     : 3813    2    2 55000    72.5%   3635   47.1%
   3 Torch 3 popavx2           : 3808    3    3 39000    68.9%   3664   47.5%
   4 Torch 2 popavx2           : 3790    3    3 38000    71.1%   3628   47.3%
   5 Stockfish 16 230630       : 3787    3    3 40000    74.0%   3594   45.3%
   6 Obsidian 14.11 a512       : 3774    4    4 14000    59.3%   3707   49.1%
   7 Stockfish 15.1 221204     : 3773    4    4 14000    72.5%   3598   46.7%