My UHO-Top15 Ratinglist is the world's first engine-ratinglist, using UHO-openings, and the world's first ratinglist offering additionally Gamepair-statistics.
Testrun of Obsidian 14.11 finished.
https://www.sp-cc.de
Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm
(Perhaps you have to clear your browsercache with <strg>+<shift>+<delete> to reload the graphics/diagrams on my website)
SPCC: Testrun of Obsidian 14.11 finished
Moderators: hgm, chrisw, Rebel
-
- Posts: 2626
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
-
- Posts: 1175
- Joined: Thu Dec 25, 2008 9:07 pm
- Full name: Herbert L
Re: SPCC: Testrun of Obsidian 14.11 finished
Hello Stefan,pohl4711 wrote: ↑Tue Dec 10, 2024 2:24 pm My UHO-Top15 Ratinglist is the world's first engine-ratinglist, using UHO-openings, and the world's first ratinglist offering additionally Gamepair-statistics.
Testrun of Obsidian 14.11 finished.
https://www.sp-cc.de
Also take a look at the EAS-Ratinglist, the world's first engine-ratinglist not measuring strength of engines but engines's style of play:
https://www.sp-cc.de/eas-ratinglist.htm
(Perhaps you have to clear your browsercache with <strg>+<shift>+<delete> to reload the graphics/diagrams on my website)
thanks for testing Obsidian 14.11
Did you test the pure exe or with net60.bin?
I read something here somewhere about net53.bin and net60.bin
Thanks in advantages
-
- Posts: 2626
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: SPCC: Testrun of Obsidian 14.11 finished
I tested the avx512 binary friom the official GitHub site of Obsidian.
This one, without any additional net/file:
https://github.com/gab8192/Obsidian/rel ... avx512.exe
This one, without any additional net/file:
https://github.com/gab8192/Obsidian/rel ... avx512.exe
-
- Posts: 169
- Joined: Sun Oct 30, 2022 5:26 pm
- Full name: Conor Anstey
Re: SPCC: Testrun of Obsidian 14.11 finished
hijacking the thread slightly to point out that unless they explicitly say so, engine devs do *not* expect you to mix and match networks, and the default net for an engine will be the strongest available
in this particular case obsidian's net60 is not compatible with the master branch, and if it were stronger than the master network, it would already *be* the master network.
-
- Posts: 3487
- Joined: Wed Mar 08, 2006 8:15 pm
- Full name: Jouni Uski
Re: SPCC: Testrun of Obsidian 14.11 finished
"Obsidian 14.11 catched up Stockfish 15.1 - nice". Not quite. Here's face to face match with 60 + 0,6 HERT book:
Code: Select all
Score of Obsidian1411 vs stockfish15.1: 9 - 25 - 166 [0.460]
... Obsidian1411 playing White: 9 - 5 - 86 [0.520] 100
... Obsidian1411 playing Black: 0 - 20 - 80 [0.400] 100
... White vs Black: 29 - 5 - 166 [0.560] 200
Elo difference: -27.9 +/- 19.6, LOS: 0.3 %, DrawRatio: 83.0 %
200 of 200 games finished.
Jouni
-
- Posts: 7
- Joined: Fri Jan 26, 2024 10:21 pm
- Full name: Gabriele Lombardo
Re: SPCC: Testrun of Obsidian 14.11 finished
i may not have fully caught stockfish 15.1 yet, but why use such a goofy book (83% draw rate) and not any UHO book?Jouni wrote: ↑Thu Dec 12, 2024 2:20 pm "Obsidian 14.11 catched up Stockfish 15.1 - nice". Not quite. Here's face to face match with 60 + 0,6 HERT book:
Code: Select all
Score of Obsidian1411 vs stockfish15.1: 9 - 25 - 166 [0.460] ... Obsidian1411 playing White: 9 - 5 - 86 [0.520] 100 ... Obsidian1411 playing Black: 0 - 20 - 80 [0.400] 100 ... White vs Black: 29 - 5 - 166 [0.560] 200 Elo difference: -27.9 +/- 19.6, LOS: 0.3 %, DrawRatio: 83.0 % 200 of 200 games finished.
With the same number of games you could've gotten a way more accurate measurement.
-
- Posts: 2626
- Joined: Sat Sep 03, 2011 7:25 am
- Location: Berlin, Germany
- Full name: Stefan Pohl
Re: SPCC: Testrun of Obsidian 14.11 finished
The full quote was: "In my full Ratinglist, Obsidian 14.11 catched up Stockfish 15.1 - nice."
I never said, a head-to-head is on 50%. Because I did not test that.
Code: Select all
1 Stockfish 17 240906 : 3843 3 3 30000 71.6% 3676 48.6%
2 Stockfish 16.1 240224 : 3813 2 2 55000 72.5% 3635 47.1%
3 Torch 3 popavx2 : 3808 3 3 39000 68.9% 3664 47.5%
4 Torch 2 popavx2 : 3790 3 3 38000 71.1% 3628 47.3%
5 Stockfish 16 230630 : 3787 3 3 40000 74.0% 3594 45.3%
6 Obsidian 14.11 a512 : 3774 4 4 14000 59.3% 3707 49.1%
7 Stockfish 15.1 221204 : 3773 4 4 14000 72.5% 3598 46.7%