EN-Test 2022 - new testsuite

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

Results so far on AMD Ryzen 3900X, 20 Threads, 4 GB hash, all 3456men Syzygy, 30s:

Solista Classic, Result: 109 out of 120 = 90.8%. Solista-Classic.txt (ZIP)
Solista Attack v2 (default), Result: 107 out of 120 = 89.1%. Solista Attack v2.txt (ZIP)
Blue Marlin 15.3a, Result: 105 out of 120 = 87.5%. BlueMarlin 15.3a.txt (ZIP)
Swordfish 15.3a, Result: 104 out of 120 = 86.6%. Swordfish 15.3a.txt (ZIP)
Corchess 3 171022, Result: 100 out of 120 = 83.3%. Corchess 3 171022.txt (ZIP)
Dark Sister 1.9a, Result: 99 out of 120 = 82.5%. DarkSister 1.9a.txt (ZIP)
Shashchess 25 (default), Result: 98 out of 120 = 81.6%. Shashchess 25 .txt (ZIP)
Stockfish 161022, Result: 97 out of 120 = 80.8%. Stockfish.txt (ZIP)
Eman 8.40, Result: 96 out of 120 = 80.0%. Eman 8.40.txt (ZIP)
Stockfish_FF2 150521, Result: 88 out of 120 = 73.3%. Stockfish_FF2.txt (ZIP)

Text files can be downloaded from my homepage:
https://solistachess.jimdosite.com/testing/
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

ProteusSF-Piranha 220904, Result: 99 out of 120 = 82.5%. ProteusSF-Piranha.txt (ZIP)
Powerfritz 18, Result: 69 out of 120 = 57.5%. Powerfritz 18.txt (ZIP)

Textfiles on my homepage only.
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

Currently

Results AMD Ryzen 3900X, 20 Threads, 4 GB hash, all 3456men Syzygy, 30s:

Solista Classic, Result: 109 out of 120 = 90.8%. Solista-Classic.txt (ZIP)
Solista Attack v2 (default), Result: 107 out of 120 = 89.1%. Solista Attack v2.txt (ZIP)
Blue Marlin 15.3a, Result: 105 out of 120 = 87.5%. BlueMarlin 15.3a.txt (ZIP)
Swordfish 15.3a, Result: 104 out of 120 = 86.6%. Swordfish 15.3a.txt (ZIP)
Corchess 3 171022, Result: 100 out of 120 = 83.3%. Corchess 3 171022.txt (ZIP)
ProteusSF-Piranha 220904, Result: 99 out of 120 = 82.5%. ProteusSF-Piranha.txt (ZIP)
Dark Sister 1.9a, Result: 99 out of 120 = 82.5%. DarkSister 1.9a.txt (ZIP)
Shashchess 25 (default), Result: 98 out of 120 = 81.6%. Shashchess 25 .txt (ZIP)
Crystal 040722, Result: 98 out of 120 = 81.6%. Crystal 040722.txt (ZIP)
BrainLearn 19, Result: 97 out of 120 = 80.8%. BrainLearn 19.txt (ZIP)
Stockfish 161022, Result: 97 out of 120 = 80.8%. Stockfish.txt (ZIP)
Eman 8.40, Result: 96 out of 120 = 80.0%. Eman 8.40.txt (ZIP)
Cfish 250621, Result: 90 out of 120 = 75.0%. Cfish 250621.txt (ZIP)
Stockfish_FF2 150521, Result: 88 out of 120 = 73.3%. Stockfish_FF2.txt (ZIP)
Berserk 20220725, Result: 77 out of 120 = 64.1%. Berserk 20220725.txt (ZIP)
Koivisto 8.16, Result: 75 out of 120 = 62.5%. Koivisto 8.16.txt (ZIP)
Wasp 6.00, Result: 72 out of 120 = 60.0%. Wasp 6.00.txt (ZIP)
Powerfritz 18, Result: 69 out of 120 = 57.5%. Powerfritz 18.txt (ZIP)
Fire NN 1072022, Result: 68 out of 120 = 56.6%. Fire NN 1072022.txt (ZIP)

Textfiles on my Homepage
https://solistachess.jimdosite.com/testing/
perejaslav
Posts: 240
Joined: Sat Mar 18, 2006 4:01 am
Location: Cold

Re: EN-Test 2022 - new testsuite

Post by perejaslav »

EPD plz
perejaslav
Posts: 240
Joined: Sat Mar 18, 2006 4:01 am
Location: Cold

Re: EN-Test 2022 - new testsuite

Post by perejaslav »

I asked and I'll answer. :D

https://pixeldrain.com/u/LfL8y39y - EN-Test 2022.epd
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

perejaslav wrote: Sun Oct 23, 2022 11:32 am I asked and I'll answer. :D

https://pixeldrain.com/u/LfL8y39y - EN-Test 2022.epd
Nice, thank you!

Unfortunately I can't test some engines.

Lc0: I don't have a good GPU.
Dragon, Ethereal, Revenge, Chiron: I'm not willing to buy these engines. If anyone has these engines, I would be happy to see a test result.
perejaslav
Posts: 240
Joined: Sat Mar 18, 2006 4:01 am
Location: Cold

Re: EN-Test 2022 - new testsuite

Post by perejaslav »

Correct/Total:

Code: Select all

Stockfish 15    : 99/120
Stockfish 161022: 98/120
Stockfish 13    : 91/120
Stockfish 14.1  : 89/120
Stockfish 14    : 89/120
Stockfish 12    : 80/120
File name : EN-Test 2022.epd
Total test items : 120
Test for : best moves
Total engines : 6 (threads=1)
Timer : movetime: 60
Expand ply : 1
Elapsed : 45:56
Total tests : 720
Total corrects : 546 (75%)
Ave correct elapse : 4243 ms
Status : not completed
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

Thanks! Currently on my PC:

Results AMD Ryzen 3900X, 20 Threads, 4 GB hash, all 3456men Syzygy, 30s:

Solista Classic, Result: 109 out of 120 = 90.8%. Solista-Classic.txt (ZIP)
Solista Attack v2 (default), Result: 107 out of 120 = 89.1%. Solista Attack v2.txt (ZIP)
Blue Marlin 15.3a, Result: 105 out of 120 = 87.5%. BlueMarlin 15.3a.txt (ZIP)
Swordfish 15.3a, Result: 104 out of 120 = 86.6%. Swordfish 15.3a.txt (ZIP)
Corchess 3 171022, Result: 100 out of 120 = 83.3%. Corchess 3 171022.txt (ZIP)
ProteusSF-Piranha 220904, Result: 99 out of 120 = 82.5%. ProteusSF-Piranha.txt (ZIP)
Dark Sister 1.9a, Result: 99 out of 120 = 82.5%. DarkSister 1.9a.txt (ZIP)
Shashchess 25 (default), Result: 98 out of 120 = 81.6%. Shashchess 25 .txt (ZIP)
Crystal 040722, Result: 98 out of 120 = 81.6%. Crystal 040722.txt (ZIP)
BrainLearn 19, Result: 97 out of 120 = 80.8%. BrainLearn 19.txt (ZIP)
Stockfish 161022, Result: 97 out of 120 = 80.8%. Stockfish.txt (ZIP)
Eman 8.40, Result: 96 out of 120 = 80.0%. Eman 8.40.txt (ZIP)
Cfish 250621, Result: 90 out of 120 = 75.0%. Cfish 250621.txt (ZIP)
Stockfish_FF2 150521, Result: 88 out of 120 = 73.3%. Stockfish_FF2.txt (ZIP)
Berserk 20220725, Result: 77 out of 120 = 64.1%. Berserk 20220725.txt (ZIP)
RubiChess 20220813, Result: 75 out of 120 = 62.5%. RubiChess.txt (ZIP)
Koivisto 8.16, Result: 75 out of 120 = 62.5%. Koivisto 8.16.txt (ZIP)
Wasp 6.00, Result: 72 out of 120 = 60.0%. Wasp 6.00.txt (ZIP)
Powerfritz 18, Result: 69 out of 120 = 57.5%. Powerfritz 18.txt (ZIP)
Fire NN 1072022, Result: 68 out of 120 = 56.6%. Fire NN 1072022.txt (ZIP)
Rebel 15x2, Result: 66 out of 120 = 55.0%. Rebel 15x2.txt (ZIP)

https://solistachess.jimdosite.com/testing/

RubiChess calculating:
Image

Unfortunately, I cannot test SlowChess under Fritz GUI: :roll:
Image
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

New RubiChess now solves 78 positions and replaces August 2022 version:
RubiChess 2022 (1013), Result: 78 out of 120 = 65.0%.
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

Berserk 10 is weaker than the development version from July 2022, I keep the development version in the ranking.

Image

Berserk 20220725, Result: 77 out of 120
Berserk 10, Result: 73 out of 120