EN-Test 2022 - new testsuite

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

ernst
Posts: 354
Joined: Thu Mar 09, 2006 6:00 pm

Re: EN-Test 2022 - new testsuite

Post by ernst »

Thank you! I tried to compile myself but am missing curl. Where can I get the correct version?
This post may either be cause or result of misunderstandings.
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

051222: Vulkan 031222 (Stockfish patches up to Stockfish 15.1 are integrated. In my EN-Test 2022, Vulkan 021222 is slightly better).

Filehorst.de:
https://filehorst.de/d/efivrjaJ
Pixeldrain:
https://pixeldrain.com/u/XtRMHvKP

and on my Homepage:
https://solistachess.jimdosite.com/solista-news/
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

07 Dec 2022: I removed a private engine from my top 10 list. Reason: I don't want people to ask me about this engine because there is absolutely no chance to get it.
Paloma
Posts: 1208
Joined: Thu Dec 25, 2008 9:07 pm
Full name: Herbert L

Re: EN-Test 2022 - new testsuite

Post by Paloma »

Eduard wrote: Fri Nov 25, 2022 6:20 am Info:
I am working on a new test suite that I will present in January 2023. I collect good test positions from previous test sets. But I have already implemented some positions that have never been published before! Currently there are 46 positions altogether. I have a large database with many interesting computer games. From this I have selected the best variants, about 2500 games. I've only checked 560 games out of it currently. It's fun to look for new positions.
How is the currend standing of your new Testsuite?
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

Unfortunately I didn't get far. I may be dealing too much with my engine. But the most important things to me are my books:
https://solistachess.jimdosite.com/books/

I edit the books in the formats CTG, BIX, BIN and now also in the Shredder Classic format. I edit the books manually and spend a lot of time on it. I am particularly proud of my Lc0 book. I've been editing this book for many months with a friend who plays with an RTX 3060m. In total, more than 6000 games were played with Lc0 and my friends hardly lose any more games on the server with Lc0. :-)

Since I'm constantly getting new games from my friends, my basic database with the most important variants is constantly growing. I checked out of the 2800 most interesting games, unfortunately only 560 games. I can't do more at the moment. :cry:

My new test suite should contain easy and hard positions. In addition, it is not just about whether a position is only solved. It's not just black and white for me.

Here is the current raw material for my new test suite, currently 80 positions. However, some positions may be deleted. Altogether there should be clearly more than 100 positions.

EN-Short-Test, downloads (PGN, ZIP):

https://pixeldrain.com/u/GotDDiYr
Eduard
Posts: 1439
Joined: Sat Oct 27, 2018 12:58 am
Location: Germany
Full name: N.N.

Re: EN-Test 2022 - new testsuite

Post by Eduard »

EN-Test 2022 - Started on 06 Nov 2022 - final standing 08 Mar 2023

I have started testing with a thinking time of 60 seconds. Only the top ten engines are included in this list, and only one engine by the same author. Computer AMD Ryzen 3900X, 20 Threads, Hash 4 GB, all 3456men Syzygy.

TOP 10 LIST:

1) Leptir Analyzer, Result: 116 out of 120 = 96.6%. Leptir Analyzer.txt (ZIP)
2) Crystal 5 KWK, Result: 113 out of 120 = 94.1%. Crystal 5 KWK.txt (ZIP)
3) Blue Marlin 15.6, Result: 111 out of 120 = 92.5%. Blue Marlin 15.6.txt (ZIP)
4) Corchess 3 010323, Result: 110 out of 120 = 91.6%. Corchess 3 010323.txt (ZIP)
5-6) Shashchess 29.1, Result: 107 out of 120 = 89.1%. Shashchess 29.1.txt (ZIP)
5-6) Stockfish dev 080323, Result: 107 out of 120 = 89.1%. Stockfish dev 080323.txt (ZIP)
7-9) Dark Sister 1.9a, Result: 106 out of 120 = 88.3%. Dark Sister 1.9a.txt (ZIP)
7-9) Kayra 1.7, Result: 106 out of 120 = 88.3%. Kayra 1.7.txt (ZIP)
7-9) ProteusSF-Piranha 220904, Result: 106 out of 120 = 88.3%. ProteusSF-Piranha 220904.txt (ZIP)
10) Eman 8.70, Result: 104 out of 120 = 86.6%. Eman 8.70.txt (ZIP)

Download all textfiles on my homepage:
https://solistachess.jimdosite.com/testing/

This is the final result. Leptir Analyzer is the best engine in this test. This engine benefits from a more precise search and a tactically strong network. Eman 8.70 is last. This engine is probably too selective for such tests.

Soon I will present my new EN Engine Test 2023 (ENET-2023). That's why I finished the EN test 2022 today. The new test will include many positions from the EN 2022 test, because I like them. But there will be some new positions. A few easy positions are removed. Unfortunately, some positions of the 2022 test also contain a secondary solution that is not much worse than the best move. These are the positions 11 (secondary solution Nf3), 17 (secondary solution Rb7) and 49 (f3). Unfortunately, these positions will also be removed. Also, I will only test engines that are not older than 6 months.