KLc wrote: ↑Tue Dec 14, 2021 12:39 pm
Cornfed wrote: ↑Mon Dec 13, 2021 8:16 pm
UI do however create my own 'Reference base' (despite the fact that 'big data' generally points you in the right direction) with players just over (depends) 2200 or over 2400 where I augment with annotated data from books/magazines and older good players 'pre-elo'.
Could you give some details how to do this? I’m having a hard time figuring out how to extract the really “good/quality” games. After 1970 one could take Elo. But I’m more interested in the “classical era”. There are many games in Mega from the 19th and 20th Century by club player etc which I would somehow like to filter out.
I plan on taking off a few days after the first of the year and this is one of the things I was going to work on. I had a hard drive crash earlier in the year and had no back up for several files....otherwise I would just be 'adding' high quality games to what I had already established.
Basically though, I start with an empty database into which I am going to bring only high quality games. I call it 'Quality Base'. Then I take multiple sources - Megabase, Corr Base, ebook pgn/cbv file, Chess Informants, Chess Publishing files, etc....and spend time "data washing" of the files into 'test bases'. I also like to filter out draws before move 20.I filter out games by rating - I think last time I only kept where 1 player was >2400 and his opponent at least 2200. For "Correspondence" games, I set the low bar at....think it was 2500 for both opponents, I then combine those into Quality Base, where I do one last bit of filtering so as to keep the 'better' game in case of duplicates.
All I am trying to do is create a really good Reference Database that I can use within Chessbase instead of just having something like 'Mega' as the default. I really could care less if two 1500's played a different move than I did when checking a game I played or working on opening files.