Jonathan003 wrote: ↑Wed Feb 02, 2022 5:12 pm
Can I specify somehow that only one player have to have a elo rating of minimum 2000 elo?
So games where one player has for example 1400 elo and the other player has 2100 elo would also be included.
OCGDB doesn’t have that feature yet. We may implement it in the coming time.
BTW, if you know SQL, it is just a doable task for yourself. A database of OCGDB is just a typical SQL/SQLite database. You could study it with any SQLite browser/studio/GUI. You may also select and create new sub-databases with your own conditions, using some SQL statements.
Using a SQL browser to query with SQL statements
Jonathan003 wrote: ↑Wed Feb 02, 2022 5:12 pm
And can I know for sure that it will be all human games without any engine games included?
I think bots may have the prefix BOT. Of course, you can't be sure for cheating accounts that use engines under human names.
Jonathan003 wrote: ↑Wed Feb 02, 2022 5:12 pm
Do you have some idea if I do a search with these setting, how long it is gone take to download the games? And how many games will it be approximately?
Not sure about this question. Do you mean downloading PGN files from Lichess? It depends much on Internet speed. The file of 94 million games is about 25 GB in zip format, perhaps it took me about a night (or a day - I can't remember exactly) to download. I knew the number (94 games) just after converting the file into an SQL database.
Now you can guess the number of games based on file sizes
More information:
As I have mentioned, OCGDB databases could be as small and as fast as SCID’s similar ones, plus it could work with much larger numbers of games.
For so-so/popular sizes of database such as MillionBase 3.45 (3.45 million games) OCGDB could do all tasks within 2 minutes in my 5-years-old 4-cores computer, including creating/converting from a PGN file into a new database, performing approximately-position-searching (scan all records), checking/removing duplicates.
The 94 million games of Lichess require 1 hour 30 minutes to convert from a PGN file into an SQLite database. For tasks of scanning all records (such as for approximately-position-searching), it needs 1-2 hours.
Of course, many other tasks such as finding a game based on ID could run instantly. If your computer is faster and has more cores, it could run faster too.
IMHO, converting from a Lichess PGN file into an SQLite database is a win already:
- You could save a lot of space: the PGN file of Lichess takes over 200 GB (unzip form) on my hard disk. Working with that file is a nightmare of being slow, inconvenient. In contrast, after converting (with removing all comments, site information which is not really useful for users - read more here) the SQLite one takes only 13 GB, so good/reasonable for storing, much faster and more convenient for any action
- You could work with it without using our program or any specific chess program. Just use typical SQL GUIs. You have never worried if we/some developers stopped supporting their chess database programs
- SQLite is built on a SQL engine that is very strong on databases and querying. Stand on the shoulder of a giant typically has many benefits
