How to create a Glaurung 2.1 book?

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

User avatar
beachknight
Posts: 3533
Joined: Tue Jan 09, 2007 8:33 pm
Location: Antalya, Turkey

How to create a Glaurung 2.1 book?

Post by beachknight »

just completed summing a huge PGN file, 241 MB in size,
for creating books.

this process included:

-- reverse engineering of some my arena books
using Dann's and Jim's compiles of Yuri's abk2pgn code.

-- exclusion of doubles, that was really lengthy

and

-- adding few pgn files to this collection.

I easily produced a book, thinker.dat, 100 MB in size,
for Thinker 5.1F. I will repeat this later for Crafty.

But now is the question on the subject.

Best,
hi, merhaba, hallo HT
Dann Corbit
Posts: 12845
Joined: Wed Mar 08, 2006 8:57 pm
Location: Redmond, WA USA

Re: How to create a Glaurung 2.1 book?

Post by Dann Corbit »

beachknight wrote:just completed summing a huge PGN file, 241 MB in size,
for creating books.

this process included:

-- reverse engineering of some my arena books
using Dann's and Jim's compiles of Yuri's abk2pgn code.

-- exclusion of doubles, that was really lengthy

and

-- adding few pgn files to this collection.

I easily produced a book, thinker.dat, 100 MB in size,
for Thinker 5.1F. I will repeat this later for Crafty.

But now is the question on the subject.

Best,
Glaurung makes Polyglot format books. Here is material from a post by Volker Pittlik, describing how he made an excellent book for Glaurung:
[snip]
"Download the archive http://www.vpittlik.org/chess/vp1-pgn.tar.bz2 (~9 MB). You will find 3 SCID databases in it: 2400-2500, 2500-2600 and 2600+. There are games inside from players within these ELO ranges.

1. Create a new pgn file and export the pgn of all ~50000 games from the 2400-2500 database to it. Do the same with the ~32000 games from the 2500-2600 database but do it 2 times. Do the same with the 2600+ games but do it 4 times. At the end there should be ~179000 games in the new pgn file.

2. Download the CEGT and CCRL games and free them from dupes with SCID. Pgn-extract finds more than SCID, but I didn't use it. Because there have been new games added since I downloaded them you may shrink the CEGT and CCRL databases to ~154000 and ~57000 games. After that add these games to the pgn file you created in step 1.

3. (You may possibly skip this step but I didn't). Clean the new created pgn with pgn-extract. I used:
./pgn-extract -7 -N -C -V -llog -o<output.filename> <input.filename>. Do not use -d or -D option because the duplicate games are necessary in the book. Rename the outputfile to whatever you like and move it to the polyglot directory.

4. Create a simple polyglot book:

./polyglot make-book -min-score 50 -pgn <input.filename>.pgn -bin vp1.bin

IIRC min-score 50 is default anyway.

Now you have the book I used.

I guess there is some room for improvement. I think to give the newer games of high rated player more weight is a good idea.
Also I'm going to use more game collections next time.

If someone is interested in the ready made book only he can download it here (~10 MB). "
[snip]
User avatar
beachknight
Posts: 3533
Joined: Tue Jan 09, 2007 8:33 pm
Location: Antalya, Turkey

Re: How to create a Glaurung 2.1 book?

Post by beachknight »

Thank you, Dann.

Glaurung 2.1 tests are going to start in 2 days!

Best,
hi, merhaba, hallo HT