jdart wrote:Can you post your cleaned data set back to Joshua so others can benefit? I'd offer to host it but while I have bandwidth, I don't have the disk.
Personally also I am not generally interested in short blitz and lightning games or games from players < about 1600 on FICS. So for my own use at least I'd cull those.
--Jon
Just now, I am removing dupes in my r db. This will take a while.
After this process, approx. 5 hours later, I'll post the links in four parts
for download as fics2006, fics2007, fics2008 and fics2009.
You are right, Joshua can process them further and add
additional games.
If anyone has a cleaned up version I'd be more than willing to host. Have tons of space free and very little bandwidth used minus occassional spikes like when I release files like this.
Btw I can add a lot of other info, like time between moves, etc. Draw by repetition. I kept all the raw data because I knew there was still a lot that could be taken out of it.
Mr. Taner do you mind if I put these on my server? Might take a little while with rapid share as I don't have an account but willing to host them.
Also if other people have ideas on what kind of data they'd like I can try and make specific releases.
I'm expecting to start collecting A LOT more data. Now that I'm going by history files instead of live games I'm getting more games in a couple hours than I would in a week or more the old method.
Checked and right now my raw data streams from fics over the past 3 years is 61gigs. So up for options on what people would like to have grabbed from it.