my private rating list: GPL Blitz

lucasart · Post by **lucasart** » Tue Dec 20, 2011 12:45 pm

Hello,

I've started a new private rating list. Testing conditions are the following:
* only open source and multi-plateform programs: I'm not interested in proprietary and/or windows only programs. I try to compile the programs myself (with GCC 4.6) as much as possible, to make sure I'm running an efficient compile.
* time control: 1min + 1sec increment. For any given CPU time, it's certainly better to play 10 times more games, than play ten times longer games. Any serious engine developper will agree with that

* book: performance.bin limited to 10 moves (ie. 20 half-moves)
* 64 bit versions only: who uses a 32 bit CPU these days anyway? Perhaps many people still use 32 OS on 64 bit architectures. Anyway I don't see any good reason to double the testing work by testing both 32 and 64 bit versions of a given engine.
* interface: cutechess-cli. This is a command line interface, which has two benefits compared to GUIs
- it allows multi-threaded testing. For example if engine A and B don't have an SMP search, then I can run 2 games in parallel on my 2 CPU hardware. When A and/or B are SMP, then games must be played one by one, allowing SMP engines to use the 2 CPUs.
- it is very fast and doesn't cause programs to lose on time for such a quick time control.
* elo calculator: BayesElo: certainly better than EloStat for many reasons.

Here's what I have so far, but of course this list will evolve

Code: Select all

Rank Name              Elo    +    - games score oppo. draws 
   1 Fruit 2.1        2740   44   41   225   85%  2450   16% 
   2 Pawny 0.3.1      2519   32   32   300   52%  2506   19% 
   3 Sungorus 1.4     2402   24   24   500   49%  2415   19% 
   4 Jazz 5.01        2391   24   24   500   47%  2417   22% 
   5 DoubleCheck 2.3  2382   20   20   675   47%  2408   20% 
   6 Beowulf 2.4      2343   33   33   250   48%  2359   19% 
   7 GNUChess 5.08    2239   37   39   200   31%  2380   19%

The next engine to be tested is Sloppy 0.2.2 (2 CPU). For SMP engines I decided to only test the 2 CPU version. This is because:
- there's no real information to be extracted by testing both 1 CPU and 2 CPU: all you'll see is that the 2 CPU is (almost) twice faster, with the corresponding elo gain.
- It is not a trivial task for engine developers to parallelize the search algorithm, so it's only fair to give them that advantage over non SMP programs.

Any suggestions as to which program to test next are welcome. As you can see I'm slowly on my way from the bottom of the list to the top. So I'll include stronger engines step by step, so the ratings make sense. If I just throw a StockFish in there, it will destroy all other participants, and its rating will not be precisely determined. I need to fill the gap before.

Lucas

PS: as a speed benchmark, my machine runs "stockfish bench" in a little less than 7 seconds (SF version 2.1.1).

Michel · Post by **Michel** » Tue Dec 20, 2011 12:53 pm

If you want to test GnuChess, you shouldn't test 5.08. It is mostly a 10 years old code base and very buggy. Either version 6 (which is fruit really) or my bug fixed and enhanced (unofficial) version 5.07

http://alpha.uhasselt.be/Research/Algeb ... s-release/

would be better choices.

lucasart · Post by **lucasart** » Tue Dec 20, 2011 1:13 pm

Michel wrote:If you want to test GnuChess, you shouldn't test 5.08. It is mostly a 10 years old code base and very buggy. Either version 6 (which is fruit really) or my bug fixed and enhanced (unofficial) version 5.07

http://alpha.uhasselt.be/Research/Algeb ... s-release/

would be better choices.

Hello Michel,

Perhaps it would be good to clarify the GNU Chess tree for everyone (correct me if I'm wrong):
* GNU Chess 5.07 is old: last modif in 2003. It is relatively weak and buggy, but still a strong opponent for human players.
* GNU Chess 5.08 is an improved 5.07. Its not much stronger (cf. my list), but the last modif was in 2011. I tested it because I needed a weak engine, to make sure that Beowulf Doublecheck Jazz and Sungorus had significant ratings. So I chose it on purpose.
* GNU Chess 6.0.1 is Fruit with a GNU branding on it. I looked at the source code and saw no significant changes since Fruit 2.1, so I prefer testing Fruit 2.1 directly. I compiled GNU Chess 6.0.1 and noticed that neither the xboard nor the uci interfaced work. So it's Fruit with bugs added... I'll test GNU Chess 6.x when the UCI and/or Xboard interfacing code is fixed.
* "unofficial" GNU Chess 5.0.7 is your program. Is it correct to say it's a fork off the official GNU Chess 5.0.7 ?

I'll be very happe to test your unofficial GNU Chess 5.0.7. Had a look at your link, and I'm not sure which file to download. Which is the latest stable source *.tar.gz that I can compile ? Let me know, and I'll test it after Sloppy 0.2.2.

PS: If I may, wouldn't it make sense to call your fork something else than GNU Chess 5.0.7, if only for the sake of clarity. Perhaps you should increase the version number of change the name, no ?

Thank you!

Michel · Post by **Michel** » Tue Dec 20, 2011 1:38 pm

lucasart wrote:
Michel wrote:If you want to test GnuChess, you shouldn't test 5.08. It is mostly a 10 years old code base and very buggy. Either version 6 (which is fruit really) or my bug fixed and enhanced (unofficial) version 5.07

http://alpha.uhasselt.be/Research/Algeb ... s-release/

would be better choices.
Hello Michel,

Perhaps it would be good to clarify the GNU Chess tree for everyone (correct me if I'm wrong):
* GNU Chess 5.07 is old: last modif in 2003. It is relatively weak and buggy, but still a strong opponent for human players.
* GNU Chess 5.08 is an improved 5.07. Its not much stronger (cf. my list), but the last modif was in 2011. I tested it because I needed a weak engine, to make sure that Beowulf Doublecheck Jazz and Sungorus had significant ratings. So I chose it on purpose.

Yes 5.08 contains a few minor usability fixes (for example my implementation
of the memory command).

* GNU Chess 6.0.1 is Fruit with a GNU branding on it. I looked at the source code and saw no significant changes since Fruit 2.1, so I prefer testing Fruit 2.1 directly. I compiled GNU Chess 6.0.1 and noticed that neither the xboard nor the uci interfaced work. So it's Fruit with bugs added... I'll test GNU Chess 6.x when the UCI and/or Xboard interfacing code is fixed.

Well if the interface code is the only obstruction.... Some people succeed in
running GNU Chess 6

. It has been playing on FICS for a while.

* "unofficial" GNU Chess 5.0.7 is your program. Is it correct to say it's a fork off the official GNU Chess 5.0.7 ?

Yes by now I guess it is truly a fork (the terrible f* word). You are probably correct that
I should rename it. On FICS I have been using the handle GnuCheese. But
I have always been a fan of GNU Chess so I like to keep that name as well....

I'll be very happe to test your unofficial GNU Chess 5.0.7.

Thanks. The latest version is 5.07.173b. This should be a link to the correct tar ball.

http://alpha.uhasselt.be/Research/Algeb ... 73b.tar.gz

Had a look at your link, and I'm not sure which file to download. Which is the latest stable source *.tar.gz that I can compile ? Let me know, and I'll test it after Sloppy 0.2.2.

PS: If I may, wouldn't it make sense to call your fork something else than GNU Chess 5.0.7, if only for the sake of clarity. Perhaps you should increase the version number of change the name, no ?

Thank you!

lucasart · Post by **lucasart** » Tue Dec 20, 2011 2:03 pm

Michel wrote: Yes by now I guess it is truly a fork (the terrible f* word). You are probably correct that
I should rename it. On FICS I have been using the handle GnuCheese. But
I have always been a fan of GNU Chess so I like to keep that name as well....

OK I downloaded the tarball and compiled it as follows:

Code: Select all

./configure
make

Now before I test it, can I at least rename it to GNU Chess 5.1 ? If you like the name GNU Chess, you may at least agree to change the version number, no ?
I ran a test game with cutechess-cli, and it works fine in UCI mode. Good stuff

Michel · Post by **Michel** » Tue Dec 20, 2011 4:04 pm

Now before I test it, can I at least rename it to GNU Chess 5.1 ?

Of course you can. I have refrained from doing this since I don't like bumping
version numbers without the author's consent. Anyway 5.1 < 6 .

Adam Hair · Post by **Adam Hair** » Tue Dec 20, 2011 4:07 pm

Definitely use Michel's branch of GNU Chess. Also, Gaviota would be a good choice.

I will come up with several other engines that fit your criteria a little later.

Edit: Of course, Gaviota would be a good choice if it were open source

jdart · Post by **jdart** » Tue Dec 20, 2011 4:42 pm

There are lots of open source engines. Note though some are open source but not GPL licensed (for example, Crafty, Arasan, Scorpio).

Among the GPL ones I would list: Stockfish, Bison, Umko, Protector.

--Jon

lucasart · Post by **lucasart** » Tue Dec 20, 2011 5:48 pm

jdart wrote:There are lots of open source engines. Note though some are open source but not GPL licensed (for example, Crafty, Arasan, Scorpio).

Among the GPL ones I would list: Stockfish, Bison, Umko, Protector.

--Jon

I'm happy to test open source programs, so long as i can get a linux compile. The difference between "open source" and GPL (that we should call "free software" to be precise) is essentially political. I'm more on the GPL side, but open source is certainly better than proprietary software.

So far Sloppy 0.2.2 (2 CPU) is scoring 44% after 90 games against Fruit 2.1. Another 10 games to go, then 50 against Pawny 0.3.1 and 50 against Sungorus 1.4. After that I'll test GNU Chess 5.1 (aka unofficial 5.0.7). These engines are especially good for my list, because they will most likely insert themselves into the 200+ elo gap between Pawny 0.3.1 and Fruit 2.1, making the elo estimation more robust and precise.

After that I guess Bison, Umko, Crafty, Protector and StockFish.

lucasart · Post by **lucasart** » Tue Dec 20, 2011 5:53 pm

Michel wrote:
Now before I test it, can I at least rename it to GNU Chess 5.1 ?
Of course you can. I have refrained from doing this since I don't like bumping
version numbers without the author's consent. Anyway 5.1 < 6 .

Well, you can always ask Stuart Cracraft, but I don't see any reason why he'd be against it. The main reasons for changing the version number are:
- if you google for GNU Chess 5.0.7 you'll end up on the official GNU website, and end up with the 10 years old program. It's a bit confusing admittedly.
- You certainly made some non trivial modifications to the original code base, as your version seems a lot stronger than the original 5.0.7, so it's only logical to increase the number.
- 5.0.7 to 5.1 is hardly a "bump" given the amount of code and elo improvements.

Anyway, that's all up to you!

my private rating list: GPL Blitz

my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz

Re: my private rating list: GPL Blitz