A dangerous combination

Discussion of computer chess matches and engine tournaments.

Moderators: hgm, chrisw, Rebel

User avatar
pedrox
Posts: 1056
Joined: Fri Mar 10, 2006 6:07 am
Location: Basque Country (Spain)

Re: Test is finished (better than previous version or is it?

Post by pedrox »

Hi Tony and Michael, I have proven to DanaSah with Naum, about middlegame Naum thinks 3 ply deeper, wins easy by tactics and the evaluation differs enough.

Code: Select all

   Motor           Puntaje                              Na
1: Naum 2.0        52,0/60  ······························ 
2: DanaSah v.2.85k 4,0/30   00=0=00000000000=000=000010100 
2: DanaSah v.3.03  4,0/30   000000=00000=0==0000=10=000000 

60 partidas jugadas / Torneo finalizado
Nombre del torneo: Torneo 159
Lugar/ País: AMD64, Spain
Nivel: Blitz 1/1
Hardware: Dual AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ 2405 MHz con 1.022 MB de Memoria
Sistema operativo: Microsoft Windows XP Home Edition Service Pack 2 (Build 2600)
Archivo PGN: C:\Archivos de programa\Arena\Tournaments\Torneo 159.pgn
Página Web: 
Correo electrónico: 
Tony Thomas

Re: Test is finished (better than previous version or is it?

Post by Tony Thomas »

Michael Sherwin wrote:I would have thought that avoiding extreams would give more solid results. A +/- 1 point is worth 44 ELO when at 90% and only worth 7 ELO at 50%. However, it could be interesting.
Since Romi has climbed to the 4th place, it is no longer in the middle. I am going to add two or three stronger engines to lower Romi back to around 6-8th place and also I am going to take out few weaker engines. One that I am going to take out is Zeus, after 570 games it has the same rating as a 27Kb engine namely Nanosachy.
Michael Sherwin
Posts: 3196
Joined: Fri May 26, 2006 3:00 am
Location: WY, USA
Full name: Michael Sherwin

Re: Test is finished (better than previous version or is it?

Post by Michael Sherwin »

pedrox wrote:Hi Tony and Michael, I have proven to DanaSah with Naum, about middlegame Naum thinks 3 ply deeper, wins easy by tactics and the evaluation differs enough.

Code: Select all

   Motor           Puntaje                              Na
1: Naum 2.0        52,0/60  ······························ 
2: DanaSah v.2.85k 4,0/30   00=0=00000000000=000=000010100 
2: DanaSah v.3.03  4,0/30   000000=00000=0==0000=10=000000 

60 partidas jugadas / Torneo finalizado
Nombre del torneo: Torneo 159
Lugar/ País: AMD64, Spain
Nivel: Blitz 1/1
Hardware: Dual AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ 2405 MHz con 1.022 MB de Memoria
Sistema operativo: Microsoft Windows XP Home Edition Service Pack 2 (Build 2600)
Archivo PGN: C:\Archivos de programa\Arena\Tournaments\Torneo 159.pgn
Página Web: 
Correo electrónico: 
Hi Pedro,

It would appear that Naum has some magic that we will have to discover!

Mike
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Michael Sherwin
Posts: 3196
Joined: Fri May 26, 2006 3:00 am
Location: WY, USA
Full name: Michael Sherwin

Re: Test is finished (better than previous version or is it?

Post by Michael Sherwin »

Tony Thomas wrote:
Michael Sherwin wrote:I would have thought that avoiding extreams would give more solid results. A +/- 1 point is worth 44 ELO when at 90% and only worth 7 ELO at 50%. However, it could be interesting.
Since Romi has climbed to the 4th place, it is no longer in the middle. I am going to add two or three stronger engines to lower Romi back to around 6-8th place and also I am going to take out few weaker engines. One that I am going to take out is Zeus, after 570 games it has the same rating as a 27Kb engine namely Nanosachy.
Well Romi was sulking around today and acting all bummed out. When I got her to tell me what was wrong, she said; "Tony must not like me anymore, because he made me look really bad." :cry:

Then she ask me; "What are you going to do about it?"

Thanks a lot Tony! :P
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Tony Thomas

Re: Test is finished (better than previous version or is it?

Post by Tony Thomas »

Michael Sherwin wrote:
Tony Thomas wrote:
Michael Sherwin wrote:I would have thought that avoiding extreams would give more solid results. A +/- 1 point is worth 44 ELO when at 90% and only worth 7 ELO at 50%. However, it could be interesting.
Since Romi has climbed to the 4th place, it is no longer in the middle. I am going to add two or three stronger engines to lower Romi back to around 6-8th place and also I am going to take out few weaker engines. One that I am going to take out is Zeus, after 570 games it has the same rating as a 27Kb engine namely Nanosachy.
Well Romi was sulking around today and acting all bummed out. When I got her to tell me what was wrong, she said; "Tony must not like me anymore, because he made me look really bad." :cry:

Then she ask me; "What are you going to do about it?"

Thanks a lot Tony! :P
Another reason why I decided to add some stronger opposition. I didnt want the pride to get in to your head, and in turn you know that you would slack. Naum 2.0 is probably only 100 to 120 points stronger than Zappa according to my previous rating list. Here is the list with few games of Naum added...Note that Lime is going to stay as a special opponent to boost your ego. :lol: I have not yet made up my mind on the other opponent that I am going to take out, its going to be either Djinn or Delphil and I am going to update the list with a newer version of Dana, and another engine as time permits. :cry:
/12/2007 2:09:18 AM :

Code: Select all

    Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Naum 2.0                       : 2665   73  71    90    72.2 %   2499   17.8 %
  2 Zappa 1.1                      : 2537   27  27   600    67.8 %   2408   16.5 %
  3 Francesca MAD 0.13             : 2497   27  26   570    64.1 %   2396   19.1 %
  4 Arasan 9.5                     : 2492   26  26   600    61.6 %   2410   17.5 %
  5 RomiChessDK5                   : 2465   34  34   360    60.7 %   2390   15.8 %
  6 RomiChessDK6                   : 2462   32  31   390    57.3 %   2411   18.7 %
  7 RomiChessDK3                   : 2438   33  33   360    57.4 %   2386   18.1 %
  8 RomiChessDK4                   : 2432   30  30   420    56.7 %   2385   21.4 %
  9 Danasah 2.85                   : 2431   25  25   571    54.5 %   2400   22.4 %
 10 RomiChessP3j                   : 2416   40  40   248    52.6 %   2398   14.9 %
 11 RomiChessDK2                   : 2411   33  33   360    53.6 %   2386   16.1 %
 12 Phalanx Reborn                 : 2410   26  26   570    51.3 %   2401   15.3 %
 13 Delphil 1.6c                   : 2397   26  26   547    49.5 %   2401   21.0 %
 14 Horizon_4_3_173                : 2389   31  31   420    48.0 %   2403   14.5 %
 15 Djinn 0.925x                   : 2375   26  26   570    46.0 %   2403   15.8 %
 16 Horizon_4_3_175                : 2365   33  33   360    46.0 %   2393   18.1 %
 17 Horizon 4.3                    : 2350   30  30   420    43.1 %   2398   18.6 %
 18 Zeus 1.28                      : 2349   27  27   540    42.2 %   2403   19.3 %
 19 NanoSzachy 2.7                 : 2349   27  27   540    42.2 %   2403   15.9 %
 20 GreKo 5.2                      : 2267   28  28   570    30.7 %   2408   16.1 %
 21 Lime 6.3                       : 2159   33  34   570    18.7 %   2414   10.0 
%
Tony Thomas

Re: Test is finished (better than previous version or is it?

Post by Tony Thomas »

pedrox wrote:Hi Tony and Michael, I have proven to DanaSah with Naum, about middlegame Naum thinks 3 ply deeper, wins easy by tactics and the evaluation differs enough.

Code: Select all

   Motor           Puntaje                              Na
1: Naum 2.0        52,0/60  ······························ 
2: DanaSah v.2.85k 4,0/30   00=0=00000000000=000=000010100 
2: DanaSah v.3.03  4,0/30   000000=00000=0==0000=10=000000 

60 partidas jugadas / Torneo finalizado
Nombre del torneo: Torneo 159
Lugar/ País: AMD64, Spain
Nivel: Blitz 1/1
Hardware: Dual AMD Athlon(tm) 64 X2 Dual Core Processor 4600+ 2405 MHz con 1.022 MB de Memoria
Sistema operativo: Microsoft Windows XP Home Edition Service Pack 2 (Build 2600)
Archivo PGN: C:\Archivos de programa\Arena\Tournaments\Torneo 159.pgn
Página Web: 
Correo electrónico: 
I havent played any games with the new version of Dana yet, but I will post my results against Naum 2.0 vs Dana 2.85k by Friday. As I mentioned in the previous post, I will eventually integrate current version of Dana in to the list as time permits.
User avatar
Graham Banks
Posts: 42944
Joined: Sun Feb 26, 2006 10:52 am
Location: Auckland, NZ

Re: Test is finished (better than previous version or is it?

Post by Graham Banks »

Guys - this thread is becoming too long. Could you please start a new one to continue your discussions and let this one eventually drop off the page?

Thanks, Graham.
Tony Thomas

Re: Test is finished (better than previous version or is it?

Post by Tony Thomas »

Graham Banks wrote:Guys - this thread is becoming too long. Could you please start a new one to continue your discussions and let this one eventually drop off the page?

Thanks, Graham.
I will start a new one when I am testing the next version of Romi (in a few days) I dont like to start a new thread without an adequate reason. May be you should start like the mods at CTF, there they have threads that are 20 pages long and no one is complaining.