New asmFish released

syzygy · Post by **syzygy** » Sun Nov 06, 2016 3:18 pm

APassionForCriminalJustic wrote:Ronald's point is basically that Brainfish is nothing more than Stockfish development with the Cerebellum book. Brainfish's implementation of NUMA was actually taken from Mohammed Li (asmFish's author).

Indeed, and the Cerebellum book and in particular the automatic generation of it is obviously very nice work, but that is not why it was inserted into this thread.

There is no question that Brainfish is Stockfish with:
- Cerebellum book code by Thomas;
- numa patch by Mohammed (I think itself based on Texel's code);
- Windows LP patch by I don't know who (possibly Thomas himself, but I suspect not).
The numa patch reverts the "per-thread CMH table" patch resulting in a per-node CMH table. On a non-NUMA system this means a single shared CMH table, which the official SF had until recently.

Going from shared to per-thread was known and accepted to lose a few Elo when using multiple threads on a non-NUMA machine (at least at STC, probably a much smaller effect at LTC where the per-thread tables get enough time to fill up). It was accepted because it removed a bottleneck on NUMA machines. It has no effect on single-threaded play or speed.

Everything can be verified by inspecting the source code. If I missed something, then I can easily be corrected by pointing out the relevant source file and line numbers or so.

The LP patch, once it is enabled via an UCI option, obviously does improve speed a lot when using larger hash tables. (And I can only applaud anyone who adds LP support to an engine. Not using this facility where it is available unnecessarily leaves the hardware underused.)

That NUMA patched failed to show an Elo gain when tested at the Fishtest framework using 32 cores if I am not mistaken.

Yes, so far it has not shown itself to be a win on NUMA machines except on Windows systems with more than 64 logical cores (where it overcomes the processor group limitation discussed elsewhere).

I have no doubt that scaling can be improved by optimising memory use on NUMA machines, but it is not yet clear by how much. For example, if SF already scales close to perfectly on a two-node machine, then any gains from NUMA modifications on that system will necessarily be very limited.

Gusev · Post by **Gusev** » Sun Nov 06, 2016 5:33 pm

Hi Ronald,

Is Cfish 8 out, to match Stockfish 8 and compare?

Dmitri

syzygy wrote:
APassionForCriminalJustic wrote:Ronald's point is basically that Brainfish is nothing more than Stockfish development with the Cerebellum book. Brainfish's implementation of NUMA was actually taken from Mohammed Li (asmFish's author).
Indeed, and the Cerebellum book and in particular the automatic generation of it is obviously very nice work, but that is not why it was inserted into this thread.

There is no question that Brainfish is Stockfish with:
- Cerebellum book code by Thomas;
- numa patch by Mohammed (I think itself based on Texel's code);
- Windows LP patch by I don't know who (possibly Thomas himself, but I suspect not).
The numa patch reverts the "per-thread CMH table" patch resulting in a per-node CMH table. On a non-NUMA system this means a single shared CMH table, which the official SF had until recently.

Going from shared to per-thread was known and accepted to lose a few Elo when using multiple threads on a non-NUMA machine (at least at STC, probably a much smaller effect at LTC where the per-thread tables get enough time to fill up). It was accepted because it removed a bottleneck on NUMA machines. It has no effect on single-threaded play or speed.

Everything can be verified by inspecting the source code. If I missed something, then I can easily be corrected by pointing out the relevant source file and line numbers or so.

The LP patch, once it is enabled via an UCI option, obviously does improve speed a lot when using larger hash tables. (And I can only applaud anyone who adds LP support to an engine. Not using this facility where it is available unnecessarily leaves the hardware underused.)

That NUMA patched failed to show an Elo gain when tested at the Fishtest framework using 32 cores if I am not mistaken.
Yes, so far it has not shown itself to be a win on NUMA machines except on Windows systems with more than 64 logical cores (where it overcomes the processor group limitation discussed elsewhere).

I have no doubt that scaling can be improved by optimising memory use on NUMA machines, but it is not yet clear by how much. For example, if SF already scales close to perfectly on a two-node machine, then any gains from NUMA modifications on that system will necessarily be very limited.

syzygy · Post by **syzygy** » Sun Nov 06, 2016 7:07 pm

Gusev wrote:Is Cfish 8 out, to match Stockfish 8 and compare?

https://github.com/syzygy1/Cfish/releases/tag/cfish_8

PaulieD · Post by **PaulieD** » Sun Nov 06, 2016 7:16 pm

Code: Select all

   # PLAYER         &#58;  RATING  ERROR  PLAYED    W     D    L   (%)  D&#40;%)
   1 SF8 ASMPD      &#58;    3219      6    2000  471  1218  311  54.0  60.9
   2 CF 110516      &#58;    3200      7    2000  396  1206  398  50.0  60.3
   3 Stockfish 8    &#58;    3181      7    2000  318  1206  476  46.0  60.3

Gusev · Post by **Gusev** » Mon Nov 07, 2016 11:23 pm

Thank you very much, I will compile and test!

syzygy wrote:
Gusev wrote:Is Cfish 8 out, to match Stockfish 8 and compare?
https://github.com/syzygy1/Cfish/releases/tag/cfish_8

New asmFish released

Re: New asmFish released

Re: New asmFish released

Re: New asmFish released

Re: New asmFish released

Re: New asmFish released