Cerebras - but honey, can it play chess?

Discussion of anything and everything relating to chess playing software and machines.

Moderator: Ras

smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Cerebras - but honey, can it play chess?

Post by smatovic »

CS2 with 2.4 petabyte RAM module, able to run GPT-3 model with 120 trillion weights, planned for Q4:

https://www.zdnet.com/article/cerebras- ... -networks/

--
Srdja
User avatar
AdminX
Posts: 6363
Joined: Mon Mar 13, 2006 2:34 pm
Location: Acworth, GA

Re: Cerebras - but honey, can it play chess?

Post by AdminX »

:shock: :shock: :shock: Wow!
"Good decisions come from experience, and experience comes from bad decisions."
__________________________________________________________________
Ted Summers
smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Cerebras - but honey, can it play chess?

Post by smatovic »

Haha, size matters ;)

Image

--
Srdja
User avatar
AdminX
Posts: 6363
Joined: Mon Mar 13, 2006 2:34 pm
Location: Acworth, GA

Re: Cerebras - but honey, can it play chess?

Post by AdminX »

smatovic wrote: Sun Sep 05, 2021 9:42 am Haha, size matters ;)
--
Srdja
:lol: Haha, That' s what she said. Reminds me of and old Richard Pryor Joke. 'If you had two more inches of ...'

***WARNING ADULT AUDIO CONTENT***

Check the 1:48 minute mark:
"Good decisions come from experience, and experience comes from bad decisions."
__________________________________________________________________
Ted Summers
smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Cerebras - but honey, can it play chess?

Post by smatovic »

AdminX wrote: Sun Sep 05, 2021 9:55 am
smatovic wrote: Sun Sep 05, 2021 9:42 am Haha, size matters ;)
--
Srdja
...
That' s what she said.
...
Haha :lol:

--
Srdja
User avatar
towforce
Posts: 12514
Joined: Thu Mar 09, 2006 12:57 am
Location: Birmingham UK
Full name: Graham Laight

Re: Cerebras - but honey, can it play chess?

Post by towforce »

smatovic wrote: Sat Sep 04, 2021 9:38 pm CS2 with 2.4 petabyte RAM module, able to run GPT-3 model with 120 trillion weights, planned for Q4:

https://www.zdnet.com/article/cerebras- ... -networks/

--
Srdja

There are already petascale computers you can buy - link. There are computers beyond that, but you cannot buy them "off the shelf".

You asked whether it can play chess: I've been looking everywhere, and I've found a program I think it's powerful to run - link. :)
Human chess is partly about tactics and strategy, but mostly about memory
smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Cerebras - but honey, can it play chess?

Post by smatovic »

towforce wrote: Sun Sep 05, 2021 5:06 pm ...
Did you read the article? This machine was designed for one purpose, to train and inference large neural networks. It competes with clusters of thousands of GPUs, and according to the PR successful, it moves the bar from 1.6 trillion neural network parameters to 120 trillion, ~100x. If the DoE says it rocks then it does ;)

--
Srdja
User avatar
towforce
Posts: 12514
Joined: Thu Mar 09, 2006 12:57 am
Location: Birmingham UK
Full name: Graham Laight

Re: Cerebras - but honey, can it play chess?

Post by towforce »

smatovic wrote: Sun Sep 05, 2021 8:27 pmDid you read the article? This machine was designed for one purpose, to train and inference large neural networks. It competes with clusters of thousands of GPUs, and according to the PR successful, it moves the bar from 1.6 trillion neural network parameters to 120 trillion, ~100x. If the DoE says it rocks then it does ;)

--
Srdja

It's a cluster of 192 computers, whereas the Nvidia DGX A100 is a single computer. They can improve the efficiency of the clustering, however, by taking advantage of sparse weights (hence sparse gradients):


Image
Human chess is partly about tactics and strategy, but mostly about memory
smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Cerebras - but honey, can it play chess?

Post by smatovic »

towforce wrote: Sun Sep 05, 2021 9:53 pm It's a cluster of 192 computers, whereas the Nvidia DGX A100 is a single computer. They can improve the efficiency of the clustering, however, by taking advantage of sparse weights (hence sparse gradients):
...
Maybe dig deeper into data/model/layer parallelism, how the WSE runs matrices on GB of SRAM fed by the main memory module. A100 has sparsity acceleration too AFAIK.

***edit**

A cluster of ~256 DGX2 (8xA100 each) runs a ~1 trillion model, a CS-2 setup with 192 WSE-2 runs 120 trillion, which one do you prefer?

--
Srdja
smatovic
Posts: 3331
Joined: Wed Mar 10, 2010 10:18 pm
Location: Hamburg, Germany
Full name: Srdja Matovic

Re: Cerebras - but honey, can it play chess?

Post by smatovic »

Followup: the Nvidia DGX2 has 16xV100, the Nvidia DGX A100 has 8xA100.

--
Srdja