Upcoming graphics cards for Leela

Discussion of anything and everything relating to chess playing software and machines.

Moderators: hgm, Rebel, chrisw

whereagles
Posts: 565
Joined: Thu Nov 13, 2014 12:03 pm

Re: Upcoming graphics cards for Leela

Post by whereagles »

Dann Corbit wrote: Wed Aug 22, 2018 1:32 am I just bought a 1080 Ti because you can get them under $700 now.
will it overheat?

PLS I HAVE TO KNOW
Werewolf
Posts: 1795
Joined: Thu Sep 18, 2008 10:24 pm

Re: Upcoming graphics cards for Leela

Post by Werewolf »

jkiliani wrote: Wed Aug 22, 2018 3:48 am
Werewolf wrote: Tue Aug 21, 2018 11:00 am Here's some better info, based on what's now released and in terms of TFLOPS, rather than FPS or something silly like that.

Titan RTX (??)
Quadro RTX 6000 - 15.9
Titan V - 14.9 (29.8 half P)
2080 Ti - 13.4
Quadro P6000 - 11.7
1080 Ti - 11.3
Quadro RTX 5000 - 10.6
2080 - 10.1
Quadro P5000 - 9.5
1080 - 8.9
2070 - 7.5
1060 - 4.4
I think this is incorrect. While we don't have actual test benchmarks yet, the RTX lineup shares wiith the Titan V one important advantage over the GTX 10x0 cards: It can utilize fp16 inference fully, which gives a factor of 2 to speed in addition to the TFLOPS you posted. Currently, a Titan V can do around triple the Leela NPS that a 1080Ti can do, and the 2080Ti should be very close to that.
It would be good if you are right, but it would also be a bit odd that Nvidia would eat into the profits of their more expensive cards.
Werewolf
Posts: 1795
Joined: Thu Sep 18, 2008 10:24 pm

Re: Upcoming graphics cards for Leela

Post by Werewolf »

Milos wrote: Wed Aug 22, 2018 4:22 am
jkiliani wrote: Wed Aug 22, 2018 3:48 am
Werewolf wrote: Tue Aug 21, 2018 11:00 am Here's some better info, based on what's now released and in terms of TFLOPS, rather than FPS or something silly like that.

Titan RTX (??)
Quadro RTX 6000 - 15.9
Titan V - 14.9 (29.8 half P)
2080 Ti - 13.4
Quadro P6000 - 11.7
1080 Ti - 11.3
Quadro RTX 5000 - 10.6
2080 - 10.1
Quadro P5000 - 9.5
1080 - 8.9
2070 - 7.5
1060 - 4.4
I think this is incorrect. While we don't have actual test benchmarks yet, the RTX lineup shares wiith the Titan V one important advantage over the GTX 10x0 cards: It can utilize fp16 inference fully, which gives a factor of 2 to speed in addition to the TFLOPS you posted. Currently, a Titan V can do around triple the Leela NPS that a 1080Ti can do, and the 2080Ti should be very close to that.
Why this BS-ing when it is also known that neither 2070, nor 2080, nor 2080Ti will have real FP16, i.e. all 3 will have it as 10xx cards at 1:64 rate?
Only Quadro's and Titan RTX will have actual FP16.
For some reason Wikipedia doesn't list the FP16 performance of the Quadro cards. They definitely do half precision?

https://en.wikipedia.org/wiki/List_of_N ... xxx_series
Milos
Posts: 4190
Joined: Wed Nov 25, 2009 1:47 am

Re: Upcoming graphics cards for Leela

Post by Milos »

Werewolf wrote: Wed Aug 22, 2018 11:02 am
Milos wrote: Wed Aug 22, 2018 4:22 am
jkiliani wrote: Wed Aug 22, 2018 3:48 am
Werewolf wrote: Tue Aug 21, 2018 11:00 am Here's some better info, based on what's now released and in terms of TFLOPS, rather than FPS or something silly like that.

Titan RTX (??)
Quadro RTX 6000 - 15.9
Titan V - 14.9 (29.8 half P)
2080 Ti - 13.4
Quadro P6000 - 11.7
1080 Ti - 11.3
Quadro RTX 5000 - 10.6
2080 - 10.1
Quadro P5000 - 9.5
1080 - 8.9
2070 - 7.5
1060 - 4.4
I think this is incorrect. While we don't have actual test benchmarks yet, the RTX lineup shares wiith the Titan V one important advantage over the GTX 10x0 cards: It can utilize fp16 inference fully, which gives a factor of 2 to speed in addition to the TFLOPS you posted. Currently, a Titan V can do around triple the Leela NPS that a 1080Ti can do, and the 2080Ti should be very close to that.
Why this BS-ing when it is also known that neither 2070, nor 2080, nor 2080Ti will have real FP16, i.e. all 3 will have it as 10xx cards at 1:64 rate?
Only Quadro's and Titan RTX will have actual FP16.
For some reason Wikipedia doesn't list the FP16 performance of the Quadro cards. They definitely do half precision?

https://en.wikipedia.org/wiki/List_of_N ... xxx_series
Nothing is sure before someone actually tests card for specific performance, but there are quite a few indicators.
If you look for example fp64 performance it is real one (1:2) while in the past it was 1:32.
mirek
Posts: 52
Joined: Sat Mar 24, 2018 4:18 pm

Re: Upcoming graphics cards for Leela

Post by mirek »

Milos wrote: Wed Aug 22, 2018 12:38 pm
Werewolf wrote: Wed Aug 22, 2018 11:02 am For some reason Wikipedia doesn't list the FP16 performance of the Quadro cards. They definitely do half precision?

https://en.wikipedia.org/wiki/List_of_N ... xxx_series
Nothing is sure before someone actually tests card for specific performance, but there are quite a few indicators.
If you look for example fp64 performance it is real one (1:2) while in the past it was 1:32.
Tensor core performance on RTX 8000 was explicitly mentioned during the new Quadro cards reveal.
Going by memory it's 125 TFLOPs for fp16, 250 TOPs for int8 and 500 TOPs for int4
and these numbers seem to be confirmed also by https://wccftech.com/nvidia-quadro-rtx- ... cing-demo/
Werewolf
Posts: 1795
Joined: Thu Sep 18, 2008 10:24 pm

Re: Upcoming graphics cards for Leela

Post by Werewolf »

mirek wrote: Wed Aug 22, 2018 2:39 pm
Tensor core performance on RTX 8000 was explicitly mentioned during the new Quadro cards reveal.
Going by memory it's 125 TFLOPs for fp16,
:shock:

Lc0 can't use Tensor yet though ?
mirek
Posts: 52
Joined: Sat Mar 24, 2018 4:18 pm

Re: Upcoming graphics cards for Leela

Post by mirek »

Werewolf wrote: Wed Aug 22, 2018 5:17 pm
mirek wrote: Wed Aug 22, 2018 2:39 pm
Tensor core performance on RTX 8000 was explicitly mentioned during the new Quadro cards reveal.
Going by memory it's 125 TFLOPs for fp16,
:shock:

Lc0 can't use Tensor yet though ?
There are some benchmarks showing that lc0 is ~ 3x faster on Titan V than on 1080 Ti
Huge part of it I am sure comes from efficient fp16 operations that are enabled on tensor cores.

Also worth noting is that while Titan V has very similar specs to RTX 8000 (110 TFLOPs fp16)
and thus should be theoretically about 10x faster then 1080 TI (11 TFLOPs fp32) this is clearly not the case.
I am not sure if current implementation could be further optimized to take better advantage of the tensor cores or if there
is some fundamental limit i.e. like saturated memory bandwidth for current NN architectire.
Dann Corbit
Posts: 12538
Joined: Wed Mar 08, 2006 8:57 pm
Location: Redmond, WA USA

Re: Upcoming graphics cards for Leela

Post by Dann Corbit »

whereagles wrote: Wed Aug 22, 2018 10:44 am
Dann Corbit wrote: Wed Aug 22, 2018 1:32 am I just bought a 1080 Ti because you can get them under $700 now.
will it overheat?

PLS I HAVE TO KNOW
I doubt it.

I have a 1200 watt power supply, and I have it external to the machine. So all of that heat that is normally given off by the power supply inside the box is dumped into the room instead. The machine and the cards have plenty of cooling. The 1080 ti does not have a history of overheating. So I am not worried about it.

But until you actually do it for a while, there is no way to know for sure.
Taking ideas is not a vice, it is a virtue. We have another word for this. It is called learning.
But sharing ideas is an even greater virtue. We have another word for this. It is called teaching.
Werewolf
Posts: 1795
Joined: Thu Sep 18, 2008 10:24 pm

Re: Upcoming graphics cards for Leela

Post by Werewolf »

mirek wrote: Wed Aug 22, 2018 7:03 pm
Werewolf wrote: Wed Aug 22, 2018 5:17 pm
mirek wrote: Wed Aug 22, 2018 2:39 pm
Tensor core performance on RTX 8000 was explicitly mentioned during the new Quadro cards reveal.
Going by memory it's 125 TFLOPs for fp16,
:shock:

Lc0 can't use Tensor yet though ?
There are some benchmarks showing that lc0 is ~ 3x faster on Titan V than on 1080 Ti
Huge part of it I am sure comes from efficient fp16 operations that are enabled on tensor cores.

Also worth noting is that while Titan V has very similar specs to RTX 8000 (110 TFLOPs fp16)
and thus should be theoretically about 10x faster then 1080 TI (11 TFLOPs fp32) this is clearly not the case.
I am not sure if current implementation could be further optimized to take better advantage of the tensor cores or if there
is some fundamental limit i.e. like saturated memory bandwidth for current NN architectire.
Saw this on the blog today:

The learning rate for the test10 training run has been lowered to 0.0002. Network id 11013 will be the first network trained with the new LR.

This is the last time we lower it for test10 to squeeze some more Elo out of it. It's expected that the result will be visible within a day or two.

The test10 will probably stay for some weeks, and after that the plan is to do a reset and to start a main2 run from scratch again.

What will change after restart:
int8 quantization during training
That's how DeepMind did it. This will produce networks compatible with TensorRT framework which should considerably improve nps on supported hardware.
Lion
Posts: 531
Joined: Fri Mar 31, 2006 1:26 pm
Location: Switzerland

Re: Upcoming graphics cards for Leela

Post by Lion »

Do we know more about the RTX 2080 Ti?

Also are there benchmark available for Titan V and 1080 Ti.

I am considering upgrading my GPU but not sure the the 3k USD is worth for the Titan V....

rgds