Discussion of anything and everything relating to chess playing software and machines.
Moderators: hgm, Dann Corbit, Harvey Williamson
Forum rules
This textbox is used to restore diagrams posted with the [d] tag before the upgrade.
-
smatovic
- Posts: 1621
- Joined: Wed Mar 10, 2010 9:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
-
Contact:
Post
by smatovic » Fri Sep 18, 2020 5:05 pm
Laskos wrote: ↑Fri Sep 18, 2020 4:47 pm
smatovic wrote: ↑Fri Sep 18, 2020 4:17 pm
Hmm, according to all those articles and papers, a slow down of RTX 3080 compared to RTX 2080 TI in some AI tasks is also possible.
You should compare 3080 with 2080 TI, they have similar SM core count.
--
Srdja
I seem to manage to extrapolate from this chart:
https://www.pugetsystems.com/labs/hpc/R ... nary-1885/
Here we cannot compare directly old RTX 2080 Ti numbers with the RTX 3080 test. RTX 3080 test (1023 images/sec) uses significantly newer drivers, TensorFlow version, testing setup. But we have RTX Titan tested now which got 1082 images/sec on FP16 with the old result which was 653, compared with RTX 2080 Ti of 532 (old result). By extrapolation, with the "old setup" 3080 would get 617 images/sec, and now we can compare to 2080 Ti (532) and 2080 (382). The speed-up RTX 2080 --> 3080 is about 60%, which is good but not great.
In this scope RTX 3080 would be only 1.16x compared to RTX 2080 TI, what can be
explained by frequency increase alone, without any architecture changes or
improvements in SM design.
--
Srdja
-
Laskos
- Posts: 10948
- Joined: Wed Jul 26, 2006 8:21 pm
- Full name: Kai Laskos
Post
by Laskos » Fri Sep 18, 2020 5:18 pm
smatovic wrote: ↑Fri Sep 18, 2020 5:05 pm
Laskos wrote: ↑Fri Sep 18, 2020 4:47 pm
smatovic wrote: ↑Fri Sep 18, 2020 4:17 pm
Hmm, according to all those articles and papers, a slow down of RTX 3080 compared to RTX 2080 TI in some AI tasks is also possible.
You should compare 3080 with 2080 TI, they have similar SM core count.
--
Srdja
I seem to manage to extrapolate from this chart:
https://www.pugetsystems.com/labs/hpc/R ... nary-1885/
Here we cannot compare directly old RTX 2080 Ti numbers with the RTX 3080 test. RTX 3080 test (1023 images/sec) uses significantly newer drivers, TensorFlow version, testing setup. But we have RTX Titan tested now which got 1082 images/sec on FP16 with the old result which was 653, compared with RTX 2080 Ti of 532 (old result). By extrapolation, with the "old setup" 3080 would get 617 images/sec, and now we can compare to 2080 Ti (532) and 2080 (382). The speed-up RTX 2080 --> 3080 is about 60%, which is good but not great.
In this scope RTX 3080 would be only 1.16x compared to RTX 2080 TI, what can be
explained by frequency increase alone, without any architecture changes or
improvements in SM design.
--
Srdja
The second hand Turings would seem the most efficient per dollar GPUs for deep learning, especially the cheapo second hand RTX 2060 Super and 2070 Super. I might end up buying one of those atop of my old RTX 2070 for the same speed as a single RTX 3080. That is, if I am not using GPU for other reasons too.
-
mwyoung
- Posts: 2725
- Joined: Wed May 12, 2010 8:00 pm
Post
by mwyoung » Sat Sep 19, 2020 3:35 am
chrisw wrote: ↑Thu Sep 17, 2020 11:47 am
Laskos wrote: ↑Thu Sep 17, 2020 10:28 am
Laskos wrote: ↑Wed Sep 16, 2020 11:32 pm
Laskos wrote: ↑Wed Sep 16, 2020 10:07 pm
CUDA cores seem to perform well. Maybe for Leela case a 60-70% or so over RTX 2080 (non-super), but let's see the benches.
However maybe a bit less than 50% improvement with 3070 over my 2070 next month. This time 3080 seems the best deal. Last time it was 2070 or 2060. But aside money, 3080 seems way too large, and it consumes like an air conditioner, in real terms in excess of 350W.
After all, if only for Chess and Go, maybe the best is to spend just $250 and get a second hand 2060 Super on top of my 2070, close to 100% boost. Will anyway have to upgrade the case and the overall cooling.
Or buy yourself 20 cores and abandon giant networks that insist on GPU
That is your most insane advice I have seen yet.
Sure buy at 20 core system, but don't spend an extra 500 bucks for a RTX GPU.
That you would need, and want anyway with a 20 core system.
Why, because the nets are 131 MB. And take us 15 seconds to download.

"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
-
chrisw
- Posts: 3851
- Joined: Tue Apr 03, 2012 2:28 pm
Post
by chrisw » Sat Sep 19, 2020 6:37 am
mwyoung wrote: ↑Sat Sep 19, 2020 3:35 am
chrisw wrote: ↑Thu Sep 17, 2020 11:47 am
Laskos wrote: ↑Thu Sep 17, 2020 10:28 am
Laskos wrote: ↑Wed Sep 16, 2020 11:32 pm
Laskos wrote: ↑Wed Sep 16, 2020 10:07 pm
CUDA cores seem to perform well. Maybe for Leela case a 60-70% or so over RTX 2080 (non-super), but let's see the benches.
However maybe a bit less than 50% improvement with 3070 over my 2070 next month. This time 3080 seems the best deal. Last time it was 2070 or 2060. But aside money, 3080 seems way too large, and it consumes like an air conditioner, in real terms in excess of 350W.
After all, if only for Chess and Go, maybe the best is to spend just $250 and get a second hand 2060 Super on top of my 2070, close to 100% boost. Will anyway have to upgrade the case and the overall cooling.
Or buy yourself 20 cores and abandon giant networks that insist on GPU
That is your most insane advice I have seen yet.
Sure buy at 20 core system, but don't spend an extra 500 bucks for a RTX GPU.
That you would need, and want anyway with a 20 core system.
Que? I already have two boxes with 2080ti in each. I know you’re an end user, so your requirements are different. Dev kit for a CPU based engine doesn’t need anything more than the simplest GPU. If there is any GPU dev requirement, and sometimes there is, then there’s capacity on other equipment.
Why, because the nets are 131 MB. And take us 15 seconds to download.
One word of advice. I am not you.
-
Laskos
- Posts: 10948
- Joined: Wed Jul 26, 2006 8:21 pm
- Full name: Kai Laskos
Post
by Laskos » Thu Oct 08, 2020 11:55 am
Finally, some reliable data with Lc0, although using OpenCL backend, but the relative speeds of different RTX GPUs should be reflected:
https://www.phoronix.com/scan.php?page= ... pute&num=6
3080 ---> 33% above 2080 Ti and a whopping 88% over the same price at the release 2080. Good numbers, now I will wait and see the 3070 speeds.

-
Laskos
- Posts: 10948
- Joined: Wed Jul 26, 2006 8:21 pm
- Full name: Kai Laskos
Post
by Laskos » Wed Oct 21, 2020 10:56 am
Hmmm, the things changed a bit with the v26.3 engine and CUDA backend. The previous graph in the thread was for v26 engine and OpenCL backend. The new CUDA backend (it's the fastest one with RTX GPUs) shows the largest improvement with RTX 2xxx GPUs and smaller with RTX 3xxx. So that I saw a benchmark showing now RTX 3090 just 40% above RTX 2080Ti, meaning 3080 would stand at mere 10% or so above 2080Ti with v26.3 CUDA backend. Not that good, and I don't know whether there are foreseen backend improvements targeted towards 3xxx series.
-
smatovic
- Posts: 1621
- Joined: Wed Mar 10, 2010 9:18 pm
- Location: Hamburg, Germany
- Full name: Srdja Matovic
-
Contact:
Post
by smatovic » Wed Oct 21, 2020 11:01 am
Laskos wrote: ↑Wed Oct 21, 2020 10:56 am
Hmmm, the things changed a bit with the v26.3 engine and CUDA backend. The previous graph in the thread was for v26 engine and OpenCL backend. The new CUDA backend (it's the fastest one with RTX GPUs) shows the largest improvement with RTX 2xxx GPUs and smaller with RTX 3xxx. So that I saw a benchmark showing now RTX 3090 just 40% above RTX 2080Ti, meaning 3080 would stand at mere 10% or so above 2080Ti with v26.3 CUDA backend. Not that good, and I don't know whether there are foreseen backend improvements targeted towards 3xxx series.
10% would confirm the speedup via clock increase only with no benefits from architecture changes.
--
Srdja
-
Laskos
- Posts: 10948
- Joined: Wed Jul 26, 2006 8:21 pm
- Full name: Kai Laskos
Post
by Laskos » Wed Oct 21, 2020 11:12 am
smatovic wrote: ↑Wed Oct 21, 2020 11:01 am
Laskos wrote: ↑Wed Oct 21, 2020 10:56 am
Hmmm, the things changed a bit with the v26.3 engine and CUDA backend. The previous graph in the thread was for v26 engine and OpenCL backend. The new CUDA backend (it's the fastest one with RTX GPUs) shows the largest improvement with RTX 2xxx GPUs and smaller with RTX 3xxx. So that I saw a benchmark showing now RTX 3090 just 40% above RTX 2080Ti, meaning 3080 would stand at mere 10% or so above 2080Ti with v26.3 CUDA backend. Not that good, and I don't know whether there are foreseen backend improvements targeted towards 3xxx series.
10% would confirm the speedup via clock increase only with no benefits from architecture changes.
--
Srdja
Yeah, really disappointing. And not a better deal with the coming more affordable RTX 3070 which by extrapolation would be only 30% faster with Lc0 than my current 2070. Maybe a backend improvement targeting 3xxx series will be available sometime soon, I don't know, someone should ask Ankan.
-
George Tsavdaris
- Posts: 1627
- Joined: Thu Mar 09, 2006 11:35 am
Post
by George Tsavdaris » Wed Oct 21, 2020 8:59 pm
Laskos wrote: ↑Wed Oct 21, 2020 10:56 am
Hmmm, the things changed a bit with the v26.3 engine and CUDA backend. The previous graph in the thread was for v26 engine and OpenCL backend. The new CUDA backend (it's the fastest one with RTX GPUs) shows the largest improvement with RTX 2xxx GPUs and smaller with RTX 3xxx. So that I saw a benchmark showing now RTX 3090 just 40% above RTX 2080Ti, meaning 3080 would stand at mere 10% or so above 2080Ti with v26.3 CUDA backend. Not that good, and I don't know whether there are foreseen backend improvements targeted towards 3xxx series.
Where's that CUDA benchmark page link?
After his son's birth they've asked him:
"Is it a boy or girl?"
YES! He replied.....
-
Laskos
- Posts: 10948
- Joined: Wed Jul 26, 2006 8:21 pm
- Full name: Kai Laskos
Post
by Laskos » Wed Oct 21, 2020 9:04 pm
George Tsavdaris wrote: ↑Wed Oct 21, 2020 8:59 pm
Laskos wrote: ↑Wed Oct 21, 2020 10:56 am
Hmmm, the things changed a bit with the v26.3 engine and CUDA backend. The previous graph in the thread was for v26 engine and OpenCL backend. The new CUDA backend (it's the fastest one with RTX GPUs) shows the largest improvement with RTX 2xxx GPUs and smaller with RTX 3xxx. So that I saw a benchmark showing now RTX 3090 just 40% above RTX 2080Ti, meaning 3080 would stand at mere 10% or so above 2080Ti with v26.3 CUDA backend. Not that good, and I don't know whether there are foreseen backend improvements targeted towards 3xxx series.
Where's that CUDA benchmark page link?
I saw on Lc0 discord someone benching with v26.3 CUDA the RTX 3090 as being 40% faster than RTX 2080Ti (30.5 vs 22.0 knps with a large JH net).