lovetb wrote: ↑Tue Mar 05, 2019 1:17 am
Hi, I have been using Lc0 for a week now. The GPU load keeps going up & down.
You can see in the graph below,
It doesn't stay at 100% like the CPUs when playing Stockfish.
Is this normal ?
Yes, looks exactly like my graph. Put a 5xxxx NN on and see what happends.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
M ANSARI wrote: ↑Tue Mar 05, 2019 8:42 am
Maybe thermal issues and GPU is throttling?
You are right. I left the case open for better air circulation & I unsinstalled MSI Afterburner.
I had better results.
If you reinstall msi afterburner. And go to settings and setup a custom fan curve. You will not have to leave your case open.
Unless your case has inadequate air flow.
The Factory fan curve is not good when running Lc0.
I have my fan curve hitting 100% fan speed when temps hit 60c. And this fan curve has worked very well.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
I think throttling of the GPU was the cause of a few lost points for Lc0 in TCEC 14. There were a few moves that simply made no sense and I was using the same network with similar hardware. It might actually make sense to have some sort of water cooling for the GPU's.
M ANSARI wrote: ↑Tue Mar 05, 2019 7:51 pm
I think throttling of the GPU was the cause of a few lost points for Lc0 in TCEC 14. There were a few moves that simply made no sense and I was using the same network with similar hardware. It might actually make sense to have some sort of water cooling for the GPU's.
Yea
I built a custom water cooling for my 275 Watt Fury X, and for hobbyists i can recommend a Zalman Reserator all in one passive cooler plus 120mm radiator and a 120mm water-temperature controlled fan. I really like the Zalman, i estimate it alone cools about 150 Watt...
M ANSARI wrote: ↑Tue Mar 05, 2019 7:51 pm
I think throttling of the GPU was the cause of a few lost points for Lc0 in TCEC 14. There were a few moves that simply made no sense and I was using the same network with similar hardware. It might actually make sense to have some sort of water cooling for the GPU's.
Lc0 is very hard on a GPU. If you do not have a beefy air cooler. It will beat up a GPU. I bought a card with a beefy air cooler with triple fans. And I still had use a custom fan curve to keep temp under control. At full load now the card runs in the low 60's c.
After testing I found that it was not my temps causing weak play and moves. It was using minibatchsize = 1024. I lost some NPS by going back to 256, but test play is now much better. And the latest 4xxxx nets are on the verge of overtaking the latest DEV. of Stockfish playing on a 2950x at 5+5 TC. Around 10 elo difference.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
If I try to host the GPU out of the box using a raiser card adopter something like shown below, will it affect the performance?
The idea is to give the gpu more cooler air to breath & to cool down.
lovetb wrote: ↑Wed Mar 06, 2019 11:46 am
Thanks for all your replies.
If I try to host the GPU out of the box using a raiser card adopter something like shown below, will it affect the performance?
The idea is to give the gpu more cooler air to breath & to cool down.
Maybe this 6 way LC0 rig via PCIe risers is of interest for you