I'm using NNCacheSize = 100000000. I have 128 GB of RAM. According to Lc0's own log file, this should have been plenty enough for the search I was running:
The longest single position time I have run was 5 1/2 hours. And It ran this with 64 Gb of ram without issues on a 2080 ti, and at some point started using my NVME drive as ram without issues.
And I have run Lc0 in match play for over a week without issues.
Did you log your GPU temp?
I will tell you the only issue I have had running Lc0 is when I try to Overclock my GPU. Even If I overclock my GPU only by 1%. Lc0 will crash at some point in time. If I do not overclock my GPU. Lc0 has never crashed.
"The worst thing that can happen to a forum is a running wild attacking moderator(HGM) who is not corrected by the community." - Ed Schröder
But my words like silent raindrops fell. And echoed in the wells of silence.
mwyoung wrote: ↑Sat Aug 10, 2019 4:47 am
The longest single position time I have run was 5 1/2 hours. And It ran this with 64 Gb of ram without issues on a 2080 ti, and at some point started using my NVME drive as ram without issues.
And I have run Lc0 in match play for over a week without issues.
Did you log your GPU temp?
I will tell you the only issue I have had running Lc0 is when I try to Overclock my GPU. Even If I overclock my GPU only by 1%. Lc0 will crash at some point in time. If I do not overclock my GPU. Lc0 has never crashed.
I didn't log the GPU temp. I've been sporadically checking it myself during searches, and I've never seen it above 73 C. But I have been using Nvidia's X Server Settings app to give a small positive offset to the graphics clock. Perhaps this is the culprit. I'll return to the default setting and see if the crashes stop. Thanks.