lucario6607 wrote: ↑Wed Dec 03, 2025 1:16 pm
You do realize that multipv does nothing for leela besides making it output the moves to uci?
That was true as for about version 0.28 and the nets actual then, from later onwards there are differences in time to solution to be seen again and again, even if (no deterministic output of Lc0 at all, not with single CPU- thread neither, unlike as for A-B-engines) it's always a matter of enough data for statistically significant proof. But take a look at the list above, when I ad two runs of older Lc0- versions and nets MultiPV=1, nr32 and 42 were not in list before but are now:
Code: Select all
Program Elo +/- Matches Score Av.Op. S.Pos. MST1 MST2 RIndex
8 Stockfish-251112-8t-MuPV4 : 3554 2 38124 57.9 % 3498 189/256 3.4s 10.4s 0.61
10 Stockfish17.1-8t-MuPV4 : 3550 2 38999 57.5 % 3498 195/256 4.3s 10.4s 0.55
13 Lc0v0.32.0-3070ti-1740-MuPV4 : 3547 2 37773 56.9 % 3499 193/256 4.8s 11.0s 0.57
17 Lc0v0.32.1-RTX5070-6147500PT-MuPV4 : 3543 2 38481 56.4 % 3499 187/256 4.1s 11.0s 0.55
25 PlentyChess7.0.22-8t-MuPV4 : 3540 2 37664 55.8 % 3499 181/256 3.8s 11.4s 0.54
26 Lc0v0.32.0-1740-MuPV4-RTX5070 : 3540 2 38376 55.9 % 3499 184/256 4.0s 11.3s 0.52
32 Lc0v0.32.0-dev-1740-MuPV1 : 3539 2 37103 55.6 % 3499 180/256 4.0s 11.7s 0.57
42 Lc0v0.32.0-4520-MuPV1 : 3535 2 37370 55.0 % 3500 180/256 4.6s 12.1s 0.54
48 Lc0v0.31.0onnx-RTX5070-BT5-3700M : 3532 2 38003 54.8 % 3499 185/256 5.1s 12.0s 0.49
71 Lc0v0.31.0-dag-onnx-3070ti-BT5-3700M : 3519 2 37659 52.9 % 3499 175/256 5.4s 13.2s 0.44
Theses two runs both were with older one GPU (3070ti), the more versions and nets I added, the less I did let MuPV1 and MuPV4 run both to see direct comparison, because the differences weren't never ever big compared to A-B MultiPV1 and MultiPV4 at all and got even smaller with newer versions and nets again, now with RTX 5070 even more like that probably, so I could as well have MuPV1 runs for Lc0 only as well as MuPV4 runs only. The reason, I don't delete the runs of little interest is, EloStatTS gets lower error bar with each and every new run in same list, computing Elo and error for each and every old and new run position- and engine- wise again, and I have A-B-engines run with MultiPV4 (if engine supports that) ho-hum. especially SF- clones with internal MultiPV- mode profit much from best of their settings in suites and hardware- TC of that kind, so I like to have Lc0 compared that way now and then too, you see? It's just some kind of list- cosmetics

Full list is 173 runs big in meantime, if pasting in fora, I just copy the parts of interest and the error bars get lower with the bigger number of compared to each other runs, so what, regards
Peter.