LC0 graph has gone impressive

Guenther · Post by **Guenther** » Fri Nov 30, 2018 10:58 pm

chrisw wrote: ↑Fri Nov 30, 2018 10:11 pm
sorry for being stupid, but these matches are what exactly? the current version versus its predecessor? test games from lc0.org?

so, the test procedure is to check if x+1 wins against x? if it doesn’t, what happens, it goes forward anyway, or it gets junked, and x stays, until there’s an x+1 that beats it?

Those are the matches, which are used for selfplay rating and also for propagating new nets
(they fail only though, if much weaker than x-1, I don't know the exact number now, may be it was 100...)

I started those stats only to show that there has crept in a huge 'French invasion', since a while
and all other openings were much less played than in previous nets, therefor selfplay rating had gone inflated
in comparison to real rating. I was also surprised to discover that high number of dupes (measured for 60 plies).

If I understand it correctly, this is a consequence of policy sharpening, which is now being reduced again.

chrisw · Post by **chrisw** » Fri Nov 30, 2018 11:28 pm

Guenther wrote: ↑Fri Nov 30, 2018 10:58 pm
chrisw wrote: ↑Fri Nov 30, 2018 10:11 pm
sorry for being stupid, but these matches are what exactly? the current version versus its predecessor? test games from lc0.org?

so, the test procedure is to check if x+1 wins against x? if it doesn’t, what happens, it goes forward anyway, or it gets junked, and x stays, until there’s an x+1 that beats it?
Those are the matches, which are used for selfplay rating and also for propagating new nets
(they fail only though, if much weaker than x-1, I don't know the exact number now, may be it was 100...)

I started those stats only to show that there has crept in a huge 'French invasion', since a while
and all other openings were much less played than in previous nets, therefor selfplay rating had gone inflated
in comparison to real rating. I was also surprised to discover that high number of dupes (measured for 60 plies).

If I understand it correctly, this is a consequence of policy sharpening, which is now being reduced again.

Ok, thanks. I thought that was what they were doing, but didn’t know the figure at which they junked a new version.
I think this test and development process results in running away with good-looking “elo” climbs that are not actually generalising, and leaving behind, unnoticed “actual real good versions”. The stars are not identified, and the effort goes into chasing things that don’t go anywhere.

carldaman · Post by **carldaman** » Sat Dec 01, 2018 1:01 am

To avoid needlessly chasing its own proverbial tail, I still think they should also test LC0 against known entities with a fixed, established rating - at least periodically. That should automatically be part of the testing protocol.

chrisw · Post by **chrisw** » Sat Dec 01, 2018 1:24 am

carldaman wrote: ↑Sat Dec 01, 2018 1:01 am To avoid needlessly chasing its own proverbial tail, I still think they should also test LC0 against known entities with a fixed, established rating - at least periodically. That should automatically be part of the testing protocol.

which lc0 would you like to test? this is the production run since yesterday .....

Code: Select all

Number	Run	Network	Elo	Games	Blocks	Filters	Time
31703	2	57295884	8416.67	12400	20	256	2018-12-01 01:54:28.65156 +0200 EET
31702	2	ab48b39e	8414.49	28750	20	256	2018-12-01 01:02:34.643888 +0200 EET
31701	2	9e07e3b8	8402.20	13088	20	256	2018-12-01 00:38:39.055414 +0200 EET
31700	2	e7c4f087	8418.66	12808	20	256	2018-12-01 00:14:38.327586 +0200 EET
31699	2	e42a5e1b	8420.10	11878	20	256	2018-11-30 23:50:52.869934 +0200 EET
31698	2	1b6ad0c6	8413.59	10154	20	256	2018-11-30 23:30:06.756969 +0200 EET
31697	2	71f289ea	8413.59	73889	20	256	2018-11-30 21:21:08.675226 +0200 EET
31696	2	1419ef38	8426.71	173565	20	256	2018-11-30 16:17:07.891541 +0200 EET
31695	2	c548ee97	8422.32	31386	20	256	2018-11-30 15:16:58.104439 +0200 EET
31694	2	fc3ee33a	8417.91	31957	20	256	2018-11-30 14:18:54.933111 +0200 EET
31693	2	fcbb45ff	8398.88	30031	20	256	2018-11-30 13:21:49.295867 +0200 EET
31692	2	e098d156	8392.31	32003	20	256	2018-11-30 12:20:21.040578 +0200 EET
31691	2	60f5efdf	8388.64	31519	20	256	2018-11-30 11:21:02.184178 +0200 EET
31690	2	6ef0c80e	8394.48	33645	20	256	2018-11-30 10:20:54.401215 +0200 EET
31689	2	91feb60a	8381.33	33258	20	256	2018-11-30 09:22:42.461197 +0200 EET
31688	2	f70a40de	8384.26	30468	20	256	2018-11-30 08:28:33.058866 +0200 EET
31687	2	d7eaa579	8362.38	31883	20	256	2018-11-30 07:32:56.78735 +0200 EET
31686	2	747d376c	8357.25	31506	20	256	2018-11-30 06:38:22.334805 +0200 EET
31685	2	974f42b8	8353.62	32618	20	256	2018-11-30 05:40:51.846122 +0200 EET
31684	2	7a880a4c	8352.89	32996	20	256	2018-11-30 04:41:56.865696 +0200 EET
31683	2	530362c2	8358.71	31874	20	256	2018-11-30 03:44:42.809853 +0200 EET
31682	2	340834bc	8351.45	33019	20	256	2018-11-30 02:44:07.140749 +0200 EET
31681	2	cc2d1da2	8347.80	16761	20	256	2018-11-30 02:12:21.557707 +0200 EET
31680	2	f690f70e	8325.19	11919	20	256	2018-11-30 01:49:00.702385 +0200 EET
31679	2	bea370f3	8319.43	12325	20	256	2018-11-30 01:25:17.398009 +0200 EET
31678	2	d38bdfde	8327.39	11981	20	256	2018-11-30 01:01:57.01215 +0200 EET
31677	2	2b626476	8333.16	10945	20	256	2018-11-30 00:40:43.728277 +0200 EET

mwyoung · Post by **mwyoung** » Sat Dec 01, 2018 1:55 am

I think LC0 is progressing rapidly. It is now close to overtaking Stockfish any version. And this is with LCO running on a slow graphics card vs I7 6700. The progress in recent weeks has been impressive in testing with real games.

carldaman · Post by **carldaman** » Sat Dec 01, 2018 5:57 am

Then why is the Leela site stating this?

"As our previous 'run to completion' has not made much progress recently, main page and default contributions have been reset to our new actually testing run while we wait for lc0 0.20.0 to be prepared with a new network architecture for our next main run."

https://lczero.org/

Nay Lin Tun · Post by **Nay Lin Tun** » Sat Dec 01, 2018 6:11 am

carldaman wrote: ↑Sat Dec 01, 2018 5:57 am Then why is the Leela site stating this?

"As our previous 'run to completion' has not made much progress recently, main page and default contributions have been reset to our new actually testing run while we wait for lc0 0.20.0 to be prepared with a new network architecture for our next main run."

https://lczero.org/

They are testing new ideas in 30xx series while waiting for version 0.20 engine with 40 block net. After full training, 30xx may be on par with 11xx or minimally better than 11xx, but it is unlikely to overtake SF 10 in tcec/cccc in this stage.

As 40 block " Go" has been undoubtedly proven better than A/B pruning, we are expecting similar results in chess.
If that comes into reality, A/B engines will be in history (like deep blue technology) .

Viral jokes about "End of an Era in 6 months" has been popular in cccc/tcec since 2018 Sept, so 2019 March should be the deadline whether those rumors come true or not.

Uri Blass · Post by **Uri Blass** » Sat Dec 01, 2018 6:36 am

mwyoung wrote: ↑Sat Dec 01, 2018 1:55 am I think LC0 is progressing rapidly. It is now close to overtaking Stockfish any version. And this is with LCO running on a slow graphics card vs I7 6700. The progress in recent weeks has been impressive in testing with real games.

when you say slow graphic card what do you mean?

What is the price of the graphics card and what is the price of the I7 6700?

For I7 6700 I see 303-312 dollars in the following link
https://ark.intel.com/products/88196/In ... -4-00-GHz-

Laskos · Post by **Laskos** » Sat Dec 01, 2018 7:04 am

mwyoung wrote: ↑Sat Dec 01, 2018 1:55 am I think LC0 is progressing rapidly. It is now close to overtaking Stockfish any version. And this is with LCO running on a slow graphics card vs I7 6700. The progress in recent weeks has been impressive in testing with real games.

You mean recent test30 nets? Even on my very powerful RTX 2070 GPU they are nowhere close to SF_10 on 4 threads. My CPU is OCed i7-4790, probably quite close to your i7-6700. Only the best test10 nets can be stronger than SF_10 on my RTX 2070. On "slow graphics card", even if you mean by that GTX 1060 (which is not that slow), at best only some test10 nets can be level with SF_8 on 4 threads, and test30 about SF_7 or SF_6 level. And there is almost no progress at all in real Elo points in test30 for already 400 nets or 3000+ self-Elo points "improvement", at best some 30-40 Elo poinjts.

Dann Corbit · Post by **Dann Corbit** » Sat Dec 01, 2018 7:11 am

Uri Blass wrote: ↑Sat Dec 01, 2018 6:36 am
mwyoung wrote: ↑Sat Dec 01, 2018 1:55 am I think LC0 is progressing rapidly. It is now close to overtaking Stockfish any version. And this is with LCO running on a slow graphics card vs I7 6700. The progress in recent weeks has been impressive in testing with real games.
when you say slow graphic card what do you mean?

What is the price of the graphics card and what is the price of the I7 6700?

For I7 6700 I see 303-312 dollars in the following link
https://ark.intel.com/products/88196/In ... -4-00-GHz-

That's about the price of AMD Ryzen Threadripper 1950X @3.8Ghz ($313) which can do 48,391,000 NPS in the bench run by Ipman chess.
These GPUs, in turn:
Radeon RX Vega
GeForce GTX 1070
are at about that same price point.

LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive

Re: LC0 graph has gone impressive