Well,that's my way..you have at least 100engines,every week/month you have a new version/update/tuned chess engine..if you have to let them run all 1000 games,is for me not possible.
So after more then 20 years testing,you find a way that for you is the most commen and i'am agree we need more games!
But you have so many lists,just put them together and you have the averange Elo from this engine.When i do 10 games(for starting) you find quickly out how strong this engine is.Because i let them play so much i can against different engines!And then you see very fast whitch engine he like to play and to others he loose.
So if you just by luck choose a engine where he don't like the play style and you run first time 1000 games,you get a bad Elo..so this engine is under his value..you play against a engine he likes very much,you get a much higher Elo. It's like with chess players..to some people you like to play,to others you don't know how to handle his style.
So i take a big range in engines..and i also comes to 1000games(if i have the time) and you get a much nicer averange Elo from this engine.
There are top engines who beat top engines but have it more difficult against less strong engines and inverse.
Look to this example TTK plays agaist TogaII141SE6 and don't like it at all
and against my stronger in my list TogaII142JD he likes very much to play
So now if i want i can continue with this two versions and play 1000 games and at the end i gonne gave a averange Elo when i put the games together.
Then i don't talk yet to use all the different openings books..is again the same some engines like this one book and not the other one.
But this is just may way to see fast how strong a engine is and you see it quickly where he take place in the Elo list.
So,don't think i just only run 10 games against one engine,when i have time enough they get a second round,a third and so on..but at that time there is already a new engine to test
Blitz 5min Core i7 @3.89Ghz 2009
TTK.cirebonb1Y.st.4cpu_b 2800 - Stockfish_13_win32_ja 2800 5.0 - 5.0 +4/-4/=2 50.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - Grapefruit 1.0 alpha 3 2800 5.0 - 5.0 +3/-3/=4 50.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - MP-x86-Inert---Thinker 5.4D 2800 5.5 - 4.5 +4/-3/=3 55.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - TogaII141SE6-4cpu 2800 3.0 - 7.0 +1/-5/=4 30.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - Glaurung22_win32_ja 2800 6.0 - 4.0 +3/-1/=6 60.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - Bright-0.4a 2800 5.5 - 4.5 +5/-4/=1 55.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - TogaII142JD-4cpu 2800 7.0 - 3.0 +4/-0/=6 70.00%
TTK.cirebonb1Y.st.4cpu_b 2800 - MP-x86-Inert---Thinker 5.4C 2800 3.0 - 7.0 +1/-5/=4 30.00%
JP.