Go has fallen to computer domination?

diep · Post by **diep** » Sun Feb 07, 2016 6:33 pm

Xann wrote:
diep wrote:What's this Go thing?

...
Hi Vincent,

Welcome back!

Fabien.

In the meantime it's clear to me what alphago is.

Thanks to a posting of some go players who figured it out...

It's just new incarnation of what ran on the supercomputer in Amsterdam it seems in 2007 and 2008 and which won in 2010 the ICGA 19x19 go title.

With some improvements and a bunch more cores it'll be very strong.

What i do not understand is this neural net story and gpu. You have to search on the gpu in order to do the neural net on the gpu.

Communication between gpu and cpu means you can do roughly 3000 communications a second between gpu and cpu. Maybe newer tesla's that's 10k. That means you can do only 10k evaluations a second. That's rather little nodes per second, whereas running inside the gpu, according to my calculation even with 100k clocks per node you get dozens of millions of nps.

So maybe they just reserve a huge supercomputer from NCSA and run on that and toy at googlehome a little with GPU's.

Interesting to know is how Google legally managed to obtain this as this software was setup by government subsidies from The Netherlands and INRIA (France).

The only thing i use from INRIA is the public prime number software

As far as i know all this isn't public software. How did Deepmind legally get the rights for that? They sure can afford it!

Or did one of the teammembers just go work at deepmind and take an USB stick with him from INRIA with that program and then say: "hello there i was!"

Still then there is the mystery of the GPU's. Do they also search inside the gpu's?
Or is this just a story to keep Nvidia happy and meanwhile search occurs only at the cpu's?

Alternative is that they get very few nodes a second and just throw a massive amount of cpu's at it. Waste of those GPU's of course as those are 99.99% of the time idle then, but they just take the "speedup" for some of the cores then?

That's what Don Dailey did do with cilkchess of course. Slow it down factor 40 first in order to then scale well at a lot of cpu's

diep · Post by **diep** » Sun Feb 07, 2016 6:41 pm

bob wrote:
diep wrote:Please note Rein that such sportplayers are so rich they have a manager and therefore that is a company.

For example if you buy a few items as a private person in EU then you have a warranty. If on other hand you buy the same items as a company - you don't have that warranty. Deal is deal and that's it. You can't return the items in short.

I have had this at several occasions already that i couldn't return items to a computershop for example (for example bunch of 18CM fans that didn't blow the amount of CFM i had bought them for). The company (in this town by the way) simply refused to take them back as my company bought the items.

That's how it works also in EU if you are at company level. All those sporters a bit stronger with enough cash income (say a few million a year) have their own company of course.

So basically there is nothing that protects you from signing any deal nor sales.

Until recently under Dutch law also bribing fees paid in 3d world nations you could tax reduce.

That's just business. Ever seen anyone sign a huge deal in a 3d world nation WITHOUT bribing?

Business with dudes from Russia is far worse there in fact than doing business in Nigeria.

Even salaries paid to employees, like 10% is white money where they paid that 13% tax over and 90% of the salary goes black.
I must say another reason to NOT live in the EU.

In USA it is worse. We have a legal 1 to 2 year customer warranty. That's for EU citizens. So you get warranty on your products up to a year after you bought it, if it has problems.

In USA you don't have that protection.

The reason to not live in the nations here is that what Obama tried to do to USA already is reality in Europe with as a result rightwing and leftwing gaining momentum. The politicians in the center get very few votes now - that is a very sad thing. yet that's not for this thread to continue discussing.

The expansion of the EU start this century and introduction of euro basically "delayed" the problems that western european nations had as of course it boosted economy having that euro and a lot.

It's unclear what happens now.

Rein Halbersma · Post by **Rein Halbersma** » Sun Feb 07, 2016 7:51 pm

diep wrote:
Rein Halbersma wrote: So Kasparov was stupid enough to have a clause in his contract determining the outcome (do you really think IBM would commit securities fraud like that??!!), but he didn'tthink of a rematch clause? Get serious
Contract is a contract you just need to find someone stupid enough to sign it. American law is total different from Dutch law there Rein.

I can believe that Kasparov was naive/arrogant enough to think that a rematch would be guaranteed, no matter the outcome.

But it is inconceivable that Kasparov would agree to losing the match, and it is also inconceivable that IBM would even suggest it. Kasparov had at that point a pretty much unbeaten record in any serious chess match event, why would he ever throw that away?

For IBM there would be even more at stake. For a stock-traded company to contract the outcome of a public event like Deep Blue-Kasparov, would constitute insider trading, and that carries stiff (jail time) penalties.

For the same reason, I don't expect Lee Sedol - AlphaGo to be a foregone conclusion.

bob · Post by **bob** » Sun Feb 07, 2016 8:40 pm

diep wrote:
bob wrote:
diep wrote:Please note Rein that such sportplayers are so rich they have a manager and therefore that is a company.

For example if you buy a few items as a private person in EU then you have a warranty. If on other hand you buy the same items as a company - you don't have that warranty. Deal is deal and that's it. You can't return the items in short.

I have had this at several occasions already that i couldn't return items to a computershop for example (for example bunch of 18CM fans that didn't blow the amount of CFM i had bought them for). The company (in this town by the way) simply refused to take them back as my company bought the items.

That's how it works also in EU if you are at company level. All those sporters a bit stronger with enough cash income (say a few million a year) have their own company of course.

So basically there is nothing that protects you from signing any deal nor sales.

Until recently under Dutch law also bribing fees paid in 3d world nations you could tax reduce.

That's just business. Ever seen anyone sign a huge deal in a 3d world nation WITHOUT bribing?

Business with dudes from Russia is far worse there in fact than doing business in Nigeria.

Even salaries paid to employees, like 10% is white money where they paid that 13% tax over and 90% of the salary goes black.
I must say another reason to NOT live in the EU.
In USA it is worse. We have a legal 1 to 2 year customer warranty. That's for EU citizens. So you get warranty on your products up to a year after you bought it, if it has problems.

In USA you don't have that protection.

The reason to not live in the nations here is that what Obama tried to do to USA already is reality in Europe with as a result rightwing and leftwing gaining momentum. The politicians in the center get very few votes now - that is a very sad thing. yet that's not for this thread to continue discussing.

The expansion of the EU start this century and introduction of euro basically "delayed" the problems that western european nations had as of course it boosted economy having that euro and a lot.

It's unclear what happens now.

We have warranties in the US. And they are enforced carefully. There is no "nationally mandated warranty period" but in order to sell goods, one has to offer a warranty to attract business. IE computers are typically 2-3 years. Disk drives are 5 years. Automobiles vary but at a minimum we get 5 years or 60K miles and quite a few go to 100K miles. Or even lifetime. I just bought a battery-driven electric drill/screwdriver, lifetime warranty on everything including batteries. Ditto for my garage door openers.

And the warranties are not disabled if you are a business buying things. They apply equally to all. We just had Seagate replace about 80 1tb disk drives that failed just inside the 5 year warranty. No questions asked. And of course "we" are a company here at UAB. A huge one.

duncan · Post by **duncan** » Sun Feb 07, 2016 10:51 pm

diep wrote:
duncan wrote:
diep wrote: 2200 at most yeah. Realize also they had a very bad book. Just some pgn games. No nothing manual.

could you give some samples of weak 2200 play by deep blue ?
There is an excellent analysis of all the weak moves in match2 in the june edition 1997 from ICGA by GM Yasser Seirawan.

just lay those 6 games with all those bad moves next to games against strong GMs that kasparov played in the same months. take also 6. How many bad moves did kasparov play THERE?

Against those GMs kasparov DOES play his best openingslines. Against deep blue with its very bad book he didn't want to waste on such exhibition match good lines of his so he just do some random openingsmoves.

So kasparov's largest advantage he didn't use - opening.

do you know who did the review here of game 2 1997 ? seemed to be just a bit critical of one move from deep blue

Daniel Shawul · Post by **Daniel Shawul** » Sun Feb 07, 2016 11:10 pm

Hello Vincent,
It is nice to see you post here, though I may not agree with some of your points

You can read the paper here http://airesearch.com/wp-content/upload ... ing-go.pdf

David & Aja have been doing computer-go for some time so they know what they are doing.

They use the GPUs first to train the neural nets (supervised + reinforcement learning), two of them one for selcetion (policy network) and the other for evaluation (value network). The later is sort of new because in typical MCTS engine we do randon-playouts for evaluation of a position. The thing that is astounding for me is that using only evaluation ( the value network ), they are able to beat a state-of-the-art MCTS program, pachi, 85% of the time, without any search!! This is because the network is 13-layers deep and using convolutional neural nets, which has revolutionized visual/sound recognition fields. That is the main achievement that Google wants to showcase with a toy computer game

Then, they mix neural nets evaluation with evaluation using random-playouts 50-50, which they say have found to be best. But it seems they could have used alpha-beta + neural nets eval as well since the value networks repsonse is deterministic.

During search they use the gpus to evaluate the neural (not for doing the MCTS-search itself which is still done on the CPUs). Hey, I did MCTS on the GPUs some years ago, so they can add me as a team member if they wish

They have found a way to do the on-CPU search and GPU evaluation asynchronously to counter CPU-GPU latency. You can read the paper for more details.

Daniel

megamau · Post by **megamau** » Tue Feb 16, 2016 7:51 am

Dear Vincent,

I have been a lurker on this forum for at least 15 years (Ruffian anyone ?), but I never felt the need to write.
However the amount of wrong information that you convey is such, that I needed to reply to this topic.

In the meantime it's clear to me what alphago is. Thanks to a posting of some go players who figured it out. It's just new incarnation of what ran on the supercomputer in Amsterdam it seems in 2007 and 2008 and which won in 2010 the ICGA 19x19 go title. With some improvements and a bunch more cores it'll be very strong.

This is equivalent to saying that Stockfish is just a new incarnation of Glaurung, with "some improvement" that give it 700 elo point advantage. Aja Huang has experience in computer Go and was working with Remi Coloumn on the program Erica, which won the 2010 Computer Go Olympiad, but this in not related to AlphaGo.

What i do not understand is this neural net story and gpu. You have to search on the gpu in order to do the neural net on the gpu.
Communication between gpu and cpu means you can do roughly 3000 communications a second between gpu and cpu. Maybe newer tesla's that's 10k. That means you can do only 10k evaluations a second. That's rather little nodes per second, whereas running inside the gpu, according to my calculation even with 100k clocks per node you get dozens of millions of nps.

This thing that you don't understand is the central point of why AlphaGo is revolutionary with respect to previous programs. The evaluation function is very slow, but ALONE (i.e. with NO montecarlo search) it overperforms the current strongest computer program (CrazyStone). Refer to the table 6 of the article in nature.

So maybe they just reserve a huge supercomputer from NCSA and run on that and toy at googlehome a little with GPU's.
Interesting to know is how Google legally managed to obtain this as this software was setup by government subsidies from The Netherlands and INRIA (France). The only thing i use from INRIA is the public prime number software Smile
As far as i know all this isn't public software. How did Deepmind legally get the rights for that? They sure can afford it!
Or did one of the teammembers just go work at deepmind and take an USB stick with him from INRIA with that program and then say: "hello there i was!"

The computer that they use is very clearly specified in the article. It is a 1200 CPU machine: big hardware, but not particularly costly for Google. I would guess that Deep Blue cost was similar if not more.

Still then there is the mystery of the GPU's. Do they also search inside the gpu's? Or is this just a story to keep Nvidia happy and meanwhile search occurs only at the cpu's?

Once again, this would be very clear if you had read the article: "This architecture consists of a single master machine that executes the main search, many remote worker CPUs that execute asynchronous rollouts, and many remote worker GPUs that execute asynchronous policy and value network evaluations. The entire search tree is stored on the master, which only executes the in-tree phase of each simulation."

My suggestion is to completely ignore the match against the 2 dan guy. That's elo 1900 or something. A guy who makes lots of small mistakes and no one ever confronted that guy with those mistakes (they don't have very strong programs right now). Even if that would be closer to FM level than 1900 level, his only job was to take care he lost all games and quickly hand over the job to a 9-dan guy as it's google man.
BY THE WAY - i'm very amazed they called that 2 dan player "european champion". Some years ago i played here a 4-dan guy (he of course toasted me in a manner world hadn't seen before you could toast someone that bad). And he spoke Dutch... ...yet odds are he sits in Japan already for a few years now...

This borders on the outrageous. The "2 dan guy" is indeed the european champion, and currently the 475th player in the world.
In chess this is equivalent to a GM with more than 2500 elo, for sure stronger than a 1900 FM.
Please note that *professional* dans are different than *amateur* dans. I'm sure a 4dan amateur would toast you, but he would still lose with an advantage of 2 stones to a 7dan amateur, which is still weaker than a professional.
As an idea, a 1dan professional is about equivalent to an 8dan amateur.

That's why the world top is to scary always to play a mainline. Go is a different game. like from those 361 moves you can make first move, already 350 you can throw away. You can hard prune 300. That's not possible in chess.

So you think there are 11 reasonable opening moves in Go. How many in Chess ?
perft 1 = 20 and not all of them are reasonable.
And after the first move Go explodes so much that there can not be opening theory, only "josekis" (i.e. local sequences).

John, Seriously, if go would've been a world wide game like chess is and chess just would be popular in japan and korea - then in 1997 kasparov would have played go and lost from IBM deep blue go and we would be busy with computerchess right now. It's about how much scientific effort you put in a game.

You manage to pack so much wrong information in short sentences.

Go is popular in all the east (including China, the most populous country). The active FIDE players are approximately 100'000 worldwide. The active players in the European Go Database alone are 6100 (and Europe is tiny in Go).The worldwide popularity of the two games is comparable, and I seriously doubt there are more than 2 chess players for each go player.

There has been a lot of scientific effort towards Go, and a lot of computer Go competitions.
Maybe not on the level of Computer Chess, but surely within a factor of 3.

The reason why IBM could not chose Go (or Jeopardy) in 1997 is that the strongest program (Handtalk) was still losing with 11 stones to Inseis (kids studying to become professional). Consider that usually handicap is capped at 9 stones !
The best Go programs were weak amateurs in 1997, while in chess even simple programs were playing at national master level.

Note i posted some years later some of the results i had seen. Namely that nullmove worked far better in go than it did do in chess.
The branching factor in go, without what later would be called LMR and which i already was doing back in the 90s (1999 version of diep for example), and which GCP also had invented long before Tord Romstad was posting about it, so without reductions which would be so effective in nowadays go yet only nullmove, i reached already quite remarkable searchdepths at a pentium-5 100Mhz, whereas there was 361 possibilities to start the game.
As an evaluation function i used something similar like Mark Boon at that time described.
So i did do a 3d order evaluation calculating the influence function of each group and did do incremental recognition on how large each group was. Of course without evlauation function at all searching deep would be easy.

And how strong did this program play ?
Do you have any game ?

Yet the branching factor is what i was amazed about - that it was this small - whereas i had expected a huge branching factor.
You simply have to get that 30+ plies (ladders and similar not counted) or you won't be able to play high level go. before you add LMR type approaches.
Instead of LMR for Go i would rather use a different sort of selectivity: namely you have a static function determining how good each move is and how many plies "reduction" each move gets. In Go this is a better concept than in chess.
This concept works very well for hashtable and hashtable isn't "dicking" you like it does with LMR.
A pure LMR in itself would be total idiocy in go as it's far easier to recognize which moves are interesting to investigate deeper. Quite some of the moves you easily can give a 20 ply reduction for example, whereas in chess that's impossible to do.
So with quite some cpu's it would be very easy to achieve 30+ plies in go.
At which point indeed evaluation function is far more important. Yet you have to go through that horizon barrier of 30 plies.
Once you get through this 30+ plies you beat the crap out of all these Monte Carlo type randomized approaches of course.

So you reached 30 plies depth at tournament time control ?
Any logs ?

Do not forget the huge difference between chess and go - after the first few moves in go - the rest of the moves you can play pretty perfect if you are at this high dan level. In chess - forget perfection.

This has to be the most outrageous claim of all. You can play perfectly after the first few moves if you are a Dan player ?
Have you ever followed commentaries of games ? How many times the Pro make mistakes in semeai or end game ?
And how many do you consider few ?

There is even a book, which has a matematical analysis of how you can play perfectly the last 8/10 moves of a game (with a monster computational/ theorical effort) which showed that even professional are not perfect in these relatively super-simple endings.

duncan · Post by **duncan** » Tue Mar 08, 2016 11:07 pm

http://english.yonhapnews.co.kr/culture ... l?843bed68

A day before taking on the advanced human-like algorithm in the ancient board game Go, also known as "baduk" here, Lee said he now feels some pressure and the score he predicted last month -- 5-0 or 4-1 in his favor after five matches -- may not happen.

"I now think AlphaGo can imitate human intuition a little bit," Lee said at Tuesday's press conference in Seoul. "I should be a little nervous about the match."
..

The 33-year-old South Korean, who went pro at the age of 12, said that the key to beating the machine will be reducing "human errors." Despite feeling tension, Lee said that doesn't mean that he will allow a loss.

"I'm still confident I can win," he said. "First of all, I have never thought about getting beaten in the first match."

http://www.theverge.com/2016/3/8/111784 ... -lee-sedol

But what’s going to happen? Really, the one thing I know is that no one knows for sure, whether they’re hardcore Go players or deep in the world of artificial intelligence. "AlphaGo learnt a lot from studying professional play, but it’s very difficult to get better than professional," says Manning. "It’s difficult to get better than your teacher just by listening to what your teacher tells you. Personally I suspect that AlphaGo will not beat Lee Se-dol. I expect it’ll be 4-1 or 3-2 in Lee Se-dol’s favour. But I’ve been wrong before!"

"From one data point it’s hard for me to tell," says IBM’s Campbell. "Lee Se-dol is clearly a couple of classes better at least than the European champion. If I have to put my neck out? I’d say that the computer will win but it will be close."
lee-sedol-se-dol-go-Sam Byford-01

"On one hand, I believe that Lee Se-dol has an advantage of experience," says the University of Alberta’s Jonathan Schaeffer, a computer scientist who wrote Chinook, the first software program to solve checkers. "Lee knows a lot about Go. AlphaGo has only been learning for less than a year. It is possible that there are gaps in AlphaGo that are not yet apparent — they have not had enough time to learn well how to play all the scenarios that might arise in a game. On the other hand, never underestimate technology."

Uri Blass · Post by **Uri Blass** » Wed Mar 09, 2016 4:10 am

duncan wrote:

http://english.yonhapnews.co.kr/culture ... l?843bed68

A day before taking on the advanced human-like algorithm in the ancient board game Go, also known as "baduk" here, Lee said he now feels some pressure and the score he predicted last month -- 5-0 or 4-1 in his favor after five matches -- may not happen.

"I now think AlphaGo can imitate human intuition a little bit," Lee said at Tuesday's press conference in Seoul. "I should be a little nervous about the match."
..

The 33-year-old South Korean, who went pro at the age of 12, said that the key to beating the machine will be reducing "human errors." Despite feeling tension, Lee said that doesn't mean that he will allow a loss.

"I'm still confident I can win," he said. "First of all, I have never thought about getting beaten in the first match."

http://www.theverge.com/2016/3/8/111784 ... -lee-sedol

But what’s going to happen? Really, the one thing I know is that no one knows for sure, whether they’re hardcore Go players or deep in the world of artificial intelligence. "AlphaGo learnt a lot from studying professional play, but it’s very difficult to get better than professional," says Manning. "It’s difficult to get better than your teacher just by listening to what your teacher tells you. Personally I suspect that AlphaGo will not beat Lee Se-dol. I expect it’ll be 4-1 or 3-2 in Lee Se-dol’s favour. But I’ve been wrong before!"

"From one data point it’s hard for me to tell," says IBM’s Campbell. "Lee Se-dol is clearly a couple of classes better at least than the European champion. If I have to put my neck out? I’d say that the computer will win but it will be close."
lee-sedol-se-dol-go-Sam Byford-01

"On one hand, I believe that Lee Se-dol has an advantage of experience," says the University of Alberta’s Jonathan Schaeffer, a computer scientist who wrote Chinook, the first software program to solve checkers. "Lee knows a lot about Go. AlphaGo has only been learning for less than a year. It is possible that there are gaps in AlphaGo that are not yet apparent — they have not had enough time to learn well how to play all the scenarios that might arise in a game. On the other hand, never underestimate technology."

The following is simply nonsense.

""It’s difficult to get better than your teacher just by listening to what your teacher tells you. "

The reason that Alphago is strong is not only the software but also the hardware and I do not think that learning from human games is learning what your teacher tell you.

I will not be surprised if AlphaGo wins every game because it is obvious that in every complex game a computer is going to beat every human if the programmers are good enough.

Dirt · Post by **Dirt** » Wed Mar 09, 2016 4:56 am

Uri Blass wrote:I will not be surprised if AlphaGo wins every game because it is obvious that in every complex game a computer is going to beat every human if the programmers are good enough.

Of course, but the question is whether the programmers are good enough. I'd bet on the human, this time.

Unfortunately, since the broadcast is Flash based I can't watch it.

Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?

Re: Go has fallen to computer domination?