It doesn't seem so much like a human brain which can remember an entire game "PGN". The NN system only remembers "principles". I think move legality and selection is run by the MCTS and the NN is consulted for positional scoring. As far as the NN is concerned, the game scores themselves are consigned to the memory hole.Michael Sherwin wrote:But is that correct? There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning. Are there enough neurons to remember millions of these stats or is there more to it? In a human brain data storage and neurons are synonymous but in a computer brain neurons and data storage is not synonymous and yet if one wants to use the human brain function as a parable then memory and neurons would be talked about as they are synonymous. So yes the games as a separate entity have been discarded but the 'memory' of the games is stored--in a learn file.
I can't believe that so many people don't get it!
Moderators: hgm, Rebel, chrisw
-
- Posts: 13447
- Joined: Wed Mar 08, 2006 9:02 pm
- Location: Dallas, Texas
- Full name: Matthew Hull
Re: I can't believe that so many people don't get it!
Matthew Hull
-
- Posts: 3196
- Joined: Fri May 26, 2006 3:00 am
- Location: WY, USA
- Full name: Michael Sherwin
Re: I can't believe that so many people don't get it!
Thanks Ed! And another thought is that they might have started the main match right after the 4 hours of learning so the w,l,d,p data may have still been in memory for speed and not stored on a hard drive. That would make sense so they could say that the w,l,d,p data was part of the NN hiding the separation.Rebel wrote:Indeed. But we are not dealing with academics. We are dealing with a commercial company with a bad reputation. They have their mouth full of ethics but their actions are criminal, like the copying of books, like buying youtube while knowing its massive illegal copyrighted material and only God knows what they do with the data we publicly trust to the internet and thus to them, my name might as well be colored red after this posthgm wrote:This is not what the paper describes.
Ranting aside, let's talk about what the paper doesn't reveal, the start positions of the 100 games. Those 50 start positions certainly can be learned as Mike described.
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
-
- Posts: 27817
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: I can't believe that so many people don't get it!
The 100 games all started from the normal start position. And all training games started also from that position. So yes, in a sense that is learning how to play good moves from the start position. But isn't that exactly what playing Chess is? Btw, from the way they describe it, I have no doubt that training on the standard opening position will make it do just as well in a match from any other start position. After all, the 100 match games for the largest part consisted out of positions AlphaZero could have never seen before, in these millions of training games. If it would not be able to do superb moves in positions it had never seen before, it would lose to Stockfish 100-0.
Even if you think they must be lying through their teeth because of their evil and greedy nature, anything you claim they would have been doing instead would still purely be a fabrication. You cannot possibly kow that. In that case the simplest explanation is that it is all a complete hoax.
But there also is this: in my judgement, what they describe is likely to work. What you think they have done instead: no way that could ever work...
Even if you think they must be lying through their teeth because of their evil and greedy nature, anything you claim they would have been doing instead would still purely be a fabrication. You cannot possibly kow that. In that case the simplest explanation is that it is all a complete hoax.
But there also is this: in my judgement, what they describe is likely to work. What you think they have done instead: no way that could ever work...
Last edited by hgm on Mon Dec 18, 2017 5:45 pm, edited 1 time in total.
-
- Posts: 88
- Joined: Fri Mar 10, 2006 11:20 am
- Location: Bangalore India
Re: I can't believe that so many people don't get it!
where is this evidence?There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning.
-
- Posts: 3196
- Joined: Fri May 26, 2006 3:00 am
- Location: WY, USA
- Full name: Michael Sherwin
Re: I can't believe that so many people don't get it!
As I wrote above they may have started the main match after the 4 hours of training thus keeping the MCTS in memory. They did keep actual w,l,d,p data for every move. The NN's job was to pick the best move. That is what the evidence is strongly suggesting. Once again there is no way they stored w,l,d,p information from millions of games in the NN without losing 99% of that information. People might be thinking in terms of pure NN and not a hybrid system also requiring some amount of stored data. For the type of reinforcement learning they describe that stored data would be around 20 to 30 gigabytes.mhull wrote:It doesn't seem so much like a human brain which can remember an entire game "PGN". The NN system only remembers "principles". I think move legality and selection is run by the MCTS and the NN is consulted for positional scoring. As far as the NN is concerned, the game scores themselves are consigned to the memory hole.Michael Sherwin wrote:But is that correct? There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning. Are there enough neurons to remember millions of these stats or is there more to it? In a human brain data storage and neurons are synonymous but in a computer brain neurons and data storage is not synonymous and yet if one wants to use the human brain function as a parable then memory and neurons would be talked about as they are synonymous. So yes the games as a separate entity have been discarded but the 'memory' of the games is stored--in a learn file.
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
-
- Posts: 4185
- Joined: Tue Mar 14, 2006 11:34 am
- Location: Ethiopia
Re: I can't believe that so many people don't get it!
Wow Micheal you are now at a stage of not implying but vividly saying i invented this shit.
What you don't get (or continually ignore) is your learning stuff is position and player specific -- and that is the elephant in the room.
If you learn something against rybka, it doesn't work against stockfish and other opponents .... That form of learning doesn't even work against the same opponent if it randomizes its moves. Ask yourself why Romi is not at Rybka's level even though it beat it with learning. When you come up with learning that works against every opponent, you may then claim you invented this kind of stuff.
The form of learning you use has been there forever and infact i may have had it at one time. Tournament organizers often forbid it as they consider it as a cheap trick.
The most general form of learning that is not opponent/posiiton specific is TD-lambda, which adjusts its evaluation weights. This one is a real algorithm.
To call monte-carlo tree search learning hugely demonstrates your ignorance with what AlphaGo is doing. The fact that MCTS stores its statistics tree in memory is not learning, you could have done the same with alpha-beta. Also traning a NN is same as training a handcrafted evaluation. What exactly are you accusing them of ?
Daniel
What you don't get (or continually ignore) is your learning stuff is position and player specific -- and that is the elephant in the room.
If you learn something against rybka, it doesn't work against stockfish and other opponents .... That form of learning doesn't even work against the same opponent if it randomizes its moves. Ask yourself why Romi is not at Rybka's level even though it beat it with learning. When you come up with learning that works against every opponent, you may then claim you invented this kind of stuff.
The form of learning you use has been there forever and infact i may have had it at one time. Tournament organizers often forbid it as they consider it as a cheap trick.
The most general form of learning that is not opponent/posiiton specific is TD-lambda, which adjusts its evaluation weights. This one is a real algorithm.
To call monte-carlo tree search learning hugely demonstrates your ignorance with what AlphaGo is doing. The fact that MCTS stores its statistics tree in memory is not learning, you could have done the same with alpha-beta. Also traning a NN is same as training a handcrafted evaluation. What exactly are you accusing them of ?
Daniel
Last edited by Daniel Shawul on Mon Dec 18, 2017 5:53 pm, edited 3 times in total.
-
- Posts: 13447
- Joined: Wed Mar 08, 2006 9:02 pm
- Location: Dallas, Texas
- Full name: Matthew Hull
Re: I can't believe that so many people don't get it!
But NNs don't store input data or output data. They use inputs/outputs to learn but keep neither. They only thing they keep is the values within the NN matrix which is not input data or output data.Michael Sherwin wrote:As I wrote above they may have started the main match after the 4 hours of training thus keeping the MCTS in memory. They did keep actual w,l,d,p data for every move. The NN's job was to pick the best move. That is what the evidence is strongly suggesting. Once again there is no way they stored w,l,d,p information from millions of games in the NN without losing 99% of that information. People might be thinking in terms of pure NN and not a hybrid system also requiring some amount of stored data. For the type of reinforcement learning they describe that stored data would be around 20 to 30 gigabytes.mhull wrote:It doesn't seem so much like a human brain which can remember an entire game "PGN". The NN system only remembers "principles". I think move legality and selection is run by the MCTS and the NN is consulted for positional scoring. As far as the NN is concerned, the game scores themselves are consigned to the memory hole.Michael Sherwin wrote:But is that correct? There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning. Are there enough neurons to remember millions of these stats or is there more to it? In a human brain data storage and neurons are synonymous but in a computer brain neurons and data storage is not synonymous and yet if one wants to use the human brain function as a parable then memory and neurons would be talked about as they are synonymous. So yes the games as a separate entity have been discarded but the 'memory' of the games is stored--in a learn file.
Matthew Hull
-
- Posts: 27817
- Joined: Fri Mar 10, 2006 10:06 am
- Location: Amsterdam
- Full name: H G Muller
Re: I can't believe that so many people don't get it!
Sorry, but it seems you are making this up completely. What evidence? All of what you say is in direct contradiction with the paper. Even if you think they must be lying about everything they claim, how would that be evidence for what they did instead? How could you know more than what the paper states? Do you have a spy inside DeepMind?Michael Sherwin wrote:As I wrote above they may have started the main match after the 4 hours of training thus keeping the MCTS in memory. They did keep actual w,l,d,p data for every move. The NN's job was to pick the best move. That is what the evidence is strongly suggesting. Once again there is no way they stored w,l,d,p information from millions of games in the NN without losing 99% of that information. People might be thinking in terms of pure NN and not a hybrid system also requiring some amount of stored data. For the type of reinforcement learning they describe that stored data would be around 20 to 30 gigabytes.
-
- Posts: 3196
- Joined: Fri May 26, 2006 3:00 am
- Location: WY, USA
- Full name: Michael Sherwin
Re: I can't believe that so many people don't get it!
My eyes are shot. This all I could find.vvarkey wrote:where is this evidence?There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning.
"Monte Carlo search does not use a tradition eval as we know it, so mobility, king safety etc. are irrelevant.
It uses a struct to hold info likes wins, losses, draws, win %, etc.,
then simply references accumulated data for the current position to select the move with the highest probability of winning."
The current position info has to be stored in memory somewhere!
If you are on a sidewalk and the covid goes beep beep
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
Just step aside or you might have a bit of heat
Covid covid runs through the town all day
Can the people ever change their ways
Sherwin the covid's after you
Sherwin if it catches you you're through
-
- Posts: 4185
- Joined: Tue Mar 14, 2006 11:34 am
- Location: Ethiopia
Re: I can't believe that so many people don't get it!
Wow you are so ignorant of what MCTS is doing it is not funny ...Michael Sherwin wrote:My eyes are shot. This all I could find.vvarkey wrote:where is this evidence?There is evidence that they keep a learn file with wins, losses, draws and a percentage chance of winning.
"Monte Carlo search does not use a tradition eval as we know it, so mobility, king safety etc. are irrelevant.
It uses a struct to hold info likes wins, losses, draws, win %, etc.,
then simply references accumulated data for the current position to select the move with the highest probability of winning."
The current position info has to be stored in memory somewhere!
Please stop trolling.