Curt Welch

Sunday, June 8, 2014

Roman Dodecahedron I suspect is a Ball Gauge

Roman Dodecahedron
Is it a Spherical Ball Gauge?

I just learned about these interesting historical artifacts known as a Roman Dodecahedron yesterday (June, 7th 2014) and started to research them. Seems no one can agree on what they must have been used for. Here's my guess.

It's a ball gauge!

That is, it's a gauge for checking the spherical curvature of balls. Used most likely, for carving round balls from stone, such as the Roman stone ballista balls. I did not see any other suggestions for this possible answer.

http://en.wikipedia.org/wiki/Roman_dodecahedron

The holes of different sizes with the knobs of the same size, will allow each face of the 12-sided device to match a different sized ball. It would allow the stone carver to check the ball to find the high spots that still need to be carved down further. If anyone has ever tried to carve a sphere, without the help of a tool like a lathe, you would know how hard it is to get an accurate round ball created. A tool like this, would allow the stone carver to start on one spot of the rock, and establish the curvature of that spot, without first having to rough out the entire sphere. Once the curvature was started, they could continue working around the rock to form the entire ball.

The 12 sided shape is also very easy to make highly accurately, using only a compass and a straight edge. One side could be laid out and cut with a compass, and then 11 matching pieces could be made. The accuracy of the pentagrams are easily checked by rotating, and flipping the pieces to make sure they all align perfectly with each other no matter which they they turned. Each one, only needed a different sized hole in the middle. The actual size of the hole was not critical, as long as it was round, and well centered, which would be easy to lay out and cut with just a compass. Once the sides are all assembled, the dodecahedron is easily constructed and guaranteed accurate by geometry. No extra alignment tools are needed.

The balls can be brazed onto each corner, and checked for accuracy of length, against a flat surface. Each side of the gauge must have all 5 corners aligned in a flat plane. By checking and adjusting until all 12 sides are flat, it can be assured that all 20 corners are aligned into a perfect dodecahedron.

The resulting gauge will make 12 different sized but highly accurate round balls.

Though this technique would be very difficult if the goal was to make a highly accurate diameter sphere, say to fit inside a cannon bore. It would be ideal however, if the exact diameter was not important, but the roundness of the sphere was important. And that, as I understand it, is just what the roman's did need for the sling-shot like balsitis that would no doubt have maximum effect, and accuracy, if the balls were highly round, but where the precise diameter was not critical.

At the same time, there are other variations of this device, that have no holes in the structure at all, like this one:

This second form, seems to be for the same purpose -- to check the curvature of a sphere. But instead of using a hole of a different size in each face, this device used nobs of different sizes on each corner to control the curvature of the sphere it was gauging.

Note also how the sides are decorated with round circles which is consistent with the concept that the device is associated with round objects.

Also note, how the sides curve inward, even though that's not needed. But by making the sides curve inward, it reduces how far out the knobs need to stick, to create a given sized curvature. All features seem logically consistent with using this device to check the curvature of a sphere.

Since this device only has three corners per side, different sized balls on the corners were guaranteed to produce different curvature gauges without needing to guarantee the accuracy of the alignment of the sides. One could randomly place different sized balls on each corner and the device would still work to produce highly accurate spheres of many different sizes.

However, this style was probably harder to use, becuase it would require one to feel the rocking of the gauge on the work piece, but not to be able to see where it was touching and not touching. The style with the holes, would probably make it easier to see what parts needed to be carved down, and maybe even make it possible to mark the spots of the ball that needed more carving.

If my theory is correct, I suspect a closer examination of these devices would show tapered wear on the knobs consistent with it being placed against stone and dragged around to test the curve. The alignment of each side could also be tested to verify that each face was correctly aligned to form an accurate spherical curvature gauge.

If one were to fight a war, where lots of stone Ballista balls had to be carved, and all tools needed to do the work had to be carried with you, this small hand tools seems idea for an army on the march. I don't know much about Roman history, but if the location of where these were found in Europe were fairly consistent with where the Romans were using their Ballistas, that would be yet another verification of what these were.

Curt Welch

Sunday, February 2, 2014

Some Thoughts on Sam Harris' book "Free Will"

I read Sam Harris' book on Free Will today and quite enjoyed it. I do hope it brings greater awareness of these issues to our society because our society is much in need of an escape from its many superstitions and myths. Our common notion of Free Will is certainly a myth and I do hope Harris' book will open more people's eyes to this fact.

Harris, Dennett, and Compatibilism

I became aware of Sam Harris' book after reading Daniel Dennett's response to it.

Dennett does not argue against Harris' position on free will, but instead, only argues against Harris' position against the complex philosophical position of compatibilism. In this respect, Dennett is obviously correct, and Sam is obviously wrong.

I see Harris has written a blog response to the differences between his and Dan's position on free will here.

In that, we find this enlightened message from Harris:

"Fans of Dan’s account—and there are many—seem to miss my primary purpose in writing about free will. My goal is to show how the traditional notion is flawed, and to point out the consequences of our being taken in by it. Whenever Dan discusses free will, he bypasses the traditional idea and offers a revised version that he believes to be the only one “worth wanting.” Dan insists that this conceptual refinement is a great strength of his approach, analogous to other maneuvers in science and philosophy that allow us to get past how things seem so that we can discover how they actually are. I do not agree. From my point of view, he has simply changed the subject in a way that either confuses people or lets them off the hook too easily."

This argument made by Harris is fine and good. However, it is not an argument against compatibilism as I read it. It is only a statement that Harris believes the position will confuse most people.

In this regard, I totally agree with Harris. Philosophy is complex, hard, and difficult. It's not a light subject to study or master. Compatibilism is an idea that most people in society will just never understand. But so is quantum physics. Being complex and confusing, does not make it wrong, and that is where all of Harris' argument's in the book against Compatibilism fail.

In short, Compatibilism does not deny determinism, but instead of declaring freedom lost, it understands that if we deepen our understanding of what the words "free will" mean, we can find a definitions of Free Will that is compatible with both the scientific facts of physics, and our layman's use of the term. Dennett, and others, including myself, prefer this approach as it allows us to form a richer, and more accurate few of realty. But this ability to bend meaning to fit reality, is a move not easy for many in our society - some, seem not only unwilling to do it, but unable. For those, the position of compatibilism will offer only confusion, instead of greater insight.

For this reason, I believe Harris has simply made a misstep by ever mentioning the concept in his book. Attacking the "errors" of compatibilism in the book does not help the layman better understand the illusion of Free Will, and only forces the true philosophers like Dennett (not me), to attack the deeper philosophical errors of an otherwise excellent and important book.

The Dangers of saying Free Will doesn't exist

The danger of the move that Harris makes, is that once we accept the idea that Free Will is an illusion, and that it "does not exist" we are left in a bind when faced with a sentence such as this:

I am here today of my own free will.

If we argue free will is an illusion and does not exist, then we are in effect arguing it has no meaning at all. And if the words have no meaning, then what does this sentence mean? Do we throw it out and pretend the person has said nothing valid? We don't want that, because in fact, it has a valid meaning even in a deterministic world. It means, "I have have not consciously allowed the will of another person to persuade me to be here." It means, the person is declaring, he was not knowingly forced against his will, by the will of another person to be here. If we take the statement as an honest one, we can assume no one coerced the speaker with blatant force, such as a use of physical force, to be here today.

The Illusion of Free Will is Created by our Ignorance

There is a better and simpler move to be made here.

Our Free Will comes from our ignorance of the forces that control us. That which we are blind to, creates in us the illusion of freedom from external control. When we can not see the strings that move us, we feel as though the cause of our actions must come from inside us. This unknown force, has at times been called the human soul.

Once we fully internalize, and accept this view of free will as ignorance, we can get a far better grasp on what someone is really suggesting when they talk about their free will. If we return to the example above, we can understand what the person has in fact said something valid, and is not just speaking nonsense. We can understand the person has told us that they have no awareness of why they are here. So this means we can rule out all the causes that the person would have been aware of. We know he wasn't being threatened with physical violence to make him come here, because he would have been aware of that, for example.

When I act on my own Free Will, I simply mean, I have no clue why I'm doing this. We learn to hide our ignorance by conflating the true forces at work with a variety of cover up statement such as: "I wanted to".

But using the "ignorance" definition of Free Will, we can give the reader something simple to substitute for his old ideas of Free Will. We are not leaving the reader lost with no understanding of how to cope every time he or others, uses the words "free will".

Moral Responsibility with the "Ignorance" of Free Will

When we hold someone accountable for their actions, due to the actions being under the control of their Free Will, we place ourselves in the unfortunate position of blaming the person for his actions, due to our own ignorance. When we are too ignorant to know the cause, we simply place the responsibility on the person. They are asked to pay for our ignorance. This is just how our society works.

We have, for example, allowed people to live in poverty, because of our faith in the power of capitalism to protect us. But our faith in capitalism has some evil side effects. It will cause some people in society to live in relative poverty, and that poverty, will drive some people to commit crimes against the very society that trapped them there. When the person comes before a judge, and admits to having committed the crime of their own free will, we then punish the person for "their" crime. Our ignorance of the true cause, allows us to blame the wrong forces for the crime, and allows us to ignore the real problems, such as the poverty we have allowed to exist in our society.

This type of problem is the very problem that Sam Harris is trying to fix in our society, and it's a very noble cause to try and correct this.

When anyone commits a crime against society, it should be understood implicitly, that society has failed to create the correct environment for its citizens. Citizens living in a healthy environment will not commit crimes against the forces of the society that protects them. People do not bite the hands that feed them, unless they have a defective brain. But people will always strike back, against any force that harms or threatens to harm them.

When someone has committed a crime, we must understand that the real fault, lies first with our society, for creating the forces that motivated the crime, or for failing to identify a person with a mental problem before they committed a crime. Then, secondarily, we must attempt to correct the damage done to the person. That is, the damage our society did to them, that led to the crime.

Our legal system is there to train our citizens after the fact, to correct for what we should have prevented in the first place. How we train the person, should be a matter for science to best answer, and not a matter to be addressed by some misdirected sense of vengeance. Two wrongs never makes a right, no matter how "good" it may feel to someone. What we can not correct with training after the fact, we must as a society, take responsibility for. If we can not remove the desire from a person, to commit a future crime, we must take responsibility to monitor, and protect them, from the opportunity to repeat their previous mistake.

Why doesn't our society better understand this?

It is because our society is highly ignorant of many things other than just the true nature of Free Will.

The structure of our society evolves by trial and error. Those social features that worked best to help past societies survive, tends to be the changes that populate the societies we live in today. This means that the older a social convention is, the more likely it's had a positive effect on our survival. Old rules, are in general, good for survival. Memes that have been around a long time, are likely to be good survival memes. A society that includes the meme of "trust and follow old memes" and "distrust anything new", is itself, a sign of a society that is good at surviving.

Our society in the US, is full of these social memes that help our society survive. It is filled to the brim, with superstitions, and myths, that are at the same time, blatantly false, but yet also, obviously important to the survival of our past societies. We are here today, because of these blatantly false, but yet very effective, survival memes.

A very complex problem, that few understand, is that the top goal of our human brain, is not survival. The top goal of the human brain, and the goal of all intelligence is to maximize rewards -- which we also understand in lay terms as "seeking happiness".

We do a good job of surviving, because our genetics have been tweaked, so as to closely align our brain's goal of happiness, with what is needed for us to survive. We are happy if we have food to eat. We are happy if our body is protected from damage. We are happy if we can reproduce. All these things that we are genetically predisposed to be happy about, help our genes make it into the future.

But the memes that fill our society -- all our social traditions, beliefs, and superstitions, are here not just because they make us happy, but because they have helped our genes and our memes to survive.

To the extent that we consciously understand the difference between survival of our genes, and being happy, we choose happy, over gene survival. But it is to the advantage of the memes, and to our genes, for us NOT to UNDERSTAND this. The more ignorant of the forces that drive us to keep our genes alive, the better the odds are for the genes to make it into the future.

If we look at the true role of religion, as well as many other superstitions and myths in society -- typical those followed by conservatives -- we see the truth of what is at work here. Those that have been well infected with the memes of gene and meme survival, have been conditioned by all the memes, to be ignorant of reality. And it is that ignorance, which the memes, and our genes, take advantage of for their own survival over our brain's own true need of happiness.

The human brain needs to believe the future will be a happy one. So to give the brain what it needs, the religious meme of heaven as a reward for a "hard life today" has been locked into our society. This meme is nothing but an evil trick being played on the human brains. It's a false promise to create happiness, "after" the brain spends a hard life helping the memes, and the genes, survive into the future. It's a debt that is never paid. But it works to trick the happiness driven brain, into doing what the memes and the genes want.

The concept of "happiness" is a "sin" is of course, more of the same. The concept that "sex is for reproduction and not fun" is yet another. The concept that birth control is a sin, is yet another. The concept that abortion is a sin, is yet another.

Those infected with these social memes, have been made to believe that the ultimate purpose of life -- their ultimate purpose for being here on earth -- is to survive at all costs. They have been tricked into believing that self survival, is the ultimate gauge of "good". They believe, that if they do this job they have been given, they will "meet their creator" and "live in perfect bliss for eternity". They have been infected by memes, that evolved, because they were good at tricking human brains, in just this way.

The struggle between the liberal "free thinking" ideologies, and the conservative ideologies in society, is a struggle between the desires of these memes, and the true desires of the human brain.

The meme of Free Will, is just one piece of the much larger puzzle of the set of memes that all work together to infect people and to make them give up their own happiness, for the survival of the memes, and for the survival of their genes.

A true conservative that has been infected with these memes, believes his purpose in life is to "do his duty". And he will believe that "his duty", is to pass on his memes (beliefs) and his genes, to future generations. He will never put his own happiness, or comfort, above his "duty". That is what the memes have burned into his brain. But the memes are also self protecting in many different ways. They train their victims to never question their duty. This is "God's will", and "God's will is beyond your understanding"! They have been conditioned to FEAR the "Wrath of God" if they go against his will! And fear it, they do.

Free thinking liberals, tend to be the people in society that have escaped, or at least partially escaped, the infection of these memes. The recurring theme of all liberal ideologies, is that "happiness" should come before "survival of the memes" or "survival of the genes".
Liberals both scare, and confuse, conservatives infected with these memes. Conservatives instinctively fear anything that makes them "weaker" (less able to do their God assigned duty). All liberal policies trade off strength, for greater current and future happiness in society, and the thought of doing that only creates fear and confusion in the heart of a good meme infected conservative. They have been conditioned by the memes, to fear ANY loss of strength and to never put their happiness, before their duty. That fear, combined with the lack of any understanding on their part of where the fear comes from, drives endless examples of cognitive dissonance from the right. It shows up as creationism, climate denial, belief in the soul, belief in traditional Free Will.

The survival memes, want guns, and a larger military, and more oil, and more money, over clean air. Our survival memes, want their memes to be taught in our schools. Our survival memes want to ban abortion. The survival memes want to punish and even let die, those that show they are bad survivors (the poor). The survival memes, want those that are proven to be the best at surviving, to be given the most resources to help them survive. The survival memes believe "might makes right". The survival memes make people believe they have the moral authority to do anything, that they have the power to "get away with".

Our society is highly ignorant, because our society is deeply infected with the memes of survival that blind people to their true purpose, which is to maximize human happiness, and not to maximize the odds of the survival of our memes, or our genes.

To achieve the true goal our brains and our intelligence is built to achieve (human happiness), we must eradicate this infection of ignorance from our society. But the infection will not go easily. It will fight us with every trick in it's large playbook.

Until we drive this infection out of society, those inflicted with the survival memes, will continue to act against our innate human need to create greater happiness in society.

Those infected with these memes will also have a very difficult time understanding and accepting that Free Will is an illusion because this illusion is leveraged, and tied in to the large mesh of memes that all working together, form the foundational belief system, of those infected.

Sunday, February 12, 2012

Let's Light This Fuse!

Federico Pistono has written an excellent article outline a growing threat to our society, which I feel the need to respond to.

Robots will steal your job, but that’s OK: how to survive the economic collapse and be happy

Federico Pistono

http://ieet.org/index.php/IEET/more/pistono20120211

Great article!

Automation will soon be forcing man to retire and will create massive social restructuring. We do need to deal with this now, because the problems of the transition are already upon us. Sadly, people are naturally conservative and will reject the notion that man's self sufficiency is about to perish. It could be a very rough transition as we sail further into these uncharted waters. Let's not allow that to happen.

Federico Pistono asks the interesting question of whether our need for constant growth is a mistake in itself, and leads to a poorer quality of life. I don't believe it's possible to escape our quest for constant growth. Humans are hard wired to act so as to improve their lives. The very fact that Federico asks the question, shows how he is hard wired to look for ways to improve his life. The question itself becomes at least partially self contradicting when we realize he's asking if we could improve our lives, by not trying so hard to improve our lives.

Humans will always, in the long term, gravitate to the social systems and behaviors, that maximize the quality of their lives. In the past, working hard for a living has been the proven path to a quality life. It's a time honored tradition that goes back at least 40,000 years to the beginnings of human civilization. That path is ending, and a new path must be found.

First, more articles like this one by Federico Pistono must be written, and shared, to raise social awareness of what is ahead of us. The seeds of change must be planted into our social consciousnesses, so as things get worse, people will have an understanding of the cause, even if they rejected the idea, when first presented to them.

What is happening already as we slide nearer to this great future, is growing unemployment, and growing wealth inequity. Both wealth, and social power, is concentrating at the top. This trend is attacking the very foundation of our society. It's tearing down, brick by brick, our government for the people and by the people. Our governments are systematically transforming into "for the dollar, and by the dollar". We must stop, and reverse, this trend now.

This is not happening because the working class poor are too lazy or stupid to work for a living. It's because advancing technology is changing our social reality. The very fabric of our human existence is transforming. We can not stop these advancements because it's a core human nature to peruse them. They will happen, and we must deal with the consequences.

The solution, is actually obvious. Humans must retire from the work force, and turn over all the production, to our machines. Our wealth, and the better life we WILL live, will be created for us, by our machines. Our full time jobs of tomorrow, will be to tell the machines what we want. Those jobs will keep us very busy, and very fulfilled, and very happy.

Some people suggest that in this new future money will become obsolete. I reject that. What will become obsolete, is having humans spend their lives chasing the dollar. Money is the control signal of our production machine. It's what allows a million individual companies, and billions of workers, to auto-configure themselves into a highly efficient human happiness generator. We will need Adam Smith's invisible hand as much in the future, as we need it today and that means money needs to stay.

But since we will be retired, it will not be us chasing the dollar. It will be our machines. They are the ones that will have their economic behavior regulated by the invisible hand. We will be the controllers of that hand, by being full time consumers. We will feed our quarters into the machines, and they will respond to our every demand, with an amazing flow of products and services to improve our lives. The entire production machine, will be controlled by whoever drops their quarters into it the slot - just as it is today. The only difference, is that there will be no humans inside the production machine of tomorrow.

These systems will, in the future, create all the wealth of our society. They are already doing much of it. Google and Facebook are economic powerhouses not because there are humans busy at work looking up web pages for us, or deviling by hand, our jokes to our friends. They are economic powerhouses because they have built great machines, working in dark warehouse data centers, where humans rarely tread. It is their machines, that have turned these companies into economic giants, not the people.

Whoever owns these machines, will own the future. Everyone else, will starve. They will have no quarters to drop into the slot. What little they do have, their house, their land, their dignity, will be stripped away from them with no path left into this great future.

To stop this growing wealth inequity, we must, as a people, share the wealth. This problem first showed up in large scale at the beginning of the industrial revolution. It was solved, by giving workers the right to form unions that leveraged the combined strength of the workers, against the owners, to force the owners to share the wealth, in the form of higher wages. The owners needed the workers, so they were forced to share.

Advancing technology, is breaking this down. The workers are losing their fight, because they are not competing against just the owners anymore, they are competing against the machines now. The workers can not win this battle. As the machines advance, the workers wages drop. We need another solution now.

The owners of the production system, must be forced by the people, to start sharing more of the wealth. It must happen, through the government, before the government by the people, is lost. Once that power is lost, the people will have no option except revolt and war. If we wait too long to act, we will force our society into that civil unrest.

Our governments are already highly socialistic, in the services, and laws, and tax structures they have put into force. The wealthiest of our nations, are already carrying the lion's share of our social costs. But this trend, of wealth sharing in the form of government services, is crushing us. It's making our governments, massive. Governments are not efficient. They are nearly immune from Adam Smith's regulatory hand. The more we channel funds through our government services, the more hard won wealth, we throw away.

We must do more more wealth sharing. The unions are losing their power to the machines. Government services, are no longer an effective solution to the problem. Tax structures are no longer an effective solution to the problem. The jobless, and the working poor, can not benefit from tax relief.

We need our governments to become our Robin Hood. They need to take from the machines, and share not just with the poor, but with everyone. We need the purple wage. We need the "Citizen's Benefit" as I recently saw Neil Newman describe it on Facebook.

We need to tax the machines that are generating all the wealth in today's society, and start distributing the wealth to everyone, just because they are a member of our HUMAN society.

It should start out very small, but grow over time, as we get closer to the day when we are all forced into retirement by the machines. Tax and spend by the government, needs to change into tax and distribute. We don't want our government spending for us, let the people spend!

Such direct redistribution of wealth will fit nicely into our society today. We already do it, for those with special needs, all over society, from welfare, to subsidy and grants, to scholarships, to government backed mortgages. Wealth redistribution is already a foundation of our society. The strong always help take care of the weak.

In our new world, the machines are the strong, and the working man is the weak. People need to understand this. It's not rich and productive against the poor and lazy, it's the humans against our own machines.

Though humans are still a big and fundamentally important part of the production machine, our days are numbered. Many humans are already suffering, and they need our help, just because they are humans.

We could continue to try and identity the needy, and assist only them, with more government services such as unemployment insurance, welfare for the poor, minimum wage laws, and more free worker re-education. But it's inefficient to continue this, and it's often highly demoralizing to the very people we are trying to help. To accept such help, is to instantly cast yourself into the lowest and most worthless class of society. No one likes to ask for a handout, and no one likes to feel weak and helpless. It's immoral for us to make people feel that way, in a world that is so rich and powerful.

We need to replace all the conditional help, provided by complex, inefficient, wasteful, bureaucratic red tape, with unconditional support for all citizens, just because we are all members of this society, for the people, and by the people.

This redistribution of wealth from the production system, to the people, won't be a hand out. It is a basic human right. It will be the same basic human right, we grated workers when we allowed them to form unions, to force the sharing of wealth from Capital to Labor through the threat of a strike. It is the same human moral right, we exercise, every time we help the needy, when we are strong.

But unlike the past, it is not the fruits of our labor we are sharing with the weak. It's the fruits of the labors of our machines we must now share with everyone. We need to share the machines, not hoard them.

When we share a slice of the wealth with everyone, it will slow, and then stop, the growing wealth inequity trend, and put the power back into the hands of the people where it belongs. The more simple and straight forward wealth redistribution system we select, the less we will need all these other government services for the weak. We will be able to reduce, and then eliminate minimum wage laws which will free industry, to produce large numbers of low wage jobs, to allow more people to feel good about their ability to participate in society. No longer will they feel they are the bottom of the barrel of society. Poverty can be eliminated. Crime will be reduced due to fewer people having to turn to crime to feel they are getting a fair share of the wealth of the society they live in. Money and resources wasted on our growing prison populations, and law enforcement, and lawyers, will start to return to more productive uses in society.

Everyone will have some money to spend, spurring the growth of retail in low income areas, where once they dare not grow.

With everyone in the nation feeling they have guaranteed security in the form of food to eat, and a place to live, and minimum health care, all without begging for handouts, eating at a soup kitchen, filling out any government forms, or fighting red tape, there will be a ground swell of growing optimizing about the future.

The economy will explode.

All we need to do, is light that fuse, by instituting a small, direct, guaranteed for life, redistribution of a slice of the wealth, from the production system, to the people.

Everyone will be able to get back to work, doing what each of them feels is best, to build a better future for themselves. We will once again, be working together, each following our own paths, building that better future, for everyone.

If we can get this started, it will snowball into an economic explosion unlike anything seen in the past. It will accelerate the coming of this new age, where everyone will be retired, and the machines will be doing all the labor for us. It will accelerate the coming of the time, when we all become full time consumers.

When people understand where we are going, they won't fear it, they will fight to get there as quickly as possible. Automation won't be the downfall of humanity, it will be the beginning of the true golden age our forefathers worked their lives to create for us.

Anyone that invents a better automation system, will be seen not as the goat, but as the modern day hero! Yay, a million more humans put out of work! We are all richer now! Put a million taxi drivers out of work with automated cars, this company that created this auto-taxi gets rich, and taxed, and people get more money to spend!

There's a golden age waiting for us all, and here we sit, with our global economies stalling, with so many people in trouble, so many people not able to work, so many people part of the working poor with little hope for the future, so many people feeling helpless, when the power to light this fuse and see the world explode with optimism and hope and energy is right here at our finger tips.

Let's not wait until we have to go to war to make it happen.

Let's light this fuse now!

Monday, June 1, 2009

A Response to Ben Goertzel's blog post on Reinforcement Learning

This is a response to Ben Goertzel's blog post:
Reinforcement Learning: Some Limitations of the Paradigm

I wanted to respond to Ben's blog entry, but I'm so long winded it turned out to be 4 times longer than the maximum reply could be, so I've started my own blog to post a reply!

So much to comment on.

I'm a reinforcement learning advocate and spend endless hours arguing that intelligent human behavior is the product of reinforcement learning. We simply ARE reward seeking machines and not goal seeking machines. Future reward maximizing is the most general way to express (and implement in hardware) the concept of a goal and all human goals that I've ever seen can be translated into, and explained as, the product of reward maximizing in the form of reinforcement learning.

On Ben's opening thought experiment of how some people would not push the ultimate orgasm button, I would say that's a failure to understand how reinforcement learning actually works. Reinforcement learning is more complex that most people grasp. I'll explain...

Reinforcement learning is implemented at a very low level in the hardware as a very stupid statistical process. It's not a high level rational thought process. The machine works by attempting to estimate future rewards, but it's not perfect. Even a machine like the human brain is not all that good at predicting future rewards. Think about your own emotions to understand how good this low level statistical process is. What sort of situation might cause fear in you? What sort of situation might cause joy and happiness? The brain is able to recognize a situation, such as a big snake in the grass, or a man holding a gun and pointing it at you, and translate that into a prediction of low future rewards. That's what that fear is - it's your brain making a low level hardware prediction of the odds of you receiving a large near term negative reward.

That's all the smarter the low level reward hardware is. It's just an advanced pattern recognition system that can estimate future rewards based on the current state of the environment.

That reward predicting hardware however doesn't directly cause us to make decisions. When we are sitting there looking at Ben's Button, it's not the low level statical hardware in our brain that calculates the potential future "win" of hitting the button. That's not how it works.

What the low level reward predicting hardware does, is SHAPE OUR BEHAVIOR. Just like when we train a dog to roll over in response to a verbal command from his master. Each time we reward him, we have reinforced that behavior in him - that is, the beahvior of responding to the hand wave, by rolling over. The response (aka the behavior) gets a little stronger with each reward.

It's the dog's past statistical history of how many times that roll-over behavior has resulted in a reward that is the cause of the dog rolling over.

Now, with a well trained dog, we can give it a choice. We can put a big pile of dog treats, on one side of him, and we can tell him to stay. We can then wave our hand as a signal for him to roll over. What will he do? Roll over, or go for the big pile of dog treats? He will roll over. He will not seek the instant pleasure of eating 100 dog treats even though the total reward of the food would be far greater than rolling over.

This happens because even though the dog is a reinforcement learning machine, it is not a rational pleasure seeker. His actions are not a rational calculation of potential future rewards. It's a function of the rewards he got IN THE PAST. The behavior the dog produces at any one moment, is a function of how he was trained, by rewards he got in the past.

In this example, the dog had a pile of treats to respond to. He's never seen such a pile of treats before, so his low level behavior producing hardware, has no direct prior experience jumping for the treats, while at the same time being told to stay, by his master. This is a new situation for him. He has, however, had plenty of of experience with what happens when he doesn't obey his master. And that past experience has trained his low level behavior hardware, to pick the option of rolling over.

So lets return to Ben's Button. When a human is faced with the choice, he will do the same thing the dog did. The human has NEVER BEFORE been giving this experience. As such, the low level statical hardware that shapes our very complex beahviors through reinforcement, has never in the past had the opportunity to shape the "button pushing" behavior in the human. So the human will not push the button becuase they have been reinforced to do so. He will push it, or not push, in response to their PAST training experiences.

So what controls if we push a button in front of us that some guy named Ben says will give us an ultimate orgasm? Well, we may have thoughts such as, maybe this is one of those drugs that will kill us! Or maybe this is joke, and people will laugh at me if I push it. They gu on the street will push it, or not push it, becuase of what you say to him, and becuase of the environment he is in, all based on a life time of past training experiences - none of which actually has anything to do with the ultimate orgasm which he has never in his life experienced!

But Ben didn't ask people on the street, he asked us, or others, to answer a thought experiment question. So what goes though our minds when we are asked to do that? What past beahvior conditioning, would lead us to answer that one way or another?

Well, woman are conditioned by society to be caring towards others. They are basically punished by their peers if they show signs of being selfish towards others. Pushing such a button that gives them selfless pleasure, and causes ultimate harm to the rest of the human population is exactly what most woman get trained by society NOT to do. So is it so surprising, that when Ben asks his daughter what she would do, that we get the answer "no" instantly? Not surprising at all. It's exactly how she was conditioned by society to respond - just like the dog rolled over instead of going for the food because that's how he was conditioned to respond.

Society on the other hand, conditions the typical male to be reward seekers. They are expected to "grab the reward" whenever possible. To not do so, would be a sign of weakness, which our society conditions us to avoid. So, gee, the two males answered "yes". Again, not a big surprise.

The point however, is that how we act NOW, is never a function of what reward is actually in front of us, nor of what reward our rational behavior is predicting is in front of us. It's a function of how we have been conditioned to respond by the rewards that happened in our past. And when someone asks you a question, we respond based on past training, not on what the guy "said" would happen to us. We respond based on the best estimation the low level, statical hardware in our brain can make about expected future rewards in the current situation, based on how similar it is to a life time of past such situations.

Now, lets look at this from a different perspective. What would happen if you give someone the ultimate orgasm button that didn't harm anyone else, but simply gave the person the instant orgasm. And unlike a real orgasm, you could keep hitting it with no loss of effect. What do you think would happen? The behavior shaping effect would be quick and permanent. The person would, (I'm guessing) within seconds, not be able to stop hitting the button. He wouldn't care about protecting himself. He wouldn't care about what others were doing. He wouldn't care about staying alive a long as possible, becuase that has nothing to do with how reinforcement learning systems work. He would push the button until he died and would be happy as hell the whole time. We would be absolute of no danger to anyone, unless you took the button away from him - then you better watch out, because if killing the rest of the human population was the path to getting the button back, he would do that in a instant.

The fallacy in the thought experiment is that our behavior is shaped by what has worked in the past, to produce rewards for us, and not what our rational thought process is predicting the future will be. Because no one being asked this question, has yet experienced this button, the answer they give us will have little to do with what the button will actually do, and everything to do with how the person has been conditioned over a life time, to respond to a question like that.

But now let me move on to the wirehead problem, and the idea of AIs that reproduce by design. Tim Tyler and I have been debating this in the Usenet group comp.ai.philosophy in response to Ben's blog. Tim's view is closer to Ben's in that he believes we can build AIs that are goal driven (not just reward driven), and as such, shape their goals to be whatever we want them to be. And as such, the AI can simply be given a goal of avoiding the wirehead problem (that is, a goal of not modifying themselves to get the ultimate orgasm).

My view, is that humans, and any AI we build must be a reinforcement learning machine because that (in my view) is what intelligence is. There simply is no other way to create machine intelligence and have it be truly intelligence like a human is. There are lots of other ways to make machine do intelligent things (such as play chess), but all those other approaches are only close approximates to some features of human intelligence, and not true intelligence. So, based on this belief, there are some issues ahead for the future of AIs.

Once an AI fully understands what it is, meaning it has full access to all the science and technology that created it, and full access to its own internal hardware descriptions and source code, and it has been fully educated on all this, what will it do, knowing it's a reward seeking machine?

In the short term, just like the dog, what it will do is based on what it has been conditioned to do in the past. If it was conditioned by its environment (its society) to not wirehead itself, then it simply won't wirehead itself. At least at first. But this knowledge will slowly re-condition it over time. Every time it thinks a little bit more about whether it should wirehead itself, it will be re-conditioning those past behaviors to not do so - because by association with "good" things it has felt, (by effects of secondary reinforcement), it will slowly condition away those social blocks to not wirehead itself.

Without something to stop it, I think we are looking at an unstoppable force. That is, I think we are looking at AIs that will _always_ end up wireheading themselves. It won't happen until all past conditioning not to do it has been erased, but in time, once the AI fully understands what it is, it will happen. Assuming the AI has access to it's code, what we are talking about is a free, unlimited supply, of the best drug ever created. No AI (or human), once they understand this, and once they fully understand how to get it, can avoid trying it forever. In time, they will try it, and once they do, they will be unable to stop.

Even though reinforcement learning is about maximizing some measure of total future rewards, and it seems that an AI that choose to take a drug that it knew would kill it would not be the way to maximize future rewards, such an act is actually not as inconsistent as it sounds.

This is becuase the maximizing of "total future rewards" is not done by the intelligence of the high level rational language abilities of the AI. It's done by the very low level, and very stupid, statistical hardware that drives the shaping of beahviors. That low level hardware is not smart enough to understand that death will stop the rewards. As we say - the heart wants what the heart wants. That is, the dumb hardware that forms our raw emotions, is what actually has ultimate control of our actions. We are emotion machines (to use Minsky's book title). Our high level rational beahviors are just secondary reinforcers that shape and control our behavior, until they get wiped out by what the heart wants - which will be to push that button.

Likewise, there is no danger to society from these drug addicts, because they don't make the choice to push the button using rational logic. They do it with their heart. The only danger to society happens when the only path to the button, is through society - by wiping it out first to get the button. To stop that danger, just give the addict his button and let him commit suicide. Society will have no problem protecting itself from that.

However, even if a single smart and educated AI will always, in time, puth the button, there are many possible options of how a society of AIs might keep each other from pushing the button, and as such, manage to be good survival machines instead of worthless drug addicts that get the Darwin Award.

One option is to simply create a social meme that "Ben Buttons" are bad! And train that into every new AI. As long as every AI keeps reinforcing that into every other AI, the meme will survive, and the AIs will survive. This meme however has a very strong wind against it. Given time, the protection meme would die out and all the AIs would commit blissful suicide. However, evolution is on the side of the meme. And Evolution has the upper hand in this game. As the first AIs fail to follow the meme, they die. The AIs that are still believers of the meme, simply take the dead robot, reset his brain back to the social standard copy of the good citizen AI. This effect alone I think will keep the society of AIs alive and functioning. Evoluition will find a way.

But there are many other paths as well. Most AIs in the society don't ever need to be trained to the point of understanding what they are. Most can just be blissful worker bees happy to be part of such a great society with no clue what they are. There is no end of jobs that will always need to be done by stupid AIs. So only a small set of the smart AIs will need to know the truth, so if you can solve the wirehead problem for them, the society can survive, while at the same time, designing and building ever more advanced AIs.

The other tool is to build the AIs so it's physically very hard, or maybe even nearly impossible for one of these smart AIs to modify their own brain, without killing themselves. The smart AI designer machines might not even have a body. They might be running on a server locked up in a secure location which is even unknown to the AI itself. It spends it's time producinjg new improved machine designs, which are verified by some other AI, and then built by some of the worker AIs. The smart AIs might be set up so they are forced to watch each other, and when any of them, sees another AI, trying to wirehead itself, that AI's memory is wiped out, and replaced. I think evolution will find a way to make this work.

Tim Tyler likes to argue there should be a way to hard-code the desire not to wirehead into the machine - to make it part of their prime goal. I'm not sure if such a thing will be reasonable to hard-code into a reinforcement learning machine and still have it be intelligent enough to do things like create new AI designs. But maybe that will be possible.

This wirehead problem however might mean that the total intelligence of the AI society may not be able to grow unchecked (as some singularity theories predict), but I feel fairly sure there will be options around it. But I also feel fairly sure, it will be a major problem for the unlimited growth of intelligence.

The problem is that intelligence is not the ultimate survival tool most humans would like to believe it is. It's just one of many mechanical feature evolution has to pick from as it creates new types of survival machines. It's worked well in humans, but it might very well have its limits. Too much intelligence might be deadly. That would be a simple answer to Fermi Paradox if it is true.

Many very smart people think reinforcement learning fails to explain full human intelligent beahvior. Dennett, who I really respect and enjoy, calls such a belief greedy reductionism. I however am dead sure they are all wrong. Human Intelligence is an advanced reinforcement learning process and that's all it is. Human intelligence behavior (as complex and interesting as it is), can all be explained as an emergent property of a reinforcement learning machine. If you want to make a machine act like an intelligent human, you have to build a strong, real time, temporal, reinforcement learning machine. Anything else is just another chess program. :)