The Incentive for Humanity by Toby Asher Lightheart

Georgina Woodward

Hi Toby,

an enjoyable, very well written essay that I have found educational and thought provoking.

Your consideration of incentives for people reminds me of these videos, that I have found really interesting.I don't know if these relate to your references 4, 5, as your references are forthcoming.

How Motivation is Driven by Purpose and not Monetary Incentives Mentions freedom, mastery and purpose.

Some concrete examples in this next one, I love the Legos experiment and the realization that lack of acknowledgement is almost as bad as shredding a persons work.

Dan Ariely, what makes us feel good about our workMentions "ownership"/connection to work and importance of acknowledgement of effort.

Both videos show that monetary reward is incidental to tasks requiring more than basic skill. One of the great things about the FQXi site, and its competitions, is being able to obtain acknowledgement of ideas rather than thinking they may as well have just been put into a waste basket. By the way I read from beginning to end and was impressed by your well set out arguments: )

Good luck, Georgina

Toby Lightheart

Hi Georgina,

Thanks for your positive feedback. I'll be sure to have a read of your essay and leave a comment.

I think I have seen the Dan Ariely video before. Those videos do seem directly related to the work I was meaning to reference. There are now a list of references and further reading posted below.

I've taken a great interesting in trying to think about technology, morality and economics and how it might combine into future social and economic systems. The question of how money is used as an incentive has been one of many things I have found interesting to think about. When market mechanisms are working well I think they do a good job of conveying information about the scarcity of goods and labour; telling us what jobs people should be taking and what goods we should be trying to ration and produce. Wages are, ideally, an indication of the demand and supply of a type of labour. Unfortunately, I imagine many people choose professions based on the incentive of wages rather than for the interest in doing a good job or helping society. This also gives people an incentive to manipulate the labour market.

I'm hopeful of reading more of the essays and exchanging ideas. I would be happy to have more of a discussion about the ideas in my essay or other ideas.

Cheers,

Toby

Toby Lightheart

You're welcome, Toby. Thanks in return. - You make a brave foray into the steering problem from a moral vantage, but fail to master that vantage (I think), and your own thesis.

Thank you for your honest and critical appraisal of my essay. I did find it a challenge to communicate the ideas that I thought most relevant to the essay topic. Your feedback has been helpful in prompting me to think further on the making the ideas more precise and how to communicate them more clearly. I hope you might also do me the favour of reading and replying to my responses.

The light manner in which you dismiss all prior moral philosophy (waving the wand of moral relativism, p. 2) doesn't encourage the reader to trust your judgement, or to give a fair reading to your own ideas.

My intention wasn't to dismiss all prior moral philosophy. I tried to suggest that the Markov decision process model can be use to represent both consequentialist and deontological moralities. I also found that representing a moral framework as a reward function didn't automatically bring us any closer to knowing which moral framework is the "best". This problem struck me as similar to the problem of moral relativism. I also consider this further evidence that meta-ethical moral relativism is a problem that needs addressing.

I tried, but couldn't assume with you that "each agent's morality, from a person to a nation, may be calculated as a reward function." (p. 3)

Perhaps that assumption was poorly worded and much stronger than the following argument required. In reality, agents are often irrational and are unlikely to have moralities consistent and well-defined enough to be translated into a reward function. This is especially true of agents with many cognitive biases, such as people, and those composed of many individuals with different systems of moral preferences, such as parliamentary governments. Even if an agent's morality may be expressed as a reward function it might not be possible to calculate because it could require many input variables. This may suggest that people and agents should be encouraged to adopt more explicit systems of moral preferences.

In any case, the focus may have been better placed on the assumption that systems of moral preferences have associated reward functions.

A reward function is an abstract way of looking at the moral preferences for actions and states that agents may have. A system of moral preferences may be translated into numerical values associated with states and actions, with greater value indicating greater preference. This could be an arbitrary one-to-one mapping of all combinations of states and actions to numerical values, for example, praying facing a particular direction at the right time of day might have a large positive value, or there might be ways of calculating the reward through a formula, such as summing all human happiness in that state. Either could fit into the general concept of a reward function.

I think we should be able to assume that at the very least an arbitrary one-to-one mapping of reward values exists for any agent with unchanging moral preferences. However, I think it may be more informative to try to design functions based on moral principles and attempt to calculate the reward (moral preference) of each state and action. The calculated reward could potentially be compared with our intuitions and other reward/preference functions more easily.

From here, I saw you struggle earnestly to deduce what might more convincingly have been assumed: that we value our own existence.

I don't think nihilist and non-anthropocentric view-points should be dismissed out of hand. For this reason and others, I think that there is great value in finding a logical argument for valuing ourselves and others.

These ideas were challenging to formulate and communicate. I don't think it would be effort wasted on my part to make the argument more convincing, so if there is any part of the argument in particular that you disagree with I would like to know what it was.

You then deduce that we place a supreme value on learning in all decision agents, human and non-human (p. 4), but this doesn't seem well supported by the argument, nor does it seem a moral principle in substance or form - not like those of the philosophers you dismissed earlier.

Valuing learning was a suggestion rather than something that was deduced in this essay. Perhaps I should have tried to argue how valuing learning of all agents can be the basis for any number of moral decisions.

As a moral principle, valuing learning suggests:

- Agents should be acting to learn and experience as much as they can.

- Actions and states that cause disability and death are immoral as they diminished the ability or opportunity for agents to learn.

- It's immoral to destroy sources of information and experiences, including artwork, books, historical artefacts, animals and ecosystems.

- It's immoral to deny people knowledge and experiences unless it would cause disability, death or destruction.

- Even though the negative value of other outcomes may be greater, any event or experience that results in learning has at least some value.

To put this moral principle into practice would require a weighing up immediate learning (reward) with future learning (expected value). Otherwise, I think valuing learning as a moral principle is quite an intriguing idea that has lots of intuitively positive ethical results.

Your conclusion feels equally shaky: that we should "search for good mechanisms" for steering, but "using scientific processes to decide whether a political or economic mechanism is effective". You spring this in the final two sentences without explaining it further (p. 7), as though you yourself were not quite convinced.

I can only really apologise for the shaky and unpolished conclusion. I was running up against the submission deadline so the conclusion was rushed. I would like to follow up these ideas further in future writing (and maybe try to come up with a more convincing conclusion), so I appreciate that you have taken the time to review my essay.

Toby Lightheart

References and further reading:

1 Bellman R. (1957). "A Markovian Decision Process". Journal of Mathematics and Mechanics 6.

2 Sutton, Richard S., Barto, Andrew G. (1998). Reinforcement Learning: An Introduction. MIT Press.

3 Gowans, Chris, "Moral Relativism", The Stanford Encyclopedia of Philosophy (Spring 2012 Edition), Edward N. Zalta (ed.).

4 Grant, Adam and Singh, Jitendra (2011). "The problem with financial incentives - and what to do about it", Wharton School, University of Pennsylvania.

5 Ariely, D., Gneezy, U., Loewenstein, G., and Mazar, N. (2009). "Large stakes and big mistakes", Review of Economic Studies, vol 76, no 2, pp 451-469, 2009. doi: 10.1111/j.1467-937X.2009.00534.x

---

Barto, Andrew. G. (1995). "Adaptive critics and the basal ganglia", Models of Information processing in the basal ganglia, pp. 215-232. Houk, J. C., Davis, J. L., and Beiser, D. G., (eds.), MIT Press, Cambridge, MA.

Moore, Andrew, "Hedonism", The Stanford Encyclopedia of Philosophy (Winter 2013 Edition), Edward N. Zalta (ed.).

Manfredi, M., Bini, G., Cruccu, G., Accornero, N., Berardelli, A., Medolago, L. (1981). "Congenital absence of pain", Archives of Neurology, vol 38, no 8, pp 507-511, 1981.

Joyce, Richard, "Moral Anti-Realism", The Stanford Encyclopedia of Philosophy (Summer 2009 Edition), Edward N. Zalta (ed.).

Singh, S., Kearns, M. (2002). "Near-Optimal Reinforcement Learning in Polynomial Time", Machine Learning, vol 49, no 2, pp 209-232, 2002.

Rittenberg, Libby and Tregerthen, Timothy (2009). Principles of Microeconomics.

Kohn, Alfie (1993). "Why incentive plans cannot work", Harvard Business Review.

Pink, Daniel H. (2009). Drive: The surprising truth about what motivates us. New York: Riverhead Books.

Michael Allan

Thanks for explaining, Toby. For my part, I'm skeptical of designs in which "humanity is steered by... mechanisms" (abstr.). I prefer instead that we be in control on the input side, consciously steering the future. I have the impression (still) that you're asking me to trade an ethics based on human autonomy and freedom, such as Kant's, for one based on rewards of "pleasure and pain" (p. 2). This is the substance of my first two objections.

My other objections, though, now appear too strongly worded. I agree it's a challenge to communicate ideas like this. (I haven't mastered it myself, as you'll see if you return the favour and critique my own essay.) Your value-based arguments make sense in light of your explanation above.

I agree that "asking how humanity should steer the future may be one of the ultimate moral questions" (p. 2). Since that puts us both outside of our fields, I shouldn't be so bold in my critique. - Mike

Michael Allan

PS - Just a note to say that I'll be rating your essay, Toby (along with the others on my review list) some time between now and May 30. I still hope you'll be able to review mine. All the best, and bye for now, - Mike

Ross Cevenst

Hi Toby,

Thanks for an interesting essay. Your reference to MDPs is novel - I wasn't expecting to see them mentioned in this context. I feel like there is something brilliant in the making here but perhaps not quite finished in its formulation. However I did find myself very much enjoying your method of description of the problems we face. In particular I like that you seem to have a peculiar talent for phrasing highly politicised topics in quite objective and neutral language, while still meaningfully engaging with them. I find myself in agreement on the value of survival and learning. I guess in conventional economics the goals you mention would be regarded as externalities, and in theory they would be priced and integrated back into the market system. Would you see this as the likely step forward or do you imagine something either more innovative, or more radical, would be required? In my attempts at philosophy I have come to similar conclusions regarding the value of survival and learning, and I've tried to explore whether the traditional political cliches of collectivism vs individualism can be avoided by a greater focus on innovative economics instead of politics. In any case thanks for an interesting read.

I'd love for you to take a look at my own entry and rate it. If you're interested some of my own perspectives on survival and economics you can also view my website here. Thanks again and good luck!

Ross

Toby Lightheart

Hi Ross,

Thanks for reading my essay and your feedback. I'll be sure to read your essay and post a comment.

I'm glad you liked my idea of considering what can be said about morality from the Markov decision process model. I did find it difficult to get the ideas into anything like a cohesive argument. This is something I would like to try to formalise more precisely, write about and communicate with other people about more in future. I would be interested to get more feedback from people, including from moral philosophers.

The ideas presented in this essay have been a persistent distraction to my other studies, but I do have a strong interest in economics and political philosophy. I'm not one to take sides unless one seems to have a better evidence supporting it. Nevertheless, it seems to me that there are a lot of factors that are converging to undermine the current market capitalism. I think humanity might be heading towards a crossroad. Rather than fight it, humanity might be served better by pursuing a more radical shift in redefining what can be privately owned.

I'm still trying to think through the implications, but I think it might be possible to have a functioning economy and society where most property and businesses are democratically and collectively owned. The socialisation of property and business provides a different model for generating revenue for the state. Some market mechanisms for pricing commodities, labour and the rental of property might need to be retained to reduce the likelihood of shortages. Transparent, democratic processes for government seem to be reasonably effective at preventing the government devolving into a dictatorship (though better forms of democracy than those currently in action may be possible).

I think it's important that people should have almost complete freedom, but I would hope that intrinsic motivations would be enough for people to participate in learning and teaching. Social structures for learning and teaching most aspects of life already exist, but these should be expanded and "restructured" to enhance opportunities and social interaction. Given the right systems for learning and teaching, which would include researching technology and learning and teaching trades and professions, people that take part in this learning and teaching system would also be contributing economically. Paying a basic income to people that participate in these social systems for learning and teaching would also give them freedom to learn and teach widely.

I'd be happy to discuss any these ideas or others further. I'll read your essay and post a comment as soon as I get the chance.

Cheers,

Toby

[deleted]

Hi Toby,

Looking forward to your comments on my entry (I should probably mention its got some fictional elements to it). I'm very keen to hear what someone actually working in the field of AI feels about the comments I make on moral philosophy.

It does seem like we are in times of increasing economic instability. I do note that there is some pretty significant political opposition to some of the solutions you mention in the above comment. I feel like somebody needs to come up with something new to overcome the old stalemates and logjams in politics.

I'd certainly support seeing more democratically owned/run enterprises like Mondragon Corporation participating in the economy. Some of my other work talks about Community Business which aims to offer a way to improve the moral core and democratic mechanisms in business. Expanding participation is definitely preferrable to handouts, but either way we're going to have to do some reforms to deal with automation unemployment, or else the world will suffer the same fate as Greece recently did.

The process of socialisation is a bit of a political minefield, partly because the people that own the existing enterprises are generally rather upset at the process. There is also a great risk of the Zimbabwe situation, where (according to the media at least) those in charge of socialising basically just took the enterprise assets for themselves and their friends. On the other hand almost all economists seem to agree that governments sensibly stepping in when there's an obvious market failure is a good thing. Even some of the world's leading capitalists seem to be saying that the system is suffering from a moral failing at the moment, so perhaps we'll see a step-back from the more fundamentalist perspectives on the market.

There's also no question in my mind that we urgently need to integrate morality, social and environmental considerations into the economy better, hence my very strong interest in moral philosophy and innovations that might bridge the poltical divides that see so many bright people talking past eachother. That's also why I'm interesting in work like yours, even if I differ on parts. Keep it up!

Ross Cevenst

(above was me)

Ross Cevenst

Reply to your thoughtful comments added!

Ajay Bhatla

Toby,

I am glad someone else also struggled with "questions and definitions". Just got to your essay and remembered all the questions and definitions I had struggled with in writing my essay (here).

Good discussion on decision processes.

The point you make towards the end is the right one on the approach to take: "to be more aligned with our ethical values yet remain functional". What do you think of the approach I lay out in my essay? I use the incentive ordinary people have to improve their lot in life.

-- Ajay

Toby Lightheart

Hi Ajay,

I'll have a read of your essay. On face value, I think whether people trying to improve their lot in life being a good thing depends a lot on the society and environment in which they find themselves. There needs to be more done to address how our society and economy operates to give people a better chance to improve their lot in life and for it not to be at the expense of others.

Cheers,

Toby

Toby Lightheart

Hi Charles,

Thanks for reading my essay.

It seems we agree on many things. I'll post be sure to post some feedback on your essay.

We'll never have complete knowledge of what the consequences of our actions or inactions will be, but gaining more knowledge can help us make better choices.

I think I fall into the consequentialist camp, so making the right choices for the wrong reasons is better than making the wrong choices with the right intent, in my opinion. Unfortunately, similar contrary outcomes are conceivable for making choices with incomplete knowledge: selectively gaining knowledge of the positive outcome of, say, a medication without being aware of the side-effects can have a net negative outcome.

Cheers,

Toby

Toby Lightheart

Thanks, Joe. I hope that you have found entering this competition valuable.

Cheers,

Toby

Toby Lightheart

Hi George,

Thanks for reading my essay.

Thankfully, I think many societies have advanced enough ethically that we don't need to iterate through all the possible systems of morality (as one might do for a reinforcement learning simulation) to know that some are going to "better" than others. Empathy, sympathy, reason and imagination are a great skills for evaluating the morality of choices.

I guess that much of what we have learned from the past efforts of societies could be considered iterations of this variety, so they should definitely be mined for all the knowledge they possible can give us.

I'll be sure to read your essay and give you some feedback.

Cheers,

Toby

Toby Lightheart

Hi Jayakar,

Thanks for your comment. I'll try to have a read of your essay and leave you a comment in return.

I think it is very important to careful with how models are used to come to conclusions about how the world works. I'm aware that the Markov decision process is a drastic simplification of how humanity might make choices to steer the future. As a result, I've tried to be careful not to draw too much from MDPs for making inferences about the nature of morality, the human condition and how humanity might evaluate its choices. I'm certainly open to being told I've made errors. If you think I've made an error I would appreciate you making a clear case.

I'll be interested to see how you have gone about answering the essay question.

Cheers,

Toby

Toby Lightheart

Hi Aaron,

Thanks for your offer. I will try to read, rate and reply to your essay shortly. I'm interested in hearing the views of someone with a formal philosophy education.

Cheers,

Toby

[deleted]

Dear Toby,

Very in-depth analytical essay with deep philosophical ideas, new concepts and new scientific method. I think, as good a deep analysis of the whole path traversed Humanity. Then we can all to act together more easily.

We need a Great Dream and "Common Cause". We need a "Great Common Cause" to save Peace, Nature and Humanity. Great Dream always go alond with Freedom without fear, Hope, Love, Justice. It's time. We start the path. New generation tells us the right path... High score.

I invite you to comment and appreciate my journey into the past and the future

where I draw the path of Protogeometer and new eidos of Universe, filled with limiting thoughts of the "LifeWorld".

Best regards,

Vladimir

Vladimir Rogozhin

Sorry, Vladimir Rogozhin

pjackson

Nice essay Toby. A valid case and interesting proposition, well argued and clearly written. Also I practice altruism which is indeed its own reward, and sell the concept; 'benevolent self interest i.e. sponsorship of ethical projects. With scoring criteria settled I'm interested in your views on more sophisticated decision making ('new ways of thinking') which I try to show is needed and works. It's about escaping assumptions and tracking consequences. (i.e. someone here quoted Watson (DNA); "scientists are stupid...because they can't see the future"). We can, but must learn to look!

First consider the limits of Markov. He's well satisfied if a chess player takes 5 pieces in a row for no loss. Or a pig may follow a complex route with many tit bits to eat. But the chess player may find too late he's been trapped into checkmate, and the pig is in the slaughterhouse ('terminating' conditions!). Decisions often have unintended and sometimes reverse outcomes as we don't bother to think holistically, or look past immediate consequences.

Beaurocrats are famous for it. i.e. The UK planning system had slowed unacceptably. Government introduced a financial reward for processing more applications faster. Great!..? Not at all. Councils started 'calling in' all minor work they'd let past, and were so busy processing all those that 'real' cases slowed even more! My lone voice warning what would happen was ignored. Same in science. Bragg, Einstien and many have talked of needing new ways of thinking or 'looking at' things, yet even when shown new ways, humans dismiss them and revert to familiar myth and doctrine

So even if we CAN find a way of rewarding ethics at what point do we determine a result? One or ten years later when Africa has electricity and health? Or a century later when our ecosystem fails?

Is not what we really need to learn to better use our brains? Stop just cramming them with so called 'facts' and learn how to 'think', to find and challenge hidden assumptions and track consequences. Would not decision making then steer us in the right rather than commonly wrong direction?

I apply that to science with well supported results (see my last 3 essays). This years is breakthrough, showing how physics can be unified. I find no more fundamental quantum leap in the right direction has such wide effects. History shows real advancement has always been science led, giving us the important choices; i.e. energy without fossil fuels etc. But again that short range thinking and poor decision making holds us back. Treating symptoms not fundamentals. See many of the comment in my essay blog.

I agree, a more scientific approach to decisions is needed, but perhaps well beyond Markov. Can you extend his foundations? Well done for yours, top points. I hope you now make the cut. I also hope you can read mine before the deadline and agree it of similar value. It's partially allegorical to tell it as a story. Don't fear that it's abut QM, it makes enough sense that 11 years on olds have see it (see the reproducible experiment in the end notes). I look forward to your comments.

Very best wishes

Peter

Toby Lightheart

Hi Peter,

Thanks for reading my essay and leaving a thoughtful response. I will do my best to return the favour before the extended deadline.

First on Markov decision processes, a solution for an MDP is usually based on the expected future value of actions, not just the immediate reward. Algorithms, including temporal-difference learning, typically search for "solutions" iteratively.

Unfortunately, humanity doesn't have much chance to iteratively search for solutions to our global problems. Fortunately, people are capable of making predictions of future rewarding states and actions far in advance of them occurring and, accordingly, assign value to actions that are likely to bring them about.

Ideally we would look as far forward as we possibly can into the consequences of our available actions. For example, we could consider the stability and health of Africa as a great "reward". Actions that have a high probability of achieving this would increase the expected value of these actions. Outcomes further into the future could likewise be incorporated into the expected value of actions.

A point I raise in my essay is that many incentives, or rewards, are too far removed from what we should actually be considering morally good. Politicians may be more motivated by desires to be re-elected and getting "donations" than a desire to achieve the greatest long-term public good. The incentive for re-election heavily biases decisions for short-term gain and donations from private interests could have strings attached that are not in the public interest. How incentives influence behaviour needs to be considered much more than it currently seems to be.

I totally agree that learning is important. It may be the most important thing we can do. Learning improves our chances of making accurate predictions about future outcomes and choosing our actions accordingly.

Cheers,

Toby

« Previous Page Next Page »