Computational Goals, Values and Decision-Making

Science and Engineering Ethics 26 (5):2487-2495 (2020)
  Copy   BIBTEX

Abstract

Considering the popular framing of an artificial intelligence as a rational agent that always seeks to maximise its expected utility, referred to as its goal, one of the features attributed to such rational agents is that they will never select an action which will change their goal. Therefore, if such an agent is to be friendly towards humanity, one argument goes, we must understand how to specify this friendliness in terms of a utility function. Wolfhart Totschnig, argues in contrast that a fully autonomous agent will have the ability to change its utility function and will do so guided by its values. This commentary examines computational accounts of goals, values and decision-making. It rejects the idea that a rational agent will never select an action that changes its goal but also argues that an artificial intelligence is unlikely to be purely rational in terms of always acting to maximise a utility function. It nevertheless also challenges the idea that an agent which does not change its goal cannot be considered fully autonomous. It does agree that values are an important component of decision-making and explores a number of reasons why.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,923

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Fully Autonomous AI.Wolfhart Totschnig - 2020 - Science and Engineering Ethics 26 (5):2473-2485.
Hard Choices: Decision Making Under Unresolved Conflict.Isaac Levi - 1986 - New York: Cambridge University Press.
Choosing for Changing Selves.Richard Pettigrew - 2019 - Oxford, UK: Oxford University Press.
Utility and framing.Paul Weirich - 2010 - Synthese 176 (1):83 - 103.

Analytics

Added to PP
2020-08-04

Downloads
18 (#857,094)

6 months
7 (#489,614)

Historical graph of downloads
How can I increase my downloads?

References found in this work

Superintelligence: paths, dangers, strategies.Nick Bostrom (ed.) - 2014 - Oxford University Press.
Risk and Rationality.Lara Buchak - 2013 - Oxford, GB: Oxford University Press.
Fully Autonomous AI.Wolfhart Totschnig - 2020 - Science and Engineering Ethics 26 (5):2473-2485.
Superintelligence as superethical.Steve Petersen - 2017 - In Patrick Lin, Keith Abney & Ryan Jenkins (eds.), Robot Ethics 2. 0: New Challenges in Philosophy, Law, and Society. New York, USA: Oxford University Press. pp. 322-337.

View all 6 references / Add more references