Approval-directed agency and the decision theory of Newcomb-like problems

Synthese 198 (Suppl 27):6491-6504 (2019)
  Copy   BIBTEX

Abstract

Decision theorists disagree about how instrumentally rational agents, i.e., agents trying to achieve some goal, should behave in so-called Newcomb-like problems, with the main contenders being causal and evidential decision theory. Since the main goal of artificial intelligence research is to create machines that make instrumentally rational decisions, the disagreement pertains to this field. In addition to the more philosophical question of what the right decision theory is, the goal of AI poses the question of how to implement any given decision theory in an AI. For example, how would one go about building an AI whose behavior matches evidential decision theory’s recommendations? Conversely, we can ask which decision theories describe the behavior of any existing AI design. In this paper, we study what decision theory an approval-directed agent, i.e., an agent whose goal it is to maximize the score it receives from an overseer, implements. If we assume that the overseer rewards the agent based on the expected value of some von Neumann–Morgenstern utility function, then such an approval-directed agent is guided by two decision theories: the one used by the agent to decide which action to choose in order to maximize the reward and the one used by the overseer to compute the expected utility of a chosen action. We show which of these two decision theories describes the agent’s behavior in which situations.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,219

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Success-First Decision Theories.Preston Greene - 2018 - In Arif Ahmed (ed.), Newcomb's Problem. Cambridge University Press. pp. 115–137.
Rationality revisited.Reed Richter - 1984 - Australasian Journal of Philosophy 62 (4):392 – 403.
Conditional Preference and Causal Expected Utility.Brad Armendt - 1988 - In W. L. Harper & B. Skyrms (eds.), Causation in Decision, Belief Change, and Statistics, vol. II. Kluwer Academic Publishers. pp. 3-24.
Evidential decision theory and medical newcomb problems.Arif Ahmed - 2005 - British Journal for the Philosophy of Science 56 (2):191-198.
Decision-theoretic paradoxes as voting paradoxes.Rachael Briggs - 2010 - Philosophical Review 119 (1):1-30.
Are Newcomb problems really decisions?James M. Joyce - 2006 - Synthese 156 (3):537-562.

Analytics

Added to PP
2019-02-25

Downloads
17 (#819,600)

6 months
4 (#698,851)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Caspar Oesterheld
Duke University

Citations of this work

No citations found.

Add more citations

References found in this work

Causality: Models, Reasoning and Inference.Judea Pearl - 2000 - New York: Cambridge University Press.
Superintelligence: paths, dangers, strategies.Nick Bostrom (ed.) - 2014 - Oxford University Press.
Causality: Models, Reasoning and Inference.Judea Pearl - 2000 - Tijdschrift Voor Filosofie 64 (1):201-202.

View all 32 references / Add more references