Literature Review: What Artificial General Intelligence Safety Researchers Have Written About the Nature of Human Values

Alexey Turchin; David Denkenberger

Literature Review: What Artificial General Intelligence Safety Researchers Have Written About the Nature of Human Values

Abstract

Abstract: The field of artificial general intelligence (AGI) safety is quickly growing. However, the nature of human values, with which future AGI should be aligned, is underdefined. Different AGI safety researchers have suggested different theories about the nature of human values, but there are contradictions. This article presents an overview of what AGI safety researchers have written about the nature of human values, up to the beginning of 2019. 21 authors were overviewed, and some of them have several theories. A theory classification method is suggested, where the theories are judged according to the level of their complexity and behaviorists-internalists scale, as well as the level of their generality-humanity. We suggest that a multiplicity of well-supported theories means that the nature of human values is difficult to define, and some meta-level theory is needed.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Alexey Turchin

Keywords

Artificial intelligence human values AI Safety

Reprint years

My notes

Similar books and articles

AI Alignment Problem: “Human Values” don’t Actually Exist.Alexey Turchin - manuscript

Safety Engineering for Artificial General Intelligence.Roman Yampolskiy & Joshua Fox - 2013 - Topoi 32 (2):217-226.

Re-creating the Philosopher’s Mind: Artificial Life from Artificial Intelligence.Maurice H. T. Ling - 2012 - Human-Level Intelligence 2:1.

Ai: Its Nature and Future.Margaret A. Boden - 2016 - Oxford University Press UK.

A Case for Machine Ethics in Modeling Human-Level Intelligent Agents.Robert James M. Boyles - 2018 - Kritike 12 (1):182–200.

A Value-Sensitive Design Approach to Intelligent Agents.Steven Umbrello & Angelo Frank De Bellis - 2018 - In Yampolskiy Roman (ed.), Artificial Intelligence Safety and Security. CRC Press. pp. 395-410.

Artificial Intelligence and the Body: Dreyfus, Bickhard, and the Future of AI.Daniel Susser - 2013 - In Vincent C. Müller (ed.), Philosophy and Theory of Artificial Intelligence. Berlin: Springer. pp. 277-287.

Artificial Intelligence Safety and Security.Turchin Alexey & David Denkenberger - 2018 - CRC Press.

The Convergence of Machine and Human Nature a Critique of the Computer Metaphor of Mind and Artificial Intelligence.A. E. Mcclintock - 1995

Neuroscience, artificial intelligence, and human nature: Theological and philosophical reflections.Ian G. Barbour - 1999 - In Zygon. Notre Dame: University Notre Dame Press. pp. 361-398.

Global Solutions vs. Local Solutions for the AI Safety Problem.Alexey Turchin - 2019 - Big Data Cogn. Comput 3 (1).

Human Values in Management.R. K. Dasgupta - 1997 - Journal of Human Values 3 (2):145-160.

Artificial Intelligence: Its Scope and Limits.James H. Fetzer - 1990 - Kluwer Academic Publishers.

Risks of artificial general intelligence.Vincent C. Müller (ed.) - 2014 - Taylor & Francis (JETAI).

Artificial Consciousness or Artificial Intelligence.Florin Spanache - 2016 - Dialogo 3 (2):135-143.

Analytics

Added to PP
2019-04-25

Downloads
444 (#42,077)

6 months
73 (#59,301)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Alexey Turchin

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Literature Review: What Artificial General Intelligence Safety Researchers Have Written About the Nature of Human Values

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work