The argument for near-term human disempowerment through AI

AI and Society:1-14 (2024)
  Copy   BIBTEX

Abstract

Many researchers and intellectuals warn about extreme risks from artificial intelligence. However, these warnings typically came without systematic arguments in support. This paper provides an argument that AI will lead to the permanent disempowerment of humanity, e.g. human extinction, by 2100. It rests on four substantive premises which it motivates and defends: first, the speed of advances in AI capability, as well as the capability level current systems have already reached, suggest that it is practically possible to build AI systems capable of disempowering humanity by 2100. Second, due to incentives and coordination problems, if it is possible to build such AI, it will be built. Third, since it appears to be a hard technical problem to build AI which is aligned with the goals of its designers, and many actors might build powerful AI, misaligned powerful AI will be built. Fourth, because disempowering humanity is useful for a large range of misaligned goals, such AI will try to disempower humanity. If AI is capable of disempowering humanity and tries to disempower humanity by 2100, then humanity will be disempowered by 2100. This conclusion has immense moral and prudential significance.

Analytics

Added to PP
2023-09-20

Downloads
544 (#36,052)

6 months
188 (#19,154)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Leonard Dung
Universität Erlangen-Nürnberg

Citations of this work

Understanding Artificial Agency.Leonard Dung - forthcoming - Philosophical Quarterly.

Add more citations