Introspective Capabilities in Large Language Models

Journal of Consciousness Studies 30 (9):143-153 (2023)
  Copy   BIBTEX

Abstract

This paper considers the kind of introspection that large language models (LLMs) might be able to have. It argues that LLMs, while currently limited in their introspective capabilities, are not inherently unable to have such capabilities: they already model the world, including mental concepts, and already have some introspection-like capabilities. With deliberate training, LLMs may develop introspective capabilities. The paper proposes a method for such training for introspection, situates possible LLM introspection in the 'possible forms of introspection' framework proposed by Kammerer and Frankish, and considers the ethical ramifications of introspection and self-report in AI systems.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,592

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Holding Large Language Models to Account.Ryan Miller - 2023 - In Berndt Müller (ed.), Proceedings of the AISB Convention. Society for the Study of Artificial Intelligence and the Simulation of Behaviour. pp. 7-14.
You are what you’re for: Essentialist categorization in large language models.Siying Zhang, Selena She, Tobias Gerstenberg & David Rose - forthcoming - Proceedings of the 45Th Annual Conference of the Cognitive Science Society.
Large Language Models and the Reverse Turing Test.Terrence Sejnowski - 2023 - Neural Computation 35 (3):309–342.
Introspection Is Signal Detection.Jorge Morales - forthcoming - British Journal for the Philosophy of Science.
You Don't Know How You Think: Introspection and Language of Thought.Edouard Machery - 2005 - British Journal for the Philosophy of Science 56 (3):469-485.

Analytics

Added to PP
2023-10-06

Downloads
121 (#147,851)

6 months
114 (#36,664)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Robert Long
New York University

Citations of this work

No citations found.

Add more citations

References found in this work

Word meaning in minds and machines.Brenden M. Lake & Gregory L. Murphy - 2023 - Psychological Review 130 (2):401-431.

Add more references