Unjustified untrue "beliefs": AI hallucinations and justification logics

In Kordula Świętorzecka, Filip Grgić & Anna Brozek (eds.), Logic, Knowledge, and Tradition. Essays in Honor of Srecko Kovac (forthcoming)
  Copy   BIBTEX

Abstract

In artificial intelligence (AI), responses generated by machine-learning models (most often large language models) may be unfactual information presented as a fact. For example, a chatbot might state that the Mona Lisa was painted in 1815. Such phenomenon is called AI hallucinations, seeking inspiration from human psychology, with a great difference of AI ones being connected to unjustified beliefs (that is, AI “beliefs”) rather than perceptual failures). AI hallucinations may have their source in the data itself, that is, the source content, or in the training procedure, i.e. the way the knowledge was encoded in the model’s parameters, so that errors in encoding and decoding textual and non-textual representations can cause hallucinations. In this paper, we will observe how such errors come to life and how they might be mitigated. For this purpose, we will analyze the usability of justification logics, to behave as a proof checker for validating the correctness of large language models’ (LLM) responses. Justification logic was developed by S. Artemov, and later on mostly by Artemov and M. Fitting, deriving its main idea from the logic of proofs (LP): knowledge and belief modalities are seen as justification terms, i.e. t:X stands for t is a (proper) justification for X. Justification logic originated from attempts to create semantics for intuitionistic logic where proofs were the most proper justifications, but in further development, justification logic could be applied to different kinds of justifications). With the recent attempts to mitigate incorrect LLM responses, we will analyze various guardrails that are currently used for LLM responses, and see how the logic of justification may provide its benefits as an AI safety layer against false data.

Links

PhilArchive

External links

  • This entry has no external links. Add one.
Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Justification Logic with Confidence.Ted Shear & John Quiggin - 2020 - Studia Logica 108 (4):751-778.
Justifications for common knowledge.Samuel Bucheli, Roman Kuznets & Thomas Studer - 2011 - Journal of Applied Non-Classical Logics 21 (1):35-60.
Realizations and LP.Melvin Fitting - 2010 - Annals of Pure and Applied Logic 161 (3):368-387.
Labeled sequent calculus for justification logics.Meghdad Ghari - 2017 - Annals of Pure and Applied Logic 168 (1):72-111.
The logic of justification.Sergei Artemov - 2008 - Review of Symbolic Logic 1 (4):477-513.
Temporal Justification Logic.S. Bucheli, M. Ghari & T. Studer - 2017 - Proceedings of the Ninth Workshop on Methods for Modalities (M4M9 2017), Indian Institute of Technology, Kanpur, India, 8th to 10th January 2017, Electronic Proceedings in Theoretical Computer Science 243, Pages 59–74.

Analytics

Added to PP
2024-05-29

Downloads
0

6 months
0

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Kristina Šekrst
University of Zagreb

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references