Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely

Cognitive Science 47 (11):e13386 (2023)
  Copy   BIBTEX

Abstract

Word co‐occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs’ semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pretrained LLMs (from 2018's BERT to 2023's MPT) assign a higher likelihood to plausible descriptions of agent−patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n = 1215), we found that pretrained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign a higher likelihood to possible versus impossible events (The teacher bought the laptop vs. The laptop bought the teacher). However, LLMs show less consistent preferences for likely versus unlikely events (The nanny tutored the boy vs. The boy tutored the nanny). In follow‐up analyses, we show that (i) LLM scores are driven by both plausibility and surface‐level sentence features, (ii) LLM scores generalize well across syntactic variants (active vs. passive constructions) but less well across semantic variants (synonymous sentences), (iii) some LLM errors mirror human judgment ambiguity, and (iv) sentence plausibility serves as an organizing dimension in internal LLM representations. Overall, our results show that important aspects of event knowledge naturally emerge from distributional linguistic patterns, but also highlight a gap between representations of possible/impossible and likely/unlikely events.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,435

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Event and Milieu.Andrei Rodin - 1998 - The Paideia Archive: Twentieth World Congress of Philosophy 45:217-221.
On Metaknowledge and Truth.Wybraniec-Skardowska Urszula - 2009 - In Jacek Malinowski David Makinson & Wansing Heinrich (eds.), Towards Mathematical Philosophy. Springer. pp. 319-243.
Knowledge of language in action.Cheng-Hung Tsai - 2015 - Philosophical Explorations 18 (1):68-89.
Monotonicity Reasoning in the Age of Neural Foundation Models.Zeming Chen & Qiyue Gao - 2023 - Journal of Logic, Language and Information 33 (1):49-68.
How Concrete Do We Get Telling Stories?Piek Vossen, Tommaso Caselli & Agata Cybulska - 2018 - Topics in Cognitive Science 10 (3):621-640.
What We Know When We Know a Language.Barry C. Smith - 2006 - In Ernest Lepore & Barry C. Smith (eds.), The Oxford Handbook of Philosophy of Language. Oxford University Press. pp. 941.
Practical knowledge of language.Cheng-Hung Tsai - 2010 - Philosophia 38 (2):331-341.
Are machines radically contextualist?Ryan M. Nefdt - 2023 - Mind and Language 38 (3):750-771.

Analytics

Added to PP
2023-11-28

Downloads
13 (#1,022,934)

6 months
13 (#187,082)

Historical graph of downloads
How can I increase my downloads?