Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely

Carina Kauf; Anna A. Ivanova; Giulia Rambelli; Emmanuele Chersoni; Jingyuan Selena She; Zawad Chowdhury; Evelina Fedorenko; Alessandro Lenci

dx.doi.org

Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely

Carina Kauf, Anna A. Ivanova, Giulia Rambelli, Emmanuele Chersoni, Jingyuan Selena She, Zawad Chowdhury, Evelina Fedorenko & Alessandro Lenci

Cognitive Science 47 (11):e13386 (2023) Copy BIBT_EX

Abstract

Word co‐occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs’ semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pretrained LLMs (from 2018's BERT to 2023's MPT) assign a higher likelihood to plausible descriptions of agent−patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n = 1215), we found that pretrained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign a higher likelihood to possible versus impossible events (The teacher bought the laptop vs. The laptop bought the teacher). However, LLMs show less consistent preferences for likely versus unlikely events (The nanny tutored the boy vs. The boy tutored the nanny). In follow‐up analyses, we show that (i) LLM scores are driven by both plausibility and surface‐level sentence features, (ii) LLM scores generalize well across syntactic variants (active vs. passive constructions) but less well across semantic variants (synonymous sentences), (iii) some LLM errors mirror human judgment ambiguity, and (iv) sentence plausibility serves as an organizing dimension in internal LLM representations. Overall, our results show that important aspects of event knowledge naturally emerge from distributional linguistic patterns, but also highlight a gap between representations of possible/impossible and likely/unlikely events.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Anna Ivanova

Keywords

Artificial neural networks Generalized event knowledge Language models Plausibility Semantics Syntax Typicality World knowledge

Reprint years

DOI

10.1111/cogs.13386

My notes

Analytics

Added to PP
2023-11-28

Downloads
13 (#1,022,934)

6 months
13 (#187,082)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Anna Ivanova

Citations of this work

No citations found.

Add more citations

References found in this work

A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge.Thomas K. Landauer & Susan T. Dumais - 1997 - Psychological Review 104 (2):211-240.

Expectation-based syntactic comprehension.Roger Levy - 2008 - Cognition 106 (3):1126-1177.

Incremental interpretation at verbs: restricting the domain of subsequent reference.Gerry T. M. Altmann & Yuki Kamide - 1999 - Cognition 73 (3):247-264.

The effect of word predictability on reading time is logarithmic.Nathaniel J. Smith & Roger Levy - 2013 - Cognition 128 (3):302-319.

Symbol Interdependency in Symbolic and Embodied Cognition.Max M. Louwerse - 2011 - Topics in Cognitive Science 3 (2):273-302.

View all 13 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work