Predicting Age of Acquisition for Children's Early Vocabulary in Five Languages Using Language Model Surprisal

Cognitive Science 47 (9):e13334 (2023)
  Copy   BIBTEX

Abstract

What makes a word easy to learn? Early‐learned words are frequent and tend to name concrete referents. But words typically do not occur in isolation. Some words are predictable from their contexts; others are less so. Here, we investigate whether predictability relates to when children start producing different words (age of acquisition; AoA). We operationalized predictability in terms of a word's surprisal in child‐directed speech, computed using n‐gram and long‐short‐term‐memory (LSTM) language models. Predictability derived from LSTMs was generally a better predictor than predictability derived from n‐gram models. Across five languages, average surprisal was positively correlated with the AoA of predicates and function words but not nouns. Controlling for concreteness and word frequency, more predictable predicates and function words were learned earlier. Differences in predictability between languages were associated with cross‐linguistic differences in AoA: the same word (when it was a predicate) was produced earlier in languages where the word was more predictable.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,611

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Knowledge of language and phrasal vocabulary acquisition.Koenraad Kuiper - 2006 - Behavioral and Brain Sciences 29 (3):291-292.
What Is the “Context” for Contextual Vocabulary Acquisition?William J. Rapaport - 2003 - Proceedings of the 4th Joint International Conference on Cognitive Science/7th Australasian Society for Cognitive Science Conference 2:547-552.

Analytics

Added to PP
2023-09-12

Downloads
4 (#1,630,023)

6 months
2 (#1,206,551)

Historical graph of downloads
How can I increase my downloads?