Learning Diphone-Based Segmentation

Robert Daland; Janet B. Pierrehumbert

Download from

www.socsci.uci.edu

More download options

Learning Diphone-Based Segmentation

Robert Daland & Janet B. Pierrehumbert

Cognitive Science 35 (1):119-155 (2011) Copy BIBT_EX

Abstract

This paper reconsiders the diphone-based word segmentation model of Cairns, Shillcock, Chater, and Levy (1997) and Hockema (2006), previously thought to be unlearnable. A statistically principled learning model is developed using Bayes’ theorem and reasonable assumptions about infants’ implicit knowledge. The ability to recover phrase-medial word boundaries is tested using phonetic corpora derived from spontaneous interactions with children and adults. The (unsupervised and semi-supervised) learning models are shown to exhibit several crucial properties. First, only a small amount of language exposure is required to achieve the model’s ceiling performance, equivalent to between 1 day and 1 month of caregiver input. Second, the models are robust to variation, both in the free parameter and the input representation. Finally, both the learning and baseline models exhibit undersegmentation, argued to have significant ramifications for speech processing as a whole

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Word segmentation Language acquisition Computational model Bayesian Unsupervised learning

Reprint years

DOI

10.1111/j.1551-6709.2010.01160.x

My notes

Analytics

Added to PP
2010-12-10

Downloads
118 (#149,267)

6 months
8 (#346,782)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

Cognitive science in the era of artificial intelligence: A roadmap for reverse-engineering the infant language-learner.Emmanuel Dupoux - 2018 - Cognition 173 (C):43-59.

Learning Phonemes With a Proto-Lexicon.Andrew Martin, Sharon Peperkamp & Emmanuel Dupoux - 2013 - Cognitive Science 37 (1):103-124.

How much does prosody help word segmentation? A simulation study on infant-directed speech.Bogdan Ludusan, Alejandrina Cristia, Reiko Mazuka & Emmanuel Dupoux - 2022 - Cognition 219 (C):104961.

Does morphological complexity affect word segmentation? Evidence from computational modeling.Georgia Loukatou, Sabine Stoll, Damian Blasi & Alejandrina Cristia - 2022 - Cognition 220 (C):104960.

Consequences of phonological variation for algorithmic word segmentation.Caroline Beech & Daniel Swingley - 2023 - Cognition 235 (C):105401.

View all 8 citations / Add more citations

References found in this work

Finding Structure in Time.Jeffrey L. Elman - 1990 - Cognitive Science 14 (2):179-211.

Shortlist B: A Bayesian model of continuous speech recognition.Dennis Norris & James M. McQueen - 2008 - Psychological Review 115 (2):357-395.

Distributional regularity and phonotactic constraints are useful for segmentation.Michael R. Brent & Timothy A. Cartwright - 1996 - Cognition 61 (1-2):93-125.

A Bayesian framework for word segmentation: Exploring the effects of context.Sharon Goldwater, Thomas L. Griffiths & Mark Johnson - 2009 - Cognition 112 (1):21-54.

The role of exposure to isolated words in early vocabulary development.Michael R. Brent & Jeffrey Mark Siskind - 2001 - Cognition 81 (2):B33-B44.

View all 13 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Learning Diphone-Based Segmentation

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work