The Role of Negative Information in Distributional Semantic Learning

Brendan T. Johns; Douglas J. K. Mewhort; Michael N. Jones

Download from

dx.doi.org

More download options

The Role of Negative Information in Distributional Semantic Learning

Brendan T. Johns, Douglas J. K. Mewhort & Michael N. Jones

Cognitive Science 43 (5):e12730 (2019) Copy BIBT_EX

Abstract

Distributional models of semantics learn word meanings from contextual co‐occurrence patterns across a large sample of natural language. Early models, such as LSA and HAL (Landauer & Dumais, 1997; Lund & Burgess, 1996), counted co‐occurrence events; later models, such as BEAGLE (Jones & Mewhort, 2007), replaced counting co‐occurrences with vector accumulation. All of these models learned from positive information only: Words that occur together within a context become related to each other. A recent class of distributional models, referred to as neural embedding models, are based on a prediction process embedded in the functioning of a neural network: Such models predict words that should surround a target word in a given context (e.g., word2vec; Mikolov, Sutskever, Chen, Corrado, & Dean, 2013). An error signal derived from the prediction is used to update each word's representation via backpropagation. However, another key difference in predictive models is their use of negative information in addition to positive information to develop a semantic representation. The models use negative examples to predict words that should not surround a word in a given context. As before, an error signal derived from the prediction prompts an update of the word's representation, a procedure referred to as negative sampling. Standard uses of word2vec recommend a greater or equal ratio of negative to positive sampling. The use of negative information in developing a representation of semantic information is often thought to be intimately associated with word2vec's prediction process. We assess the role of negative information in developing a semantic representation and show that its power does not reflect the use of a prediction mechanism. Finally, we show how negative information can be efficiently integrated into classic count‐based semantic models using parameter‐free analytical transformations.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Big data Cognitive modeling Distributional semantics Machine learning Natural language processing

Reprint years

DOI

10.1111/cogs.12730

My notes

Analytics

Added to PP
2019-05-09

Downloads
14 (#925,441)

6 months
5 (#510,007)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

Investigating the Extent to which Distributional Semantic Models Capture a Broad Range of Semantic Relations.Kevin S. Brown, Eiling Yee, Gitte Joergensen, Melissa Troyer, Elliot Saltzman, Jay Rueckl, James S. Magnuson & Ken McRae - 2023 - Cognitive Science 47 (5):e13291.

Determining the Relativity of Word Meanings Through the Construction of Individualized Models of Semantic Memory.Brendan T. Johns - 2024 - Cognitive Science 48 (2):e13413.

Add more citations

References found in this work

A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge.Thomas K. Landauer & Susan T. Dumais - 1997 - Psychological Review 104 (2):211-240.

Topics in semantic representation.Thomas L. Griffiths, Mark Steyvers & Joshua B. Tenenbaum - 2007 - Psychological Review 114 (2):211-244.

Representing word meaning and order information in a composite holographic lexicon.Michael N. Jones & Douglas J. K. Mewhort - 2007 - Psychological Review 114 (1):1-37.

Optimal foraging in semantic memory.Thomas T. Hills, Michael N. Jones & Peter M. Todd - 2012 - Psychological Review 119 (2):431-440.

The Myth of Cognitive Decline: Non‐Linear Dynamics of Lifelong Learning.Michael Ramscar, Peter Hendrix, Cyrus Shaoul, Petar Milin & Harald Baayen - 2014 - Topics in Cognitive Science 6 (1):5-42.

View all 8 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

The Role of Negative Information in Distributional Semantic Learning

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work