Accurate Unlexicalized Parsing

Dan Klein; Christopher D. Manning

nlp.stanford.edu

Accurate Unlexicalized Parsing

Abstract

We demonstrate that an unlexicalized PCFG can parse much more accurately than previously shown, by making use of simple, linguistically motivated state splits, which break down false independence assumptions latent in a vanilla treebank grammar. Indeed, its performance of 86.36% (LP/LR F1) is better than that of early lexicalized PCFG models, and surprisingly close to the current state-of-theart. This result has potential uses beyond establishing a strong lower bound on the maximum possible accuracy of unlexicalized models: an unlexicalized PCFG is much more compact, easier to replicate, and easier to interpret than more complex lexical models, and the parsing algorithms are simpler, more widely understood, of lower asymptotic complexity, and easier to optimize.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Daniel Klein

Harvard University

Keywords

Add keywords

Reprint years

My notes

Analytics

Added to PP
2010-12-22

Downloads
36 (#431,270)

6 months
5 (#652,053)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Daniel Klein

Harvard University

Citations of this work

Expectation-based syntactic comprehension.Roger Levy - 2008 - Cognition 106 (3):1126-1177.

Recurrent neural network-based models for recognizing requisite and effectuation parts in legal texts.Truong-Son Nguyen, Le-Minh Nguyen, Satoshi Tojo, Ken Satoh & Akira Shimazu - 2018 - Artificial Intelligence and Law 26 (2):169-199.

Calibrating Generative Models: The Probabilistic Chomsky-Schützenberger Hierarchy.Thomas Icard - 2020 - Journal of Mathematical Psychology 95.

Generating Typed Dependency Parses from Phrase Structure Parses.Christopher Manning - unknown

Appellate Court Modifications Extraction for Portuguese.William Paulo Ducca Fernandes, Luiz José Schirmer Silva, Isabella Zalcberg Frajhof, Guilherme da Franca Couto Fernandes de Almeida, Carlos Nelson Konder, Rafael Barbosa Nasser, Gustavo Robichez de Carvalho, Simone Diniz Junqueira Barbosa & Hélio Côrtes Vieira Lopes - 2020 - Artificial Intelligence and Law 28 (3):327-360.

View all 23 citations / Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Accurate Unlexicalized Parsing

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work