Natural Language Grammar Induction using a Constituent-Context Model

Abstract

This paper presents a novel approach to the unsupervised learning of syntactic analyses of natural language text. Most previous work has focused on maximizing likelihood according to generative PCFG models. In contrast, we employ a simpler probabilistic model over trees based directly on constituent identity and linear context, and use an EM-like iterative procedure to induce structure. This method produces much higher quality analyses, giving the best published results on the ATIS dataset.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,139

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

A grammar systems approach to natural language grammar.M. Dolores Jiménez López - 2006 - Linguistics and Philosophy 29 (4):419 - 454.
Talking about trees and truth-conditions.Reinhard Muskens - 2001 - Journal of Logic, Language and Information 10 (4):417-455.

Analytics

Added to PP
2010-12-22

Downloads
19 (#732,197)

6 months
3 (#760,965)

Historical graph of downloads
How can I increase my downloads?