Hostname: page-component-586b7cd67f-tf8b9 Total loading time: 0 Render date: 2024-11-22T11:52:10.708Z Has data issue: false hasContentIssue false

Robust garden path parsing

Published online by Cambridge University Press:  23 February 2004

BRIAN ROARK
Affiliation:
AT&T Labs - Research, 180 Park Avenue, Building 103, Room E145, Florham Park, NJ 07932-0971, USA e-mail: [email protected]

Abstract

This paper presents modifications to a standard probabilistic context-free grammar that enable a predictive parser to avoid garden pathing without resorting to any ad-hoc heuristic repair. The resulting parser is shown to apply efficiently to both newspaper text and telephone conversations with complete coverage and excellent accuracy. The distribution over trees is peaked enough to allow the parser to find parses efficiently, even with the much larger search space resulting from overgeneration. Empirical results are provided for both Wall St. Journal and Switchboard test corpora.

Type
Papers
Copyright
© 2004 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)