Article contents
PP-Attachment Disambiguation for Swedish: Combining Unsupervised and Supervised Training Data
Published online by Cambridge University Press: 22 December 2008
Abstract
Structural ambiguity, particularly attachment of prepositional phrases, is a serious type of global ambiguity in Natural Language. The disambiguation becomes crucial when a syntactic analyzer must make the correct decision among at least two equally grammatical parse-trees for the same sentence. This paper attempts to find answers to the problem of how attachment ambiguity can be resolved by utilizing Machine Learning (ML) techniques. ML is founded on the assumption that the performance in cognitive tasks is based on the similarity of new situations (testing) to stored representations of earlier experiences (training). Therefore, a large amount of training data is an important prerequisite for providing a solution to the problem. A combination of unsupervised and restricted supervised acquisition of such data will be reported. Training is performed both on a subset of the content of the Gothenburg Lexical Database (GLDB), and on instances of large corpora annotated with coarse-grained semantic information. Testing is performed on corpora instances using a range of different algorithms and metrics. The application language is written Swedish.
- Type
- Research Article
- Information
- Copyright
- Copyright © Cambridge University Press 2000
References
REFERENCES
- 3
- Cited by