Abstract
There is significant evidence in the literature that integrating knowledge about multiword expressions can improve shallow parsing accuracy. We present an experimental study to quantify this improvement, focusing on compound nominals, proper names and adjective-noun constructions. The evaluation set of multiword expressions is derived from Word-Net and the textual data are downloaded from the web. We use a classification method to aid human annotation of output parses. This method allows us to conduct experiments on a large dataset of unannotated data. Experiments show that knowledge about multiword expressions leads to an increase of between 7.5% and 9.5% in accuracy of shallow parsing in sentences containing these multiword expressions.
Original language | English |
---|---|
Title of host publication | Human language technologies |
Subtitle of host publication | the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics |
Place of Publication | Stroudsburg, PA |
Publisher | Association for Computational Linguistics |
Pages | 636-644 |
Number of pages | 9 |
ISBN (Print) | 978-1-932432-65-7 |
Publication status | Published - 2010 |
Event | 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics - Los Angeles, United States Duration: 2 Jun 2010 → 4 Jun 2010 |
Conference
Conference | 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics |
---|---|
Country/Territory | United States |
City | Los Angeles |
Period | 2/06/10 → 4/06/10 |