Hostname: page-component-cd9895bd7-lnqnp Total loading time: 0 Render date: 2024-12-26T02:56:17.597Z Has data issue: false hasContentIssue false

The Danish SIMPLE lexicon and its application in content-based querying

Published online by Cambridge University Press:  04 May 2004

Bolette Sandford Pedersen
Affiliation:
Center for Sprogteknologi, Københavns Universitet, Njalsgade 80, DK-2300 S. E-mail: [email protected]
Patrizia Paggio
Affiliation:
Center for Sprogteknologi, Københavns Universitet, Njalsgade 80, DK-2300 S. E-mail: [email protected]
Get access

Abstract

This paper deals with the SIMPLE-DK lexicon, a computational lexicon for Danish developed at the Centre for Language Technology in Copenhagen within the European Union project SIMPLE. The general SIMPLE model, on which the Danish lexicon is based, is presented, and the way in which several specific aspects of Danish, such as nominal compounds and time expressions, are accommodated in this model is then described. Phrasal verbs – in particular phrasal motion verbs – are shown to be a challenging phenomenon since they are difficult to place in the SIMPLE event ontology, and pose problems regarding the interpretation of the directional particle they combine with. The encoding strategy that is proposed here accounts for compositional and non-compositional types of phrasal verb, and captures the relation between act-denoting and transition-denoting senses of the same verb in terms of regular polysemy. The final part of the paper deals with the exploitation of SIMPLE-DK as an ontological and lexical source in the Danish project on content-based querying OntoQuery. In the OntoQuery ontology, the structured concepts in SIMPLE-DK are combined with nutrition concepts, and the resulting ontology is used for matching evaluation. It is also discussed how selectional restrictions and qualia roles from SIMPLE-DK can be included in a conceptual grammar to be used for query and text analysis.

Type
Research Article
Copyright
© 2004 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)