Header

UZH-Logo

Maintenance Infos

Ontologies in cross-language information retrieval


Volk, Martin; Vintar, S; Buitelaar, P (2003). Ontologies in cross-language information retrieval. In: 2nd Conference on Professional Knowledge Management, Luzern, 2003 - 2003.

Abstract

We present an approach to using ontologies as interlingua in cross-language information retrieval in the medical domain. Our approach is based on using the Unified Medical Language System (UMLS) as the primary ontology. Documents and queries are annotated with multiple layers of linguistic information (part-of-speech tags, lemmas, phrase chunks). Based on this we identify medical terms and semantic relations between them and map them to their position in the ontology. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that semantic information, specifically the combined use of concepts and relations, increases the precision in monolingual retrieval. In cross-language retrieval the semantic annotation outperforms machine translation of the queries, but the best results are achieved by combining a similarity thesaurus with the semantic codes.

Abstract

We present an approach to using ontologies as interlingua in cross-language information retrieval in the medical domain. Our approach is based on using the Unified Medical Language System (UMLS) as the primary ontology. Documents and queries are annotated with multiple layers of linguistic information (part-of-speech tags, lemmas, phrase chunks). Based on this we identify medical terms and semantic relations between them and map them to their position in the ontology. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that semantic information, specifically the combined use of concepts and relations, increases the precision in monolingual retrieval. In cross-language retrieval the semantic annotation outperforms machine translation of the queries, but the best results are achieved by combining a similarity thesaurus with the semantic codes.

Statistics

Downloads

135 downloads since deposited on 24 Aug 2009
18 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:2003
Deposited On:24 Aug 2009 14:17
Last Modified:19 Feb 2018 21:12
OA Status:Green

Download

Download PDF  'Ontologies in cross-language information retrieval'.
Preview
Filetype: PDF
Size: 1MB