Gesellschaft für Informatik e.V.

Lecture Notes in Informatics

WM 2003: Professionelles Wissesmanagement - Erfahrungen und Visionen, Beiträge der 2. Konferenz Professionelles Wissensmanagement, 2.-4. April 2003 in Luzern. P-28, 43-50 (2003).

GI, Gesellschaft für Informatik, Bonn


Ulrich Reimer (ed.), Andreas Abecker (ed.), Steffen Staab (ed.), Gerd Stumme (ed.)

Copyright © GI, Gesellschaft für Informatik, Bonn


Ontologies in cross-language information retrieval

Martin Volk , Spela Vintar and Paul Buitelaar


We present an approach to using ontologies as interlingua in cross-language information retrieval in the medical domain. Our approach is based on using the Unified Medical Language System (UMLS) as the primary ontology. Documents and queries are annotated with multiple layers of linguistic information (part-of-speech tags, lemmas, phrase chunks). Based on this we identify medical terms and semantic relations between them and map them to their position in the ontology. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that semantic information, specifically the combined use of concepts and relations, increases the precision in monolingual retrieval. In cross-language retrieval the semantic annotation outperforms machine translation of the queries, but the best results are achieved by combining a similarity thesaurus with the semantic codes.

Full Text: PDF

GI, Gesellschaft für Informatik, Bonn
ISBN 3-88579-357-1

Last changed 04.10.2013 17:56:54