Enriching a thesaurus to improve retrieval of audiovisual documents

Laura Hollink, Véronique Malaisé, Guus Schreiber

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In many archives of audiovisual documents, retrieval is done using metadata from a structured vocabulary or thesaurus. In practice, many of these thesauri have limited or no structure. The objective of this paper is to find out whether retrieval of audiovisual resources from a collection indexed with an in-house thesaurus can be improved by anchoring the thesaurus to an external, semantically richer thesaurus. We propose a method to enrich the structure of a thesaurus and we investigate its added value for retrieval purposes. We first anchor the thesaurus to an external resource, WordNet. From this anchoring we infer relations between pairs of terms in the thesaurus that were previously unrelated. We employ the enriched thesaurus in a retrieval experiment on a TRECVid 2007 dataset. The results are promising: with simple techniques we are able to enrich a thesaurus in such a way that it adds to retrieval performance.

Original languageEnglish
Title of host publicationSemantic Multimedia - Third International Conference on Semantic and Digital Media Technologies, SAMT 2008, Proceedings
Pages47-60
Number of pages14
DOIs
StatePublished - 2008
Externally publishedYes
Event3rd International Conference on Semantic and Digital Media Technologies, SAMT 2008 - Koblenz, Germany
Duration: Dec 3 2008Dec 5 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5392 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd International Conference on Semantic and Digital Media Technologies, SAMT 2008
Country/TerritoryGermany
CityKoblenz
Period12/3/0812/5/08

Fingerprint

Dive into the research topics of 'Enriching a thesaurus to improve retrieval of audiovisual documents'. Together they form a unique fingerprint.

Cite this