TY - GEN
T1 - Enriching a thesaurus to improve retrieval of audiovisual documents
AU - Hollink, Laura
AU - Malaisé, Véronique
AU - Schreiber, Guus
PY - 2008
Y1 - 2008
N2 - In many archives of audiovisual documents, retrieval is done using metadata from a structured vocabulary or thesaurus. In practice, many of these thesauri have limited or no structure. The objective of this paper is to find out whether retrieval of audiovisual resources from a collection indexed with an in-house thesaurus can be improved by anchoring the thesaurus to an external, semantically richer thesaurus. We propose a method to enrich the structure of a thesaurus and we investigate its added value for retrieval purposes. We first anchor the thesaurus to an external resource, WordNet. From this anchoring we infer relations between pairs of terms in the thesaurus that were previously unrelated. We employ the enriched thesaurus in a retrieval experiment on a TRECVid 2007 dataset. The results are promising: with simple techniques we are able to enrich a thesaurus in such a way that it adds to retrieval performance.
AB - In many archives of audiovisual documents, retrieval is done using metadata from a structured vocabulary or thesaurus. In practice, many of these thesauri have limited or no structure. The objective of this paper is to find out whether retrieval of audiovisual resources from a collection indexed with an in-house thesaurus can be improved by anchoring the thesaurus to an external, semantically richer thesaurus. We propose a method to enrich the structure of a thesaurus and we investigate its added value for retrieval purposes. We first anchor the thesaurus to an external resource, WordNet. From this anchoring we infer relations between pairs of terms in the thesaurus that were previously unrelated. We employ the enriched thesaurus in a retrieval experiment on a TRECVid 2007 dataset. The results are promising: with simple techniques we are able to enrich a thesaurus in such a way that it adds to retrieval performance.
UR - http://www.scopus.com/inward/record.url?scp=58849128249&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-92235-3_6
DO - 10.1007/978-3-540-92235-3_6
M3 - Contribución a la conferencia
AN - SCOPUS:58849128249
SN - 3540922342
SN - 9783540922346
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 47
EP - 60
BT - Semantic Multimedia - Third International Conference on Semantic and Digital Media Technologies, SAMT 2008, Proceedings
T2 - 3rd International Conference on Semantic and Digital Media Technologies, SAMT 2008
Y2 - 3 December 2008 through 5 December 2008
ER -