TY - GEN
T1 - A generalized vector space model for text retrieval based on semantic relatedness
AU - Tsatsaronis, George
AU - Panagiotopoulou, Vicky
PY - 2009
Y1 - 2009
N2 - Generalized Vector Space Models (GVSM) extend the standard Vector Space Model (VSM) by embedding additional types of information, besides terms, in the representation of documents. An interesting type of information that can be used in such models is semantic information from word thesauri like WordNet. Previous attempts to construct GVSM reported contradicting results. The most challenging problem is to incorporate the semantic information in a theoretically sound and rigorous manner and to modify the standard interpretation of the VSM. In this paper we present a new GVSM model that exploits WordNet's semantic information. The model is based on a new measure of semantic relatedness between terms. Experimental study conducted in three TREC collections reveals that semantic information can boost text retrieval performance with the use of the proposed GVSM.
AB - Generalized Vector Space Models (GVSM) extend the standard Vector Space Model (VSM) by embedding additional types of information, besides terms, in the representation of documents. An interesting type of information that can be used in such models is semantic information from word thesauri like WordNet. Previous attempts to construct GVSM reported contradicting results. The most challenging problem is to incorporate the semantic information in a theoretically sound and rigorous manner and to modify the standard interpretation of the VSM. In this paper we present a new GVSM model that exploits WordNet's semantic information. The model is based on a new measure of semantic relatedness between terms. Experimental study conducted in three TREC collections reveals that semantic information can boost text retrieval performance with the use of the proposed GVSM.
UR - http://www.scopus.com/inward/record.url?scp=70349962448&partnerID=8YFLogxK
U2 - 10.3115/1609179.1609188
DO - 10.3115/1609179.1609188
M3 - Contribución a la conferencia
AN - SCOPUS:70349962448
SN - 9781932432169
T3 - EACL 2009 - 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings
SP - 70
EP - 78
BT - Student Research Workshop, Demonstrations, Tutorial Abstracts
PB - Association for Computational Linguistics (ACL)
T2 - 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009
Y2 - 30 March 2009 through 3 April 2009
ER -