An experimental study on unsupervised graph-based word sense disambiguation

George Tsatsaronis, Iraklis Varlamis, Kjetil Nørv̊ag

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Scopus citations

Abstract

Recent research works on unsupervised word sense disambiguation report an increase in performance, which reduces their handicap from the respective supervised approaches for the same task. Among the latest state of the art methods, those that use semantic graphs reported the best results. Such methods create a graph comprising the words to be disambiguated and their corresponding candidate senses. The graph is expanded by adding semantic edges and nodes from a thesaurus. The selection of the most appropriate sense per word occurrence is then made through the use of graph processing algorithms that offer a degree of importance among the graph vertices. In this paper we experimentally investigate the performance of such methods. We additionally evaluate a new method, which is based on a recently introduced algorithm for computing similarity between graph vertices, P-Rank. We evaluate the performance of all alternatives in two benchmark data sets, Senseval 2 and 3, using WordNet. The current study shows the differences in the performance of each method, when applied on the same semantic graph representation, and analyzes the pros and cons of each method for each part of speech separately. Furthermore, it analyzes the levels of inter-agreement in the sense selection level, giving further insight on how these methods could be employed in an unsupervised ensemble for word sense disambiguation.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 11th International Conference, CICLing 2010, Proceedings
Pages184-198
Number of pages15
DOIs
StatePublished - 2010
Externally publishedYes
Event11th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2010 - Iasi, Romania
Duration: Mar 21 2010Mar 27 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6008 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2010
Country/TerritoryRomania
CityIasi
Period03/21/1003/27/10

Fingerprint

Dive into the research topics of 'An experimental study on unsupervised graph-based word sense disambiguation'. Together they form a unique fingerprint.

Cite this