Using automatically acquired predominant senses for word sense disambiguation

Diana McCarthy, Rob Koeling, Julie Weeds, John Carroll

Research output: Contribution to conferencePaperpeer-review

23 Scopus citations

Abstract

In word sense disambiguation (WSD), the heuristic of choosing the most common sense is extremely powerful because the distribution of the senses of a word is often skewed. The first (or predominant) sense heuristic assumes the availability of hand-tagged data. Whilst there are hand-tagged corpora available for some languages, these are relatively small in size and many word forms either do not occur, or occur infrequently. In this paper we investigate the performance of an unsupervised first sense heuristic where predominant senses are acquired automatically from raw text. We evaluate on both the SENSEVAL-2 and SENSEVAL-3 English all-words data. For accurate WSD the first sense heuristic should be used only as a back-off, where the evidence from the context is not strong enough. In this paper however, we examine the performance of the automatically acquired first sense in isolation since it turned out that the first sense taken from SemCor outperformed many systems in SENSEVAL-2.

Original languageEnglish
Pages151-154
Number of pages4
StatePublished - 2004
Externally publishedYes
Event3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, SENSEVAL@ACL 2004 - Barcelona, Spain
Duration: Jul 25 2004Jul 26 2004

Conference

Conference3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, SENSEVAL@ACL 2004
Country/TerritorySpain
CityBarcelona
Period07/25/0407/26/04

Fingerprint

Dive into the research topics of 'Using automatically acquired predominant senses for word sense disambiguation'. Together they form a unique fingerprint.

Cite this