Scalable semantic annotation of text using lexical and Web resources

Elias Zavitsanos, George Tsatsaronis, Iraklis Varlamis, Georgios Paliouras

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

In this paper we are dealing with the task of adding domain-specific semantic tags to a document, based solely on the domain ontology and generic lexical and Web resources. In this manner, we avoid the need for trained domain-specific lexical resources, which hinder the scalability of semantic annotation. More specifically, the proposed method maps the content of the document to concepts of the ontology, using the WordNet lexicon and Wikipedia. The method comprises a novel combination of measures of semantic relatedness and word sense disambiguation techniques to identify the most related ontology concepts for the document. We test the method on two case studies: (a) a set of summaries, accompanying environmental news videos, (b) a set of medical abstracts. The results in both cases show that the proposed method achieves reasonable performance, thus pointing to a promising path for scalable semantic annotation of documents.

Original languageEnglish
Title of host publicationArtificial Intelligence
Subtitle of host publicationTheories, Models and Applications - 6th Hellenic Conference on AI, SETN 2010, Proceedings
Pages287-296
Number of pages10
DOIs
StatePublished - 2010
Externally publishedYes
Event6th Hellenic Conference on Artificial Intelligence: Theories, Models and Applications, SETN 2010 - Athens, Greece
Duration: May 4 2010May 7 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6040 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Hellenic Conference on Artificial Intelligence: Theories, Models and Applications, SETN 2010
Country/TerritoryGreece
CityAthens
Period05/4/1005/7/10

Fingerprint

Dive into the research topics of 'Scalable semantic annotation of text using lexical and Web resources'. Together they form a unique fingerprint.

Cite this