Traffic analytics for linked data publishers

Luca Costabello, Pierre Yves Vandenbussche, Gofran Shukair, Corine Deliot, Neil Wilson

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present a traffic analytics platform for servers that publish Linked Data. To the best of our knowledge, this is the first system that mines access logs of registered Linked Data servers to extract traffic insights on daily basis and without human intervention. The framework extracts Linked Data-specific traffic metrics from log records of HTTP lookups and SPARQL queries, and provides insights not available in traditional web analytics tools. Among all, we detect visitor sessions with a variant of hierarchical agglomerative clustering. We also identify workload peaks of SPARQL endpoints by detecting heavy and light SPARQL queries with supervised learning. The platform has been tested on 13 months of access logs of the British National Bibliography RDF dataset.

Original languageEnglish
Title of host publicationThe Semantic Web - 14th International Conference, ESWC 2017, Proceedings
EditorsDiana Maynard, Aldo Gangemi, Rinke Hoekstra, Eva Blomqvist, Olaf Hartig, Pascal Hitzler
PublisherSpringer Verlag
Pages3-18
Number of pages16
ISBN (Print)9783319580678
DOIs
StatePublished - 2017
Externally publishedYes
Event14th Extended Semantic Web Conference, ESWC 2017 - Portoroz, Slovenia
Duration: May 28 2017Jun 1 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10249 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th Extended Semantic Web Conference, ESWC 2017
Country/TerritorySlovenia
CityPortoroz
Period05/28/1706/1/17

Keywords

  • Data publication
  • Linked data
  • SPARQL
  • Traffic analytics

Fingerprint

Dive into the research topics of 'Traffic analytics for linked data publishers'. Together they form a unique fingerprint.

Cite this