Effectively identifying users' research interests for scholarly reference management and discovery

Marco Rossetti, Benjamin Pettit, Saúl Vargas, Daniel Kershaw, Kris Jack, Davide Magatti, Maya Hristakeva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Discovering users' interests is essential in order to help them explore resources in large digital repositories. In particular, correctly identifying users' interests is commonly a good approach for organising information and providing personalised recommendations. We consider here the case of discovering users' research interests in Mendeley a research platform for scholarly article management and discovery. Prior work in this area has considered approaches such as matrix factorisation and text-based topic modelling for inferring topics of interest in recommendation scenarios. These approaches present several problems, such as little or no interpretability of the inferred topics and difficulty handling similarities in vocabulary in different research disciplines. We present an effective solution for extracting coherent and interpretable research topics that leverages the reference management data in Mendeley in a three-step approach: 1) a topic model based on the interactions between users and articles rather than article content, 2) keyword extraction to label the topics using article titles and author-declared keywords and 3) identifying the research interests of users based on the articles that they have added to their libraries. An evaluation comprised of a research interest prediction task and an article recommendation task shows the validity of our proposal in different research disciplines (clearly outperforming a text-based latent topic model) and provides further insights regarding the effects of number of latent topics in the model and the trade-off between recency and quantity of the users' libraries.

Original languageEnglish
Title of host publicationProceedings of the 1st Workshop on Scholarly Web Mining, SWM 2017
PublisherAssociation for Computing Machinery, Inc
Pages17-24
Number of pages8
ISBN (Electronic)9781450352406
DOIs
Publication statusPublished - Feb 10 2017
Externally publishedYes
Event1st Workshop on Scholarly Web Mining, SWM 2017 - Cambridge, United Kingdom
Duration: Feb 10 2017 → …

Publication series

NameACM International Conference Proceeding Series
VolumePart F127853

Conference

Conference1st Workshop on Scholarly Web Mining, SWM 2017
CountryUnited Kingdom
CityCambridge
Period02/10/17 → …

    Fingerprint

Keywords

  • Collaborative filtering
  • Explanations
  • Scholarly article recommendations
  • Topic labelling
  • Topic modelling

Cite this

Rossetti, M., Pettit, B., Vargas, S., Kershaw, D., Jack, K., Magatti, D., & Hristakeva, M. (2017). Effectively identifying users' research interests for scholarly reference management and discovery. In Proceedings of the 1st Workshop on Scholarly Web Mining, SWM 2017 (pp. 17-24). (ACM International Conference Proceeding Series; Vol. Part F127853). Association for Computing Machinery, Inc. https://doi.org/10.1145/3057148.3057151