Summarizing answers in non-factoid community question-answering

H Song, Zhaochun Ren, Shangsong Liang, P Li, Jun Ma, Maarten De Rijke

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

44 Scopus citations

Abstract

We aim at summarizing answers in community question-answering (CQA). While most previous work focuses on factoid questionanswering, we focus on the non-factoid question-answering. Unlike factoid CQA, non-factoid question-answering usually requires passages as answers. The shortness, sparsity and diversity of answers form interesting challenges for summarization. To tackle these challenges, we propose a sparse coding-based summarization strategy that includes three core ingredients: short document expansion, sentence vectorization, and a sparse-coding optimization framework. Specifically, we extend each answer in a questionanswering thread to a more comprehensive representation via entity linking and sentence ranking strategies. From answers extended in this manner, each sentence is represented as a feature vector trained from a short text convolutional neural network model. We then use these sentence representations to estimate the saliency of candidate sentences via a sparse-coding framework that jointly considers candidate sentences and Wikipedia sentences as reconstruction items. Given the saliency vectors for all candidate sentences, we extract sentences to generate an answer summary based on a maximal marginal relevance algorithm. Experimental results on a benchmark data collection confirm the effectiveness of our proposed method in answer summarization of non-factoid CQA, and moreover, its significant improvement compared to state-of-the-art baselines in terms of ROUGE metrics.

Original languageAmerican English
Title of host publicationWSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery, Inc
Pages405-414
Number of pages10
ISBN (Electronic)9781450346757
DOIs
StatePublished - 2017
Event10th ACM International Conference on Web Search and Data Mining, WSDM 2017 - Cambridge, United Kingdom
Duration: Feb 6 2017Feb 10 2017

Publication series

NameWSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining

Conference

Conference10th ACM International Conference on Web Search and Data Mining, WSDM 2017
Country/TerritoryUnited Kingdom
CityCambridge
Period02/6/1702/10/17

Keywords

  • Community question-answering
  • Document summarization
  • Short text processing
  • Sparse coding

Fingerprint

Dive into the research topics of 'Summarizing answers in non-factoid community question-answering'. Together they form a unique fingerprint.

Cite this