Evaluating Open Information Extraction on Scientific and Medical Text

Dataset

Description

This dataset is the result of applying crowd sourcing to the extractions of two open information extraction tools (Open IE 4 and MinIE) linked below. Extractions were performed on both a set of random sentences from Wikipedia and randomly selected sentences from the OA-STM corpus.

The aim is to evaluate the effectiveness of open information extraction tools on scientific and medical text.

The initial datasets, the code for applying information, the HITS, labelling instructions, and analysis code are all included above.
Date made availableFeb 16 2018
PublisherMendeley Data

Cite this