TY - GEN
T1 - TRUMIT
T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2011
AU - Neumayer, Robert
AU - Tsatsaronis, George
AU - Nørvåg, Kjetil
PY - 2011
Y1 - 2011
N2 - Due to the nature of textual data the application of association rule mining in text corpora has attracted the focus of the research scientific community for years. In this paper we demonstrate a system that can efficiently mine association rules from text. The system annotates terms using several annotators, and extracts text association rules between terms or categories of terms. An additional contribution of this work is the inclusion of novel unsupervised evaluation measures for weighting and ranking the importance of the text rules. We demonstrate the functionalities of our system with two text collections, a set of Wikileaks documents, and one from TREC-7.
AB - Due to the nature of textual data the application of association rule mining in text corpora has attracted the focus of the research scientific community for years. In this paper we demonstrate a system that can efficiently mine association rules from text. The system annotates terms using several annotators, and extracts text association rules between terms or categories of terms. An additional contribution of this work is the inclusion of novel unsupervised evaluation measures for weighting and ranking the importance of the text rules. We demonstrate the functionalities of our system with two text collections, a set of Wikileaks documents, and one from TREC-7.
UR - http://www.scopus.com/inward/record.url?scp=80052396601&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-23808-6_48
DO - 10.1007/978-3-642-23808-6_48
M3 - Contribución a la conferencia
AN - SCOPUS:80052396601
SN - 9783642238079
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 646
EP - 649
BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2011, Proceedings
Y2 - 5 September 2011 through 9 September 2011
ER -