TY - GEN
T1 - Exploring Unsupervised Features in Conditional Random Fields for Spanish Named Entity Recognition
AU - Copara, Jenny
AU - Ochoa, Jose
AU - Thorne, Camilo
AU - Glavas, Goran
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/2/1
Y1 - 2017/2/1
N2 - Unsupervised features such as word representations mostly given by word embeddings have been shown significantly improve semi supervised Named Entity Recognition (NER) for English language. In this work we investigate whether unsupervised features can boost (semi) supervised NER in Spanish. To do so, we use word representations and collocations as additional features in a linear chain Conditional Random Field (CRF) classifier. Experimental results (82.44% F-score on the CoNLL-2002 corpus and 65.72% F-score on Ancora Corpus) show that our approach is comparable to some state-of-art Deep Learning approaches for Spanish, in particular when using cross-lingual Word Representations.
AB - Unsupervised features such as word representations mostly given by word embeddings have been shown significantly improve semi supervised Named Entity Recognition (NER) for English language. In this work we investigate whether unsupervised features can boost (semi) supervised NER in Spanish. To do so, we use word representations and collocations as additional features in a linear chain Conditional Random Field (CRF) classifier. Experimental results (82.44% F-score on the CoNLL-2002 corpus and 65.72% F-score on Ancora Corpus) show that our approach is comparable to some state-of-art Deep Learning approaches for Spanish, in particular when using cross-lingual Word Representations.
KW - Conditional Random Fields
KW - NER for Spanish
KW - Unsupervised features
KW - Word embeddings
KW - Word Representations
UR - http://www.scopus.com/inward/record.url?scp=85015146121&partnerID=8YFLogxK
U2 - 10.1109/BRACIS.2016.059
DO - 10.1109/BRACIS.2016.059
M3 - Contribución a la conferencia
AN - SCOPUS:85015146121
T3 - Proceedings - 2016 5th Brazilian Conference on Intelligent Systems, BRACIS 2016
SP - 283
EP - 288
BT - Proceedings - 2016 5th Brazilian Conference on Intelligent Systems, BRACIS 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th Brazilian Conference on Intelligent Systems, BRACIS 2016
Y2 - 9 October 2016 through 12 October 2016
ER -