TY - JOUR
T1 - ContextD
T2 - An algorithm to identify contextual properties of medical terms in a dutch clinical corpus
AU - Afzal, Zubair
AU - Pons, Ewoud
AU - Kang, Ning
AU - Sturkenboom, Miriam C.J.M.
AU - Schuemie, Martijn J.
AU - Kors, Jan A.
N1 - Publisher Copyright:
© Afzal et al.; licensee BioMed Central Ltd.
PY - 2014/11/29
Y1 - 2014/11/29
N2 - Background: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development.
AB - Background: In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. Results: The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. Conclusions: The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development.
KW - Contextual features
KW - Dutch electronic medical records
KW - Negation detection
UR - http://www.scopus.com/inward/record.url?scp=84923933223&partnerID=8YFLogxK
U2 - 10.1186/s12859-014-0373-3
DO - 10.1186/s12859-014-0373-3
M3 - Artículo
C2 - 25432799
AN - SCOPUS:84923933223
SN - 1471-2105
VL - 15
JO - BMC Bioinformatics
JF - BMC Bioinformatics
IS - 1
M1 - 373
ER -