TY - JOUR
T1 - Detecting Chemical Reactions in Patents
AU - Yoshikawa, Hiyori
AU - Nguyen, Dat Quoc
AU - Zhai, Zenan
AU - Druckenbrodt, Christian
AU - Thorne, Camilo
AU - Akhondi, Saber A.
AU - Baldwin, Timothy
AU - Verspoor, Karin
N1 - Publisher Copyright:
© 2019, Australasian Language Technology Association. All rights reserved.
PY - 2019
Y1 - 2019
N2 - Extracting chemical reactions from patents is a crucial task for chemists working on chemical exploration. In this paper we introduce the novel task of detecting the textual spans that describe or refer to chemical reactions within patents. We formulate this task as a paragraph-level sequence tagging problem, where the system is required to return a sequence of paragraphs that contain a description of a reaction. To address this new task, we construct an annotated dataset from an existing proprietary database of chemical reactions manually extracted from patents. We introduce several baseline methods for the task and evaluate them over our dataset. Through error analysis, we discuss what makes the task complex and challenging, and suggest possible directions for future research.
AB - Extracting chemical reactions from patents is a crucial task for chemists working on chemical exploration. In this paper we introduce the novel task of detecting the textual spans that describe or refer to chemical reactions within patents. We formulate this task as a paragraph-level sequence tagging problem, where the system is required to return a sequence of paragraphs that contain a description of a reaction. To address this new task, we construct an annotated dataset from an existing proprietary database of chemical reactions manually extracted from patents. We introduce several baseline methods for the task and evaluate them over our dataset. Through error analysis, we discuss what makes the task complex and challenging, and suggest possible directions for future research.
UR - http://www.scopus.com/inward/record.url?scp=85084176097&partnerID=8YFLogxK
M3 - Artículo de la conferencia
AN - SCOPUS:85084176097
SN - 1834-7037
VL - 17
JO - Proceedings of the Australasian Language Technology Workshop
JF - Proceedings of the Australasian Language Technology Workshop
T2 - 17th Annual Workshop of the Australasian Language Technology Association, ALTA 2019
Y2 - 4 December 2019 through 6 December 2019
ER -