sa ›› 2018, Vol. 23 ›› Issue (3): 392-.doi: 10.1007/s12204-018-1954-5
QIN Ying (秦颖), ZENG Yingfei (曾颖菲)
出版日期:
2018-05-31
发布日期:
2018-06-17
通讯作者:
QIN Ying (秦颖)
E-mail:qinying@bfsu.edu.cn
QIN Ying (秦颖), ZENG Yingfei (曾颖菲)
Online:
2018-05-31
Published:
2018-06-17
Contact:
QIN Ying (秦颖)
E-mail:qinying@bfsu.edu.cn
摘要: Electronic Medical Records (EMR) with unstructured sentences and various conceptual expressions provide rich information for medical information extraction. However, common Named Entity Recognition (NER) in Natural Language Processing (NLP) are not well suitable for clinical NER in EMR. This study aims at applying neural networks to clinical concept extractions. We integrate Bidirectional Long Short-Term Memory Networks (Bi-LSTM) with a Conditional Random Fields (CRF) layer to detect three types of clinical named entities. Word representations fed into the neural networks are concatenated by character-based word embeddings and Contin- uous Bag of Words (CBOW) embeddings trained both on domain and non-domain corpus. We test our NER system on i2b2/VA open datasets and compare the performance with six related works, achieving the best result of NER with F1 value 0.853 7. We also point out a few speciˉc problems in clinical concept extractions which will give some hints to deeper studies.
中图分类号:
QIN Ying (秦颖), ZENG Yingfei (曾颖菲). Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF[J]. sa, 2018, 23(3): 392-.
QIN Ying (秦颖), ZENG Yingfei (曾颖菲). Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF[J]. Journal of Shanghai Jiao Tong University (Science), 2018, 23(3): 392-.
[1] | SAGER N, FRIEDMAN C, LYMAN M S. Review ofmedical language processing: computer managementof narrative data [J]. Computational Linguistics, 1989,15(3): 195-198. |
[2] | UZUNER O, SOUTH B R, SHEN S, et al. 2010i2b2/VA challenge on concepts, assertions, and rela-tions in clinical text [J]. Journal of the American Med-ical Informatics Association. 2011, 18(5): 552-556. |
[3] | CURRAN J R, CLARK S. Language indepen-dent NER using a maximum entropy tagger[C]//Proceedings of the 7th Conference on Natu-ral Language Learning at HLT-NAACL. Edmonton,Canada: ACL, 2003: 164-167. |
[4] | TJONG KIM SANG E F, DE MEULDER F.Introduction to the CoNLL-2003 shared task:Language-Independent named entity recognition[C]//Proceedings of the 7th Conference on NaturalLanguage Learning at HLT-NAACL. Edmonton,Canada: ACL, 2003: 142-147. |
[5] | COLLOBERT R, WESTON J, BOTTOU L, et al.Natural language processing (almost) from scratch [J].Journal of Machine Learning Research, 2011, 12(8):2493-2537. |
[6] | HUANG Z, XU W, YU K. Bidirectional LSTM-CRFmodels for sequence tagging [EB/OL]. (2015-08-19).[2017-06-21]. https://arxiv.org/pdf/1508.01991v1.pdf. |
[7] | LAMPLE G, BALLESTEROS M, SUBRAMANIAN S, et al. Neural architectures for named entity recog-nition [C]//Proceedings of NAACL-2016, San Diego,US: ACL, 2016: 260-270. |
[8] | HOCHREITER S, SCHMIDHUBER J. Long shortterm memory [J].Neural Computation, 1997, 9(8):1735-1780. |
[9] | LAFFERTY J, MCCALLUM A, PEREIRA F C N. Conditional random ˉelds: Probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the 18th International Conferenceon Machine Learning. Williamstown, US: IMLS, 2001:282-289. |
[10] | BOAG W, WACOME K, NAUMANN T, et al.CliNER: A lightweight tool for clinical named entityrecognition [C]//AMIA Joint Summits on Clinical Research Informatics. San Francisco, CA: AMIA, 2015. |
[11] | DE BRUIJN B, CHERRY C, KIRITCHENKO S, et al.Machine-Learned solutions for three stages of clinicalinformation extraction: the state of the art at i2b22010 [J].Journal of the American Medical InformaticsAssociation, 2011, 18(5): 557-562. |
[12] | WU Y H, XU J, JIANG M, et al. A study of neuralword embeddings for named entity recognition in clinical text [C]//AMIA Annual Symposium Proceedings.2015: 1326-1333. |
[13] | JONNALAGADDA S, COHEN T, WU S, et al.Enhancing clinical concept extraction with distribu-tional semantics [J]. Journal of Biomedical Informatics, 2012,45(1): 129-140. |
[14] | CHALAPATHY R, BORZESHI, E Z, PICCARDIM. Bidirectional LSTM-CRF for clinical conceptextraction [EB/OL]. (2016-10-19). [2017-06-21].https://arxiv.org/pdf/1610.05858.pdf. |
[15] | MIKOLOV T, CHEN K, CORRADO G, et al.E±cient estimation of word representations invector space [EB/OL]. (2013-09-07). [2017-06-21].https://arxiv.org/pdf/1301.3781v3.pdf. |
[16] | BENGIO Y, SIMARD P, FRASCONI P. Learninglong-term dependencies with gradient descent is di±-cult [J].IEEE Transactions on Neural Networks, 1994,5(2): 157-166. |
[17] | GRAVES A, SCHMIDHUBER J. Framewise phonemeclassiˉcation with bidirectional LSTM and other neu-ral network architectures [J]. Neural Networks, 2005,18(5/6): 602-610. |
[18] | MIKOLOV T, SUTSKEVER I, CHEN K, et al. Dis-tributed representations of words and phrases andtheir compositionality [EB/OL]. (2013-10-16). [2017-06-21]. https://arxiv.org/pdf/1310.4546.pdf. |
[19] | FU X, ANANIADOU S. Improving the extraction ofclinical concepts from clinical records [C]//Proceedingsof the 4th Workshop on Building and Evaluating Re-sources for Health and Biomedical Text Processing.Reykjavik, Iceland: ELRA, 2014. |
[1] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(6): 757-767. |
[2] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(2): 190-201. |
[3] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(1): 99-111. |
[4] | . [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 577-586. |
[5] | . [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 587-597. |
[6] | ZHAN Zhu (占竹), ZHANG Wenjun (张文俊), CHEN Xia (陈霞), WANG Jun (汪军) . Objective Evaluation of Fabric Flatness Grade Based on Convolutional Neural Network[J]. J Shanghai Jiaotong Univ Sci, 2021, 26(4): 503-510. |
[7] | XU Jiangchang (许江长), HE Shamin (何莎敏), YU Dedong (于德栋), WU Yiqun (吴轶群), CHEN Xiaojun, (陈晓军). Automatic Segmentation Method for Cone-Beam Computed Tomography Image of the Bone Graft Region within Maxillary Sinus Based on the Atrous Spatial Pyramid Convolution Network[J]. J Shanghai Jiaotong Univ Sci, 2021, 26(3): 298-305. |
[8] | ZHANG Yue (张月), LIU Shijie (刘世界), LI Chunlai (李春来), WANG Jianyu (王建宇). Rethinking the Dice Loss for Deep Learning Lesion Segmentation in Medical Images[J]. J Shanghai Jiaotong Univ Sci, 2021, 26(1): 93-102. |
[9] | WU Jin, MIN Yu, YANG Xiaodie, MA Simin . Micro-Expression Recognition Algorithm Based on Information Entropy Feature[J]. Journal of Shanghai Jiao Tong University(Science), 2020, 25(5): 589-599. |
[10] | LIU Min, DENG Bin, TANG Ying, WU Minghu, WANG Juan . Low-Cost Approach for Improving Video Transmission Efficiency in WVSN[J]. Journal of Shanghai Jiao Tong University(Science), 2020, 25(5): 600-605. |
[11] | WANG Yuzong (王毓综), DENG Fei (邓飞), ZHAO Daxu (赵大旭), YE Jiaying (叶佳英), WANG Peixin. Monocular Dynamic Machine Vision-Based Pearl Shape Detection[J]. Journal of Shanghai Jiao Tong University (Science), 2019, 24(5): 654-662. |
[12] | LI Dan (李丹), NIU Zhongbin (牛中彬), PENG Dongxu (彭冬旭) . Magnetic Tile Surface Defect Detection Based on Texture Feature Clustering[J]. Journal of Shanghai Jiao Tong University (Science), 2019, 24(5): 663-670. |
[13] | XUE Ankang (薛安康), LI Fan* (李凡), XIONG Yin (熊吟). Automatic Identification of Butterfly Species Based on Gray-Level Co-occurrence Matrix Features of Image Block[J]. Journal of Shanghai Jiao Tong University (Science), 2019, 24(2): 220-225. |
[14] | ZHOU Jingmei *(周经美), ZHAO Xiangmo (赵祥模), CHENG Xin (程鑫), XU Zhigang (徐志刚), ZHAO. Vehicle Ego-Localization Based on Streetscape Image Database Under Blind Area of Global Positioning System[J]. Journal of Shanghai Jiao Tong University (Science), 2019, 24(1): 122-129. |
[15] | MA Jin (马进), XUE Teng (薛腾), SHAO Quanquan (邵全全), HU Jie (胡洁), WANG Weiming (王伟明. Research on Spatially Adaptive High-Order Total Variation Model for Weak Fluorescence Image Restoration[J]. Journal of Shanghai Jiao Tong University (Science), 2018, 23(Sup. 1): 1-7. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||