Guo Weijia. Method for Extracting Data Elements from Chinese Electronic Medical Records. 2024. biomedRxiv.202404.00038
Method for Extracting Data Elements from Chinese Electronic Medical Records
Corresponding author: Guo Weijia, guowj2022@163.com
DOI: 10.12201/bmr.202404.00038
-
Abstract: Purpose/Significance Extracting data elements that comply with national standards from EMR (Electronic Medical Records) can help to achieve fine-grained sharing of EMR data. Method/Process This paper proposes a method for extracting data elements from Chinese EMRs. Firstly, it uses the ALBERT, BILSTM and CRF models to perform sequence labeling on EMRs, and generates a set of candidate data elements based on labeling results. Then, for any candidate data element, its contextual information is collected to form an enhanced key vector. Finally, the similarity between the vector and the standard vector is calculated to determine whether the candidate data element is valid. Result/Conclusion The results show that the F1 value is 90.32%, indicating good performance. The shortcomings are the small size of the experimental dataset and the uneven distribution of data element types.
Key words: Electronic medical record; Data element; ALBERT; Sequence labeling; Token vectorSubmit time: 26 April 2024
Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity. -
图表
-
ZHAO Jia-Qi, WANG Xiao-Feng, FAN Yu-Yu, ZHANG Wei, WANG Hui-Xuan, LI Jin-Shan. Research on the Quality and Countermeasures of Electronic Medical Record Data. 2020. doi: 10.12201/bmr.202011.00008
wuxuehong. A method of recognizing entities from Chinese Electronic Medical Record based on domain word vector combined with word attributes reasoning. 2021. doi: 10.12201/bmr.202109.00016
zhang lixin, sun haixia, tang mingkun, qian qing. A Review of Real World Electronic Medical Record Data Evaluation. 2021. doi: 10.12201/bmr.202106.00015
renhuiling, lixiaoying, wangweijie, wangxu, zhangying. Research on Chinese electronic medical record entity mapping method by fusing similarity algorithm and pre-trained model. 2023. doi: 10.12201/bmr.202305.00015
chenjieqing, zhangfeng. Named Entity Recognition in Chinese Electronic Medical Records Using Knowledge Graph Construction. 2023. doi: 10.12201/bmr.202312.00011
wuhuan, hekunlun. Construction of general medical knowledge graph based on evidence-based medicine and electronic medical record data. 2024. doi: 10.12201/bmr.202409.00027
SUN Chenghao, LIU Fen, ZHAO Feng. Research on electronic Medical Record System based on Block chain technology. 2020. doi: 10.12201/bmr.202007.00012
Yang Liu, Li Xiaolong, Li Shanping, Wu Yirong. Research on the Construction of Electronic Health Record Data Quality Assessment Index System. 2023. doi: 10.12201/bmr.202303.00021
Ying Fang, Zhi Chen, Wenhua Jian, Jinping Zheng, Dongying Zhang. Multi-center Data Integration and Application Base on Respiratory Data Platform. 2020. doi: 10.12201/bmr.202009.00009
Deng Lan, Du Tongzhou. An Efficient, Secure and Multi-keyword Search Scheme on Encrypted Electronic Medical Records. 2021. doi: 10.12201/bmr.202105.00008
-
-
Public Anonymous To author only
Get Citation
Article Metrics
- Read: 127
- Download: 2
- Comment: 0