• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

Method for Extracting Data Elements from Chinese Electronic Medical Records

Corresponding author: Guo Weijia, guowj2022@163.com
DOI: 10.12201/bmr.202404.00038
Statement: This article is a preprint and has not been peer-reviewed. It reports new research that has yet to be evaluated and so should not be used to guide clinical practice.
  •  

    Abstract: Purpose/Significance Extracting data elements that comply with national standards from EMR (Electronic Medical Records) can help to achieve fine-grained sharing of EMR data. Method/Process This paper proposes a method for extracting data elements from Chinese EMRs. Firstly, it uses the ALBERT, BILSTM and CRF models to perform sequence labeling on EMRs, and generates a set of candidate data elements based on labeling results. Then, for any candidate data element, its contextual information is collected to form an enhanced key vector. Finally, the similarity between the vector and the standard vector is calculated to determine whether the candidate data element is valid. Result/Conclusion The results show that the F1 value is 90.32%, indicating good performance. The shortcomings are the small size of the experimental dataset and the uneven distribution of data element types.

    Key words: Electronic medical record; Data element; ALBERT; Sequence labeling; Token vector

    Submit time: 26 April 2024

    Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity.
  • 图表

  • ZHAO Jia-Qi, WANG Xiao-Feng, FAN Yu-Yu, ZHANG Wei, WANG Hui-Xuan, LI Jin-Shan. Research on the Quality and Countermeasures of Electronic Medical Record Data. 2020. doi: 10.12201/bmr.202011.00008

    wuxuehong. A method of recognizing entities from Chinese Electronic Medical Record based on domain word vector combined with word attributes reasoning. 2021. doi: 10.12201/bmr.202109.00016

    zhang lixin, sun haixia, tang mingkun, qian qing. A Review of Real World Electronic Medical Record Data Evaluation. 2021. doi: 10.12201/bmr.202106.00015

    renhuiling, lixiaoying, wangweijie, wangxu, zhangying. Research on Chinese electronic medical record entity mapping method by fusing similarity algorithm and pre-trained model. 2023. doi: 10.12201/bmr.202305.00015

    chenjieqing, zhangfeng. Named Entity Recognition in Chinese Electronic Medical Records Using Knowledge Graph Construction. 2023. doi: 10.12201/bmr.202312.00011

    wuhuan, hekunlun. Construction of general medical knowledge graph based on evidence-based medicine and electronic medical record data. 2024. doi: 10.12201/bmr.202409.00027

    SUN Chenghao, LIU Fen, ZHAO Feng. Research on electronic Medical Record System based on Block chain technology. 2020. doi: 10.12201/bmr.202007.00012

    Yang Liu, Li Xiaolong, Li Shanping, Wu Yirong. Research on the Construction of Electronic Health Record Data Quality Assessment Index System. 2023. doi: 10.12201/bmr.202303.00021

    Ying Fang, Zhi Chen, Wenhua Jian, Jinping Zheng, Dongying Zhang. Multi-center Data Integration and Application Base on Respiratory Data Platform. 2020. doi: 10.12201/bmr.202009.00009

    Deng Lan, Du Tongzhou. An Efficient, Secure and Multi-keyword Search Scheme on Encrypted Electronic Medical Records. 2021. doi: 10.12201/bmr.202105.00008

  • ID Submit time Number Download
    2 2023-12-14

    bmr.202404.00038V2

    Download
    1 2023-12-14

    bmr.202404.00038V1

    Download
  • Public  Anonymous  To author only

Get Citation

Guo Weijia. Method for Extracting Data Elements from Chinese Electronic Medical Records. 2024. biomedRxiv.202404.00038

Article Metrics

  • Read: 127
  • Download: 2
  • Comment: 0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误