• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

Research on entity recognition of liver cancer electronic medical records based on RoBERTa-CRF

Corresponding author: Zhou Yi, zhouyi@mail.sysu.edu.cn
DOI: 10.12201/bmr.202303.00027
Statement: This article is a preprint and has not been peer-reviewed. It reports new research that has yet to be evaluated and so should not be used to guide clinical practice.
  •  

    Abstract: Purpose Electronic medical records of liver cancer contain a large amount of medical knowledge, and most of the knowledge is in the form of unstructured data which is difficult to extract automatically. Knowledge extraction is important in the construction of clinical decision support systems and medical knowledge graphs in the area of liver cancer.Method This paper builds a named entity recognition model combined with RoBERTa algorithm and CRF algorithm and the model achieves excellent effect. The real data of self-labeled electronic medical records of liver cancer are used for model training and testing. Result RoBERTa-CRF model is better than other baseline models and has good entity recognition effect.

    Key words: electronic medical records of liver cancer; entity recognition; knowledge extraction; RoBERTa-CRF model

    Submit time: 22 March 2023

    Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity.
  • 图表

  • chenjieqing, zhangfeng. Named Entity Recognition in Chinese Electronic Medical Records Using Knowledge Graph Construction. 2023. doi: 10.12201/bmr.202312.00011

    xiaoxiaoxia. Research on named entity recognition of Chinese medical records based on BERT-BiLSTM-CRF with Chinese radicals. 2023. doi: 10.12201/bmr.202303.00004

    wuxuehong. A method of recognizing entities from Chinese Electronic Medical Record based on domain word vector combined with word attributes reasoning. 2021. doi: 10.12201/bmr.202109.00016

    renhuiling, lixiaoying, wangweijie, wangxu, zhangying. Research on Chinese electronic medical record entity mapping method by fusing similarity algorithm and pre-trained model. 2023. doi: 10.12201/bmr.202305.00015

    chenjianqiu, huangxiaofang. Joint extraction of Chinese EMR entity relationship based on bert. 2022. doi: 10.12201/bmr.202206.00003

    shenrongrong, xiashuaishuai, yanjunfeng. Review on Research of Named Entity Recognition in Chinese Medicine. 2022. doi: 10.12201/bmr.202207.00038

    HU Haiyang, ZHAO Congpu, Ma Lian, JIANG Huizhen, ZHANG Jing, ZHU Weiguo. Attention Mechanism And Dilated Convolution Neural Networks for Named Entity Recognition. 2021. doi: 10.12201/bmr.202102.00004

    Deng Lan, Du Tongzhou. An Efficient, Secure and Multi-keyword Search Scheme on Encrypted Electronic Medical Records. 2021. doi: 10.12201/bmr.202105.00008

    SUN Chenghao, LIU Fen, ZHAO Feng. Research on electronic Medical Record System based on Block chain technology. 2020. doi: 10.12201/bmr.202007.00012

    pangzhen, GuJiYu, WuYuFei, YanSshiXing, LiWangYang, SunYue. A study on the solution of the problem of extracting essential substance of TCM diagnosis and treatment of hypertension based on triple extraction strategy. 2021. doi: 10.12201/bmr.202107.00015

  • ID Submit time Number Download
    1 2022-12-02

    bmr.202303.00027V1

    Download
  • Public  Anonymous  To author only

Get Citation

Deng Jiale, Hu Zhensheng, Lian Wanmin, Hua Yunpeng, Zhou Yi. Research on entity recognition of liver cancer electronic medical records based on RoBERTa-CRF. 2023. biomedRxiv.202303.00027

Article Metrics

  • Read: 424
  • Download: 0
  • Comment: 0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误