• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

Establishing a model based on data mining for predicting the recurrence factor of breast cancer

Corresponding author: QIAO Qiong, qiaoqiong13 @126.com
DOI: 10.12201/bmr.202009.00011
Statement: This article is a preprint and has not been peer-reviewed. It reports new research that has yet to be evaluated and so should not be used to guide clinical practice.
  •  

    Abstract: Objective:The purpose of this study was to set up the predicting model for the recurrence factor of breast cancer from the open machine learning database(https://www.openml.org/d/13), to find out the best algorithm and the recurrence factor of breast cancer. Methods:By using the SPSS Modeler(18.0) software to establish the model. After the auto-classifier, picking up the algorithms which were ranked top 5 of total accuracy processed the random sampling ten times. Model performance was evaluated by using areas under the receiver operating characteristic curve(AUC) Results:The results showed that the proper algorithm was an artificial neural network with a multilayer perceptron. The AUC of the training data was 0.869, and the AUC of the testing data was 0.894. The clinical-stage of breast cancer could be the main reason for the recurrence. Conclusion:The best algorithm for the database of breast cancer recurrence’s factor was the artificial neural network, the recurrent factor could be the stage of the breast cancer, and the model could provide support for predicting recurrence in breast cancer.

    Key words: Breast; cancer, Recurrence; factor, Model; prediction, SPSS; Modeler, Artificial; neural network

    Submit time: 23 September 2020

    Copyright: The copyright holder for this preprint is the author/funder, who has granted biomedRxiv a license to display the preprint in perpetuity.
  • 图表

  • Zhu Xiaoxiao, Qian Aibing. Analysis of Network Attention Characteristics of Breast Cancer Prevention and Treatment Health Information Based on Baidu Index. 2020. doi: 10.12201/bmr.201906.00001

    Zhan Haixia, Hu Dong, Zhang Wenting, Gu Ying. Effect of cluster nursing mode on shoulder function recovery and quality of life of patients with breast cancer after modified radical mastectomy. 2020. doi: 10.12201/bmr.202004.00015

    jinlizhu, gehui, guoqing, lishaoqiong, duxuejie. Early warning of influenza epidemics using meteorological factors and machine learning. 2021. doi: 10.12201/bmr.202012.00008

    Ci Yan, Peng Wang, Yue Yang, Jin Ren, Ruihao Wu, Yin Guan, Qian Zhang. Planning and construction of provincial cancer big data center. 2020. doi: 10.12201/bmr.202009.00002

    Zhai Xing, Li Guoliang, Guo Fengying. Research on the construction of the course system of big data management and application in traditional Chinese medicine Universities——An exploratory analysis based on the needs of College Students. 2020. doi: 10.12201/bmr.202008.00008

    Establishing a symptom management system based on electronic patient-reported outcome. 2020. doi: 10.12201/bmr.202004.00023

    Wei Jingming, Gao Qilong, Huang Minzhuo, Dong Hengjin. Study on the Operation Efficiency of County Medical Community in Zhejiang Province Based on DEA Model. 2021. doi: 10.12201/bmr.202005.00252

    Yufan Zhu, Xin Zhao, Zhiqiang Yang, Houcheng Zhong, Lin Cai, Yuanlong Xie. Prospect of the Training Model for Artificial Intelligence + Medicine Inter-disciplinary Talents. 2020. doi: 10.12201/bmr.202008.00010

    chenquan, hu hongpu. Construction of Mobile Health Intervention Strategy for Elderly Diabetes Patients Based on Behavior. 2020. doi: 10.12201/bmr.202005.00245

  • ID Submit time Number Download
    1 2020-09-17

    bmr.202009.00011V1

    Download
  • Public  Anonymous  To author only

Get Citation

HUANG Yucheng, YANG Xuming, QIAO Qiong. Establishing a model based on data mining for predicting the recurrence factor of breast cancer. 2020. biomedRxiv.202009.00011

Article Metrics

  • Read: 3749
  • Download: 3
  • Comment: 0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误