PL EN


Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników
Tytuł artykułu

Heart failure disease prediction and stratification with temporal electronic health records data using patient representation

Identyfikatory
Warianty tytułu
Języki publikacji
EN
Abstrakty
EN
Accurate early prediction of heart failure and identification of heart failure sub-phenotypes can enable in-time interventions and treatments, assist with policy decisions, and lead to a better understanding of disease pathophysiology in groups of patients. However, decision making more challenging for clinicians since the available data is complex, heterogeneous, temporal, and different in granularity. Even with much data, it is difficult for a cardiologist to pre-judge a patient’s heart condition at the next visit by relying on data from only one visit. Moreover, complicated and overloaded information bewilders clinicians, bringing obstacles to the stratification of patients and the mining of disease typical patterns in subgroups. To overcome these issues, this study proposes a novel Patient Representation model based on a temporal Bidirectional neural network with an Attention mechanism deep learning model called tBNA-PR. tBNA-PR effectively models heterogeneous and temporal Electronic Health Records (tEHRs) data from past and future directions to obtain informative patient representation to realize accurate heart failure prediction and reasonable patient stratification. Additionally, this study extracts typical diagnosis and prescriptions for disease patterns exploration and identifies significant features of sub-phenotypes for subgroup explanation in the context of complex clinical settings to provide better quality healthcare services and clinical decision support. This study leverages a real-world dataset MIMIC-III database. We carried out experiments on the prediction of heart failure to investigate tBNA-PR, which obtains prediction accuracy of 0.78, F1-Score of 0.7671, and AUC of 0.7198, showing a certain superiority compared with several state-of-the-art benchmarks. Moreover, we identified three distinct sub-phenotypes in all heart failure patients in the dataset with the clustering method and subgroup analysis. Sub-phenotype I has characteristics of more long-term anticoagulants. This sub-group has more patients who have the thrombotic disease. Sub-phenotype II has features of more patients having kidney disease, pneumonia, urinary tract infection, and coronary heart disease surgery history. Subphenotype III has characteristics of more patients having acidosis, depressive disorder, esophageal reflux, obstructive sleep apnea, and acquired hypothyroidism. Statistical tests show that the features, including age, creatinine, hemoglobin, urea nitrogen, and blood potassium, are significantly different among the three sub-phenotypes and have particular high importance. The resultant findings from this work have practical implications for clinical decision support.
Twórcy
autor
  • Institute of Systems Engineering, Dalian University of Technology, Dalian, Liaoning, China
autor
  • Institute of Systems Engineering, Dalian University of Technology, Dalian, Liaoning 116024, China
Bibliografia
  • [1] Heidenreich PA, Bozkurt B, Aguilar D, et al. 2022 AHA/ACC/ HFSA guideline for the management of heart failure: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. J Am Coll Cardiol 2022;79(17):e263-421.
  • [2] Bradley J, Schelbert EB, Bonnett LJ, et al. Predicting hospitalisation for heart failure and death in patients with, or at risk of, heart failure before first hospitalisation: A retrospective model development and external validation study. Lancet Digital Health 2022;4(6):e445-54.
  • [3] Suri JS, Bhagawati M, Paul S, et al. A powerful paradigm for cardiovascular risk stratification using multiclass, multilabel, and ensemble-based machine learning paradigms: A narrative review. Diagnostics 2022;12(3):722.
  • [4] Ahmad GN, Fatima H, Abbas M, et al. Mixed machine learning approach for efficient prediction of human heart disease by identifying the numerical and categorical features. Appl Sci 2022;12(15):7449.
  • [5] National center for health statistics. National Health and Nutrition Examination Survey (NHANES) public use data flies. Centers for disease control and prevention website. https:// www.cdc.gov/nchs/nhanes/. Accessed May 5, 2022.
  • [6] Tsao CW, Aday AW, Almarzooq ZI, et al. Heart disease and stroke statistics-2022 update: A report from the American Heart Association. Circulation 2022;145(8):e153-639.
  • [7] Nagamine T, Gillette B, Kahoun J, et al. Data-driven identification of heart failure disease states and progression pathways using electronic health records. Sci Rep 2022;12 (1):1-20.
  • [8] Sabbah HN. Silent disease progression in clinically stable heart failure. Eur J Heart Fail 2017;19(4):469-78.
  • [9] Yin T, Shi S, Zhu X, et al. A survival prediction for acute heart failure patients via web-based dynamic nomogram with internal validation: A prospective cohort study. J Inflammat Res 2022;15:1953-67.
  • [10] McDonagh TA, Metra M, Adamo M, et al. 2021 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure: Developed by the Task Force for the diagnosis and treatment of acute and chronic heart failure of the European Society of Cardiology (ESC) With the special contribution of the Heart Failure Association (HFA) of the ESC. Eur Heart J 2021;42(36):3599-726.
  • [11] Shea S, Blaha MJ. Long-term risk prediction for heart failure, disparities, and early prevention. Circul Res 2022;130 (2):210-2.
  • [12] Yip C, Lindsay M, Oudit G, et al. Complexity of acute care journey of females with heart failure following hospitalization. Can J Cardiol 2021;37(2):e7.
  • [13] Sarwar T, Seifollahi S, Chan J, et al. The secondary use of electronic health records for data mining: Data characteristics and challenges. ACM Comput Surv (CSUR) 2022;55(2):1-40.
  • [14] Ayaad O, Alloubani A, ALhajaa EA, et al. The role of electronic medical records in improving the quality of health care services: comparative study. Int J Med Informat 2019;127:63-7.
  • [15] Tiwari P, Colborn KL, Smith DE, et al. Assessment of a machine learning model applied to harmonized electronic health record data for the prediction of incident atrial fibrillation. JAMA Network Open 2020;3(1):e1919396.
  • [16] Najafabadipour M, Tuñas JM, Rodríguez-González A, et al. Analysis of electronic health records to identify the patient’s treatment lines: Challenges and opportunities. In: International Conference on Innovative Techniques and Applications of Artificial Intelligence. Springer; 2019. p. 437-42.
  • [17] Jetley G, Zhang H. Electronic health records in IS research: Quality issues, essential thresholds and remedial actions. Decis Support Syst 2019;126:113137.
  • [18] Mathis MR, Engoren MC, Williams AM, et al. Prediction of postoperative deterioration in cardiac surgery patients using electronic health record and physiologic waveform data. Anesthesiology 2022;137(5):586-601.
  • [19] Chu J, Dong W, Huang Z. Endpoint prediction of heart failure using electronic health records. J Biomed Inform 2020;109:103518.
  • [20] Qaisar SM, Khan SI, Dallet D, et al. Signal-piloted processing metaheuristic optimization and wavelet decomposition based elucidation of arrhythmia for mobile healthcare. Biocybernet Biomed Eng 2022;42(2):681-94.
  • [21] Overmars LM, van Es B, Groepenhoff F, et al. Preventing unnecessary imaging in patients suspect of coronary artery disease through machine learning of electronic health records. Eur Heart J-Digital Health 2022;3(1):11-9.
  • [22] McBeath K, Angermann C, Cowie M. Digital technologies to support better outcome and experience of care in patients with heart failure. Curr Heart Failure Rep 2022;19:75-108.
  • [23] Williams BA, Voyce S, Sidney S, et al. Establishing a national cardiovascular disease surveillance system in the United States using electronic health record data: Key strengths and limitations. J Am Heart Assoc 2022;11:e024409.
  • [24] Huang Y, Wang N, Zhang Z, et al. Patient representation from structured electronic medical records based on embedding technique: Development and validation study. JMIR Med Informat 2021;9(7):e19905.
  • [25] Anetta K, Horak A, Wojakowski W, et al. Deep learning analysis of polish electronic health records for diagnosis prediction in patients with cardiovascular diseases. J Personalized Med 2022;12(6):869.
  • [26] Davazdahemami B, Zolbanin HM, Delen D. An explanatory machine learning framework for studying pandemics: The case of COVID-19 emergency department readmissions. Decis Support Syst 2022;113730.
  • [27] Hong D, Fort D, Shi L, et al. Electronic medical record risk modeling of cardiovascular outcomes among patients with type 2 diabetes. Diabetes Therapy 2021;12(7):2007-17.
  • [28] Rahman F, Finkelstein N, Alyakin A, et al. Using machine learning for early prediction of cardiogenic shock in patients with acute heart failure. J Soc Cardiovasc Angiography Intervent 2022;1(3):100308.
  • [29] Chen R, Stewart WF, Sun J, et al. Recurrent neural networks for early detection of heart failure from longitudinal electronic health record data: Implications for temporal modeling with respect to time before diagnosis, data density, data quantity, and data type. Circulation: Cardiovasc Quality Outcomes 2019;12(10):e005114.
  • [30] Maragatham G, Devi S. LSTM model for prediction of heart failure in big data. J Med Syst 2019;43(5):1-13.
  • [31] Rodrigues-Jr JF, Gutierrez MA, Spadon G, et al. LIGDoctor: Efficient patient trajectory prediction using bidirectional minimal gated-recurrent networks. Inf Sci 2021;545:813-27.
  • [32] Choi E, Bahadori MT, Schuetz A, et al. Doctor ai: Predicting clinical events via recurrent neural networks. In: Machine Learning for Healthcare Conference. PMLR; 2016. p. 301-18.
  • [33] Choi E, Bahadori MT, Sun J, et al. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In: the 30th Conference on Neural Information Processing Systems. p. 3504-12.
  • [34] Miotto R, Li L, Kidd BA, et al. Deep patient: An unsupervised representation to predict the future of patients from the electronic health records. Sci Rep 2016;6(1):1-10.
  • [35] Che Z, Liu Y. Deep learning solutions to computational phenotyping in health care. In: 2017 IEEE International Conference on Data Mining Workshops. IEEE; 2017. p. 1100-9.
  • [36] Yu R, Zheng Y, Zhang R, et al. Using a multi-task recurrent neural network with attention mechanisms to predict hospital mortality of patients. IEEE J Biomed Health Informat 2019;24(2):486-92.
  • [37] An Y, Huang N, Chen X, et al. High-risk prediction of cardiovascular diseases via attention-based deep neural networks. IEEE/ACM Trans Comput Biol Bioinf 2019;18 (3):1093-105.
  • [38] Lee JM, Hauskrecht M. Modeling multivariate clinical event time-series with recurrent temporal mechanisms. Artif Intell Med 2021;112:102021.
  • [39] Liu S, Wang X, Xiang Y, et al. Multi-channel fusion LSTM for medical event prediction using EHRs. J Biomed Inform 2022;127:104011.
  • [40] Ma F, Chitta R, Zhou J, et al. Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In: the 23th International Conference on Knowledge Discovery and Data Mining; 2017. p. 1903-11.
  • [41] Zhao J, Zhang Y, Schlueter DJ, et al. Detecting time-evolving phenotypic topics via tensor factorization on electronic health records: Cardiovascular disease case study. J Biomed Inform 2019;98:103270.
  • [42] Gao H, Wang K, Zhao W, et al. Cardiorenal risk profiles among data-driven type 2 diabetes sub-phenotypes: A post-hoc analysis of the china health and nutrition survey. Front Endocrinol 2022;514.
  • [43] Urban S, Błaziak M, Jura M, et al. Machine learning approach to understand worsening renal function in acute heart failure. Biomolecules 2022;12(11):1616.
  • [44] Seymour CW, Kennedy JN, Wang S, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 2019;321(20):2003-17.
  • [45] Luo C, Zhu Y, Zhu Z, et al. A machine learning-based risk stratification tool for in-hospital mortality of intensive care unit patients with heart failure. J Transl Med 2022;20(1):1-9.
  • [46] Smole T, Žunkovič B, Pičulin M, et al. A machine learning-based risk stratification model for ventricular tachycardia and heart failure in hypertrophic cardiomyopathy. Comput Biol Med 2021;135:104648.
  • [47] Uijl A, Savarese G, Vaartjes I, et al. Identification of distinct phenotypic clusters in heart failure with preserved ejection fraction. Eur J Heart Fail 2021;23(6):973-82.
  • [48] Harada D, Asanoi H, Noto T, Takagawa J. Different pathophysiology and outcomes of heart failure with preserved ejection fraction stratified by k-means clustering. Front Cardiovasc Med 2020;7:607760.
  • [49] Cikes M, Sanchez-Martinez S, Claggett B, et al. Machine learning-based phenogrouping in heart failure to identify responders to cardiac resynchronization therapy. Eur J Heart Fail 2019;21(1):74-85.
  • [50] Fereshtehnejad SM, Zeighami Y, Dagher A, et al. Clinical criteria for subtyping Parkinson’s disease: biomarkers and longitudinal progression. Brain 2017;140(7):1959-76.
  • [51] Zhang X, Chou J, Liang J, et al. Data-driven subtyping of Parkinson’s disease using longitudinal clinical records: A cohort study. Sci Rep 2019;9(1):1-12.
  • [52] Johnson AE, Pollard TJ, Shen L, et al. MIMIC-III, a freely accessible critical care database. Sci Data 2016;3(1):1-9.
  • [53] Centers for disease control and prevention, International Classification of Disease, Ninth Revision, Clinical Modification (ICD-9-CM). https://www.cdc.gov/nchs/icd/icd9cm.htm. Accessed May 26, 2022.
  • [54] Xie F, Yuan H, Ning Y, et al. Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies. J Biomed Inform 2022;126:103980.
  • [55] Aggarwal Y, Das J, Mazumder PM, et al. Heart rate variability features from nonlinear cardiac dynamics in identification of diabetes using artificial neural network and support vector machine. Biocybernet Biomed Eng 2020;40(3):1002-9.
  • [56] Pal M, Parija S, Panda G, et al. Risk prediction of cardiovascular disease using machine learning classifiers. Open Med 2022;17(1):1100-13.
  • [57] Mullin S, Zola J, Lee R, et al. Longitudinal K-means approaches to clustering and analyzing EHR opioid use trajectories for clinical subtypes. J Biomed Inform 2021;122:103889.
  • [58] Yin C, Liu R, Zhang D, et al. Identifying sepsis subphenotypes via time-aware multi-modal auto-encoder. In: The 26th International Conference on Knowledge Discovery and Data Mining. p. 862-72.
  • [59] Rousseeuw PJ. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 1987;20:53-65.
  • [60] Caliński T, Harabasz J. A dendrite method for cluster analysis. Commun Stat-Theory Methods 1974;3(1):1-27.
  • [61] Davies DL and Bouldin DW. A cluster separation measure. IEEE Trans Pattern Anal Mach Intell 1979;PAMI-1(2):224-227.
  • [62] Manfredini M, Poli PP, Creminelli L, et al. Comparative risk of bleeding of anticoagulant therapy with vitamin k antagonists (Vkas) and with non-vitamin k antagonists in patients undergoing dental surgery. J Clin Med 2021;10(23):5526.
Uwagi
PL
Opracowanie rekordu ze środków MEiN, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2022-2023)
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-a0922187-82f3-403f-b0a0-3ed81186ef25
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.