Machine learning models for predicting patients survival after liver transplantation

Jarmulski, W.; Wieczorkowska, A.; Trzaska, M.; Ciszek, M.; Paczek, L.

doi:10.7494/csci.2018.19.2.2746

Artykuł - szczegóły

Tytuł artykułu

Machine learning models for predicting patients survival after liver transplantation

Autorzy

Jarmulski W. , Wieczorkowska A. , Trzaska M. , Ciszek M. , Paczek L.

Treść / Zawartość

Pełne teksty:

Pobierz

Identyfikatory

DOI

10.7494/csci.2018.19.2.2746

Warianty tytułu

Języki publikacji

Abstrakty

In our work, we have built models predicting whether a patient will lose an organ after a liver transplant within a specified time horizon. We have used the observations of bilirubin and creatinine in the whole first year after the transplantation to derive predictors, capturing not only their static value but also their variability. Our models indeed have a predictive power that proves the value of incorporating variability of biochemical measurements, and it is the first contribution of our paper. As the second contribution we have identified that full-complexity models such as random forests and gradient boosting lack sufficient interpretability despite having the best predictive power, which is important in medicine. We have found that generalized additive models (GAM) provide the desired interpretability, and their predictive power is closer to the predictions of full-complexity models than to the predictions of simple linear models.

Słowa kluczowe

machine learning models interpretability survival prediction generalized additive models (GAM) liver transplant

Wydawca

Wydawnictwa AGH

Czasopismo

Computer Science

Rocznik

2018

Tom

Vol. 19 (2)

Strony

223--239

Opis fizyczny

Bibliogr. 27 poz., rys., wykr., tab.

Twórcy

autor

Jarmulski W.

wojciech.jarmulski@pja.edu.pl

Polish-Japanese Academyof Information Technology

autor

Wieczorkowska A.

alicja@poljap.edu.pl

Polish-Japanese Academyof Information Technology

autor

Trzaska M.

mtrzaska@pjwstk.edu.pl

Polish-Japanese Academyof Information Technology

autor

Ciszek M.

michal.ciszek@wum.edu.pl

Medical University of Warsaw. Department of Immunology, Transplantology and Internal Diseases

autor

Paczek L.

leszek.paczek@wum.edu.pl

Polish-Japanese Academyof Information Technology

Bibliografia

[1] Boyd J.: Statistical analysis and presentation of data. In: Evidence-Based Laboratory Medicine, pp. 113–140. AACC Press Washington, DC, 2007.
[2] Breiman L.: Random forests. In: Machine learning, vol. 45(1), pp. 5–32, 2001.
[3] Caruana R., Lou Y., Gehrke J., Koch P., Sturm M., Elhadad N.: Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1721–1730. ACM, 2015.
[4] Chawla N.V., Bowyer K.W., Hall L.O., Kegelmeyer W.P.: SMOTE: synthetic minority over-sampling technique. In: Journal of artificial intelligence research, vol. 16, pp. 321–357, 2002.
[5] Chen T., Guestrin C.: XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd SIGKDD Conference on Knowledge Discovery and Data Mining. ACM, 2016.
[6] Cholongitas E., Marelli L., Shusang V., Senzolo M., Rolles K., Patch D., Burroughs A.K.: A systematic review of the performance of the model for end-stage liver disease (MELD) in the setting of liver transplantation. In: Liver transplantation, vol. 12(7), pp. 1049–1061, 2006.
[7] Fernandez-Delgado M., Cernadas E., Barro S., Amorim D.: Do we need hundreds of classifiers to solve real world classification problems. In: J. Mach. Learn. Res, vol. 15(1), pp. 3133–3181, 2014.
[8] Friedman J.H.: Greedy function approximation: a gradient boosting machine. In: Annals of statistics, pp. 1189–1232, 2001.
[9] Friedman J.H.: Stochastic gradient boosting. In: Computational Statistics & Data Analysis, vol. 38(4), pp. 367–378, 2002.
[10] Habib S., Berk B., Chang C.C.H., Demetris A.J., Fontes P., Dvorchik I., Eghtesad B., Marcos A., Shakil A.O.: MELD and prediction of post–liver transplantation survival. In: Liver transplantation, vol. 12(3), pp. 440–447, 2006.
[11] Hastie T., Tibshirani R., Friedman J.: The elements of statistical learning: data mining, inference, and prediction. Second Edition. Springer, 2009.
[12] Hastie T.J., Tibshirani R.J.: Generalized additive models, vol. 43. CRC Press, 1990.
[13] Herland M., Khoshgoftaar T.M., Wald R.: A review of data mining using big data in health informatics. In: Journal Of Big Data, vol. 1(1), p. 2, 2014.
[14] Kuhn M., Johnson K.: Applied predictive modeling. Springer, 2013.
[15] Liu K.H., Huang D.S.: Cancer classification using Rotation Forest. In: Computers in biology and medicine, vol. 38(5), pp. 601–10, 2008.
[16] Lou Y., Caruana R., Gehrke J.: Intelligible models for classification and regression. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 150–158. ACM, 2012.
[17] Luca A., Angermayr B., Bertolini G., Koenig F., Vizzini G., Ploner M., Peck- Radosavljevic M., Gridelli B., Bosch J.: An integrated MELD model including serum sodium and age improves the prediction of early mortality in patients with cirrhosis. In: Liver transplantation, vol. 13(8), pp. 1174–1180, 2007.
[18] Mazzaferro V., Llovet J.M., Miceli R., Bhoori S., Schiavo M., Mariani L., Camerini T., Roayaie S., Schwartz M.E., Grazi G.L., et al.: Predicting survival after liver transplantation in patients with hepatocellular carcinoma beyond the Milan criteria: a retrospective, exploratory analysis. In: The lancet oncology, vol. 10(1), pp. 35–43, 2009.
[19] Menardi G., Torelli N.: Training and assessing classification rules with imbalanced data. In: Data Mining and Knowledge Discovery, 2014.
[20] Miotto R., Li L., Kidd B.A., Dudley J.T.: Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records. In: Scientific Reports, vol. 6, p. 26094, 2016. ISSN 2045-2322. URL http://dx.doi.org/10.1038/srep26094.
[21] Ozcift A., Gulten A.: Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. In: Computer methods and programs in biomedicine, vol. 104(3), pp. 443–51, 2011.
[22] Pratt, Daniel S.; Kaplan M.M.: Evaluation of Liver Function. In: Harrison’s Principles of Internal Medicine, pp. 1923–1926. McGraw-Hill Medical Publishing Division, New York, 17th ed., 2008. ISBN 978-0-07-146633-2.
[23] Roberts M.S., Angus D.C., Bryce C.L., Valenta Z., Weissfeld L.: Survival after liver transplantation in the United States: a disease-specific analysis of the UNOS database. In: Liver transplantation, vol. 10(7), pp. 886–897, 2004.
[24] Thongkam J., Xu G., Zhang Y., Huang F.: Breast cancer survivability via Ada- Boost algorithms. In: Proceedings of the second Australasian workshop on Health data and knowledge management, vol. 80, pp. 55–64, 2008.
[25] Tsujitani M., Tanaka Y.: Analysis of heart transplant survival data using generalized additive models. In: Computational and mathematical methods in medicine, 2013.
[26] Watt K., Menke T., Lyden E., McCashland T.M.: Mortality while awaiting liver retransplantation: predictability of MELD scores. In: Transplantation proceedings, vol. 37, pp. 2172–2173. Elsevier, 2005.
[27] Wood S.: Generalized additive models: an introduction with R. CRC press, 2006.

Uwagi

Opracowanie rekordu w ramach umowy 509/P-DUN/2018 ze środków MNiSW przeznaczonych na działalność upowszechniającą naukę (2018).

Typ dokumentu

Bibliografia

Identyfikator YADDA

bwmeta1.element.baztech-d8f103bb-b483-401b-909d-bb59cae80e03