Author information
1Newcastle University, Newcastle upon Tyne, United Kingdom.
2Novartis Institute for Biomedical Research, Cambridge, Massachusetts, United States of America.
3Novartis Pharmaceuticals, East Hanover, New Jersey, United States of America.
4Institute of Cardiometabolism and Nutrition, Paris, France.
5Department of Medicine II, University Medical Center Homburg and Saarland University, Homburg, Germany.
6University of Torino, Turin, Italy.
7University Hospital Würzburg, Würzburg, Germany.
8Servicio Andaluz de Salud, Seville, Spain.
9University of Bern, Bern, Switzerland.
10Linköping University, Linköping, Sweden.
11Antwerp University Hospital, Antwerp, Belgium.
12University of Helsinki, Helsinki, Finland.
13University of Cambridge, Cambridge, United Kingdom.
14Università degli Studi di Milano, Milan, Italy.
15Università Cattolica del Sacro Cuore, Rome, Italy.
16University of Oxford, Oxford, United Kingdom.
17Medical School of National & Kapodistrian University of Athens, Athens, Greece.
18AMC Amsterdam, Amsterdam, The Netherlands.
19Translational & Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom.
20Newcastle NIHR Biomedical Research Centre NUTH NHS Trust, Newcastle upon Tyne, United Kingdom.
Abstract
Aims: Metabolic dysfunction Associated Steatotic Liver Disease (MASLD) outcomes such as MASH (metabolic dysfunction associated steatohepatitis), fibrosis and cirrhosis are ordinarily determined by resource-intensive and invasive biopsies. We aim to show that routine clinical tests offer sufficient information to predict these endpoints.
Methods: Using the LITMUS Metacohort derived from the European NAFLD Registry, the largest MASLD dataset in Europe, we create three combinations of features which vary in degree of procurement including a 19-variable feature set that are attained through a routine clinical appointment or blood test. This data was used to train predictive models using supervised machine learning (ML) algorithm XGBoost, alongside missing imputation technique MICE and class balancing algorithm SMOTE. Shapley Additive exPlanations (SHAP) were added to determine relative importance for each clinical variable.
Results: Analysing nine biopsy-derived MASLD outcomes of cohort size ranging between 5385 and 6673 subjects, we were able to predict individuals at training set AUCs ranging from 0.719-0.994, including classifying individuals who are At-Risk MASH at an AUC = 0.899. Using two further feature combinations of 26-variables and 35-variables, which included composite scores known to be good indicators for MASLD endpoints and advanced specialist tests, we found predictive performance did not sufficiently improve. We are also able to present local and global explanations for each ML model, offering clinicians interpretability without the expense of worsening predictive performance.
Conclusions: This study developed a series of ML models of accuracy ranging from 71.9-99.4% using only easily extractable and readily available information in predicting MASLD outcomes which are usually determined through highly invasive means.