Author information
1PathAI, Inc., Boston, MA, USA.
2Pinnacle Clinical Research, San Antonio, TX, USA.
3AbbVie, Inc., Irvine, CA, USA.
4Virginia Commonwealth University, Richmond, VA, USA.
5Whoop Inc., Boston, MA, USA.
6Invicro, Needham, MA, USA.
7Harvard Medical School, Boston, MA, USA.
8Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
9Department of Pathology and Laboratory Medicine, Rhode Island Hospital, Brown University, Providence, RI, USA.
10OrsoBio, Inc., Palo Alto, CA, USA.
11Inipharm, Inc., Bellevue, WA, USA.
12Gilead Sciences, Inc., Foster City, CA, USA.
13Novo Nordisk, Bagsvaerd, Denmark.
14Bristol Myers Squibb, Princeton, NJ, USA.
15Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK.
16UCSD School of Medicine, San Diego, CA, USA.
17Sorbonne Université, ICAN Institute for Cardiometabolism and Nutrition, Assisstance Publique Hôpitaux de Paris, INSERM UMRS, Paris, France.
18Amgen, Thousand Oaks, CA, USA.
19PathAI, Inc., Boston, MA, USA. katy.wack@pathai.com.
Abstract
Metabolic dysfunction-associated steatohepatitis (MASH) is a major cause of liver-related morbidity and mortality, yet treatment options are limited. Manual scoring of liver biopsies, currently the gold standard for clinical trial enrollment and endpoint assessment, suffers from high reader variability. This study represents the most comprehensive multisite analytical and clinical validation of an artificial intelligence (AI)-based pathology system, AI-based measurement of metabolic dysfunction-associated steatohepatitis (AIM-MASH), to assist pathologists in MASH trial histology scoring. AIM-MASH demonstrated high repeatability and reproducibility compared to manual scoring. AIM-MASH-assisted reads by expert MASH pathologists were superior to unassisted reads in accurately assessing inflammation, ballooning, MAS ≥ 4 with ≥1 in each score category and MASH resolution, while maintaining non-inferiority in steatosis and fibrosis assessment. These findings suggest that AIM-MASH could mitigate reader variability, providing a more reliable assessment of therapeutics in MASH clinical trials.