TY - JOUR
T1 - Machine learning models predict long COVID outcomes based on baseline clinical and immunologic factors
AU - IMPACC Network
AU - Doni Jayavelu, Naresh
AU - Samaha, Hady
AU - Wimalasena, Sonia Tandon
AU - Hoch, Annmarie
AU - Gygi, Jeremy P.
AU - Gabernet, Gisela
AU - Ozonoff, Al
AU - Liu, Shanshan
AU - Milliren, Carly E.
AU - Levy, Ofer
AU - Baden, Lindsey R.
AU - Melamed, Esther
AU - Ehrlich, Lauren I.R.
AU - McComsey, Grace A.
AU - Sekaly, Rafick P.
AU - Cairns, Charles B.
AU - Haddad, Elias K.
AU - Schaenman, Joanna
AU - Shaw, Albert C.
AU - Hafler, David A.
AU - Montgomery, Ruth R.
AU - Corry, David B.
AU - Kheradmand, Farrah
AU - Atkinson, Mark A.
AU - Brakenridge, Scott C.
AU - Agudelo Higuit, Nelson I.
AU - Metcalf, Jordan P.
AU - Hough, Catherine L.
AU - Messer, William B.
AU - Pulendran, Bali
AU - Nadeau, Kari C.
AU - Davis, Mark M.
AU - Gen, Linda N.
AU - Fernandez Sesma, Ana
AU - Simon, Viviana
AU - Krammer, Florian
AU - Kraft, Monica
AU - Bime, Chris
AU - Calfee, Carolyn S.
AU - Erle, David J.
AU - Langelier, Charles R.
AU - Tipan, Pablo Guaman
AU - Rogers, Jacob E.
AU - Siles, Nadia
AU - Geltman, Janelle N.
AU - Hurley, Kerin C.
AU - Rousseau, Justin F.
AU - Wylie, Dennis
AU - Maguire, Cole
AU - Guerrero, Yanedth Sanchez
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2026/12
Y1 - 2026/12
N2 - Background: The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will develop long COVID is challenging due to the absence of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models may address this gap by leveraging clinical data to enhance diagnostic precision. Methods: Clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, are used to predict the likelihood of acute COVID-19 progressing to long COVID. Machine learning models are trained and evaluated for predictive performance. Feature importance analysis is performed to identify the most influential predictors. Results: The machine learning models achieve median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating predictive capabilities. Low antibody titers and high viral loads at hospital admission emerge as the strongest predictors of long COVID outcomes. Comorbidities—such as chronic respiratory, cardiac, and neurologic diseases—and female sex are also identified as significant risk factors. Conclusions: Machine learning models identify patients at risk for developing long COVID based on baseline clinical characteristics. These models guide early interventions, improve patient outcomes, and mitigate the long-term public health impacts of SARS-CoV-2.
AB - Background: The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will develop long COVID is challenging due to the absence of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models may address this gap by leveraging clinical data to enhance diagnostic precision. Methods: Clinical data, including antibody titers and viral load measurements collected at the time of hospital admission, are used to predict the likelihood of acute COVID-19 progressing to long COVID. Machine learning models are trained and evaluated for predictive performance. Feature importance analysis is performed to identify the most influential predictors. Results: The machine learning models achieve median AUROC values ranging from 0.64 to 0.66 and AUPRC values between 0.51 and 0.54, demonstrating predictive capabilities. Low antibody titers and high viral loads at hospital admission emerge as the strongest predictors of long COVID outcomes. Comorbidities—such as chronic respiratory, cardiac, and neurologic diseases—and female sex are also identified as significant risk factors. Conclusions: Machine learning models identify patients at risk for developing long COVID based on baseline clinical characteristics. These models guide early interventions, improve patient outcomes, and mitigate the long-term public health impacts of SARS-CoV-2.
UR - https://www.scopus.com/pages/publications/105031548760
UR - https://www.scopus.com/pages/publications/105031548760#tab=citedBy
U2 - 10.1038/s43856-025-01230-w
DO - 10.1038/s43856-025-01230-w
M3 - Article
C2 - 41484172
AN - SCOPUS:105031548760
SN - 2730-664X
VL - 6
JO - Communications Medicine
JF - Communications Medicine
IS - 1
M1 - 1
ER -