ImpactU - Detalle del Producto

EHR Foundation Models Improve Robustness in the Presence of Temporal Distribution Shift

Acceso Abierto

Idioma: Inglés

Publicado: 19/04/2022

APC (est): No disponible

Abstract:

ABSTRACT Background Temporal distribution shift negatively impacts the performance of clinical prediction models over time. Pretraining foundation models using self-supervised learning on electronic health records (EHR) may be effective in acquiring informative global patterns that can improve the robustness of task-specific models. Objective To evaluate the utility of EHR foundation models in improving the in-distribution (ID) and out-of-distribution (OOD) performance of clinical prediction models. Methods The cohort consisted of adult inpatients admitted between 2009-2021. Gated recurrent unit (GRU)- and transformer (TRANS)-based foundation models were pretrained on EHR of patients admitted between 2009-2012 and were subsequently used to construct patient representations (CLMBR). These representations were used to learn logistic regression models (CLMBR GRU and CLMBR TRANS ) to predict hospital mortality, long length of stay, 30-day readmission, and ICU admission. We compared CLMBR GRU and CLMBR TRANS with baseline logistic regression models learned on count-based representations (count-LR) and end-to-end (ETE) GRU and transformer models in ID (2009-2012) and OOD (2013-2021) year groups. Performance was measured using area-under-the-receiver-operating-characteristic curve, area- under-the-precision-recall curve, and absolute calibration error. Results Models trained on CLMBR generally showed better discrimination relative to count-LR in both ID and OOD year groups. In addition, they often matched or were better than their ETE counterparts. Finally, foundation models’ performance in the self-supervised learning task tracked closely with the ID and OOD performance of the downstream models. Conclusions These results suggest that pretraining foundation models on electronic health records is a useful approach for developing clinical prediction models that perform well in the presence of temporal distribution shift.

Tópico:

Machine Learning in Healthcare

Citaciones:

Citaciones por año:

Altmétricas:

Información de la Fuente:

FuentemedRxiv (Cold Spring Harbor Laboratory)	Cuartil año de publicaciónNo disponible	Volumen13
IssueNo disponible	Páginas3767 - N/A	pISSNNo disponible
ISSNNo disponible	Perfil OpenAlexhttps://openalex.org/S4306400573

Enlaces e Identificadores:

Doi URL	https://doi.org/10.1101/2022.04.15.22273900	Pdf URL	https://www.nature.com/articles/s41598-023-30820-8	Open_access URL	https://www.medrxiv.org/content/medrxiv/early/2022/04/19/2022.04.15.22273900.full.pdf
Scholar URL	https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=info%3ADQkrIUxN5GkJ%3Ascholar.google.com&btnG=	Openalex URL	https://openalex.org/W4224284596	Scholar citations URL	https://scholar.google.com/scholar?cites=7630308658041784589&as_sdt=5,33&sciodt=0,33&hl=en

Artículo de revista