TY - JOUR
T1 - Predicting AT(N) pathologies in Alzheimer’s disease from blood-based proteomic data using neural networks
AU - Zhang, Yuting
AU - Ghose, Upamanyu
AU - Buckley, Noel J.
AU - Engelborghs, Sebastiaan
AU - Sleegers, Kristel
AU - Frisoni, Giovanni B.
AU - Wallin, Anders
AU - Lleó, Alberto
AU - Popp, Julius
AU - Martinez-Lage, Pablo
AU - Legido-Quigley, Cristina
AU - Barkhof, Frederik
AU - Zetterberg, Henrik
AU - Visser, Pieter Jelle
AU - Bertram, Lars
AU - Lovestone, Simon
AU - Nevado-Holgado, Alejo J.
AU - Shi, Liu
N1 - Funding Information:
This research was conducted as part of the EMIF-AD project which has received support from the Innovative Medicines Initiative Joint Undertaking under EMIF grant agreement no. 115372, resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in-kind contribution. The authors declare that they have received funding from Astra Zeneca (SL) and Janssen (SL and ANH). The funders were not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication. The DESCRIPA study was funded by the European Commission within the 5th framework program (QLRT-2001-2455). The EDAR study was funded by the European Commission within the 5th framework program (contract # 37670). The San Sebastian GAP study was partially funded by the Department of Health of the Basque Government (allocation 17.0.1.08.12.0000.2.454.01.41142.001.H). The research at VIB-CMN was funded in part by the University of Antwerp Research Fund. LS is funded by the Virtual Brain Cloud from European commission (grant no. H2020-SC1-DTH-2018-1). HZ is a Wallenberg Scholar supported by grants from the Swedish Research Council (#2018–02532), the European Research Council (#681712 and #101053962), Swedish State Support for Clinical Research (#ALFGBG-71320), the Alzheimer Drug Discovery Foundation (ADDF), United States (#201809–2016862), the AD Strategic Fund and the Alzheimer’s Association (#ADSF-21-831376-C, #ADSF-21-831381-C, and #ADSF-21-831377-C), the Bluefield Project, the Olav Thon Foundation, the Erling-Persson Family Foundation, Stiftelsen för Gamla Tjänarinnor, Hjärnfonden, Sweden (#FO2022-0270), the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement no 860197 (MIRIADE), the European Union Joint Programme – Neurodegenerative Disease Research (JPND2021-00694), the UK Dementia Research Institute at UCL (UKDRI-1003), and the Lausanne cohort was supported by grants from the Swiss National Science Foundation (SNF 320030_141179), Synapsis Foundation – Dementia Research Switzerland (grant no. 2017-PI01). This work was supported by the Centre for Artificial Intelligence in Precision Medicines of the University of Oxford and King Abdulaziz University.
Publisher Copyright:
Copyright © 2022 Zhang, Ghose, Buckley, Engelborghs, Sleegers, Frisoni, Wallin, Lleó, Popp, Martinez-Lage, Legido-Quigley, Barkhof, Zetterberg, Visser, Bertram, Lovestone, Nevado-Holgado and Shi.
PY - 2022/11/29
Y1 - 2022/11/29
N2 - Background and objective: Blood-based biomarkers represent a promising approach to help identify early Alzheimer’s disease (AD). Previous research has applied traditional machine learning (ML) to analyze plasma omics data and search for potential biomarkers, but the most modern ML methods based on deep learning has however been scarcely explored. In the current study, we aim to harness the power of state-of-the-art deep learning neural networks (NNs) to identify plasma proteins that predict amyloid, tau, and neurodegeneration (AT[N]) pathologies in AD. Methods: We measured 3,635 proteins using SOMAscan in 881 participants from the European Medical Information Framework for AD Multimodal Biomarker Discovery study (EMIF-AD MBD). Participants underwent measurements of brain amyloid β (Aβ) burden, phosphorylated tau (p-tau) burden, and total tau (t-tau) burden to determine their AT(N) statuses. We ranked proteins by their association with Aβ, p-tau, t-tau, and AT(N), and fed the top 100 proteins along with age and apolipoprotein E (APOE) status into NN classifiers as input features to predict these four outcomes relevant to AD. We compared NN performance of using proteins, age, and APOE genotype with performance of using age and APOE status alone to identify protein panels that optimally improved the prediction over these main risk factors. Proteins that improved the prediction for each outcome were aggregated and nominated for pathway enrichment and protein–protein interaction enrichment analysis. Results: Age and APOE alone predicted Aβ, p-tau, t-tau, and AT(N) burden with area under the curve (AUC) scores of 0.748, 0.662, 0.710, and 0.795. The addition of proteins significantly improved AUCs to 0.782, 0.674, 0.734, and 0.831, respectively. The identified proteins were enriched in five clusters of AD-associated pathways including human immunodeficiency virus 1 infection, p53 signaling pathway, and phosphoinositide-3-kinase–protein kinase B/Akt signaling pathway. Conclusion: Combined with age and APOE genotype, the proteins identified have the potential to serve as blood-based biomarkers for AD and await validation in future studies. While the NNs did not achieve better scores than the support vector machine model used in our previous study, their performances were likely limited by small sample size.
AB - Background and objective: Blood-based biomarkers represent a promising approach to help identify early Alzheimer’s disease (AD). Previous research has applied traditional machine learning (ML) to analyze plasma omics data and search for potential biomarkers, but the most modern ML methods based on deep learning has however been scarcely explored. In the current study, we aim to harness the power of state-of-the-art deep learning neural networks (NNs) to identify plasma proteins that predict amyloid, tau, and neurodegeneration (AT[N]) pathologies in AD. Methods: We measured 3,635 proteins using SOMAscan in 881 participants from the European Medical Information Framework for AD Multimodal Biomarker Discovery study (EMIF-AD MBD). Participants underwent measurements of brain amyloid β (Aβ) burden, phosphorylated tau (p-tau) burden, and total tau (t-tau) burden to determine their AT(N) statuses. We ranked proteins by their association with Aβ, p-tau, t-tau, and AT(N), and fed the top 100 proteins along with age and apolipoprotein E (APOE) status into NN classifiers as input features to predict these four outcomes relevant to AD. We compared NN performance of using proteins, age, and APOE genotype with performance of using age and APOE status alone to identify protein panels that optimally improved the prediction over these main risk factors. Proteins that improved the prediction for each outcome were aggregated and nominated for pathway enrichment and protein–protein interaction enrichment analysis. Results: Age and APOE alone predicted Aβ, p-tau, t-tau, and AT(N) burden with area under the curve (AUC) scores of 0.748, 0.662, 0.710, and 0.795. The addition of proteins significantly improved AUCs to 0.782, 0.674, 0.734, and 0.831, respectively. The identified proteins were enriched in five clusters of AD-associated pathways including human immunodeficiency virus 1 infection, p53 signaling pathway, and phosphoinositide-3-kinase–protein kinase B/Akt signaling pathway. Conclusion: Combined with age and APOE genotype, the proteins identified have the potential to serve as blood-based biomarkers for AD and await validation in future studies. While the NNs did not achieve better scores than the support vector machine model used in our previous study, their performances were likely limited by small sample size.
UR - http://www.scopus.com/inward/record.url?scp=85144030940&partnerID=8YFLogxK
U2 - 10.3389/fnagi.2022.1040001
DO - 10.3389/fnagi.2022.1040001
M3 - Journal articles
AN - SCOPUS:85144030940
SN - 1663-4365
VL - 14
JO - Frontiers in Aging Neuroscience
JF - Frontiers in Aging Neuroscience
M1 - 1040001
ER -