TY - JOUR
T1 - STIGMA
T2 - Single-cell tissue-specific gene prioritization using machine learning
AU - Balachandran, Saranya
AU - Prada-Medina, Cesar A.
AU - Mensah, Martin A.
AU - Kakar, Naseebullah
AU - Nagel, Inga
AU - Pozojevic, Jelena
AU - Audain, Enrique
AU - Hitz, Marc Phillip
AU - Kircher, Martin
AU - Sreenivasan, Varun K.A.
AU - Spielmann, Malte
N1 - Publisher Copyright:
© 2023 The Authors
Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.
PY - 2024/1
Y1 - 2024/1
N2 - Clinical exome and genome sequencing have revolutionized the understanding of human disease genetics. Yet many genes remain functionally uncharacterized, complicating the establishment of causal disease links for genetic variants. While several scoring methods have been devised to prioritize these candidate genes, these methods fall short of capturing the expression heterogeneity across cell subpopulations within tissues. Here, we introduce single-cell tissue-specific gene prioritization using machine learning (STIGMA), an approach that leverages single-cell RNA-seq (scRNA-seq) data to prioritize candidate genes associated with rare congenital diseases. STIGMA prioritizes genes by learning the temporal dynamics of gene expression across cell types during healthy organogenesis. To assess the efficacy of our framework, we applied STIGMA to mouse limb and human fetal heart scRNA-seq datasets. In a cohort of individuals with congenital limb malformation, STIGMA prioritized 469 variants in 345 genes, with UBA2 as a notable example. For congenital heart defects, we detected 34 genes harboring nonsynonymous de novo variants (nsDNVs) in two or more individuals from a set of 7,958 individuals, including the ortholog of Prdm1, which is associated with hypoplastic left ventricle and hypoplastic aortic arch. Overall, our findings demonstrate that STIGMA effectively prioritizes tissue-specific candidate genes by utilizing single-cell transcriptome data. The ability to capture the heterogeneity of gene expression across cell populations makes STIGMA a powerful tool for the discovery of disease-associated genes and facilitates the identification of causal variants underlying human genetic disorders.
AB - Clinical exome and genome sequencing have revolutionized the understanding of human disease genetics. Yet many genes remain functionally uncharacterized, complicating the establishment of causal disease links for genetic variants. While several scoring methods have been devised to prioritize these candidate genes, these methods fall short of capturing the expression heterogeneity across cell subpopulations within tissues. Here, we introduce single-cell tissue-specific gene prioritization using machine learning (STIGMA), an approach that leverages single-cell RNA-seq (scRNA-seq) data to prioritize candidate genes associated with rare congenital diseases. STIGMA prioritizes genes by learning the temporal dynamics of gene expression across cell types during healthy organogenesis. To assess the efficacy of our framework, we applied STIGMA to mouse limb and human fetal heart scRNA-seq datasets. In a cohort of individuals with congenital limb malformation, STIGMA prioritized 469 variants in 345 genes, with UBA2 as a notable example. For congenital heart defects, we detected 34 genes harboring nonsynonymous de novo variants (nsDNVs) in two or more individuals from a set of 7,958 individuals, including the ortholog of Prdm1, which is associated with hypoplastic left ventricle and hypoplastic aortic arch. Overall, our findings demonstrate that STIGMA effectively prioritizes tissue-specific candidate genes by utilizing single-cell transcriptome data. The ability to capture the heterogeneity of gene expression across cell populations makes STIGMA a powerful tool for the discovery of disease-associated genes and facilitates the identification of causal variants underlying human genetic disorders.
UR - http://www.scopus.com/inward/record.url?scp=85183512103&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/248abff6-279b-3a7c-9836-8bcc350c3add/
U2 - 10.1016/j.ajhg.2023.12.011
DO - 10.1016/j.ajhg.2023.12.011
M3 - Journal articles
C2 - 38228144
AN - SCOPUS:85183512103
SN - 0002-9297
VL - 111
SP - 338
EP - 349
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 2
ER -