TY - JOUR
T1 - A structural variation reference for medical and population genetics
AU - Genome Aggregation Database Consortium
AU - Genome Aggregation Database Production Team
AU - Collins, Ryan L.
AU - Brand, Harrison
AU - Karczewski, Konrad J.
AU - Zhao, Xuefang
AU - Alföldi, Jessica
AU - Francioli, Laurent
AU - Khera, Amit V.
AU - Lowther, Chelsea
AU - Gauthier, Laura D.
AU - Wang, Harold
AU - Watts, Nicholas A.
AU - Solomonson, Matthew
AU - O’Donnell-Luria, Anne
AU - Baumann, Alexander
AU - Munshi, Ruchi
AU - Walker, Mark
AU - Whelan, Christopher W.
AU - Huang, Yongqing
AU - Brookings, Ted
AU - Sharpe, Ted
AU - Stone, Matthew R.
AU - Valkanas, Elise
AU - Fu, Jack
AU - Tiao, Grace
AU - Laricchia, Kristen M.
AU - Ruano-Rubio, Valentin
AU - Stevens, Christine
AU - Gupta, Namrata
AU - Cusick, Caroline
AU - Margolin, Lauren
AU - Alföldi, Jessica
AU - Armean, Irina M.
AU - Banks, Eric
AU - Bergelson, Louis
AU - Cibulskis, Kristian
AU - Collins, Ryan L.
AU - Connolly, Kristen M.
AU - Covarrubias, Miguel
AU - Cummings, Beryl
AU - Daly, Mark J.
AU - Donnelly, Stacey
AU - Farjoun, Yossi
AU - Ferriera, Steven
AU - Francioli, Laurent
AU - Gabriel, Stacey
AU - Gauthier, Laura D.
AU - Gentry, Jeff
AU - Gupta, Namrata
AU - Jeandet, Thibault
AU - Erdmann, Jeanette
PY - 2020/5/28
Y1 - 2020/5/28
N2 - Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
AB - Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25–29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
UR - http://www.scopus.com/inward/record.url?scp=85085567000&partnerID=8YFLogxK
U2 - 10.1038/s41586-020-2287-8
DO - 10.1038/s41586-020-2287-8
M3 - Journal articles
C2 - 32461652
AN - SCOPUS:85085567000
SN - 0028-0836
VL - 581
SP - 444
EP - 451
JO - Nature
JF - Nature
IS - 7809
ER -