TY - JOUR
T1 - Integration of Hi-C with short and long-read genome sequencing reveals the structure of germline rearranged genomes
AU - Schöpflin, Robert
AU - Melo, Uirá Souto
AU - Moeinzadeh, Hossein
AU - Heller, David
AU - Laupert, Verena
AU - Hertzberg, Jakob
AU - Holtgrewe, Manuel
AU - Alavi, Nico
AU - Klever, Marius Konstantin
AU - Jungnitsch, Julius
AU - Comak, Emel
AU - Türkmen, Seval
AU - Horn, Denise
AU - Duffourd, Yannis
AU - Faivre, Laurence
AU - Callier, Patrick
AU - Sanlaville, Damien
AU - Zuffardi, Orsetta
AU - Tenconi, Romano
AU - Kurtas, Nehir Edibe
AU - Giglio, Sabrina
AU - Prager, Bettina
AU - Latos-Bielenska, Anna
AU - Vogel, Ida
AU - Bugge, Merete
AU - Tommerup, Niels
AU - Spielmann, Malte
AU - Vitobello, Antonio
AU - Kalscheuer, Vera M.
AU - Vingron, Martin
AU - Mundlos, Stefan
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Structural variants are a common cause of disease and contribute to a large extent to inter-individual variability, but their detection and interpretation remain a challenge. Here, we investigate 11 individuals with complex genomic rearrangements including germline chromothripsis by combining short- and long-read genome sequencing (GS) with Hi-C. Large-scale genomic rearrangements are identified in Hi-C interaction maps, allowing for an independent assessment of breakpoint calls derived from the GS methods, resulting in >300 genomic junctions. Based on a comprehensive breakpoint detection and Hi-C, we achieve a reconstruction of whole rearranged chromosomes. Integrating information on the three-dimensional organization of chromatin, we observe that breakpoints occur more frequently than expected in lamina-associated domains (LADs) and that a majority reshuffle topologically associating domains (TADs). By applying phased RNA-seq, we observe an enrichment of genes showing allelic imbalanced expression (AIG) within 100 kb around the breakpoints. Interestingly, the AIGs hit by a breakpoint (19/22) display both up- and downregulation, thereby suggesting different mechanisms at play, such as gene disruption and rearrangements of regulatory information. However, the majority of interpretable genes located 200 kb around a breakpoint do not show significant expression changes. Thus, there is an overall robustness in the genome towards large-scale chromosome rearrangements.
AB - Structural variants are a common cause of disease and contribute to a large extent to inter-individual variability, but their detection and interpretation remain a challenge. Here, we investigate 11 individuals with complex genomic rearrangements including germline chromothripsis by combining short- and long-read genome sequencing (GS) with Hi-C. Large-scale genomic rearrangements are identified in Hi-C interaction maps, allowing for an independent assessment of breakpoint calls derived from the GS methods, resulting in >300 genomic junctions. Based on a comprehensive breakpoint detection and Hi-C, we achieve a reconstruction of whole rearranged chromosomes. Integrating information on the three-dimensional organization of chromatin, we observe that breakpoints occur more frequently than expected in lamina-associated domains (LADs) and that a majority reshuffle topologically associating domains (TADs). By applying phased RNA-seq, we observe an enrichment of genes showing allelic imbalanced expression (AIG) within 100 kb around the breakpoints. Interestingly, the AIGs hit by a breakpoint (19/22) display both up- and downregulation, thereby suggesting different mechanisms at play, such as gene disruption and rearrangements of regulatory information. However, the majority of interpretable genes located 200 kb around a breakpoint do not show significant expression changes. Thus, there is an overall robustness in the genome towards large-scale chromosome rearrangements.
UR - http://www.scopus.com/inward/record.url?scp=85140915097&partnerID=8YFLogxK
U2 - 10.1038/s41467-022-34053-7
DO - 10.1038/s41467-022-34053-7
M3 - Journal articles
C2 - 36309531
AN - SCOPUS:85140915097
SN - 1751-8628
VL - 13
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 6470
ER -