Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

Massively parallel characterization of transcriptional regulatory elements

Vikram Agarwal*, Fumitaka Inoue, Max Schubach, Dmitry Penzar, Beth K. Martin, Pyaree Mohan Dash, Pia Keukeleire, Zicong Zhang, Ajuni Sohota, Jingjing Zhao, Ilias Georgakopoulos-Soares, William S. Noble, Galip Gürkan Yardımcı, Ivan V. Kulakovskiy, Martin Kircher, Jay Shendure*, Nadav Ahituv*

*Korrespondierende/r Autor/-in für diese Arbeit

Abstract

The human genome contains millions of candidate cis-regulatory elements (cCREs) with cell-type-specific activities that shape both health and many disease states1. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these cCREs. Here we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of more than 680,000 sequences, representing an extensive set of annotated cCREs among three cell types (HepG2, K562 and WTC11), and found that 41.7% of these sequences were active. By testing sequences in both orientations, we find promoters to have strand-orientation biases and their 200-nucleotide cores to function as non-cell-type-specific ‘on switches’ that provide similar expression levels to their associated gene. By contrast, enhancers have weaker orientation biases, but increased tissue-specific characteristics. Utilizing our lentiMPRA data, we develop sequence-based models to predict cCRE function and variant effects with high accuracy, delineate regulatory motifs and model their combinatorial effects. Testing a lentiMPRA library encompassing 60,000 cCREs in all three cell types further identified factors that determine cell-type specificity. Collectively, our work provides an extensive catalogue of functional CREs in three widely used cell lines and showcases how large-scale functional measurements can be used to dissect regulatory grammar.

OriginalspracheEnglisch
Aufsatznummer219
ZeitschriftNature
Jahrgang639
Ausgabenummer8054
Seiten (von - bis)411-420
Seitenumfang10
ISSN0028-0836
DOIs
PublikationsstatusVeröffentlicht - 13.03.2025

Fördermittel

TrägerTrägernummer
Howard Hughes Medical Institute
Ministry of Education, Culture, Sports, Science and Technology
National Human Genome Research InstituteUM1HG011966, U24 HG009446, UM1HG009408
RSF Social Finance20-74-10075
NRSA NIH5T32HL007093
MSHERF075-15-2021-1344
Deutsche Forschungsgemeinschaft464313370

    UN SDGs

    Dieser Output leistet einen Beitrag zu folgendem(n) Ziel(en) für nachhaltige Entwicklung

    1. SDG 3 – Gesundheit und Wohlergehen
      SDG 3 – Gesundheit und Wohlergehen

    Strategische Forschungsbereiche und Zentren

    • Querschnittsbereich: Medizinische Genetik

    DFG-Fachsystematik

    • 2.22-03 Humangenetik

    Fingerprint

    Untersuchen Sie die Forschungsthemen von „Massively parallel characterization of transcriptional regulatory elements“. Zusammen bilden sie einen einzigartigen Fingerprint.

    Zitieren