We investigate the large-scale organization of human genes with respect to “master” replication origins that were previously identified as bordering nucleotide compositional skew domains. We separate genes in two categories depending on their CpG enrichment at the promoter which can be considered as a marker of germline DNA methylation. Using expression data in mouse, we confirm that CpG-rich genes are highly expressed in germline whereas CpG-poor genes are in a silent state. We further show that, whether tissue-specific or broadly expressed (housekeeping genes), the CpG-rich genes are over-represented close to the replication skew domain borders suggesting some coordination of replication and transcription. We also reveal that the transcription of the longest CpG-rich genes is co-oriented with replication fork progression so that the promoter of these transcriptionally active genes be located into the accessible open chromatin environment surrounding the master replication origins that border the replication skew domains. The observation of a similar gene organization in the mouse genome confirms the interplay of replication, transcription and chromatin structure as the cornerstone of mammalian genome architecture.
Lamia Zaghloul 1, 2; Antoine Baker 1, 2; Benjamin Audit 1, 2; Alain Arneodo 1, 2
@article{CRMECA_2012__340_11-12_745_0, author = {Lamia Zaghloul and Antoine Baker and Benjamin Audit and Alain Arneodo}, title = {Gene organization inside replication domains in mammalian genomes}, journal = {Comptes Rendus. M\'ecanique}, pages = {745--757}, publisher = {Elsevier}, volume = {340}, number = {11-12}, year = {2012}, doi = {10.1016/j.crme.2012.10.023}, language = {en}, }
TY - JOUR AU - Lamia Zaghloul AU - Antoine Baker AU - Benjamin Audit AU - Alain Arneodo TI - Gene organization inside replication domains in mammalian genomes JO - Comptes Rendus. Mécanique PY - 2012 SP - 745 EP - 757 VL - 340 IS - 11-12 PB - Elsevier DO - 10.1016/j.crme.2012.10.023 LA - en ID - CRMECA_2012__340_11-12_745_0 ER -
%0 Journal Article %A Lamia Zaghloul %A Antoine Baker %A Benjamin Audit %A Alain Arneodo %T Gene organization inside replication domains in mammalian genomes %J Comptes Rendus. Mécanique %D 2012 %P 745-757 %V 340 %N 11-12 %I Elsevier %R 10.1016/j.crme.2012.10.023 %G en %F CRMECA_2012__340_11-12_745_0
Lamia Zaghloul; Antoine Baker; Benjamin Audit; Alain Arneodo. Gene organization inside replication domains in mammalian genomes. Comptes Rendus. Mécanique, Out of Equilibrium Dynamics, Volume 340 (2012) no. 11-12, pp. 745-757. doi : 10.1016/j.crme.2012.10.023. https://comptes-rendus.academie-sciences.fr/mecanique/articles/10.1016/j.crme.2012.10.023/
[1] Essential Cell Biology: An Introduction to the Molecular Biology of the Cell, Garland Publishing, 1998
[2] Replication-associated gene dosage effects shape the genomes of fast-growing bacteria but only for transcription and translation genes, Mol. Microbiol., Volume 59 (2006), pp. 1506-1518
[3] Gene essentiality determines chromosome organisation in bacteria, Nucleic Acids Res., Volume 31 (2003), pp. 6570-6577
[4] Essentiality, not expressiveness, drives gene-strand bias in bacteria, Nat. Genet., Volume 34 (2003), pp. 377-378
[5] et al. Initial sequencing and analysis of the human genomes, Nature, Volume 409 (2001), pp. 860-921
[6] Initial sequencing and comparative analysis of the mouse genome, Nature, Volume 420 (2002), pp. 520-562
[7] The mosaic genome of warm-blooded vertebrates, Science, Volume 228 (1985), pp. 953-958
[8] The distribution of genes in the human genome, Gene, Volume 100 (1991), pp. 181-187
[9] Isochores and the evolutionary genomics of vertebrates, Gene, Volume 241 (2000), pp. 3-17
[10] A unification of mosaic structures in the human genome, Hum. Mol. Genet., Volume 12 (2003), pp. 2411-2415
[11] From DNA sequence analysis to modeling replication in the human genome, Phys. Rev. Lett., Volume 94 (2005), p. 248103
[12] Replication-associated strand asymmetries in mammalian genomes: Toward detection of replication origins, Proc. Natl. Acad. Sci. USA, Volume 102 (2005), pp. 9836-9841
[13] Wavelet-based method to disentangle transcription- and replication-associated strand asymmetries in mammalian genomes, Appl. Comput. Harmon. Anal., Volume 28 (2010), pp. 150-170
[14] Multi-scale coding of genomic information: From DNA sequence to genome structure and function, Phys. Rep., Volume 498 (2011), pp. 45-188
[15] Human gene organization driven by the coordination of replication and transcription, Genome Res., Volume 17 (2007), pp. 1278-1285
[16] DNA replication timing data corroborate in silico human replication origin predictions, Phys. Rev. Lett., Volume 99 (2007), p. 248102
[17] Replication-associated mutational asymmetry in the human genome, Mol. Biol. Evol., Volume 28 (2011), pp. 2327-2337
[18] Replication fork polarity gradients revealed by megabase-sized U-shaped replication timing domains in human cell lines, PLoS Comput. Biol., Volume 8 (2012), p. e1002443
[19] Linking the DNA strand asymmetry to the spatio-temporal replication program. I. About the role of the replication fork polarity in genome evolution, Eur. Phys. E, Volume 35 (2012), p. 92
[20] A. Baker, C.-L. Chen, H. Julienne, B. Audit, Y. dʼAubenton Carafa, C. Thermes, A. Arneodo, Linking the DNA strand asymmetry to the spatio-temporal replication program. II. Accounting for neighbor-dependent substitution rates, Eur. Phys. E (2012), in press.
[21] Spontaneous emergence of sequence-dependent rosettelike folding of chromatin fiber, Phys. Rev. E, Volume 77 (2008), p. 061923
[22] Open chromatin encoded in DNA sequence is the signature of “master” replication origins in human cells, Nucleic Acids Res., Volume 37 (2009), pp. 6064-6075
[23] The UCSC genome browser database, Nucleic Acids Res., Volume 31 (2003), pp. 51-54
[24] Transcription-coupled TA and GC strand asymmetries in the human genome, FEBS Lett., Volume 555 (2003), pp. 579-582
[25] Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes, Nucleic Acids Res., Volume 32 (2004), pp. 4969-4978
[26] From scale invariance to deterministic chaos in DNA sequences: Towards a deterministic description of gene organization in the human genome, Physica A, Volume 342 (2004), pp. 270-280
[27] Low frequency rhythms in human DNA sequences: A key to the organization of gene location and orientation?, Phys. Rev. Lett., Volume 93 (2004), p. 108101
[28] Bifractality of human DNA strand-asymmetry profiles results from transcription, Phys. Rev. E, Volume 75 (2007), p. 032902
[29] L. Zaghloul, Transcriptional activity, chromatin state and replication timing in domains of compositional skew in the human genome, Ph.D. thesis, Université de Lyon, Ecole Normale Supérieure de Lyon, 2009.
[30] No evidence for tissue-specific adaptation of synonymous codon usage in humans, Mol. Biol. Evol., Volume 23 (2006), pp. 523-529
[31] Evolutionary origin and maintenance of coexpressed gene clusters in mammals, Mol. Biol. Evol., Volume 23 (2006), pp. 1715-1723
[32] A gene atlas of the mouse and human protein-encoding transcriptomes, Proc. Natl. Acad. Sci. USA, Volume 101 (2004), pp. 6062-6067
[33] The conserved transcriptome in human and rodent male gametogenesis, Proc. Natl. Acad. Sci. USA, Volume 104 (2007), pp. 8346-8351
[34] DNA methylation landscapes: Provocative insights from epigenomics, Nat. Rev. Genet., Volume 9 (2008), pp. 465-476
[35] Unmethylated domains in vertebrate DNA, Nucleic Acids Res., Volume 11 (1983), pp. 647-658
[36] A fraction of the mouse genome that is derived from islands of nonmethylated, CpG-rich DNA, Cell, Volume 40 (1985), pp. 91-99
[37] CpG islands in vertebrate genomes, J. Mol. Biol., Volume 196 (1987), pp. 261-282
[38] Number of CpG islands and genes in human and mouse, Proc. Natl. Acad. Sci. USA, Volume 90 (1993), pp. 11995-11999
[39] An alternative promoter in the mouse major histocompatibility complex class II I-Abeta gene: Implications for the origin of CpG islands, Mol. Cell. Biol., Volume 18 (1998), pp. 4433-4443
[40] Determinants of CpG islands: Expression in early embryo and isochore structure, Genome Res., Volume 11 (2001), pp. 1854-1860
[41] Structure, function and evolution of CpG island promoters, Cell. Mol. Life Sci., Volume 60 (2003), pp. 1647-1658
[42] Initiation of DNA replication at CpG islands in mammalian chromosomes, EMBO J., Volume 17 (1998), pp. 2426-2435
[43] CpG islands as genomic footprints of promoters that are associated with replication origins, Curr. Biol., Volume 9 (1999), p. R661-R667
[44] Genome-wide studies highlight indirect links between human replication origins and gene regulation, Proc. Natl. Acad. Sci. USA, Volume 105 (2008), pp. 15837-15842
[45] Transcription initiation activity sets replication origin efficiency in mammalian cells, PLoS Genet., Volume 5 (2009), p. e1000446
[46] The relationship between DNA replication and human genome organization, Mol. Biol. Evol., Volume 26 (2009), pp. 729-741
[47] Comprehensive analysis of CpG islands in human chromosomes 21 and 22, Proc. Natl. Acad. Sci. USA, Volume 99 (2002), pp. 3740-3745
[48] A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters, Proc. Natl. Acad. Sci. USA, Volume 103 (2006), pp. 1412-1417
[49] Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome, Nat. Genet., Volume 39 (2007), pp. 457-466
[50] A structural split in the human genome, PLoS One, Volume 2 (2007), p. e603
[51] Genetics and epigenetics: Stability and plasticity during cellular differentiation, Trends Genet., Volume 25 (2009), pp. 129-136
[52] Genome-wide analysis of mammalian promoter architecture and evolution, Nat. Genet., Volume 38 (2006), pp. 626-635
[53] CpG islands, genes and isochores in the genomes of vertebrates, Gene, Volume 106 (1991), pp. 185-195
[54] Analysis of fine-scale mammalian evolutionary breakpoints provides new insight into their relation to genome organisation, BMC Genomics, Volume 10 (2009), p. 335
[55] CpG islands as gene markers in the human genome, Genomics, Volume 13 (1992), pp. 1095-1107
[56] Selection for short introns in highly expressed genes, Nat. Genet., Volume 31 (2002), pp. 415-418
[57] Human housekeeping genes are compact, Trends Genet., Volume 19 (2003), pp. 362-365
[58] The signature of selection mediated by expression on human genes, Genome Res., Volume 13 (2003), pp. 2260-2264
[59] Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores, J. Mol. Evol., Volume 40 (1995), pp. 308-317
[60] Ripples from neighbouring transcription, Nat. Cell Biol., Volume 10 (2008), pp. 1106-1113
[61] Evolution of chromosome organization driven by selection for reduced gene expression noise, Nat. Genet., Volume 39 (2007), pp. 945-949
[62] Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors, Mol. Cell, Volume 30 (2008), pp. 755-766
[63] Global mapping of DNA methylation in mouse promoters reveals epigenetic reprogramming of pluripotency genes, PLoS Genet., Volume 4 (2008), p. e1000116
[64] Understanding what determines the frequency and pattern of human germline mutations, Nat. Rev. Genet., Volume 10 (2009), pp. 478-488
[65] A novel CpG island set identifies tissue-specific methylation at developmental gene loci, PLoS Biol., Volume 6 (2008), p. e22
[66] Replication timing of human chromosome 6, Cell Cycle, Volume 4 (2005), pp. 172-176
[67] Predictable dynamic program of timing of DNA replication in human cells, Genome Res., Volume 19 (2009), pp. 2288-2299
[68] Impact of replication timing on non-CpG and CpG substitution rates in mammalian genomes, Genome Res., Volume 4 (2010), pp. 447-457
[69] Sequencing newly replicated DNA reveals widespread plasticity in human replication timing, Proc. Natl. Acad. Sci. USA, Volume 107 (2010), pp. 139-144
[70] Global organization of replication time zones of the mouse genome, Genome Res., Volume 18 (2008), pp. 1562-1570
[71] Global reorganization of replication domains during embryonic stem cell differentiation, PLoS Biol., Volume 6 (2008), p. e245
[72] Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types, Genome Res., Volume 20 (2010), pp. 761-770
[73] Genome-wide dynamics of replication timing revealed by in vitro models of mouse embryogenesis, Genome Res., Volume 20 (2010), pp. 155-169
[74] Inferring where and when replication initiates from genome-wide replication timing data, Phys. Rev. Lett., Volume 108 (2012), p. 268101
[75] B. Audit, A. Baker, C.-L. Chen, A. Rappailles, G. Guilbaud, H. Julienne, A. Goldar, Y. dʼAubenton-Carafa, O. Hyrien, C. Thermes, A. Arneodo, Multi-scale analysis of genome wide replication timing profiles using a wavelet-based signal-processing algorithm, Nat. Protoc. (2012), in press.
[76] 3D chromatin conformation correlates with replication timing and is conserved in resting cells, Nucleic Acids Res., Volume 40 (2012), pp. 9470-9481
[77] Mathematical modelling of eukaryotic DNA replication, Chromosome Res., Volume 18 (2010), pp. 147-161
[78] Evidence for sequential and increasing activation of replication origins along replication timing gradients in the human genome, PLoS Comput. Biol., Volume 7 (2011), p. e1002322
Cited by Sources:
Comments - Policy