ACGT-containing element gene transcriptions

Jump to navigation Jump to search

Associate Editor(s)-in-Chief: Henry A. Hoff

The "binding affinities of both bZIP proteins were similar to CREA/T (ATGACGTCAT), a CRE sequence with flanking adenine and thymine (A/T) at positions -4 and +4. [The] bZIP domains of both STF1 and HY5 have similar binding properties for recognizing ACGT-containing elements (ACEs). [Although] the G-box is a known target site for the HY5 protein, the C-box sequences are the preferred binding sites for both STF1 and HY5."[1]

"The combination of an [ACGT-containing element] ACE and a MRE confers light responsiveness to the CFI, F3H and FLS promoters."[2]

"Upstream from the transcriptional start site, several motifs were found [...]. A typical TATA box is located at -43. The CAAT consensus sequence cannot be found between -80 and -120; however, two sequence motifs (GCGCCC, GGGCAG), which are homologous to the consensus sequences for the Spl-binding site, GGGCGG (GC box) [19] were found around -114 and -570. The GC box has been found in promoters of many viral and cellular genes [20], and acts as a binding site of a protein, Spl, which is necessary for transcriptional activity. A pyrimidine box (CCTTT) and Box I (GCAGTG) which are part of the GA response complex [21] were found at -208 and -256. Two 8 bp sequences (CACGTCGC, CACGTAAC) which are similar to an ABA response element (ABRE, CACGTGGC) [22] were located at -308, -648 relative to the + 1 site. The core sequence of the ABA response element (ACGT) is the binding site for basic leucine zipper transcriptional factors or common plant regulatory factors (CPRFs) [23]."[3]

ABA-response elements

"The ABA responsive element (ABRE) is a key cis‐regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA‐induced genes in rice aleurone cells, suggesting other ABREs may exist."[4]

"Many ABA‐inducible genes in various species contain a conserved cis‐regulatory ABA responsive element (ABRE) with the consensus sequence ACGTG(G/T)C (Hattori et al. 2002; Shen et al. 2004)."[4]

A boxes

Most bZIP proteins show high binding affinity for the ACGT motifs, which include [...] TACGTA (A box) [...].[5][6][7]

Carbohydrate response elements

"The putative ChREBP binding sites ChoRE1 (CACGTGACCGGATCTTG, -324 to -308) and ChoRE2 (TCCGCCCCCATCACGTG, -298 to - 282) were mutated into CACGTGACGGATCTTG and TCCGCCCCATCACGTG respectively, where the 5-nt spacer between the two E-boxes in ChoRE motifs were shortened to 4-nt (underlined) as previously studies showed (10,35)."[8]

The E-boxes in ChoRE1 and ChoRE2 are CACGTG and ATCTTG and TCCGCC and CACGTG.[8]

Cbf1 regulatory factors

"Cbf1 is a GRF that binds the palindromic E-box motif (CACGTG) and utilizes DNA shape to discriminate between potential binding sites (Gordan et al. 2013). [...] Like the other GRFs, almost all (88% of 113) in vivo Cbf1-bound promoter sites were also detected in vitro [...]. There were also a substantial number of low occupancy “in vitro-only sites,” typically in ORFs [...]."[9]

"Previous studies have shown that Cbf1 prefers to bind E-boxes with a “T” at the 5′ end of the E-box (Zhou and O'Shea 2011) [TCACGTG]. This manifests as a specific DNA shape flanking both sides of the palindromic core motif (Gordan et al. 2013). Our PB-exo experiments confirmed these preferences for Cbf1 in DNA sequence (Supplemental Fig. S11E) and DNA shape readout [...]. [...] While the discriminatory DNA shape (and DNA sequence) information at one end of the motif is sufficient to support binding, at the strongly bound Cbf1 motifs it is enriched at both ends ([...] with a “T” on the 5′ and “A” on the 3′ end)."[9]

C boxes

Most bZIP proteins show high binding affinity for the ACGT motifs, which include [...] GACGTC (C box) [...].[5][6][7]

C-boxes

Analysis "of the recombinant (soybean [Glycine max] TGACG-motif binding factor 1) STF1 protein revealed the C-box (nGACGTCn) to be a high-affinity binding site (Cheong et al., 1998). [...] To test whether STF1 and HY5 have similar DNA-binding properties, the binding properties of each were compared with eight different DNA sequences that represent G-, C-, and C/G-box motifs [TGACGTGT]. C-box sequences carrying the mammalian cAMP responsive element (CRE; TGACGTCA) motif and the Hex sequence (TGACGTGGC), a hybrid C/G-box (Cheong et al., 1998), were high-affinity binding sites for both proteins [...]."[1]

Copying the C-box consensus sequence 5'-nGACGTCn-3' (Cheong et al., 1998) and putting the sequence in "⌘F" finds one location between ZNF497 and A1BG or no locations between ZSCAN22 and A1BG as can be found by the computer programs.

CRE boxes

"Within the cAMP-responsive element of the somatostatin gene, we observed an 8-base palindrome, 5'-TGACGTCA-3', which is highly conserved in many other genes whose expression is regulated by cAMP."[10]

The upstream activating sequence (UAS) for the Aca1p, the basic "leucine zipper (bZIP) transcription factor [55] involved in carbon source utilization" is 5'-TGACGTCA-3'[11] the same as a CRE.

The upstream activating sequence (UAS) for the Sko1p, involved "in osmotic and oxidative stress responses" is 5'-TGACGTCA-3'[11] the same as a CRE.

G boxes

"Two distinct sequence elements, the H-box (consensus CCTACC(N)7CT) and the G-box (CACGTG), are required for stimulation of the chsl5 promoter by [p-coumaric acid] 4-CA."[12]

The "perfect palindrome 5'-GCCACGTGGC-3' which is also known as the G-box motif."[13]

"A G-box-related motif, containing the core sequence CACGTG is also present in the 5' regions of two other classes of light-responsive genes".[13]

Most bZIP proteins show high binding affinity for the ACGT motifs, which include CACGTG (G box) [...].[5][6][7]

Binding "activity to the G-box of the light-responsive unit 1 (U1) region of the parsley (Petroselinum crispum) CHS promoter (CHS-U1: TCCACGTGGC; Schulze-Lefert et al., 1989) or the G-box of GmAux28 (TCCACGTGTC) was much weaker than to the PA G-box [...]."[1]

Hypoxia response elements

"The hypoxia response element (HRE) and estrogen response element (ERE) were located on −154 to −150 "ACGTG", and −94 to −80 "AGGTTATTGCCTCCT" on the transcript, respectively."[14]

Metal responsive elements

"[T]hree potential metal response elements (MREs) [overlap] the E-boxes in the repeats, (TGCACGT with TGCRCNC being the consensus sequence; 17,18)."[15]

ORE1 binding sites

"As a transcription factor, ORE1 was reported to bind to consensus DNA sequences of [ACG][CA]GT[AG]N{5,6}[CT]AC[AG] [29] or T[TAG][GA]CGT[GA][TCA][TAG] [37]."[16]

Consensus sequences are 5'-(A/C/G)(A/C)GT(A/G)N5,6(C/T)AC(A/G)-3' or 5'-T(A/G/T)(A/G)CGT(A/G)(A/C/T)(A/G/T)-3'.[16]

Copying 5'-TTACGTG-3' in "⌘F" yields none between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

Phosphate starvation-response transcription factors

Abscisic acid-responsive elements (CACGTG).[17]

"The [palindromic E-box motif (CACGTG)] motif is bound by the transcription factor Pho4, [and has the] class of basic helix-loop-helix DNA binding domain and core recognition sequence (Zhou and O'Shea 2011)."[9]

The upstream activating sequence (UAS) for Pho4p is 5'-CAC(A/G)T(T/G)-3' in the promoters of HIS4 and PHO5 regarding phosphate limitation with respect to regulation of the purine and histidine biosynthesis pathways [66].[11]

Root specific elements

Root specific elements (TGACGTCA).[17]

Synaptic Activity-Responsive Elements

"A unique synaptic activity-responsive element (SARE) sequence, composed of the consensus binding sites for SRF, MEF2 and CREB, is necessary for control of transcriptional upregulation of the Arc gene in response to synaptic activity."[18]

"Within the cAMP-responsive element of the somatostatin gene, we observed an 8-base palindrome, 5'-TGACGTCA-3', which is highly conserved in many other genes whose expression is regulated by cAMP."[10]

T boxes

"The different inducing activities of Xbra, VegT and Eomesodermin suggest that the proteins might recognise different DNA target sequences. [...] All three proteins prove to recognise the same core sequence of TCACACCT with some differences in flanking nucleotides."[19]

Most bZIP proteins show high binding affinity for the ACGT motifs, which include [...] AACGTT (T box) [...].[5][6][7]

"Despite sequence variations within the Tbox DBD between family members, all members of the family appear to bind to the same DNA consensus sequence, TCACACCT. In several in vitro binding-site selection studies, members of the Tbox family were found to bind preferentially sequences containing two or more of these core motifs arranged in various orientations; however, the significance of such double sites in vivo is uncertain, as most Tbox target gene sites have been found to contain only a single consensus motif (18)."[20]

Z boxes

"The HY5 protein interacts with both the G- (CACGTG) and Z- (ATACGTGT) boxes of the light-regulated promoter of RbcS1A (ribulose bisphosphate carboxylase small subunit) and the CHS (chalcone synthase) genes (Ang et al., 1998; Chattopadhyay et al., 1998; Yadav et al., 2002)."[1]

Consensus sequences

"The ABRE contains the core sequence, ACGT, also known as the G‐box (Marcotte et al. 1989; Yamaguchi‐Shinozaki et al. 1990)."[4]

5'-ACGT-3'.[3]

The consensus sequence for the ACGT-containing elements (ACEs) is 5'-CACGT-3'.[2]

Hypotheses

  1. A1BG has no ACEs in either promoter.

Samplings

  1. ACGT elements, negative strand, negative direction: 24 between 150 and 4338 nts.
  2. ACGT elements, negative strand, positive direction: 2, 3'-ACGT-5' at 569, 3'-ACGT-5' at 3254.
  3. ACGT elements, positive strand, negative direction: 4, 3'-ACGT-5' at 342, 3'-ACGT-5' at 531, 3'-ACGT-5' at 1772, 3'-ACGT-5' at 4236.
  4. ACGT elements, positive strand, positive direction: 44 between 192 and 4341 nts.

ACGT-containing elements include these metal responsive elements:

  1. complement, negative strand, negative direction: 6 between 1348 and 4341 nts.
  2. complement, positive strand, negative direction: 6 between 549 and 3323 nts.
  3. inverse, negative strand, negative direction: 2, 3'-CTCACGT-5' at 1470, 3'-CACACGT-5' at 2863.
  4. inverse, positive strand, negative direction: 2, 3'-CACACGT-5' at 531, 3'-CTCACGT-5' at 1772.
  5. inverse, positive strand, positive direction: 6 between 546 and 3883 nts.

ACGT-containing elements include these cAMP response elements (CRE):

  1. negative strand in the negative direction (from ZSCAN22 to A1BG): 1, 3'-TGACGTCA-5' at 4317.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

See also

References

  1. 1.0 1.1 1.2 1.3 Young Hun Song, Cheol Min Yoo, An Pio Hong, Seong Hee Kim, Hee Jeong Jeong, Su Young Shin, Hye Jin Kim, Dae-Jin Yun, Chae Oh Lim, Jeong Dong Bahk, Sang Yeol Lee, Ron T. Nagao, Joe L. Key, and Jong Chan Hong (April 2008). "DNA-Binding Study Identifies C-Box and Hybrid C/G-Box or C/A-Box Motifs as High-Affinity Binding Sites for STF1 and LONG HYPOCOTYL5 Proteins" (PDF). Plant Physiology. 146 (4): 1862–1877. doi:10.1104/pp.107.113217. Retrieved 26 March 2019.
  2. 2.0 2.1 Ulrike Hartmann, Martin Sagasser, Frank Mehrtens, Ralf Stracke and Bernd Weisshaar (January 2005). "Differential combinatorial interactions of cis-acting elements recognized by R2R3-MYB, BZIP, and BHLH factors control light-responsive and tissue-specific activation of phenylpropanoid biosynthesis genes" (PDF). Plant Molecular Biology. 57 (2): 155–171. doi:10.1007/s11103-004-6910-0. Retrieved 10 November 2018.
  3. 3.0 3.1 Lingqiang Guan, Alexis N. Polidoros and John G. Scandalios (March 1996). "Isolation, characterization and expression of the maize Cat2 catalase gene" (PDF). Plant Molecular Biology. 30 (5): 913–24. doi:10.1007/BF00020803. Retrieved 19 April 2019.
  4. 4.0 4.1 4.2 Kenneth A. Watanabe, Arielle Homayouni, Lingkun Gu, Kuan‐Ying Huang, Tuan‐Hua David Ho, Qingxi J. Shen (18 June 2017). "Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element". Plant, Cell & Environment. 40 (9): 2004–2016. doi:10.1111/pce.13006. Retrieved 5 October 2020.
  5. 5.0 5.1 5.2 5.3 Landschulz WH, Johnson PF, McKnight SL (June 1988). "The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins". Science. 240 (4860): 1759–64. Bibcode:1988Sci...240.1759L. doi:10.1126/science.3289117. PMID 3289117.
  6. 6.0 6.1 6.2 6.3 Z G E, Zhang YP, Zhou JH, Wang L (April 2014). "Mini review roles of the bZIP gene family in rice". Genetics and Molecular Research. 13 (2): 3025–36. doi:10.4238/2014.April.16.11. PMID 24782137. Vancouver style error: name (help)
  7. 7.0 7.1 7.2 7.3 Nijhawan A, Jain M, Tyagi AK, Khurana JP (February 2008). "Genomic survey and gene expression analysis of the basic leucine zipper transcription factor family in rice". Plant Physiology. 146 (2): 333–50. doi:10.1104/pp.107.112821. PMID 18065552.
  8. 8.0 8.1 Jianyin Long, Daniel L. Galvan, Koki Mise, Yashpal S. Kanwar, Li Li, Naravat Poungavrin, Paul A. Overbeek, Benny H. Chang, and Farhad R. Danesh (28 May 2020). "Role for carbohydrate response element-binding protein (ChREBP) in high glucose-mediated repression of long noncoding RNA Tug1" (PDF). Journal of Biological Chemistry. 5 (28). doi:10.1074/jbc.RA120.013228. Retrieved 6 October 2020.
  9. 9.0 9.1 9.2 Matthew J. Rossi, William K.M. Lai and B. Franklin Pugh (21 March 2018). "Genome-wide determinants of sequence-specific DNA binding of general regulatory factors". Genome Research. 28: 497–508. doi:10.1101/gr.229518.117. PMID 29563167. Retrieved 31 August 2020.
  10. 10.0 10.1 Marc R. Montminy, Kevin A. Sevarino, John A. Wagner, Gail Mandel, and Richard H. Goodman (September 1986). "Identification of a cyclic-AMP-responsive element within the rat somatostatin gene" (PDF). Proceedings of the National Academy of Sciences of the USA. 83 (18): 6382–6. PMID 2875459. Retrieved 17 September 2018.
  11. 11.0 11.1 11.2 Hongting Tang, Yanling Wu, Jiliang Deng, Nanzhu Chen, Zhaohui Zheng, Yongjun Wei, Xiaozhou Luo, and Jay D. Keasling (6 August 2020). "Promoter Architecture and Promoter Engineering in Saccharomyces cerevisiae". Metabolites. 10 (8): 320–39. doi:10.3390/metabo10080320. PMID 32781665 Check |pmid= value (help). Retrieved 18 September 2020.
  12. Gary J. Loake, Ouriel Faktor, Christopher J. Lamb, and Richard A. Dixon (October 1992). "Combination of H-box [CCTACC(N)7CT] and G-box (CACGTG) cis elements is necessary for feed-forward stimulation of a chalcone synthase promoter by the phenylpropanoid-pathway intermediate p-coumaricacid" (PDF). Proceedings of the National Academy of Sciences USA. 89: 9230–4. Retrieved 5 May 2020.
  13. 13.0 13.1 K Oeda, J Salinas, and N H Chua (July 1991). "A tobacco bZip transcription activator (TAF-1) binds to a G-box-like motif conserved in plant genes". The EMBO Journal. 10 (7): 1793–1802. PMID 2050116. Retrieved 2017-02-13.
  14. Takuya Matsumoto, Saemi Kitajima, Chisato Yamamoto, Mitsuru Aoyagi, Yoshiharu Mitoma, Hiroyuki Harada and Yuji Nagashima (9 August 2020). "Cloning and tissue distribution of the ATP-binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" (PDF). Fisheries Science. 86: 873–887. doi:10.1007/s12562-020-01451-z. Retrieved 27 September 2020.
  15. Barbara Levinson, Rebecca Conant, Rhonda Schnur, Soma Das, Seymour Packman and Jane Gitschier (1996). "A Repeated Element in the Regulatory Region of the MNK Gene and Its Deletion in A Patient With Occipital Horn Syndrome". Human Molecular Genetics. 5 (11): 1737–42. doi:10.1093/hmg/5.11.1737. Retrieved 2013-04-15.
  16. 16.0 16.1 Kai Qiu, Zhongpeng Li, Zhen Yang, Junyi Chen, Shouxin Wu, Xiaoyu Zhu, Shan Gao, Jiong Gao, Guodong Ren, Benke Kuai, and Xin Zhou (July 2015). "EIN3 and ORE1 Accelerate Degreening during Ethylene-Mediated Leaf Senescence by Directly Activating Chlorophyll Catabolic Genes in Arabidopsis". PLoS Genetics. 11 (7): e1005399. doi:10.1371/journal.pgen.1005399. PMID 26218222. Retrieved 4 October 2020.
  17. 17.0 17.1 Bhaskar Sharma & Joemar Taganna (12 June 2020). "Genome-wide analysis of the U-box E3 ubiquitin ligase enzyme gene family in tomato". Scientific Reports. 10 (9581). doi:10.1038/s41598-020-66553-1. PMID 32533036 Check |pmid= value (help). Retrieved 27 August 2020.
  18. Fernanda M. Rodríguez-Tornos, Iñigo San Aniceto, Beatriz Cubelos, Marta Nieto (31 January 2013). "Enrichment of Conserved Synaptic Activity-Responsive Element in Neuronal Genes Predicts a Coordinated Response of MEF2, CREB and SRF". PLoS ONE. 8 (1): e53848. doi:10.1371/journal.pone.0053848. PMID 23382855. Retrieved 12 November 2018.
  19. Frank L. Conlon, Lynne Fairclough, Brenda M. J. Price, Elena S. Casey and J. C. Smith (2001). "Determinants of T box protein specificity" (PDF). Development. 128 (19): 3749–3758. PMID 11585801. Retrieved 17 November 2018.
  20. Ce Feng Liu, Gabriel S. Brandt, Quyen Q. Hoang, Natalia Naumova, Vanja Lazarevic, Eun Sook Hwang, Job Dekker, Laurie H. Glimcher, Dagmar Ringe, and Gregory A. Petsko (25 October 2016). "Crystal structure of the DNA binding domain of the transcription factor T-bet suggests simultaneous recognition of distant genome sites". Proceedings of the National Academy of Sciences of the USA. 113 (43): E6572–E6581. doi:10.1073/pnas.1613914113. PMID 27791029. Retrieved 28 August 2020.

External links