CGCG box gene transcriptions

Jump to navigation Jump to search

WikiDoc Resources for CGCG box gene transcriptions

Articles

Most recent articles on CGCG box gene transcriptions

Most cited articles on CGCG box gene transcriptions

Review articles on CGCG box gene transcriptions

Articles on CGCG box gene transcriptions in N Eng J Med, Lancet, BMJ

Media

Powerpoint slides on CGCG box gene transcriptions

Images of CGCG box gene transcriptions

Photos of CGCG box gene transcriptions

Podcasts & MP3s on CGCG box gene transcriptions

Videos on CGCG box gene transcriptions

Evidence Based Medicine

Cochrane Collaboration on CGCG box gene transcriptions

Bandolier on CGCG box gene transcriptions

TRIP on CGCG box gene transcriptions

Clinical Trials

Ongoing Trials on CGCG box gene transcriptions at Clinical Trials.gov

Trial results on CGCG box gene transcriptions

Clinical Trials on CGCG box gene transcriptions at Google

Guidelines / Policies / Govt

US National Guidelines Clearinghouse on CGCG box gene transcriptions

NICE Guidance on CGCG box gene transcriptions

NHS PRODIGY Guidance

FDA on CGCG box gene transcriptions

CDC on CGCG box gene transcriptions

Books

Books on CGCG box gene transcriptions

News

CGCG box gene transcriptions in the news

Be alerted to news on CGCG box gene transcriptions

News trends on CGCG box gene transcriptions

Commentary

Blogs on CGCG box gene transcriptions

Definitions

Definitions of CGCG box gene transcriptions

Patient Resources / Community

Patient resources on CGCG box gene transcriptions

Discussion groups on CGCG box gene transcriptions

Patient Handouts on CGCG box gene transcriptions

Directions to Hospitals Treating CGCG box gene transcriptions

Risk calculators and risk factors for CGCG box gene transcriptions

Healthcare Provider Resources

Symptoms of CGCG box gene transcriptions

Causes & Risk Factors for CGCG box gene transcriptions

Diagnostic studies for CGCG box gene transcriptions

Treatment of CGCG box gene transcriptions

Continuing Medical Education (CME)

CME Programs on CGCG box gene transcriptions

International

CGCG box gene transcriptions en Espanol

CGCG box gene transcriptions en Francais

Business

CGCG box gene transcriptions in the Marketplace

Patents on CGCG box gene transcriptions

Experimental / Informatics

List of terms related to CGCG box gene transcriptions

Editor-In-Chief: Henry A. Hoff

Arabidopsis thaliana, the thale cress or mouse-ear cress is a small flowering plant native to Eurasia and Africa. Credit: Roepers.{{free media}}

A. thaliana is a popular model organism in plant biology and genetics. For a complex multicellular eukaryote, A. thaliana has a relatively small genome of approximately 135 megabase pairs (Mbp).[1]

"The minimum DNA-binding elements are 6-bp CGCG box, (A/C/G)CGCG(C/G/T)."[2]

"AtSR1 [Arabidopsis thaliana signal-responsive genes] targets the nucleus and specifically recognizes a novel 6-bp CGCG box (A/C/G)CGCG(G/T/C). The multiple CGCG cis-elements are found in promoters of genes such as those involved in ethylene signaling, abscisic acid signaling, and light signal perception. The DNA-binding domain in AtSR1 is located on the N-terminal 146 bp where all AtSR1-related proteins share high similarity but have no similarity to other known DNA-binding proteins. The calmodulin-binding nuclear proteins isolated from wounded leaves exhibit specific CGCG box DNA binding activities. These results suggest that the AtSR gene family encodes a family of calmodulin-binding/DNA-binding proteins involved in multiple signal transduction pathways in plants."[2]

"Ca2+-mediated signaling is involved in the transduction of physical signals such as temperature, wind, touch, light, and gravity; oxidative signals such as those arising from pathogen attacks; and hormone signals such as ethylene, abscisic acid (ABA),1 gibberellins, and auxin (2-7). All these signals have been shown to trigger changes in amplitude or oscillation in cytosolic free Ca2+ level. Recently, the signal-induced nuclear free calcium changes were also observed (8). Free Ca2+ changes are sensed by a number of Ca2+-binding proteins that usually contain a common structural motif, the “EF-hand,” a helix-loop-helix structure (9). One of the best characterized Ca2+-binding proteins is calmodulin (CaM), a highly conserved and multifunctional regulatory protein in eukaryotes. Its regulatory activities are triggered by its ability to modulate the activity of a certain set of CaM-binding proteins after binding to Ca2+, and thereby generating physiological responses to various stimuli (10-15)."[2]

"The CaM-regulated basic helix-loop-helix family of transcription factors was reported in mammals, where CaM inhibits the protein-DNA interaction by competing with the DNA-binding domain in certain proteins (16)."[2]

"cis-acting elements ACGCGG/CCGCGT were present in the promoter regions of about 130 genes (more than two copies) in Arabidopsis genome."[2]

"The promoter regions are assumed to be within ∼1 kb upstream of the starting transcription site (for the known genes) or the first ATG (for the predicted genes). These genes are related to ethylene signaling (EIN3) and ABA signaling (a putative ABA responsive protein), light perception (phytochrome A, phyA), stress responsive such as the DNA repairing protein, heat shock protein, touch protein (TCH 4), and CaM-regulated ion channel. CaM genes (CaM2 andCaM3) and AtSR6 also contains CGCGcis-elements in their promoter regions."[2]

CGCG box samplings

For the Basic programs (starting with SuccessablesCGCG.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for (A/C/G)CGCG(C/G/T), and found:

  1. Negative strand, negative direction: 2, CCGCGC at 1761, GCGCGT at 161.
  2. Negative strand, positive direction: 9, CCGCGC at 1650, CCGCGG at 1437, CCGCGG at 1337, GCGCGT at 1215, ACGCGG at 971, ACGCGG at 871, GCGCGC at 683, CCGCGC at 681, GCGCGT at 543.
  3. Positive strand, negative direction: 1, GCGCGG at 1762.
  4. Positive strand, positive direction: 23, CCGCGG at 1769, ACGCGG at 1656, CCGCGT at 1550, ACGCGT at 1523, ACGCGG at 1498, ACGCGG at 1454, ACGCGT at 1414, ACGCGG at 1398, ACGCGG at 1354, ACGCGT at 1314, CCGCGT at 1298, ACGCGG at 1246, CCGCGC at 1214, ACGCGG at 1162, ACGCGG at 1078, CCGCGT at 1046, CCGCGT at 976, CCGCGT at 876, GCGCGT at 684, GCGCGC at 682, CCGCGC at 542, ACGCGG at 452, CCGCGC at 161.

The inverse complements are the same as the directs.

CGCG negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: CCGCGC at 1761, GCGCGT at 161.
  2. Positive strand, negative direction: GCGCGG at 1762.

CGCG positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: CCGCGC at 1650, CCGCGG at 1437, CCGCGG at 1337, GCGCGT at 1215, ACGCGG at 971, ACGCGG at 871, GCGCGC at 683, CCGCGC at 681, GCGCGT at 543.
  2. Positive strand, positive direction: CCGCGG at 1769, ACGCGG at 1656, CCGCGT at 1550, ACGCGT at 1523, ACGCGG at 1498, ACGCGG at 1454, ACGCGT at 1414, ACGCGG at 1398, ACGCGG at 1354, ACGCGT at 1314, CCGCGT at 1298, ACGCGG at 1246, CCGCGC at 1214, ACGCGG at 1162, ACGCGG at 1078, CCGCGT at 1046, CCGCGT at 976, CCGCGT at 876, GCGCGT at 684, GCGCGC at 682, CCGCGC at 542, ACGCGG at 452, CCGCGC at 161.

CGCG box random dataset samplings

  1. CGCGr0: 5, GCGCGC at 3885, GCGCGC at 3238, GCGCGG at 3116, GCGCGG at 2069, CCGCGG at 1893.
  2. CGCGr1: 3, CCGCGG at 3800, GCGCGG at 2791, ACGCGG at 1299.
  3. CGCGr2: 13, CCGCGG at 3874, CCGCGG at 3564, CCGCGG at 3370, ACGCGT at 3086, CCGCGT at 2847, GCGCGT at 2588, ACGCGC at 2214, ACGCGG at 1931, CCGCGC at 1818, CCGCGT at 1807, CCGCGC at 1535, GCGCGC at 1151, CCGCGT at 418.
  4. CGCGr3: 8, ACGCGG at 3721, CCGCGC at 2377, ACGCGT at 2218, ACGCGT at 2211, ACGCGG at 1763, GCGCGC at 650, ACGCGT at 588, GCGCGG at 8.
  5. CGCGr4: 7, ACGCGT at 4517, ACGCGT at 4441, ACGCGG at 4018, CCGCGG at 2117, ACGCGC at 1939, ACGCGG at 565, CCGCGC at 18.
  6. CGCGr5: 7, CCGCGC at 4412, ACGCGT at 4313, CCGCGT at 3325, CCGCGC at 598, CCGCGG at 400, ACGCGG at 206, ACGCGG at 105.
  7. CGCGr6: 8, GCGCGG at 4386, ACGCGC at 4305, CCGCGC at 4275, GCGCGC at 2071, GCGCGG at 1968, CCGCGC at 1281, ACGCGC at 979, GCGCGG at 319.
  8. CGCGr7: 6, CCGCGC at 4215, CCGCGC at 3956, ACGCGT at 3560, CCGCGC at 2991, CCGCGG at 2747, ACGCGG at 985.
  9. CGCGr8: 2, ACGCGG at 4216, CCGCGT at 1573.
  10. CGCGr9: 9, ACGCGG at 3892, CCGCGG at 3885, CCGCGT at 3382, ACGCGG at 2881, ACGCGT at 2118, CCGCGC at 1954, GCGCGT at 1769, GCGCGC at 1767, ACGCGC at 683.

CGCGr arbitrary (evens) (4560-2846) UTRs

  1. CGCGr0: GCGCGC at 3885, GCGCGC at 3238, GCGCGG at 3116.
  2. CGCGr2: CCGCGG at 3874, CCGCGG at 3564, CCGCGG at 3370, ACGCGT at 3086, CCGCGT at 2847.
  3. CGCGr4: ACGCGT at 4517, ACGCGT at 4441, ACGCGG at 4018.
  4. CGCGr6: GCGCGG at 4386, ACGCGC at 4305, CCGCGC at 4275.
  5. CGCGr8: ACGCGG at 4216.

CGCGr alternate (odds) (4560-2846) UTRs

  1. CGCGr1: CCGCGG at 3800.
  2. CGCGr3: ACGCGG at 3721.
  3. CGCGr5: CCGCGC at 4412, ACGCGT at 4313, CCGCGT at 3325.
  4. CGCGr7: CCGCGC at 4215, CCGCGC at 3956, ACGCGT at 3560, CCGCGC at 2991.
  5. CGCGr9: ACGCGG at 3892, CCGCGG at 3885, CCGCGT at 3382, ACGCGG at 2881.

CGCGr arbitrary positive direction (odds) (4445-4265) core promoters

  1. CGCGr5: CCGCGC at 4412, ACGCGT at 4313.

CGCGr alternate positive direction (evens) (4445-4265) core promoters

  1. CGCGr4: ACGCGT at 4441.
  2. CGCGr6: GCGCGG at 4386, ACGCGC at 4305, CCGCGC at 4275.

CGCGr alternate negative direction (odds) (2811-2596) proximal promoters

  1. CGCGr1: GCGCGG at 2791.
  2. CGCGr7: CCGCGG at 2747.

CGCGr arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. CGCGr7: CCGCGC at 4215.

CGCGr alternate positive direction (evens) (4265-4050) proximal promoters

  1. CGCGr8: ACGCGG at 4216.

CGCGr arbitrary negative direction (evens) (2596-1) distal promoters

  1. CGCGr0: GCGCGG at 2069, CCGCGG at 1893.
  2. CGCGr2: GCGCGT at 2588, ACGCGC at 2214, ACGCGG at 1931, CCGCGC at 1818, CCGCGT at 1807, CCGCGC at 1535, GCGCGC at 1151, CCGCGT at 418.
  3. CGCGr4: CCGCGG at 2117, ACGCGC at 1939, ACGCGG at 565, CCGCGC at 18.
  4. CGCGr6: GCGCGC at 2071, GCGCGG at 1968, CCGCGC at 1281, ACGCGC at 979, GCGCGG at 319.
  5. CGCGr8: CCGCGT at 1573.

CGCGr alternate negative direction (odds) (2596-1) distal promoters

  1. CGCGr1: ACGCGG at 1299.
  2. CGCGr3: CCGCGC at 2377, ACGCGT at 2218, ACGCGT at 2211, ACGCGG at 1763, GCGCGC at 650, ACGCGT at 588, GCGCGG at 8.
  3. CGCGr5: CCGCGC at 598, CCGCGG at 400, ACGCGG at 206, ACGCGG at 105.
  4. CGCGr7: ACGCGG at 985.
  5. CGCGr9: ACGCGT at 2118, CCGCGC at 1954, GCGCGT at 1769, GCGCGC at 1767, ACGCGC at 683.

CGCGr arbitrary positive direction (odds) (4050-1) distal promoters

  1. CGCGr1: CCGCGG at 3800, GCGCGG at 2791, ACGCGG at 1299.
  2. CGCGr3: ACGCGG at 3721, CCGCGC at 2377, ACGCGT at 2218, ACGCGT at 2211, ACGCGG at 1763, GCGCGC at 650, ACGCGT at 588, GCGCGG at 8.
  3. CGCGr5: CCGCGT at 3325, CCGCGC at 598, CCGCGG at 400, ACGCGG at 206, ACGCGG at 105.
  4. CGCGr7: CCGCGC at 3956, ACGCGT at 3560, CCGCGC at 2991, CCGCGG at 2747, ACGCGG at 985.
  5. CGCGr9: ACGCGG at 3892, CCGCGG at 3885, CCGCGT at 3382, ACGCGG at 2881, ACGCGT at 2118, CCGCGC at 1954, GCGCGT at 1769, GCGCGC at 1767, ACGCGC at 683.

CGCGr alternate positive direction (evens) (4050-1) distal promoters

  1. CGCGr0: GCGCGC at 3885, GCGCGC at 3238, GCGCGG at 3116, GCGCGG at 2069, CCGCGG at 1893.
  2. CGCGr2: CCGCGG at 3874, CCGCGG at 3564, CCGCGG at 3370, ACGCGT at 3086, CCGCGT at 2847, GCGCGT at 2588, ACGCGC at 2214, ACGCGG at 1931, CCGCGC at 1818, CCGCGT at 1807, CCGCGC at 1535, GCGCGC at 1151, CCGCGT at 418.
  3. CGCGr4: ACGCGG at 4018, CCGCGG at 2117, ACGCGC at 1939, ACGCGG at 565, CCGCGC at 18.
  4. CGCGr6: GCGCGC at 2071, GCGCGG at 1968, CCGCGC at 1281, ACGCGC at 979, GCGCGG at 319.
  5. CGCGr8: CCGCGT at 1573.

CGCG box analysis and results

All of the real CGCG boxes found are more closely associated with ZSCAN22 or ZNF497 than A1BG.

The real promoters only have three CGCG boxes closer to ZSCAN22 (for an occurrence of 1.5) or thirty-two closer to ZNF497 than A1BG (for an occurrence of 16.0).

The random datasets had fifteen CGCG boxes in the A1BG UTR on the ZSCAN22 side for an occurrence of 3.0.

The random datasets had two CGCG boxes in the core promoter (arbitrary positive direction) for an occurrences of 0.2.

In the proximal promoters the random datasets had one (positive direction) for an occurrence of 0.1.

The distal promoters had twenty sequences in the arbitrary negative direction some closer to A1BG some to ZSCAN22 for an occurrence of 4.0. There were thirty sequences in the arbitrary positive direction some closer to A1BG than to ZNF497 for an occurrence of 6.0.

The disparity between the real consensus sequences regarding their occurrences closer to the zinc fingers than A1BG and between the values of occurrences in the distal promoters 1.5 in the negative direction vs 4.0 and 16.0 vs 6.0 in the positive direction suggests that the real consensus sequences are likely active or activable probably for the respective zinc fingers.

"The minimum DNA-binding elements are 6-bp CGCG box, (A/C/G)CGCG(C/G/T)."[2]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 0 2 0 0
Randoms UTR arbitrary negative 15 10 1.5 1.4
Randoms UTR alternate negative 13 10 1.3 1.4
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 2 10 0.2 0.3
Randoms Core alternate negative 4 10 0.4 0.3
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 0 10 0 0.1
Randoms Proximal alternate negative 2 10 0.2 0.1
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 1 10 0.1 0.1
Randoms Proximal alternate positive 1 10 0.1 0.1
Reals Distal negative 3 2 1.5 1.5
Randoms Distal arbitrary negative 20 10 2.0 1.9
Randoms Distal alternate negative 18 10 1.8 1.9
Reals Distal positive 32 2 16 16 ± 7 (-+9,++23)
Randoms Distal arbitrary positive 30 10 3 2.95
Randoms Distal alternate positive 29 10 2.9 2.95

Comparison:

The occurrences of real CGCG negative distals are less than the randoms and the positive distals are greater than the randoms. This suggests that the real CGCGs are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

See also

References

  1. Genome Assembly. The Arabidopsis Information Resource. Retrieved 29 March 2016.
  2. 2.0 2.1 2.2 2.3 2.4 2.5 2.6 Tianbao Yang and B. W. Poovaiah (22 November 2002). "A calmodulin-binding/CGCG box DNA-binding protein family involved in multiple signaling pathways in plants". Journal of Biological Chemistry. 277 (47): 45049–45058. doi:10.1074/jbc.M207941200. Retrieved 2017-02-05.

External links