ABA-response element gene transcriptions

Jump to navigation Jump to search

Associate Editor(s)-in-Chief: Henry A. Hoff

"The key cis-elements in [non-yellow coloring 1] NYC1 promoter, namely, ABA-response element (ABRE) (ACGTG), ACGT, GCCcore (GCCGCC), and ethylene-inducible 3 [EIN3-Like1] (EIN3)/EIL1-binding sequence (T[TAG][GA]CGT[GA][TCA][TAG]), can be targeted by ABA insensitive 3 (ABI3), ABI5, and ABF2, 3, 4 in the ABA-signaling pathway [60,61]. GCCGCC and EIN3/EIL1-binding sequence (T[TAG][GA]CGT[GA][TCA][TAG]) are induced by ethylene-inducible TF and EIN3/EIL1 in the ethylene signaling pathway [61]. Therefore, ABA signaling is crucial for [chlorophyll] Chl b reductase activities to catalyze the Chl degradation, the first part of leaf senescence."[1]

"The hypoxia response element (HRE) and estrogen response element (ERE) were located on −154 to −150 "ACGTG", and −94 to −80 "AGGTTATTGCCTCCT" on the transcript, respectively."[2]

Consensus sequences

"The ABA responsive element (ABRE) is a key cis‐regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA‐induced genes in rice aleurone cells, suggesting other ABREs may exist."[3]

"Many ABA‐inducible genes in various species contain a conserved cis‐regulatory ABA responsive element (ABRE) with the consensus sequence ACGTG(G/T)C (Hattori et al. 2002; Shen et al. 2004)."[3]

"The [novel ABRE] ABREN, [GATCGATC], had the highest average score and highest average sites in the 1 kb upstream of the start codon of the highly ABA‐induced genes in aleurone. The CGATCGAT motif [...] was identified by Bioprospector more times, but the average score and number of sites were lower. This sequence is almost identical to the ABREN but with a leading cytosine. The ABRE [ACGTGTCC] was the third highest scoring motif."[3]

Using the ABREN (GATCGATC), the CGATCGAT motif, and the ABRE (ACGTGTCC)[3] suggests a general consensus sequence of 5'-(A/C/G)(A/C/G)(A/G/T)(C/T)(C/G)(A/G/T)(A/C/T)(C/T)-3' which would allow 1944 combinations including the principal three variations. As the ABREN has the repeat pattern "GATC" and the CGATCGAT motif is similar with "CGAT", looking for the smaller GATCGAT may be more productive, while the ABRE as "ACGTGTCC" can be searched separately.

The more reduced consensus sequence 5'-GATCGAT-3' may be a better choice while the ABRE 5'-ACGTGTCC-3' could be examined separately.

Hypotheses

  1. A1BG has no ABA-response elements in either promoter.
  2. A1BG is not transcribed by an ABA-response element.
  3. ABA-response elements do not participate in the transcription of A1BG.

ABREN samplings

Copying 5'-CCGTCTCC-3' in "⌘F" yields two between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

Copying 5'-CAGCCACC-3' in "⌘F" yields none between ZSCAN22 and A1BG and one between ZNF497 and A1BG as can be found by the computer programs.

The last two above could be combined as 5'-C(A/C)G(C/T)C(A/T)CC-3' as another possible consensus sequence.

Copying 5'-GATCGAT-3' in "⌘F" yields none between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

Copying 5'-T[TAG][GA]CGT[GA][TCA][TAG]-3' is 5'-T(A/G/T)(A/G)CGT(A/G)(A/C/T)(A/G/T)-3'.

For the Basic programs testing consensus sequence 3'-GATCGAT-5' (starting with SuccessablesABA.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABA--.bas, looking for 3'-GATCGAT-5', 0.
  2. negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABA-+.bas, looking for 3'-GATCGAT-5', 0.
  3. positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABA+-.bas, looking for 3'-GATCGAT-5', 0.
  4. positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABA++.bas, looking for 3'-GATCGAT-5', 0.
  5. complement, negative strand, negative direction is SuccessablesABAc--.bas, looking for 3'-CTAGCTA-5', 0.
  6. complement, negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAc-+.bas, looking for 3'-CTAGCTA-5', 0.
  7. complement, positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAc+-.bas, looking for 3'-CTAGCTA-5', 0.
  8. complement, positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAc++.bas, looking for 3'-CTAGCTA-5', 0.
  9. inverse complement, negative strand, negative direction is SuccessablesABAci--.bas, looking for 3'-ATCGATC-5', 0.
  10. inverse complement, negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAci-+.bas, looking for 3'-ATCGATC-5', 0.
  11. inverse complement, positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAci+-.bas, looking for 3'-ATCGATC-5', 0.
  12. inverse complement, positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAci++.bas, looking for 3'-ATCGATC-5', 0.
  13. inverse negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAi--.bas, looking for 3'-TAGCTAG-5', 0.
  14. inverse negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAi-+.bas, looking for 3'-TAGCTAG-5', 0.
  15. inverse positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAi+-.bas, looking for 3'-TAGCTAG-5', 0.
  16. inverse positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAi++.bas, looking for 3'-TAGCTAG-5', 0.

ABRE samplings

Copying 5'-ACGTG-3' in "⌘F" yields eight between ZSCAN22 and A1BG and one between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence 3'-ACGTG(G/T)C-5' (starting with SuccessablesABA.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABA--.bas, looking for 3'-ACGTG(G/T)C-5', 0.
  2. negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABA-+.bas, looking for 3'-ACGTG(G/T)C-5', 0.
  3. positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABA+-.bas, looking for 3'-ACGTG(G/T)C-5', 1, 3'-ACGTGGC-5', 4239.
  4. positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABA++.bas, looking for 3'-ACGTG(G/T)C-5', 2, 3'-ACGTGTC-5', 1823, 3'-ACGTGGC-5', 4344.
  5. complement, negative strand, negative direction is SuccessablesABAc--.bas, looking for 3'-TGCAC(A/C)G-5', 1, 3'-TGCACCG-5', 4239.
  6. complement, negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAc-+.bas, looking for 3'-TGCAC(A/C)G-5', 2, 3'-TGCACAG-5', 1823, 3'-TGCACCG-5', 4344.
  7. complement, positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAc+-.bas, looking for 3'-TGCAC(A/C)G-5', 0.
  8. complement, positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAc++.bas, looking for 3'-TGCAC(A/C)G-5', 0.
  9. inverse complement, negative strand, negative direction is SuccessablesABAci--.bas, looking for 3'-G(A/C)CACGT-5', 1, 3'-GACACGT-5', 3429.
  10. inverse complement, negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAci-+.bas, looking for 3'-G(A/C)CACGT-5', 0.
  11. inverse complement, positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAci+-.bas, looking for 3'-G(A/C)CACGT-5', 0.
  12. inverse complement, positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAci++.bas, looking for 3'-G(A/C)CACGT-5', 1, 3'-GACACGT-5', 2960.
  13. inverse negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAi--.bas, looking for 3'-C(G/T)GTGCA-5', 0.
  14. inverse negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAi-+.bas, looking for 3'-C(G/T)GTGCA-5', 1, 3'-CTGTGCA-5', 2960.
  15. inverse positive strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesABAi+-.bas, looking for 3'-C(G/T)GTGCA-5', 1, 3'-CTGTGCA-5', 3429.
  16. inverse positive strand in the positive direction (from ZNF497 to A1BG) is SuccessablesABAi++.bas, looking for 3'-C(G/T)GTGCA-5', 0.

UTR ABRE promoters

  1. Negative strand, negative direction: GACACGT at 3429.
  1. Positive strand, negative direction: ACGTGGC at 4239.

ABRE Core promoters

Positive strand, positive direction: ACGTGGC at 4344.

ABRE Distal promoters

Positive strand, positive direction: GACACGT at 2960, ACGTGTC at 1823.

ABRE random dataset samplings

  1. ABREr0: 0.
  2. ABREr1: 0.
  3. ABREr2: 1, ACGTGTC at 1596.
  4. ABREr3: 1, ACGTGGC at 1439.
  5. ABREr4: 0.
  6. ABREr5: 0.
  7. ABREr6: 1, ACGTGTC at 2907.
  8. ABREr7: 1, ACGTGGC at 3536.
  9. ABREr8: 1, ACGTGTC at 916.
  10. ABREr9: 0.
  11. ABREr0ci: 0.
  12. ABREr1ci: 3, GACACGT at 3788, GCCACGT at 3298, GCCACGT at 191.
  13. ABREr2ci: 0.
  14. ABREr3ci: 2, GCCACGT at 4080, GCCACGT at 1482.
  15. ABREr4ci: 1, GACACGT at 2286.
  16. ABREr5ci: 0.
  17. ABREr6ci: 0.
  18. ABREr7ci: 1, GACACGT at 1391.
  19. ABREr8ci: 1, GCCACGT at 897.
  20. ABREr9ci: 1, GCCACGT at 874.

ABREr UTRs

  1. ABREr6: ACGTGTC at 2907.

ABREr proximal promoters

  1. ABREr3ci: GCCACGT at 4080.

ABREr distal promoters

  1. ABREr2: ACGTGTC at 1596.
  2. ABREr8: ACGTGTC at 916.
  3. ABREr4ci: GACACGT at 2286.
  4. ABREr8ci: GCCACGT at 897.


  1. ABREr3: ACGTGGC at 1439.
  2. ABREr7: ACGTGGC at 3536.
  3. ABREr1ci: GACACGT at 3788, GCCACGT at 3298, GCCACGT at 191.
  4. ABREr3ci: GCCACGT at 1482.
  5. ABREr7ci: GACACGT at 1391.
  6. ABREr9ci: GCCACGT at 874.

ABRE analysis and results

"The ABA responsive element (ABRE) is a key cis‐regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA‐induced genes in rice aleurone cells, suggesting other ABREs may exist."[3]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 2 2 1.0 1.0 ± 0 (--1,+-1)
Randoms UTR arbitrary negative 1 10 0.1 0.25 ± 0.15
Randoms UTR alternate negative 4 10 0.4 0.25 ± 0.15
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 1 2 0.5 0.5 ± 0.5 (-+0,++1)
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 1 10 0.1 0.05 ± 0.05
Randoms Proximal alternate positive 0 10 0 0.05 ± 0.05
Reals Distal negative 0 2 0 0
Randoms Distal arbitrary negative 4 10 0.4 0.45 ± 0.05
Randoms Distal alternate negative 5 10 0.5 0.45 ± 0.05
Reals Distal positive 2 2 1 1 ± 1 (-+0,++1)
Randoms Distal arbitrary positive 8 10 0.8 0.65 ± 0.15
Randoms Distal alternate positive 5 10 0.5 0.65 ± 0.15

Comparison:

The occurrences of real UTR, core and distal ABREs are greater than the randoms. This suggests that the real ABREs are likely active or activable.

ACGTGTCC samplings

For the Basic programs testing consensus sequence 3'-ACGTGTCC-5' (starting with SuccessablesABA.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction is SuccessablesABA--.bas, looking for 3'-ACGTGTCC-5', 0.
  2. negative strand, positive direction is SuccessablesABA-+.bas, looking for 3'-ACGTGTCC-5', 0.
  3. positive strand, negative direction is SuccessablesABA+-.bas, looking for 3'-ACGTGTCC-5', 0.
  4. positive strand, positive direction is SuccessablesABA++.bas, looking for 3'-ACGTGTCC-5', 0.
  5. complement, negative strand, negative direction is SuccessablesABAc--.bas, looking for 3'-TGCACAGG-5', 0.
  6. complement, negative strand, positive direction is SuccessablesABAc-+.bas, looking for 3'-TGCACAGG-5', 0.
  7. complement, positive strand, negative direction is SuccessablesABAc+-.bas, looking for 3'-TGCACAGG-5', 0.
  8. complement, positive strand, positive direction is SuccessablesABAc++.bas, looking for 3'-TGCACAGG-5', 0.
  9. inverse complement, negative strand, negative direction is SuccessablesABAci--.bas, looking for 3'-GGACACGT-5', 0.
  10. inverse complement, negative strand, positive direction is SuccessablesABAci-+.bas, looking for 3'-GGACACGT-5', 0.
  11. inverse complement, positive strand, negative direction is SuccessablesABAci+-.bas, looking for 3'-GGACACGT-5', 0.
  12. inverse complement, positive strand, positive direction is SuccessablesABAci++.bas, looking for 3'-GGACACGT-5', 0.
  13. inverse negative strand, negative direction is SuccessablesABAi--.bas, looking for 3'-CCTGTGCA-5', 0.
  14. inverse negative strand, positive direction is SuccessablesABAi-+.bas, looking for 3'-CCTGTGCA-5', 0.
  15. inverse positive strand, negative direction is SuccessablesABAi+-.bas, looking for 3'-CCTGTGCA-5', 0.
  16. inverse positive strand, positive direction is SuccessablesABAi++.bas, looking for 3'-CCTGTGCA-5', 0.

Discussion

Abscisic acid (ABA) response elements (ABREs) have been identified for example in the UTR for A1BG between ZSCAN22 and A1BG. If these response elements are active then A1BG can be transcribed as a key cis-regulatory element in ABA signaling. "However, for ABA responsive transcription to occur, a single copy of the ABRE is not sufficient. In barley, the combination of an ABRE and one of two known coupling elements CE1 (TGCCACCGG) and CE3 (GCGTGTC) constitutes an ABA responsive complex (ABRC) in the regulation of the ABA‐inducible genes HVA1 and HVA22 (Shen and Ho 1995; Shen et al. 1996). It was also shown that a pair of ABREs can function as an ABRC with the second ABRE playing the role of the coupling element in rice (Hobo et al. 1999), barley (Shen et al. 1996) and Arabidopsis (Nakashima et al. 2006). Coupling of two CE3s is much less active in conferring ABA response to the minimal promoter (Shen et al. 2004). Interestingly, CE3 appears to be specific to monocots. In Arabidopsis, the CE3 element is practically absent; thus, Arabidopsis relies on paired ABREs to form ABRCs (Gomez‐Porras et al. 2007) or on the coupling of a DRE (TACCGACAT) with ABRE (Narusaka et al. 2003; Nakashima et al. 2006)."[3]

No coupling elements occur in the negative direction between ZSCAN22 and A1BG. Two ABREs occur in both directions suggesting pairs may be available to function as an ABRC on either side of A1BG, subject to needed proximity. The random datasets have more occurrences further away from A1BG and always much fewer occurrences per dataset. These suggest that those around A1BG are not random and are likely active or activatable.

No DRE (TACCGACAT) occurs in either direction of A1BG.

Testing on Google Scholar: A1BG "ABA responsive element" - did not match any articles. Testing: A1BG ABRE produced 6 articles but ABRE in the first was not Abscisic acid (ABA) response element (ABRE). One other contained the Spanish word abre. The only ABRE found in the promoters of A1BG is ACGTG(G/T)C (Watanabe) with four occurrences. Coupling element (CE1) is absent but CE3 is present as GCGTGTC (Watanabe) and CACGCG (Ding). The DREB box or (CRT/DREB box) are absent.

An NIH Gene database search of "Homo sapiens" and "Abscisic acid" or ABRE produced no genes found.

ABA-signaling pathway

"The key cis-elements in [non-yellow coloring 1] NYC1 promoter, namely, ABA-response element (ABRE) (ACGTG), ACGT, GCCcore (GCCGCC), and ethylene-inducible 3 [EIN3-Like1] (EIN3)/EIL1-binding sequence (T[TAG][GA]CGT[GA][TCA][TAG]), can be targeted by ABA insensitive 3 (ABI3), ABI5, and ABF2, 3, 4 in the ABA-signaling pathway [60,61]."[1]

Ethylene signaling pathway

"GCCGCC and EIN3/EIL1-binding sequence (T[TAG][GA]CGT[GA][TCA][TAG]) are induced by ethylene-inducible TF and EIN3/EIL1 in the ethylene signaling pathway [61]."[1]

Gene ID: 1543, CYP1A1 cytochrome P450 family 1 subfamily A member 1 [ Homo sapiens (human) ]. Location: 15q24.1. "This gene, CYP1A1, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme's endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. The gene has been associated with lung cancer risk. A related family member, CYP1A2, is located approximately 25 kb away from CYP1A1 on chromosome 15. Alternative splicing results in multiple transcript variants encoding distinct isoforms."[4] Traceable author statement: "involved in ethylene metabolic process"[5]

Gene ID: 1571, CYP2E1 cytochrome P450 family 2 subfamily E member 1 [ Homo sapiens (human) ]. Location: 10q26.3. "This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is induced by ethanol, the diabetic state, and starvation. The enzyme metabolizes both endogenous substrates, such as ethanol, acetone, and acetal, as well as exogenous substrates including benzene, carbon tetrachloride, ethylene glycol, and nitrosamines which are premutagens found in cigarette smoke. Due to its many substrates, this enzyme may be involved in such varied processes as gluconeogenesis, hepatic cirrhosis, diabetes, and cancer."[6] Metabolism of dichloroethylene by CYP450 WikiPathways WP3666.

Gene ID: 219681, ARMC3 armadillo repeat containing 3 [ Homo sapiens (human) ]. Location: 10p12.2. "Armadillo/beta-catenin (CTNNB1; MIM 116806)-like (ARM) domains are imperfect 45-amino acid repeats involved in protein-protein interactions. ARM domain-containing proteins, such as ARMC3, function in signal transduction, development, cell adhesion and mobility, and tumor initiation and metastasis (Li et al., 2006 [PubMed 16915934])."[7] Transcript Variant: This variant (2) uses an alternate in-frame splice site in the 3' coding region, compared to variant 1. This results in a shorter protein (isoform 2), compared to isoform 1. pfam14381 Location: 713 → 851: EDR1; Ethylene-responsive protein kinase Le-CTR1. Transcript Variant: This variant (4) lacks three exons in the 5' region and uses a downstream start codon compared to variant 1. It encodes isoform 4 which has a shorter N-terminus compared to isoform 1. pfam14381 Location: 457 → 595: EDR1; Ethylene-responsive protein kinase Le-CTR1. Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1). pfam14381 Location: 720 → 858: EDR1; Ethylene-responsive protein kinase Le-CTR1.

cl00033: AP2 Superfamily: "DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein). In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies."[8]

cl04813: EIN3 Superfamily: "Ethylene insensitive 3 (EIN3) proteins are a family of plant DNA-binding proteins that regulate transcription in response to the gaseous plant hormone ethylene, and are essential for ethylene-mediated responses including the triple response, cell growth inhibition, and accelerated senescence."[9]

GATC samplings

For the Basic programs testing consensus sequence GATC (starting with SuccessablesGATC1.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. Negative strand, negative direction: 6, GATC at 3493, GATC at 3462, GATC at 3369, GATC at 2313, GATC at 2093, GATC at 1306.
  2. Positive strand, negative direction: 20, GATC at 4475, GATC at 4288, GATC at 4157, GATC at 4006, GATC at 3488, GATC at 3276, GATC at 3097, GATC at 2574, GATC at 2413, GATC at 2239, GATC at 1987, GATC at 1812, GATC at 1481, GATC at 1167, GATC at 972, GATC at 877, GATC at 703, GATC at 589, GATC at 525, GATC at 430.
  3. Negative strand, positive direction: 6, GATC at 4080, GATC at 2513, GATC at 2377, GATC at 2230, GATC at 964, GATC at 864.
  4. Positive strand, positive direction: 7, GATC at 4076, GATC at 4064, GATC at 3521, GATC at 3483, GATC at 2638, GATC at 2481, GATC at 2167.
  5. inverse complement, is the same GATC.

GATC (4560-2846) UTRs

  1. Negative strand, negative direction: GATC at 3493, GATC at 3462, GATC at 3369.
  2. Positive strand, negative direction: GATC at 4475, GATC at 4288, GATC at 4157, GATC at 4006, GATC at 3488, GATC at 3276, GATC at 3097.

GATC positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: GATC at 4080.
  2. Positive strand, positive direction: GATC at 4076, GATC at 4064.

GATC negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: GATC at 2313, GATC at 2093, GATC at 1306.
  2. Positive strand, negative direction: GATC at 2574, GATC at 2413, GATC at 2239, GATC at 1987, GATC at 1812, GATC at 1481, GATC at 1167, GATC at 972, GATC at 877, GATC at 703, GATC at 589, GATC at 525, GATC at 430.

GATC positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: GATC at 2513, GATC at 2377, GATC at 2230, GATC at 964, GATC at 864.
  2. Positive strand, positive direction: GATC at 3521, GATC at 3483, GATC at 2638, GATC at 2481, GATC at 2167.

GATC random dataset samplings

  1. GATCr0: 14, GATC at 4322, GATC at 4271, GATC at 3571, GATC at 3140, GATC at 3084, GATC at 2761, GATC at 2667, GATC at 2249, GATC at 1949, GATC at 1852, GATC at 924, GATC at 817, GATC at 736, GATC at 94.
  2. GATCr1: 14, GATC at 4398, GATC at 4079, GATC at 3860, GATC at 2995, GATC at 2834, GATC at 2757, GATC at 2724, GATC at 2626, GATC at 2250, GATC at 2230, GATC at 1798, GATC at 1789, GATC at 1779, GATC at 1335.
  3. GATCr2: 9, GATC at 3761, GATC at 3436, GATC at 3336, GATC at 2360, GATC at 1749, GATC at 1418, GATC at 1192, GATC at 266, GATC at 157.
  4. GATCr3: 9, GATC at 4551, GATC at 4372, GATC at 3824, GATC at 2566, GATC at 2434, GATC at 1755, GATC at 1125, GATC at 968, GATC at 846.
  5. GATCr4: 15, GATC at 4379, GATC at 4310, GATC at 3922, GATC at 3294, GATC at 3132, GATC at 3092, GATC at 3047, GATC at 2972, GATC at 2808, GATC at 2798, GATC at 2690, GATC at 1669, GATC at 1494, GATC at 1173, GATC at 65.
  6. GATCr5: 15, GATC at 4518, GATC at 4463, GATC at 4288, GATC at 4119, GATC at 4030, GATC at 2884, GATC at 2829, GATC at 2428, GATC at 1812, GATC at 1782, GATC at 1746, GATC at 1705, GATC at 1119, GATC at 943, GATC at 528.
  7. GATCr6: 18, GATC at 4310, GATC at 4049, GATC at 4038, GATC at 3047, GATC at 3026, GATC at 2936, GATC at 2874, GATC at 2390, GATC at 2221, GATC at 2213, GATC at 1012, GATC at 879, GATC at 755, GATC at 751, GATC at 732, GATC at 701, GATC at 508, GATC at 166.
  8. GATCr7: 12, GATC at 4552, GATC at 3550, GATC at 3367, GATC at 2594, GATC at 2530, GATC at 2460, GATC at 1921, GATC at 1285, GATC at 1162, GATC at 1116, GATC at 1002, GATC at 501.
  9. GATCr8: 15, GATC at 4188, GATC at 3702, GATC at 3016, GATC at 2192, GATC at 1872, GATC at 1500, GATC at 1138, GATC at 1057, GATC at 974, GATC at 966, GATC at 864, GATC at 442, GATC at 337, GATC at 103, GATC at 26.
  10. GATCr9: 7, GATC at 3582, GATC at 3444, GATC at 2724, GATC at 2484, GATC at 2430, GATC at 2282, GATC at 2236.

GATCr arbitrary (evens) (4560-2846) UTRs

  1. GATCr0: GATC at 4322, GATC at 4271, GATC at 3571, GATC at 3140, GATC at 3084.
  2. GATCr2: GATC at 3761, GATC at 3436, GATC at 3336.
  3. GATCr4: GATC at 4379, GATC at 4310, GATC at 3922, GATC at 3294, GATC at 3132, GATC at 3092, GATC at 3047, GATC at 2972.
  4. GATCr6: GATC at 4310, GATC at 4049, GATC at 4038, GATC at 3047, GATC at 3026, GATC at 2936, GATC at 2874.
  5. GATCr8: GATC at 4188, GATC at 3702, GATC at 3016.

GATCr alternate (odds) (4560-2846) UTRs

  1. GATCr1: GATC at 4398, GATC at 4079, GATC at 3860, GATC at 2995.
  2. GATCr3: GATC at 4551, GATC at 4372, GATC at 3824.
  3. GATCr5: GATC at 4518, GATC at 4463, GATC at 4288, GATC at 4119, GATC at 4030, GATC at 2884.
  4. GATCr7: GATC at 4552, GATC at 3550, GATC at 3367.
  5. GATCr9: GATC at 3582, GATC at 3444.

GATCr alternate negative direction (odds) (2846-2811) core promoters

  1. GATCr1: GATC at 2834.
  2. GATCr5: GATC at 2829.

GATCr arbitrary positive direction (odds) (4445-4265) core promoters

  1. GATCr1: GATC at 4398.
  2. GATCr3: GATC at 4372.
  3. GATCr5: GATC at 4288.

GATCr alternate positive direction (evens) (4445-4265) core promoters

  1. GATCr0: GATC at 4322, GATC at 4271.
  2. GATCr4: GATC at 4379, GATC at 4310.
  3. GATCr6: GATC at 4310.

GATCr arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. GATCr0: GATC at 2761, GATC at 2667.
  2. GATCr4: GATC at 2808, GATC at 2798, GATC at 2690.

GATCr alternate negative direction (odds) (2811-2596) proximal promoters

  1. GATCr1: GATC at 2757, GATC at 2724, GATC at 2626.
  2. GATCr9: GATC at 2724.

GATCr arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. GATCr1: GATC at 4079.
  2. GATCr5: GATC at 4119.

GATCr alternate positive direction (evens) (4265-4050) proximal promoters

  1. GATCr8: GATC at 4188.

GATCr arbitrary negative direction (evens) (2596-1) distal promoters

  1. GATCr0: GATC at 2249, GATC at 1949, GATC at 1852, GATC at 924, GATC at 817, GATC at 736, GATC at 94.
  2. GATCr2: GATC at 2360, GATC at 1749, GATC at 1418, GATC at 1192, GATC at 266, GATC at 157.
  3. GATCr4: GATC at 1669, GATC at 1494, GATC at 1173, GATC at 65.
  4. GATCr6: GATC at 2390, GATC at 2221, GATC at 2213, GATC at 1012, GATC at 879, GATC at 755, GATC at 751, GATC at 732, GATC at 701, GATC at 508, GATC at 166.
  5. GATCr8: GATC at 2192, GATC at 1872, GATC at 1500, GATC at 1138, GATC at 1057, GATC at 974, GATC at 966, GATC at 864, GATC at 442, GATC at 337, GATC at 103, GATC at 26.

GATCr alternate negative direction (odds) (2596-1) distal promoters

  1. GATCr1: GATC at 2250, GATC at 2230, GATC at 1798, GATC at 1789, GATC at 1779, GATC at 1335.
  2. GATCr3: GATC at 2566, GATC at 2434, GATC at 1755, GATC at 1125, GATC at 968, GATC at 846.
  3. GATCr5: GATC at 2428, GATC at 1812, GATC at 1782, GATC at 1746, GATC at 1705, GATC at 1119, GATC at 943, GATC at 528.
  4. GATCr7: GATC at 2594, GATC at 2530, GATC at 2460, GATC at 1921, GATC at 1285, GATC at 1162, GATC at 1116, GATC at 1002, GATC at 501.
  5. GATCr9: GATC at 2484, GATC at 2430, GATC at 2282, GATC at 2236.

GATCr arbitrary positive direction (odds) (4050-1) distal promoters

  1. GATCr1: GATC at 3860, GATC at 2995, GATC at 2834, GATC at 2757, GATC at 2724, GATC at 2626, GATC at 2250, GATC at 2230, GATC at 1798, GATC at 1789, GATC at 1779, GATC at 1335.
  2. GATCr3: GATC at 3824, GATC at 2566, GATC at 2434, GATC at 1755, GATC at 1125, GATC at 968, GATC at 846.
  3. GATCr5: GATC at 4030, GATC at 2884, GATC at 2829, GATC at 2428, GATC at 1812, GATC at 1782, GATC at 1746, GATC at 1705, GATC at 1119, GATC at 943, GATC at 528.
  4. GATCr7: GATC at 3550, GATC at 3367, GATC at 2594, GATC at 2530, GATC at 2460, GATC at 1921, GATC at 1285, GATC at 1162, GATC at 1116, GATC at 1002, GATC at 501.
  5. GATCr9: GATC at 3582, GATC at 3444, GATC at 2724, GATC at 2484, GATC at 2430, GATC at 2282, GATC at 2236.

GATCr alternate positive direction (evens) (4050-1) distal promoters

  1. GATCr0: GATC at 3571, GATC at 3140, GATC at 3084, GATC at 2761, GATC at 2667, GATC at 2249, GATC at 1949, GATC at 1852, GATC at 924, GATC at 817, GATC at 736, GATC at 94.
  2. GATCr2: GATC at 3761, GATC at 3436, GATC at 3336, GATC at 2360, GATC at 1749, GATC at 1418, GATC at 1192, GATC at 266, GATC at 157.
  3. GATCr4: GATC at 3922, GATC at 3294, GATC at 3132, GATC at 3092, GATC at 3047, GATC at 2972, GATC at 2808, GATC at 2798, GATC at 2690, GATC at 1669, GATC at 1494, GATC at 1173, GATC at 65.
  4. GATCr6: GATC at 4049, GATC at 4038, GATC at 3047, GATC at 3026, GATC at 2936, GATC at 2874, GATC at 2390, GATC at 2221, GATC at 2213, GATC at 1012, GATC at 879, GATC at 755, GATC at 751, GATC at 732, GATC at 701, GATC at 508, GATC at 166.
  5. GATCr8: GATC at 3702, GATC at 3016, GATC at 2192, GATC at 1872, GATC at 1500, GATC at 1138, GATC at 1057, GATC at 974, GATC at 966, GATC at 864, GATC at 442, GATC at 337, GATC at 103, GATC at 26.

GATC analysis and results

"The [novel ABRE] ABREN, [GATCGATC], had the highest average score and highest average sites in the 1 kb upstream of the start codon of the highly ABA‐induced genes in aleurone."[3]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 10 2 5 5 ± 2 (--3,+-7)
Randoms UTR arbitrary negative 26 10 2.6 2.2 ± 0.4
Randoms UTR alternate negative 18 10 1.8 2.2 ± 0.4
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0.1
Randoms Core alternate negative 2 10 0.2 0.1
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 3 10 0.3 0.4
Randoms Core alternate positive 5 10 0.5 0.4
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 5 10 0.5 0.45
Randoms Proximal alternate negative 4 10 0.4 0.45
Reals Proximal positive 2 2 1 1 ± 0 (-+1,++1)
Randoms Proximal arbitrary positive 2 10 0.2 0.15
Randoms Proximal alternate positive 1 10 0.1 0.15
Reals Distal negative 16 2 8 8 ± 5 (--3, +-13)
Randoms Distal arbitrary negative 40 10 4.0 3.65
Randoms Distal alternate negative 33 10 3.3 3.65
Reals Distal positive 10 2 5 5 ± 0 (-+5,++5)
Randoms Distal arbitrary positive 48 10 4.8 5.65 ± 0.85
Randoms Distal alternate positive 65 10 6.5 5.65 ± 0.85

Comparison:

The occurrences of real GATC UTRs, proximals and positive strand, negative direction distals are greater than the randoms, but the negative strand, negative direction distals and positive direction distals are likely randoms. This suggests that the real GATCs are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

Initial content for this page in some instances came from Wikipedia.

Initial content for this page in some instances incorporates text from the United States National Library of Medicine.

See also

References

  1. 1.0 1.1 1.2 Muhammad Asad Ullah Asad, Shamsu Ado Zakari, Qian Zhao, Lujian Zhou, Yu Ye and Fangmin Cheng (10 January 2019). "Abiotic Stresses Intervene with ABA Signaling to Induce Destructive Metabolic Pathways Leading to Death: Premature Leaf Senescence in Plants". International Journal of Molecular Sciences. 20 (2): 256–278. doi:10.3390/ijms20020256. PMID 30634648. Retrieved 27 August 2020.
  2. Takuya Matsumoto, Saemi Kitajima, Chisato Yamamoto, Mitsuru Aoyagi, Yoshiharu Mitoma, Hiroyuki Harada and Yuji Nagashima (9 August 2020). "Cloning and tissue distribution of the ATP-binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" (PDF). Fisheries Science. 86: 873–887. doi:10.1007/s12562-020-01451-z. Retrieved 27 September 2020.
  3. 3.0 3.1 3.2 3.3 3.4 3.5 3.6 Kenneth A. Watanabe, Arielle Homayouni, Lingkun Gu, Kuan‐Ying Huang, Tuan‐Hua David Ho, Qingxi J. Shen (18 June 2017). "Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element". Plant, Cell & Environment. 40 (9): 2004–2016. doi:10.1111/pce.13006. Retrieved 5 October 2020.
  4. RefSeq (January 2016). CYP1A1 cytochrome P450 family 1 subfamily A member 1 [ Homo sapiens (human) ]. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.
  5. TAS (January 2016). CYP1A1 cytochrome P450 family 1 subfamily A member 1 [ Homo sapiens (human) ]. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.
  6. RefSeq (July 2008). CYP2E1 cytochrome P450 family 2 subfamily E member 1 [ Homo sapiens (human) ]. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.
  7. OMIM (March 2008). ARMC3 armadillo repeat containing 3 [ Homo sapiens (human) ]. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.
  8. NCBI (8 March 2022). Conserved Protein Domain Family AP2. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.
  9. NCBI (24 November 2020). Conserved Protein Domain Family EIN3. 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 3 May 2023.

External links