Estrogen response element gene transcriptions: Difference between revisions

Jump to navigation Jump to search
Marshallsumter (talk | contribs)
Marshallsumter (talk | contribs)
 
(One intermediate revision by the same user not shown)
Line 229: Line 229:
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)  
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)  
|-
|-
| Reals || UTR || negative || 2 || 2 || 1 || 1 ± (+-1,--1)  
| Reals || UTR || negative || 2 || 2 || 1 || 1 ± 0 (+-1,--1)  
|-
|-
| Randoms || UTR || arbitrary negative || 7 || 10 || 0.7 || 0.9 ± 0.2  
| Randoms || UTR || arbitrary negative || 7 || 10 || 0.7 || 0.9 ± 0.2  
Line 241: Line 241:
| Randoms || Core || alternate negative || 0 || 10 || 0 || 0  
| Randoms || Core || alternate negative || 0 || 10 || 0 || 0  
|-
|-
| Reals || Core || positive || 1 || 2 || 0.5 || 0.5 ± (-+1,++0)  
| Reals || Core || positive || 1 || 2 || 0.5 || 0.5 ± 0.5 (-+1,++0)  
|-
|-
| Randoms || Core || arbitrary positive || 2 || 10 || 0.2 || 0.15
| Randoms || Core || arbitrary positive || 2 || 10 || 0.2 || 0.15
Line 274: Line 274:
Comparison:
Comparison:


The occurrences of real ERE1 UTRs overlap at the high end, cores, proximals and distals are greater than the randoms. This suggests that the real ERE1s are likely active or activable.
The occurrences of real ERE1 UTRs are within the randoms, cores, proximals and distals are greater than the randoms. This suggests that the real ERE1s are likely active or activable.


==ERE2 (Driscoll) samplings==
==ERE2 (Driscoll) samplings==
Line 389: Line 389:
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)  
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)  
|-
|-
| Reals || UTR || negative || 1 || 2 || 0.5 || 0.5  
| Reals || UTR || negative || 1 || 2 || 0.5 || 0.5 ± 0.5 (--1,+-0)
|-
|-
| Randoms || UTR || arbitrary negative || 7 || 10 || 0.7 || 0.85 ± 0.15  
| Randoms || UTR || arbitrary negative || 7 || 10 || 0.7 || 0.85 ± 0.15  
Line 413: Line 413:
| Randoms || Proximal || alternate negative || 0 || 10 || 0 || 0.05  
| Randoms || Proximal || alternate negative || 0 || 10 || 0 || 0.05  
|-
|-
| Reals || Proximal || positive || 1 || 2 || 0.5 || 0.5  
| Reals || Proximal || positive || 1 || 2 || 0.5 || 0.5 ± 0.5 (-+1,++0)
|-
|-
| Randoms || Proximal || arbitrary positive || 1 || 10 || 0.1 || 0.05  
| Randoms || Proximal || arbitrary positive || 1 || 10 || 0.1 || 0.05  
Line 434: Line 434:
Comparison:
Comparison:


The occurrences of real ERE2 UTRs are less than the randoms, cores, proximals and distals are greater than the randoms. This suggests that the real ERE2s are likely active or activable.
The occurrences of real ERE2 UTRs overlap the randoms, cores, proximals and distals are greater than the randoms. This suggests that the real ERE2s are likely active or activable.


==See also==
==See also==

Latest revision as of 02:58, 29 August 2023

Associate Editor(s)-in-Chief: Henry A. Hoff

Human genes

Consensus sequences

"The [...] estrogen response element (ERE) [was] located on [...] −94 to −80 "AGGTTATTGCCTCCT" on the transcript, respectively."[1]

Note: GGTTA is the complement of CCAAT. AGGTTATTGCCTCCT is the complement of TCCAATAACGGAGGA.

"Our in silico analysis revealed a putative ERE sequence (GGTCAggaTGACA; wherein the non-consensus nucleotide A is underlined) starting at position −242 from the translation initiation codon (ATG, A being +1; [...])."[2]

"The estrogen receptor (ER)1 is a transcription factor that regulates the expression of estrogen-responsive genes by binding to a specific DNA sequence found in their regulatory regions. This sequence is called the estrogen response element (ERE). [...] The essential ERE was determined to have the consensus sequence 5′-GGTCAnnnTGACC-3′ (4, 7)."[3]

The "sequence requirements for a “minimal” consensus ERE [...] for ER binding encompasses a wide variety of possible combinations, which display stable ER binding. The optimal binding sequence is 5′-C(A/G)GGTCAnnnTGACC(T/C)G-3′. Any change from this optimal consensus diminishes ER-ERE affinity. The underlined core consensus sequence represents the minimum length for binding activity."[3]

ERE (Matsumoto) samplings

Copying an abridged consensus sequence for the estrogen response element of AGGTTA (ci=TAACCT) and putting it in "⌘F" finds none located between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence AGGTTA (starting with SuccessablesEREM.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 0.
  2. positive strand, negative direction: 0.
  3. negative strand, positive direction: 0.
  4. positive strand, positive direction: 0.
  5. inverse complement, negative strand, negative direction: 0.
  6. inverse complement, positive strand, negative direction: 0.
  7. inverse complement, negative strand, positive direction: 0.
  8. inverse complement, positive strand, positive direction: 0.

For the Basic programs testing consensus sequence 5'-AGGTTATTGCCTCCT-3' (starting with SuccessablesERE.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for 5'-AGGTTATTGCCTCCT-3', 0.
  2. negative strand, positive direction, looking for 5'-AGGTTATTGCCTCCT-3', 0.
  3. positive strand, negative direction, looking for 5'-AGGTTATTGCCTCCT-3', 0.
  4. positive strand, positive direction, looking for 5'-AGGTTATTGCCTCCT-3', 0.
  5. complement, negative strand, negative direction, looking for 5'-TCCAATAACGGAGGA-3', 0.
  6. complement, negative strand, positive direction, looking for 5'-TCCAATAACGGAGGA-3', 0.
  7. complement, positive strand, negative direction, looking for 5'-TCCAATAACGGAGGA-3', 0.
  8. complement, positive strand, positive direction, looking for 5'-TCCAATAACGGAGGA-3', 0.
  9. inverse complement, negative strand, negative direction, looking for 5'-AGGAGGCAATAACCT-3', 0.
  10. inverse complement, negative strand, positive direction, looking for 5'-AGGAGGCAATAACCT-3', 0.
  11. inverse complement, positive strand, negative direction, looking for 5'-AGGAGGCAATAACCT-3', 0.
  12. inverse complement, positive strand, positive direction, looking for 5'-AGGAGGCAATAACCT-3', 0.
  13. inverse negative strand, negative direction, looking for 5'-TCCTCCGTTATTGGA-3', 0.
  14. inverse negative strand, positive direction, looking for 5'-TCCTCCGTTATTGGA-3', 0.
  15. inverse positive strand, negative direction, looking for 5'-TCCTCCGTTATTGGA-3', 0.
  16. inverse positive strand, positive direction, looking for 5'-TCCTCCGTTATTGGA-3', 0.

ERE (Yasar) samplings

Copying an abridged consensus sequence for the estrogen response element of GGTCAGGAT and putting it in "⌘F" finds none located between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence 5'-GGTCAGGATGAC-3' (starting with SuccessablesERE2.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for 5'-GGTCAGGATGAC-3', 0.
  2. negative strand, positive direction, looking for 5'-GGTCAGGATGAC-3', 0.
  3. positive strand, negative direction, looking for 5'-GGTCAGGATGAC-3', 0.
  4. positive strand, positive direction, looking for 5'-GGTCAGGATGAC-3', 0.
  5. complement, negative strand, negative direction, looking for 5'-CCAGTCCTACTG-3', 0.
  6. complement, negative strand, positive direction, looking for 5'-CCAGTCCTACTG-3', 0.
  7. complement, positive strand, negative direction, looking for 5'-CCAGTCCTACTG-3', 0.
  8. complement, positive strand, positive direction, looking for 5'-CCAGTCCTACTG-3', 0.
  9. inverse complement, negative strand, negative direction, looking for 5'-GTCATCCTGACC-3', 0.
  10. inverse complement, negative strand, positive direction, looking for 5'-GTCATCCTGACC-3', 0.
  11. inverse complement, positive strand, negative direction, looking for 5'-GTCATCCTGACC-3', 0.
  12. inverse complement, positive strand, positive direction, looking for 5'-GTCATCCTGACC-3', 0.
  13. inverse negative strand, negative direction, looking for 5'-CAGTAGGACTGG-3', 0.
  14. inverse negative strand, positive direction, looking for 5'-CAGTAGGACTGG-3', 0.
  15. inverse positive strand, negative direction, looking for 5'-CAGTAGGACTGG-3', 0.
  16. inverse positive strand, positive direction, looking for 5'-CAGTAGGACTGG-3', 0.

ERE1 (Driscoll) samplings

Copying a responsive elements consensus sequence GGTCA and putting the sequence in "⌘F" finds five between ZNF497 and A1BG or six between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence GGTCA (starting with SuccessablesERE1.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 6, GGTCA at 4415, GGTCA at 2654, GGTCA at 2601, GGTCA at 2211, GGTCA at 1532, GGTCA at 1352.
  2. positive strand, negative direction: 8, GGTCA at 4307, GGTCA at 2585, GGTCA at 2248, GGTCA at 712, GGTCA at 576, GGTCA at 568, GGTCA at 439, GGTCA at 206.
  3. negative strand, positive direction: 5, GGTCA at 4269, GGTCA at 3820, GGTCA at 2220, GGTCA at 2035, GGTCA at 153.
  4. positive strand, positive direction: 7, GGTCA at 3841, GGTCA at 3379, GGTCA at 3082, GGTCA at 2996, GGTCA at 2605, GGTCA at 2024, GGTCA at 1250.

ERE1 (4560-2846) UTRs

  1. Negative strand, negative direction: GGTCA at 4415.
  2. Positive strand, negative direction: GGTCA at 4307.

ERE1 positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: GGTCA at 4269.

ERE1 negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: GGTCA at 2654, GGTCA at 2601.

ERE1 negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: GGTCA at 2211, GGTCA at 1532, GGTCA at 1352.
  2. Positive strand, negative direction: GGTCA at 2585, GGTCA at 2248, GGTCA at 712, GGTCA at 576, GGTCA at 568, GGTCA at 439, GGTCA at 206.

ERE1 positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: GGTCA at 3820, GGTCA at 2220, GGTCA at 2035, GGTCA at 153.
  2. Positive strand, positive direction: GGTCA at 3841, GGTCA at 3379, GGTCA at 3082, GGTCA at 2996, GGTCA at 2605, GGTCA at 2024, GGTCA at 1250.

ERE1 (Driscoll) random dataset samplings

  1. ERE1r0: 4, GGTCA at 3832, GGTCA at 1984, GGTCA at 845, GGTCA at 699.
  2. ERE1r1: 6, GGTCA at 4483, GGTCA at 3425, GGTCA at 2777, GGTCA at 2237, GGTCA at 1750, GGTCA at 1572.
  3. ERE1r2: 7, GGTCA at 4381, GGTCA at 3905, GGTCA at 3163, GGTCA at 2185, GGTCA at 467, GGTCA at 141, GGTCA at 23.
  4. ERE1r3: 9, GGTCA at 3843, GGTCA at 3505, GGTCA at 2854, GGTCA at 2303, GGTCA at 2238, GGTCA at 1631, GGTCA at 1051, GGTCA at 747, GGTCA at 600.
  5. ERE1r4: 1, GGTCA at 3806.
  6. ERE1r5: 3, GGTCA at 4296, GGTCA at 4051, GGTCA at 2696.
  7. ERE1r6: 3, GGTCA at 2736, GGTCA at 1832, GGTCA at 60.
  8. ERE1r7: 4, GGTCA at 4457, GGTCA at 3894, GGTCA at 972, GGTCA at 634.
  9. ERE1r8: 3, GGTCA at 3493, GGTCA at 3433, GGTCA at 2694.
  10. ERE1r9: 3, GGTCA at 4379, GGTCA at 3055, GGTCA at 2056.

ERE1r arbitrary (evens) (4560-2846) UTRs

  1. ERE1r0: GGTCA at 3832.
  2. ERE1r2: GGTCA at 4381, GGTCA at 3905, GGTCA at 3163.
  3. ERE1r4: GGTCA at 3806.
  4. ERE1r8: GGTCA at 3493, GGTCA at 3433.

ERE1r alternate (odds) (4560-2846) UTRs

  1. ERE1r1: GGTCA at 4483, GGTCA at 3425.
  2. ERE1r3: GGTCA at 3843, GGTCA at 3505, GGTCA at 2854.
  3. ERE1r5: GGTCA at 4296, GGTCA at 4051.
  4. ERE1r7: GGTCA at 4457, GGTCA at 3894.
  5. ERE1r9: GGTCA at 4379, GGTCA at 3055.

ERE1r arbitrary positive direction (odds) (4445-4265) core promoters

  1. ERE1r5: GGTCA at 4296.
  2. ERE1r9: GGTCA at 4379.

ERE1r alternate positive direction (evens) (4445-4265) core promoters

  1. ERE1r2: GGTCA at 4381.

ERR1r arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. ERE1r6: GGTCA at 2736.
  2. ERE1r8: GGTCA at 2694.

ERE1r alternate negative direction (odds) (2811-2596) proximal promoters

  1. ERE1r1: GGTCA at 2777.
  2. ERE1r5: GGTCA at 2696.

ERE1r arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. ERE1r5: GGTCA at 4051.

ERE1r arbitrary negative direction (evens) (2596-1) distal promoters

  1. ERE1r0: GGTCA at 1984, GGTCA at 845, GGTCA at 699.
  2. ERE1r2: GGTCA at 2185, GGTCA at 467, GGTCA at 141, GGTCA at 23.
  3. ERE1r6: GGTCA at 1832, GGTCA at 60.

ERE1r alternate negative direction (odds) (2596-1) distal promoters

  1. ERE1r1: GGTCA at 2237, GGTCA at 1750, GGTCA at 1572.
  2. ERE1r3: GGTCA at 2303, GGTCA at 2238, GGTCA at 1631, GGTCA at 1051, GGTCA at 747, GGTCA at 600.
  3. ERE1r7: GGTCA at 972, GGTCA at 634.
  4. ERE1r9: GGTCA at 2056.

ERE1r arbitrary positive direction (odds) (4050-1) distal promoters

  1. ERE1r1: GGTCA at 3425, GGTCA at 2777, GGTCA at 2237, GGTCA at 1750, GGTCA at 1572.
  2. ERE1r3: GGTCA at 3843, GGTCA at 3505, GGTCA at 2854, GGTCA at 2303, GGTCA at 2238, GGTCA at 1631, GGTCA at 1051, GGTCA at 747, GGTCA at 600.
  3. ERE1r5: GGTCA at 2696.
  4. ERE1r7: GGTCA at 3894, GGTCA at 972, GGTCA at 634.
  5. ERE1r9: GGTCA at 3055, GGTCA at 2056.

ERE1r alternate positive direction (evens) (4050-1) distal promoters

  1. ERE1r0: GGTCA at 3832, GGTCA at 1984, GGTCA at 845, GGTCA at 699.
  2. ERE1r2: GGTCA at 3905, GGTCA at 3163, GGTCA at 2185, GGTCA at 467, GGTCA at 141, GGTCA at 23.
  3. ERE1r4: GGTCA at 3806.
  4. ERE1r6: GGTCA at 2736, GGTCA at 1832, GGTCA at 60.
  5. ERE1r8: GGTCA at 3493, GGTCA at 3433, GGTCA at 2694.

ERE1 (Driscoll) analysis and results

"The underlined core consensus sequence [GGTCA] represents the minimum length for binding activity."[3]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 2 2 1 1 ± 0 (+-1,--1)
Randoms UTR arbitrary negative 7 10 0.7 0.9 ± 0.2
Randoms UTR alternate negative 11 10 1.1 0.9 ± 0.2
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 1 2 0.5 0.5 ± 0.5 (-+1,++0)
Randoms Core arbitrary positive 2 10 0.2 0.15
Randoms Core alternate positive 1 10 0.1 0.15
Reals Proximal negative 2 2 1 1 ± 1 (--2,+-0)
Randoms Proximal arbitrary negative 2 10 0.2 0.2
Randoms Proximal alternate negative 2 10 0.2 0.2
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 1 10 0.1 0.05
Randoms Proximal alternate positive 0 10 0 0.05
Reals Distal negative 10 2 5 5 ± 2 (--3,+-7)
Randoms Distal arbitrary negative 9 10 0.9 1.05
Randoms Distal alternate negative 12 10 1.2 1.05
Reals Distal positive 11 2 5.5 5.5 ± 1.5 (-+4,++7)
Randoms Distal arbitrary positive 20 10 2.0 1.85 ± 0.15
Randoms Distal alternate positive 17 10 1.7 1.85 ± 0.15

Comparison:

The occurrences of real ERE1 UTRs are within the randoms, cores, proximals and distals are greater than the randoms. This suggests that the real ERE1s are likely active or activable.

ERE2 (Driscoll) samplings

Copying a responsive elements consensus sequence TGACC and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence TGACC (starting with SuccessablesERE2.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. inverse complement, negative strand, negative direction: 1, TGACC at 3749.
  2. inverse complement, positive strand, negative direction: 2, TGACC at 2189, TGACC at 734.
  3. inverse complement, negative strand, positive direction: 7, TGACC at 4216, TGACC at 4018, TGACC at 3784, TGACC at 3714, TGACC at 2873, TGACC at 2213, TGACC at 1662.
  4. inverse complement, positive strand, positive direction: 5, TGACC at 3345, TGACC at 3117, TGACC at 1953, TGACC at 1140, TGACC at 347.

ERE2 (4560-2846) UTRs

  1. Negative strand, negative direction: TGACC at 3749.

ERE2 positive direction (4445-4265) core promoters

  1. Positive strand, positive direction: TGACC at 3345, TGACC at 3117.

ERE2 positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: TGACC at 4216.

ERE2 negative direction (2596-1) distal promoters

  1. Positive strand, negative direction: TGACC at 2189, TGACC at 734.

ERE2 positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: TGACC at 4018, TGACC at 3784, TGACC at 3714, TGACC at 2873, TGACC at 2213, TGACC at 1662.
  2. Positive strand, positive direction: TGACC at 3345, TGACC at 3117, TGACC at 1953, TGACC at 1140, TGACC at 347.

ERE2 (Driscoll) random dataset samplings

  1. ERE2r0ci: 5, TGACC at 4380, TGACC at 2730, TGACC at 1834, TGACC at 1659, TGACC at 727.
  2. ERE2r1ci: 6, TGACC at 3635, TGACC at 3358, TGACC at 3323, TGACC at 3051, TGACC at 1160, TGACC at 154.
  3. ERE2r2ci: 4, TGACC at 3755, TGACC at 3580, TGACC at 3542, TGACC at 853.
  4. ERE2r3ci: 3, TGACC at 4438, TGACC at 3102, TGACC at 2442.
  5. ERE2r4ci: 3, TGACC at 2892, TGACC at 2131, TGACC at 344.
  6. ERE2r5ci: 2, TGACC at 2537, TGACC at 505.
  7. ERE2r6ci: 5, TGACC at 3520, TGACC at 3192, TGACC at 2398, TGACC at 1623, TGACC at 1345.
  8. ERE2r7ci: 3, TGACC at 3800, TGACC at 3140, TGACC at 898.
  9. ERE2r8ci: 4, TGACC at 1241, TGACC at 776, TGACC at 539, TGACC at 509.
  10. ERE2r9ci: 2, TGACC at 4209, TGACC at 3680.

ERE2r arbitrary (evens) (4560-2846) UTRs

  1. ERE2r0ci: TGACC at 4380.
  2. ERE2r2ci: TGACC at 3755, TGACC at 3580, TGACC at 3542.
  3. ERE2r4ci: TGACC at 2892.
  4. ERE2r6ci: TGACC at 3520, TGACC at 3192.

ERE2r alternate (odds) (4560-2846) UTRs

  1. ERE2r1ci: TGACC at 3635, TGACC at 3358, TGACC at 3323, TGACC at 3051.
  2. ERE2r3ci: TGACC at 4438, TGACC at 3102.
  3. ERE2r7ci: TGACC at 3800, TGACC at 3140.
  4. ERE2r9ci: TGACC at 4209, TGACC at 3680.

ERE2r arbitrary positive direction (odds) (4445-4265) core promoters

  1. ERE2r3ci: TGACC at 4438.

ERE2r alternate positive direction (evens) (4445-4265) core promoters

  1. ERE2r0ci: TGACC at 4380.

ERE2r arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. ERE2r0ci: TGACC at 2730.

ERE2r arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. ERE2r9ci: TGACC at 4209.

ERE2r arbitrary negative direction (evens) (2596-1) distal promoters

  1. ERE2r0ci: TGACC at 1834, TGACC at 1659, TGACC at 727.
  2. ERE2r2ci: TGACC at 853.
  3. ERE2r4ci: TGACC at 2131, TGACC at 344.
  4. ERE2r6ci: TGACC at 2398, TGACC at 1623, TGACC at 1345.
  5. ERE2r8ci: TGACC at 1241, TGACC at 776, TGACC at 539, TGACC at 509.

ERE2r alternate negative direction (odds) (2596-1) distal promoters

  1. ERE2r1ci: TGACC at 1160, TGACC at 154.
  2. ERE2r3ci: TGACC at 2442.
  3. ERE2r5ci: TGACC at 2537, TGACC at 505.
  4. ERE2r7ci: TGACC at 898.

ERE2r arbitrary positive direction (odds) (4050-1) distal promoters

  1. ERE2r1ci: TGACC at 3635, TGACC at 3358, TGACC at 3323, TGACC at 3051, TGACC at 1160, TGACC at 154.
  2. ERE2r3ci: TGACC at 3102, TGACC at 2442.
  3. ERE2r5ci: TGACC at 2537, TGACC at 505.
  4. ERE2r7ci: TGACC at 3800, TGACC at 3140, TGACC at 898.
  5. ERE2r9ci: TGACC at 3680.

ERE2r alternate positive direction (evens) (4050-1) distal promoters

  1. ERE2r0ci: TGACC at 2730, TGACC at 1834, TGACC at 1659, TGACC at 727.
  2. ERE2r2ci: TGACC at 3755, TGACC at 3580, TGACC at 3542, TGACC at 853.
  3. ERE2r4ci: TGACC at 2892, TGACC at 2131, TGACC at 344.
  4. ERE2r6ci: TGACC at 3520, TGACC at 3192, TGACC at 2398, TGACC at 1623, TGACC at 1345.
  5. ERE2r8ci: TGACC at 1241, TGACC at 776, TGACC at 539, TGACC at 509.

ERE2 (Driscoll) analysis and results

"The underlined core consensus sequence [TGACC] represents the minimum length for binding activity."[3]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 1 2 0.5 0.5 ± 0.5 (--1,+-0)
Randoms UTR arbitrary negative 7 10 0.7 0.85 ± 0.15
Randoms UTR alternate negative 10 10 1.0 0.85 ± 0.15
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 2 2 1 1 ± 1 (-+0,++2)
Randoms Core arbitrary positive 1 10 0.1 0.1
Randoms Core alternate positive 1 10 0.1 0.1
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 1 10 0.1 0.05
Randoms Proximal alternate negative 0 10 0 0.05
Reals Proximal positive 1 2 0.5 0.5 ± 0.5 (-+1,++0)
Randoms Proximal arbitrary positive 1 10 0.1 0.05
Randoms Proximal alternate positive 0 10 0 0.05
Reals Distal negative 2 2 1 1 ± 1 (--0,+-2)
Randoms Distal arbitrary negative 13 10 1.3 0.95 ± 0.35
Randoms Distal alternate negative 6 10 0.6 0.95 ± 0.35
Reals Distal positive 11 2 5.5 5.5 ± 0.5 (-+6,++5)
Randoms Distal arbitrary positive 14 10 1.4 1.7 ± 0.3
Randoms Distal alternate positive 20 10 2.0 1.7 ± 0.3

Comparison:

The occurrences of real ERE2 UTRs overlap the randoms, cores, proximals and distals are greater than the randoms. This suggests that the real ERE2s are likely active or activable.

See also

References

  1. Takuya Matsumoto, Saemi Kitajima, Chisato Yamamoto, Mitsuru Aoyagi, Yoshiharu Mitoma, Hiroyuki Harada and Yuji Nagashima (9 August 2020). "Cloning and tissue distribution of the ATP-binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" (PDF). Fisheries Science. 86: 873–887. doi:10.1007/s12562-020-01451-z. Retrieved 27 September 2020.
  2. Pelin Yaşar, Gamze Ayaz & Mesut Muyan (25 November 2016). "Estradiol-Estrogen Receptor α Mediates the Expression of the CXXC5 Gene through the Estrogen Response Element-Dependent Signaling Pathway". Scientific Reports. 6: 37808. doi:10.1038/srep37808. Retrieved 6 October 2020.
  3. 3.0 3.1 3.2 3.3 Mark D. Driscoll, G. Sathya, Mesut Muyan, Carolyn M. Klinge, Russell Hilf and Robert A. Bambara (November 1998). "Sequence Requirements for Estrogen Receptor Binding to Estrogen Response Elements". Journal of Biological Chemistry Nucleic Acids, Protein Synthesis, and Molecular Genetics. 273 (45): 29321–29330. doi:10.1074/jbc.273.45.29321. Retrieved 8 March 2023.

External links