Downstream promoter element gene transcriptions

Jump to navigation Jump to search

Editor-In-Chief: Henry A. Hoff

File:Core promoter elements.svg
The diagram is an overview of four core promoter elements. Credit: Jennifer E.F. Butler & James T. Kadonaga.

The figure on the right is an overview of four core promoter elements: the B recognition element (BRE), TATA box, initiator element (Inr), and downstream promoter element (DPE), showing their respective consensus sequences and their distance from the transcription start site.[1]

The downstream promoter element (DPE) is a core promoter element present in other species including humans and excluding Saccharomyces cerevisiae.[2]

Gene transcriptions

"Transcription by RNA polymerase II is directed by cis-acting [close-acting] DNA sequences that typically consist of a core promoter along with regulatory elements, such as enhancers [trans-acting, or distant-acting, protein factors], that contain binding sites for sequence-specific transcriptional activator and/or repressor proteins."[3]

Core promoters

"[T]he core promoter [consists of] the DNA sequences, which encompass the transcription start site (within about -40 and +40 [nucleotides] relative to the +1 start site"[3].

"[T]he core sequence of the DPE is located at precisely +28 to +32 relative to the A+1 nucleotide in the Inr"[4]. It is located about 28–33 nucleotides downstream of the transcription start site.[2]

DPE-dependent basal transcription depends highly on the Inr (and vice versa) and on correct spacing between the two elements.[5][3][6]

Initiator elements

"There is a strict requirement for spacing between the [Initiator element] Inr and DPE motifs, as an increase or decrease of 3 nucleotides in the distance between the Inr and DPE causes a seven- to eightfold reduction in transcription as well as a significant reduction in the binding of purified TFIID."[3]

Consensus sequences

The early DPE consensus sequence was RGWCGTG.[5][7]

The DPE consensus sequence is the more general sequence RGWYVT, or (A/G)G(A/T)(C/T)(A/C/G)T.[2]

The DPE in "the ATP‐binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" is 5'-AGTCTC-3'.[8]

DPE-containing promoters

"The ... Drosophila Antennapedia P2 (Antp P2) [promoter contains] a 7-nucleotide sequence that conforms to the DPE consensus"[3]. GeneID: 40835 Antp Antennapedia [Drosophila melanogaster] is also known as Antp P2.[9] GeneID: 3204 HOXA7 homeobox A7 [ Homo sapiens ] is also known as ANTP and "[t]his gene is highly similar to the antennapedia (Antp) gene of Drosophila."[10] As GeneID: 3204 is " highly similar to the antennapedia (Antp) gene of Drosophila"[10], it may have a DPE like the Drosophila gene core promoter does.

"[T]he TATA-less Drosophila Abdominal-B (Abd-B) promoter [has a] partial DPE sequence"[3]. GeneID: 3205 HOXA9 homeobox A9 [ Homo sapiens] is also known as ABD-B and "[t]his gene is highly similar to the abdominal-B (Abd-B) gene of Drosophila."[11] GeneID: 3205 may also be TATA-less and have a DPE.

General transcription factor II Ds

The DPE "is required for the binding of purified [general transcription factor II D] TFIID to a subset of TATA-less promoters"[4].

"Photo-cross-linking analysis of purified TFIID with a TATA-less DPE-containing promoter revealed specific cross-linking of dTAFII60 [TAF6 GeneID: 6878] and dTAFII40 [TAF11 GeneID: 6882] to the DPE, with a higher efficiency of cross-linking to dTAFII60 than to dTAFII40. These data, combined with the previously well-characterized interactions between the two TAFs and their homology to histones H4 and H3, suggest that a dTAFII60–dTAFII40 heterotetramer binds to the DPE."[3]

Hypotheses

  1. The DPE is not used to transcribe A1BG.

DPE (Butler 2002) samplings

For the Basic programs testing consensus sequence RGWYV (A/G)G(A/T)(C/T)(A/C/G) (starting with SuccessablesDPEB.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 163, GGACC at 4546, GGACC at 4494, AGTCG at 4489, GGTCG at 4480, AGTCC at 4436, AGATG at 4430, GGTCA at 4415, GGACA at 4369, GGACC at 4349, GGTCG at 4345, GGACC at 4300, GGTCG at 4261, GGTCC at 4253, AGATG at 4212, GGACA at 4208, AGTTC at 4178, GGTCC at 4170, AGTCC at 4138, GGTCG at 4130, GGACA at 4121, GGTCC at 4102, AGATG at 4062, GGACC at 4037, GGTCG at 4033, AGTTC at 4027, GGTTC at 4019, GGTTG at 3979, GGACA at 3970, GGTCC at 3951, GGACC at 3906, GGTCC at 3885, GGTCC at 3871, GGACG at 3861, AGTTC at 3844, AGACC at 3835, GGACC at 3744, GGTCG at 3731, AGACG at 3706, GGTCG at 3701, GGTCG at 3682, GGTCC at 3585, GGACG at 3579, GGTCC at 3564, AGACA at 3556, AGTTG at 3523, AGTCC at 3396, AGACA at 3319, GGACC at 3298, GGTCG at 3294, GGTTC at 3273, GGTCC at 3249, AGTCC at 3217, GGTCG at 3209, AGTCG at 3204, GGACA at 3200, AGATG at 3158, GGTTG at 3137, GGACC at 3128, GGTCG at 3124, AGTCC at 3110, GGTCG at 3070, GGACA at 3061, GGATA at 2996, AGATG at 2988, GGTTA at 2848, GGACC at 2770, GGTCG at 2766, GGACC at 2720, GGTCG at 2681, GGACA at 2672, GGTCA at 2654, AGTCG at 2650, GGTTG at 2610, GGTCA at 2601, AGTCC at 2587, GGTTG at 2547, GGACA at 2538, GGTCC at 2519, AGTTA at 2496, GGACC at 2435, GGTCG at 2431, GGACC at 2385, GGTCG at 2346, GGACA at 2337, AGATG at 2294, GGACC at 2268, GGTCG at 2264, AGTCC at 2250, GGTCA at 2211, AGATG at 2169, GGTTG at 2148, AGTCC at 2134, GGATC at 2093, GGTCC at 2077, AGACA at 2029, GGACC at 2009, GGTCG at 2005, GGACC at 1959, GGTCG at 1920, GGACA at 1911, AGATG at 1867, GGACC at 1841, GGTTC at 1817, GGTCG at 1785, AGACA at 1776, GGTCG at 1611, GGTCA at 1532, AGATA at 1525, AGTTG at 1513, AGTCG at 1486, GGTCC at 1460, AGACA at 1452, AGTTG at 1406, AGACC at 1356, GGTCA at 1352, GGATC at 1306, AGTCC at 1275, GGTCG at 1267, GGACA at 1258, AGATG at 1224, GGTTG at 1203, GGACC at 1198, GGTCG at 1194, GGTCG at 1140, GGACA at 1131, AGACA at 1085, GGTCG at 1061, GGACC at 1015, AGTCC at 984, GGTCG at 976, GGACA at 967, GGTCC at 948, AGACA at 919, GGACC at 899, GGTCG at 895, GGTTC at 874, GGTCC at 850, GGTCG at 810, GGACA at 801, AGATG at 758, GGTCG at 737, GGTCG at 728, AGTCC at 714, GGTTC at 692, GGTCG at 676, GGACA at 667, GGTCC at 648, AGATG at 624, GGACC at 596, AGTCC at 578, GGTTC at 556, GGTCG at 540, GGACC at 508, GGTCG at 504, AGATG at 481, GGACC at 459, AGTCC at 441, GGTTC at 419, GGTCG at 403, GGACA at 394, GGTCC at 262, AGATA at 234, GGTCG at 35.
  2. positive strand, negative direction: 101, AGACA at 4507, AGTCC at 4500, AGATC at 4475, GGACA at 4468, AGTTC at 4417, AGACC at 4365, GGTCA at 4307, GGATC at 4288, AGACG at 4235, AGACC at 4204, AGACA at 4181, AGTTC at 4175, GGATC at 4157, AGTCC at 4126, AGTTG at 4096, AGACC at 4030, AGTTC at 4024, GGATC at 4006, GGTTG at 3945, AGATG at 3919, GGACC at 3868, GGTCG at 3813, GGTTG at 3804, AGACC at 3761, GGACA at 3756, GGATA at 3655, AGATG at 3627, AGATG at 3620, GGTTG at 3605, GGTTG at 3532, AGATC at 3488, AGATA at 3465, AGACA at 3433, GGACA at 3389, AGATC at 3276, GGTTG at 3261, AGACC at 3121, AGTTG at 3115, GGATC at 3097, AGATA at 2981, AGACA at 2948, AGATG at 2905, AGATG at 2894, AGACA at 2880, AGTTG at 2733, AGTTG at 2704, AGACC at 2598, AGTTG at 2592, GGTCA at 2585, GGATC at 2574, AGTCC at 2543, AGATC at 2413, GGTTG at 2398, GGACA at 2271, AGACC at 2261, GGTCA at 2248, GGATC at 2239, GGTTG at 2234, AGATA at 2177, AGACC at 2145, AGACC at 2121, GGACA at 2117, AGATC at 1987, AGACC at 1834, AGATG at 1828, GGATC at 1812, AGATA at 1595, AGACA at 1569, AGATG at 1438, GGTTG at 1319, AGTTC at 1177, GGATC at 1167, GGACG at 1151, GGTTG at 1028, AGATC at 972, AGATC at 877, GGTTG at 862, GGATG at 784, AGACC at 725, AGTTC at 719, GGTCA at 712, GGATC at 703, AGATC at 589, GGTCA at 576, GGTCA at 568, AGACA at 559, GGATC at 525, GGTCA at 439, GGATC at 430, AGACA at 422, AGTTC at 253, AGATG at 244, GGTCA at 206, AGACA at 170, AGTCG at 157, GGATA at 108, GGATA at 98, AGTTG at 84, GGATA at 74, AGATA at 57, GGACC at 32.
  3. negative strand, positive direction: 73, GGTCC at 4420, AGACA at 4332, AGACG at 4319, GGTCA at 4269, GGACA at 4252, AGTTC at 4200, GGATG at 4099, GGATC at 4080, GGTTC at 4073, AGACA at 3893, AGTCC at 3863, GGTCA at 3820, GGATG at 3574, AGACC at 3550, GGACC at 3545, GGACA at 3530, GGTTG at 3490, AGATG at 3475, GGATG at 3457, AGATG at 3418, AGTTA at 3381, AGTCG at 3283, GGACC at 3172, GGACA at 3131, AGTCC at 3084, GGTTG at 3050, GGTTA at 3024, AGTCC at 2998, AGTTC at 2954, GGTTC at 2922, AGACC at 2861, GGATA at 2737, GGATG at 2714, AGTTA at 2666, GGATA at 2659, AGTCA at 2618, AGTCA at 2613, AGTCA at 2607, GGACA at 2460, GGATG at 2409, AGATC at 2230, GGTCA at 2220, AGTTA at 2134, AGTCA at 2100, GGTCA at 2035, AGTCC at 2026, AGTTC at 1987, GGTTC at 1926, GGATG at 1878, GGACA at 1869, AGTCC at 1841, AGTCC at 1826, GGACA at 1693, GGTCG at 1687, GGACG at 1670, AGTCG at 1528, AGATC at 964, AGATC at 864, AGTCC at 757, AGACA at 712, AGACC at 440, GGACG at 410, AGACG at 398, GGACG at 359, GGACG at 323, GGTTC at 305, GGTCC at 218, GGACC at 187, AGTCC at 172, AGATG at 166, GGTCA at 153, GGATG at 59, GGACC at 37.
  4. positive strand, positive direction: 159, GGACC at 4424, AGACC at 4416, GGACC at 4409, AGTCA at 4271, GGACG at 4231, AGATC at 4076, AGATC at 4064, AGTCG at 4052, GGTCC at 4032, AGTCG at 4023, AGTCG at 3997, AGTCC at 3868, GGTCA at 3841, GGACC at 3787, AGTCG at 3775, GGACC at 3758, AGTCC at 3728, GGTCG at 3720, GGTCC at 3687, GGACC at 3679, GGTTG at 3633, GGACA at 3622, GGACA at 3617, GGTCC at 3536, GGACC at 3496, GGACA at 3434, AGTTA at 3424, AGACC at 3405, GGTCA at 3379, GGACC at 3362, AGACG at 3358, AGACG at 3306, GGACC at 3296, AGTTG at 3290, AGACG at 3278, AGACG at 3267, AGATA at 3258, GGTCG at 3239, AGTCG at 3155, GGTCC at 3111, GGTCA at 3082, AGACG at 3060, GGACC at 3047, AGTCG at 3041, AGTCC at 3034, AGACC at 3021, GGTCC at 3016, GGTCA at 2996, GGACC at 2988, AGACC at 2983, AGACG at 2975, AGACA at 2957, AGTCA at 2936, AGACA at 2925, GGTTA at 2908, GGACC at 2891, AGACC at 2883, GGTCC at 2876, AGACG at 2856, GGTCC at 2780, AGTCC at 2620, AGTTC at 2615, GGTCA at 2605, GGTTC at 2593, GGTCC at 2574, GGACC at 2569, AGTCG at 2526, GGACG at 2520, AGTTC at 2508, GGACC at 2501, GGATC at 2481, GGACC at 2433, GGTTC at 2398, AGTCG at 2390, AGTCC at 2372, GGTCC at 2316, AGACA at 2308, AGACA at 2260, GGACA at 2250, AGTTA at 2233, AGTCG at 2198, AGACA at 2182, AGATC at 2167, AGTCC at 2115, AGTCG at 2102, AGTCA at 2098, AGTCA at 2060, GGTCG at 2052, GGTCA at 2024, GGTTG at 2012, AGACC at 1992, GGTCC at 1893, AGACC at 1864, GGACA at 1860, GGTCC at 1855, GGACC at 1815, GGACG at 1776, AGACG at 1733, AGTTG at 1621, AGTCG at 1603, GGATG at 1573, AGACG at 1495, AGACC at 1476, GGACG at 1469, GGTCG at 1463, GGTCG at 1457, GGACG at 1411, AGACG at 1395, AGACC at 1376, GGACG at 1369, GGTCG at 1363, GGTCG at 1357, GGACG at 1311, GGATG at 1283, GGTTG at 1279, GGTCG at 1271, AGTCG at 1267, GGTCA at 1250, GGACC at 1199, GGATG at 1195, GGTCC at 1175, GGTCG at 1127, GGACG at 1118, GGACG at 1075, GGACA at 991, GGACC at 947, GGTTG at 943, AGTCG at 931, GGACG at 907, GGACA at 891, GGACC at 847, GGTTG at 843, AGTCG at 831, GGACG at 807, GGTCC at 707, GGATG at 649, GGTCG at 623, GGTCG at 617, AGTCG at 613, GGTTG at 607, GGACC at 598, GGTCC at 515, AGTCG at 511, GGACG at 435, GGTCC at 424, GGTCG at 329, GGACC at 286, AGACC at 270, AGACG at 223, GGTCC at 215, GGACG at 191, GGTTC at 177, GGACA at 144, AGACC at 102, AGACA at 98, AGTCC at 90, GGACC at 40, GGTCC at 33, GGTCC at 8.
  5. inverse complement, negative strand, negative direction: 174, GGACC at 4546, TGTCT at 4518, GGACC at 4494, GGTCT at 4448, GGACC at 4349, GGACC at 4300, GAACT at 4294, CGTCC at 4282, CGACT at 4276, GAACC at 4268, GGTCC at 4253, GGTCT at 4233, GAACC at 4188, GGTCC at 4170, CGACT at 4145, GGTCC at 4102, CAACC at 4097, TATCT at 4079, GGACC at 4037, GAACT at 4012, CGACT at 3994, GGTCC at 3951, CAACC at 3946, CAACC at 3942, GGACT at 3932, TGTCT at 3917, GGACC at 3906, GGTCC at 3885, GGTCC at 3871, CGACC at 3864, CATCT at 3820, CAACT at 3805, GAACC at 3793, GGACT at 3781, TGACC at 3749, GGACC at 3744, CGACC at 3719, TGTCT at 3672, CGACT at 3649, CAACC at 3606, CGTCT at 3589, GGTCC at 3585, GAACT at 3571, GGTCC at 3564, TGACT at 3542, CAACT at 3533, TAACC at 3529, CAACT at 3505, GGTCT at 3486, GATCT at 3463, TATCC at 3447, CGTCT at 3431, TATCT at 3422, GAACT at 3401, TAACT at 3358, GGACC at 3298, CATCT at 3256, GGTCC at 3249, GAACT at 3242, CGACT at 3224, CGACC at 3180, GGACC at 3128, CAACC at 3116, GAACT at 3103, CGACT at 3085, CGACC at 3041, CGACC at 3035, GAACC at 2921, TATCT at 2903, TGTCT at 2878, TGTCT at 2778, GGACC at 2770, CGACT at 2744, GGACC at 2720, GAACT at 2714, CAACT at 2705, CGACT at 2696, TGTCC at 2689, CAACT at 2593, GAACT at 2580, CGTCC at 2568, CGACT at 2562, GGTCC at 2519, TGTCC at 2514, TGTCT at 2443, GGACC at 2435, CGTCC at 2389, GGACC at 2385, GAACT at 2379, CGTCC at 2367, CGACT at 2361, CGACC at 2326, GGACC at 2268, CAACC at 2235, CGACT at 2226, GAACT at 2127, TGTCT at 2119, CGACT at 2109, GGTCC at 2077, CGACC at 2069, TGTCT at 2017, GGACC at 2009, CGTCT at 1967, GGACC at 1959, CGTCC at 1941, TGACT at 1935, GAACC at 1927, CGACC at 1891, CAACT at 1853, GGACC at 1841, CGTCC at 1823, CGACT at 1800, CGACC at 1756, CGACC at 1746, TATCT at 1710, GGTCT at 1670, CATCT at 1653, GAACC at 1649, GGACT at 1623, CGTCT at 1614, TGTCT at 1567, GGTCT at 1518, CGACC at 1464, GGTCC at 1460, GGTCT at 1411, CGTCT at 1314, GATCC at 1307, GAACT at 1300, CGTCC at 1288, CGACT at 1282, GGACC at 1198, GGACT at 1173, CGACC at 1111, TGTCT at 1073, TAACC at 1045, CGTCT at 1023, GGACC at 1015, GAACT at 1009, CGTCC at 997, CGACT at 991, CATCT at 970, GGTCC at 948, TGTCT at 907, GGACC at 899, GGTCC at 850, GAACT at 843, CGTCC at 831, CGACT at 825, CGACC at 781, GGACT at 732, CGTCC at 697, GGTCC at 648, TAACC at 643, GGACC at 596, TAACT at 585, CGTCC at 565, TGTCC at 561, GGACC at 508, GGACC at 459, TGTCC at 424, TATCT at 355, GAACC at 328, TGACT at 307, TGTCT at 289, CATCT at 284, GGTCC at 262, TGTCT at 168, CGACT at 140, TGACT at 130, CATCC at 119, TATCT at 100, CAACT at 85, TGACT at 17, TGTCT at 13.
  6. inverse complement, positive strand, negative direction: 58, GATCC at 4476, CATCC at 4456, GAACC at 4451, TGTCT at 4371, GGACT at 4327, TGTCT at 4210, CATCT at 4058, CATCC at 3903, GGACC at 3868, CAACT at 3849, TGTCT at 3833, GAACC at 3784, GGACT at 3747, CGTCC at 3698, GGACT at 3640, CATCT at 3551, CAACT at 3524, GAACT at 3460, TGTCT at 3321, GAACC at 3245, CATCT at 3154, TGTCT at 2986, CAACT at 2911, CAACC at 2844, TGACT at 2786, GAACC at 2717, GAACC at 2382, CATCT at 2290, TGACC at 2189, TGTCT at 2165, TGTCT at 2031, GAACC at 1956, CATCT at 1863, CATCC at 1838, GATCC at 1813, CGTCT at 1774, TATCT at 1731, GAACT at 1685, CATCC at 1572, TATCC at 1529, CAACC at 1514, GATCT at 1482, CAACC at 1407, GAACC at 1303, TGTCT at 1222, CGACC at 1191, TGTCT at 1087, TGACT at 1051, GAACC at 1012, GATCC at 973, TGTCT at 921, GAACC at 846, CGTCT at 754, TGACC at 734, TAACC at 614, CATCC at 593, TGTCT at 479, GGACC at 32.
  7. inverse complement, negative strand, positive direction: 95, GGTCC at 4420, GGTCT at 4414, GGTCT at 4380, TGTCC at 4367, CGACC at 4358, TGACC at 4216, CATCC at 4183, GATCC at 4081, CGTCT at 4056, GAACT at 4048, TGACC at 4018, GAACC at 3937, CAACC at 3911, GAACC at 3856, GAACC at 3838, CGACT at 3801, TGACC at 3784, GGTCT at 3771, TAACT at 3733, TGACC at 3714, CATCC at 3629, TGTCC at 3619, GGTCT at 3608, TGTCC at 3577, GGACC at 3545, CATCT at 3403, TGTCT at 3392, TATCC at 3384, CATCT at 3329, GGTCT at 3299, CAACT at 3291, CGTCT at 3256, GGTCT at 3245, CGTCC at 3203, GGACC at 3172, CATCC at 3108, TGTCT at 3053, TGACT at 3029, GGTCT at 3019, TGTCT at 3004, TGACT at 2945, GGTCT at 2941, TGACC at 2873, CATCT at 2852, GGACT at 2820, GAACC at 2776, GGACT at 2672, GAACC at 2579, GATCC at 2514, GGTCT at 2489, TGTCT at 2466, GATCC at 2378, CGACT at 2359, GGACT at 2271, GGTCT at 2258, GAACC at 2225, TGACC at 2213, TGTCT at 2172, CAACC at 2120, CATCT at 2111, TGTCT at 2078, CAACC at 2013, GGTCT at 1958, GAACT at 1951, CGTCC at 1930, TGTCT at 1862, GAACC at 1811, TGTCT at 1731, GGTCT at 1711, TGACC at 1662, CAACC at 1616, CGTCT at 1493, CGTCT at 1393, CAACC at 1280, CGACT at 998, TGTCC at 993, GATCC at 965, CAACC at 944, GGACT at 914, CGACT at 898, TGTCC at 893, GATCC at 865, CAACC at 844, GGACT at 814, GGACT at 746, CATCC at 698, CATCC at 629, CAACC at 608, TGTCT at 268, GGTCC at 218, GGACC at 187, TGTCT at 100, TGTCC at 82, GGACC at 37, CATCC at 30.
  8. inverse complement, positive strand, positive direction: 152, GGACC at 4424, GGACC at 4409, GGTCT at 4330, CGTCT at 4317, GAACC at 4300, GGACT at 4214, GGACT at 4186, CGACC at 4177, TAACT at 4161, GAACT at 4131, TGACT at 4089, GATCC at 4077, TGTCC at 4070, GATCT at 4065, CATCT at 4036, GGTCC at 4032, GAACT at 4016, CGACC at 3989, TGTCC at 3975, CGTCT at 3916, GGTCT at 3891, CGTCT at 3831, GGTCT at 3806, GGACC at 3787, CGACT at 3778, CGTCC at 3768, GGACC at 3758, CATCC at 3753, TGACT at 3735, CGTCC at 3694, GGTCC at 3687, GGACC at 3679, CGTCC at 3662, TGTCC at 3636, CGACT at 3588, TGTCC at 3571, GGTCT at 3548, GGTCC at 3536, CGACC at 3526, GATCC at 3522, GGACC at 3496, GATCC at 3484, CGTCT at 3473, CGTCC at 3466, CATCT at 3416, GGACC at 3362, TGACC at 3345, GGACC at 3296, CGACC at 3242, GGTCT at 3221, CGTCT at 3214, TGTCT at 3179, CGTCC at 3147, TGTCT at 3133, CGTCC at 3128, TGACC at 3117, GGTCC at 3111, GGTCT at 3091, GGACC at 3047, GGTCC at 3016, GGACC at 2988, GGACT at 2968, CGACT at 2915, GGACC at 2891, GGTCC at 2876, CGTCT at 2859, TGTCT at 2837, CAACC at 2816, CGACC at 2810, GGTCC at 2780, CGACC at 2770, CGTCC at 2745, CGACC at 2734, CGTCT at 2721, CGTCC at 2683, TGACT at 2674, TGTCT at 2652, GATCC at 2639, TATCT at 2627, GGTCC at 2574, GGACC at 2569, TATCC at 2550, CAACC at 2541, GGACC at 2501, GATCC at 2482, GGACC at 2433, TGTCT at 2414, CGACC at 2405, CGACC at 2320, GGTCC at 2316, CGTCC at 2296, CATCC at 2255, GGTCT at 2228, GGACT at 2211, CAACC at 2185, TGTCC at 2125, TGTCC at 1966, TGACC at 1953, CGTCT at 1937, CGTCC at 1905, GGTCC at 1893, CATCC at 1875, GGTCC at 1855, GGACC at 1815, GAACC at 1799, CGTCC at 1788, CGACC at 1779, GGTCT at 1742, CGACC at 1736, GGACT at 1676, GGACT at 1660, GGTCT at 1631, CGTCT at 1416, CGTCT at 1316, TGACT at 1286, GGACC at 1199, GGTCC at 1175, TGACC at 1140, GGACT at 959, GGACC at 947, GGTCT at 935, GGACT at 859, GGACC at 847, GGTCT at 835, CGACC at 779, GGACT at 725, GGTCC at 707, CGTCC at 658, GGACC at 598, TGTCC at 552, GGTCC at 515, GGTCT at 468, CGTCT at 438, GGTCC at 424, CGACC at 417, CGTCT at 396, CGACC at 386, CGTCC at 379, TGTCC at 365, TGACC at 347, CGTCC at 318, GGACC at 286, CGACC at 277, GGTCC at 215, GGTCT at 204, CGTCC at 194, TGTCC at 157, GGACC at 40, GGTCC at 33, TAACC at 24, GGTCT at 15, GGTCC at 8.

DPEB (4560-2846) UTRs

Negative strand UTRs

  1. Negative strand, negative direction: GGACC at 4546, GGACC at 4494, AGTCG at 4489, GGTCG at 4480, AGTCC at 4436, AGATG at 4430, GGTCA at 4415, GGACA at 4369, GGACC at 4349, GGTCG at 4345, GGACC at 4300, GGTCG at 4261, GGTCC at 4253, AGATG at 4212, GGACA at 4208, AGTTC at 4178, GGTCC at 4170, AGTCC at 4138, GGTCG at 4130, GGACA at 4121, GGTCC at 4102, AGATG at 4062, GGACC at 4037, GGTCG at 4033, AGTTC at 4027, GGTTC at 4019, GGTTG at 3979, GGACA at 3970, GGTCC at 3951, GGACC at 3906, GGTCC at 3885, GGTCC at 3871, GGACG at 3861, AGTTC at 3844, AGACC at 3835, GGACC at 3744, GGTCG at 3731, AGACG at 3706, GGTCG at 3701, GGTCG at 3682, GGTCC at 3585, GGACG at 3579, GGTCC at 3564, AGACA at 3556, AGTTG at 3523, AGTCC at 3396, AGACA at 3319, GGACC at 3298, GGTCG at 3294, GGTTC at 3273, GGTCC at 3249, AGTCC at 3217, GGTCG at 3209, AGTCG at 3204, GGACA at 3200, AGATG at 3158, GGTTG at 3137, GGACC at 3128, GGTCG at 3124, AGTCC at 3110, GGTCG at 3070, GGACA at 3061, GGATA at 2996, AGATG at 2988, GGTTA at 2848.
  2. Negative strand, negative direction: TGTCT at 4518, GGTCT at 4448, GAACT at 4294, CGTCC at 4282, CGACT at 4276, GAACC at 4268, GGTCT at 4233, GAACC at 4188, CGACT at 4145, CAACC at 4097, TATCT at 4079, GAACT at 4012, CGACT at 3994, CAACC at 3946, CAACC at 3942, GGACT at 3932, TGTCT at 3917, CGACC at 3864, CATCT at 3820, CAACT at 3805, GAACC at 3793, GGACT at 3781, TGACC at 3749, CGACC at 3719, TGTCT at 3672, CGACT at 3649, CAACC at 3606, CGTCT at 3589, GAACT at 3571, TGACT at 3542, CAACT at 3533, TAACC at 3529, CAACT at 3505, GGTCT at 3486, GATCT at 3463, TATCC at 3447, CGTCT at 3431, TATCT at 3422, GAACT at 3401, TAACT at 3358, CATCT at 3256, GAACT at 3242, CGACT at 3224, CGACC at 3180, CAACC at 3116, GAACT at 3103, CGACT at 3085, CGACC at 3041, CGACC at 3035, GAACC at 2921, TATCT at 2903, TGTCT at 2878.

Positive strand UTRs

  1. Positive strand, negative direction: AGACA at 4507, AGTCC at 4500, AGATC at 4475, GGACA at 4468, AGTTC at 4417, AGACC at 4365, GGTCA at 4307, GGATC at 4288, AGACG at 4235, AGACC at 4204, AGACA at 4181, AGTTC at 4175, GGATC at 4157, AGTCC at 4126, AGTTG at 4096, AGACC at 4030, AGTTC at 4024, GGATC at 4006, GGTTG at 3945, AGATG at 3919, GGACC at 3868, GGTCG at 3813, GGTTG at 3804, AGACC at 3761, GGACA at 3756, GGATA at 3655, AGATG at 3627, AGATG at 3620, GGTTG at 3605, GGTTG at 3532, AGATC at 3488, AGATA at 3465, AGACA at 3433, GGACA at 3389, AGATC at 3276, GGTTG at 3261, AGACC at 3121, AGTTG at 3115, GGATC at 3097, AGATA at 2981, AGACA at 2948, AGATG at 2905, AGATG at 2894, AGACA at 2880.
  2. Positive strand, negative direction: GATCC at 4476, CATCC at 4456, GAACC at 4451, TGTCT at 4371, GGACT at 4327, TGTCT at 4210, CATCT at 4058, CATCC at 3903, CAACT at 3849, TGTCT at 3833, GAACC at 3784, GGACT at 3747, CGTCC at 3698, GGACT at 3640, CATCT at 3551, CAACT at 3524, GAACT at 3460, TGTCT at 3321, GAACC at 3245, CATCT at 3154, TGTCT at 2986, CAACT at 2911.

DPEB negative direction (2846-2811) core promoters

  1. Positive strand, negative direction: CAACC at 2844.

DPEB positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: GGTCC at 4420, AGACA at 4332, AGACG at 4319, GGTCA at 4269.
  2. Negative strand, positive direction: GGTCT at 4414, GGTCT at 4380, TGTCC at 4367, CGACC at 4358.
  3. Positive strand, positive direction: GGACC at 4424, AGACC at 4416, GGACC at 4409, AGTCA at 4271.
  4. Positive strand, positive direction: GGTCT at 4330, CGTCT at 4317, GAACC at 4300.

DPEB negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: GGACC at 2770, GGTCG at 2766, GGACC at 2720, GGTCG at 2681, GGACA at 2672, GGTCA at 2654, AGTCG at 2650, GGTTG at 2610, GGTCA at 2601
  2. Negative strand, negative direction: TGTCT at 2778, CGACT at 2744, GAACT at 2714, CAACT at 2705, CGACT at 2696, TGTCC at 2689.
  3. Positive strand, negative direction: AGTTG at 2733, AGTTG at 2704, AGACC at 2598.
  4. Positive strand, negative direction: TGACT at 2786, GAACC at 2717.

DPEB positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: GGACA at 4252, AGTTC at 4200, GGATG at 4099, GGATC at 4080, GGTTC at 4073.
  2. Negative strand, positive direction: TGACC at 4216, CATCC at 4183, GATCC at 4081, CGTCT at 4056.
  3. Positive strand, positive direction: GGACG at 4231, AGATC at 4076, AGATC at 4064, AGTCG at 4052.
  4. Positive strand, positive direction: GGACT at 4214, GGACT at 4186, CGACC at 4177, TAACT at 4161, GAACT at 4131, TGACT at 4089, GATCC at 4077, TGTCC at 4070, GATCT at 4065.

DPEB negative direction (2596-1) distal promoters

Negative strand

  1. Negative strand, negative direction: AGTCC at 2587, GGTTG at 2547, GGACA at 2538, GGTCC at 2519, AGTTA at 2496, GGACC at 2435, GGTCG at 2431, GGACC at 2385, GGTCG at 2346, GGACA at 2337, AGATG at 2294, GGACC at 2268, GGTCG at 2264, AGTCC at 2250, GGTCA at 2211, AGATG at 2169, GGTTG at 2148, AGTCC at 2134, GGATC at 2093, GGTCC at 2077, AGACA at 2029, GGACC at 2009, GGTCG at 2005, GGACC at 1959, GGTCG at 1920, GGACA at 1911, AGATG at 1867, GGACC at 1841, GGTTC at 1817, GGTCG at 1785, AGACA at 1776, GGTCG at 1611, GGTCA at 1532, AGATA at 1525, AGTTG at 1513, AGTCG at 1486, GGTCC at 1460, AGACA at 1452, AGTTG at 1406, AGACC at 1356, GGTCA at 1352, GGATC at 1306, AGTCC at 1275, GGTCG at 1267, GGACA at 1258, AGATG at 1224, GGTTG at 1203, GGACC at 1198, GGTCG at 1194, GGTCG at 1140, GGACA at 1131, AGACA at 1085, GGTCG at 1061, GGACC at 1015, AGTCC at 984, GGTCG at 976, GGACA at 967, GGTCC at 948, AGACA at 919, GGACC at 899, GGTCG at 895, GGTTC at 874, GGTCC at 850, GGTCG at 810, GGACA at 801, AGATG at 758, GGTCG at 737, GGTCG at 728, AGTCC at 714, GGTTC at 692, GGTCG at 676, GGACA at 667, GGTCC at 648, AGATG at 624, GGACC at 596, AGTCC at 578, GGTTC at 556, GGTCG at 540, GGACC at 508, GGTCG at 504, AGATG at 481, GGACC at 459, AGTCC at 441, GGTTC at 419, GGTCG at 403, GGACA at 394, GGTCC at 262, AGATA at 234, GGTCG at 35.
  2. Negative strand, negative direction: CAACT at 2593, GAACT at 2580, CGTCC at 2568, CGACT at 2562, TGTCC at 2514, TGTCT at 2443, CGTCC at 2389, GAACT at 2379, CGTCC at 2367, CGACT at 2361, CGACC at 2326, CAACC at 2235, CGACT at 2226, GAACT at 2127, TGTCT at 2119, CGACT at 2109, CGACC at 2069, TGTCT at 2017, CGTCT at 1967, CGTCC at 1941, TGACT at 1935, GAACC at 1927, CGACC at 1891, CAACT at 1853, CGTCC at 1823, CGACT at 1800, CGACC at 1756, CGACC at 1746, TATCT at 1710, GGTCT at 1670, CATCT at 1653, GAACC at 1649, GGACT at 1623, CGTCT at 1614, TGTCT at 1567, GGTCT at 1518, CGACC at 1464, GGTCT at 1411, CGTCT at 1314, GATCC at 1307, GAACT at 1300, CGTCC at 1288, CGACT at 1282, GGACT at 1173, CGACC at 1111, TGTCT at 1073, TAACC at 1045, CGTCT at 1023, GAACT at 1009, CGTCC at 997, CGACT at 991, CATCT at 970, TGTCT at 907, GAACT at 843, CGTCC at 831, CGACT at 825, CGACC at 781, GGACT at 732, CGTCC at 697, TAACC at 643, TAACT at 585, CGTCC at 565, TGTCC at 561, TGTCC at 424, TATCT at 355, GAACC at 328, TGACT at 307, TGTCT at 289, CATCT at 284, TGTCT at 168, CGACT at 140, TGACT at 130, CATCC at 119, TATCT at 100, CAACT at 85, TGACT at 17, TGTCT at 13.

Positive strand

  1. Positive strand, negative direction: AGTTG at 2592, GGTCA at 2585, GGATC at 2574, AGTCC at 2543, AGATC at 2413, GGTTG at 2398, GGACA at 2271, AGACC at 2261, GGTCA at 2248, GGATC at 2239, GGTTG at 2234, AGATA at 2177, AGACC at 2145, AGACC at 2121, GGACA at 2117, AGATC at 1987, AGACC at 1834, AGATG at 1828, GGATC at 1812, AGATA at 1595, AGACA at 1569, AGATG at 1438, GGTTG at 1319, AGTTC at 1177, GGATC at 1167, GGACG at 1151, GGTTG at 1028, AGATC at 972, AGATC at 877, GGTTG at 862, GGATG at 784, AGACC at 725, AGTTC at 719, GGTCA at 712, GGATC at 703, AGATC at 589, GGTCA at 576, GGTCA at 568, AGACA at 559, GGATC at 525, GGTCA at 439, GGATC at 430, AGACA at 422, AGTTC at 253, AGATG at 244, GGTCA at 206, AGACA at 170, AGTCG at 157, GGATA at 108, GGATA at 98, AGTTG at 84, GGATA at 74, AGATA at 57, GGACC at 32.
  2. Positive strand, negative direction: GAACC at 2382, CATCT at 2290, TGACC at 2189, TGTCT at 2165, TGTCT at 2031, GAACC at 1956, CATCT at 1863, CATCC at 1838, GATCC at 1813, CGTCT at 1774, TATCT at 1731, GAACT at 1685, CATCC at 1572, TATCC at 1529, CAACC at 1514, GATCT at 1482, CAACC at 1407, GAACC at 1303, TGTCT at 1222, CGACC at 1191, TGTCT at 1087, TGACT at 1051, GAACC at 1012, GATCC at 973, TGTCT at 921, GAACC at 846, CGTCT at 754, TGACC at 734, TAACC at 614, CATCC at 593, TGTCT at 479.

DPEB positive direction (4050-1) distal promoters

Negative strand distals

  1. Negative strand, positive direction: AGACA at 3893, AGTCC at 3863, GGTCA at 3820, GGATG at 3574, AGACC at 3550, GGACC at 3545, GGACA at 3530, GGTTG at 3490, AGATG at 3475, GGATG at 3457, AGATG at 3418, AGTTA at 3381, AGTCG at 3283, GGACC at 3172, GGACA at 3131, AGTCC at 3084, GGTTG at 3050, GGTTA at 3024, AGTCC at 2998, AGTTC at 2954, GGTTC at 2922, AGACC at 2861, GGATA at 2737, GGATG at 2714, AGTTA at 2666, GGATA at 2659, AGTCA at 2618, AGTCA at 2613, AGTCA at 2607, GGACA at 2460, GGATG at 2409, AGATC at 2230, GGTCA at 2220, AGTTA at 2134, AGTCA at 2100, GGTCA at 2035, AGTCC at 2026, AGTTC at 1987, GGTTC at 1926, GGATG at 1878, GGACA at 1869, AGTCC at 1841, AGTCC at 1826, GGACA at 1693, GGTCG at 1687, GGACG at 1670, AGTCG at 1528, AGATC at 964, AGATC at 864, AGTCC at 757, AGACA at 712, AGACC at 440, GGACG at 410, AGACG at 398, GGACG at 359, GGACG at 323, GGTTC at 305, GGTCC at 218, GGACC at 187, AGTCC at 172, AGATG at 166, GGTCA at 153, GGATG at 59, GGACC at 37.
  2. Negative strand, positive direction: GAACT at 4048, TGACC at 4018, GAACC at 3937, CAACC at 3911, GAACC at 3856, GAACC at 3838, CGACT at 3801, TGACC at 3784, GGTCT at 3771, TAACT at 3733, TGACC at 3714, CATCC at 3629, TGTCC at 3619, GGTCT at 3608, TGTCC at 3577, CATCT at 3403, TGTCT at 3392, TATCC at 3384, CATCT at 3329, GGTCT at 3299, CAACT at 3291, CGTCT at 3256, GGTCT at 3245, CGTCC at 3203, CATCC at 3108, TGTCT at 3053, TGACT at 3029, GGTCT at 3019, TGTCT at 3004, TGACT at 2945, GGTCT at 2941, TGACC at 2873, CATCT at 2852, GGACT at 2820, GAACC at 2776, GGACT at 2672, GAACC at 2579, GATCC at 2514, GGTCT at 2489, TGTCT at 2466, GATCC at 2378, CGACT at 2359, GGACT at 2271, GGTCT at 2258, GAACC at 2225, TGACC at 2213, TGTCT at 2172, CAACC at 2120, CATCT at 2111, TGTCT at 2078, CAACC at 2013, GGTCT at 1958, GAACT at 1951, CGTCC at 1930, TGTCT at 1862, GAACC at 1811, TGTCT at 1731, GGTCT at 1711, TGACC at 1662, CAACC at 1616, CGTCT at 1493, CGTCT at 1393, CAACC at 1280, CGACT at 998, TGTCC at 993, GATCC at 965, CAACC at 944, GGACT at 914, CGACT at 898, TGTCC at 893, GATCC at 865, CAACC at 844, GGACT at 814, GGACT at 746, CATCC at 698, CATCC at 629, CAACC at 608, TGTCT at 268, TGTCT at 100, TGTCC at 82, CATCC at 30.

Positive strand distals

  1. Positive strand, positive direction: GGTCC at 4032, AGTCG at 4023, AGTCG at 3997, AGTCC at 3868, GGTCA at 3841, GGACC at 3787, AGTCG at 3775, GGACC at 3758, AGTCC at 3728, GGTCG at 3720, GGTCC at 3687, GGACC at 3679, GGTTG at 3633, GGACA at 3622, GGACA at 3617, GGTCC at 3536, GGACC at 3496, GGACA at 3434, AGTTA at 3424, AGACC at 3405, GGTCA at 3379, GGACC at 3362, AGACG at 3358, AGACG at 3306, GGACC at 3296, AGTTG at 3290, AGACG at 3278, AGACG at 3267, AGATA at 3258, GGTCG at 3239, AGTCG at 3155, GGTCC at 3111, GGTCA at 3082, AGACG at 3060, GGACC at 3047, AGTCG at 3041, AGTCC at 3034, AGACC at 3021, GGTCC at 3016, GGTCA at 2996, GGACC at 2988, AGACC at 2983, AGACG at 2975, AGACA at 2957, AGTCA at 2936, AGACA at 2925, GGTTA at 2908, GGACC at 2891, AGACC at 2883, GGTCC at 2876, AGACG at 2856, GGTCC at 2780, AGTCC at 2620, AGTTC at 2615, GGTCA at 2605, GGTTC at 2593, GGTCC at 2574, GGACC at 2569, AGTCG at 2526, GGACG at 2520, AGTTC at 2508, GGACC at 2501, GGATC at 2481, GGACC at 2433, GGTTC at 2398, AGTCG at 2390, AGTCC at 2372, GGTCC at 2316, AGACA at 2308, AGACA at 2260, GGACA at 2250, AGTTA at 2233, AGTCG at 2198, AGACA at 2182, AGATC at 2167, AGTCC at 2115, AGTCG at 2102, AGTCA at 2098, AGTCA at 2060, GGTCG at 2052, GGTCA at 2024, GGTTG at 2012, AGACC at 1992, GGTCC at 1893, AGACC at 1864, GGACA at 1860, GGTCC at 1855, GGACC at 1815, GGACG at 1776, AGACG at 1733, AGTTG at 1621, AGTCG at 1603, GGATG at 1573, AGACG at 1495, AGACC at 1476, GGACG at 1469, GGTCG at 1463, GGTCG at 1457, GGACG at 1411, AGACG at 1395, AGACC at 1376, GGACG at 1369, GGTCG at 1363, GGTCG at 1357, GGACG at 1311, GGATG at 1283, GGTTG at 1279, GGTCG at 1271, AGTCG at 1267, GGTCA at 1250, GGACC at 1199, GGATG at 1195, GGTCC at 1175, GGTCG at 1127, GGACG at 1118, GGACG at 1075, GGACA at 991, GGACC at 947, GGTTG at 943, AGTCG at 931, GGACG at 907, GGACA at 891, GGACC at 847, GGTTG at 843, AGTCG at 831, GGACG at 807, GGTCC at 707, GGATG at 649, GGTCG at 623, GGTCG at 617, AGTCG at 613, GGTTG at 607, GGACC at 598, GGTCC at 515, AGTCG at 511, GGACG at 435, GGTCC at 424, GGTCG at 329, GGACC at 286, AGACC at 270, AGACG at 223, GGTCC at 215, GGACG at 191, GGTTC at 177, GGACA at 144, AGACC at 102, AGACA at 98, AGTCC at 90, GGACC at 40, GGTCC at 33, GGTCC at 8.
  2. Positive strand, positive direction: CATCT at 4036, GAACT at 4016, CGACC at 3989, TGTCC at 3975, CGTCT at 3916, GGTCT at 3891, CGTCT at 3831, GGTCT at 3806, CGACT at 3778, CGTCC at 3768, CATCC at 3753, TGACT at 3735, CGTCC at 3694, CGTCC at 3662, TGTCC at 3636, CGACT at 3588, TGTCC at 3571, GGTCT at 3548, CGACC at 3526, GATCC at 3522, GATCC at 3484, CGTCT at 3473, CGTCC at 3466, CATCT at 3416, TGACC at 3345, CGACC at 3242, GGTCT at 3221, CGTCT at 3214, TGTCT at 3179, CGTCC at 3147, TGTCT at 3133, CGTCC at 3128, TGACC at 3117, GGTCT at 3091, GGACT at 2968, CGACT at 2915, CGTCT at 2859, TGTCT at 2837, CAACC at 2816, CGACC at 2810, CGACC at 2770, CGTCC at 2745, CGACC at 2734, CGTCT at 2721, CGTCC at 2683, TGACT at 2674, TGTCT at 2652, GATCC at 2639, TATCT at 2627, TATCC at 2550, CAACC at 2541, GATCC at 2482, TGTCT at 2414, CGACC at 2405, CGACC at 2320, CGTCC at 2296, CATCC at 2255, GGTCT at 2228, GGACT at 2211, CAACC at 2185, TGTCC at 2125, TGTCC at 1966, TGACC at 1953, CGTCT at 1937, CGTCC at 1905, CATCC at 1875, GAACC at 1799, CGTCC at 1788, CGACC at 1779, GGTCT at 1742, CGACC at 1736, GGACT at 1676, GGACT at 1660, GGTCT at 1631, CGTCT at 1416, CGTCT at 1316, TGACT at 1286, TGACC at 1140, GGACT at 959, GGTCT at 935, GGACT at 859, GGTCT at 835, CGACC at 779, GGACT at 725, CGTCC at 658, TGTCC at 552, GGTCT at 468, CGTCT at 438, CGACC at 417, CGTCT at 396, CGACC at 386, CGTCC at 379, TGTCC at 365, TGACC at 347, CGTCC at 318, CGACC at 277, GGTCT at 204, CGTCC at 194, TGTCC at 157, TAACC at 24, GGTCT at 15.

DPEB random dataset samplings

  1. DPEBr0: 103, AGATG at 4444, AGTTA at 4390, AGACG at 4363, GGATC at 4322, GGACC at 4314, GGATC at 4271, AGACA at 4234, GGTCG at 4187, AGTCA at 4169, GGTCC at 4116, AGACC at 4048, AGTCA at 4043, GGTCC at 3950, GGACA at 3847, GGTCA at 3832, AGTCG at 3811, GGTCG at 3747, AGTTA at 3680, GGTTA at 3630, GGTCC at 3612, GGTCG at 3559, GGATA at 3537, AGTTG at 3440, GGTTC at 3256, GGATA at 3218, AGTCA at 3188, AGTTG at 3165, AGACC at 3154, GGATC at 3140, AGTCC at 3134, AGACG at 3039, GGTTG at 3017, GGATA at 2990, GGTTA at 2974, GGTTG at 2932, AGACG at 2907, GGACA at 2883, GGACC at 2801, GGATA at 2796, AGATC at 2761, GGTCC at 2747, AGTCG at 2721, GGATC at 2667, AGACG at 2536, AGTCC at 2488, AGTTC at 2457, AGTCG at 2447, AGATA at 2423, AGACA at 2330, GGTCG at 2311, AGATC at 2249, GGACA at 2205, AGACA at 2182, AGTTA at 2139, GGACC at 2103, GGTTA at 2096, AGTTA at 2081, GGACG at 2073, GGTTC at 2048, GGACC at 2035, AGTTG at 2021, GGTCA at 1984, AGATC at 1949, AGATC at 1852, GGTCG at 1752, GGACA at 1702, AGTTG at 1667, AGATA at 1537, AGATA at 1465, AGACG at 1438, GGACA at 1433, GGTCG at 1417, GGATA at 1405, AGACC at 1315, GGACG at 1309, AGACG at 1236, GGATG at 1217, AGTTC at 1130, AGATG at 1125, GGATG at 979, AGTTG at 965, GGACC at 942, GGATC at 924, GGTTC at 884, AGTCC at 875, AGATA at 849, GGTCA at 845, GGTTC at 780, GGATG at 762, GGATC at 736, GGTTG at 722, GGTCA at 699, AGTTC at 687, GGACC at 678, AGTTA at 557, GGTTC at 535, AGTCC at 388, AGTTA at 188, AGTTG at 103, GGATC at 94, GGTTC at 84, GGTCC at 71, AGACC at 10.
  2. DPEBr1: 93, AGTCG at 4547, AGACG at 4488, GGTCA at 4483, GGACC at 4466, AGTTA at 4412, AGTTG at 4395, GGTTA at 4384, GGTTA at 4378, GGTCC at 4276, GGTCG at 4202, AGACA at 4196, GGTCG at 4155, GGTTA at 4122, GGTTG at 4018, AGACC at 3991, GGACA at 3985, GGATA at 3935, AGATC at 3860, AGTCA at 3854, GGTCC at 3642, GGTCA at 3425, AGTTG at 3388, AGACA at 3256, GGACA at 3141, GGACC at 3116, GGTCG at 3003, AGATC at 2995, GGTTA at 2961, GGATG at 2945, GGTTC at 2872, GGTTC at 2844, AGATC at 2834, GGTCA at 2777, AGACC at 2765, AGACA at 2748, AGTTA at 2743, GGATC at 2724, GGACA at 2617, GGACA at 2593, GGTCC at 2560, AGTTA at 2499, AGACA at 2469, GGACC at 2461, GGTCC at 2430, GGTTA at 2343, AGATC at 2250, GGTCA at 2237, GGATC at 2230, GGATA at 2154, GGACC at 2122, GGTCG at 2011, AGTTC at 1971, AGACC at 1949, GGACA at 1892, AGATC at 1798, GGATC at 1789, AGATC at 1779, GGTCA at 1750, AGTCC at 1717, GGTCG at 1682, AGTCG at 1652, GGATG at 1632, GGTCA at 1572, GGACG at 1510, GGTTC at 1499, GGTTA at 1471, GGATG at 1448, GGACA at 1341, GGATA at 1302, GGACG at 1183, GGTTA at 1084, GGTTG at 1074, AGTTC at 855, AGACC at 758, AGATA at 751, AGACC at 678, GGTCG at 578, AGTTC at 560, GGTCG at 510, GGATA at 492, AGTCG at 482, GGTCG at 416, GGACG at 407, AGATG at 400, AGATG at 345, AGTTC at 323, GGACA at 317, GGACC at 214, AGATA at 162, GGACA at 88, GGACG at 48, AGACA at 33, GGATA at 14.
  3. DPEBr2: 98, GGACG at 4555, AGTTG at 4536, GGATA at 4470, AGATA at 4445, GGTCA at 4381, AGTCA at 4374, AGTTA at 4331, GGTTA at 4302, GGACC at 4268, AGTTG at 4236, AGACA at 4196, AGACA at 4044, AGATA at 3943, GGTTG at 3934, GGTCA at 3905, GGACC at 3877, GGATG at 3867, AGTCA at 3849, AGTTG at 3792, AGATG at 3728, GGTCC at 3599, GGACG at 3513, GGATC at 3436, AGTCC at 3376, GGTTC at 3324, AGACC at 3310, GGACA at 3249, AGTTG at 3245, GGTCA at 3163, GGACG at 3134, AGTTG at 3126, AGTTG at 3044, GGATA at 3040, GGTCC at 2982, AGACG at 2903, AGTCA at 2899, GGACC at 2888, AGTCG at 2872, GGTCC at 2843, GGTTC at 2737, AGTCA at 2680, GGTTC at 2660, GGTCC at 2613, AGTTG at 2570, GGACC at 2404, AGACG at 2389, GGTTA at 2309, AGTTA at 2247, AGACC at 2232, GGTCA at 2185, GGTCC at 2145, AGATA at 2113, AGTTA at 2044, GGTTA at 2040, GGACA at 2018, AGACG at 2011, AGTTG at 1987, AGACG at 1860, AGATG at 1826, GGACA at 1776, GGTTA at 1765, AGATG at 1647, GGACG at 1602, GGTCC at 1575, AGACC at 1506, GGTCG at 1499, GGTTC at 1484, GGTCC at 1477, GGATA at 1429, GGATC at 1418, GGTCC at 1400, AGTTG at 1396, GGTTG at 1285, GGTTC at 1084, AGTTG at 1035, GGATA at 987, AGTTC at 878, AGACC at 682, GGATG at 664, GGATA at 656, GGTTA at 651, GGTTC at 635, AGACC at 607, AGTCC at 521, GGACA at 494, AGTTA at 480, GGTCA at 467, AGTTA at 378, AGTTG at 299, GGACG at 255, GGACA at 217, AGTCA at 199, GGTTC at 178, GGATC at 157, GGTCA at 141, GGTTC at 104, GGTCA at 23, AGTCG at 10.
  4. DPEBr3: 107, AGTTC at 4541, GGTTG at 4512, GGTTC at 4465, GGTTA at 4413, AGATG at 4408, GGACG at 4360, AGTTC at 4218, GGACG at 4201, AGTTC at 4158, GGTTG at 4107, AGTTC at 3987, GGACA at 3928, GGACA at 3884, GGTTA at 3877, GGATA at 3864, GGTCA at 3843, AGATC at 3824, GGTTA at 3820, AGACC at 3737, AGACG at 3718, AGTCG at 3684, GGTTC at 3662, AGTCC at 3570, GGTTG at 3551, GGTCA at 3505, AGACA at 3485, GGTCG at 3478, AGACC at 3387, GGATG at 3325, GGACC at 3318, AGACC at 3306, AGTTA at 3201, GGTCC at 3127, GGATG at 3038, AGTCC at 3027, GGTCG at 2945, GGTTG at 2907, GGTCC at 2887, GGTCA at 2854, AGATA at 2773, AGACA at 2741, AGTCG at 2686, GGACC at 2651, GGATC at 2566, GGACG at 2562, AGTCC at 2549, GGATC at 2434, AGTCC at 2336, GGTTC at 2322, GGTCA at 2303, GGTTC at 2279, AGTTG at 2263, GGATG at 2255, GGTCA at 2238, GGTTC at 2124, GGTTG at 2035, GGTTG at 2014, AGACC at 1961, GGACA at 1892, GGTCG at 1887, GGATG at 1790, AGATA at 1775, AGATG at 1714, GGATG at 1688, GGTTC at 1682, GGTCA at 1631, AGTTG at 1617, GGTCC at 1585, AGTTC at 1538, GGTTG at 1507, GGATG at 1419, AGATG at 1408, AGTTA at 1379, GGACG at 1371, GGACG at 1328, GGACC at 1274, GGTTA at 1247, GGTCC at 1234, GGTTG at 1192, GGACA at 1166, AGACG at 1101, AGACC at 1074, GGTCA at 1051, AGTTC at 1018, AGTTG at 998, GGTTA at 994, GGATC at 968, AGTCC at 949, AGATA at 940, GGTTC at 857, GGTCA at 747, AGACC at 723, GGATG at 703, AGACA at 660, GGTCG at 644, GGACG at 628, GGTCA at 600, GGTTG at 528, AGTCA at 477, GGTCC at 347, GGACC at 342, GGTTC at 328, GGTCG at 302, AGATG at 298, GGTTA at 257, GGACC at 75, AGACC at 15.
  5. DPEBr4: 96, GGACG at 4499, GGTTC at 4459, AGTTG at 4419, GGATA at 4275, GGATA at 4264, AGTCA at 4258, GGTTG at 4245, GGTCG at 4241, GGACC at 4235, AGATG at 4202, AGACC at 4170, GGACC at 4133, GGATG at 4070, AGTCC at 3970, GGATC at 3922, AGTCA at 3883, GGACA at 3831, GGTCA at 3806, AGATG at 3742, AGTCC at 3598, GGTCG at 3552, GGTTG at 3529, AGTTC at 3438, GGACA at 3386, AGACA at 3367, GGTTG at 3361, GGTCG at 3342, AGTCC at 3316, GGTCC at 3299, AGTCA at 3239, GGTTA at 3223, AGATG at 3201, GGTTC at 3148, AGATC at 3132, AGTCA at 3118, GGATC at 3092, AGATG at 3061, AGATC at 3047, GGATG at 2941, AGACG at 2880, AGTCG at 2824, AGATC at 2798, GGATC at 2690, GGTCC at 2666, GGACG at 2526, GGTCC at 2475, GGTCG at 2469, GGTTG at 2434, GGTTC at 2372, GGACG at 2312, GGTTA at 2292, AGTCA at 2248, GGTTG at 2160, GGTCC at 2091, AGTTG at 2006, AGACA at 1966, GGTTC at 1927, GGATG at 1902, GGACG at 1834, GGTCC at 1813, GGACC at 1758, AGATC at 1669, AGTTA at 1638, GGACG at 1604, AGTTG at 1578, AGACC at 1573, GGATC at 1494, AGACG at 1451, GGTTA at 1447, GGATG at 1288, GGACG at 1278, AGACG at 1225, GGATC at 1173, GGTTC at 1143, AGACG at 1111, AGTCA at 985, AGATA at 941, GGTTA at 886, GGACC at 852, GGTCC at 798, GGTTC at 749, GGTTC at 742, GGATG at 727, GGTCC at 676, AGTCC at 631, GGTTC at 570, GGATA at 510, AGTCC at 497, GGTCG at 357, AGTTG at 341, GGACG at 290, AGACC at 259, AGTCC at 226, GGTCC at 206, GGTCC at 130, AGACA at 74.
  6. RDr5: 0.
  7. RDr6: 0.
  8. RDr7: 0.
  9. RDr8: 0.
  10. RDr9: 0.
  11. RDr0ci: 0.
  12. RDr1ci: 0.
  13. RDr2ci: 0.
  14. RDr3ci: 0.
  15. RDr4ci: 0.
  16. RDr5ci: 0.
  17. RDr6ci: 0.
  18. RDr7ci: 0.
  19. RDr8ci: 0.
  20. RDr9ci: 0.

DPEBr arbitrary (evens) (4560-2846) UTRs

  1. DPEBr0: AGATG at 4444, AGTTA at 4390, AGACG at 4363, GGATC at 4322, GGACC at 4314, GGATC at 4271, AGACA at 4234, GGTCG at 4187, AGTCA at 4169, GGTCC at 4116, AGACC at 4048, AGTCA at 4043, GGTCC at 3950, GGACA at 3847, GGTCA at 3832, AGTCG at 3811, GGTCG at 3747, AGTTA at 3680, GGTTA at 3630, GGTCC at 3612, GGTCG at 3559, GGATA at 3537, AGTTG at 3440, GGTTC at 3256, GGATA at 3218, AGTCA at 3188, AGTTG at 3165, AGACC at 3154, GGATC at 3140, AGTCC at 3134, AGACG at 3039, GGTTG at 3017, GGATA at 2990, GGTTA at 2974, GGTTG at 2932, AGACG at 2907, GGACA at 2883.
  2. PDEBr2: GGACG at 4555, AGTTG at 4536, GGATA at 4470, AGATA at 4445, GGTCA at 4381, AGTCA at 4374, AGTTA at 4331, GGTTA at 4302, GGACC at 4268, AGTTG at 4236, AGACA at 4196, AGACA at 4044, AGATA at 3943, GGTTG at 3934, GGTCA at 3905, GGACC at 3877, GGATG at 3867, AGTCA at 3849, AGTTG at 3792, AGATG at 3728, GGTCC at 3599, GGACG at 3513, GGATC at 3436, AGTCC at 3376, GGTTC at 3324, AGACC at 3310, GGACA at 3249, AGTTG at 3245, GGTCA at 3163, GGACG at 3134, AGTTG at 3126, AGTTG at 3044, GGATA at 3040, GGTCC at 2982, AGACG at 2903, AGTCA at 2899, GGACC at 2888, AGTCG at 2872.
  3. DPEBr4: GGACG at 4499, GGTTC at 4459, AGTTG at 4419, GGATA at 4275, GGATA at 4264, AGTCA at 4258, GGTTG at 4245, GGTCG at 4241, GGACC at 4235, AGATG at 4202, AGACC at 4170, GGACC at 4133, GGATG at 4070, AGTCC at 3970, GGATC at 3922, AGTCA at 3883, GGACA at 3831, GGTCA at 3806, AGATG at 3742, AGTCC at 3598, GGTCG at 3552, GGTTG at 3529, AGTTC at 3438, GGACA at 3386, AGACA at 3367, GGTTG at 3361, GGTCG at 3342, AGTCC at 3316, GGTCC at 3299, AGTCA at 3239, GGTTA at 3223, AGATG at 3201, GGTTC at 3148, AGATC at 3132, AGTCA at 3118, GGATC at 3092, AGATG at 3061, AGATC at 3047, GGATG at 2941, AGACG at 2880.

DPEBr alternate (odds) (4560-2846) UTRs

  1. DPEBr1: AGTCG at 4547, AGACG at 4488, GGTCA at 4483, GGACC at 4466, AGTTA at 4412, AGTTG at 4395, GGTTA at 4384, GGTTA at 4378, GGTCC at 4276, GGTCG at 4202, AGACA at 4196, GGTCG at 4155, GGTTA at 4122, GGTTG at 4018, AGACC at 3991, GGACA at 3985, GGATA at 3935, AGATC at 3860, AGTCA at 3854, GGTCC at 3642, GGTCA at 3425, AGTTG at 3388, AGACA at 3256, GGACA at 3141, GGACC at 3116, GGTCG at 3003, AGATC at 2995, GGTTA at 2961, GGATG at 2945, GGTTC at 2872.
  2. DPEBr3: AGTTC at 4541, GGTTG at 4512, GGTTC at 4465, GGTTA at 4413, AGATG at 4408, GGACG at 4360, AGTTC at 4218, GGACG at 4201, AGTTC at 4158, GGTTG at 4107, AGTTC at 3987, GGACA at 3928, GGACA at 3884, GGTTA at 3877, GGATA at 3864, GGTCA at 3843, AGATC at 3824, GGTTA at 3820, AGACC at 3737, AGACG at 3718, AGTCG at 3684, GGTTC at 3662, AGTCC at 3570, GGTTG at 3551, GGTCA at 3505, AGACA at 3485, GGTCG at 3478, AGACC at 3387, GGATG at 3325, GGACC at 3318, AGACC at 3306, AGTTA at 3201, GGTCC at 3127, GGATG at 3038, AGTCC at 3027, GGTCG at 2945, GGTTG at 2907, GGTCC at 2887, GGTCA at 2854.

DPEBr arbitrary negative direction (evens) (2846-2811) core promoters

  1. PDEBr2: GGTCC at 2843.
  2. DPEBr4: AGTCG at 2824.

DPEBr alternate negative direction (odds) (2846-2811) core promoters

  1. DPEBr1: GGTTC at 2844, AGATC at 2834.

DPEBr arbitrary positive direction (odds) (4445-4265) core promoters

  1. DPEBr1: AGTTA at 4412, AGTTG at 4395, GGTTA at 4384, GGTTA at 4378, GGTCC at 4276.
  2. DPEBr3: GGTTA at 4413, AGATG at 4408, GGACG at 4360.

DPEBr alternate positive direction (evens) (4445-4265) core promoters

  1. DPEBr0: AGATG at 4444, AGTTA at 4390, AGACG at 4363, GGATC at 4322, GGACC at 4314, GGATC at 4271.
  2. PDEBr2: AGATA at 4445, GGTCA at 4381, AGTCA at 4374, AGTTA at 4331, GGTTA at 4302, GGACC at 4268.
  3. DPEBr4: AGTTG at 4419, GGATA at 4275.

DPEBr arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. DPEBr0: GGACC at 2801, GGATA at 2796, AGATC at 2761, GGTCC at 2747, AGTCG at 2721, GGATC at 2667.
  2. PDEBr2: GGTTC at 2737, AGTCA at 2680, GGTTC at 2660, GGTCC at 2613.
  3. DPEBr4: AGATC at 2798, GGATC at 2690, GGTCC at 2666.

DPEBr alternate negative direction (odds) (2811-2596) proximal promoters

  1. DPEBr1: GGTCA at 2777, AGACC at 2765, AGACA at 2748, AGTTA at 2743, GGATC at 2724, GGACA at 2617.
  2. DPEBr3: AGATA at 2773, AGACA at 2741, AGTCG at 2686, GGACC at 2651.

DPEBr arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. DPEBr1: GGTCG at 4202, AGACA at 4196, GGTCG at 4155, GGTTA at 4122.
  2. DPEBr3: AGTTC at 4218, GGACG at 4201, AGTTC at 4158, GGTTG at 4107.

DPEBr alternate positive direction (evens) (4265-4050) proximal promoters

  1. DPEBr0: AGACA at 4234, GGTCG at 4187, AGTCA at 4169, GGTCC at 4116.
  2. PDEBr2: AGTTG at 4236, AGACA at 4196.
  3. DPEBr4: GGATA at 4264, AGTCA at 4258, GGTTG at 4245, GGTCG at 4241, GGACC at 4235, AGATG at 4202, AGACC at 4170, GGACC at 4133, GGATG at 4070.

DPEBr arbitrary negative direction (evens) (2596-1) distal promoters

  1. DPEBr0: AGACG at 2536, AGTCC at 2488, AGTTC at 2457, AGTCG at 2447, AGATA at 2423, AGACA at 2330, GGTCG at 2311, AGATC at 2249, GGACA at 2205, AGACA at 2182, AGTTA at 2139, GGACC at 2103, GGTTA at 2096, AGTTA at 2081, GGACG at 2073, GGTTC at 2048, GGACC at 2035, AGTTG at 2021, GGTCA at 1984, AGATC at 1949, AGATC at 1852, GGTCG at 1752, GGACA at 1702, AGTTG at 1667, AGATA at 1537, AGATA at 1465, AGACG at 1438, GGACA at 1433, GGTCG at 1417, GGATA at 1405, AGACC at 1315, GGACG at 1309, AGACG at 1236, GGATG at 1217, AGTTC at 1130, AGATG at 1125, GGATG at 979, AGTTG at 965, GGACC at 942, GGATC at 924, GGTTC at 884, AGTCC at 875, AGATA at 849, GGTCA at 845, GGTTC at 780, GGATG at 762, GGATC at 736, GGTTG at 722, GGTCA at 699, AGTTC at 687, GGACC at 678, AGTTA at 557, GGTTC at 535, AGTCC at 388, AGTTA at 188, AGTTG at 103, GGATC at 94, GGTTC at 84, GGTCC at 71, AGACC at 10.
  2. PDEBr2: AGTTG at 2570, GGACC at 2404, AGACG at 2389, GGTTA at 2309, AGTTA at 2247, AGACC at 2232, GGTCA at 2185, GGTCC at 2145, AGATA at 2113, AGTTA at 2044, GGTTA at 2040, GGACA at 2018, AGACG at 2011, AGTTG at 1987, AGACG at 1860, AGATG at 1826, GGACA at 1776, GGTTA at 1765, AGATG at 1647, GGACG at 1602, GGTCC at 1575, AGACC at 1506, GGTCG at 1499, GGTTC at 1484, GGTCC at 1477, GGATA at 1429, GGATC at 1418, GGTCC at 1400, AGTTG at 1396, GGTTG at 1285, GGTTC at 1084, AGTTG at 1035, GGATA at 987, AGTTC at 878, AGACC at 682, GGATG at 664, GGATA at 656, GGTTA at 651, GGTTC at 635, AGACC at 607, AGTCC at 521, GGACA at 494, AGTTA at 480, GGTCA at 467, AGTTA at 378, AGTTG at 299, GGACG at 255, GGACA at 217, AGTCA at 199, GGTTC at 178, GGATC at 157, GGTCA at 141, GGTTC at 104, GGTCA at 23, AGTCG at 10.
  3. DPEBr4: GGACG at 2526, GGTCC at 2475, GGTCG at 2469, GGTTG at 2434, GGTTC at 2372, GGACG at 2312, GGTTA at 2292, AGTCA at 2248, GGTTG at 2160, GGTCC at 2091, AGTTG at 2006, AGACA at 1966, GGTTC at 1927, GGATG at 1902, GGACG at 1834, GGTCC at 1813, GGACC at 1758, AGATC at 1669, AGTTA at 1638, GGACG at 1604, AGTTG at 1578, AGACC at 1573, GGATC at 1494, AGACG at 1451, GGTTA at 1447, GGATG at 1288, GGACG at 1278, AGACG at 1225, GGATC at 1173, GGTTC at 1143, AGACG at 1111, AGTCA at 985, AGATA at 941, GGTTA at 886, GGACC at 852, GGTCC at 798, GGTTC at 749, GGTTC at 742, GGATG at 727, GGTCC at 676, AGTCC at 631, GGTTC at 570, GGATA at 510, AGTCC at 497, GGTCG at 357, AGTTG at 341, GGACG at 290, AGACC at 259, AGTCC at 226, GGTCC at 206, GGTCC at 130, AGACA at 74.

DPEBr alternate negative direction (odds) (2596-1) distal promoters

  1. DPEBr1: GGACA at 2593, GGTCC at 2560, AGTTA at 2499, AGACA at 2469, GGACC at 2461, GGTCC at 2430, GGTTA at 2343, AGATC at 2250, GGTCA at 2237, GGATC at 2230, GGATA at 2154, GGACC at 2122, GGTCG at 2011, AGTTC at 1971, AGACC at 1949, GGACA at 1892, AGATC at 1798, GGATC at 1789, AGATC at 1779, GGTCA at 1750, AGTCC at 1717, GGTCG at 1682, AGTCG at 1652, GGATG at 1632, GGTCA at 1572, GGACG at 1510, GGTTC at 1499, GGTTA at 1471, GGATG at 1448, GGACA at 1341, GGATA at 1302, GGACG at 1183, GGTTA at 1084, GGTTG at 1074, AGTTC at 855, AGACC at 758, AGATA at 751, AGACC at 678, GGTCG at 578, AGTTC at 560, GGTCG at 510, GGATA at 492, AGTCG at 482, GGTCG at 416, GGACG at 407, AGATG at 400, AGATG at 345, AGTTC at 323, GGACA at 317, GGACC at 214, AGATA at 162, GGACA at 88, GGACG at 48, AGACA at 33, GGATA at 14.
  2. DPEBr3: GGATC at 2566, GGACG at 2562, AGTCC at 2549, GGATC at 2434, AGTCC at 2336, GGTTC at 2322, GGTCA at 2303, GGTTC at 2279, AGTTG at 2263, GGATG at 2255, GGTCA at 2238, GGTTC at 2124, GGTTG at 2035, GGTTG at 2014, AGACC at 1961, GGACA at 1892, GGTCG at 1887, GGATG at 1790, AGATA at 1775, AGATG at 1714, GGATG at 1688, GGTTC at 1682, GGTCA at 1631, AGTTG at 1617, GGTCC at 1585, AGTTC at 1538, GGTTG at 1507, GGATG at 1419, AGATG at 1408, AGTTA at 1379, GGACG at 1371, GGACG at 1328, GGACC at 1274, GGTTA at 1247, GGTCC at 1234, GGTTG at 1192, GGACA at 1166, AGACG at 1101, AGACC at 1074, GGTCA at 1051, AGTTC at 1018, AGTTG at 998, GGTTA at 994, GGATC at 968, AGTCC at 949, AGATA at 940, GGTTC at 857, GGTCA at 747, AGACC at 723, GGATG at 703, AGACA at 660, GGTCG at 644, GGACG at 628, GGTCA at 600, GGTTG at 528, AGTCA at 477, GGTCC at 347, GGACC at 342, GGTTC at 328, GGTCG at 302, AGATG at 298, GGTTA at 257, GGACC at 75, AGACC at 15.

DPEBr arbitrary positive direction (odds) (4050-1) distal promoters

  1. DPEBr1: GGTTG at 4018, AGACC at 3991, GGACA at 3985, GGATA at 3935, AGATC at 3860, AGTCA at 3854, GGTCC at 3642, GGTCA at 3425, AGTTG at 3388, AGACA at 3256, GGACA at 3141, GGACC at 3116, GGTCG at 3003, AGATC at 2995, GGTTA at 2961, GGATG at 2945, GGTTC at 2872, GGTTC at 2844, AGATC at 2834, GGTCA at 2777, AGACC at 2765, AGACA at 2748, AGTTA at 2743, GGATC at 2724, GGACA at 2617, GGACA at 2593, GGTCC at 2560, AGTTA at 2499, AGACA at 2469, GGACC at 2461, GGTCC at 2430, GGTTA at 2343, AGATC at 2250, GGTCA at 2237, GGATC at 2230, GGATA at 2154, GGACC at 2122, GGTCG at 2011, AGTTC at 1971, AGACC at 1949, GGACA at 1892, AGATC at 1798, GGATC at 1789, AGATC at 1779, GGTCA at 1750, AGTCC at 1717, GGTCG at 1682, AGTCG at 1652, GGATG at 1632, GGTCA at 1572, GGACG at 1510, GGTTC at 1499, GGTTA at 1471, GGATG at 1448, GGACA at 1341, GGATA at 1302, GGACG at 1183, GGTTA at 1084, GGTTG at 1074, AGTTC at 855, AGACC at 758, AGATA at 751, AGACC at 678, GGTCG at 578, AGTTC at 560, GGTCG at 510, GGATA at 492, AGTCG at 482, GGTCG at 416, GGACG at 407, AGATG at 400, AGATG at 345, AGTTC at 323, GGACA at 317, GGACC at 214, AGATA at 162, GGACA at 88, GGACG at 48, AGACA at 33, GGATA at 14.
  2. DPEBr3: AGTTC at 3987, GGACA at 3928, GGACA at 3884, GGTTA at 3877, GGATA at 3864, GGTCA at 3843, AGATC at 3824, GGTTA at 3820, AGACC at 3737, AGACG at 3718, AGTCG at 3684, GGTTC at 3662, AGTCC at 3570, GGTTG at 3551, GGTCA at 3505, AGACA at 3485, GGTCG at 3478, AGACC at 3387, GGATG at 3325, GGACC at 3318, AGACC at 3306, AGTTA at 3201, GGTCC at 3127, GGATG at 3038, AGTCC at 3027, GGTCG at 2945, GGTTG at 2907, GGTCC at 2887, GGTCA at 2854, AGATA at 2773, AGACA at 2741, AGTCG at 2686, GGACC at 2651, GGATC at 2566, GGACG at 2562, AGTCC at 2549, GGATC at 2434, AGTCC at 2336, GGTTC at 2322, GGTCA at 2303, GGTTC at 2279, AGTTG at 2263, GGATG at 2255, GGTCA at 2238, GGTTC at 2124, GGTTG at 2035, GGTTG at 2014, AGACC at 1961, GGACA at 1892, GGTCG at 1887, GGATG at 1790, AGATA at 1775, AGATG at 1714, GGATG at 1688, GGTTC at 1682, GGTCA at 1631, AGTTG at 1617, GGTCC at 1585, AGTTC at 1538, GGTTG at 1507, GGATG at 1419, AGATG at 1408, AGTTA at 1379, GGACG at 1371, GGACG at 1328, GGACC at 1274, GGTTA at 1247, GGTCC at 1234, GGTTG at 1192, GGACA at 1166, AGACG at 1101, AGACC at 1074, GGTCA at 1051, AGTTC at 1018, AGTTG at 998, GGTTA at 994, GGATC at 968, AGTCC at 949, AGATA at 940, GGTTC at 857, GGTCA at 747, AGACC at 723, GGATG at 703, AGACA at 660, GGTCG at 644, GGACG at 628, GGTCA at 600, GGTTG at 528, AGTCA at 477, GGTCC at 347, GGACC at 342, GGTTC at 328, GGTCG at 302, AGATG at 298, GGTTA at 257, GGACC at 75, AGACC at 15.

DPEBr alternate positive direction (evens) (4050-1) distal promoters

  1. DPEBr0: AGACC at 4048, AGTCA at 4043, GGTCC at 3950, GGACA at 3847, GGTCA at 3832, AGTCG at 3811, GGTCG at 3747, AGTTA at 3680, GGTTA at 3630, GGTCC at 3612, GGTCG at 3559, GGATA at 3537, AGTTG at 3440, GGTTC at 3256, GGATA at 3218, AGTCA at 3188, AGTTG at 3165, AGACC at 3154, GGATC at 3140, AGTCC at 3134, AGACG at 3039, GGTTG at 3017, GGATA at 2990, GGTTA at 2974, GGTTG at 2932, AGACG at 2907, GGACA at 2883, GGACC at 2801, GGATA at 2796, AGATC at 2761, GGTCC at 2747, AGTCG at 2721, GGATC at 2667, AGACG at 2536, AGTCC at 2488, AGTTC at 2457, AGTCG at 2447, AGATA at 2423, AGACA at 2330, GGTCG at 2311, AGATC at 2249, GGACA at 2205, AGACA at 2182, AGTTA at 2139, GGACC at 2103, GGTTA at 2096, AGTTA at 2081, GGACG at 2073, GGTTC at 2048, GGACC at 2035, AGTTG at 2021, GGTCA at 1984, AGATC at 1949, AGATC at 1852, GGTCG at 1752, GGACA at 1702, AGTTG at 1667, AGATA at 1537, AGATA at 1465, AGACG at 1438, GGACA at 1433, GGTCG at 1417, GGATA at 1405, AGACC at 1315, GGACG at 1309, AGACG at 1236, GGATG at 1217, AGTTC at 1130, AGATG at 1125, GGATG at 979, AGTTG at 965, GGACC at 942, GGATC at 924, GGTTC at 884, AGTCC at 875, AGATA at 849, GGTCA at 845, GGTTC at 780, GGATG at 762, GGATC at 736, GGTTG at 722, GGTCA at 699, AGTTC at 687, GGACC at 678, AGTTA at 557, GGTTC at 535, AGTCC at 388, AGTTA at 188, AGTTG at 103, GGATC at 94, GGTTC at 84, GGTCC at 71, AGACC at 10.
  2. PDEBr2: AGACA at 4044, AGATA at 3943, GGTTG at 3934, GGTCA at 3905, GGACC at 3877, GGATG at 3867, AGTCA at 3849, AGTTG at 3792, AGATG at 3728, GGTCC at 3599, GGACG at 3513, GGATC at 3436, AGTCC at 3376, GGTTC at 3324, AGACC at 3310, GGACA at 3249, AGTTG at 3245, GGTCA at 3163, GGACG at 3134, AGTTG at 3126, AGTTG at 3044, GGATA at 3040, GGTCC at 2982, AGACG at 2903, AGTCA at 2899, GGACC at 2888, AGTCG at 2872, GGTCC at 2843, GGTTC at 2737, AGTCA at 2680, GGTTC at 2660, GGTCC at 2613, AGTTG at 2570, GGACC at 2404, AGACG at 2389, GGTTA at 2309, AGTTA at 2247, AGACC at 2232, GGTCA at 2185, GGTCC at 2145, AGATA at 2113, AGTTA at 2044, GGTTA at 2040, GGACA at 2018, AGACG at 2011, AGTTG at 1987, AGACG at 1860, AGATG at 1826, GGACA at 1776, GGTTA at 1765, AGATG at 1647, GGACG at 1602, GGTCC at 1575, AGACC at 1506, GGTCG at 1499, GGTTC at 1484, GGTCC at 1477, GGATA at 1429, GGATC at 1418, GGTCC at 1400, AGTTG at 1396, GGTTG at 1285, GGTTC at 1084, AGTTG at 1035, GGATA at 987, AGTTC at 878, AGACC at 682, GGATG at 664, GGATA at 656, GGTTA at 651, GGTTC at 635, AGACC at 607, AGTCC at 521, GGACA at 494, AGTTA at 480, GGTCA at 467, AGTTA at 378, AGTTG at 299, GGACG at 255, GGACA at 217, AGTCA at 199, GGTTC at 178, GGATC at 157, GGTCA at 141, GGTTC at 104, GGTCA at 23, AGTCG at 10.
  3. DPEBr4: AGTCC at 3970, GGATC at 3922, AGTCA at 3883, GGACA at 3831, GGTCA at 3806, AGATG at 3742, AGTCC at 3598, GGTCG at 3552, GGTTG at 3529, AGTTC at 3438, GGACA at 3386, AGACA at 3367, GGTTG at 3361, GGTCG at 3342, AGTCC at 3316, GGTCC at 3299, AGTCA at 3239, GGTTA at 3223, AGATG at 3201, GGTTC at 3148, AGATC at 3132, AGTCA at 3118, GGATC at 3092, AGATG at 3061, AGATC at 3047, GGATG at 2941, AGACG at 2880, AGTCG at 2824, AGATC at 2798, GGATC at 2690, GGTCC at 2666, GGACG at 2526, GGTCC at 2475, GGTCG at 2469, GGTTG at 2434, GGTTC at 2372, GGACG at 2312, GGTTA at 2292, AGTCA at 2248, GGTTG at 2160, GGTCC at 2091, AGTTG at 2006, AGACA at 1966, GGTTC at 1927, GGATG at 1902, GGACG at 1834, GGTCC at 1813, GGACC at 1758, AGATC at 1669, AGTTA at 1638, GGACG at 1604, AGTTG at 1578, AGACC at 1573, GGATC at 1494, AGACG at 1451, GGTTA at 1447, GGATG at 1288, GGACG at 1278, AGACG at 1225, GGATC at 1173, GGTTC at 1143, AGACG at 1111, AGTCA at 985, AGATA at 941, GGTTA at 886, GGACC at 852, GGTCC at 798, GGTTC at 749, GGTTC at 742, GGATG at 727, GGTCC at 676, AGTCC at 631, GGTTC at 570, GGATA at 510, AGTCC at 497, GGTCG at 357, AGTTG at 341, GGACG at 290, AGACC at 259, AGTCC at 226, GGTCC at 206, GGTCC at 130, AGACA at 74.

DPEB analysis and results

RGWYV (A/G)G(A/T)(C/T)(A/C/G).[12]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 183 2 91.5 91.5 ± 25.5 (--117,+-66)
Randoms UTR arbitrary negative 0 10 0 0
Randoms UTR alternate negative 0 10 0 0
Reals Core negative 1 2 0.5 0.5
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 15 2 8.7 8.7 ± 0.7 (-+8,++7)
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 20 2 10 10 ± 5 (--15,+-5)
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 22 2 11 11 ± 2 (-+9,++13)
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 251 2 125.5 125.5 ± 40.5 (--166,+-85)
Randoms Distal arbitrary negative 0 10 0 0
Randoms Distal alternate negative 0 10 0 0
Reals Distal positive 397 2 198.5 198.5 ± 53.5 (-+145,++252)
Randoms Distal arbitrary positive 0 10 0 0
Randoms Distal alternate positive 0 10 0 0

Comparison:

The occurrences of real DPEBs are greater than the randoms. This suggests that the real DPEBs are likely active or activable.

DPE (Juven-Gershon) samplings

For the Basic programs (starting with SuccessablesDPE.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are expanded in the positive direction from 958 to 4445, are looking for, and found:

  1. Negative strand, negative direction: 63, AGTCCT at 4437, AGATGT at 4213, AGTTCT at 4179, GGTCCT at 4171, AGTCCT at 4139, GGACAT at 4122, AGATGT at 4063, AGTTCT at 4028, GGTTCT at 4020, GGTTGT at 3980, GGACAT at 3971, GGACCT at 3907, AGACCT at 3836, GGACCT at 3745, GGTCGT at 3732, GGTTCT at 3274, GGTCCT at 3250, AGTCCT at 3218, GGTTGT at 3138, AGTCCT at 3111, GGTCGT at 3071, GGACAT at 3062, AGATGT at 2989, GGTTAT at 2849, GGACAT at 2673, GGTTGT at 2611, AGTCCT at 2588, GGTTGT at 2548, GGACAT at 2539, AGTTAT at 2497, GGACAT at 2338, GGACCT at 2269, AGTCCT at 2251, GGTCAT at 2212, GGTTGT at 2149, AGTCCT at 2135, GGACAT at 1912, GGTCGT at 1786, AGACAT at 1777, GGTCGT at 1612, AGATAT at 1526, AGTCCT at 1276, GGACAT at 1259, AGATGT at 1225, GGTTGT at 1204, GGTCGT at 1141, GGACAT at 1132, AGTCCT at 985, GGACAT at 968, GGTTCT at 875, GGTCCT at 851, GGACAT at 802, AGTCCT at 715, GGTCGT at 677, GGACAT at 668, AGTCCT at 579, GGTTCT at 557, GGTCGT at 541, AGATGT at 482, AGTCCT at 442, GGTTCT at 420, GGTCGT at 404, GGACAT at 395.
  2. Negative strand, positive direction: 18, GGTTCT at 4074, AGTCCT at 3864, GGATGT at 3575, AGACCT at 3551, AGTTAT at 3382, GGTTGT at 3051, GGTTAT at 3025, AGTCCT at 2999, AGTTCT at 2955, GGTTCT at 2923, AGACCT at 2862, GGATGT at 2715, GGATAT at 2660, AGTTCT at 1988, GGACAT at 1870, AGTCCT at 758, GGTCCT at 219, GGACCT at 38.
  3. Positive strand, negative direction: 16, AGACAT at 4508, AGTTCT at 4418, AGACGT at 4236, AGATGT at 3621, AGATAT at 3466, AGACAT at 3434, AGATAT at 2982, AGACAT at 2949, AGACAT at 2881, AGATAT at 1596, AGACAT at 1570, GGATGT at 785, AGATGT at 245, AGACAT at 171, GGATAT at 109, GGATAT at 75.
  4. Positive strand, positive direction: 31, AGATCT at 4065, AGTCGT at 4024, AGTCCT at 3869, GGTCGT at 3721, GGTTGT at 3634, AGTTAT at 3425, GGACCT at 3363, AGACGT at 3279, AGACGT at 3268, AGTCGT at 3156, AGACGT at 3061, AGTCGT at 3042, AGACGT at 2857, AGTCCT at 2621, AGTCGT at 2199, AGTCGT at 2103, GGACGT at 1470, GGTCGT at 1458, GGACGT at 1370, GGTCGT at 1358, GGACGT at 1119, GGTCCT at 708, GGTCGT at 618, GGACCT at 599, GGACGT at 436, GGTCCT at 425, AGACCT at 271, AGACGT at 224, GGACGT at 192, GGTTCT at 178, GGACCT at 41.
  5. inverse complement, negative strand, negative direction: 18, AGGACC at 4546, ACAACC at 3942, AGGACC at 3906, ACGACC at 3864, AGAACC at 3793, AGGTCC at 3585, ATGACT at 3542, ATAACC at 3529, ACGTCT at 3431, ATATCT at 2903, ACGACC at 2326, ACAACT at 1853, AGGACC at 1841, AGAACC at 1649, ATGTCT at 1567, ACATCT at 970, AGGACC at 596, ACATCT at 284.
  6. inverse complement, negative strand, positive direction: 16, ATGTCC at 4367, AGAACT at 4048, ATGACC at 3784, AGGTCT at 3771, ATGTCC at 3577, ACGTCT at 3256, ATGACT at 3029, AGGTCT at 3019, AGAACC at 2776, AGGTCT at 2258, AGAACC at 2225, AGAACT at 1951, AGAACC at 1811, AGATCC at 965, AGATCC at 865, AGGTCC at 218.
  7. inverse complement, positive strand, negative direction: 12, AGATCC at 4476, AGAACC at 4451, ATGTCT at 3833, AGGACT at 3640, ATGTCT at 2986, ACAACC at 2844, ATGACT at 2786, ATGACC at 2189, ACGTCT at 1774, ACATCC at 1572, ATATCC at 1529, AGATCC at 973.
  8. inverse complement, positive strand, positive direction: 39, AGGACC at 4409, ACGTCT at 4317, AGGACT at 4186, ACGACC at 4177, ATAACT at 4161, AGAACT at 4131, AGATCC at 4077, AGATCT at 4065, AGGTCC at 4032, AGGTCT at 3891, ACGTCT at 3831, AGGTCT at 3806, AGGTCC at 3687, ACGTCC at 3466, AGGACC at 3296, ATGACC at 3117, AGGTCC at 3111, ACGTCT at 2859, ACAACC at 2816, ACGTCC at 2745, ACGTCT at 2721, ACGTCC at 2683, ATATCC at 2550, AGGACC at 2501, ACATCC at 2255, AGGACT at 2211, ACAACC at 2185, ACGTCT at 1937, ACATCC at 1875, ACGTCC at 1788, ACGACC at 1779, ACGACC at 1736, ATGACT at 1286, ACGTCC at 658, ACGTCT at 438, ACGTCC at 194, AGGTCC at 33, AGGTCT at 15, AGGTCC at 8.

DPE (Juven-Gershon) (4560-2846) UTRs

  1. Negative strand, negative direction: AGGACC at 4546, AGTCCT at 4437, AGATGT at 4213, AGTTCT at 4179, GGTCCT at 4171, AGTCCT at 4139, GGACAT at 4122, AGATGT at 4063, AGTTCT at 4028, GGTTCT at 4020, GGTTGT at 3980, GGACAT at 3971, ACAACC at 3942, GGACCT at 3907, AGGACC at 3906, ACGACC at 3864, AGACCT at 3836, AGAACC at 3793, GGACCT at 3745, GGTCGT at 3732, AGGTCC at 3585, ATGACT at 3542, ATAACC at 3529, ACGTCT at 3431, GGTTCT at 3274, GGTCCT at 3250, AGTCCT at 3218, GGTTGT at 3138, AGTCCT at 3111, GGTCGT at 3071, GGACAT at 3062, AGATGT at 2989, ATATCT at 2903, GGTTAT at 2849.
  2. Positive strand, negative direction: AGACAT at 4508, AGATCC at 4476, AGAACC at 4451, AGTTCT at 4418, AGACGT at 4236, ATGTCT at 3833, AGGACT at 3640, AGATGT at 3621, AGATAT at 3466, AGACAT at 3434, ATGTCT at 2986, AGATAT at 2982, AGACAT at 2949, AGACAT at 2881.

DPE (Juven-Gershon) negative direction (2846-2811) core promoters

  1. Positive strand, negative direction: ACAACC at 2844.

DPE (Juven-Gershon) positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: ATGTCC at 4367.
  2. Positive strand, positive direction: AGGACC at 4409, ACGTCT at 4317.

DPE (Juven-Gershon) negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: GGACAT at 2673, GGTTGT at 2611.
  2. Positive strand, negative direction: ATGACT at 2786.

DPE (Juven-Gershon) positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: GGTTCT at 4074.
  2. Positive strand, positive direction: AGGACT at 4186, ACGACC at 4177, ATAACT at 4161, AGAACT at 4131, AGATCC at 4077, AGATCT at 4065.

DPE (Juven-Gershon) negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: AGTCCT at 2588, GGTTGT at 2548, GGACAT at 2539, AGTTAT at 2497, GGACAT at 2338, ACGACC at 2326, GGACCT at 2269, AGTCCT at 2251, GGTCAT at 2212, GGTTGT at 2149, AGTCCT at 2135, GGACAT at 1912, ACAACT at 1853, AGGACC at 1841, GGTCGT at 1786, AGACAT at 1777, AGAACC at 1649, GGTCGT at 1612, AGATAT at 1596, AGACAT at 1570, ATGTCT at 1567, AGATAT at 1526, AGTCCT at 1276, GGACAT at 1259, AGATGT at 1225, GGTTGT at 1204, GGTCGT at 1141, GGACAT at 1132, AGTCCT at 985, ACATCT at 970, GGACAT at 968, GGTTCT at 875, GGTCCT at 851, GGACAT at 802, AGTCCT at 715, GGTCGT at 677, GGACAT at 668, AGGACC at 596, AGTCCT at 579, GGTTCT at 557, GGTCGT at 541, AGATGT at 482, AGTCCT at 442, GGTTCT at 420, GGTCGT at 404, GGACAT at 395, ACATCT at 284.
  2. Positive strand, negative direction: ATGACC at 2189, ACGTCT at 1774, ACATCC at 1572, ATATCC at 1529, AGATCC at 973, GGATGT at 785, AGATGT at 245, AGACAT at 171, GGATAT at 109, GGATAT at 75.

DPE (Juven-Gershon) positive direction (4050-1) distal promoters

Negative strand

  1. Negative strand, positive direction: AGAACT at 4048, AGTCCT at 3864, ATGACC at 3784, AGGTCT at 3771, ATGTCC at 3577, GGATGT at 3575, AGACCT at 3551, AGTTAT at 3382, ACGTCT at 3256, GGTTGT at 3051, ATGACT at 3029, GGTTAT at 3025, AGGTCT at 3019, AGTCCT at 2999, AGTTCT at 2955, GGTTCT at 2923, AGACCT at 2862, AGAACC at 2776, GGATGT at 2715, GGATAT at 2660, AGGTCT at 2258, AGAACC at 2225, AGTTCT at 1988, AGAACT at 1951, GGACAT at 1870, AGAACC at 1811, AGATCC at 965, AGATCC at 865, AGTCCT at 758, GGTCCT at 219, AGGTCC at 218, GGACCT at 38.

Positive strand

  1. Positive strand, positive direction: AGGTCC at 4032, AGTCGT at 4024, AGGTCT at 3891, AGTCCT at 3869, ACGTCT at 3831, AGGTCT at 3806, GGTCGT at 3721, AGGTCC at 3687, GGTTGT at 3634, ACGTCC at 3466, AGTTAT at 3425, GGACCT at 3363, AGGACC at 3296, AGACGT at 3279, AGACGT at 3268, AGTCGT at 3156, ATGACC at 3117, AGGTCC at 3111, AGACGT at 3061, AGTCGT at 3042, ACGTCT at 2859, AGACGT at 2857, ACAACC at 2816, ACGTCC at 2745, ACGTCT at 2721, ACGTCC at 2683, AGTCCT at 2621, ATATCC at 2550, AGGACC at 2501, ACATCC at 2255, AGGACT at 2211, AGTCGT at 2199, ACAACC at 2185, AGTCGT at 2103, ACGTCT at 1937, ACATCC at 1875, ACGTCC at 1788, ACGACC at 1779, ACGACC at 1736, GGACGT at 1470, GGTCGT at 1458, GGACGT at 1370, GGTCGT at 1358, ATGACT at 1286, GGACGT at 1119, GGTCCT at 708, ACGTCC at 658, GGTCGT at 618, GGACCT at 599, ACGTCT at 438, GGACGT at 436, GGTCCT at 425, AGACCT at 271, AGACGT at 224, ACGTCC at 194, GGACGT at 192, GGTTCT at 178, GGACCT at 41, AGGTCC at 33, AGGTCT at 15, AGGTCC at 8.

DPE (Juven-Gershon) random dataset samplings

  1. DPEJGr0: 23, GGACCT at 4315, AGTCAT at 4170, AGACCT at 4049, GGACAT at 3848, AGTCGT at 3812, GGTCCT at 3613, GGTCGT at 3560, GGTTGT at 3018, GGTTGT at 2933, AGACGT at 2537, AGTTAT at 2140, GGACCT at 2104, GGTTCT at 2049, GGTCGT at 1753, AGTTGT at 1668, AGATAT at 1538, AGATAT at 1466, GGATCT at 925, GGATGT at 763, GGATCT at 737, GGTTGT at 723, GGTCAT at 700, AGTTAT at 558.
  2. DPEJGr1: 25, AGACGT at 4489, GGTCGT at 4203, AGACAT at 4197, GGTTAT at 4123, AGATCT at 3861, GGTCCT at 3643, GGACAT at 3142, GGTTCT at 2873, GGTTCT at 2845, GGACAT at 2618, GGTCAT at 2238, GGTCGT at 2012, AGACCT at 1950, AGTCCT at 1718, AGTCGT at 1653, GGACGT at 1511, GGTTCT at 1500, GGTTAT at 1472, GGATGT at 1449, GGACAT at 1342, GGACGT at 1184, AGACCT at 759, AGATGT at 401, AGATGT at 346, GGACAT at 89.
  3. DPEJGr2: 20, AGACAT at 4045, GGACAT at 3250, AGTCGT at 2873, GGTTCT at 2738, GGACCT at 2405, AGATAT at 2114, AGACGT at 1861, GGACAT at 1777, AGATGT at 1648, GGACGT at 1603, AGACCT at 1507, GGTCCT at 1401, AGTTGT at 1036, AGTTCT at 879, AGACCT at 683, GGACAT at 495, GGACAT at 218, AGTCAT at 200, GGTTCT at 179, GGATCT at 158.
  4. DPEJGr3: 32, GGTTGT at 4513, GGTTAT at 4414, AGTTCT at 4219, AGTTCT at 3988, GGACAT at 3885, AGTCCT at 3028, GGTCAT at 2855, AGTCGT at 2687, GGACCT at 2652, GGATCT at 2567, AGTCCT at 2550, GGATCT at 2435, GGTCAT at 2304, AGTTGT at 2264, GGATGT at 1791, AGATGT at 1715, GGTTCT at 1683, GGTCAT at 1632, AGTTCT at 1539, GGTTGT at 1193, GGACAT at 1167, AGTTCT at 1019, AGATAT at 941, GGTCAT at 748, AGACAT at 661, GGACGT at 629, GGTTGT at 529, AGTCAT at 478, GGTCCT at 348, GGTTCT at 329, GGTTAT at 258, GGACCT at 76.
  5. DPEJGr4: 24, GGTTCT at 4460, GGATGT at 4071, AGTCCT at 3971, GGATCT at 3923, GGTTGT at 3530, AGACAT at 3368, AGTCCT at 3317, AGATCT at 3133, GGTCCT at 2667, GGTTGT at 2435, GGACGT at 2313, GGTTAT at 2293, AGTCAT at 2249, GGTTGT at 2161, AGTTGT at 2007, AGATCT at 1670, GGACGT at 1605, GGATCT at 1495, AGACGT at 1452, GGTCCT at 799, GGATGT at 728, GGTCGT at 358, GGACGT at 291, AGACCT at 260.
  6. DPEJGr5: 17, GGACCT at 4345, AGTCAT at 4250, GGTCAT at 4052, GGTTCT at 3378, GGTTAT at 3233, GGTCGT at 3121, AGTTCT at 3089, AGTCGT at 2940, AGATCT at 2830, GGTTCT at 2557, GGATCT at 2429, AGTCGT at 2195, AGACGT at 1586, GGACCT at 1383, AGACAT at 1257, AGACGT at 711, GGTTCT at 210.
  7. DPEJGr6: 22, GGTCCT at 4465, GGTTCT at 4117, AGTCCT at 4012, AGTCGT at 3837, GGACAT at 3657, AGTCGT at 3489, GGACCT at 3416, AGTTCT at 3091, AGATCT at 2937, AGTTAT at 2806, GGATGT at 2708, AGTTGT at 2639, AGATCT at 2391, GGACCT at 2016, GGTTCT at 1515, AGATAT at 1242, AGTCAT at 1110, AGACCT at 855, GGACAT at 792, GGACGT at 609, GGTTGT at 586, GGTCGT at 157.
  8. DPEJGr7: 38, GGATAT at 4531, GGACAT at 4378, AGTCGT at 4300, GGATGT at 4170, GGATGT at 4100, AGTTAT at 4081, AGACGT at 4032, GGTTGT at 3908, AGTTAT at 3773, AGTCCT at 3577, GGACGT at 3402, GGATCT at 3368, GGACAT at 3168, AGATAT at 3054, AGTTGT at 2938, GGACGT at 2805, GGTCGT at 2781, AGTTCT at 2572, GGATCT at 2461, AGTCGT at 2263, GGTTGT at 2138, AGTCCT at 2097, GGTCCT at 2052, AGTCCT at 2013, GGTTGT at 1883, AGTTAT at 1806, GGACAT at 1648, GGTTGT at 1310, AGTCAT at 1015, GGTCAT at 973, GGACAT at 798, GGTTCT at 735, GGACGT at 696, GGTCAT at 635, GGATGT at 315, GGTCGT at 264, AGTCAT at 203, GGATGT at 117.
  9. DPEJGr8: 28, GGTCCT at 4350, AGACCT at 4264, AGACGT at 4065, GGACAT at 4027, AGATAT at 3980, GGTCGT at 3844, GGACGT at 3601, GGATAT at 3555, AGACGT at 3542, AGACAT at 3218, GGACCT at 3008, AGATAT at 2999, GGTCCT at 2877, AGTCCT at 2804, AGATGT at 2727, GGTTGT at 2380, AGTCCT at 2208, GGATAT at 1701, GGTCCT at 1457, GGTTAT at 1335, GGTCCT at 1196, AGACAT at 1155, AGTCCT at 1013, GGTTAT at 552, AGATGT at 522, GGTCGT at 247, GGTTAT at 92, AGATCT at 27.
  10. DPEJGr9: 18, GGTTCT at 4465, AGATGT at 4418, GGATGT at 4388, AGTTCT at 4286, GGATAT at 4226, GGTTAT at 4176, AGATAT at 4020, AGACAT at 3639, GGACGT at 3509, AGATAT at 2994, AGACAT at 2918, GGTTAT at 2893, GGTCGT at 2613, GGACCT at 2334, GGACAT at 1846, AGACAT at 1631, GGATGT at 1468, GGTCCT at 1459.
  11. DPEJGr0ci: 23, ACATCC at 4416, AGGACC at 4314, AGAACC at 4308, AGAACC at 3587, AGAACT at 3494, ATGTCC at 3377, AGAACC at 3322, AGGACC at 2801, AGATCC at 2762, AGGTCC at 2747, ATGACC at 2730, ACGACC at 2628, ACGTCC at 2594, ACGTCT at 2539, AGATCC at 2250, ATGTCT at 2115, ATAACT at 1792, AGGTCT at 1047, AGGACC at 942, ATGACT at 704, AGGACC at 678, ATAACT at 431, AGAACC at 55.
  12. DPEJGr1ci: 31, AGGACC at 4466, ACAACC at 4321, ACATCT at 4097, AGATCT at 3861, ATGACC at 3635, ATAACC at 3475, ACGTCC at 3430, ACGTCC at 3300, AGGACT at 3277, ACGACT at 3246, AGGACC at 3116, AGATCC at 2835, ACAACT at 2751, AGGTCC at 2560, AGGACC at 2461, ATAACT at 2157, AGGACC at 2122, ACGACT at 1828, AGATCC at 1799, ACGACC at 1762, ACGACC at 1736, ATGACT at 1635, ACATCC at 1203, ACGTCC at 1186, ACATCT at 1130, AGAACT at 1035, AGGACT at 647, ACGACT at 618, AGAACT at 471, ATATCT at 59, ATGTCT at 24.
  13. DPEJGr2ci: 17, ACAACC at 3839, ATAACC at 3824, ATGACT at 3809, ACATCC at 3179, ACATCC at 2810, AGGTCC at 2613, ATATCT at 2415, AGGACT at 2333, ACGACC at 2317, ACAACC at 2193, AGGACT at 1676, ACGTCT at 1661, ACAACT at 1309, ATAACC at 1140, ATGTCT at 920, ATGACT at 373, ACATCC at 220.
  14. DPEJGr3ci: 22, AGAACT at 4531, ATGTCT at 4388, ATAACC at 3903, AGATCC at 3825, ATAACC at 3674, ATAACC at 3447, ATAACC at 3339, AGGACC at 3318, ACATCT at 3285, ATGTCC at 3188, ACGTCT at 2585, AGAACC at 2243, AGAACC at 1301, ATAACT at 1290, AGGACC at 1274, ATATCT at 943, AGAACC at 691, AGGTCT at 679, AGGACC at 342, AGGTCT at 246, AGAACT at 221, AGAACC at 188.
  15. DPEJGr4ci: 25, ATATCT at 4475, ACGTCC at 4367, ACGTCT at 4334, ACATCT at 4298, ACGACC at 4045, ACGACT at 3938, ACAACT at 3712, ACGTCT at 3509, AGATCT at 3133, AGATCC at 3048, AGAACT at 2998, AGGACT at 2956, ACAACC at 2856, ATAACT at 2378, ACGTCT at 2315, AGAACT at 1747, AGAACT at 1740, AGATCT at 1670, AGGTCT at 980, AGGTCC at 676, AGAACT at 579, ATAACT at 466, ACAACC at 421, AGAACT at 267, AGGACT at 9.
  16. DPEJGr5ci: 14, ACGTCC at 4403, AGATCC at 4031, AGAACC at 3986, AGAACC at 3394, AGGACT at 3365, AGATCT at 2830, ATAACC at 2093, AGAACC at 1948, ATAACT at 1873, AGAACC at 1430, ACGTCC at 905, AGAACC at 613, ATGACC at 505, AGAACT at 342.
  17. DPEJGr6ci: 25, AGAACC at 4404, AGGACT at 3989, ACGACC at 3784, ATGACC at 3192, ACGACC at 2954, AGATCT at 2937, ACGTCT at 2927, AGATCT at 2391, ACAACC at 2358, ATATCC at 2352, ATAACT at 2241, ATATCC at 2054, ATATCT at 2038, ACAACC at 1917, ACAACT at 1808, ATGTCT at 1291, AGGACC at 1225, ATAACT at 1133, ACATCT at 810, AGGACC at 780, ACGTCC at 611, AGGACC at 387, AGGACC at 374, AGGTCC at 306, ATGTCC at 46.
  18. DPEJGr7ci: 24, ACGTCC at 4259, ATGTCT at 4004, ACAACC at 3418, ACATCC at 3170, ACGTCC at 2860, ATAACT at 2828, AGGTCC at 2812, AGGACC at 2795, ACGTCT at 2705, AGGACC at 2688, AGGTCC at 2676, ACAACC at 2550, ACAACC at 2408, ACGACT at 2363, ATAACT at 2329, AGGACT at 2070, ACGTCC at 1576, ACGACC at 1522, ATAACC at 1447, ATGTCC at 1145, ACGTCC at 946, ACGTCT at 698, ATGACT at 393, ACAACC at 63.
  19. DPEJGr8ci: 23, AGGTCC at 4349, ACGACT at 4184, ACAACC at 4082, ACATCC at 4029, ACGACC at 3935, AGGACC at 3647, ACAACC at 3233, ACATCT at 3220, AGAACC at 3043, ACAACC at 2604, ATAACT at 2144, ACAACC at 2105, AGGTCC at 2075, ACAACT at 1906, AGGACT at 1765, AGGTCT at 1517, AGGTCC at 1417, ATGACT at 1133, AGAACC at 1069, AGGTCC at 1023, AGAACT at 885, AGGTCT at 399, AGATCT at 27.
  20. DPEJGr9ci: 22, ATATCT at 4329, ATATCC at 4228, ATAACC at 4143, ACATCC at 3750, ATGACT at 3704, ATGACC at 3680, AGGACC at 3378, ATGTCT at 3298, ATATCC at 2996, AGGTCT at 2648, AGATCC at 2431, ACGACC at 2148, ATATCC at 2097, ATGACT at 2080, ACATCC at 2024, AGGACT at 1596, ACATCC at 1523, AGGACT at 1385, AGGTCC at 575, AGGTCT at 450, ACGACC at 288, ACAACT at 120.

DPEJGr arbitrary (evens) (4560-2846) UTRs

  1. DPEJGr0: GGACCT at 4315, AGTCAT at 4170, AGACCT at 4049, GGACAT at 3848, AGTCGT at 3812, GGTCCT at 3613, GGTCGT at 3560, GGTTGT at 3018, GGTTGT at 2933.
  2. DPEJGr2: AGACAT at 4045, GGACAT at 3250, AGTCGT at 2873.
  3. DPEJGr4: GGTTCT at 4460, GGATGT at 4071, AGTCCT at 3971, GGATCT at 3923, GGTTGT at 3530, AGACAT at 3368, AGTCCT at 3317, AGATCT at 3133.
  4. DPEJGr6: GGTCCT at 4465, GGTTCT at 4117, AGTCCT at 4012, AGTCGT at 3837, GGACAT at 3657, AGTCGT at 3489, GGACCT at 3416, AGTTCT at 3091, AGATCT at 2937.
  5. DPEJGr8: GGTCCT at 4350, AGACCT at 4264, AGACGT at 4065, GGACAT at 4027, AGATAT at 3980, GGTCGT at 3844, GGACGT at 3601, GGATAT at 3555, AGACGT at 3542, AGACAT at 3218, GGACCT at 3008, AGATAT at 2999, GGTCCT at 2877.
  6. DPEJGr0ci: ACATCC at 4416, AGGACC at 4314, AGAACC at 4308, AGAACC at 3587, AGAACT at 3494, ATGTCC at 3377, AGAACC at 3322.
  7. DPEJGr2ci: ACAACC at 3839, ATAACC at 3824, ATGACT at 3809, ACATCC at 3179.
  8. DPEJGr4ci: ATATCT at 4475, ACGTCC at 4367, ACGTCT at 4334, ACATCT at 4298, ACGACC at 4045, ACGACT at 3938, ACAACT at 3712, ACGTCT at 3509, AGATCT at 3133, AGATCC at 3048, AGAACT at 2998, AGGACT at 2956, ACAACC at 2856.
  9. DPEJGr6ci: AGAACC at 4404, AGGACT at 3989, ACGACC at 3784, ATGACC at 3192, ACGACC at 2954, AGATCT at 2937, ACGTCT at 2927.
  10. DPEJGr8ci: AGGTCC at 4349, ACGACT at 4184, ACAACC at 4082, ACATCC at 4029, ACGACC at 3935, AGGACC at 3647, ACAACC at 3233, ACATCT at 3220, AGAACC at 3043.

DPEJGr alternate (odds) (4560-2846) UTRs

  1. DPEJGr1: AGACGT at 4489, GGTCGT at 4203, AGACAT at 4197, GGTTAT at 4123, AGATCT at 3861, GGTCCT at 3643, GGACAT at 3142, GGTTCT at 2873.
  2. DPEJGr3: GGTTGT at 4513, GGTTAT at 4414, AGTTCT at 4219, AGTTCT at 3988, GGACAT at 3885, AGTCCT at 3028, GGTCAT at 2855.
  3. DPEJGr5: 17, GGACCT at 4345, AGTCAT at 4250, GGTCAT at 4052, GGTTCT at 3378, GGTTAT at 3233, GGTCGT at 3121, AGTTCT at 3089, AGTCGT at 2940.
  4. DPEJGr7: GGATAT at 4531, GGACAT at 4378, AGTCGT at 4300, GGATGT at 4170, GGATGT at 4100, AGTTAT at 4081, AGACGT at 4032, GGTTGT at 3908, AGTTAT at 3773, AGTCCT at 3577, GGACGT at 3402, GGATCT at 3368, GGACAT at 3168, AGATAT at 3054, AGTTGT at 2938.
  5. DPEJGr9: GGTTCT at 4465, AGATGT at 4418, GGATGT at 4388, AGTTCT at 4286, GGATAT at 4226, GGTTAT at 4176, AGATAT at 4020, AGACAT at 3639, GGACGT at 3509, AGATAT at 2994, AGACAT at 2918, GGTTAT at 2893.
  6. DPEJGr1ci: 31, AGGACC at 4466, ACAACC at 4321, ACATCT at 4097, AGATCT at 3861, ATGACC at 3635, ATAACC at 3475, ACGTCC at 3430, ACGTCC at 3300, AGGACT at 3277, ACGACT at 3246, AGGACC at 3116.
  7. DPEJGr2ci: 17, ACAACC at 3839, ATAACC at 3824, ATGACT at 3809, ACATCC at 3179, ACATCC at 2810, AGGTCC at 2613, ATATCT at 2415, AGGACT at 2333, ACGACC at 2317, ACAACC at 2193, AGGACT at 1676, ACGTCT at 1661, ACAACT at 1309, ATAACC at 1140, ATGTCT at 920, ATGACT at 373, ACATCC at 220.
  8. DPEJGr3ci: AGAACT at 4531, ATGTCT at 4388, ATAACC at 3903, AGATCC at 3825, ATAACC at 3674, ATAACC at 3447, ATAACC at 3339, AGGACC at 3318, ACATCT at 3285, ATGTCC at 3188.
  9. DPEJGr5ci: 14, ACGTCC at 4403, AGATCC at 4031, AGAACC at 3986, AGAACC at 3394, AGGACT at 3365.
  10. DPEJGr7ci: ACGTCC at 4259, ATGTCT at 4004, ACAACC at 3418, ACATCC at 3170, ACGTCC at 2860.
  11. DPEJGr9ci: ATATCT at 4329, ATATCC at 4228, ATAACC at 4143, ACATCC at 3750, ATGACT at 3704, ATGACC at 3680, AGGACC at 3378, ATGTCT at 3298, ATATCC at 2996.

DPEJGr alternate negative direction (odds) (2846-2811) core promoters

  1. DPEJGr1: GGTTCT at 2845.
  2. DPEJGr5: AGATCT at 2830.
  3. DPEJGr1ci: AGATCC at 2835.
  4. DPEJGr5ci: AGATCT at 2830.
  5. DPEJGr7ci: ATAACT at 2828, AGGTCC at 2812.

DPEJGr arbitrary positive direction (odds) (4445-4265) core promoters

  1. DPEJGr3: GGTTAT at 4414.
  2. DPEJGr5: GGACCT at 4345.
  3. DPEJGr7: GGACAT at 4378, AGTCGT at 4300.
  4. DPEJGr9: AGATGT at 4418, GGATGT at 4388, AGTTCT at 4286.
  5. DPEJGr1ci: ACAACC at 4321.
  6. DPEJGr2ci: 17, ACAACC at 3839, ATAACC at 3824, ATGACT at 3809, ACATCC at 3179, ACATCC at 2810, AGGTCC at 2613, ATATCT at 2415, AGGACT at 2333, ACGACC at 2317, ACAACC at 2193, AGGACT at 1676, ACGTCT at 1661, ACAACT at 1309, ATAACC at 1140, ATGTCT at 920, ATGACT at 373, ACATCC at 220.
  7. DPEJGr3ci: ATGTCT at 4388.
  8. DPEJGr5ci: ACGTCC at 4403.
  9. DPEJGr9ci: ATATCT at 4329.

DPEJGr alternate positive direction (evens) (4445-4265) core promoters

  1. DPEJGr0: GGACCT at 4315.
  2. DPEJGr8: GGTCCT at 4350.
  3. DPEJGr0ci: ACATCC at 4416, AGGACC at 4314, AGAACC at 4308.
  4. DPEJGr4ci: ACGTCC at 4367, ACGTCT at 4334, ACATCT at 4298.
  5. DPEJGr6ci: AGAACC at 4404.
  6. DPEJGr8ci: AGGTCC at 4349.

DPEJGr arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. DPEJGr2: GGTTCT at 2738.
  2. DPEJGr4: GGTCCT at 2667.
  3. DPEJGr6: AGTTAT at 2806, GGATGT at 2708, AGTTGT at 2639.
  4. DPEJGr8: AGTCCT at 2804, AGATGT at 2727.
  5. DPEJGr0ci: AGGACC at 2801, AGATCC at 2762, AGGTCC at 2747, ATGACC at 2730, ACGACC at 2628.
  6. DPEJGr2ci: ACATCC at 2810, AGGTCC at 2613.
  7. DPEJGr8ci: ACAACC at 2604.

DPEJGr alternate negative direction (odds) (2811-2596) proximal promoters

  1. DPEJGr1: GGACAT at 2618.
  2. DPEJGr3: AGTCGT at 2687, GGACCT at 2652.
  3. DPEJGr7: GGACGT at 2805, GGTCGT at 2781.
  4. DPEJGr9: GGTCGT at 2613.
  5. DPEJGr1ci: ACAACT at 2751.
  6. DPEJGr7ci: AGGACC at 2795, ACGTCT at 2705, AGGACC at 2688, AGGTCC at 2676.
  7. DPEJGr9ci: AGGTCT at 2648.

DPEJGr arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. DPEJGr1: GGTCGT at 4203, AGACAT at 4197, GGTTAT at 4123.
  2. DPEJGr3: 3AGTTCT at 4219.
  3. DPEJGr5: AGTCAT at 4250, GGTCAT at 4052.
  4. DPEJGr7: GGATGT at 4170, GGATGT at 4100, AGTTAT at 4081.
  5. DPEJGr9: GGATAT at 4226, GGTTAT at 4176.
  6. DPEJGr1ci: ACATCT at 4097.
  7. DPEJGr7ci: ACGTCC at 4259.
  8. DPEJGr9ci: ATATCC at 4228, ATAACC at 4143.

DPEJGr alternate positive direction (evens) (4265-4050) proximal promoters

  1. DPEJGr0: AGTCAT at 4170.
  2. DPEJGr4: GGATGT at 4071.
  3. DPEJGr6: GGTTCT at 4117.
  4. DPEJGr8: AGACCT at 4264, AGACGT at 4065.
  5. DPEJGr8ci: ACGACT at 4184, ACAACC at 4082.

DPEJGr arbitrary negative direction (evens) (2596-1) distal promoters

  1. DPEJGr0: AGACGT at 2537, AGTTAT at 2140, GGACCT at 2104, GGTTCT at 2049, GGTCGT at 1753, AGTTGT at 1668, AGATAT at 1538, AGATAT at 1466, GGATCT at 925, GGATGT at 763, GGATCT at 737, GGTTGT at 723, GGTCAT at 700, AGTTAT at 558.
  2. DPEJGr2: GGACCT at 2405, AGATAT at 2114, AGACGT at 1861, GGACAT at 1777, AGATGT at 1648, GGACGT at 1603, AGACCT at 1507, GGTCCT at 1401, AGTTGT at 1036, AGTTCT at 879, AGACCT at 683, GGACAT at 495, GGACAT at 218, AGTCAT at 200, GGTTCT at 179, GGATCT at 158.
  3. DPEJGr4: GGTTGT at 2435, GGACGT at 2313, GGTTAT at 2293, AGTCAT at 2249, GGTTGT at 2161, AGTTGT at 2007, AGATCT at 1670, GGACGT at 1605, GGATCT at 1495, AGACGT at 1452, GGTCCT at 799, GGATGT at 728, GGTCGT at 358, GGACGT at 291, AGACCT at 260.
  4. DPEJGr6: AGATCT at 2391, GGACCT at 2016, GGTTCT at 1515, AGATAT at 1242, AGTCAT at 1110, AGACCT at 855, GGACAT at 792, GGACGT at 609, GGTTGT at 586, GGTCGT at 157.
  5. DPEJGr8: GGTTGT at 2380, AGTCCT at 2208, GGATAT at 1701, GGTCCT at 1457, GGTTAT at 1335, GGTCCT at 1196, AGACAT at 1155, AGTCCT at 1013, GGTTAT at 552, AGATGT at 522, GGTCGT at 247, GGTTAT at 92, AGATCT at 27.
  6. DPEJGr0ci: ACGTCC at 2594, ACGTCT at 2539, AGATCC at 2250, ATGTCT at 2115, ATAACT at 1792, AGGTCT at 1047, AGGACC at 942, ATGACT at 704, AGGACC at 678, ATAACT at 431, AGAACC at 55.
  7. DPEJGr2ci: ATATCT at 2415, AGGACT at 2333, ACGACC at 2317, ACAACC at 2193, AGGACT at 1676, ACGTCT at 1661, ACAACT at 1309, ATAACC at 1140, ATGTCT at 920, ATGACT at 373, ACATCC at 220.
  8. DPEJGr4ci: ATAACT at 2378, ACGTCT at 2315, AGAACT at 1747, AGAACT at 1740, AGATCT at 1670, AGGTCT at 980, AGGTCC at 676, AGAACT at 579, ATAACT at 466, ACAACC at 421, AGAACT at 267, AGGACT at 9.
  9. DPEJGr6ci: AGATCT at 2391, ACAACC at 2358, ATATCC at 2352, ATAACT at 2241, ATATCC at 2054, ATATCT at 2038, ACAACC at 1917, ACAACT at 1808, ATGTCT at 1291, AGGACC at 1225, ATAACT at 1133, ACATCT at 810, AGGACC at 780, ACGTCC at 611, AGGACC at 387, AGGACC at 374, AGGTCC at 306, ATGTCC at 46.
  10. DPEJGr8ci: ATAACT at 2144, ACAACC at 2105, AGGTCC at 2075, ACAACT at 1906, AGGACT at 1765, AGGTCT at 1517, AGGTCC at 1417, ATGACT at 1133, AGAACC at 1069, AGGTCC at 1023, AGAACT at 885, AGGTCT at 399, AGATCT at 27.

DPEJGr alternate negative direction (odds) (2596-1) distal promoters

  1. DPEJGr1: GGTCAT at 2238, GGTCGT at 2012, AGACCT at 1950, AGTCCT at 1718, AGTCGT at 1653, GGACGT at 1511, GGTTCT at 1500, GGTTAT at 1472, GGATGT at 1449, GGACAT at 1342, GGACGT at 1184, AGACCT at 759, AGATGT at 401, AGATGT at 346, GGACAT at 89.
  2. DPEJGr3: GGATCT at 2567, AGTCCT at 2550, GGATCT at 2435, GGTCAT at 2304, AGTTGT at 2264, GGATGT at 1791, AGATGT at 1715, GGTTCT at 1683, GGTCAT at 1632, AGTTCT at 1539, GGTTGT at 1193, GGACAT at 1167, AGTTCT at 1019, AGATAT at 941, GGTCAT at 748, AGACAT at 661, GGACGT at 629, GGTTGT at 529, AGTCAT at 478, GGTCCT at 348, GGTTCT at 329, GGTTAT at 258, GGACCT at 76.
  3. DPEJGr5: AGTCGT at 2940, AGATCT at 2830, GGTTCT at 2557, GGATCT at 2429, AGTCGT at 2195, AGACGT at 1586, GGACCT at 1383, AGACAT at 1257, AGACGT at 711, GGTTCT at 210.
  4. DPEJGr7: AGTTCT at 2572, GGATCT at 2461, AGTCGT at 2263, GGTTGT at 2138, AGTCCT at 2097, GGTCCT at 2052, AGTCCT at 2013, GGTTGT at 1883, AGTTAT at 1806, GGACAT at 1648, GGTTGT at 1310, AGTCAT at 1015, GGTCAT at 973, GGACAT at 798, GGTTCT at 735, GGACGT at 696, GGTCAT at 635, GGATGT at 315, GGTCGT at 264, AGTCAT at 203, GGATGT at 117.
  5. DPEJGr9: GGACCT at 2334, GGACAT at 1846, AGACAT at 1631, GGATGT at 1468, GGTCCT at 1459.
  6. DPEJGr1ci: AGGTCC at 2560, AGGACC at 2461, ATAACT at 2157, AGGACC at 2122, ACGACT at 1828, AGATCC at 1799, ACGACC at 1762, ACGACC at 1736, ATGACT at 1635, ACATCC at 1203, ACGTCC at 1186, ACATCT at 1130, AGAACT at 1035, AGGACT at 647, ACGACT at 618, AGAACT at 471, ATATCT at 59, ATGTCT at 24.
  7. DPEJGr3ci: ACGTCT at 2585, AGAACC at 2243, AGAACC at 1301, ATAACT at 1290, AGGACC at 1274, ATATCT at 943, AGAACC at 691, AGGTCT at 679, AGGACC at 342, AGGTCT at 246, AGAACT at 221, AGAACC at 188.
  8. DPEJGr5ci: ATAACC at 2093, AGAACC at 1948, ATAACT at 1873, AGAACC at 1430, ACGTCC at 905, AGAACC at 613, ATGACC at 505, AGAACT at 342.
  9. DPEJGr7ci: ACAACC at 2550, ACAACC at 2408, ACGACT at 2363, ATAACT at 2329, AGGACT at 2070, ACGTCC at 1576, ACGACC at 1522, ATAACC at 1447, ATGTCC at 1145, ACGTCC at 946, ACGTCT at 698, ATGACT at 393, ACAACC at 63.
  10. DPEJGr9ci: AGATCC at 2431, ACGACC at 2148, ATATCC at 2097, ATGACT at 2080, ACATCC at 2024, AGGACT at 1596, ACATCC at 1523, AGGACT at 1385, AGGTCC at 575, AGGTCT at 450, ACGACC at 288, ACAACT at 120.

DPEJGr arbitrary positive direction (odds) (4050-1) distal promoters

  1. DPEJGr1: AGATCT at 3861, GGTCCT at 3643, GGACAT at 3142, GGTTCT at 2873, GGTTCT at 2845, GGACAT at 2618, GGTCAT at 2238, GGTCGT at 2012, AGACCT at 1950, AGTCCT at 1718, AGTCGT at 1653, GGACGT at 1511, GGTTCT at 1500, GGTTAT at 1472, GGATGT at 1449, GGACAT at 1342, GGACGT at 1184, AGACCT at 759, AGATGT at 401, AGATGT at 346, GGACAT at 89.
  2. DPEJGr3: AGTTCT at 3988, GGACAT at 3885, AGTCCT at 3028, GGTCAT at 2855, AGTCGT at 2687, GGACCT at 2652, GGATCT at 2567, AGTCCT at 2550, GGATCT at 2435, GGTCAT at 2304, AGTTGT at 2264, GGATGT at 1791, AGATGT at 1715, GGTTCT at 1683, GGTCAT at 1632, AGTTCT at 1539, GGTTGT at 1193, GGACAT at 1167, AGTTCT at 1019, AGATAT at 941, GGTCAT at 748, AGACAT at 661, GGACGT at 629, GGTTGT at 529, AGTCAT at 478, GGTCCT at 348, GGTTCT at 329, GGTTAT at 258, GGACCT at 76.
  3. DPEJGr5: GGTTCT at 3378, GGTTAT at 3233, GGTCGT at 3121, AGTTCT at 3089, AGTCGT at 2940, AGATCT at 2830, GGTTCT at 2557, GGATCT at 2429, AGTCGT at 2195, AGACGT at 1586, GGACCT at 1383, AGACAT at 1257, AGACGT at 711, GGTTCT at 210.
  4. DPEJGr7: AGACGT at 4032, GGTTGT at 3908, AGTTAT at 3773, AGTCCT at 3577, GGACGT at 3402, GGATCT at 3368, GGACAT at 3168, AGATAT at 3054, AGTTGT at 2938, GGACGT at 2805, GGTCGT at 2781, AGTTCT at 2572, GGATCT at 2461, AGTCGT at 2263, GGTTGT at 2138, AGTCCT at 2097, GGTCCT at 2052, AGTCCT at 2013, GGTTGT at 1883, AGTTAT at 1806, GGACAT at 1648, GGTTGT at 1310, AGTCAT at 1015, GGTCAT at 973, GGACAT at 798, GGTTCT at 735, GGACGT at 696, GGTCAT at 635, GGATGT at 315, GGTCGT at 264, AGTCAT at 203, GGATGT at 117.
  5. DPEJGr9: AGATAT at 4020, AGACAT at 3639, GGACGT at 3509, AGATAT at 2994, AGACAT at 2918, GGTTAT at 2893, GGTCGT at 2613, GGACCT at 2334, GGACAT at 1846, AGACAT at 1631, GGATGT at 1468, GGTCCT at 1459.
  6. DPEJGr1ci: AGATCT at 3861, ATGACC at 3635, ATAACC at 3475, ACGTCC at 3430, ACGTCC at 3300, AGGACT at 3277, ACGACT at 3246, AGGACC at 3116, AGATCC at 2835, ACAACT at 2751, AGGTCC at 2560, AGGACC at 2461, ATAACT at 2157, AGGACC at 2122, ACGACT at 1828, AGATCC at 1799, ACGACC at 1762, ACGACC at 1736, ATGACT at 1635, ACATCC at 1203, ACGTCC at 1186, ACATCT at 1130, AGAACT at 1035, AGGACT at 647, ACGACT at 618, AGAACT at 471, ATATCT at 59, ATGTCT at 24.
  7. DPEJGr3ci: ATAACC at 3903, AGATCC at 3825, ATAACC at 3674, ATAACC at 3447, ATAACC at 3339, AGGACC at 3318, ACATCT at 3285, ATGTCC at 3188, ACGTCT at 2585, AGAACC at 2243, AGAACC at 1301, ATAACT at 1290, AGGACC at 1274, ATATCT at 943, AGAACC at 691, AGGTCT at 679, AGGACC at 342, AGGTCT at 246, AGAACT at 221, AGAACC at 188.
  8. DPEJGr5ci: AGATCC at 4031, AGAACC at 3986, AGAACC at 3394, AGGACT at 3365, AGATCT at 2830, ATAACC at 2093, AGAACC at 1948, ATAACT at 1873, AGAACC at 1430, ACGTCC at 905, AGAACC at 613, ATGACC at 505, AGAACT at 342.
  9. DPEJGr7ci: ATGTCT at 4004, ACAACC at 3418, ACATCC at 3170, ACGTCC at 2860, ATAACT at 2828, AGGTCC at 2812, AGGACC at 2795, ACGTCT at 2705, AGGACC at 2688, AGGTCC at 2676, ACAACC at 2550, ACAACC at 2408, ACGACT at 2363, ATAACT at 2329, AGGACT at 2070, ACGTCC at 1576, ACGACC at 1522, ATAACC at 1447, ATGTCC at 1145, ACGTCC at 946, ACGTCT at 698, ATGACT at 393, ACAACC at 63.
  10. DPEJGr9ci: ACATCC at 3750, ATGACT at 3704, ATGACC at 3680, AGGACC at 3378, ATGTCT at 3298, ATATCC at 2996, AGGTCT at 2648, AGATCC at 2431, ACGACC at 2148, ATATCC at 2097, ATGACT at 2080, ACATCC at 2024, AGGACT at 1596, ACATCC at 1523, AGGACT at 1385, AGGTCC at 575, AGGTCT at 450, ACGACC at 288, ACAACT at 120.

DPEJGr alternate positive direction (evens) (4050-1) distal promoters

  1. DPEJGr0: AGACCT at 4049, GGACAT at 3848, AGTCGT at 3812, GGTCCT at 3613, GGTCGT at 3560, GGTTGT at 3018, GGTTGT at 2933, AGACGT at 2537, AGTTAT at 2140, GGACCT at 2104, GGTTCT at 2049, GGTCGT at 1753, AGTTGT at 1668, AGATAT at 1538, AGATAT at 1466, GGATCT at 925, GGATGT at 763, GGATCT at 737, GGTTGT at 723, GGTCAT at 700, AGTTAT at 558.
  2. DPEJGr2: AGACAT at 4045, GGACAT at 3250, AGTCGT at 2873, GGTTCT at 2738, GGACCT at 2405, AGATAT at 2114, AGACGT at 1861, GGACAT at 1777, AGATGT at 1648, GGACGT at 1603, AGACCT at 1507, GGTCCT at 1401, AGTTGT at 1036, AGTTCT at 879, AGACCT at 683, GGACAT at 495, GGACAT at 218, AGTCAT at 200, GGTTCT at 179, GGATCT at 158.
  3. DPEJGr4: AGTCCT at 3971, GGATCT at 3923, GGTTGT at 3530, AGACAT at 3368, AGTCCT at 3317, AGATCT at 3133, GGTCCT at 2667, GGTTGT at 2435, GGACGT at 2313, GGTTAT at 2293, AGTCAT at 2249, GGTTGT at 2161, AGTTGT at 2007, AGATCT at 1670, GGACGT at 1605, GGATCT at 1495, AGACGT at 1452, GGTCCT at 799, GGATGT at 728, GGTCGT at 358, GGACGT at 291, AGACCT at 260.
  4. DPEJGr6: AGTCCT at 4012, AGTCGT at 3837, GGACAT at 3657, AGTCGT at 3489, GGACCT at 3416, AGTTCT at 3091, AGATCT at 2937, AGTTAT at 2806, GGATGT at 2708, AGTTGT at 2639, AGATCT at 2391, GGACCT at 2016, GGTTCT at 1515, AGATAT at 1242, AGTCAT at 1110, AGACCT at 855, GGACAT at 792, GGACGT at 609, GGTTGT at 586, GGTCGT at 157.
  5. DPEJGr8: GGACAT at 4027, AGATAT at 3980, GGTCGT at 3844, GGACGT at 3601, GGATAT at 3555, AGACGT at 3542, AGACAT at 3218, GGACCT at 3008, AGATAT at 2999, GGTCCT at 2877, AGTCCT at 2804, AGATGT at 2727, GGTTGT at 2380, AGTCCT at 2208, GGATAT at 1701, GGTCCT at 1457, GGTTAT at 1335, GGTCCT at 1196, AGACAT at 1155, AGTCCT at 1013, GGTTAT at 552, AGATGT at 522, GGTCGT at 247, GGTTAT at 92, AGATCT at 27.
  6. DPEJGr0ci: AGAACC at 3587, AGAACT at 3494, ATGTCC at 3377, AGAACC at 3322, AGGACC at 2801, AGATCC at 2762, AGGTCC at 2747, ATGACC at 2730, ACGACC at 2628, ACGTCC at 2594, ACGTCT at 2539, AGATCC at 2250, ATGTCT at 2115, ATAACT at 1792, AGGTCT at 1047, AGGACC at 942, ATGACT at 704, AGGACC at 678, ATAACT at 431, AGAACC at 55.
  7. DPEJGr2ci: 17, ACAACC at 3839, ATAACC at 3824, ATGACT at 3809, ACATCC at 3179, ACATCC at 2810, AGGTCC at 2613, ATATCT at 2415, AGGACT at 2333, ACGACC at 2317, ACAACC at 2193, AGGACT at 1676, ACGTCT at 1661, ACAACT at 1309, ATAACC at 1140, ATGTCT at 920, ATGACT at 373, ACATCC at 220.
  8. DPEJGr4ci: ACGACC at 4045, ACGACT at 3938, ACAACT at 3712, ACGTCT at 3509, AGATCT at 3133, AGATCC at 3048, AGAACT at 2998, AGGACT at 2956, ACAACC at 2856, ATAACT at 2378, ACGTCT at 2315, AGAACT at 1747, AGAACT at 1740, AGATCT at 1670, AGGTCT at 980, AGGTCC at 676, AGAACT at 579, ATAACT at 466, ACAACC at 421, AGAACT at 267, AGGACT at 9.
  9. DPEJGr6ci: AGGACT at 3989, ACGACC at 3784, ATGACC at 3192, ACGACC at 2954, AGATCT at 2937, ACGTCT at 2927, AGATCT at 2391, ACAACC at 2358, ATATCC at 2352, ATAACT at 2241, ATATCC at 2054, ATATCT at 2038, ACAACC at 1917, ACAACT at 1808, ATGTCT at 1291, AGGACC at 1225, ATAACT at 1133, ACATCT at 810, AGGACC at 780, ACGTCC at 611, AGGACC at 387, AGGACC at 374, AGGTCC at 306, ATGTCC at 46.
  10. DPEJGr8ci: ACATCC at 4029, ACGACC at 3935, AGGACC at 3647, ACAACC at 3233, ACATCT at 3220, AGAACC at 3043, ACAACC at 2604, ATAACT at 2144, ACAACC at 2105, AGGTCC at 2075, ACAACT at 1906, AGGACT at 1765, AGGTCT at 1517, AGGTCC at 1417, ATGACT at 1133, AGAACC at 1069, AGGTCC at 1023, AGAACT at 885, AGGTCT at 399, AGATCT at 27.

DPE (Juven-Gershon) analysis and results

The DPE consensus sequence is the more general sequence RGWYVT, or (A/G)G(A/T)(C/T)(A/C/G)T.[2]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 48 2 24 24 ± 10 (--34,+-14)
Randoms UTR arbitrary negative 82 10 8.2 9.45 ± 1.25
Randoms UTR alternate negative 107 10 10.7 9.45 ± 1.25
Reals Core negative 1 2 0.5 0.5
Randoms Core arbitrary negative 0 10 0 0.3
Randoms Core alternate negative 6 10 0.6 0.3
Reals Core positive 3 2 1.5 1.5
Randoms Core arbitrary positive 28 10 2.8 1.9 ± 0.9
Randoms Core alternate positive 10 10 1 1.9 ± 0.9
Reals Proximal negative 3 2 1.5 1.5
Randoms Proximal arbitrary negative 15 10 1.5 1.35 ± 0.15
Randoms Proximal alternate negative 12 10 1.2 1.35 ± 0.15
Reals Proximal positive 7 2 3.5 3.5 ± 2.5 (-+1,++6)
Randoms Proximal arbitrary positive 15 10 1.5 1.1 ± 0.4
Randoms Proximal alternate positive 7 10 0.7 1.1 ± 0.4
Reals Distal negative 57 2 28.5 28.5 ± 18.5 (--47,+-10)
Randoms Distal arbitrary negative 133 10 13.3 13.5 ± 0.2
Randoms Distal alternate negative 137 10 13.7 13.5 ± 0.2
Reals Distal positive 93 2 46.5 46.5 ± 14.5 (-+32,++61)
Randoms Distal arbitrary positive 211 10 21.1 21.05 ± 0.05
Randoms Distal alternate positive 210 10 21.0 21.05 ± 0.05

Comparison:

The occurrences of real DPE (Juven-Gershon) UTRs are greater than the randoms, the core promoters are within the ranges of the randoms, the negative direction proximals are at the limits of the randoms, but the positive strand, positive direction proximal is greater than the randoms, the negative strand, negative direction distals are outside the range of the randoms, and the positive direction distals are greater than the randoms. This suggests that most or all of the real DPE (Juven-Gershon)s are likely active or activable.

DPE (Kadonaga) samplings

Copying a responsive elements consensus sequence (A/G)G(A/T)CGTG and putting the sequence in "⌘F" finds two between ZNF497 and A1BG or seven between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence (A/G)G(A/T)CGTG (starting with SuccessablesDPEK.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for (A/G)G(A/T)CGTG, 7, GGTCGTG at 3733, GGTCGTG at 3072, GGTCGTG at 1787, GGTCGTG at 1142, GGTCGTG at 678, GGTCGTG at 542, GGTCGTG at 405.
  2. positive strand, negative direction, looking for (A/G)G(A/T)CGTG, 1, AGACGTG at 4237.
  3. positive strand, positive direction, looking for (A/G)G(A/T)CGTG, 8, AGTCGTG at 3043, AGTCGTG at 2200, AGTCGTG at 2104, GGACGTG at 1471, GGTCGTG at 1459, GGACGTG at 1371, GGTCGTG at 1359, GGTCGTG at 619.
  4. negative strand, positive direction, looking for (A/G)G(A/T)CGTG, 0.
  5. complement, negative strand, negative direction, looking for (C/T)C(A/T)GCAC, 1, TCTGCAC at 4237.
  6. complement, positive strand, negative direction, looking for (C/T)C(A/T)GCAC, 7, CCAGCAC at 3733, CCAGCAC at 3072, CCAGCAC at 1787, CCAGCAC at 1142, CCAGCAC at 678, CCAGCAC at 542, CCAGCAC at 405.
  7. complement, positive strand, positive direction, looking for (C/T)C(A/T)GCAC, 0.
  8. complement, negative strand, positive direction, looking for (C/T)C(A/T)GCAC, 8, TCAGCAC at 3043, TCAGCAC at 2200, TCAGCAC at 2104, CCAGCAC at 1471, CCAGCAC at 1459, CCAGCAC at 1371, CCAGCAC at 1359, CCAGCAC at 619.
  9. inverse complement, negative strand, negative direction, looking for CACG(A/T)C(C/T), 1, CACGTCT at 3431.
  10. inverse complement, positive strand, negative direction, looking for CACG(A/T)C(C/T), 1, CACGTCT at 1774.
  11. inverse complement, positive strand, positive direction, looking for CACG(A/T)C(C/T), 3, CACGTCC at 3466, CACGTCC at 2683, CACGTCC at 1788.
  12. inverse complement, negative strand, positive direction, looking for CACG(A/T)C(C/T), 1, CACGTCT at 3256.
  13. inverse negative strand, negative direction, looking for GTGC(A/T)G(A/G), 1, GTGCAGA at 1774.
  14. inverse positive strand, negative direction, looking for GTGC(A/T)G(A/G), 1, GTGCAGA at 3431.
  15. inverse positive strand, positive direction, looking for GTGC(A/T)G(A/G), 1, GTGCAGA at 3256.
  16. inverse negative strand, positive direction, looking for GTGC(A/T)G(A/G), 3, GTGCAGG at 3466, GTGCAGG at 2683, GTGCAGG at 1788.

DPE (Kadonaga) UTRs

Negative strand, negative direction: GGTCGTG at 3733, CACGTCT at 3431, GGTCGTG at 3072.

Positive strand, negative direction: AGACGTG at 4237.

DPE (Kadonaga) distal promoters

Negative strand, negative direction: GGTCGTG at 1787, GGTCGTG at 1142, GGTCGTG at 678, GGTCGTG at 542, GGTCGTG at 405.

Positive strand, negative direction: CACGTCC at 3466, CACGTCC at 2683, CACGTCC at 1788, CACGTCT at 1774.

Positive strand, positive direction: AGTCGTG at 3043, AGTCGTG at 2200, AGTCGTG at 2104, GGACGTG at 1471, GGTCGTG at 1459, GGACGTG at 1371, GGTCGTG at 1359, GGTCGTG at 619.

Negative strand, positive direction: CACGTCT at 3256.

DPE (Kadonaga) random dataset samplings

  1. DPEKr0: 0.
  2. DPEKr1: 0.
  3. DPEKr2: 0.
  4. DPEKr3: 0.
  5. DPEKr4: 1, AGACGTG at 1453.
  6. DPEKr5: 2, AGACGTG at 1587, AGACGTG at 712.
  7. DPEKr6: 0.
  8. DPEKr7: 1, AGTCGTG at 4301.
  9. DPEKr8: 1, GGTCGTG at 3845.
  10. DPEKr9: 0.
  11. DPEKr0ci: 1, CACGTCC at 2594.
  12. DPEKr1ci: 4, CACGTCC at 3430, CACGTCC at 3300, CACGACT at 3246, CACGACC at 1762.
  13. DPEKr2ci: 0.
  14. DPEKr3ci: 0.
  15. DPEKr4ci: 2, CACGTCC at 4367, CACGTCT at 3509.
  16. DPEKr5ci: 0.
  17. DPEKr6ci: 1, CACGTCT at 2927.
  18. DPEKr7ci: 2, CACGACC at 1522, CACGTCC at 946.
  19. DPEKr8ci: 0.
  20. DPEKr9ci: 1, CACGACC at 2148.

DPEKr arbitrary (evens) (4560-2846) UTRs

  1. DPEKr8: GGTCGTG at 3845.
  2. DPEKr4ci: CACGTCC at 4367, CACGTCT at 3509.
  3. DPEKr6ci: CACGTCT at 2927.

DPEKr alternate (odds) (4560-2846) UTRs

  1. DPEKr7: AGTCGTG at 4301.
  2. DPEKr1ci: CACGTCC at 3430, CACGTCC at 3300, CACGACT at 3246.

DPEKr arbitrary positive direction (odds) (4445-4265) core promoters

  1. DPEKr7: AGTCGTG at 4301.

DPEKr alternate positive direction (evens) (4445-4265) core promoters

  1. DPEKr4ci: CACGTCC at 4367.

DPEKr arbitrary negative direction (evens) (2596-1) distal promoters

  1. DPEKr4: AGACGTG at 1453.
  2. DPEKr0ci: CACGTCC at 2594.

DPEKr alternate negative direction (odds) (2596-1) distal promoters

  1. DPEKr5: AGACGTG at 1587, AGACGTG at 712.
  2. DPEKr1ci: CACGACC at 1762.
  3. DPEKr7ci: CACGACC at 1522, CACGTCC at 946.
  4. DPEKr9ci: CACGACC at 2148.

DPEKr arbitrary positive direction (odds) (4050-1) distal promoters

  1. DPEKr5: AGACGTG at 1587, AGACGTG at 712.
  2. DPEKr1ci: CACGTCC at 3430, CACGTCC at 3300, CACGACT at 3246, CACGACC at 1762.
  3. DPEKr7ci: CACGACC at 1522, CACGTCC at 946.
  4. DPEKr9ci: CACGACC at 2148.

DPEKr alternate positive direction (evens) (4050-1) distal promoters

  1. DPEKr4: AGACGTG at 1453.
  2. DPEKr8: GGTCGTG at 3845.
  3. DPEKr0ci: CACGTCC at 2594.
  4. DPEKr4ci: CACGTCT at 3509.
  5. DPEKr6ci: CACGTCT at 2927.

DPE (Kadonaga) analysis and results

The early DPE consensus sequence was RGWCGTG.[5][7]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 4 2 2 2 ± 1 (--3,+-1)
Randoms UTR arbitrary negative 4 10 0.4 0.4
Randoms UTR alternate negative 4 10 0.4 0.4
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 1 10 0.1 0.1
Randoms Core alternate positive 1 10 0.1 0.1
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 9 2 4.5 4.5 ± 0.5 (--5,+-4)
Randoms Distal arbitrary negative 2 10 0.2 0.4
Randoms Distal alternate negative 6 10 0.6 0.4
Reals Distal positive 9 2 4.5 4.5 ± 3.5 (-+1,++8)
Randoms Distal arbitrary positive 9 10 0.9 0.7
Randoms Distal alternate positive 5 10 0.5 0.7

Comparison:

The occurrences of real DPE (Kadonaga)s are greater than the randoms. This suggests that the real DPE (Kadonaga)s are likely active or activable.

DPE (Matsumoto) samplings

Copying a responsive elements consensus sequence AGTCTC and putting the sequence in "⌘F" finds four between ZNF497 and A1BG or two between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence AGTCTC (starting with SuccessablesDPEM.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for AGTCTC, 1, AGTCTC at 3645.
  2. positive strand, negative direction, looking for AGTCTC, 1, AGTCTC at 1445.
  3. positive strand, positive direction, looking for AGTCTC, 3, AGTCTC at 2730, AGTCTC at 2700, AGTCTC at 2610.
  4. negative strand, positive direction, looking for AGTCTC, 1, AGTCTC at 3188.
  5. complement, negative strand, negative direction, looking for TCAGAG, 1, TCAGAG at 1445.
  6. complement, positive strand, negative direction, looking for TCAGAG, 1, TCAGAG at 3645.
  7. complement, positive strand, positive direction, looking for TCAGAG, 1, TCAGAG at 3188.
  8. complement, negative strand, positive direction, looking for TCAGAG, 3, TCAGAG at 2730, TCAGAG at 2700, TCAGAG at 2610.
  9. inverse complement, negative strand, negative direction, looking for GAGACT, 0.
  10. inverse complement, positive strand, negative direction, looking for GAGACT, 4, GAGACT at 4053, GAGACT at 1933, GAGACT at 1081, GAGACT at 915.
  11. inverse complement, positive strand, positive direction, looking for GAGACT, 2, GAGACT at 3123, GAGACT at 255.
  12. inverse complement, negative strand, positive direction, looking for GAGACT, 0.
  13. inverse negative strand, negative direction, looking for CTCTGA, 4, CTCTGA at 4053, CTCTGA at 1933, CTCTGA at 1081, CTCTGA at 915.
  14. inverse positive strand, negative direction, looking for CTCTGA, 0.
  15. inverse positive strand, positive direction, looking for CTCTGA, 0.
  16. inverse negative strand, positive direction, looking for CTCTGA, 2, CTCTGA at 3123, CTCTGA at 255.

DPE (Matsumoto) UTRs

Negative strand, negative direction: AGTCTC at 3645.

Positive strand, negative direction: GAGACT at 4053.

DPE (Matsumoto) distal promoters

Positive strand, negative direction: GAGACT at 1933, AGTCTC at 1445, GAGACT at 1081, GAGACT at 915.

Negative strand, positive direction: AGTCTC at 3188.

Positive strand, positive direction: GAGACT at 3123, AGTCTC at 2730, AGTCTC at 2700, AGTCTC at 2610, GAGACT at 255.

DPE (Matsumoto) random dataset samplings

  1. DPEMr0: 0.
  2. DPEMr1: 1, AGTCTC at 905.
  3. DPEMr2: 1, AGTCTC at 4519.
  4. DPEMr3: 1, AGTCTC at 3217.
  5. DPEMr4: 0.
  6. DPEMr5: 1, AGTCTC at 4010.
  7. DPEMr6: 0.
  8. DPEMr7: 0.
  9. DPEMr8: 0.
  10. DPEMr9: 0.
  11. DPEMr0ci: 1, GAGACT at 2272.
  12. DPEMr1ci: 1, GAGACT at 1413.
  13. DPEMr2ci: 1, GAGACT at 1114.
  14. DPEMr3ci: 0.
  15. DPEMr4ci: 1, GAGACT at 3618.
  16. DPEMr5ci: 0.
  17. DPEMr6ci: 2, GAGACT at 3055, GAGACT at 538.
  18. DPEMr7ci: 1, GAGACT at 4351.
  19. DPEMr8ci: 1, GAGACT at 4467.
  20. DPEMr9ci: 0.

DPEMr arbitrary (evens) (4560-2846) UTRs

  1. DPEMr2: AGTCTC at 4519.
  2. DPEMr4ci: GAGACT at 3618.
  3. DPEMr6ci: GAGACT at 3055.
  4. DPEMr8ci: GAGACT at 4467.

DPEMr alternate (odds) (4560-2846) UTRs

  1. DPEMr3: AGTCTC at 3217.
  2. DPEMr5: AGTCTC at 4010.
  3. DPEMr7ci: GAGACT at 4351.

DPEMr arbitrary positive direction (odds) (4445-4265) core promoters

  1. DPEMr7ci: GAGACT at 4351.

DPEMr arbitrary negative direction (evens) (2596-1) distal promoters

  1. DPEMr0ci: GAGACT at 2272.
  2. DPEMr2ci: GAGACT at 1114.
  3. DPEMr6ci: GAGACT at 538.

DPEMr alternate negative direction (odds) (2596-1) distal promoters

  1. DPEMr1: AGTCTC at 905.
  2. DPEMr1ci: GAGACT at 1413.

DPEMr arbitrary positive direction (odds) (4050-1) distal promoters

  1. DPEMr1: AGTCTC at 905.
  2. DPEMr3: AGTCTC at 3217.
  3. DPEMr5: AGTCTC at 4010.
  4. DPEMr1ci: GAGACT at 1413.

DPEMr alternate positive direction (evens) (4050-1) distal promoters

  1. DPEMr0ci: GAGACT at 2272.
  2. DPEMr2ci: GAGACT at 1114.
  3. DPEMr4ci: GAGACT at 3618.
  4. DPEMr6ci: GAGACT at 3055, GAGACT at 538.

DPE (Matsumoto) analysis and results

The DPE in "the ATP‐binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" is 5'-AGTCTC-3'.[8]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 2 2 1 1
Randoms UTR arbitrary negative 4 10 0.4 0.35
Randoms UTR alternate negative 3 10 0.3 0.35
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 1 10 0.1 0.05
Randoms Core alternate positive 0 10 0 0.05
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 4 2 2 2 ± 2 (--0,+-4)
Randoms Distal arbitrary negative 3 10 0.3 0.25
Randoms Distal alternate negative 2 10 0.2 0.25
Reals Distal positive 6 2 3 3 (-+1,++5)
Randoms Distal arbitrary positive 4 10 0.4 0.45
Randoms Distal alternate positive 5 10 0.5 0.45

Comparison:

The occurrences of real DPE (Matsumoto)s are greater than the randoms. This suggests that the real DPE (Matsumoto)s are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

See also

References

  1. Jennifer E.F. Butler, James T. Kadonaga (October 15, 2002). "The RNA polymerase II core promoter: a key component in the regulation of gene expression". Genes & Development. 16 (20): 2583–92. doi:10.1101/gad.1026202. PMID 12381658.
  2. 2.0 2.1 2.2 2.3 Tamar Juven-Gershon, James T. Kadonaga (March 15, 2010). "Regulation of Gene Expression via the Core Promoter and the Basal Transcriptional Machinery". Developmental Biology. 339 (2): 225–9. doi:10.1016/j.ydbio.2009.08.009. PMC 2830304. PMID 19682982.
  3. 3.0 3.1 3.2 3.3 3.4 3.5 3.6 Thomas W. Burke and James T. Kadonaga (November 15, 1997). "The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila". Genes & Development. 11 (22): 3020–31. doi:10.1101/gad.11.22.3020. PMC 316699. PMID 9367984.
  4. 4.0 4.1 Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter" (PDF). Annual Review of Biochemistry. 72 (1): 449–79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. Retrieved 2012-05-07.
  5. 5.0 5.1 5.2 T.W. Burke and James T. Kadonaga (15 March 1996). "Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters" (PDF). Genes & Development. 10 (6): 711–724. doi:10.1101/gad.10.6.711. PMID 8598298.
  6. Kutach, Alan K.; Kadonaga, James T. (July 2000). "The Downstream Promoter Element DPE Appears To Be as Widely Used as the TATA Box in Drosophila Core Promoters". Molecular and Cellular Biology. 20 (13): 4754–4764. doi:10.1128/MCB.20.13.4754-4764.2000. PMC 85905. PMID 10848601.
  7. 7.0 7.1 James T. Kadonaga (September 2002). "The DPE, a core promoter element for transcription by RNA polymerase II" (PDF). Experimental & Molecular Medicine. 34 (4): 259–264. PMID 12515390.
  8. 8.0 8.1 Takuya Matsumoto, Saemi Kitajima, Chisato Yamamoto, Mitsuru Aoyagi, Yoshiharu Mitoma, Hiroyuki Harada and Yuji Nagashima (9 August 2020). "Cloning and tissue distribution of the ATP-binding cassette subfamily G member 2 gene in the marine pufferfish Takifugu rubripes" (PDF). Fisheries Science. 86: 873–887. doi:10.1007/s12562-020-01451-z. Retrieved 27 September 2020.
  9. FlyBase (February 3, 2013). "Antp Antennapedia [ Drosophila melanogaster ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-02-07.
  10. 10.0 10.1 HGNC (February 5, 2013). "HOXA7 homeobox A7 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-02-07.
  11. HGNC (February 5, 2013). "HOXA9 homeobox A9 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-02-07.

Further reading

External links

{{Phosphate biochemistry}}