HY box gene transcriptions

Jump to navigation Jump to search

Editor-In-Chief: Henry A. Hoff

"Deletion, mutagenesis, and tandem repeat analyses identified the core responsive element as the region between −89 and −60 bp (termed the hypertrophy box [HY box]), which showed specific binding to RUNX‐2."[1]

The "HY box is the core element responsive to RUNX‐2 in human COL10A1 promoter."[1]

Boxes

A repeating sequence of nucleotides that forms a transcription or a regulatory signal is a box.

Consensus sequences

"Deletion analysis by a series of 5′-deletion constructs identified the responsive region to RUNX-2 as being between −81 bp and −76 bp, containing a putative RUNX-2 binding sequence (TGAGGG), which is similar to that identified in the promoter region of human interleukin-3 (TGTGGG) (33)."[1] This suggests a consensus sequence of 3'-TG(A/T)GGG-5' on the template strand in the direction of transcription.

RUNX2

The gene RUNX2 "is a member of the RUNX family of transcription factors and encodes a nuclear protein with an Runt DNA-binding domain. This protein is essential for osteoblastic differentiation and skeletal morphogenesis and acts as a scaffold for nucleic acids and regulatory factors involved in skeletal gene expression. The protein can bind DNA both as a monomer or, with more affinity, as a subunit of a heterodimeric complex."[2]

COL10A1

The gene COL10A1 "encodes the alpha chain of type X collagen, a short chain collagen expressed by hypertrophic chondrocytes during endochondral ossification. Unlike type VIII collagen, the other short chain collagen, type X collagen is a homotrimer."[3]

Human COL10A1, GeneID: 1300, has an HY box as the core responsive element.[1]

Hypotheses

  1. A1BG is not transcribed by an HY box.

HY box samplings

For the Basic programs (starting with SuccessablesHY.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand in the negative direction is SuccessablesHY--.bas, looking for TG(A/T)GGG, 1, TGTGGG at 749.
  2. negative strand in the positive direction is SuccessablesHY-+.bas, looking for TG(A/T)GGG, 5, TGTGGG at 4395, TGAGGG at 3906, TGAGGG at 3879, TGAGGG at 3479, TGAGGG at 258.
  3. positive strand in the negative direction is SuccessablesHY+-.bas, looking for TG(A/T)GGG, 5, TGAGGG at 4558, TGTGGG at 3712, TGAGGG at 3652, TGAGGG at 2699, TGAGGG at 88.
  4. positive strand in the positive direction is SuccessablesHY++.bas, looking for TG(A/T)GGG, 2, TGTGGG at 3533, TGTGGG at 2965.
  5. inverse complement, negative strand, negative direction is SuccessablesHYci--.bas, looking for CCC(A/T)CA, 4, CCCTCA at 4498, CCCTCA at 3889, CCCACA at 3184, CCCTCA at 2702.
  6. inverse complement, negative strand, positive direction is SuccessablesHYci-+.bas, looking for CCC(A/T)CA, 3, CCCTCA at 3503, CCCTCA at 3207, CCCTCA at 88.
  7. inverse complement, positive strand, negative direction is SuccessablesHYci+-.bas, looking for CCC(A/T)CA, 0,
  8. inverse complement, positive strand, positive direction is SuccessablesHYci++.bas, looking for CCC(A/T)CA, 5, CCCTCA at 3185, CCCACA at 1803, CCCTCA at 1783, CCCTCA at 662, CCCTCA at 494.

HY box UTRs

  1. Negative strand, negative direction: CCCTCA at 4498, CCCTCA at 3889, CCCACA at 3184.
  2. Positive strand, negative direction: TGAGGG at 4558, TGTGGG at 3712, TGAGGG at 3652.

HY box core promoters

  1. Negative strand, positive direction: TGTGGG at 4395.

HY box proximal promoters

  1. Negative strand, negative direction: CCCTCA at 2702.
  2. Positive strand, negative direction: TGAGGG at 2699.

HY box distal promoters

  1. Negative strand, negative direction: TGTGGG at 749.
  2. Positive strand, negative direction: TGAGGG at 88.
  1. Negative strand, positive direction: TGAGGG at 3906, TGAGGG at 3879, CCCTCA at 3503, TGAGGG at 3479, CCCTCA at 3207, TGAGGG at 258, CCCTCA at 88.
  2. Positive strand, positive direction: TGTGGG at 3533, CCCTCA at 3185, TGTGGG at 2965, CCCACA at 1803, CCCTCA at 1783, CCCTCA at 662, CCCTCA at 494.

HY boxes random dataset samplings

  1. HYboxr0: 0.
  2. HYboxr1: 2, TGTGGG at 873, TGTGGG at 699.
  3. HYboxr2: 4, TGAGGG at 2839, TGAGGG at 2396, TGTGGG at 2001, TGAGGG at 170.
  4. HYboxr3: 4, TGTGGG at 3855, TGAGGG at 2864, TGAGGG at 2319, TGTGGG at 513.
  5. HYboxr4: 2, TGTGGG at 2762, TGTGGG at 1825.
  6. HYboxr5: 4, TGAGGG at 3839, TGAGGG at 2381, TGTGGG at 1723, TGAGGG at 517.
  7. HYboxr6: 0.
  8. HYboxr7: 5, TGAGGG at 4318, TGTGGG at 4228, TGAGGG at 1728, TGAGGG at 1189, TGAGGG at 631.
  9. HYboxr8: 2, TGAGGG at 4205, TGAGGG at 2864.
  10. HYboxr9: 2, TGTGGG at 4391, TGAGGG at 227.
  11. HYboxr0ci: 3, CCCACA at 3394, CCCTCA at 2874, CCCTCA at 588.
  12. RDr1ci: 0.
  13. RDr2ci: 0.
  14. RDr3ci: 0.
  15. RDr4ci: 0.
  16. RDr5ci: 0.
  17. RDr6ci: 0.
  18. RDr7ci: 0.
  19. RDr8ci: 0.
  20. RDr9ci: 0.

HYboxr arbitrary UTRs

  1. HYboxr8: TGAGGG at 4205, TGAGGG at 2864.

HYboxr alternate UTRs

  1. HYboxr3: TGTGGG at 3855, TGAGGG at 2864.
  2. HYboxr5: TGAGGG at 3839.
  3. HYboxr7: TGAGGG at 4318, TGTGGG at 4228.
  4. HYboxr9: TGTGGG at 4391.
  5. HYboxr0ci: CCCACA at 3394, CCCTCA at 2874.

HYboxr arbitrary negative direction core promoters

  1. HYboxr2: TGAGGG at 2839.

RDr alternate negative direction core promoters

HYboxr arbitrary positive direction core promoters

  1. HYboxr7: TGAGGG at 4318.
  2. HYboxr9: TGTGGG at 4391.

RDr alternate positive direction core promoters

HYboxr arbitrary negative direction proximal promoters

  1. HYboxr4: TGTGGG at 2762.

RDr alternate negative direction proximal promoters

HYboxr arbitrary positive direction proximal promoters

  1. HYboxr7: TGTGGG at 4228.

HYboxr alternate positive direction proximal promoters

  1. HYboxr8: TGAGGG at 4205.

HYboxr arbitrary negative direction distal promoters

  1. HYboxr2: TGAGGG at 2396, TGTGGG at 2001, TGAGGG at 170.
  2. HYboxr4: TGTGGG at 1825.
  3. HYboxr0ci: CCCTCA at 588.

HYboxr alternate negative direction distal promoters

  1. HYboxr3: TGAGGG at 2319, TGTGGG at 513.
  2. HYboxr5: TGAGGG at 2381, TGTGGG at 1723, TGAGGG at 517.
  3. HYboxr7: TGAGGG at 1728, TGAGGG at 1189, TGAGGG at 631.
  4. HYboxr9: TGAGGG at 227.

HYboxr arbitrary positive direction distal promoters

  1. HYboxr1: TGTGGG at 873, TGTGGG at 699.
  2. HYboxr3: TGTGGG at 3855, TGAGGG at 2864, TGAGGG at 2319, TGTGGG at 513.
  3. HYboxr5: TGAGGG at 3839, TGAGGG at 2381, TGTGGG at 1723, TGAGGG at 517.
  4. HYboxr7: TGAGGG at 1728, TGAGGG at 1189, TGAGGG at 631.
  5. HYboxr9: TGAGGG at 227.

HYboxr alternate positive direction distal promoters

  1. HYboxr2: TGAGGG at 2839, TGAGGG at 2396, TGTGGG at 2001, TGAGGG at 170.
  2. HYboxr4: TGTGGG at 2762, TGTGGG at 1825.
  3. HYboxr8: TGAGGG at 2864.
  4. HYboxr0ci: CCCACA at 3394, CCCTCA at 2874, CCCTCA at 588.

HY boxes analysis and results

"Deletion analysis by a series of 5′-deletion constructs identified the responsive region to RUNX-2 as being between −81 bp and −76 bp, containing a putative RUNX-2 binding sequence (TGAGGG), which is similar to that identified in the promoter region of human interleukin-3 (TGTGGG) (33)."[1] This suggests a consensus sequence of TG(A/T)GGG.

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 6 2 3 3
Randoms UTR arbitrary negative 0 10 0 0
Randoms UTR alternate negative 0 10 0 0
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 1 2 0.5 0.5
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 2 2 1 1
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 2 2 1 1
Randoms Distal arbitrary negative 0 10 0 0
Randoms Distal alternate negative 0 10 0 0
Reals Distal positive 14 2 7 7
Randoms Distal arbitrary positive 0 10 0 0
Randoms Distal alternate positive 0 10 0 0

Comparison:

The occurrences of real responsive element consensus sequences are larger than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 Akiro Higashikawa, Taku Saito, Toshiyuki Ikeda, Satoru Kamekura, Naohiro Kawamura, Akinori Kan, Yasushi Oshima, Shinsuke Ohba, Naoshi Ogata, Katsushi Takeshita, Kozo Nakamura, Ung-Il Chung, Hiroshi Kawaguchi (2009). "Identification of the core element responsive to runt-related transcription factor 2 in the promoter of human type x collagen gene". Arthritis & Rheumatism. 60 (1): 166–78. doi:10.1002/art.24243. PMID 19116917. Retrieved 2013-06-18. Unknown parameter |month= ignored (help)
  2. "RUNX2 runt-related transcription factor 2 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. 2013. Retrieved 2013-06-18. Unknown parameter |month= ignored (help)
  3. HGNC (June 9, 2013). "COL10A1 collagen, type X, alpha 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-06-18.

External links