TATA box human genes: Difference between revisions

Jump to navigation Jump to search
 
(199 intermediate revisions by the same user not shown)
Line 22: Line 22:
|doi=10.1146/annurev.biochem.72.121801.161520
|doi=10.1146/annurev.biochem.72.121801.161520
|pmid=12651739
|pmid=12651739
|accessdate=2012-05-07 }}</ref> approximately 24% of human genes contain a TATA box within the core promoter.<ref name=Yang>{{ cite journal
|accessdate=2012-05-07 }}</ref> approximately 24% of human genes contain a TATA box within the core promoter.<ref name=Yang2007>{{ cite journal
| doi = 10.1016/j.gene.2006.09.029
| doi = 10.1016/j.gene.2006.09.029
| author = C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez
| author = C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez
Line 37: Line 37:
{{main|Human genes}}
{{main|Human genes}}


"TATA-containing genes are more often highly regulated, such as by biotic or stress stimuli."<ref name=Yang>{{ cite journal
"TATA-containing genes are more often highly regulated, such as by biotic or stress stimuli."<ref name=Yang2006>{{ cite journal
|author=Chuhu Yang, Eugene Bolotin, Tao Jiang, Frances M. Sladek, and Ernest Martinez
|author=Chuhu Yang, Eugene Bolotin, Tao Jiang, Frances M. Sladek, and Ernest Martinez
|title=Prevalence of the Initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters
|title=Prevalence of the Initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters
Line 50: Line 50:
|doi=10.1016/j.gene.2006.09.029
|doi=10.1016/j.gene.2006.09.029
|pmid=17123746
|pmid=17123746
|accessdate=2024-06-07 }}</ref> Only "∼10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR)."<ref name=Yang/>
|accessdate=2024-06-07 }}</ref> Only "∼10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR)."<ref name=Yang2006/>


"SRF-regulated genes of the actin/cytoskeleton/contractile family tend to have a TATA box."<ref name=Xu>{{ cite journal
"SRF-regulated genes of the actin/cytoskeleton/contractile family tend to have a TATA box."<ref name=Xu>{{ cite journal
Line 96: Line 96:


===Gene ID: 19===
===Gene ID: 19===
"The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. With cholesterol as its substrate, this protein functions as a cholesteral efflux pump in the cellular lipid removal pathway. Mutations in both alleles of this gene cause Tangier disease and familial high-density lipoprotein (HDL) deficiency."<ref name=RefSeq19>{{ cite web
|author=RefSeq
|title=ABCA1 ATP binding cassette subfamily A member 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2019
|url=http://www.ncbi.nlm.nih.gov/gene/19
|accessdate=2024-06-19 }}</ref>


===Gene ID: 58===
===Gene ID: 58===
Line 109: Line 118:


===Gene ID: 60===
===Gene ID: 60===
"This gene encodes one of six different actin proteins. Actins are highly conserved proteins that are involved in cell motility, structure, integrity, and intercellular signaling. The encoded protein is a major constituent of the contractile apparatus and one of the two nonmuscle cytoskeletal actins that are ubiquitously expressed. Mutations in this gene cause Baraitser-Winter syndrome 1, which is characterized by intellectual disability with a distinctive facial appearance in human patients. Numerous pseudogenes of this gene have been identified throughout the human genome."<ref name=RefSeq60>{{ cite web
|author=RefSeq
|title=ACTB actin beta [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2017
|url=http://www.ncbi.nlm.nih.gov/gene/60
|accessdate=2024-06-19 }}</ref>


===Gene ID: 70===
===Gene ID: 70===
Line 133: Line 151:


===Gene ID: 173===
===Gene ID: 173===
"This gene is a member of the albumin gene family, which is comprised of four genes that localize to chromosome 4 in a tandem arrangement. These four genes encode structurally-related serum transport proteins that are known to be evolutionarily related. The protein encoded by this gene is regulated developmentally, expressed in the liver and secreted into the bloodstream."<ref name=RefSeq173>{{ cite web
|author=RefSeq
|title=AFM afamin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/173
|accessdate=2024-06-18 }}</ref>


===Gene ID: 174===
===Gene ID: 174===
"This gene encodes alpha-fetoprotein, a major plasma protein produced by the yolk sac and the liver during fetal life. Alpha-fetoprotein expression in adults is often associated with hepatocarcinoma and with teratoma, and has prognostic value for managing advanced gastric cancer. However, hereditary persistance of alpha-fetoprotein may also be found in individuals with no obvious pathology. The protein is thought to be the fetal counterpart of serum albumin, and the alpha-fetoprotein and albumin genes are present in tandem in the same transcriptional orientation on chromosome 4. Alpha-fetoprotein is found in monomeric as well as dimeric and trimeric forms, and binds copper, nickel, fatty acids and bilirubin. The level of alpha-fetoprotein in amniotic fluid is used to measure renal loss of protein to screen for spina bifida and anencephaly."<ref name=RefSeq174>{{ cite web
|author=RefSeq
|title=AFP alpha fetoprotein [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=October 2019
|url=http://www.ncbi.nlm.nih.gov/gene/174
|accessdate=2024-06-18 }}</ref>


===Gene ID: 183===
===Gene ID: 183===
Line 171: Line 207:
===Gene ID: 229===
===Gene ID: 229===


===Gene ID: 230===
"Fructose-1,6-bisphosphate aldolase (EC 4.1.2.13) is a tetrameric glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Vertebrates have 3 aldolase isozymes which are distinguished by their electrophoretic and catalytic properties. Differences indicate that aldolases A, B, and C are distinct proteins, the products of a family of related 'housekeeping' genes exhibiting developmentally regulated expression of the different isozymes. The developing embryo produces aldolase A, which is produced in even greater amounts in adult muscle where it can be as much as 5% of total cellular protein. In adult liver, kidney and intestine, aldolase A expression is repressed and aldolase B is produced. In brain and other nervous tissue, aldolase A and C are expressed about equally. There is a high degree of homology between aldolase A and C. Defects in ALDOB cause hereditary fructose intolerance."<ref name=RefSeq229>{{ cite web
 
===Gene ID: 249===
 
"This gene encodes a member of the alkaline phosphatase family of proteins. There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2, while the tissue non-specific form is located on chromosome 1. The product of this gene is a membrane bound glycosylated enzyme that is not expressed in any particular tissue and is, therefore, referred to as the tissue-nonspecific form of the enzyme. Alternative splicing results in multiple transcript variants, at least one of which encodes a preproprotein that is proteolytically processed to generate the mature enzyme. This enzyme may play a role in bone mineralization. Mutations in this gene have been linked to hypophosphatasia, a disorder that is characterized by hypercalcemia and skeletal defects."<ref name=RefSeq249>{{ cite web
|author=RefSeq
|author=RefSeq
|title=ALPL alkaline phosphatase, biomineralization associated [ Homo sapiens ]
|title=ALDOB aldolase, fructose-bisphosphate B [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2008
|url=http://www.ncbi.nlm.nih.gov/gene/229
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 230===
 
"This gene encodes a member of the class I fructose-biphosphate aldolase gene family. Expressed specifically in the hippocampus and Purkinje cells of the brain, the encoded protein is a glycolytic enzyme that catalyzes the reversible aldol cleavage of fructose-1,6-biphosphate and fructose 1-phosphate to dihydroxyacetone phosphate and either glyceraldehyde-3-phosphate or glyceraldehyde, respectively."<ref name=RefSeq230>{{ cite web
|author=RefSeq
|title=ALDOC aldolase, fructose-bisphosphate C [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/230
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 249===
 
"This gene encodes a member of the alkaline phosphatase family of proteins. There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2, while the tissue non-specific form is located on chromosome 1. The product of this gene is a membrane bound glycosylated enzyme that is not expressed in any particular tissue and is, therefore, referred to as the tissue-nonspecific form of the enzyme. Alternative splicing results in multiple transcript variants, at least one of which encodes a preproprotein that is proteolytically processed to generate the mature enzyme. This enzyme may play a role in bone mineralization. Mutations in this gene have been linked to hypophosphatasia, a disorder that is characterized by hypercalcemia and skeletal defects."<ref name=RefSeq249>{{ cite web
|author=RefSeq
|title=ALPL alkaline phosphatase, biomineralization associated [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
Line 183: Line 237:
|url=http://www.ncbi.nlm.nih.gov/gene/249
|url=http://www.ncbi.nlm.nih.gov/gene/249
|accessdate=2024-06-09 }}</ref> It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-09 }}</ref> It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS.<ref name=Jin/>
===Gene ID: 262===
"This gene encodes an important intermediate enzyme in polyamine biosynthesis. The polyamines spermine, spermidine, and putrescine are low-molecular-weight aliphatic amines essential for cellular proliferation and tumor promotion. Multiple alternatively spliced transcript variants have been identified. Pseudogenes of this gene are found on chromosomes 5, 6, 10, X and Y."<ref name=RefSeq262>{{ cite web
|author=RefSeq
|title=AMD1 adenosylmethionine decarboxylase 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2013
|url=http://www.ncbi.nlm.nih.gov/gene/262
|accessdate=2024-06-20 }}</ref>


===Gene ID: 265===
===Gene ID: 265===
Line 230: Line 295:
===Gene ID: 301===
===Gene ID: 301===


===Gene ID: 302===
"This gene encodes a membrane-localized protein that binds phospholipids. This protein inhibits phospholipase A2 and has anti-inflammatory activity. Loss of function or expression of this gene has been detected in multiple tumors."<ref name=RefSeq301>{{ cite web
 
===Gene ID: 314===
 
"Copper amine oxidases catalyze the oxidative conversion of amines to aldehydes and ammonia in the presence of copper and quinone cofactor. This gene shows high sequence similarity to copper amine oxidases from various species ranging from bacteria to mammals. The protein contains several conserved motifs including the active site of amine oxidases and the histidine residues that likely bind copper. It may be a critical modulator of signal transmission in retina, possibly by degrading the biogenic amines dopamine, histamine, and putrescine. This gene may be a candidate gene for hereditary ocular diseases. Alternate splicing results in multiple transcript variants."<ref name=RefSeq314>{{ cite web
|author=RefSeq
|author=RefSeq
|title=AOC2 amine oxidase copper containing 2 [ Homo sapiens ]
|title=ALDOC aldolase, fructose-bisphosphate C [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2014
|url=http://www.ncbi.nlm.nih.gov/gene/301
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 302===
 
"This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene. Annexin A2 expression has been found to correlate with resistance to treatment against various cancer forms."<ref name=RefSeq302>{{ cite web
|author=RefSeq
|title=ANXA2 annexin A2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2019
|url=http://www.ncbi.nlm.nih.gov/gene/302
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 314===
 
"Copper amine oxidases catalyze the oxidative conversion of amines to aldehydes and ammonia in the presence of copper and quinone cofactor. This gene shows high sequence similarity to copper amine oxidases from various species ranging from bacteria to mammals. The protein contains several conserved motifs including the active site of amine oxidases and the histidine residues that likely bind copper. It may be a critical modulator of signal transmission in retina, possibly by degrading the biogenic amines dopamine, histamine, and putrescine. This gene may be a candidate gene for hereditary ocular diseases. Alternate splicing results in multiple transcript variants."<ref name=RefSeq314>{{ cite web
|author=RefSeq
|title=AOC2 amine oxidase copper containing 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
Line 266: Line 349:


===Gene ID: 345===
===Gene ID: 345===
"This gene encodes a protein component of triglyceride (TG)-rich lipoproteins (TRLs) including very low density lipoproteins (VLDL), high density lipoproteins (HDL) and chylomicrons. The encoded protein plays a role in role in the metabolism of these TRLs through multiple modes. This protein has been shown to promote the secretion of VLDL1, inhibit lipoprotein lipase enzyme activity, and delay catabolism of TRL remnants. Mutations in this gene are associated with low plasma triglyceride levels and reduced risk of ischemic cardiovascular disease, and hyperalphalipoproteinemia, which is characterized by elevated levels of high density lipoprotein (HDL) and HDL cholesterol in human patients. This gene and other related genes comprise an apolipoprotein gene cluster on chromosome 11."<ref name=RefSeq345>{{ cite web
|author=RefSeq
|title=APOC3 apolipoprotein C3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2017
|url=http://www.ncbi.nlm.nih.gov/gene/345
|accessdate=2024-06-18 }}</ref>


===Gene ID: 358===
===Gene ID: 358===
Line 291: Line 383:
===Gene ID: 360===
===Gene ID: 360===


===Gene ID: 374===
"This gene encodes the water channel protein aquaporin 3. Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein, also known as aquaporin 0. Aquaporin 3 is localized at the basal lateral membranes of collecting duct cells in the kidney. In addition to its water channel function, aquaporin 3 has been found to facilitate the transport of nonionic small solutes such as urea and glycerol, but to a smaller degree. It has been suggested that water channels can be functionally heterogeneous and possess water and solute permeation mechanisms. Alternative splicing of this gene results in multiple transcript variants encoding different isoforms."<ref name=RefSeq360>{{ cite web
 
|author=RefSeq
|title=AQP3 aquaporin 3 (Gill blood group) [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2015
|url=http://www.ncbi.nlm.nih.gov/gene/360
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 374===
 
"The protein encoded by this gene is a member of the epidermal growth factor family. It is an autocrine growth factor as well as a mitogen for astrocytes, Schwann cells and fibroblasts. It is related to epidermal growth factor (EGF) and transforming growth factor alpha (TGF-alpha). The protein interacts with the EGF/TGF-alpha receptor to promote the growth of normal epithelial cells, and it inhibits the growth of certain aggressive carcinoma cell lines. It also functions in mammary gland, oocyte and bone tissue development. This gene is associated with a psoriasis-like skin phenotype, and is also associated with other pathological disorders, including various types of cancers and inflammatory conditions."<ref name=RefSeq374>{{ cite web
|author=RefSeq
|title=AREG amphiregulin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2014
|url=http://www.ncbi.nlm.nih.gov/gene/374
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 383===
===Gene ID: 383===
"Arginase catalyzes the hydrolysis of arginine to ornithine and urea. At least two isoforms of mammalian arginase exist (types I and II) which differ in their tissue distribution, subcellular localization, immunologic crossreactivity and physiologic function. The type I isoform encoded by this gene, is a cytosolic enzyme and expressed predominantly in the liver as a component of the urea cycle. Inherited deficiency of this enzyme results in argininemia, an autosomal recessive disorder characterized by hyperammonemia. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq383>{{ cite web
|author=RefSeq
|title=ARG1 arginase 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/383
|accessdate=2024-06-18 }}</ref>


===Gene ID: 390===
===Gene ID: 390===
"This gene encodes a protein which is a member of the small GTPase protein superfamily. The encoded protein binds only GTP but has no GTPase activity, and appears to act as a negative regulator of cytoskeletal organization leading to loss of adhesion. Multiple alternatively spliced variants, encoding the same protein, have been identified."<ref name=RefSeq390>{{ cite web
|author=RefSeq
|title=RND3 Rho family GTPase 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2011
|url=http://www.ncbi.nlm.nih.gov/gene/390
|accessdate=2024-06-18 }}</ref>


===Gene ID: 467===
===Gene ID: 467===
Line 309: Line 437:


===Gene ID: 468===
===Gene ID: 468===
"This gene encodes a transcription factor that was originally identified as a widely expressed mammalian DNA binding protein that could bind a tax-responsive enhancer element in the LTR of HTLV-1. The encoded protein was also isolated and characterized as the cAMP-response element binding protein 2 (CREB-2). The protein encoded by this gene belongs to a family of DNA-binding proteins that includes the AP-1 family of transcription factors, cAMP-response element binding proteins (CREBs) and CREB-like proteins. These transcription factors share a leucine zipper region that is involved in protein-protein interactions, located C-terminal to a stretch of basic amino acids that functions as a DNA binding domain. Two alternative transcripts encoding the same protein have been described. Two pseudogenes are located on the X chromosome at q28 in a region containing a large inverted duplication."<ref name=RefSeq468>{{ cite web
|author=RefSeq
|title=ATF4 activating transcription factor 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/468
|accessdate=2024-06-18 }}</ref>


===Gene ID: 481===
===Gene ID: 481===
Line 322: Line 459:


===Gene ID: 482===
===Gene ID: 482===
"The protein encoded by this gene belongs to the family of Na+/K+ and H+/K+ ATPases beta chain proteins, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The beta subunit regulates, through assembly of alpha/beta heterodimers, the number of sodium pumps transported to the plasma membrane. The glycoprotein subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes a beta 2 subunit. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq482>{{ cite web
|author=RefSeq
|title=ATP1B2 ATPase Na+/K+ transporting subunit beta 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2014
|url=http://www.ncbi.nlm.nih.gov/gene/482
|accessdate=2024-06-18 }}</ref>


===Gene ID: 496===
===Gene ID: 496===
"The protein encoded by this gene belongs to a family of P-type cation-transporting ATPases. The gastric H+, K+-ATPase is a heterodimer consisting of a high molecular weight catalytic alpha subunit and a smaller but heavily glycosylated beta subunit. This enzyme is a proton pump that catalyzes the hydrolysis of ATP coupled with the exchange of H(+) and K(+) ions across the plasma membrane. It is also responsible for gastric acid secretion. This gene encodes the beta subunit of the gastric H+, K+-ATPase."<ref name=RefSeq496>{{ cite web
|author=RefSeq
|title=ATP4B ATPase H+/K+ transporting subunit beta [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/496
|accessdate=2024-06-18 }}</ref>


===Gene ID: 515===
===Gene ID: 515===
Line 335: Line 490:
|url=http://www.ncbi.nlm.nih.gov/gene/515
|url=http://www.ncbi.nlm.nih.gov/gene/515
|accessdate=2024-06-09 }}</ref> ATO5PB aka ATP5F1.<ref name=RefSeq515/> It has a TATA box (TTTAAAA) from -34 to -28 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-09 }}</ref> ATO5PB aka ATP5F1.<ref name=RefSeq515/> It has a TATA box (TTTAAAA) from -34 to -28 nts from the TSS.<ref name=Jin/>
===Gene ID: 604===
"The protein encoded by this gene is a zinc finger transcription factor and contains an N-terminal POZ domain. This protein acts as a sequence-specific repressor of transcription, and has been shown to modulate the transcription of STAT-dependent IL-4 responses of B cells. This protein can interact with a variety of POZ-containing proteins that function as transcription corepressors. This gene is found to be frequently translocated and hypermutated in diffuse large-cell lymphoma (DLCL), and may be involved in the pathogenesis of DLCL. Alternatively spliced transcript variants encoding different protein isoforms have been found for this gene."<ref name=RefSeq604>{{ cite web
|author=RefSeq
|title=BCL6 BCL6 transcription repressor [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/604
|accessdate=2024-06-20 }}</ref>


===Gene ID: 676===
===Gene ID: 676===
Line 348: Line 514:


===Gene ID: 677===
===Gene ID: 677===
"This gene is a member of the TIS11 family of early response genes, which are induced by various agonists such as the phorbol ester TPA and the polypeptide mitogen EGF. This gene is well conserved across species and has a promoter that contains motifs seen in other early-response genes. The encoded protein contains a distinguishing putative zinc finger domain with a repeating cys-his motif. This putative nuclear transcription factor most likely functions in regulating the response to growth factors. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq677>{{ cite web
|author=RefSeq
|title=ZFP36L1 ZFP36 ring finger protein like 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/677
|accessdate=2024-06-18 }}</ref>


===Gene ID: 694===
===Gene ID: 694===


===Gene ID: 759===
"This gene is a member of an anti-proliferative gene family that regulates cell growth and differentiation. Expression of this gene is highest in the G0/G1 phases of the cell cycle and downregulated when cells progressed through G1. The encoded protein interacts with several nuclear receptors, and functions as a coactivator of cell differentiation. This locus has been shown to be involved in a t(8;12)(q24;q22) chromosomal translocation in a case of B-cell chronic lymphocytic leukemia."<ref name=RefSeq694>{{ cite web
 
|author=RefSeq
===Gene ID: 760===
|title=BTG1 BTG anti-proliferation factor 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=October 2008
|url=http://www.ncbi.nlm.nih.gov/gene/694
|accessdate=2024-06-18 }}</ref>


===Gene ID: 762===
===Gene ID: 759===


===Gene ID: 767===
"Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. This CA1 gene is closely linked to the CA2 and CA3 genes on chromosome 8. It encodes a cytosolic protein that is found at the highest level in erythrocytes. Allelic variants of this gene have been described in some populations. Alternative splicing and the use of alternative promoters results in multiple transcript variants."<ref name=RefSeq759>{{ cite web
|author=RefSeq
|title=CA1 carbonic anhydrase 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2016
|url=http://www.ncbi.nlm.nih.gov/gene/759
|accessdate=2024-06-18 }}</ref>


===Gene ID: 811===
===Gene ID: 760===


===Gene ID: 846===
"The protein encoded by this gene is one of several isozymes of carbonic anhydrase, which catalyzes reversible hydration of carbon dioxide. Defects in this enzyme are associated with osteopetrosis and renal tubular acidosis. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq760>{{ cite web
|author=RefSeq
|title=CA2 carbonic anhydrase 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=June 2014
|url=http://www.ncbi.nlm.nih.gov/gene/760
|accessdate=2024-06-18 }}</ref>


===Gene ID: 919===
===Gene ID: 762===


"The protein encoded by this gene is T-cell receptor zeta, which together with T-cell receptor alpha/beta and gamma/delta heterodimers, and with CD3-gamma, -delta and -epsilon, forms the T-cell receptor-CD3 complex. The zeta chain plays an important role in coupling antigen recognition to several intracellular signal-transduction pathways. Low expression of the antigen results in impaired immune response. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene."<ref name=RefSeq919>{{ cite web
"Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. This gene encodes a glycosylphosphatidyl-inositol-anchored membrane isozyme expressed on the luminal surfaces of pulmonary (and certain other) capillaries and proximal renal tubules. Its exact function is not known; however, it may have a role in inherited renal abnormalities of bicarbonate transport."<ref name=RefSeq762>{{ cite web
|author=RefSeq
|author=RefSeq
|title=CD247 CD247 molecule [ Homo sapiens ]
|title=CA4 carbonic anhydrase 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/919
|url=http://www.ncbi.nlm.nih.gov/gene/762
|accessdate=2024-06-09 }}</ref> It has a TATA box (AATAAAA) from -31 to -25 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1051===
===Gene ID: 767===


===Gene ID: 1101===
"The protein encoded by this gene was initially named CA-related protein because of sequence similarity to other known carbonic anhydrase genes. However, the gene product lacks carbonic anhydrase activity (i.e., the reversible hydration of carbon dioxide). The gene product continues to carry a carbonic anhydrase designation based on clear sequence identity to other members of the carbonic anhydrase gene family. The absence of CA8 gene transcription in the cerebellum of the lurcher mutant in mice with a neurologic defect suggests an important role for this acatalytic form. Mutations in this gene are associated with cerebellar ataxia, mental retardation, and dysequilibrium syndrome 3 (CMARQ3). Polymorphisms in this gene are associated with osteoporosis, and overexpression of this gene in osteosarcoma cells suggests an oncogenic role. Alternative splicing results in multiple transcript variants."<ref name=RefSeq767>{{ cite web
|author=RefSeq
|title=CA8 carbonic anhydrase 8 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=March 2016
|url=http://www.ncbi.nlm.nih.gov/gene/767
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1116===
===Gene ID: 794===


"Chitinases catalyze the hydrolysis of chitin, which is an abundant glycopolymer found in insect exoskeletons and fungal cell walls. The glycoside hydrolase 18 family of chitinases includes eight human family members. This gene encodes a glycoprotein member of the glycosyl hydrolase 18 family. The protein lacks chitinase activity and is secreted by activated macrophages, chondrocytes, neutrophils and synovial cells. The protein is thought to play a role in the process of inflammation and tissue remodeling."<ref name=RefSeq1116>{{ cite web
"This gene encodes an intracellular calcium-binding protein belonging to the troponin C superfamily. Members of this protein family have six EF-hand domains which bind calcium. This protein plays a role in diverse cellular functions, including message targeting and intracellular calcium buffering. It also functions as a modulator of neuronal excitability, and is a diagnostic marker for some human diseases, including Hirschsprung disease and some cancers. Alternative splicing results in multiple transcript variants."<ref name=RefSeq794>{{ cite web
|author=RefSeq
|author=RefSeq
|title=CHI3L1 chitinase 3 like 1 [ Homo sapiens ]
|title=CALB2 calbindin 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=September 2009
|date=June 2010
|url=http://www.ncbi.nlm.nih.gov/gene/1116
|url=http://www.ncbi.nlm.nih.gov/gene/794
|accessdate=2024-06-09 }}</ref> It has a TATA box (CATAAAA) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


===Gene ID: 1152===
===Gene ID: 811===


===Gene ID: 1160===
"Calreticulin is a highly conserved chaperone protein which resides primarily in the endoplasmic reticulum, and is involved in a variety of cellular processes, among them, cell adhesion. Additionally, it functions in protein folding quality control and calcium homeostasis. Calreticulin is also found in the nucleus, suggesting that it may have a role in transcription regulation. Systemic lupus erythematosus is associated with increased autoantibody titers against calreticulin. Recurrent mutations in calreticulin have been linked to various neoplasms, including the myeloproliferative type."<ref name=RefSeq811>{{ cite web
|author=RefSeq
|title=CALR calreticulin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=May 2020
|url=http://www.ncbi.nlm.nih.gov/gene/811
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1180===
===Gene ID: 846===


===Gene ID: 1188===
"The protein encoded by this gene is a plasma membrane G protein-coupled receptor that senses small changes in circulating calcium concentration. The encoded protein couples this information to intracellular signaling pathways that modify parathyroid hormone secretion or renal cation handling, and thus this protein plays an essential role in maintaining mineral ion homeostasis. Mutations in this gene are a cause of familial hypocalciuric hypercalcemia, neonatal severe hyperparathyroidism, and autosomal dominant hypocalcemia."<ref name=RefSeq846>{{ cite web
 
"The protein encoded by this gene is a member of the family of voltage-gated chloride channels. Chloride channels have several functions, including the regulation of cell volume, membrane potential stabilization, signal transduction and transepithelial transport. This gene is expressed predominantly in the kidney and may be important for renal salt reabsorption. Mutations in this gene are associated with autosomal recessive Bartter syndrome type 3 (BS3). Alternatively spliced transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq1188>{{ cite web
|author=RefSeq
|author=RefSeq
|title=CLCNKB chloride voltage-gated channel Kb [ Homo sapiens ]
|title=CASR calcium sensing receptor [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=September 2009
|date=August 2017
|url=http://www.ncbi.nlm.nih.gov/gene/1188
|url=http://www.ncbi.nlm.nih.gov/gene/846
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAC) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 919===


===Gene ID: 1191===
"The protein encoded by this gene is T-cell receptor zeta, which together with T-cell receptor alpha/beta and gamma/delta heterodimers, and with CD3-gamma, -delta and -epsilon, forms the T-cell receptor-CD3 complex. The zeta chain plays an important role in coupling antigen recognition to several intracellular signal-transduction pathways. Low expression of the antigen results in impaired immune response. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene."<ref name=RefSeq919>{{ cite web
|author=RefSeq
|title=CD247 CD247 molecule [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/919
|accessdate=2024-06-09 }}</ref> It has a TATA box (AATAAAA) from -31 to -25 nts from the TSS.<ref name=Jin/>


===Gene ID: 1231===
===Gene ID: 1044===


===Gene ID: 1278===
"This gene is a member of the caudal-related homeobox transcription factor gene family. The encoded DNA-binding protein regulates intestine-specific gene expression and enterocyte differentiation. It has been shown to induce expression of the intestinal alkaline phosphatase gene, and inhibit beta-catenin/T-cell factor transcriptional activity."<ref name=RefSeq1044>{{ cite web
|author=RefSeq
|title=CDX1 caudal type homeobox 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1044
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1382===
===Gene ID: 1051===


"This gene encodes a member of the retinoic acid (RA, a form of vitamin A) binding protein family and lipocalin/cytosolic fatty-acid binding protein family. The protein is a cytosol-to-nuclear shuttling protein, which facilitates RA binding to its cognate receptor complex and transfer to the nucleus. It is involved in the retinoid signaling pathway, and is associated with increased circulating low-density lipoprotein cholesterol. Alternatively spliced transcript variants encoding the same protein have been found for this gene."<ref name=RefSeq1382>{{ cite web
"This intronless gene encodes a transcription factor that contains a basic leucine zipper (bZIP) domain. The encoded protein functions as a homodimer but can also form heterodimers with CCAAT/enhancer-binding proteins alpha, delta, and gamma. Activity of this protein is important in the regulation of genes involved in immune and inflammatory responses, among other processes. The use of alternative in-frame AUG start codons results in multiple protein isoforms, each with distinct biological functions."<ref name=RefSeq1051>{{ cite web
|author=RefSeq
|author=RefSeq
|title=CRABP2 cellular retinoic acid binding protein 2 [ Homo sapiens ]
|title=CEBPB CCAAT enhancer binding protein beta [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=December 2010
|date=October 2013
|url=http://www.ncbi.nlm.nih.gov/gene/1382
|url=http://www.ncbi.nlm.nih.gov/gene/1051
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -32 to -26 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1401===
===Gene ID: 1081===


===Gene ID: 1411===
"The four human glycoprotein hormones chorionic gonadotropin (CG), luteinizing hormone (LH), follicle stimulating hormone (FSH), and thyroid stimulating hormone (TSH) are dimers consisting of alpha and beta subunits that are associated noncovalently. The alpha subunits of these hormones are identical, however, their beta chains are unique and confer biological specificity. The protein encoded by this gene is the alpha subunit and belongs to the glycoprotein hormones alpha chain family. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq1081>{{ cite web
|author=RefSeq
|title=CGA glycoprotein hormones, alpha polypeptide [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2011
|url=http://www.ncbi.nlm.nih.gov/gene/1081
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1427===
===Gene ID: 1101===


===Gene ID: 1440===
"Chondroadherin is a cartilage matrix protein thought to mediate adhesion of isolated chondrocytes. The protein contains 11 leucine-rich repeats flanked by cysteine-rich regions. The chondroadherin messenger RNA is present in chondrocytes at all ages."<ref name=RefSeq1101>{{ cite web
|author=RefSeq
|title=CHAD chondroadherin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1101
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1543===
===Gene ID: 1116===


===Gene ID: 1548===
"Chitinases catalyze the hydrolysis of chitin, which is an abundant glycopolymer found in insect exoskeletons and fungal cell walls. The glycoside hydrolase 18 family of chitinases includes eight human family members. This gene encodes a glycoprotein member of the glycosyl hydrolase 18 family. The protein lacks chitinase activity and is secreted by activated macrophages, chondrocytes, neutrophils and synovial cells. The protein is thought to play a role in the process of inflammation and tissue remodeling."<ref name=RefSeq1116>{{ cite web
|author=RefSeq
|title=CHI3L1 chitinase 3 like 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2009
|url=http://www.ncbi.nlm.nih.gov/gene/1116
|accessdate=2024-06-09 }}</ref> It has a TATA box (CATAAAA) from -30 to -24 nts from the TSS and a TATA box (TATATAAA) from 16 to 23 nts from the TSS.<ref name=Jin/>


===Gene ID: 1553===
===Gene ID: 1152===


===Gene ID: 1576===
"The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in brain as well as in other tissues, and as a heterodimer with a similar muscle isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family. A pseudogene of this gene has been characterized."<ref name=RefSeq1152>{{ cite web
|author=RefSeq
|title=CKB creatine kinase B [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1152
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1602===
===Gene ID: 1153===


"Nine elements were tested, representing a sampling of elements present in the two gene deserts and DACH introns, spread over a 1530-kb region surrounding the human DACH's TATA box."<ref name=Nobrega>{{ cite journal
"Enables mRNA 3'-UTR binding activity and small ribosomal subunit rRNA binding activity. Involved in mRNA stabilization; positive regulation of translation; and response to UV. Located in cytoplasm and nucleoplasm."<ref name=AllianceofGenomeResources1153>{{ cite web
|author=Marcelo A. Nobrega, Ivan Ovcharenko, Veena Afzal, and Edward M. Rubin
|author=Alliance of Genome Resources
|title=Scanning human gene deserts for long-range enhancers
|title=CIRBP cold inducible RNA binding protein [ Homo sapiens ]
|journal=Science
|date=October 2003
|volume=302
|issue=5644
|pages=413
|url=http://www.sciencemag.org/content/302/5644/413.short
|arxiv=
|bibcode=
|doi=10.1126/science.1088328
|pmid=14563999
|accessdate=2012-12-26 }}</ref>
 
Gene ID: 1602 is the human gene DACH1 dachshund homolog 1 also known as DACH.<ref name=HGNC1602>{{ cite web
|author=HGNC
|title=DACH1 dachshund homolog 1 (Drosophila) [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=December 20, 2012
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/1602
|url=http://www.ncbi.nlm.nih.gov/gene/1153
|accessdate=2012-12-26 }}</ref> DACH1 has three isoforms: a, b, and c.
|accessdate=2024-06-20 }}</ref>


"This gene encodes a chromatin-associated protein that associates with other DNA-binding transcription factors to regulate gene expression and cell fate determination during development. The protein contains a Ski domain that is highly conserved from Drosophila to human. Expression of this gene is lost in some forms of metastatic cancer, and is correlated with poor prognosis."<ref name=HGNC1602/>
===Gene ID: 1158===


===Gene ID: 1735===
"The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis and is an important serum marker for myocardial infarction. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in striated muscle as well as in other tissues, and as a heterodimer with a similar brain isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family."<ref name=RefSeq1158>{{ cite web
|author=RefSeq
|title=CKM creatine kinase, M-type [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1158
|accessdate=2024-06-20 }}</ref>


===Gene ID: 1805===
===Gene ID: 1160===


"Dermatopontin is an extracellular matrix protein with possible functions in cell-matrix interactions and matrix assembly. The protein is found in various tissues and many of its tyrosine residues are sulphated. Dermatopontin is postulated to modify the behavior of TGF-beta through interaction with decorin."<ref name=RefSeq1805>{{ cite web
"Mitochondrial creatine kinase (MtCK) is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Sarcomeric mitochondrial creatine kinase has 80% homology with the coding exons of ubiquitous mitochondrial creatine kinase. This gene contains sequences homologous to several motifs that are shared among some nuclear genes encoding mitochondrial proteins and thus may be essential for the coordinated activation of these genes during mitochondrial biogenesis. Three transcript variants encoding the same protein have been found for this gene."<ref name=RefSeq1160>{{ cite web
|author=RefSeq
|author=RefSeq
|title=DPT dermatopontin [ Homo sapiens ]
|title=CKMT2 creatine kinase, mitochondrial 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1465
|url=http://www.ncbi.nlm.nih.gov/gene/1160
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -26 to -20 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 1180===


===Gene ID: 1811===
"The CLCN family of voltage-dependent chloride channel genes comprises nine members (CLCN1-7, Ka and Kb) which demonstrate quite diverse functional characteristics while sharing significant sequence homology. The protein encoded by this gene regulates the electric excitability of the skeletal muscle membrane. Mutations in this gene cause two forms of inherited human muscle disorders: recessive generalized myotonia congenita (Becker) and dominant myotonia (Thomsen). Alternative splicing results in multiple transcript variants."<ref name=RefSeq1180>{{ cite web
|author=RefSeq
|title=CLCN1 chloride voltage-gated channel 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=March 2012
|url=http://www.ncbi.nlm.nih.gov/gene/1180
|accessdate=2024-06-18 }}</ref>


The DRA gene (colon mucosa-associated gene) has a TATA box.<ref name=Suzuki>{{ cite journal
===Gene ID: 1188===
|author=Yutaka Suzuki, Tatsuhiko Tsunoda, Jun Sese, Hirotoshi Taira, Junko Mizushima-Sugano, Hiroko Hata, Toshio Ota, Takao Isogai, Toshihiro Tanaka, Yusuke Nakamura, Akira Suyama, Yoshiyuki Sakaki, Shinichi Morishita, Kousaku Okubo, and Sumio Sugano
|title=Identification and Characterization of the Potential Promoter Regions of 1031 Kinds of Human Genes
|journal=Genome Research
|doi=10.1101/gr.164001
|date=11 April 2001
|volume=11
|issue=5
|page=677-684 }}</ref>


===Gene ID: 1831===
"The protein encoded by this gene is a member of the family of voltage-gated chloride channels. Chloride channels have several functions, including the regulation of cell volume, membrane potential stabilization, signal transduction and transepithelial transport. This gene is expressed predominantly in the kidney and may be important for renal salt reabsorption. Mutations in this gene are associated with autosomal recessive Bartter syndrome type 3 (BS3). Alternatively spliced transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq1188>{{ cite web
|author=RefSeq
|title=CLCNKB chloride voltage-gated channel Kb [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2009
|url=http://www.ncbi.nlm.nih.gov/gene/1188
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAC) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 1833===
===Gene ID: 1191===


===Gene ID: 1974===
"The protein encoded by this gene is a secreted chaperone that can under some stress conditions also be found in the cell cytosol. It has been suggested to be involved in several basic biological events such as cell death, tumor progression, and neurodegenerative disorders. Alternate splicing results in both coding and non-coding variants."<ref name=RefSeq1191>{{ cite web
|author=RefSeq
|title=CLU clusterin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=May 2011
|url=http://www.ncbi.nlm.nih.gov/gene/1191
|accessdate=2024-06-18 }}</ref>


===Gene ID: 1990===
===Gene ID: 1215===


===Gene ID: 1999===
"This gene encodes a chymotryptic serine proteinase that belongs to the peptidase family S1. It is expressed in mast cells and is thought to function in the degradation of the extracellular matrix, the regulation of submucosal gland secretion, and the generation of vasoactive peptides. In the heart and blood vessels, this protein, rather than angiotensin converting enzyme, is largely responsible for converting angiotensin I to the vasoactive peptide angiotensin II. Alternative splicing results in multiple variants."<ref name=RefSeq1215>{{ cite web
 
|author=RefSeq
"Enables DNA-binding transcription activator activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Involved in inflammatory response; negative regulation of transcription, DNA-templated; and positive regulation of transcription by RNA polymerase II. Located in Golgi apparatus; cytosol; and nucleoplasm."<ref name=Alliance of Genome Resources1999>{{ cite web
|title=CMA1 chymase 1 [ Homo sapiens ]
|author=Alliance of Genome Resources
|title=ELF3 E74 like ETS transcription factor 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=April 2022
|date=April 2015
|url=http://www.ncbi.nlm.nih.gov/gene/1999
|url=http://www.ncbi.nlm.nih.gov/gene/1215
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 1277===


===Gene ID: 2206===
"This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene."<ref name=RefSeq1277>{{ cite web
|author=RefSeq
|title=COL1A1 collagen type I alpha 1 chain [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=February 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1277
|accessdate=2024-06-18 }}</ref>


===Gene ID: 2250===
===Gene ID: 1278===


===Gene ID: 2321===
"This gene encodes the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIB, recessive Ehlers-Danlos syndrome Classical type, idiopathic osteoporosis, and atypical Marfan syndrome. Symptoms associated with mutations in this gene, however, tend to be less severe than mutations in the gene for the alpha1 chain of type I collagen (COL1A1) reflecting the different role of alpha2 chains in matrix integrity. Three transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene."<ref name=RefSeq1278>{{ cite web
|author=RefSeq
|title=COL1A2 collagen type I alpha 2 chain [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=February 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1278
|accessdate=2024-06-18 }}</ref>


===Gene ID: 2494===
===Gene ID: 1356===


"The protein encoded by this gene is a DNA-binding zinc finger transcription factor and is a member of the fushi tarazu factor-1 subfamily of orphan nuclear receptors. The encoded protein is involved in the expression of genes for hepatitis B virus and cholesterol biosynthesis, and may be an important regulator of embryonic development."<ref name=RefSeq2494>{{ cite web
"The protein encoded by this gene is a metalloprotein that binds most of the copper in plasma and is involved in the peroxidation of Fe(II)transferrin to Fe(III) transferrin. Mutations in this gene cause aceruloplasminemia, which results in iron accumulation and tissue damage, and is associated with diabetes and neurologic abnormalities. Two transcript variants, one protein-coding and the other not protein-coding, have been found for this gene."<ref name=RefSeq1356>{{ cite web
|author=RefSeq
|author=RefSeq
|title=NR5A2 nuclear receptor subfamily 5 group A member 2 [ Homo sapiens ]
|title=CP ceruloplasmin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=June 2016
|date=February 2012
|url=http://www.ncbi.nlm.nih.gov/gene/2494
|url=http://www.ncbi.nlm.nih.gov/gene/1356
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAACA) from -28 to -21 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 2542===
===Gene ID: 1382===


===Gene ID: 2597===
"This gene encodes a member of the retinoic acid (RA, a form of vitamin A) binding protein family and lipocalin/cytosolic fatty-acid binding protein family. The protein is a cytosol-to-nuclear shuttling protein, which facilitates RA binding to its cognate receptor complex and transfer to the nucleus. It is involved in the retinoid signaling pathway, and is associated with increased circulating low-density lipoprotein cholesterol. Alternatively spliced transcript variants encoding the same protein have been found for this gene."<ref name=RefSeq1382>{{ cite web
 
|author=RefSeq
===Gene ID: 2652===
|title=CRABP2 cellular retinoic acid binding protein 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2010
|url=http://www.ncbi.nlm.nih.gov/gene/1382
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -32 to -26 nts from the TSS.<ref name=Jin/>


===Gene ID: 2669===
===Gene ID: 1392===


===Gene ID: 2752===
"This gene encodes a member of the corticotropin-releasing factor family. The encoded preproprotein is proteolytically processed to generate the mature neuropeptide hormone. In response to stress, this hormone is secreted by the paraventricular nucleus (PVN) of the hypothalamus, binds to corticotropin releasing hormone receptors and stimulates the release of adrenocorticotropic hormone from the pituitary gland. Marked reduction in this protein has been observed in association with Alzheimer's disease. Autosomal recessive hypothalamic corticotropin deficiency has multiple and potentially fatal metabolic consequences including hypoglycemia and hepatitis. In addition to production in the hypothalamus, this protein is also synthesized in peripheral tissues, such as T lymphocytes, and is highly expressed in the placenta. In the placenta it is a marker that determines the length of gestation and the timing of parturition and delivery. A rapid increase in circulating levels of the hormone occurs at the onset of parturition, suggesting that, in addition to its metabolic functions, this protein may act as a trigger for parturition."<ref name=RefSeq1392>{{ cite web
 
"The protein encoded by this gene belongs to the glutamine synthetase family. It catalyzes the synthesis of glutamine from glutamate and ammonia in an ATP-dependent reaction. This protein plays a role in ammonia and glutamate detoxification, acid-base homeostasis, cell signaling, and cell proliferation. Glutamine is an abundant amino acid, and is important to the biosynthesis of several amino acids, pyrimidines, and purines. Mutations in this gene are associated with congenital glutamine deficiency, and overexpression of this gene was observed in some primary liver cancer samples. There are six pseudogenes of this gene found on chromosomes 2, 5, 9, 11, and 12. Alternative splicing results in multiple transcript variants."<ref name=RefSeq2752>{{ cite web
|author=RefSeq
|author=RefSeq
|title=GLUL glutamate-ammonia ligase [ Homo sapiens ]
|title=CRH corticotropin releasing hormone [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=December 2014
|date=November 2015
|url=http://www.ncbi.nlm.nih.gov/gene/2752
|url=http://www.ncbi.nlm.nih.gov/gene/1392
|accessdate=2024-06-10 }}</ref> It has a TATA box (GATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 2780===
===Gene ID: 1393===


"Transducin is a 3-subunit guanine nucleotide-binding protein (G protein) which stimulates the coupling of rhodopsin and cGMP-phoshodiesterase during visual impulses. The transducin alpha subunits in rods and cones are encoded by separate genes. This gene encodes the alpha subunit in cones."<ref name=RefSeq2780>{{ cite web
"Corticotropin-releasing hormone is a potent stimulator of synthesis and secretion of preopiomelanocortin-derived peptides. Although CRH concentrations in the human peripheral circulation are normally low, they increase throughout pregnancy and fall rapidly after parturition. Maternal plasma CRH probably originates from the placenta. Human plasma contains a CRH-binding protein which inactivates CRH and which may prevent inappropriate pituitary-adrenal stimulation in pregnancy."<ref name=RefSeq1393>{{ cite web
|author=RefSeq
|author=RefSeq
|title=GNAT2 G protein subunit alpha transducin 2 [ Homo sapiens ]
|title=CRHBP corticotropin releasing hormone binding protein [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/2780
|url=http://www.ncbi.nlm.nih.gov/gene/1393
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -30 to -23 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 2922===
===Gene ID: 1401===


===Gene ID: 2947===
"The protein encoded by this gene belongs to the pentraxin family which also includes serum amyloid P component protein and pentraxin 3. Pentraxins are involved in complement activation and amplification via communication with complement initiation pattern recognition molecules, but also complement regulation via recruitment of complement regulators. The encoded protein has a calcium dependent ligand binding domain with a distinctive flattened beta-jellyroll structure. It exists in two forms as either a pentamer in circulation or as a nonsoluble monomer in tissues. It is involved in several host defense related functions based on its ability to recognize foreign pathogens and damaged cells of the host and to initiate their elimination by interacting with humoral and cellular effector systems in the blood. Consequently, the level of this protein in plasma increases greatly during acute phase response to tissue injury, infection, or other inflammatory stimuli. Elevated expression of the encoded protein is associated with severe acute respiratory syndrome coronavirus 2 (SARS&#8208;CoV&#8208;2) infection."<ref name=RefSeq1401>{{ cite web
|author=RefSeq
|title=CRP C-reactive protein [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2020
|url=http://www.ncbi.nlm.nih.gov/gene/1401
|accessdate=2024-06-19 }}</ref>


===Gene ID: 2980===
===Gene ID: 1410===


"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."<ref name=AllianceofGenomeResources2980>{{ cite web
"Mammalian lens crystallins are divided into alpha, beta, and gamma families. Alpha crystallins are composed of two gene products: alpha-A and alpha-B, for acidic and basic, respectively. Alpha crystallins can be induced by heat shock and are members of the small heat shock protein (HSP20) family. They act as molecular chaperones although they do not renature proteins and release them in the fashion of a true chaperone; instead they hold them in large soluble aggregates. These heterogeneous aggregates consist of 30-40 subunits; the alpha-A and alpha-B subunits have a 3:1 ratio, respectively. Two additional functions of alpha crystallins are an autokinase activity and participation in the intracellular architecture. The encoded protein has been identified as a moonlighting protein based on its ability to perform mechanistically distinct functions. Alpha-A and alpha-B gene products are differentially expressed; alpha-A is preferentially restricted to the lens and alpha-B is expressed widely in many tissues and organs. Elevated expression of alpha-B crystallin occurs in many neurological diseases; a missense mutation cosegregated in a family with a desmin-related myopathy. Alternative splicing results in multiple transcript variants."<ref name=RefSeq1410>{{ cite web
|author=Alliance of Genome Resources
|author=RefSeq
|title=GUCA2A guanylate cyclase activator 2A [ Homo sapiens ]
|title=CRYAB crystallin alpha B [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=April 2022
|date=January 2019
|url=http://www.ncbi.nlm.nih.gov/gene/2980
|url=http://www.ncbi.nlm.nih.gov/gene/1410
|accessdate=2024-06-10 }}</ref> It has a TATA box (TTTAAAA) from -33 to -27 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


===Gene ID: 2981===
===Gene ID: 1411===


"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."<ref name=RefSeq2981>{{ cite web
"Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group, none in the acidic group). Beta-crystallins form aggregates of different sizes and are able to self-associate to form dimers or to form heterodimers with other beta-crystallins. This gene, a beta acidic group member, encodes two proteins (crystallin, beta A3 and crystallin, beta A1) from a single mRNA, the latter protein is 17 aa shorter than crystallin, beta A3 and is generated by use of an alternate translation initiation site. Deletion of exons 3 and 4 causes the autosomal dominant disease 'zonular cataract with sutural opacities'."<ref name=RefSeq1411>{{ cite web
|author=RefSeq
|author=RefSeq
|title=GUCA2B guanylate cyclase activator 2B [ Homo sapiens ]
|title=CRYBA1 crystallin beta A1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=November 2015
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/2981
|url=http://www.ncbi.nlm.nih.gov/gene/1411
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAGG) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 3008===
===Gene ID: 1427===


===Gene ID: 3015===
"Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. This gene encodes a protein initially considered to be a beta-crystallin but the encoded protein is monomeric and has greater sequence similarity to other gamma-crystallins. This gene encodes the most significant gamma-crystallin in adult eye lens tissue. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation."<ref name=RefSeq1427>{{ cite web
|author=RefSeq
|title=CRYGS crystallin gamma S [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1427
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3133===
===Gene ID: 1440===
 
===Gene ID: 3158===


"The protein encoded by this gene belongs to the HMG-CoA synthase family. It is a mitochondrial enzyme that catalyzes the first reaction of ketogenesis, a metabolic pathway that provides lipid-derived energy for various organs during times of carbohydrate deprivation, such as fasting. Mutations in this gene are associated with HMG-CoA synthase deficiency. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq3158>{{ cite web
"This gene encodes a member of the IL-6 superfamily of cytokines. The encoded cytokine controls the production, differentiation, and function of granulocytes. Granulocytes are a type of white blood cell that are part of the innate immune response. A modified form of this protein is commonly administered to manage chemotherapy-induced neutropenia. Alternatively spliced transcript variants have been described for this gene."<ref name=RefSeq1440>{{ cite web
|author=RefSeq
|author=RefSeq
|title=HMGCS2 3-hydroxy-3-methylglutaryl-CoA synthase 2 [ Homo sapiens ]
|title=CSF3 colony stimulating factor 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=October 2009
|date=May 2020
|url=http://www.ncbi.nlm.nih.gov/gene/3158
|url=http://www.ncbi.nlm.nih.gov/gene/1440
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-18 }}</ref>


===Gene ID: 3164===
===Gene ID: 1490===


===Gene ID: 3171===
"The protein encoded by this gene is a mitogen that is secreted by vascular endothelial cells. The encoded protein plays a role in chondrocyte proliferation and differentiation, cell adhesion in many cell types, and is related to platelet-derived growth factor. Certain polymorphisms in this gene have been linked with a higher incidence of systemic sclerosis."<ref name=RefSeq1490>{{ cite web
|author=RefSeq
|title=CCN2 cellular communication network factor 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2009
|url=http://www.ncbi.nlm.nih.gov/gene/1490
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3182===
===Gene ID: 1543===


===Gene ID: 3206===
"This gene, CYP1A1, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme's endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. The gene has been associated with lung cancer risk. A related family member, CYP1A2, is located approximately 25 kb away from CYP1A1 on chromosome 15. Alternative splicing results in multiple transcript variants encoding distinct isoforms."<ref name=RefSeq1543>{{ cite web
|author=RefSeq
|title=CYP1A1 cytochrome P450 family 1 subfamily A member 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=January 2016
|url=http://www.ncbi.nlm.nih.gov/gene/1543
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3222===
===Gene ID: 1544===


===Gene ID: 3232===
"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The protein encoded by this gene localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme's endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. Other xenobiotic substrates for this enzyme include caffeine, aflatoxin B1, and acetaminophen. The transcript from this gene contains four Alu sequences flanked by direct repeats in the 3' untranslated region."<ref name=RefSeq1544>{{ cite web
|author=RefSeq
|title=CYP1A2 cytochrome P450 family 1 subfamily A member 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1544
|accessdate=2024-06-18 }}</ref>


===Gene ID: 3283===
===Gene ID: 1548===


"The protein encoded by this gene is an enzyme that catalyzes the oxidative conversion of delta-5-3-beta-hydroxysteroid precursors into delta-4-ketosteroids, which leads to the production of all classes of steroid hormones. The encoded protein also catalyzes the interconversion of 3-beta-hydroxy- and 3-keto-5-alpha-androstane steroids."<ref name=RefSeq3283>{{ cite web
"This gene, CYP2A6, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by phenobarbital. The enzyme is known to hydroxylate coumarin, and also metabolizes nicotine, aflatoxin B1, nitrosamines, and some pharmaceuticals. Individuals with certain allelic variants are said to have a poor metabolizer phenotype, meaning they do not efficiently metabolize coumarin or nicotine. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q. The gene was formerly referred to as CYP2A3; however, it has been renamed CYP2A6."<ref name=RefSeq1548>{{ cite web
|author=RefSeq
|author=RefSeq
|title=HSD3B1 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 1 [ Homo sapiens ]
|title=CYP2A6 cytochrome P450 family 2 subfamily A member 6 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=June 2016
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3283
|url=http://www.ncbi.nlm.nih.gov/gene/1548
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3284===
===Gene ID: 1553===


"The protein encoded by this gene is a bifunctional enzyme that catalyzes the oxidative conversion of delta(5)-ene-3-beta-hydroxy steroid, and the oxidative conversion of ketosteroids. It plays a crucial role in the biosynthesis of all classes of hormonal steroids. This gene is predominantly expressed in the adrenals and the gonads. Mutations in this gene are associated with 3-beta-hydroxysteroid dehydrogenase, type II, deficiency. Alternatively spliced transcript variants have been found for this gene."<ref name=RefSeq3284>{{ cite web
"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. Although its endogenous substrate has not been determined, it is known to metabolize 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone, a major nitrosamine specific to tobacco. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q."<ref name=RefSeq1553>{{ cite web
|author=RefSeq
|author=RefSeq
|title=HSD3B2 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 2 [ Homo sapiens ]
|title=CYP2A13 cytochrome P450 family 2 subfamily A member 13 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=October 2009
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3284
|url=http://www.ncbi.nlm.nih.gov/gene/1553
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3308===
===Gene ID: 1571===


The ''Drosophila hsp70'' has a TATA box containing promoter.<ref name=Burke>{{ cite journal
"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is induced by ethanol, the diabetic state, and starvation. The enzyme metabolizes both endogenous substrates, such as ethanol, acetone, and acetal, as well as exogenous substrates including benzene, carbon tetrachloride, ethylene glycol, and nitrosamines which are premutagens found in cigarette smoke. Due to its many substrates, this enzyme may be involved in such varied processes as gluconeogenesis, hepatic cirrhosis, diabetes, and cancer."<ref name=RefSeq1571>{{ cite web
| author = Thomas W. Burke and James T. Kadonaga
|author=RefSeq
| title = The downstream core promoter element, DPE, is conserved from ''Drosophila'' to humans and is recognized by TAF<sub>II</sub>60 of ''Drosophila''
|title=CYP2E1 cytochrome P450 family 2 subfamily E member 1 [ Homo sapiens ]
| date = November 15, 1997
|publisher=ncbi.nlm.nih
| journal = Genes & Development
|location=Bethsda, Maryland, USA
| volume = 11
|date=July 2008
| issue = 22
|url=http://www.ncbi.nlm.nih.gov/gene/1571
| pages = 3020–31
|accessdate=2024-06-18 }}</ref>
| url=http://genesdev.cshlp.org/content/11/22/3020.long
| doi = 10.1101/gad.11.22.3020
| pmid = 9367984
| pmc = 316699 }}</ref> This suggests that Gene ID: 3308 HSPA4 heat shock 70kDa protein 4 [''Homo sapiens''], also known as hsp70,<ref name=HGNC3308>{{ cite web
|author=HGNC
|title=HSPA4 heat shock 70kDa protein 4 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=February 3, 2013
|url=http://www.ncbi.nlm.nih.gov/gene/3308
|accessdate=2013-02-07 }}</ref> has a TATA box in its [[core promoter]].


===Gene ID: 3383===
===Gene ID: 1576===


===Gene ID: 3458===
"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. This enzyme is involved in the metabolism of approximately half the drugs in use today, including acetaminophen, codeine, cyclosporin A, diazepam, erythromycin, and chloroquine. The enzyme also metabolizes some steroids and carcinogens. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, CYP3A3, was thought to exist; however, it is now thought that this sequence represents a transcript variant of CYP3A4. Alternatively spliced transcript variants encoding different isoforms have been identified."<ref name=RefSeq1576>{{ cite web
|author=RefSeq
|title=CYP3A4 cytochrome P450 family 3 subfamily A member 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2020
|url=http://www.ncbi.nlm.nih.gov/gene/1576
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3487===
===Gene ID: 1581===


===Gene ID: 3491===
"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. This enzyme is involved in the metabolism of approximately half the drugs in use today, including acetaminophen, codeine, cyclosporin A, diazepam, erythromycin, and chloroquine. The enzyme also metabolizes some steroids and carcinogens. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, CYP3A3, was thought to exist; however, it is now thought that this sequence represents a transcript variant of CYP3A4. Alternatively spliced transcript variants encoding different isoforms have been identified."<ref name=RefSeq1581>{{ cite web
 
"The secreted protein encoded by this gene is growth factor-inducible and promotes the adhesion of endothelial cells. The encoded protein interacts with several integrins and with heparan sulfate proteoglycan. This protein also plays a role in cell proliferation, differentiation, angiogenesis, apoptosis, and extracellular matrix formation."<ref name=RefSeq3491>{{ cite web
|author=RefSeq
|author=RefSeq
|title=CCN1 cellular communication network factor 1 [ Homo sapiens ]
|title=CYP3A4 cytochrome P450 family 3 subfamily A member 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=September 2011
|date=August 2020
|url=http://www.ncbi.nlm.nih.gov/gene/3491
|url=http://www.ncbi.nlm.nih.gov/gene/1581
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3558===
===Gene ID: 1602===


===Gene ID: 3630===
"Nine elements were tested, representing a sampling of elements present in the two gene deserts and DACH introns, spread over a 1530-kb region surrounding the human DACH's TATA box."<ref name=Nobrega>{{ cite journal
|author=Marcelo A. Nobrega, Ivan Ovcharenko, Veena Afzal, and Edward M. Rubin
|title=Scanning human gene deserts for long-range enhancers
|journal=Science
|date=October 2003
|volume=302
|issue=5644
|pages=413
|url=http://www.sciencemag.org/content/302/5644/413.short
|arxiv=
|bibcode=
|doi=10.1126/science.1088328
|pmid=14563999
|accessdate=2012-12-26 }}</ref>


===Gene ID: 3640===
Gene ID: 1602 is the human gene DACH1 dachshund homolog 1 also known as DACH.<ref name=HGNC1602>{{ cite web
|author=HGNC
|title=DACH1 dachshund homolog 1 (Drosophila) [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 20, 2012
|url=http://www.ncbi.nlm.nih.gov/gene/1602
|accessdate=2012-12-26 }}</ref> DACH1 has three isoforms: a, b, and c.


===Gene ID: 3662===
"This gene encodes a chromatin-associated protein that associates with other DNA-binding transcription factors to regulate gene expression and cell fate determination during development. The protein contains a Ski domain that is highly conserved from Drosophila to human. Expression of this gene is lost in some forms of metastatic cancer, and is correlated with poor prognosis."<ref name=HGNC1602/>


===Gene ID: 3726===
===Gene ID: 1655===


===Gene ID: 3758===
"This gene encodes a member of the DEAD box family of RNA helicases that are involved in a variety of cellular processes as a result of its role as an adaptor molecule, promoting interactions with a large number of other factors. This protein is involved in pathways that include the alteration of RNA structures, plays a role as a coregulator of transcription, a regulator of splicing, and in the processing of small noncoding RNAs. Members of this family contain nine conserved motifs, including the conserved Asp-Glu-Ala-Asp (DEAD) motif, important to ATP binding and hydrolysis as well as RNA binding and unwinding activities. Dysregulation of this gene may play a role in cancer development. Alternative splicing results in multiple transcript variants."<ref name=RefSeq1655>{{ cite web
 
|author=RefSeq
===Gene ID: 3858===
|title=DDX5 DEAD-box helicase 5 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2017
|url=http://www.ncbi.nlm.nih.gov/gene/1655
|accessdate=2024-06-18 }}</ref>


===Gene ID: 3861===
===Gene ID: 1728===


===Gene ID: 3872===
"This gene is a member of the NAD(P)H dehydrogenase (quinone) family and encodes a cytoplasmic 2-electron reductase. This FAD-binding protein forms homodimers and reduces quinones to hydroquinones. This protein's enzymatic activity prevents the one electron reduction of quinones that results in the production of radical species. Mutations in this gene have been associated with tardive dyskinesia (TD), an increased risk of hematotoxicity after exposure to benzene, and susceptibility to various forms of cancer. Altered expression of this protein has been seen in many tumors and is also associated with Alzheimer's disease (AD). Alternate transcriptional splice variants, encoding different isoforms, have been characterized."<ref name=RefSeq1728>{{ cite web
 
===Gene ID: 3906===
 
===Gene ID: 3918===
 
"Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Laminins, composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively), have a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains. Each laminin chain is a multidomain protein encoded by a distinct gene. Several isoforms of each chain have been described. Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1. The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 2. The gamma 2 chain, formerly thought to be a truncated version of beta chain (B2t), is highly homologous to the gamma 1 chain; however, it lacks domain VI, and domains V, IV and III are shorter. It is expressed in several fetal tissues but differently from gamma 1, and is specifically localized to epithelial cells in skin, lung and kidney. The gamma 2 chain together with alpha 3 and beta 3 chains constitute laminin 5 (earlier known as kalinin), which is an integral part of the anchoring filaments that connect epithelial cells to the underlying basement membrane. The epithelium-specific expression of the gamma 2 chain implied its role as an epithelium attachment molecule, and mutations in this gene have been associated with junctional epidermolysis bullosa, a skin disease characterized by blisters due to disruption of the epidermal-dermal junction. Two transcript variants resulting from alternative splicing of the 3' terminal exon, and encoding different isoforms of gamma 2 chain, have been described. The two variants are differentially expressed in embryonic tissues, however, the biological significance of the two forms is not known. Transcript variants utilizing alternative polyA_signal have also been noted in literature."<ref name=RefSeq3918>{{ cite web
|author=RefSeq
|author=RefSeq
|title=LAMC2 laminin subunit gamma 2 [ Homo sapiens ]
|title=NQO1 NAD(P)H quinone dehydrogenase 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|location=Bethsda, Maryland, USA
|date=August 2011
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3918
|url=http://www.ncbi.nlm.nih.gov/gene/1728
|accessdate=2024-06-10 }}</ref> It has a TATA box (GATAAAA) from -33 to -27 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 1735===


===Gene ID: 3938===
"The protein encoded by this intronless gene belongs to the iodothyronine deiodinase family. It catalyzes the inactivation of thyroid hormone by inner ring deiodination of the prohormone thyroxine (T4) and the bioactive hormone 3,3',5-triiodothyronine (T3) to inactive metabolites, 3,3',5'-triiodothyronine (RT3) and 3,3'-diiodothyronine (T2), respectively. This enzyme is highly expressed in pregnant uterus, placenta, fetal and neonatal tissues, and thought to prevent premature exposure of developing fetal tissues to adult levels of thyroid hormones. It regulates circulating fetal thyroid hormone concentrations, and thus plays a critical role in mammalian development. Knockout mice lacking this gene exhibit abnormalities related to development and reproduction, and increased activity of this enzyme in infants with hemangiomas causes severe hypothyroidism. This protein is a selenoprotein, containing the rare selenocysteine (Sec) amino acid at its active site. Sec is encoded by the UGA codon, which normally signals translation termination. The 3' UTRs of selenoprotein mRNAs contain a conserved stem-loop structure, designated the Sec insertion sequence (SECIS) element, that is necessary for the recognition of UGA as a Sec codon rather than as a stop signal."<ref name=RefSeq1735>{{ cite web
|author=RefSeq
|title=DIO3 iodothyronine deiodinase 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=May 2016
|url=http://www.ncbi.nlm.nih.gov/gene/1735
|accessdate=2024-06-19 }}</ref>


===Gene ID: 3976===
===Gene ID: 1805===


===Gene ID: 3995===
"Dermatopontin is an extracellular matrix protein with possible functions in cell-matrix interactions and matrix assembly. The protein is found in various tissues and many of its tyrosine residues are sulphated. Dermatopontin is postulated to modify the behavior of TGF-beta through interaction with decorin."<ref name=RefSeq1805>{{ cite web
|author=RefSeq
|title=DPT dermatopontin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1465
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -26 to -20 nts from the TSS.<ref name=Jin/>


TATA box is TATAA.<ref name=Suzuki/>
===Gene ID: 1811===


The protein encoded by this FADS3 fatty acid desaturase 3 gene is "a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization."<ref name=HGNC3995>{{ cite web
The DRA gene (colon mucosa-associated gene) has a TATA box.<ref name=Suzuki>{{ cite journal
|author=HGNC:3576
|author=Yutaka Suzuki, Tatsuhiko Tsunoda, Jun Sese, Hirotoshi Taira, Junko Mizushima-Sugano, Hiroko Hata, Toshio Ota, Takao Isogai, Toshihiro Tanaka, Yusuke Nakamura, Akira Suyama, Yoshiyuki Sakaki, Shinichi Morishita, Kousaku Okubo, and Sumio Sugano
|title=FADS3 fatty acid desaturase 3 [ Homo sapiens ]
|title=Identification and Characterization of the Potential Promoter Regions of 1031 Kinds of Human Genes
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|journal=Genome Research
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|doi=10.1101/gr.164001
|date=June 6, 2024
|date=11 April 2001
|url=http://www.ncbi.nlm.nih.gov/gene/3995
|volume=11
|accessdate=2024-06-07 }}</ref>
|issue=5
|page=677-684 }}</ref>


===Gene ID: 4014===
"The protein encoded by this gene is a transmembrane glycoprotein that transports chloride ions across the cell membrane in exchange for bicarbonate ions. It is localized to the mucosa of the lower intestinal tract, particularly to the apical membrane of columnar epithelium and some goblet cells. The protein is essential for intestinal chloride absorption, and mutations in this gene have been associated with congenital chloride diarrhea."<ref name=RefSeq1811>{{ cite web
 
"This gene encodes loricrin, a major protein component of the cornified cell envelope found in terminally differentiated epidermal cells. Mutations in this gene are associated with Vohwinkel's syndrome and progressive symmetric erythrokeratoderma, both inherited skin diseases."<ref name=RefSeq4014>{{ cite web
|author=RefSeq
|author=RefSeq
|title=LORICRIN loricrin cornified envelope precursor protein [ Homo sapiens ]
|title=SLC26A3 solute carrier family 26 member 3 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=October 2008
|url=http://www.ncbi.nlm.nih.gov/gene/4014
|url=http://www.ncbi.nlm.nih.gov/gene/1811
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATATATAA) from -40 to -32 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-19 }}</ref>


===Gene ID: 4049===
===Gene ID: 1831===


===Gene ID: 4144===
"This gene encodes the anti-inflammatory protein glucocorticoid (GC)-induced leucine zipper. Expression of this gene stimulated by glucocorticoids and interleukin 10 and it appears to play a key role in the anti-inflammatory and immunosuppressive effects of this steroid. This protein has also been shown to inhibit pro-inflammatory molecules including nuclear factor &#954;B. Alternate splicing results in multiple transcript variants."<ref name=RefSeq1831>{{ cite web
|author=RefSeq
|title=TSC22D3 TSC22 domain family member 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=January 2016
|url=http://www.ncbi.nlm.nih.gov/gene/1831
|accessdate=2024-06-19 }}</ref>


===Gene ID: 4222===
===Gene ID: 1833===


===Gene ID: 4225===
"Dermatan sulfate proteoglycan 3 is a member of the small leucine-rich repeat proteoglycan family. This gene is composed of seven exons. It regulates fibrillogenesis by interacting with collagen fibrils and other extracellular matrix proteins."<ref name=RefSeq1833>{{ cite web
|author=RefSeq
|title=EPYC epiphycan [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1833
|accessdate=2024-06-19 }}</ref>


===Gene ID: 4284===
===Gene ID: 1893===


===Gene ID: 4316===
"This gene encodes a soluble protein that is involved in endochondral bone formation, angiogenesis, and tumor biology. It also interacts with a variety of extracellular and structural proteins, contributing to the maintenance of skin integrity and homeostasis. Mutations in this gene are associated with lipoid proteinosis disorder (also known as hyalinosis cutis et mucosae or Urbach-Wiethe disease) that is characterized by generalized thickening of skin, mucosae and certain viscera. Alternatively spliced transcript variants encoding distinct isoforms have been described for this gene."<ref name=RefSeq1833>{{ cite web
|author=RefSeq
|title=ECM1 extracellular matrix protein 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=February 2011
|url=http://www.ncbi.nlm.nih.gov/gene/1833
|accessdate=2024-06-20 }}</ref>


===Gene ID: 4321===
===Gene ID: 1906===


===Gene ID: 4327===
"This gene encodes a preproprotein that is proteolytically processed to generate a secreted peptide that belongs to the endothelin/sarafotoxin family. This peptide is a potent vasoconstrictor and its cognate receptors are therapeutic targets in the treatment of pulmonary arterial hypertension. Aberrant expression of this gene may promote tumorigenesis. Alternative splicing results in multiple transcript variants."<ref name=RefSeq1906>{{ cite web
|author=RefSeq
|title=EDN1 endothelin 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=October 2015
|url=http://www.ncbi.nlm.nih.gov/gene/1906
|accessdate=2024-06-20 }}</ref>


===Gene ID: 4357===
===Gene ID: 1915===


===Gene ID: 4501===
"This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes."<ref name=RefSeq1915>{{ cite web
|author=RefSeq
|title=EEF1A1 eukaryotic translation elongation factor 1 alpha 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/1915
|accessdate=2024-06-20 }}</ref>


===Gene ID: 4582===
===Gene ID: 1938===


"This gene encodes a membrane-bound protein that is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. This protein is expressed on the apical surface of epithelial cells that line the mucosal surfaces of many different tissues including lung, breast stomach and pancreas. This protein is proteolytically cleaved into alpha and beta subunits that form a heterodimeric complex. The N-terminal alpha subunit functions in cell-adhesion and the C-terminal beta subunit is involved in cell signaling. Overexpression, aberrant intracellular localization, and changes in glycosylation of this protein have been associated with carcinomas. This gene is known to contain a highly polymorphic variable number tandem repeats (VNTR) domain. Alternate splicing results in multiple transcript variants."<ref name=RefSeq4582>{{ cite web
===Gene ID: 1961===
|author=RefSeq
|title=MUC1 mucin 1, cell surface associated [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=February 2011
|url=http://www.ncbi.nlm.nih.gov/gene/4582
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -24 to -18 nts from the TSS.<ref name=Jin/>


===Gene ID: 4618===
"Enables DNA-binding transcription activator activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Involved in positive regulation of transcription by RNA polymerase II. Predicted to be located in nucleoplasm. Predicted to be part of chromatin."<ref name=AllianceofGenomeResources1961>{{ cite web
|author=Alliance of Genome Resources
|title=EGR4 early growth response 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/1961
|accessdate=2024-06-20 }}</ref>


===Gene ID: 4624===
===Gene ID: 1974===


===Gene ID: 4629===
===Gene ID: 1990===


===Gene ID: 4638===
===Gene ID: 1999===


===Gene ID: 4653===
"Enables DNA-binding transcription activator activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Involved in inflammatory response; negative regulation of transcription, DNA-templated; and positive regulation of transcription by RNA polymerase II. Located in Golgi apparatus; cytosol; and nucleoplasm."<ref name=Alliance of Genome Resources1999>{{ cite web
 
|author=Alliance of Genome Resources
"MYOC encodes the protein myocilin, which is believed to have a role in cytoskeletal function. MYOC is expressed in many occular tissues, including the trabecular meshwork, and was revealed to be the trabecular meshwork glucocorticoid-inducible response protein (TIGR). The trabecular meshwork is a specialized eye tissue essential in regulating intraocular pressure, and mutations in MYOC have been identified as the cause of hereditary juvenile-onset open-angle glaucoma."<ref name=RefSeq4653>{{ cite web
|title=ELF3 E74 like ETS transcription factor 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/1999
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 2168===
 
===Gene ID: 2206===
 
===Gene ID: 2244===
 
"The protein encoded by this gene is the beta component of fibrinogen, a blood-borne glycoprotein comprised of three pairs of nonidentical polypeptide chains. Following vascular injury, fibrinogen is cleaved by thrombin to form fibrin which is the most abundant component of blood clots. In addition, various cleavage products of fibrinogen and fibrin regulate cell adhesion and spreading, display vasoconstrictor and chemotactic activities, and are mitogens for several cell types. Fibrinogen serves key roles in hemostasis and antimicrobial host defense. Mutations in this gene lead to several disorders, including afibrinogenemia, dysfibrinogenemia, hypodysfibrinogenemia and thrombotic tendency."<ref name=RefSeq2244>{{ cite web
|author=RefSeq
|author=RefSeq
|title=MYOC myocilin [ Homo sapiens ]
|title=FGB fibrinogen beta chain [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=August 2020
|url=http://www.ncbi.nlm.nih.gov/gene/4653
|url=http://www.ncbi.nlm.nih.gov/gene/2244
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATATATAAAC) from -31 to -21 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


===Gene ID: 4741===
===Gene ID: 2250===


===Gene ID: 4747===
===Gene ID: 2321===


===Gene ID: 4843===
===Gene ID: 2353===


===Gene ID: 4878===
===Gene ID: 2494===


"The protein encoded by this gene belongs to the natriuretic peptide family. Natriuretic peptides are implicated in the control of extracellular fluid volume and electrolyte homeostasis. This protein is synthesized as a large precursor (containing a signal peptide), which is processed to release a peptide from the N-terminus with similarity to vasoactive peptide, cardiodilatin, and another peptide from the C-terminus with natriuretic-diuretic activity. Mutations in this gene have been associated with atrial fibrillation familial type 6. This gene is located adjacent to another member of the natriuretic family of peptides on chromosome 1."<ref name=RefSeq4878>{{ cite web
"The protein encoded by this gene is a DNA-binding zinc finger transcription factor and is a member of the fushi tarazu factor-1 subfamily of orphan nuclear receptors. The encoded protein is involved in the expression of genes for hepatitis B virus and cholesterol biosynthesis, and may be an important regulator of embryonic development."<ref name=RefSeq2494>{{ cite web
|author=RefSeq
|author=RefSeq
|title=NPPA natriuretic peptide A [ Homo sapiens ]
|title=NR5A2 nuclear receptor subfamily 5 group A member 2 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=October 2015
|date=June 2016
|url=http://www.ncbi.nlm.nih.gov/gene/4878
|url=http://www.ncbi.nlm.nih.gov/gene/2494
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAAAG) from -30 to -22 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAACA) from -28 to -21 nts from the TSS.<ref name=Jin/>


===Gene ID: 4946===
===Gene ID: 2538===


===Gene ID: 5004===
===Gene ID: 2542===


===Gene ID: 5005===
===Gene ID: 2597===


===Gene ID: 5016===
===Gene ID: 2641===


"This gene encodes a large, carbohydrate-rich, epithelial glycoprotein with numerous O-glycosylation sites located within threonine, serine, and proline-rich tandem repeats. The gene is similar to members of the mucin and the glycosyl hydrolase 18 gene families. Regulation of expression may be estrogen-dependent. Gene expression and protein secretion occur during late follicular development through early cleavage-stage embryonic development. The protein is secreted from non-ciliated oviductal epithelial cells and associates with ovulated oocytes, blastomeres, and spermatozoan acrosomal regions."<ref name=RefSeq5016>{{ cite web
"The protein encoded by this gene is actually a preproprotein that is cleaved into four distinct mature peptides. One of these, glucagon, is a pancreatic hormone that counteracts the glucose-lowering action of insulin by stimulating glycogenolysis and gluconeogenesis. Glucagon is a ligand for a specific G-protein linked receptor whose signalling pathway controls cell proliferation. Two of the other peptides are secreted from gut endocrine cells and promote nutrient absorption through distinct mechanisms. Finally, the fourth peptide is similar to glicentin, an active enteroglucagon."<ref name=RefSeq2641>{{ cite web
|author=RefSeq
|author=RefSeq
|title=OVGP1 oviductal glycoprotein 1 [ Homo sapiens ]
|title=GCG glucagon [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5016
|url=http://www.ncbi.nlm.nih.gov/gene/2641
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -25 to -19 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 2652===
 
===Gene ID: 2669===


===Gene ID: 5034===
===Gene ID: 2688===


===Gene ID: 5052===
===Gene ID: 2752===


"This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. Four transcript variants encoding the same protein have been identified for this gene."<ref name=RefSeq5052>{{ cite web
"The protein encoded by this gene belongs to the glutamine synthetase family. It catalyzes the synthesis of glutamine from glutamate and ammonia in an ATP-dependent reaction. This protein plays a role in ammonia and glutamate detoxification, acid-base homeostasis, cell signaling, and cell proliferation. Glutamine is an abundant amino acid, and is important to the biosynthesis of several amino acids, pyrimidines, and purines. Mutations in this gene are associated with congenital glutamine deficiency, and overexpression of this gene was observed in some primary liver cancer samples. There are six pseudogenes of this gene found on chromosomes 2, 5, 9, 11, and 12. Alternative splicing results in multiple transcript variants."<ref name=RefSeq2752>{{ cite web
|author=RefSeq
|author=RefSeq
|title=PRDX1 peroxiredoxin 1 [ Homo sapiens ]
|title=GLUL glutamate-ammonia ligase [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=January 2011
|date=December 2014
|url=http://www.ncbi.nlm.nih.gov/gene/5052
|url=http://www.ncbi.nlm.nih.gov/gene/2752
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-10 }}</ref> It has a TATA box (GATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 5069===
===Gene ID: 2780===


===Gene ID: 5132===
"Transducin is a 3-subunit guanine nucleotide-binding protein (G protein) which stimulates the coupling of rhodopsin and cGMP-phoshodiesterase during visual impulses. The transducin alpha subunits in rods and cones are encoded by separate genes. This gene encodes the alpha subunit in cones."<ref name=RefSeq2780>{{ cite web
 
|author=RefSeq
"This gene encodes a phosphoprotein, which is located in the outer and inner segments of the rod cells in the retina. This protein may participate in the regulation of visual phototransduction or in the integration of photoreceptor metabolism. It modulates the phototransduction cascade by interacting with the beta and gamma subunits of the retinal G-protein transducin. This gene is a potential candidate gene for retinitis pigmentosa and Usher syndrome type II. Alternatively spliced transcript variants encoding different isoforms have been identified."<ref name=RefSeq5132>{{ cite web
|title=GNAT2 G protein subunit alpha transducin 2 [ Homo sapiens ]
|author=RefSeq
|publisher=ncbi.nlm.nih
|title=PDC phosducin [ Homo sapiens ]
|location=Bethsda, Maryland, USA
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5132
|url=http://www.ncbi.nlm.nih.gov/gene/2780
|accessdate=2024-06-10 }}</ref> It has a TATA box (TTTAAAT) from -32 to -26 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -30 to -23 nts from the TSS.<ref name=Jin/>


===Gene ID: 5155===
===Gene ID: 2814===


===Gene ID: 5224===
"Human platelet glycoprotein V (GP5) is a part of the Ib-V-IX system of surface glycoproteins that constitute the receptor for von Willebrand factor (VWF; MIM 613160) and mediate the adhesion of platelets to injured vascular surfaces in the arterial circulation, a critical initiating event in hemostasis. The main portion of the receptor is a heterodimer composed of 2 polypeptide chains, an alpha chain (GP1BA; MIM 606672) and a beta chain (GP1BB; MIM 138720), that are linked by disulfide bonds. The complete receptor complex includes noncovalent association of the alpha and beta subunits with platelet glycoprotein IX (GP9; MIM 173515) and GP5. Mutations in GP1BA, GP1BB, and GP9 have been shown to cause Bernard-Soulier syndrome (MIM 231200), a bleeding disorder (review by Lopez et al., 1998 [PubMed 9616133])."<ref name=OMIM2814>{{ cite web
|author=OMIM
|title=GP5 glycoprotein V platelet [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2010
|url=http://www.ncbi.nlm.nih.gov/gene/2814
|accessdate=2024-06-20 }}</ref>


===Gene ID: 5275===
===Gene ID: 2922===


===Gene ID: 5360===
===Gene ID: 2947===


===Gene ID: 5449===
===Gene ID: 2980===


===Gene ID: 5478===
"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."<ref name=AllianceofGenomeResources2980>{{ cite web
 
|author=Alliance of Genome Resources
===Gene ID: 5610===
|title=GUCA2A guanylate cyclase activator 2A [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/2980
|accessdate=2024-06-10 }}</ref> It has a TATA box (TTTAAAA) from -33 to -27 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 2981===


===Gene ID: 5617===
"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."<ref name=RefSeq2981>{{ cite web
|author=RefSeq
|title=GUCA2B guanylate cyclase activator 2B [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2015
|url=http://www.ncbi.nlm.nih.gov/gene/2981
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAGG) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 5645===
===Gene ID: 3006===


===Gene ID: 5650===
"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."<ref name=RefSeq3006>{{ cite web
|author=RefSeq
|title=H1-2 H1.2 linker histone, cluster member [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/3006
|accessdate=2024-06-20 }}</ref>


===Gene ID: 5743===
===Gene ID: 3008===


"Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes the inducible isozyme. It is regulated by specific stimulatory events, suggesting that it is responsible for the prostanoid biosynthesis involved in inflammation and mitogenesis."<ref name=RefSeq5743>{{ cite web
"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."<ref name=RefSeq3008>{{ cite web
|author=RefSeq
|author=RefSeq
|title=PTGS2 prostaglandin-endoperoxide synthase 2 [ Homo sapiens ]
|title=H1-4 H1.4 linker histone, cluster member [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=February 2009
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/5743
|url=http://www.ncbi.nlm.nih.gov/gene/3008
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -31 to -25 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


"[T]he human ... prostaglandin-endoperoxide-synthase-2 [gene contains] a canonical TATA box (nucleotide residues at positions -31 to -25 for the human gene)."<ref name=Kosaka>{{ cite journal
===Gene ID: 3009===
|author=Tetsuya Kosaka, Atsuro Miyata, Hayato Ihara, Shuntaro Hara, Tamiko Sugimoto, Osamu Takeda, Ei-ichi Takahashi, Tadashi Tanabe
|title=Characterization of the human gene (PTGS2) encoding prostaglandin‐endoperoxide synthase 2
|journal=European Journal of Biochemistry
|date=May 1994
|volume=221
|issue=3
|pages=889-97
|url=http://onlinelibrary.wiley.com/doi/10.1111/j.1432-1033.1994.tb18804.x/full
|arxiv=
|bibcode=
|doi=10.1111/j.1432-1033.1994.tb18804.x
|pmid=
|accessdate=2012-12-26 }}</ref> This is Gene ID: 5743.


===Gene ID: 5950===
"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3."<ref name=RefSeq3009>{{ cite web
|author=RefSeq
|title=H1-5 H1.5 linker histone, cluster member [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/3009
|accessdate=2024-06-20 }}</ref>


===Gene ID: 5956===
===Gene ID: 3010===


===Gene ID: 5967===
"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."<ref name=RefSeq3010>{{ cite web
|author=RefSeq
|title=H1-6 H1.6 linker histone, cluster member [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/3010
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 3015===


===Gene ID: 5996===
===Gene ID: 3050===


"This gene encodes a member of the regulator of G-protein signalling family. This protein is located on the cytosolic side of the plasma membrane and contains a conserved, 120 amino acid motif called the RGS domain. The protein attenuates the signalling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal."<ref name=RefSeq5996>{{ cite web
"Zeta-globin is an alpha-like hemoglobin. The zeta-globin polypeptide is synthesized in the yolk sac of the early embryo, while alpha-globin is produced throughout fetal and adult life. The zeta-globin gene is a member of the human alpha-globin gene cluster that includes five functional genes and two pseudogenes. The order of genes is: 5' - zeta - pseudozeta - mu - pseudoalpha-1 - alpha-2 -alpha-1 - theta1 - 3'."<ref name=RefSeq3050>{{ cite web
|author=RefSeq
|author=RefSeq
|title=RGS1 regulator of G protein signaling 1 [ Homo sapiens ]
|title=HBZ hemoglobin subunit zeta [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=July 2008
|date=November 2009
|url=http://www.ncbi.nlm.nih.gov/gene/5996
|url=http://www.ncbi.nlm.nih.gov/gene/3050
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -28 to -22 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 3133===


===Gene ID: 5997===
===Gene ID: 3158===


"Regulator of G protein signaling (RGS) family members are regulatory molecules that act as GTPase activating proteins (GAPs) for G alpha subunits of heterotrimeric G proteins. RGS proteins are able to deactivate G protein subunits of the Gi alpha, Go alpha and Gq alpha subtypes. They drive G proteins into their inactive GDP-bound forms. Regulator of G protein signaling 2 belongs to this family. The protein acts as a mediator of myeloid differentiation and may play a role in leukemogenesis."<ref name=RefSeq5997>{{ cite web
"The protein encoded by this gene belongs to the HMG-CoA synthase family. It is a mitochondrial enzyme that catalyzes the first reaction of ketogenesis, a metabolic pathway that provides lipid-derived energy for various organs during times of carbohydrate deprivation, such as fasting. Mutations in this gene are associated with HMG-CoA synthase deficiency. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq3158>{{ cite web
|author=RefSeq
|author=RefSeq
|title=RGS2 regulator of G protein signaling 2 [ Homo sapiens ]
|title=HMGCS2 3-hydroxy-3-methylglutaryl-CoA synthase 2 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=August 2009
|date=October 2009
|url=http://www.ncbi.nlm.nih.gov/gene/5997
|url=http://www.ncbi.nlm.nih.gov/gene/3158
|accessdate=2024-06-11 }}</ref> It has a TATA box (CATAAAT) from -28 to -22 nts from the TSS.<ref name=Jin/>
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 3164===


===Gene ID: 6046===
===Gene ID: 3171===


===Gene ID: 6121===
===Gene ID: 3182===


"The protein encoded by this gene is a component of the vitamin A visual cycle of the retina which supplies the 11-cis retinal chromophore of the photoreceptors opsin visual pigments. It is a member of the carotenoid cleavage oxygenase superfamily. All members of this superfamily are non-heme iron oxygenases with a seven-bladed propeller fold and oxidatively cleave carotenoid carbon:carbon double bonds. However, the protein encoded by this gene has acquired a divergent function that involves the concerted O-alkyl ester cleavage of its all-trans retinyl ester substrate and all-trans to 11-cis double bond isomerization of the retinyl moiety. As such, it performs the essential enzymatic isomerization step in the synthesis of 11-cis retinal. Mutations in this gene are associated with early-onset severe blinding disorders such as Leber congenital."<ref name=RefSeq6121>{{ cite web
===Gene ID: 3206===
|author=RefSeq
|title=RPE65 retinoid isomerohydrolase RPE65 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=October 2017
|url=http://www.ncbi.nlm.nih.gov/gene/6121
|accessdate=2024-06-11 }}</ref> It has a TATA box (CATAAAA) from -27 to -21 nts from the TSS.<ref name=Jin/>


===Gene ID: 6232===
===Gene ID: 3222===


"Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of four RNA species and approximately 80 structurally distinct proteins. This gene encodes a member of the S27e family of ribosomal proteins and component of the 40S subunit. The encoded protein contains a C4-type zinc finger domain that can bind to zinc and may bind to nucleic acid. Mutations in this gene have been identified in numerous melanoma patients and in at least one patient with Diamond-Blackfan anemia (DBA). Elevated expression of this gene has been observed in various human cancers. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome."<ref name=RefSeq6232>{{ cite web
===Gene ID: 3232===
|author=RefSeq
 
|title=RPS27 ribosomal protein S27 [ Homo sapiens ]
===Gene ID: 3273===
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2018
|url=http://www.ncbi.nlm.nih.gov/gene/6232
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATATAA) from -29 to -23 nts from the TSS.<ref name=Jin/>


===Gene ID: 6279===
===Gene ID: 3280===


"The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in the inhibition of casein kinase and as a cytokine. Altered expression of this protein is associated with the disease cystic fibrosis. Multiple transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq6279>{{ cite web
"This protein belongs to the basic helix-loop-helix family of transcription factors. It is a transcriptional repressor of genes that require a bHLH protein for their transcription. The protein has a particular type of basic domain that contains a helix interrupting protein that binds to the N-box rather than the canonical E-box."<ref name=RefSeq3280>{{ cite web
|author=RefSeq
|author=RefSeq
|title=S100A8 S100 calcium binding protein A8 [ Homo sapiens ]
|title=HES1 hes family bHLH transcription factor 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=January 2016
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/6279
|url=http://www.ncbi.nlm.nih.gov/gene/3280
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS, Code score: 1.00; Matrix score: 0.93.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


===Gene ID: 6280===
===Gene ID: 3283===


===Gene ID: 6288===
"The protein encoded by this gene is an enzyme that catalyzes the oxidative conversion of delta-5-3-beta-hydroxysteroid precursors into delta-4-ketosteroids, which leads to the production of all classes of steroid hormones. The encoded protein also catalyzes the interconversion of 3-beta-hydroxy- and 3-keto-5-alpha-androstane steroids."<ref name=RefSeq3283>{{ cite web
 
|author=RefSeq
===Gene ID: 6289===
|title=HSD3B1 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 1 [ Homo sapiens ]
 
|publisher=ncbi.nlm.nih
===Gene ID: 6349===
|location=Bethsda, Maryland, USA
|date=June 2016
|url=http://www.ncbi.nlm.nih.gov/gene/3283
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 6351===
===Gene ID: 3284===


===Gene ID: 6352===
"The protein encoded by this gene is a bifunctional enzyme that catalyzes the oxidative conversion of delta(5)-ene-3-beta-hydroxy steroid, and the oxidative conversion of ketosteroids. It plays a crucial role in the biosynthesis of all classes of hormonal steroids. This gene is predominantly expressed in the adrenals and the gonads. Mutations in this gene are associated with 3-beta-hydroxysteroid dehydrogenase, type II, deficiency. Alternatively spliced transcript variants have been found for this gene."<ref name=RefSeq3284>{{ cite web
|author=RefSeq
|title=HSD3B2 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=October 2009
|url=http://www.ncbi.nlm.nih.gov/gene/3284
|accessdate=2024-06-10 }}</ref> It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 6364===
===Gene ID: 3304===


===Gene ID: 6414===
===Gene ID: 3308===


===Gene ID: 6427===
The ''Drosophila hsp70'' has a TATA box containing promoter.<ref name=Burke>{{ cite journal
 
| author = Thomas W. Burke and James T. Kadonaga
===Gene ID: 6428===
| title = The downstream core promoter element, DPE, is conserved from ''Drosophila'' to humans and is recognized by TAF<sub>II</sub>60 of ''Drosophila''
 
| date = November 15, 1997
===Gene ID: 6435===
| journal = Genes & Development
| volume = 11
| issue = 22
| pages = 3020–31
| url=http://genesdev.cshlp.org/content/11/22/3020.long
| doi = 10.1101/gad.11.22.3020
| pmid = 9367984
| pmc = 316699 }}</ref> This suggests that Gene ID: 3308 HSPA4 heat shock 70kDa protein 4 [''Homo sapiens''], also known as hsp70,<ref name=HGNC3308>{{ cite web
|author=HGNC
|title=HSPA4 heat shock 70kDa protein 4 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=February 3, 2013
|url=http://www.ncbi.nlm.nih.gov/gene/3308
|accessdate=2013-02-07 }}</ref> has a TATA box in its [[core promoter]].


===Gene ID: 6436===
===Gene ID: 3309===


===Gene ID: 6446===
"The protein encoded by this gene is a member of the heat shock protein 70 (HSP70) family. This protein localizes to the lumen of the endoplasmic reticulum (ER) where it operates as a typical HSP70 chaperone involved in the folding and assembly of proteins in the ER and is a master regulator of ER homeostasis. During cellular stress, as during viral infection or tumorogenesis, this protein interacts with the transmembrane stress sensor proteins PERK (protein kinase R-like endoplasmic reticulum kinase), IRE1 (inositol-requiring kinase 1), and ATF6 (activating transcription factor 6) where it acts as a repressor of the unfolded protein response (UPR) and also plays a role in cellular apoptosis and senescence. Elevated expression and atypical translocation of this protein to the cell surface has been reported in viral infections and some types of cancer cells. At the cell surface this protein may facilitate viral attachment and entry to host cells. This gene is a therapeutic target for the treatment of coronavirus diseases and chemoresistant cancers."<ref name=RefSeq3309>{{ cite web
|author=RefSeq
|title=HSPA5 heat shock protein family A (Hsp70) member 5 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2020
|url=http://www.ncbi.nlm.nih.gov/gene/3309
|accessdate=2024-06-20 }}</ref>


===Gene ID: 6500===
===Gene ID: 3320===


===Gene ID: 6513===
"The protein encoded by this gene is an inducible molecular chaperone that functions as a homodimer. The encoded protein aids in the proper folding of specific target proteins by use of an ATPase activity that is modulated by co-chaperones. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq3320>{{ cite web
|author=RefSeq
|title=HSP90AA1 heat shock protein 90 alpha family class A member 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=January 2012
|url=http://www.ncbi.nlm.nih.gov/gene/3320
|accessdate=2024-06-20 }}</ref>


===Gene ID: 6518===
===Gene ID: 3371===


"The protein encoded by this gene is a fructose transporter responsible for fructose uptake by the small intestine. The encoded protein also is necessary for the increase in blood pressure due to high dietary fructose consumption."<ref name=RefSeq6518>{{ cite web
"This gene encodes an extracellular matrix protein with a spatially and temporally restricted tissue distribution. This protein is homohexameric with disulfide-linked subunits, and contains multiple EGF-like and fibronectin type-III domains. It is implicated in guidance of migrating neurons as well as axons during development, synaptic plasticity, and neuronal regeneration."<ref name=RefSeq3371>{{ cite web
|author=RefSeq
|author=RefSeq
|title=SLC2A5 solute carrier family 2 member 5 [ Homo sapiens ]
|title=TNC tenascin C [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=June 2016
|date=July 2011
|url=http://www.ncbi.nlm.nih.gov/gene/6518
|url=http://www.ncbi.nlm.nih.gov/gene/3371
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAA) from -33 to -27 nts from the TSS, Code score: 1.00; Matrix score: 0.92.<ref name=Jin/>
|accessdate=2024-06-20 }}</ref>


===Gene ID: 6519===
===Gene ID: 3375===


===Gene ID: 6548===
"This gene encodes a member of the calcitonin family of peptide hormones. This hormone is released from pancreatic beta cells following food intake to regulate blood glucose levels and act as a satiation signal. Human patients with type 1 and advanced type 2 diabetes exhibit reduced levels of the encoded hormone in blood and pancreas. This protein also exhibits a bactericidal, antimicrobial activity."<ref name=RefSeq3375>{{ cite web
|author=RefSeq
|title=IAPP islet amyloid polypeptide [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2016
|url=http://www.ncbi.nlm.nih.gov/gene/3375
|accessdate=2024-06-20 }}</ref>


"This gene encodes a Na+/H+ antiporter that is a member of the solute carrier family 9. The encoded protein is a plasma membrane transporter that is expressed in the kidney and intestine. This protein plays a central role in regulating pH homeostasis, cell migration and cell volume. This protein may also be involved in tumor growth."<ref name=RefSeq6548>{{ cite web
===Gene ID: 3383===
|author=RefSeq
|title=SLC9A1 solute carrier family 9 member A1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/6548
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAGT) from -32 to -26 nts from the TSS, Code score: 0.91; Matrix score: 0.84.<ref name=Jin/>


===Gene ID: 6566===
===Gene ID: 3397===


"The protein encoded by this gene is a proton-linked monocarboxylate transporter that catalyzes the movement of many monocarboxylates, such as lactate and pyruvate, across the plasma membrane. Mutations in this gene are associated with erythrocyte lactate transporter defect. Alternatively spliced transcript variants have been found for this gene."<ref name=RefSeq6566>{{ cite web
===Gene ID: 3458===
|author=RefSeq
|title=SLC16A1 solute carrier family 16 member 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=October 2009
|url=http://www.ncbi.nlm.nih.gov/gene/6566
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS, Code score: 0.91; Matrix score: 0.78.<ref name=Jin/>


===Gene ID: 6647===
===Gene ID: 3487===


===Gene ID: 6698===
===Gene ID: 3491===


"A structural constituent of skin epidermis. Involved in keratinocyte differentiation and peptide cross-linking. Located in cornified envelope."<ref name=AllianceofGenomeResources6698>{{ cite web
"The secreted protein encoded by this gene is growth factor-inducible and promotes the adhesion of endothelial cells. The encoded protein interacts with several integrins and with heparan sulfate proteoglycan. This protein also plays a role in cell proliferation, differentiation, angiogenesis, apoptosis, and extracellular matrix formation."<ref name=RefSeq3491>{{ cite web
|author=Alliance of Genome Resources
|author=RefSeq
|title=SPRR1A small proline rich protein 1A [ Homo sapiens ]
|title=CCN1 cellular communication network factor 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=ncbi.nlm.nih
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=Bethsda, Maryland, USA
|date=April 2022
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/6698
|url=http://www.ncbi.nlm.nih.gov/gene/3491
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAAG) from -30 to -23 nts from the TSS, Code score: 1.00; Matrix score: 0.91.<ref name=Jin/>
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS.<ref name=Jin/>


===Gene ID: 6707===
===Gene ID: 3558===


===Gene ID: 6783===
===Gene ID: 3565===


===Gene ID: 6870===
"The protein encoded by this gene is a pleiotropic cytokine produced by activated T cells. This cytokine is a ligand for interleukin 4 receptor. The interleukin 4 receptor also binds to IL13, which may contribute to many overlapping functions of this cytokine and IL13. STAT6, a signal transducer and activator of transcription, has been shown to play a central role in mediating the immune regulatory signal of this cytokine. This gene, IL3, IL5, IL13, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL13. This gene, IL13 and IL5 are found to be regulated coordinately by several long-range regulatory elements in an over 120 kilobase range on the chromosome. IL4 is considered an important cytokine for tissue repair, counterbalancing the effects of proinflammatory type 1 cytokines, however, it also promotes allergic airway inflammation. Moreover, IL-4, a type 2 cytokine, mediates and regulates a variety of human host responses such as allergic, anti-parasitic, wound healing, and acute inflammation. This cytokine has been reported to promote resolution of neutrophil-mediated acute lung injury. In an allergic response, IL-4 has an essential role in the production of allergen-specific immunoglobin (Ig) E. This pro-inflammatory cytokine has been observed to be increased in COVID-19 (Coronavirus disease 2019) patients, but is not necessarily associated with severe COVID-19 pathology. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported."<ref name=RefSeq3565>{{ cite web
|author=RefSeq
|title=IL4 interleukin 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2020
|url=http://www.ncbi.nlm.nih.gov/gene/3565
|accessdate=2024-06-20 }}</ref>


===Gene ID: 6916===
===Gene ID: 3593===


===Gene ID: 7037===
"This gene encodes a subunit of interleukin 12, a cytokine that acts on T and natural killer cells, and has a broad array of biological activities. Interleukin 12 is a disulfide-linked heterodimer composed of the 40 kD cytokine receptor like subunit encoded by this gene, and a 35 kD subunit encoded by IL12A. This cytokine is expressed by activated macrophages that serve as an essential inducer of Th1 cells development. This cytokine has been found to be important for sustaining a sufficient number of memory/effector Th1 cells to mediate long-term protection to an intracellular pathogen. Overexpression of this gene was observed in the central nervous system of patients with multiple sclerosis (MS), suggesting a role of this cytokine in the pathogenesis of the disease. The promoter polymorphism of this gene has been reported to be associated with the severity of atopic and non-atopic asthma in children."<ref name=RefSeq3593>{{ cite web
|author=RefSeq
|title=IL4 interleukin 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3593
|accessdate=2024-06-20 }}</ref>


===Gene ID: 7042===
===Gene ID: 3596===


===Gene ID: 7056===
"This gene encodes an immunoregulatory cytokine produced primarily by activated Th2 cells. This cytokine is involved in several stages of B-cell maturation and differentiation. It up-regulates CD23 and MHC class II expression, and promotes IgE isotype switching of B cells. This cytokine down-regulates macrophage activity, thereby inhibits the production of pro-inflammatory cytokines and chemokines. This cytokine is found to be critical to the pathogenesis of allergen-induced asthma but operates through mechanisms independent of IgE and eosinophils. This gene, IL3, IL5, IL4, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL4."<ref name=RefSeq3596>{{ cite web
|author=RefSeq
|title=IL13 interleukin 13 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3596
|accessdate=2024-06-20 }}</ref>


===Gene ID: 7124===
===Gene ID: 3605===


===Gene ID: 7252===
===Gene ID: 3624===


===Gene ID: 7262===
===Gene ID: 3630===


===Gene ID: 7295===
"This gene encodes insulin, a peptide hormone that plays a vital role in the regulation of carbohydrate and lipid metabolism. After removal of the precursor signal peptide, proinsulin is post-translationally cleaved into three peptides: the B chain and A chain peptides, which are covalently linked via two disulfide bonds to form insulin, and C-peptide. Binding of insulin to the insulin receptor (INSR) stimulates glucose uptake. A multitude of mutant alleles with phenotypic effects have been identified, including insulin-dependent diabetes mellitus, permanent neonatal diabetes diabetes mellitus, maturity-onset diabetes of the young type 10 and hyperproinsulinemia. There is a read-through gene, INS-IGF2, which overlaps with this gene at the 5' region and with the IGF2 gene at the 3' region."<ref name=RefSeq3630>{{ cite web
|author=RefSeq
|title=INS insulin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=May 2020
|url=http://www.ncbi.nlm.nih.gov/gene/3630
|accessdate=2024-06-20 }}</ref>


===Gene ID: 7432===
===Gene ID: 3640===


===Gene ID: 7803===
===Gene ID: 3662===


===Gene ID: 7850===
===Gene ID: 3708===


===Gene ID: 8000===
"This gene encodes an intracellular receptor for inositol 1,4,5-trisphosphate. Upon stimulation by inositol 1,4,5-trisphosphate, this receptor mediates calcium release from the endoplasmic reticulum. Mutations in this gene cause spinocerebellar ataxia type 15, a disease associated with an heterogeneous group of cerebellar disorders. Multiple transcript variants have been identified for this gene."<ref name=RefSeq3708>{{ cite web
|author=RefSeq
|title=ITPR1 inositol 1,4,5-trisphosphate receptor type 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2009
|url=http://www.ncbi.nlm.nih.gov/gene/3708
|accessdate=2024-06-20 }}</ref>


===Gene ID: 8431===
===Gene ID: 3726===


===Gene ID: 8942===
===Gene ID: 3758===


===Gene ID: 8991===
===Gene ID: 3848===


===Gene ID: 4327===
"The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the spinous and granular layers of the epidermis with family member KRT10 and mutations in these genes have been associated with bullous congenital ichthyosiform erythroderma. The type II cytokeratins are clustered in a region of chromosome 12q12-q13."<ref name=RefSeq3848>{{ cite web
|author=RefSeq
|title=KRT1 keratin 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3848
|accessdate=2024-06-20 }}</ref>


===Gene ID: 9001===
===Gene ID: 3858===


===Gene ID: 9518===
"This gene encodes a member of the type I (acidic) cytokeratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. Mutations in this gene are associated with epidermolytic hyperkeratosis. This gene is located within a cluster of keratin family members on chromosome 17q21."<ref name=RefSeq3858>{{ cite web
|author=RefSeq
|title=KRT10 keratin 10 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3858
|accessdate=2024-06-20 }}</ref>


===Gene ID: 9643===
===Gene ID: 3859===


===Gene ID: 9768===
"KRT12 encodes the type I intermediate filament chain keratin 12, expressed in corneal epithelia. Mutations in this gene lead to Meesmann corneal dystrophy."<ref name=RefSeq3859>{{ cite web
|author=RefSeq
|title=KRT12 keratin 12 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/3859
|accessdate=2024-06-20 }}</ref>


===Gene ID: 10057===
===Gene ID: 3860===


===Gene ID: 10284===
===Gene ID: 3861===


===Gene ID: 10397===
===Gene ID: 3872===


===Gene ID: 10458===
===Gene ID: 3906===


===Gene ID: 10482===
===Gene ID: 3918===


===Gene ID: 10563===
"Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Laminins, composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively), have a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains. Each laminin chain is a multidomain protein encoded by a distinct gene. Several isoforms of each chain have been described. Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1. The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 2. The gamma 2 chain, formerly thought to be a truncated version of beta chain (B2t), is highly homologous to the gamma 1 chain; however, it lacks domain VI, and domains V, IV and III are shorter. It is expressed in several fetal tissues but differently from gamma 1, and is specifically localized to epithelial cells in skin, lung and kidney. The gamma 2 chain together with alpha 3 and beta 3 chains constitute laminin 5 (earlier known as kalinin), which is an integral part of the anchoring filaments that connect epithelial cells to the underlying basement membrane. The epithelium-specific expression of the gamma 2 chain implied its role as an epithelium attachment molecule, and mutations in this gene have been associated with junctional epidermolysis bullosa, a skin disease characterized by blisters due to disruption of the epidermal-dermal junction. Two transcript variants resulting from alternative splicing of the 3' terminal exon, and encoding different isoforms of gamma 2 chain, have been described. The two variants are differentially expressed in embryonic tissues, however, the biological significance of the two forms is not known. Transcript variants utilizing alternative polyA_signal have also been noted in literature."<ref name=RefSeq3918>{{ cite web
|author=RefSeq
|title=LAMC2 laminin subunit gamma 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2011
|url=http://www.ncbi.nlm.nih.gov/gene/3918
|accessdate=2024-06-10 }}</ref> It has a TATA box (GATAAAA) from -33 to -27 nts from the TSS.<ref name=Jin/>


===Gene ID: 10631===
===Gene ID: 3938===


===Gene ID: 10761===
===Gene ID: 3976===


===Gene ID: 10761===
===Gene ID: 3977===


===Gene ID: 10930===
"This gene encodes a protein that belongs to the type I cytokine receptor family. This protein combines with a high-affinity converter subunit, gp130, to form a receptor complex that mediates the action of the leukemia inhibitory factor, a polyfunctional cytokine that is involved in cellular differentiation, proliferation and survival in the adult and the embryo. Mutations in this gene cause Schwartz-Jampel syndrome type 2, a disease belonging to the group of the bent-bone dysplasias. A translocation that involves the promoter of this gene, t(5;8)(p13;q12) with the pleiomorphic adenoma gene 1, is associated with salivary gland pleiomorphic adenoma, a common type of benign epithelial tumor of the salivary gland. Multiple splice variants encoding two different isoforms have been found for this gene."<ref name=RefSeq3977>{{ cite web
|author=RefSeq
|title=LIFR LIF receptor subunit alpha [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=June 2018
|url=http://www.ncbi.nlm.nih.gov/gene/3977
|accessdate=2024-06-10 }}</ref>


===Gene ID: 10938===
===Gene ID: 3995===


===Gene ID: 11081===
TATA box is TATAA.<ref name=Suzuki/>


===Gene ID: 22928===
The protein encoded by this FADS3 fatty acid desaturase 3 gene is "a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization."<ref name=HGNC3995>{{ cite web
|author=HGNC:3576
|title=FADS3 fatty acid desaturase 3 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=June 6, 2024
|url=http://www.ncbi.nlm.nih.gov/gene/3995
|accessdate=2024-06-07 }}</ref>


===Gene ID: 22943===
===Gene ID: 4014===


===Gene ID: 25928===
"This gene encodes loricrin, a major protein component of the cornified cell envelope found in terminally differentiated epidermal cells. Mutations in this gene are associated with Vohwinkel's syndrome and progressive symmetric erythrokeratoderma, both inherited skin diseases."<ref name=RefSeq4014>{{ cite web
 
|author=RefSeq
===Gene ID: 26827===
|title=LORICRIN loricrin cornified envelope precursor protein [ Homo sapiens ]
 
RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ] is also known as U6 or U6-1.<ref name=HGNC26827>{{ cite web
|author=HGNC:10227
|title=RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=May 13, 2024
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/26827
|url=http://www.ncbi.nlm.nih.gov/gene/4014
|accessdate=2024-06-08 }}</ref> In the flanking region 5' to the gene[U6], there is a Hogness box sequence TATAAAT beginning at position -31 which is boxed in ...."<ref name=Ohshima>{{ cite journal
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATATATAA) from -40 to -32 nts from the TSS.<ref name=Jin/>
|author=Y Ohshima, N Okada, T Tani, Y Itoh, and M Itoh
|title=Nucleotide sequences of mouse genomic loci including a gene or pseudogene for U6 (4.8S) nuclear RNA
|journal=Nucleic Acids Research
|date=10 October 1981
|volume=9
|issue=19
|pages=5145-5158
|url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC327505/
|arxiv=
|bibcode=
|doi=10.1093/nar/9.19.5145
|pmid=6171774
|accessdate=2024-06-08 }}</ref>


===Gene ID: 26576===
===Gene ID: 4025===


===Gene ID: 26827===
===Gene ID: 4049===


===Gene ID: 27129===
===Gene ID: 4144===


===Gene ID: 27316===
===Gene ID: 4222===


===Gene ID: 51155===
===Gene ID: 4225===


===Gene ID: 51203===
===Gene ID: 4254===


===Gene ID: 51297===
===Gene ID: 4256===


===Gene ID: 51313===
===Gene ID: 4284===


===Gene ID: 51582===
"Major intrinsic protein is a member of the water-transporting aquaporins as well as the original member of the MIP family of channel proteins. The function of the fiber cell membrane protein encoded by this gene is undetermined, yet this protein is speculated to play a role in intracellular communication. The MIP protein is expressed in the ocular lens and is required for correct lens function. This gene has been mapped among aquaporins AQP2, AQP5, and AQP6, in a potential gene cluster at 12q13."<ref name=RefSeq4284>{{ cite web
|author=RefSeq
|title=MIP major intrinsic protein of lens fiber [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/4284
|accessdate=2024-06-20 }}</ref>


===Gene ID: 55118===
===Gene ID: 4316===


===Gene ID: 55504===
===Gene ID: 4319===


===Gene ID: 56642===
===Gene ID: 4321===


===Gene ID: 56895===
===Gene ID: 4322===


===Gene ID: 57126===
===Gene ID: 4327===


===Gene ID: 57152===
===Gene ID: 4357===


===Gene ID: 57579===
===Gene ID: 4435===


===Gene ID: 80740===
===Gene ID: 4501===


===Gene ID: 81606===
===Gene ID: 4504===


===Gene ID: 83638===
===Gene ID: 4582===


===Gene ID: 84223===
"This gene encodes a membrane-bound protein that is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. This protein is expressed on the apical surface of epithelial cells that line the mucosal surfaces of many different tissues including lung, breast stomach and pancreas. This protein is proteolytically cleaved into alpha and beta subunits that form a heterodimeric complex. The N-terminal alpha subunit functions in cell-adhesion and the C-terminal beta subunit is involved in cell signaling. Overexpression, aberrant intracellular localization, and changes in glycosylation of this protein have been associated with carcinomas. This gene is known to contain a highly polymorphic variable number tandem repeats (VNTR) domain. Alternate splicing results in multiple transcript variants."<ref name=RefSeq4582>{{ cite web
|author=RefSeq
|title=MUC1 mucin 1, cell surface associated [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=February 2011
|url=http://www.ncbi.nlm.nih.gov/gene/4582
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -24 to -18 nts from the TSS.<ref name=Jin/>


===Gene ID: 92736===
===Gene ID: 4609===


===Gene ID: 115265===
===Gene ID: 4616===


===Gene ID: 117156===
===Gene ID: 4618===


===Gene ID: 126364===
===Gene ID: 4624===


===Gene ID: 145957===
===Gene ID: 4629===


===Gene ID: 153020===
===Gene ID: 4632===


===Gene ID: 200504===
"Myosin is a hexameric ATPase cellular motor protein. It is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene encodes a myosin alkali light chain expressed in fast skeletal muscle. Two transcript variants have been identified for this gene."<ref name=RefSeq4632>{{ cite web
|author=RefSeq
|title=MYL1 myosin light chain 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/4632
|accessdate=2024-06-10 }}</ref>


===Gene ID: 200539===
===Gene ID: 4638===
 
===Gene ID: 4653===
 
"MYOC encodes the protein myocilin, which is believed to have a role in cytoskeletal function. MYOC is expressed in many occular tissues, including the trabecular meshwork, and was revealed to be the trabecular meshwork glucocorticoid-inducible response protein (TIGR). The trabecular meshwork is a specialized eye tissue essential in regulating intraocular pressure, and mutations in MYOC have been identified as the cause of hereditary juvenile-onset open-angle glaucoma."<ref name=RefSeq4653>{{ cite web
|author=RefSeq
|title=MYOC myocilin [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/4653
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATATATAAAC) from -31 to -21 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 4741===
 
===Gene ID: 4744===
 
===Gene ID: 4747===
 
===Gene ID: 4842===
 
===Gene ID: 4843===
 
===Gene ID: 4856===
 
===Gene ID: 4869===
 
"The protein encoded by this gene is involved in several cellular processes, including centrosome duplication, protein chaperoning, and cell proliferation. The encoded phosphoprotein shuttles between the nucleolus, nucleus, and cytoplasm, chaperoning ribosomal proteins and core histones from the nucleus to the cytoplasm. This protein is also known to sequester the tumor suppressor ARF in the nucleolus, protecting it from degradation until it is needed. Mutations in this gene are associated with acute myeloid leukemia. Dozens of pseudogenes of this gene have been identified."<ref name=RefSeq869>{{ cite web
|author=RefSeq
|title=NPM1 nucleophosmin 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=August 2017
|url=http://www.ncbi.nlm.nih.gov/gene/4869
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 4878===
 
"The protein encoded by this gene belongs to the natriuretic peptide family. Natriuretic peptides are implicated in the control of extracellular fluid volume and electrolyte homeostasis. This protein is synthesized as a large precursor (containing a signal peptide), which is processed to release a peptide from the N-terminus with similarity to vasoactive peptide, cardiodilatin, and another peptide from the C-terminus with natriuretic-diuretic activity. Mutations in this gene have been associated with atrial fibrillation familial type 6. This gene is located adjacent to another member of the natriuretic family of peptides on chromosome 1."<ref name=RefSeq4878>{{ cite web
|author=RefSeq
|title=NPPA natriuretic peptide A [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=October 2015
|url=http://www.ncbi.nlm.nih.gov/gene/4878
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAAAG) from -30 to -22 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 4922===
 
===Gene ID: 4946===
 
===Gene ID: 4982===
 
===Gene ID: 5004===
 
===Gene ID: 5005===
 
===Gene ID: 5016===
 
"This gene encodes a large, carbohydrate-rich, epithelial glycoprotein with numerous O-glycosylation sites located within threonine, serine, and proline-rich tandem repeats. The gene is similar to members of the mucin and the glycosyl hydrolase 18 gene families. Regulation of expression may be estrogen-dependent. Gene expression and protein secretion occur during late follicular development through early cleavage-stage embryonic development. The protein is secreted from non-ciliated oviductal epithelial cells and associates with ovulated oocytes, blastomeres, and spermatozoan acrosomal regions."<ref name=RefSeq5016>{{ cite web
|author=RefSeq
|title=OVGP1 oviductal glycoprotein 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5016
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -25 to -19 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 5033===
 
===Gene ID: 5034===
 
===Gene ID: 5052===
 
"This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. Four transcript variants encoding the same protein have been identified for this gene."<ref name=RefSeq5052>{{ cite web
|author=RefSeq
|title=PRDX1 peroxiredoxin 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=January 2011
|url=http://www.ncbi.nlm.nih.gov/gene/5052
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 5054===
 
===Gene ID: 5055===
 
===Gene ID: 5068===
 
===Gene ID: 5069===
 
===Gene ID: 5079===
 
===Gene ID: 5132===
 
"This gene encodes a phosphoprotein, which is located in the outer and inner segments of the rod cells in the retina. This protein may participate in the regulation of visual phototransduction or in the integration of photoreceptor metabolism. It modulates the phototransduction cascade by interacting with the beta and gamma subunits of the retinal G-protein transducin. This gene is a potential candidate gene for retinitis pigmentosa and Usher syndrome type II. Alternatively spliced transcript variants encoding different isoforms have been identified."<ref name=RefSeq5132>{{ cite web
|author=RefSeq
|title=PDC phosducin [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5132
|accessdate=2024-06-10 }}</ref> It has a TATA box (TTTAAAT) from -32 to -26 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 5155===
 
===Gene ID: 5224===
 
===Gene ID: 5225===
 
===Gene ID: 5275===
 
===Gene ID: 5360===
 
===Gene ID: 5406===
 
===Gene ID: 5408===
 
===Gene ID: 5443===
 
===Gene ID: 5449===
 
"This gene encodes a member of the POU family of transcription factors that regulate mammalian development. The protein regulates expression of several genes involved in pituitary development and hormone expression. Mutations in this genes result in combined pituitary hormone deficiency. Multiple transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq5449>{{ cite web
|author=RefSeq
|title=POU1F1 POU class 1 homeobox 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5449
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 5478===
 
===Gene ID: 5514===
 
===Gene ID: 5553===
 
===Gene ID: 5610===
 
===Gene ID: 5617===
 
===Gene ID: 5645===
 
===Gene ID: 5650===
 
===Gene ID: 5741===
 
===Gene ID: 5743===
 
"Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes the inducible isozyme. It is regulated by specific stimulatory events, suggesting that it is responsible for the prostanoid biosynthesis involved in inflammation and mitogenesis."<ref name=RefSeq5743>{{ cite web
|author=RefSeq
|title=PTGS2 prostaglandin-endoperoxide synthase 2 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=February 2009
|url=http://www.ncbi.nlm.nih.gov/gene/5743
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAA) from -31 to -25 nts from the TSS.<ref name=Jin/>
 
"[T]he human ... prostaglandin-endoperoxide-synthase-2 [gene contains] a canonical TATA box (nucleotide residues at positions -31 to -25 for the human gene)."<ref name=Kosaka>{{ cite journal
|author=Tetsuya Kosaka, Atsuro Miyata, Hayato Ihara, Shuntaro Hara, Tamiko Sugimoto, Osamu Takeda, Ei-ichi Takahashi, Tadashi Tanabe
|title=Characterization of the human gene (PTGS2) encoding prostaglandin‐endoperoxide synthase 2
|journal=European Journal of Biochemistry
|date=May 1994
|volume=221
|issue=3
|pages=889-97
|url=http://onlinelibrary.wiley.com/doi/10.1111/j.1432-1033.1994.tb18804.x/full
|arxiv=
|bibcode=
|doi=10.1111/j.1432-1033.1994.tb18804.x
|pmid=
|accessdate=2012-12-26 }}</ref> This is Gene ID: 5743.
 
===Gene ID: 5744===
 
===Gene ID: 5950===
 
===Gene ID: 5956===
 
===Gene ID: 5967===
 
===Gene ID: 5996===
 
"This gene encodes a member of the regulator of G-protein signalling family. This protein is located on the cytosolic side of the plasma membrane and contains a conserved, 120 amino acid motif called the RGS domain. The protein attenuates the signalling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal."<ref name=RefSeq5996>{{ cite web
|author=RefSeq
|title=RGS1 regulator of G protein signaling 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/5996
|accessdate=2024-06-10 }}</ref> It has a TATA box (TATAAAG) from -28 to -22 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 5997===
 
"Regulator of G protein signaling (RGS) family members are regulatory molecules that act as GTPase activating proteins (GAPs) for G alpha subunits of heterotrimeric G proteins. RGS proteins are able to deactivate G protein subunits of the Gi alpha, Go alpha and Gq alpha subtypes. They drive G proteins into their inactive GDP-bound forms. Regulator of G protein signaling 2 belongs to this family. The protein acts as a mediator of myeloid differentiation and may play a role in leukemogenesis."<ref name=RefSeq5997>{{ cite web
|author=RefSeq
|title=RGS2 regulator of G protein signaling 2 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=August 2009
|url=http://www.ncbi.nlm.nih.gov/gene/5997
|accessdate=2024-06-11 }}</ref> It has a TATA box (CATAAAT) from -28 to -22 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 6046===
 
===Gene ID: 6121===
 
"The protein encoded by this gene is a component of the vitamin A visual cycle of the retina which supplies the 11-cis retinal chromophore of the photoreceptors opsin visual pigments. It is a member of the carotenoid cleavage oxygenase superfamily. All members of this superfamily are non-heme iron oxygenases with a seven-bladed propeller fold and oxidatively cleave carotenoid carbon:carbon double bonds. However, the protein encoded by this gene has acquired a divergent function that involves the concerted O-alkyl ester cleavage of its all-trans retinyl ester substrate and all-trans to 11-cis double bond isomerization of the retinyl moiety. As such, it performs the essential enzymatic isomerization step in the synthesis of 11-cis retinal. Mutations in this gene are associated with early-onset severe blinding disorders such as Leber congenital."<ref name=RefSeq6121>{{ cite web
|author=RefSeq
|title=RPE65 retinoid isomerohydrolase RPE65 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=October 2017
|url=http://www.ncbi.nlm.nih.gov/gene/6121
|accessdate=2024-06-11 }}</ref> It has a TATA box (CATAAAA) from -27 to -21 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 6175===
 
===Gene ID: 6187===
 
===Gene ID: 6224===
 
===Gene ID: 6232===
 
"Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of four RNA species and approximately 80 structurally distinct proteins. This gene encodes a member of the S27e family of ribosomal proteins and component of the 40S subunit. The encoded protein contains a C4-type zinc finger domain that can bind to zinc and may bind to nucleic acid. Mutations in this gene have been identified in numerous melanoma patients and in at least one patient with Diamond-Blackfan anemia (DBA). Elevated expression of this gene has been observed in various human cancers. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome."<ref name=RefSeq6232>{{ cite web
|author=RefSeq
|title=RPS27 ribosomal protein S27 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=July 2018
|url=http://www.ncbi.nlm.nih.gov/gene/6232
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATATAA) from -29 to -23 nts from the TSS.<ref name=Jin/>
 
===Gene ID: 6279===
 
"The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in the inhibition of casein kinase and as a cytokine. Altered expression of this protein is associated with the disease cystic fibrosis. Multiple transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq6279>{{ cite web
|author=RefSeq
|title=S100A8 S100 calcium binding protein A8 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=January 2016
|url=http://www.ncbi.nlm.nih.gov/gene/6279
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS, Code score: 1.00; Matrix score: 0.93.<ref name=Jin/>
 
===Gene ID: 6280===
 
===Gene ID: 6288===
 
===Gene ID: 6289===
 
===Gene ID: 6349===
 
===Gene ID: 6351===
 
===Gene ID: 6352===
 
===Gene ID: 6354===
 
===Gene ID: 6356===
 
===Gene ID: 6357===
 
===Gene ID: 6364===
 
"This antimicrobial gene belongs to the subfamily of small cytokine CC genes. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The protein encoded by this gene displays chemotactic activity for lymphocytes and can repress proliferation of myeloid progenitors. Two transcript variants encoding different isoforms have been found for this gene."<ref name=RefSeq5449>{{ cite web
|author=RefSeq
|title=CCL20 C-C motif chemokine ligand 20 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2014
|url=http://www.ncbi.nlm.nih.gov/gene/5449
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 6376===
 
===Gene ID: 6414===
 
===Gene ID: 6427===
 
===Gene ID: 6428===
 
===Gene ID: 6432===
 
===Gene ID: 6435===
 
===Gene ID: 6436===
 
===Gene ID: 6446===
 
===Gene ID: 6500===
 
===Gene ID: 6513===
 
===Gene ID: 6518===
 
"The protein encoded by this gene is a fructose transporter responsible for fructose uptake by the small intestine. The encoded protein also is necessary for the increase in blood pressure due to high dietary fructose consumption."<ref name=RefSeq6518>{{ cite web
|author=RefSeq
|title=SLC2A5 solute carrier family 2 member 5 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=June 2016
|url=http://www.ncbi.nlm.nih.gov/gene/6518
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAA) from -33 to -27 nts from the TSS, Code score: 1.00; Matrix score: 0.92.<ref name=Jin/>
 
===Gene ID: 6519===
 
===Gene ID: 6548===
 
"This gene encodes a Na+/H+ antiporter that is a member of the solute carrier family 9. The encoded protein is a plasma membrane transporter that is expressed in the kidney and intestine. This protein plays a central role in regulating pH homeostasis, cell migration and cell volume. This protein may also be involved in tumor growth."<ref name=RefSeq6548>{{ cite web
|author=RefSeq
|title=SLC9A1 solute carrier family 9 member A1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=September 2011
|url=http://www.ncbi.nlm.nih.gov/gene/6548
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAGT) from -32 to -26 nts from the TSS, Code score: 0.91; Matrix score: 0.84.<ref name=Jin/>
 
===Gene ID: 6559===
 
===Gene ID: 6566===
 
"The protein encoded by this gene is a proton-linked monocarboxylate transporter that catalyzes the movement of many monocarboxylates, such as lactate and pyruvate, across the plasma membrane. Mutations in this gene are associated with erythrocyte lactate transporter defect. Alternatively spliced transcript variants have been found for this gene."<ref name=RefSeq6566>{{ cite web
|author=RefSeq
|title=SLC16A1 solute carrier family 16 member 1 [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=October 2009
|url=http://www.ncbi.nlm.nih.gov/gene/6566
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS, Code score: 0.91; Matrix score: 0.78.<ref name=Jin/>
 
===Gene ID: 6624===
 
===Gene ID: 6647===
 
===Gene ID: 6698===
 
"A structural constituent of skin epidermis. Involved in keratinocyte differentiation and peptide cross-linking. Located in cornified envelope."<ref name=AllianceofGenomeResources6698>{{ cite web
|author=Alliance of Genome Resources
|title=SPRR1A small proline rich protein 1A [ Homo sapiens ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/6698
|accessdate=2024-06-11 }}</ref> It has a TATA box (TATAAAAG) from -30 to -23 nts from the TSS, Code score: 1.00; Matrix score: 0.91.<ref name=Jin/>
 
===Gene ID: 6707===
 
===Gene ID: 6781===
 
===Gene ID: 6783===
 
===Gene ID: 6870===
 
===Gene ID: 6906===
 
===Gene ID: 6916===
 
===Gene ID: 6988===
 
===Gene ID: 7021===
 
===Gene ID: 7031===
 
===Gene ID: 7032===
 
"Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. The encoded protein inhibits gastric acid secretion. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21."<ref name=RefSeq7032>{{ cite web
|author=RefSeq
|title=TFF2 trefoil factor 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/7032
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 7037===
 
===Gene ID: 7038===
 
===Gene ID: 7042===
 
===Gene ID: 7043===
 
===Gene ID: 7056===
 
===Gene ID: 7124===
 
===Gene ID: 7167===
 
===Gene ID: 7252===
 
===Gene ID: 7262===
 
===Gene ID: 7276===
 
===Gene ID: 7280===
 
===Gene ID: 7295===
 
===Gene ID: 7306===
 
===Gene ID: 7316===
 
===Gene ID: 7369===
 
===Gene ID: 7432===
 
===Gene ID: 7803===
 
"This gene encodes a member of a small class of prenylated protein tyrosine phosphatases (PTPs), which contain a PTP domain and a characteristic C-terminal prenylation motif. The encoded protein is a cell signaling molecule that plays regulatory roles in a variety of cellular processes, including cell proliferation and migration. The protein may also be involved in cancer development and metastasis. This tyrosine phosphatase is a nuclear protein, but may associate with plasma membrane by means of its prenylation motif. Pseudogenes related to this gene are located on chromosomes 1, 2, 5, 7, 11 and X."<ref name=RefSeq7803>{{ cite web
|author=RefSeq
|title=PTP4A1 protein tyrosine phosphatase 4A1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=June 2013
|url=http://www.ncbi.nlm.nih.gov/gene/7803
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 7850===
 
===Gene ID: 7852===
 
===Gene ID: 8000===
 
===Gene ID: 8288===
 
===Gene ID: 8339===
 
===Gene ID: 8431===
 
===Gene ID: 8483===
 
===Gene ID: 8490===
 
===Gene ID: 8513===
 
===Gene ID: 8564===
 
===Gene ID: 8832===
 
===Gene ID: 8942===
 
===Gene ID: 8969===
 
"Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33."<ref name=RefSeq8969>{{ cite web
|author=RefSeq
|title=H2AC11 H2A clustered histone 11 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/8969
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 8970===
 
===Gene ID: 8991===
 
===Gene ID: 9001===
 
===Gene ID: 9421===
 
===Gene ID: 9510===
 
===Gene ID: 9518===
 
===Gene ID: 9607===
 
===Gene ID: 9643===
 
===Gene ID: 9709===
 
===Gene ID: 9768===
 
"Enables chromatin binding activity. Involved in several processes, including cellular macromolecule biosynthetic process; centrosome cycle; and response to UV. Located in centrosome; nucleus; and perinuclear region of cytoplasm."<ref name=AllianceofGenomeResources9768>{{ cite web
|author=Alliance of Genome Resources
|title=PCLAF PCNA clamp associated factor [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/9768
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 9921===
 
===Gene ID: 10057===
 
===Gene ID: 10215===
 
===Gene ID: 10284===
 
===Gene ID: 10350===
 
===Gene ID: 10397===
 
===Gene ID: 10458===
 
===Gene ID: 10482===
 
===Gene ID: 10563===
 
===Gene ID: 10631===
 
"This gene encodes a secreted extracellular matrix protein that functions in tissue development and regeneration, including wound healing, and ventricular remodeling following myocardial infarction. The encoded protein binds to integrins to support adhesion and migration of epithelial cells. This protein plays a role in cancer stem cell maintenance and metastasis. Mice lacking this gene exhibit cardiac valve disease, and skeletal and dental defects. Alternative splicing results in multiple transcript variants encoding different isoforms."<ref name=RefSeq10631>{{ cite web
|author=RefSeq
|title=POSTN periostin [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2015
|url=http://www.ncbi.nlm.nih.gov/gene/10631
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 10761===
 
===Gene ID: 10769===
 
===Gene ID: 10912===
 
===Gene ID: 10930===
 
===Gene ID: 10938===
 
===Gene ID: 11009===
 
===Gene ID: 11067===
 
===Gene ID: 11081===
 
===Gene ID: 11082===
 
===Gene ID: 11169===
 
===Gene ID: 12723===
 
===Gene ID: 22928===
 
===Gene ID: 22943===
 
===Gene ID: 23450===
 
===Gene ID: 23645===
 
===Gene ID: 25928===
 
===Gene ID: 26827===
 
RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ] is also known as U6 or U6-1.<ref name=HGNC26827>{{ cite web
|author=HGNC:10227
|title=RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ]
|publisher=National Center for Biotechnology Information, U.S. National Library of Medicine
|location=8600 Rockville Pike, Bethesda MD, 20894 USA
|date=May 13, 2024
|url=http://www.ncbi.nlm.nih.gov/gene/26827
|accessdate=2024-06-08 }}</ref> In the flanking region 5' to the gene[U6], there is a Hogness box sequence TATAAAT beginning at position -31 which is boxed in ...."<ref name=Ohshima>{{ cite journal
|author=Y Ohshima, N Okada, T Tani, Y Itoh, and M Itoh
|title=Nucleotide sequences of mouse genomic loci including a gene or pseudogene for U6 (4.8S) nuclear RNA
|journal=Nucleic Acids Research
|date=10 October 1981
|volume=9
|issue=19
|pages=5145-5158
|url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC327505/
|arxiv=
|bibcode=
|doi=10.1093/nar/9.19.5145
|pmid=6171774
|accessdate=2024-06-08 }}</ref>
 
===Gene ID: 26576===
 
===Gene ID: 26827===
 
===Gene ID: 27063===
 
===Gene ID: 27106===
 
===Gene ID: 27129===
 
===Gene ID: 27159===
 
===Gene ID: 27316===
 
===Gene ID: 50943===
 
===Gene ID: 51050===
 
===Gene ID: 51129===
 
===Gene ID: 51155===
 
===Gene ID: 51203===
 
===Gene ID: 51278===
 
===Gene ID: 51297===
 
===Gene ID: 51313===
 
===Gene ID: 51582===
 
"The protein encoded by this gene belongs to the antizyme inhibitor family, which plays a role in cell growth and proliferation by maintaining polyamine homeostasis within the cell. Antizyme inhibitors are homologs of ornithine decarboxylase (ODC, the key enzyme in polyamine biosynthesis) that have lost the ability to decarboxylase ornithine; however, retain the ability to bind to antizymes. Antizymes negatively regulate intracellular polyamine levels by binding to ODC and targeting it for degradation, as well as by inhibiting polyamine uptake. Antizyme inhibitors function as positive regulators of polyamine levels by sequestering antizymes and neutralizing their effect. This gene encodes antizyme inhibitor 1, the first member of this gene family that is ubiquitously expressed, and is localized in the nucleus and cytoplasm. Overexpression of antizyme inhibitor 1 gene has been associated with increased proliferation, cellular transformation and tumorigenesis. Gene knockout studies showed that homozygous mutant mice lacking functional antizyme inhibitor 1 gene died at birth with abnormal liver morphology. RNA editing of this gene, predominantly in the liver tissue, has been linked to the progression of hepatocellular carcinoma. Alternatively spliced transcript variants have been described for this gene."<ref name=RefSeq51582>{{ cite web
|author=RefSeq
|title=AZIN1 antizyme inhibitor 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2014
|url=http://www.ncbi.nlm.nih.gov/gene/51582
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 51738===
 
===Gene ID: 54106===
 
===Gene ID: 54567===
 
===Gene ID: 55118===
 
===Gene ID: 55504===
 
===Gene ID: 55603===
 
===Gene ID: 56642===
 
===Gene ID: 56675===
 
===Gene ID: 56829===
 
===Gene ID: 56895===
 
===Gene ID: 56987===
 
===Gene ID: 57126===
 
===Gene ID: 57152===
 
"The protein encoded by this gene is a member of the Ly6/uPAR family but lacks a GPI-anchoring signal sequence. It is thought that this secreted protein contains antitumor activity. Mutations in this gene have been associated with Mal de Meleda, a rare autosomal recessive skin disorder. This gene maps to the same chromosomal region as several members of the Ly6/uPAR family of glycoprotein receptors."<ref name=RefSeq57152>{{ cite web
|author=RefSeq
|title=SLURP1 secreted LY6/PLAUR domain containing 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/57152
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 57579===
 
===Gene ID: 57626===
 
===Gene ID: 57823===
 
"Enables identical protein binding activity. Predicted to be involved in adaptive immune response. Predicted to act upstream of or within regulation of natural killer cell activation. Located in endoplasmic reticulum."<ref name=AllianceofGenomeResources57823>{{ cite web
|author=Alliance of Genome Resources
|title=SLAMF7 SLAM family member 7 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/57823
|accessdate=2024-06-24 }}</ref>
 
===Gene ID: 64111===
 
"Predicted to enable signaling receptor binding activity. Involved in negative regulation of gonadotropin secretion. Predicted to be located in extracellular region."<ref name=AllianceofGenomeResources64111>{{ cite web
|author=Alliance of Genome Resources
|title=NPVF neuropeptide VF precursor [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/64111
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 65108===
 
"This gene encodes a member of the myristoylated alanine-rich C-kinase substrate (MARCKS) family. Members of this family play a role in cytoskeletal regulation, protein kinase C signaling and calmodulin signaling. The encoded protein affects the formation of adherens junction. Alternative splicing results in multiple transcript variants. Pseudogenes of this gene are located on the long arm of chromosomes 6 and 10."<ref name=RefSeq65108>{{ cite web
|author=RefSeq
|title=MARCKSL1 MARCKS like 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=June 2012
|url=http://www.ncbi.nlm.nih.gov/gene/65108
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 79733===
 
"This gene encodes a member of a family of transcription factors which regulate the expression of genes required for progression through the cell cycle. The encoded protein regulates progression from G1 to S phase by ensuring the nucleus divides at the proper time. Multiple alternatively spliced variants, encoding the same protein, have been identified."<ref name=RefSeq79733>{{ cite web
|author=RefSeq
|title=E2F8 E2F transcription factor 8 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=January 2012
|url=http://www.ncbi.nlm.nih.gov/gene/79733
|accessdate=2024-06-24 }}</ref>
 
===Gene ID: 80177===
 
"Predicted to act upstream of or within hematopoietic stem cell homeostasis. Located in nucleoplasm."<ref name=AllianceofGenomeResources80177>{{ cite web
|author=Alliance of Genome Resources
|title=MYCT1 MYC target 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/80177
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 80740===
 
"LY6G6C belongs to a cluster of leukocyte antigen-6 (LY6) genes located in the major histocompatibility complex (MHC) class III region on chromosome 6. Members of the LY6 superfamily typically contain 70 to 80 amino acids, including 8 to 10 cysteines. Most LY6 proteins are attached to the cell surface by a glycosylphosphatidylinositol (GPI) anchor that is directly involved in signal transduction (Mallya et al., 2002 [PubMed 12079290])."<ref name=OMIM80740>{{ cite web
|author=OMIM
|title=LY6G6C lymphocyte antigen 6 family member G6C [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=March 2008
|url=http://www.ncbi.nlm.nih.gov/gene/80740
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 81285===
 
"Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms."<ref name=RefSeq81285>{{ cite web
|author=RefSeq
|title=OR51E2 olfactory receptor family 51 subfamily E member 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/81285
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 81606===
 
"Involved in negative regulation of transcription, DNA-templated; positive regulation of transcription, DNA-templated; and regulation of MAPK cascade. Located in cytoplasm and nucleus. Part of protein-containing complex."<ref name=AllianceofGenomeResources81606>{{ cite web
|author=Alliance of Genome Resources
|title=LBH LBH regulator of WNT signaling pathway [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/81606
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 83638===
 
"Enables RNA binding activity."<ref name=AllianceofGenomeResources83638>{{ cite web
|author=Alliance of Genome Resources
|title=C11orf68 chromosome 11 open reading frame 68 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/83638
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 83998===
 
"Enables heparin binding activity and mannan binding activity. Predicted to act upstream of or within response to bacterium. Located in cytoplasm."<ref name=AllianceofGenomeResources83998>{{ cite web
|author=Alliance of Genome Resources
|title=REG4 regenerating family member 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/83998
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 84107===
 
"This gene encodes a member of the ZIC family of C2H2-type zinc finger proteins. Members of this family are important during development, and have been associated with X-linked visceral heterotaxy and holoprosencephaly type 5. This gene is closely linked to the gene encoding zinc finger protein of the cerebellum 1, a related family member on chromosome 3. Heterozygous deletion of these linked genes is involved in Dandy-Walker malformation, which is a congenital cerebellar malformation. Multiple transcript variants have been identified for this gene."<ref name=RefSeq84107>{{ cite web
|author=RefSeq
|title=ZIC4 Zic family member 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2009
|url=http://www.ncbi.nlm.nih.gov/gene/84107
|accessdate=2024-06-20 }}</ref>
 
===Gene ID: 84223===
 
"Enables Hsp70 protein binding activity and calmodulin binding activity. Predicted to be involved in sperm axoneme assembly. Located in cytoplasm."<ref name=AllianceofGenomeResources84223>{{ cite web
|author=Alliance of Genome Resources
|title=IQCG IQ motif containing G [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/84223
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 84328===
 
"Predicted to enable beta-catenin binding activity. Predicted to be involved in response to ionizing radiation."<ref name=AllianceofGenomeResources84328>{{ cite web
|author=Alliance of Genome Resources
|title=LZIC leucine zipper and CTNNBIP1 domain containing [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/84328
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 84419===
 
"This gene was first identified in a study of human esophageal squamous cell carcinoma tissues. Levels of both the message and protein are reduced in carcinoma samples. In adult human tissues, this gene is expressed in the the esophagus, stomach, small intestine, colon and placenta. Alternatively spliced transcript variants that encode the same protein have been identified."<ref name=RefSeq84419>{{ cite web
|author=RefSeq
|title=C15orf48 chromosome 15 open reading frame 48 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=June 2012
|url=http://www.ncbi.nlm.nih.gov/gene/84419
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 84790===
 
"Predicted to enable GTP binding activity. Predicted to be a structural constituent of cytoskeleton. Predicted to be involved in microtubule cytoskeleton organization and mitotic cell cycle. Located in microtubule cytoskeleton and nucleus."<ref name=AllianceofGenomeResources84790>{{ cite web
|author=Alliance of Genome Resources
|title=TUBA1C tubulin alpha 1c [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/84790
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 84889===
 
"This gene encodes a member of the solute carrier family 7. The encoded protein is a sodium-independent cationic amino acid transporter. Alternate splicing results in multiple transcripts that encoded the same protein."<ref name=RefSeq84889>{{ cite web
|author=RefSeq
|title=SLC7A3 solute carrier family 7 member 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=May 2010
|url=http://www.ncbi.nlm.nih.gov/gene/84889
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 85235===
 
"Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33."<ref name=RefSeq85235>{{ cite web
|author=RefSeq
|title=H2AC12 H2A clustered histone 12 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2015
|url=http://www.ncbi.nlm.nih.gov/gene/85235
|accessdate=2024-06-23 }}</ref>
 
===Gene ID: 92736===
 
"Predicted to enable proton channel activity. Predicted to be involved in proton transmembrane transport. Predicted to be located in plasma membrane. Predicted to be integral component of membrane."<ref name=RefSeq92736>{{ cite web
|author=RefSeq
|title=OTOP2 otopetrin 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/92736
|accessdate=2024-06-22 }}</ref>
 
===Gene ID: 114899===
 
"Enables identical protein binding activity. Involved in several processes, including cellular triglyceride homeostasis; negative regulation of NIK/NF-kappaB signaling; and regulation of cytokine production. Acts upstream of or within negative regulation of gluconeogenesis. Located in extracellular exosome and membrane."<ref name=AllianceofGenomeResources114899>{{ cite web
|author=Alliance of Genome Resources
|title=C1QTNF3 C1q and TNF related 3 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/114899
|accessdate=2024-06-22 }}</ref>
 
===Gene ID: 115265===
 
"Predicted to be involved in negative regulation of signal transduction. Predicted to be located in cytoplasm."<ref name=AllianceofGenomeResources115265>{{ cite web
|author=Alliance of Genome Resources
|title=DDIT4L DNA damage inducible transcript 4 like [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/115265
|accessdate=2024-06-22 }}</ref>
 
===Gene ID: 116842===
 
"This gene encodes a cysteine-rich cationic antimicrobial peptide that is expressed predominantly in the liver. The mature peptide has activity against gram-positive bacteria and yeasts."<ref name=RefSeq116842>{{ cite web
|author=RefSeq
|title=LEAP2 liver enriched antimicrobial peptide 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=September 2014
|url=http://www.ncbi.nlm.nih.gov/gene/116842
|accessdate=2024-06-22 }}</ref>
 
===Gene ID: 117156===
 
"The protein encoded by this gene is a secreted lung surfactant protein and a downstream target of thyroid transcription factor. A single nucleotide polymorphism in the promoter of this gene results in susceptibility to asthma."<ref name=RefSeq117156>{{ cite web
|author=RefSeq
|title=Scgb3a2 secretoglobin, family 3A, member 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=March 2010
|url=http://www.ncbi.nlm.nih.gov/gene/117156
|accessdate=2024-06-21 }}</ref>
 
===Gene ID: 117158===
 
"Located in extracellular region. Is expressed in several structures, including mammary gland and respiratory system. Human ortholog(s) of this gene implicated in allergic rhinitis and asthma. Orthologous to human SCGB3A2 (secretoglobin family 3A member 2)."<ref name=AllianceofGenomeResources117158>{{ cite web
|author=Alliance of Genome Resources
|title=Scgb3a2 secretoglobin, family 3A, member 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/117158
|accessdate=2024-06-21 }}</ref>
 
===Gene ID: 126364===
 
"Predicted to be located in cytoplasm. Predicted to be integral component of membrane."<ref name=RefSeq126364>{{ cite web
|author=RefSeq
|title=LRRC25 leucine rich repeat containing 25 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/126364
|accessdate=2024-06-21 }}</ref>
 
===Gene ID: 126393===
 
"This locus encodes a heat shock protein. The encoded protein likely plays a role in smooth muscle relaxation."<ref name=RefSeq126393>{{ cite web
|author=RefSeq
|title=HSPB6 heat shock protein family B (small) member 6 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=January 2012
|url=http://www.ncbi.nlm.nih.gov/gene/126393
|accessdate=2024-06-21 }}</ref>
 
===Gene ID: 130120===
 
"This gene encodes a member of the regenerating islet-derived genes (REG)3 protein family. These proteins are secreted, C-type lectins with a carbohydrate recognition domain and N-terminal signal peptide. The protein encoded by this gene is an antimicrobial lectin with activity against Gram-positive bacteria. Alternative splicing results in multiple transcript variants encoding multiple isoforms."<ref name=RefSeq130120>{{ cite web
|author=RefSeq
|title=REG3G regenerating family member 3 gamma [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=November 2014
|url=http://www.ncbi.nlm.nih.gov/gene/130120
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 131377===
 
"This gene encodes a protein containing a BACK domain, a BTB/POZ domain, and 5 Kelch repeats, however, its exact function is not known. The gene and the multi-domain protein structure are conserved across different taxa, including primates, rodents, chicken and zebrafish."<ref name=RefSeq131377>{{ cite web
|author=RefSeq
|title=KLHL40 kelch like family member 40 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2012
|url=http://www.ncbi.nlm.nih.gov/gene/131377
|accessdate=2024-06-21 }}</ref>
 
===Gene ID: 145957===
 
"The neuregulins, including NRG4, activate type-1 growth factor receptors (see EGFR; MIM 131550) to initiating cell-to-cell signaling through tyrosine phosphorylation (Harari et al., 1999 [PubMed 10348342])."<ref name=OMIM145957>{{ cite web
|author=OMIM
|title=NRG4 neuregulin 4 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=March 2008
|url=http://www.ncbi.nlm.nih.gov/gene/145957
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 147183===
 
"This gene encodes a member of the type I (acidic) keratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. The type I keratin genes are clustered in a region of chromosome 17q12-q21."<ref name=RefSeq147183>{{ cite web
|author=RefSeq
|title=KRT25 keratin 25 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2009
|url=http://www.ncbi.nlm.nih.gov/gene/147183
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 153020===
 
"Enables guanyl-nucleotide exchange factor activity. Predicted to be involved in regulation of catalytic activity and small GTPase mediated signal transduction. Predicted to be located in early endosome; late endosome; and midbody."<ref name=AllianceofGenomeResources153020>{{ cite web
|author=Alliance of Genome Resources
|title=RASGEF1B RasGEF domain family member 1B [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2020
|url=http://www.ncbi.nlm.nih.gov/gene/153020
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 200504===
 
"The secretory protein encoded by this gene is produced in gastric surface mucous cells, where it can bind trefoil factor family peptide 1 or gastrokine-1. This gene may be a tumor suppressor gene, as its expression is markedly decreased in gastric cancer tissues. The encoded protein interacts with gastrokine-1 and regulates homeostasis of the gastric mucosa."<ref name=RefSeq200504>{{ cite web
|author=RefSeq
|title=GKN2 gastrokine 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=December 2015
|url=http://www.ncbi.nlm.nih.gov/gene/200504
|accessdate=2024-06-18 }}</ref>
 
===Gene ID: 200539===
 
"This gene is a member of the muscle ankyrin repeat protein (MARP) family and encodes a protein with four tandem ankyrin-like repeats. The protein is localized to the nucleus, functioning as a transcriptional regulator. Expression of this protein is induced during recovery following starvation."<ref name=RefSeq200539>{{ cite web
|author=RefSeq
|title=ANKRD23 ankyrin repeat domain 23 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2008
|url=http://www.ncbi.nlm.nih.gov/gene/200539
|accessdate=2024-06-18 }}</ref>


===Gene ID: 342574===
===Gene ID: 342574===
"This gene encodes a member of the type I (acidic) keratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. The type I keratin genes are clustered in a region of chromosome 17q12-q21."<ref name=RefSeq342574>{{ cite web
|author=RefSeq
|title=KRT27 keratin 27 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=July 2009
|url=http://www.ncbi.nlm.nih.gov/gene/342574
|accessdate=2024-06-18 }}</ref>
===Gene ID: 389125===
"Predicted to be involved in several processes, including chondrocyte differentiation; chondrocyte proliferation; and embryonic limb morphogenesis. Predicted to act upstream of or within positive regulation of chondrocyte differentiation; positive regulation of chondrocyte proliferation; and positive regulation of macromolecule metabolic process. Predicted to be located in nucleoplasm."<ref name=AllianceofGenomeResources389125>{{ cite web
|author=Alliance of Genome Resources
|title=MUSTN1 musculoskeletal, embryonic nuclear protein 1 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=April 2022
|url=http://www.ncbi.nlm.nih.gov/gene/389125
|accessdate=2024-06-21 }}</ref>
===Gene ID: 729230===
"The protein encoded by this gene is a receptor for monocyte chemoattractant protein-1, a chemokine which specifically mediates monocyte chemotaxis. Monocyte chemoattractant protein-1 is involved in monocyte infiltration in inflammatory diseases such as rheumatoid arthritis as well as in the inflammatory response against tumors. The encoded protein mediates agonist-dependent calcium mobilization and inhibition of adenylyl cyclase. This protein can also be a coreceptor with CD4 for HIV-1 infection. This gene is located in the chemokine receptor gene cluster region of chromosome 3."<ref name=RefSeq729230>{{ cite web
|author=RefSeq
|title=CCR2 C-C motif chemokine receptor 2 [ Homo sapiens ]
|publisher=ncbi.nlm.nih
|location=Bethsda, Maryland, USA
|date=August 2017
|url=http://www.ncbi.nlm.nih.gov/gene/729230
|accessdate=2024-06-18 }}</ref> This Gene ID replaces Gene ID: 1231.
==Families of TATA box genes==
# Actin family: Gene IDs: 58, 60, 70.
# Actin/cytoskeleton/contractile family: Gene IDs: 60.
# Activity against gram-positive bacteria and yeasts: Gene IDs: 116842.
# Adenylate kinase: Gene IDs: 203.
# Albumin gene family: Gene IDs: 173.
# Aldolases: Gene IDs: 229.
# Alkaline phosphatase family: Gene IDs: 249.
# Alpha-amylase family: Gene IDs: 279.
# Amelogenin family: Gene IDs: 265.
# Amylases: Gene IDs: 280.
# Annexin family: Gene IDs: 302.
# AP-1 family of transcription factors: Gene IDs: .
# Apolipoprotein (apo-) A-II: Gene IDs: 336.
# Apolipoprotein of chylomicrons and low density lipoproteins (LDL): Gene IDs: 338.
# Aquaporin family: Gene IDs: 360.
# Arginase type I: Gene IDs: 383.
# Axoneme assembly: Gene IDs: 84223.
# Binding activity: Gene IDs: 84223, 83998, 83638.
# Binds copper, nickel, fatty acids and bilirubin: Gene IDs: 174.
# Binds phospholipids: Gene IDs: 301.
# cAMP-response element binding family: Gene IDs: .
# Cell cycle transcription factor family: Gene IDs: 79733.
# Cellular triglyceride homeostasis: Gene IDs: 114899.
# Chondrocyte differentiation; chondrocyte proliferation: Gene IDs: 389125.
# Class I fructose-biphosphate aldolase family: Gene IDs: 230.
# Component of triglyceride (TG)-rich lipoproteins (TRLs) including very low density lipoproteins (VLDL), high density lipoproteins (HDL) and chylomicrons: Gene IDs: 345.
# Copper amine oxidases: Gene IDs: 314.
# Enables identical protein binding activity: Gene IDs: 57823.
# Enable signaling receptor binding activity: Gene IDs: 64111.
# Enables guanyl-nucleotide exchange factor activity: Gene IDs: 153020.
# Epidermal growth factor family: Gene IDs: 374.
# G-protein-coupled receptors (GPCR) large family: Gene IDs: 81285.
# GTP binding activity: Gene IDs: 84790.
# Heat shock: Gene IDs: 126393.
# Histone H1 family: Gene IDs: 3010.
# Histone H2A family: Gene IDs: 85235.
# 'Housekeeping' family: Gene IDs: 229.
# Identical protein binding activity: Gene IDs: 114899.
# Inhibit inflammatory cytokines to disrupt inflammatory cascades: Gene IDs: 2.
# Integral component of membrane: Gene IDs: 126364.
# Involved in adaptive immune response: Gene IDs: 57823.
# Kelch-like family: Gene IDs: 131377.
# Maintaining blood pressure, body fluid and electrolyte homeostasis: Gene IDs: 183.
# Mammalian activation transcription factor/cAMP responsive element-binding (CREB) family: Gene IDs: 467.
# Mammary gland and respiratory system: Gene IDs: 117158.
# Membranes: Gene IDs: 126364.
# MIP/aquaporin family: Gene IDs: 359.
# Mitochondrial solute carrier subfamily: Gene IDs: 292.
# Muscle ankyrin repeat protein (MARP) family: Gene IDs: 200539.
# Myristoylated alanine-rich C-kinase substrate (MARCKS) family: Gene IDs: 65108.
# Na+/K+ and H+/K+ ATPases beta chain family: Gene IDs: 481, 482.
# Negative regulation of gluconeogenesis: Gene IDs: 114899.
# Negative regulation of gonadotropin secretion: Gene IDs: 64111.
# Negative regulation of NIK/NF-kappaB signaling: Gene IDs: 114899.
# Negative regulation of signal transduction: Gene IDs: 115265.
# Neuregulins: Gene IDs: 145957.
# Polyamine biosynthesis: Gene IDs: 262.
# Protease inhibitor and cytokine transporter: Gene IDs: 2.
# Proton channel activity: Gene IDs: 92736.
# Proton transmembrane transport: Gene IDs: 92736.
# Receptor for monocyte chemoattractant protein-1: Gene IDs: 729230.
# Regenerating islet-derived genes (REG)3 family: Gene IDs:130120.
# Regulate host inflammation and immunity: Gene IDs: 84419.
# Regulation of cytokine production: Gene IDs: 114899.
# Regulation of transcription: Gene IDs: 81606.
# Regulator of aldosterone secretion: Gene IDs: 185.
# Secretoglobin family: Gene IDs: 117158.
# Secretory: Gene IDs: 117158, 200504.
# Small integral membrane protein with six bilayer spanning domains that functions as a water channel: Gene IDs: 358.
# Solute carrier family 7: Gene IDs: 84889.
# Stem cell homeostasis: Gene IDs: 80177.
# Structural constituent of cytoskeleton: Gene IDs: 84790.
# DNA-binding family tax-responsive enhancer element in the left-to-right (LTR) of the Human T-lymphotropic virus 1 (HTLV-1) transcription factor also characterized as the cAMP-response element binding protein 2 (CREB-2): Gene IDs: 468.
# Type I (acidic) keratin family: Gene IDs: 147183, 342574.
# Vasodilation, regulation of hormone secretion, promotion of angiogenesis, and antimicrobial activity: Gene IDs: 133.
# Water channel protein aquaporin 3: Gene IDs: 360.
# ZIC family of C2H2-type zinc fingers: Gene IDs: 84107.
# Superfamily of ATP-binding cassette (ABC) transporters: Gene ID: 19.
# Superfamily of extracellular matrix proteins: amelogenin family: Gene ID: 265.
# Superfamily of intermediate filament (IF) proteins: keratin family.
# Superfamily of small GTPase protein: Gene IDs: 390.


==Acknowledgements==
==Acknowledgements==

Latest revision as of 17:08, 26 June 2024

The TATA box (also called Goldberg-Hogness box)[1] is a DNA sequence (cis-regulatory element) found in the promoter region of genes in archaea and eukaryotes;[2] approximately 24% of human genes contain a TATA box within the core promoter.[3]

Human genes

"TATA-containing genes are more often highly regulated, such as by biotic or stress stimuli."[4] Only "∼10% of these TATA-containing promoters have the canonical TATA box (TATAWAWR)."[4]

"SRF-regulated genes of the actin/cytoskeleton/contractile family tend to have a TATA box."[5]

Different "TATA box sequences have different abilities to convey the activating signals of certain enhancers and activators in mammalian cells [...] and in yeast [...]."[5]

"SRF is a well established master regulator of the specific family of genes encoding the actin cytoskeleton and contractile apparatus [...], and we found that ~40% of the core promoters for these genes contain a TATA box, which is a significant enrichment compared to the low overall frequency of TATA-containing promoters in human and mouse genomes (...)."[5] "Global frequencies of core promoter types for human [9010 orthologous mouse-human promoter pairs with 1848 TATA-containing or 7162 TATA-less][6] genes with experimentally validated transcription start sites [are known from 2006]."[5] "The TATA box [...] has a consensus sequence of TATAWAAR [...]."[5] W = A or T and R = A or G. We "estimate that ~17% of promoters contain a TATA box".[6]

Gene ID: 2

"The protein encoded by this gene is a protease inhibitor and cytokine transporter. It uses a bait-and-trap mechanism to inhibit a broad spectrum of proteases, including trypsin, thrombin and collagenase. It can also inhibit inflammatory cytokines, and it thus disrupts inflammatory cascades. Mutations in this gene are a cause of alpha-2-macroglobulin deficiency. This gene is implicated in Alzheimer's disease (AD) due to its ability to mediate the clearance and degradation of A-beta, the major component of beta-amyloid deposits. A related pseudogene, which is also located on the p arm of chromosome 12, has been identified."[7]

Gene ID: 19

"The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. With cholesterol as its substrate, this protein functions as a cholesteral efflux pump in the cellular lipid removal pathway. Mutations in both alleles of this gene cause Tangier disease and familial high-density lipoprotein (HDL) deficiency."[8]

Gene ID: 58

"The product encoded by this gene belongs to the actin family of proteins, which are highly conserved proteins that play a role in cell motility, structure and integrity. Alpha, beta and gamma actin isoforms have been identified, with alpha actins being a major constituent of the contractile apparatus, while beta and gamma actins are involved in the regulation of cell motility. This actin is an alpha actin that is found in skeletal muscle. Mutations in this gene cause a variety of myopathies, including nemaline myopathy, congenital myopathy with excess of thin myofilaments, congenital myopathy with cores, and congenital myopathy with fiber-type disproportion, diseases that lead to muscle fiber defects with manifestations such as hypotonia."[9] It has a TATA box (TATAAAA) from -28 to -22 nts from the TSS.[6]

Gene ID: 60

"This gene encodes one of six different actin proteins. Actins are highly conserved proteins that are involved in cell motility, structure, integrity, and intercellular signaling. The encoded protein is a major constituent of the contractile apparatus and one of the two nonmuscle cytoskeletal actins that are ubiquitously expressed. Mutations in this gene cause Baraitser-Winter syndrome 1, which is characterized by intellectual disability with a distinctive facial appearance in human patients. Numerous pseudogenes of this gene have been identified throughout the human genome."[10]

Gene ID: 70

"Actins are highly conserved proteins that are involved in various types of cell motility. Polymerization of globular actin (G-actin) leads to a structural filament (F-actin) in the form of a two-stranded helix. Each actin can bind to four others. The protein encoded by this gene belongs to the actin family which is comprised of three main groups of actin isoforms, alpha, beta, and gamma. The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus. Defects in this gene have been associated with idiopathic dilated cardiomyopathy (IDC) and familial hypertrophic cardiomyopathy (FHC)."[11]

Gene ID: 133

"The protein encoded by this gene is a preprohormone which is cleaved to form two biologically active peptides, adrenomedullin and proadrenomedullin N-terminal 20 peptide. Adrenomedullin is a 52 aa peptide with several functions, including vasodilation, regulation of hormone secretion, promotion of angiogenesis, and antimicrobial activity. The antimicrobial activity is antibacterial, as the peptide has been shown to kill E. coli and S. aureus at low concentration."[12]

Gene ID: 173

"This gene is a member of the albumin gene family, which is comprised of four genes that localize to chromosome 4 in a tandem arrangement. These four genes encode structurally-related serum transport proteins that are known to be evolutionarily related. The protein encoded by this gene is regulated developmentally, expressed in the liver and secreted into the bloodstream."[13]

Gene ID: 174

"This gene encodes alpha-fetoprotein, a major plasma protein produced by the yolk sac and the liver during fetal life. Alpha-fetoprotein expression in adults is often associated with hepatocarcinoma and with teratoma, and has prognostic value for managing advanced gastric cancer. However, hereditary persistance of alpha-fetoprotein may also be found in individuals with no obvious pathology. The protein is thought to be the fetal counterpart of serum albumin, and the alpha-fetoprotein and albumin genes are present in tandem in the same transcriptional orientation on chromosome 4. Alpha-fetoprotein is found in monomeric as well as dimeric and trimeric forms, and binds copper, nickel, fatty acids and bilirubin. The level of alpha-fetoprotein in amniotic fluid is used to measure renal loss of protein to screen for spina bifida and anencephaly."[14]

Gene ID: 183

"The protein encoded by this gene, pre-angiotensinogen or angiotensinogen precursor, is expressed in the liver and is cleaved by the enzyme renin in response to lowered blood pressure. The resulting product, angiotensin I, is then cleaved by angiotensin converting enzyme (ACE) to generate the physiologically active enzyme angiotensin II. The protein is involved in maintaining blood pressure, body fluid and electrolyte homeostasis, and in the pathogenesis of essential hypertension and preeclampsia. Mutations in this gene are associated with susceptibility to essential hypertension, and can cause renal tubular dysgenesis, a severe disorder of renal tubular development. Defects in this gene have also been associated with non-familial structural atrial fibrillation, and inflammatory bowel disease."[15] It has a TATA box (TATAAAT) from -32 to -25 nts from the TSS.[6]

Gene ID: 185

"Angiotensin II is a potent vasopressor hormone and a primary regulator of aldosterone secretion. It is an important effector controlling blood pressure and volume in the cardiovascular system. It acts through at least two types of receptors. This gene encodes the type 1 receptor which is thought to mediate the major cardiovascular effects of angiotensin II. This gene may play a role in the generation of reperfusion arrhythmias following restoration of blood flow to ischemic or infarcted myocardium. It was previously thought that a related gene, denoted as AGTR1B, existed; however, it is now believed that there is only one type 1 receptor gene in humans. Alternative splicing of this gene results in multiple transcript variants."[16]

Gene ID: 203

"This gene encodes an adenylate kinase enzyme involved in energy metabolism and homeostasis of cellular adenine nucleotide ratios in different intracellular compartments. This gene is highly expressed in skeletal muscle, brain and erythrocytes. Certain mutations in this gene resulting in a functionally inadequate enzyme are associated with a rare genetic disorder causing nonspherocytic hemolytic anemia. Alternative splicing of this gene results in multiple transcript variants encoding different isoforms. This gene shares readthrough transcripts with the upstream ST6GALNAC6 gene."[17]

Gene ID: 229

"Fructose-1,6-bisphosphate aldolase (EC 4.1.2.13) is a tetrameric glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Vertebrates have 3 aldolase isozymes which are distinguished by their electrophoretic and catalytic properties. Differences indicate that aldolases A, B, and C are distinct proteins, the products of a family of related 'housekeeping' genes exhibiting developmentally regulated expression of the different isozymes. The developing embryo produces aldolase A, which is produced in even greater amounts in adult muscle where it can be as much as 5% of total cellular protein. In adult liver, kidney and intestine, aldolase A expression is repressed and aldolase B is produced. In brain and other nervous tissue, aldolase A and C are expressed about equally. There is a high degree of homology between aldolase A and C. Defects in ALDOB cause hereditary fructose intolerance."[18]

Gene ID: 230

"This gene encodes a member of the class I fructose-biphosphate aldolase gene family. Expressed specifically in the hippocampus and Purkinje cells of the brain, the encoded protein is a glycolytic enzyme that catalyzes the reversible aldol cleavage of fructose-1,6-biphosphate and fructose 1-phosphate to dihydroxyacetone phosphate and either glyceraldehyde-3-phosphate or glyceraldehyde, respectively."[19]

Gene ID: 249

"This gene encodes a member of the alkaline phosphatase family of proteins. There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2, while the tissue non-specific form is located on chromosome 1. The product of this gene is a membrane bound glycosylated enzyme that is not expressed in any particular tissue and is, therefore, referred to as the tissue-nonspecific form of the enzyme. Alternative splicing results in multiple transcript variants, at least one of which encodes a preproprotein that is proteolytically processed to generate the mature enzyme. This enzyme may play a role in bone mineralization. Mutations in this gene have been linked to hypophosphatasia, a disorder that is characterized by hypercalcemia and skeletal defects."[20] It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS.[6]

Gene ID: 262

"This gene encodes an important intermediate enzyme in polyamine biosynthesis. The polyamines spermine, spermidine, and putrescine are low-molecular-weight aliphatic amines essential for cellular proliferation and tumor promotion. Multiple alternatively spliced transcript variants have been identified. Pseudogenes of this gene are found on chromosomes 5, 6, 10, X and Y."[21]

Gene ID: 265

"This gene encodes a member of the amelogenin family of extracellular matrix proteins. Amelogenins are involved in biomineralization during tooth enamel development. Mutations in this gene cause X-linked amelogenesis imperfecta. Alternative splicing results in multiple transcript variants encoding different isoforms."[22]

Gene ID: 279

"This gene encodes a member of the alpha-amylase family of proteins. Amylases are secreted proteins that hydrolyze 1,4-alpha-glucoside bonds in oligosaccharides and polysaccharides, catalyzing the first step in digestion of dietary starch and glycogen. This gene and several family members are present in a gene cluster on chromosome 1. This gene encodes an amylase isoenzyme produced by the pancreas."[23] It has a TATA box (TATAAAT) from -27 to -21 nts from the TSS.[6]

Gene ID: 280

"Amylases are secreted proteins that hydrolyze 1,4-alpha-glucoside bonds in oligosaccharides and polysaccharides, and thus catalyze the first step in digestion of dietary starch and glycogen. The human genome has a cluster of several amylase genes that are expressed at high levels in either salivary gland or pancreas. This gene encodes an amylase isoenzyme produced by the pancreas."[24] It has a TATA box (TATAAAT) from -30 to -24 nts from the TSS.[6]

Gene ID: 292

"This gene is a member of the mitochondrial carrier subfamily of solute carrier protein genes. The product of this gene functions as a gated pore that translocates ADP from the cytoplasm into the mitochondrial matrix and ATP from the mitochondrial matrix into the cytoplasm. The protein forms a homodimer embedded in the inner mitochondria membrane. Suppressed expression of this gene has been shown to induce apoptosis and inhibit tumor growth. The human genome contains several non-transcribed pseudogenes of this gene."[25]

Gene ID: 301

"This gene encodes a membrane-localized protein that binds phospholipids. This protein inhibits phospholipase A2 and has anti-inflammatory activity. Loss of function or expression of this gene has been detected in multiple tumors."[26]

Gene ID: 302

"This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene. Annexin A2 expression has been found to correlate with resistance to treatment against various cancer forms."[27]

Gene ID: 314

"Copper amine oxidases catalyze the oxidative conversion of amines to aldehydes and ammonia in the presence of copper and quinone cofactor. This gene shows high sequence similarity to copper amine oxidases from various species ranging from bacteria to mammals. The protein contains several conserved motifs including the active site of amine oxidases and the histidine residues that likely bind copper. It may be a critical modulator of signal transmission in retina, possibly by degrading the biogenic amines dopamine, histamine, and putrescine. This gene may be a candidate gene for hereditary ocular diseases. Alternate splicing results in multiple transcript variants."[28]

Gene ID: 336

"This gene encodes apolipoprotein (apo-) A-II, which is the second most abundant protein of the high density lipoprotein particles. The protein is found in plasma as a monomer, homodimer, or heterodimer with apolipoprotein D. Defects in this gene may result in apolipoprotein A-II deficiency or hypercholesterolemia."[29] It has a TATA box (TATATAG) from -28 to -22 nts from the TSS.[6]

Gene ID: 338

"This gene product is the main apolipoprotein of chylomicrons and low density lipoproteins (LDL), and is the ligand for the LDL receptor. It occurs in plasma as two main isoforms, apoB-48 and apoB-100: the former is synthesized exclusively in the gut and the latter in the liver. The intestinal and the hepatic forms of apoB are encoded by a single gene from a single, very long mRNA. The two isoforms share a common N-terminal sequence. The shorter apoB-48 protein is produced after RNA editing of the apoB-100 transcript at residue 2180 (CAA->UAA), resulting in the creation of a stop codon, and early translation termination. Mutations in this gene or its regulatory region cause hypobetalipoproteinemia, normotriglyceridemic hypobetalipoproteinemia, and hypercholesterolemia due to ligand-defective apoB, diseases affecting plasma cholesterol and apoB levels."[30]

Gene ID: 345

"This gene encodes a protein component of triglyceride (TG)-rich lipoproteins (TRLs) including very low density lipoproteins (VLDL), high density lipoproteins (HDL) and chylomicrons. The encoded protein plays a role in role in the metabolism of these TRLs through multiple modes. This protein has been shown to promote the secretion of VLDL1, inhibit lipoprotein lipase enzyme activity, and delay catabolism of TRL remnants. Mutations in this gene are associated with low plasma triglyceride levels and reduced risk of ischemic cardiovascular disease, and hyperalphalipoproteinemia, which is characterized by elevated levels of high density lipoprotein (HDL) and HDL cholesterol in human patients. This gene and other related genes comprise an apolipoprotein gene cluster on chromosome 11."[31]

Gene ID: 358

"This gene encodes a small integral membrane protein with six bilayer spanning domains that functions as a water channel protein. This protein permits passive transport of water along an osmotic gradient. This gene is a possible candidate for disorders involving imbalance in ocular fluid movement."[32]

Gene ID: 359

"This gene encodes a water channel protein located in the kidney collecting tubule. It belongs to the MIP/aquaporin family, some members of which are clustered together on chromosome 12q13. Mutations in this gene have been linked to autosomal dominant and recessive forms of nephrogenic diabetes insipidus."[33]

Gene ID: 360

"This gene encodes the water channel protein aquaporin 3. Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein, also known as aquaporin 0. Aquaporin 3 is localized at the basal lateral membranes of collecting duct cells in the kidney. In addition to its water channel function, aquaporin 3 has been found to facilitate the transport of nonionic small solutes such as urea and glycerol, but to a smaller degree. It has been suggested that water channels can be functionally heterogeneous and possess water and solute permeation mechanisms. Alternative splicing of this gene results in multiple transcript variants encoding different isoforms."[34]

Gene ID: 374

"The protein encoded by this gene is a member of the epidermal growth factor family. It is an autocrine growth factor as well as a mitogen for astrocytes, Schwann cells and fibroblasts. It is related to epidermal growth factor (EGF) and transforming growth factor alpha (TGF-alpha). The protein interacts with the EGF/TGF-alpha receptor to promote the growth of normal epithelial cells, and it inhibits the growth of certain aggressive carcinoma cell lines. It also functions in mammary gland, oocyte and bone tissue development. This gene is associated with a psoriasis-like skin phenotype, and is also associated with other pathological disorders, including various types of cancers and inflammatory conditions."[35]

Gene ID: 383

"Arginase catalyzes the hydrolysis of arginine to ornithine and urea. At least two isoforms of mammalian arginase exist (types I and II) which differ in their tissue distribution, subcellular localization, immunologic crossreactivity and physiologic function. The type I isoform encoded by this gene, is a cytosolic enzyme and expressed predominantly in the liver as a component of the urea cycle. Inherited deficiency of this enzyme results in argininemia, an autosomal recessive disorder characterized by hyperammonemia. Two transcript variants encoding different isoforms have been found for this gene."[36]

Gene ID: 390

"This gene encodes a protein which is a member of the small GTPase protein superfamily. The encoded protein binds only GTP but has no GTPase activity, and appears to act as a negative regulator of cytoskeletal organization leading to loss of adhesion. Multiple alternatively spliced variants, encoding the same protein, have been identified."[37]

Gene ID: 467

"This gene encodes a member of the mammalian activation transcription factor/cAMP responsive element-binding (CREB) protein family of transcription factors. This gene is induced by a variety of signals, including many of those encountered by cancer cells, and is involved in the complex process of cellular stress response. Multiple transcript variants encoding different isoforms have been found for this gene. It is possible that alternative splicing of this gene may be physiologically important in the regulation of target genes."[38] It has a TATA box (TATAAAA) from -33 to -27 nts from the TSS.[6]

Gene ID: 468

"This gene encodes a transcription factor that was originally identified as a widely expressed mammalian DNA binding protein that could bind a tax-responsive enhancer element in the LTR of HTLV-1. The encoded protein was also isolated and characterized as the cAMP-response element binding protein 2 (CREB-2). The protein encoded by this gene belongs to a family of DNA-binding proteins that includes the AP-1 family of transcription factors, cAMP-response element binding proteins (CREBs) and CREB-like proteins. These transcription factors share a leucine zipper region that is involved in protein-protein interactions, located C-terminal to a stretch of basic amino acids that functions as a DNA binding domain. Two alternative transcripts encoding the same protein have been described. Two pseudogenes are located on the X chromosome at q28 in a region containing a large inverted duplication."[39]

Gene ID: 481

"The protein encoded by this gene belongs to the family of Na+/K+ and H+/K+ ATPases beta chain proteins, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The beta subunit regulates, through assembly of alpha/beta heterodimers, the number of sodium pumps transported to the plasma membrane. The glycoprotein subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes a beta 1 subunit. Alternatively spliced transcript variants encoding different isoforms have been described, but their biological validity is not known."[40] It has a TATA box (TATATAG) from -28 to -22 nts from the TSS.[6]

Gene ID: 482

"The protein encoded by this gene belongs to the family of Na+/K+ and H+/K+ ATPases beta chain proteins, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The beta subunit regulates, through assembly of alpha/beta heterodimers, the number of sodium pumps transported to the plasma membrane. The glycoprotein subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes a beta 2 subunit. Two transcript variants encoding different isoforms have been found for this gene."[41]

Gene ID: 496

"The protein encoded by this gene belongs to a family of P-type cation-transporting ATPases. The gastric H+, K+-ATPase is a heterodimer consisting of a high molecular weight catalytic alpha subunit and a smaller but heavily glycosylated beta subunit. This enzyme is a proton pump that catalyzes the hydrolysis of ATP coupled with the exchange of H(+) and K(+) ions across the plasma membrane. It is also responsible for gastric acid secretion. This gene encodes the beta subunit of the gastric H+, K+-ATPase."[42]

Gene ID: 515

"This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the b subunit of the proton channel."[43] ATO5PB aka ATP5F1.[43] It has a TATA box (TTTAAAA) from -34 to -28 nts from the TSS.[6]

Gene ID: 604

"The protein encoded by this gene is a zinc finger transcription factor and contains an N-terminal POZ domain. This protein acts as a sequence-specific repressor of transcription, and has been shown to modulate the transcription of STAT-dependent IL-4 responses of B cells. This protein can interact with a variety of POZ-containing proteins that function as transcription corepressors. This gene is found to be frequently translocated and hypermutated in diffuse large-cell lymphoma (DLCL), and may be involved in the pathogenesis of DLCL. Alternatively spliced transcript variants encoding different protein isoforms have been found for this gene."[44]

Gene ID: 676

"BRDT is similar to the RING3 protein family. It possesses 2 bromodomain motifs and a PEST sequence (a cluster of proline, glutamic acid, serine, and threonine residues), characteristic of proteins that undergo rapid intracellular degradation. The bromodomain is found in proteins that regulate transcription. Several transcript variants encoding multiple isoforms have been found for this gene."[45] It has a TATA box (TATAAAA) from -31 to -25 nts from the TSS.[6]

Gene ID: 677

"This gene is a member of the TIS11 family of early response genes, which are induced by various agonists such as the phorbol ester TPA and the polypeptide mitogen EGF. This gene is well conserved across species and has a promoter that contains motifs seen in other early-response genes. The encoded protein contains a distinguishing putative zinc finger domain with a repeating cys-his motif. This putative nuclear transcription factor most likely functions in regulating the response to growth factors. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."[46]

Gene ID: 694

"This gene is a member of an anti-proliferative gene family that regulates cell growth and differentiation. Expression of this gene is highest in the G0/G1 phases of the cell cycle and downregulated when cells progressed through G1. The encoded protein interacts with several nuclear receptors, and functions as a coactivator of cell differentiation. This locus has been shown to be involved in a t(8;12)(q24;q22) chromosomal translocation in a case of B-cell chronic lymphocytic leukemia."[47]

Gene ID: 759

"Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. This CA1 gene is closely linked to the CA2 and CA3 genes on chromosome 8. It encodes a cytosolic protein that is found at the highest level in erythrocytes. Allelic variants of this gene have been described in some populations. Alternative splicing and the use of alternative promoters results in multiple transcript variants."[48]

Gene ID: 760

"The protein encoded by this gene is one of several isozymes of carbonic anhydrase, which catalyzes reversible hydration of carbon dioxide. Defects in this enzyme are associated with osteopetrosis and renal tubular acidosis. Two transcript variants encoding different isoforms have been found for this gene."[49]

Gene ID: 762

"Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. This gene encodes a glycosylphosphatidyl-inositol-anchored membrane isozyme expressed on the luminal surfaces of pulmonary (and certain other) capillaries and proximal renal tubules. Its exact function is not known; however, it may have a role in inherited renal abnormalities of bicarbonate transport."[50]

Gene ID: 767

"The protein encoded by this gene was initially named CA-related protein because of sequence similarity to other known carbonic anhydrase genes. However, the gene product lacks carbonic anhydrase activity (i.e., the reversible hydration of carbon dioxide). The gene product continues to carry a carbonic anhydrase designation based on clear sequence identity to other members of the carbonic anhydrase gene family. The absence of CA8 gene transcription in the cerebellum of the lurcher mutant in mice with a neurologic defect suggests an important role for this acatalytic form. Mutations in this gene are associated with cerebellar ataxia, mental retardation, and dysequilibrium syndrome 3 (CMARQ3). Polymorphisms in this gene are associated with osteoporosis, and overexpression of this gene in osteosarcoma cells suggests an oncogenic role. Alternative splicing results in multiple transcript variants."[51]

Gene ID: 794

"This gene encodes an intracellular calcium-binding protein belonging to the troponin C superfamily. Members of this protein family have six EF-hand domains which bind calcium. This protein plays a role in diverse cellular functions, including message targeting and intracellular calcium buffering. It also functions as a modulator of neuronal excitability, and is a diagnostic marker for some human diseases, including Hirschsprung disease and some cancers. Alternative splicing results in multiple transcript variants."[52]

Gene ID: 811

"Calreticulin is a highly conserved chaperone protein which resides primarily in the endoplasmic reticulum, and is involved in a variety of cellular processes, among them, cell adhesion. Additionally, it functions in protein folding quality control and calcium homeostasis. Calreticulin is also found in the nucleus, suggesting that it may have a role in transcription regulation. Systemic lupus erythematosus is associated with increased autoantibody titers against calreticulin. Recurrent mutations in calreticulin have been linked to various neoplasms, including the myeloproliferative type."[53]

Gene ID: 846

"The protein encoded by this gene is a plasma membrane G protein-coupled receptor that senses small changes in circulating calcium concentration. The encoded protein couples this information to intracellular signaling pathways that modify parathyroid hormone secretion or renal cation handling, and thus this protein plays an essential role in maintaining mineral ion homeostasis. Mutations in this gene are a cause of familial hypocalciuric hypercalcemia, neonatal severe hyperparathyroidism, and autosomal dominant hypocalcemia."[54]

Gene ID: 919

"The protein encoded by this gene is T-cell receptor zeta, which together with T-cell receptor alpha/beta and gamma/delta heterodimers, and with CD3-gamma, -delta and -epsilon, forms the T-cell receptor-CD3 complex. The zeta chain plays an important role in coupling antigen recognition to several intracellular signal-transduction pathways. Low expression of the antigen results in impaired immune response. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene."[55] It has a TATA box (AATAAAA) from -31 to -25 nts from the TSS.[6]

Gene ID: 1044

"This gene is a member of the caudal-related homeobox transcription factor gene family. The encoded DNA-binding protein regulates intestine-specific gene expression and enterocyte differentiation. It has been shown to induce expression of the intestinal alkaline phosphatase gene, and inhibit beta-catenin/T-cell factor transcriptional activity."[56]

Gene ID: 1051

"This intronless gene encodes a transcription factor that contains a basic leucine zipper (bZIP) domain. The encoded protein functions as a homodimer but can also form heterodimers with CCAAT/enhancer-binding proteins alpha, delta, and gamma. Activity of this protein is important in the regulation of genes involved in immune and inflammatory responses, among other processes. The use of alternative in-frame AUG start codons results in multiple protein isoforms, each with distinct biological functions."[57]

Gene ID: 1081

"The four human glycoprotein hormones chorionic gonadotropin (CG), luteinizing hormone (LH), follicle stimulating hormone (FSH), and thyroid stimulating hormone (TSH) are dimers consisting of alpha and beta subunits that are associated noncovalently. The alpha subunits of these hormones are identical, however, their beta chains are unique and confer biological specificity. The protein encoded by this gene is the alpha subunit and belongs to the glycoprotein hormones alpha chain family. Two transcript variants encoding different isoforms have been found for this gene."[58]

Gene ID: 1101

"Chondroadherin is a cartilage matrix protein thought to mediate adhesion of isolated chondrocytes. The protein contains 11 leucine-rich repeats flanked by cysteine-rich regions. The chondroadherin messenger RNA is present in chondrocytes at all ages."[59]

Gene ID: 1116

"Chitinases catalyze the hydrolysis of chitin, which is an abundant glycopolymer found in insect exoskeletons and fungal cell walls. The glycoside hydrolase 18 family of chitinases includes eight human family members. This gene encodes a glycoprotein member of the glycosyl hydrolase 18 family. The protein lacks chitinase activity and is secreted by activated macrophages, chondrocytes, neutrophils and synovial cells. The protein is thought to play a role in the process of inflammation and tissue remodeling."[60] It has a TATA box (CATAAAA) from -30 to -24 nts from the TSS and a TATA box (TATATAAA) from 16 to 23 nts from the TSS.[6]

Gene ID: 1152

"The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in brain as well as in other tissues, and as a heterodimer with a similar muscle isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family. A pseudogene of this gene has been characterized."[61]

Gene ID: 1153

"Enables mRNA 3'-UTR binding activity and small ribosomal subunit rRNA binding activity. Involved in mRNA stabilization; positive regulation of translation; and response to UV. Located in cytoplasm and nucleoplasm."[62]

Gene ID: 1158

"The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis and is an important serum marker for myocardial infarction. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in striated muscle as well as in other tissues, and as a heterodimer with a similar brain isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family."[63]

Gene ID: 1160

"Mitochondrial creatine kinase (MtCK) is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Sarcomeric mitochondrial creatine kinase has 80% homology with the coding exons of ubiquitous mitochondrial creatine kinase. This gene contains sequences homologous to several motifs that are shared among some nuclear genes encoding mitochondrial proteins and thus may be essential for the coordinated activation of these genes during mitochondrial biogenesis. Three transcript variants encoding the same protein have been found for this gene."[64]

Gene ID: 1180

"The CLCN family of voltage-dependent chloride channel genes comprises nine members (CLCN1-7, Ka and Kb) which demonstrate quite diverse functional characteristics while sharing significant sequence homology. The protein encoded by this gene regulates the electric excitability of the skeletal muscle membrane. Mutations in this gene cause two forms of inherited human muscle disorders: recessive generalized myotonia congenita (Becker) and dominant myotonia (Thomsen). Alternative splicing results in multiple transcript variants."[65]

Gene ID: 1188

"The protein encoded by this gene is a member of the family of voltage-gated chloride channels. Chloride channels have several functions, including the regulation of cell volume, membrane potential stabilization, signal transduction and transepithelial transport. This gene is expressed predominantly in the kidney and may be important for renal salt reabsorption. Mutations in this gene are associated with autosomal recessive Bartter syndrome type 3 (BS3). Alternatively spliced transcript variants encoding different isoforms have been found for this gene."[66] It has a TATA box (CATAAAC) from -30 to -24 nts from the TSS.[6]

Gene ID: 1191

"The protein encoded by this gene is a secreted chaperone that can under some stress conditions also be found in the cell cytosol. It has been suggested to be involved in several basic biological events such as cell death, tumor progression, and neurodegenerative disorders. Alternate splicing results in both coding and non-coding variants."[67]

Gene ID: 1215

"This gene encodes a chymotryptic serine proteinase that belongs to the peptidase family S1. It is expressed in mast cells and is thought to function in the degradation of the extracellular matrix, the regulation of submucosal gland secretion, and the generation of vasoactive peptides. In the heart and blood vessels, this protein, rather than angiotensin converting enzyme, is largely responsible for converting angiotensin I to the vasoactive peptide angiotensin II. Alternative splicing results in multiple variants."[68]

Gene ID: 1277

"This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene."[69]

Gene ID: 1278

"This gene encodes the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIB, recessive Ehlers-Danlos syndrome Classical type, idiopathic osteoporosis, and atypical Marfan syndrome. Symptoms associated with mutations in this gene, however, tend to be less severe than mutations in the gene for the alpha1 chain of type I collagen (COL1A1) reflecting the different role of alpha2 chains in matrix integrity. Three transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene."[70]

Gene ID: 1356

"The protein encoded by this gene is a metalloprotein that binds most of the copper in plasma and is involved in the peroxidation of Fe(II)transferrin to Fe(III) transferrin. Mutations in this gene cause aceruloplasminemia, which results in iron accumulation and tissue damage, and is associated with diabetes and neurologic abnormalities. Two transcript variants, one protein-coding and the other not protein-coding, have been found for this gene."[71]

Gene ID: 1382

"This gene encodes a member of the retinoic acid (RA, a form of vitamin A) binding protein family and lipocalin/cytosolic fatty-acid binding protein family. The protein is a cytosol-to-nuclear shuttling protein, which facilitates RA binding to its cognate receptor complex and transfer to the nucleus. It is involved in the retinoid signaling pathway, and is associated with increased circulating low-density lipoprotein cholesterol. Alternatively spliced transcript variants encoding the same protein have been found for this gene."[72] It has a TATA box (TATAAAA) from -32 to -26 nts from the TSS.[6]

Gene ID: 1392

"This gene encodes a member of the corticotropin-releasing factor family. The encoded preproprotein is proteolytically processed to generate the mature neuropeptide hormone. In response to stress, this hormone is secreted by the paraventricular nucleus (PVN) of the hypothalamus, binds to corticotropin releasing hormone receptors and stimulates the release of adrenocorticotropic hormone from the pituitary gland. Marked reduction in this protein has been observed in association with Alzheimer's disease. Autosomal recessive hypothalamic corticotropin deficiency has multiple and potentially fatal metabolic consequences including hypoglycemia and hepatitis. In addition to production in the hypothalamus, this protein is also synthesized in peripheral tissues, such as T lymphocytes, and is highly expressed in the placenta. In the placenta it is a marker that determines the length of gestation and the timing of parturition and delivery. A rapid increase in circulating levels of the hormone occurs at the onset of parturition, suggesting that, in addition to its metabolic functions, this protein may act as a trigger for parturition."[73]

Gene ID: 1393

"Corticotropin-releasing hormone is a potent stimulator of synthesis and secretion of preopiomelanocortin-derived peptides. Although CRH concentrations in the human peripheral circulation are normally low, they increase throughout pregnancy and fall rapidly after parturition. Maternal plasma CRH probably originates from the placenta. Human plasma contains a CRH-binding protein which inactivates CRH and which may prevent inappropriate pituitary-adrenal stimulation in pregnancy."[74]

Gene ID: 1401

"The protein encoded by this gene belongs to the pentraxin family which also includes serum amyloid P component protein and pentraxin 3. Pentraxins are involved in complement activation and amplification via communication with complement initiation pattern recognition molecules, but also complement regulation via recruitment of complement regulators. The encoded protein has a calcium dependent ligand binding domain with a distinctive flattened beta-jellyroll structure. It exists in two forms as either a pentamer in circulation or as a nonsoluble monomer in tissues. It is involved in several host defense related functions based on its ability to recognize foreign pathogens and damaged cells of the host and to initiate their elimination by interacting with humoral and cellular effector systems in the blood. Consequently, the level of this protein in plasma increases greatly during acute phase response to tissue injury, infection, or other inflammatory stimuli. Elevated expression of the encoded protein is associated with severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) infection."[75]

Gene ID: 1410

"Mammalian lens crystallins are divided into alpha, beta, and gamma families. Alpha crystallins are composed of two gene products: alpha-A and alpha-B, for acidic and basic, respectively. Alpha crystallins can be induced by heat shock and are members of the small heat shock protein (HSP20) family. They act as molecular chaperones although they do not renature proteins and release them in the fashion of a true chaperone; instead they hold them in large soluble aggregates. These heterogeneous aggregates consist of 30-40 subunits; the alpha-A and alpha-B subunits have a 3:1 ratio, respectively. Two additional functions of alpha crystallins are an autokinase activity and participation in the intracellular architecture. The encoded protein has been identified as a moonlighting protein based on its ability to perform mechanistically distinct functions. Alpha-A and alpha-B gene products are differentially expressed; alpha-A is preferentially restricted to the lens and alpha-B is expressed widely in many tissues and organs. Elevated expression of alpha-B crystallin occurs in many neurological diseases; a missense mutation cosegregated in a family with a desmin-related myopathy. Alternative splicing results in multiple transcript variants."[76]

Gene ID: 1411

"Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group, none in the acidic group). Beta-crystallins form aggregates of different sizes and are able to self-associate to form dimers or to form heterodimers with other beta-crystallins. This gene, a beta acidic group member, encodes two proteins (crystallin, beta A3 and crystallin, beta A1) from a single mRNA, the latter protein is 17 aa shorter than crystallin, beta A3 and is generated by use of an alternate translation initiation site. Deletion of exons 3 and 4 causes the autosomal dominant disease 'zonular cataract with sutural opacities'."[77]

Gene ID: 1427

"Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. This gene encodes a protein initially considered to be a beta-crystallin but the encoded protein is monomeric and has greater sequence similarity to other gamma-crystallins. This gene encodes the most significant gamma-crystallin in adult eye lens tissue. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation."[78]

Gene ID: 1440

"This gene encodes a member of the IL-6 superfamily of cytokines. The encoded cytokine controls the production, differentiation, and function of granulocytes. Granulocytes are a type of white blood cell that are part of the innate immune response. A modified form of this protein is commonly administered to manage chemotherapy-induced neutropenia. Alternatively spliced transcript variants have been described for this gene."[79]

Gene ID: 1490

"The protein encoded by this gene is a mitogen that is secreted by vascular endothelial cells. The encoded protein plays a role in chondrocyte proliferation and differentiation, cell adhesion in many cell types, and is related to platelet-derived growth factor. Certain polymorphisms in this gene have been linked with a higher incidence of systemic sclerosis."[80]

Gene ID: 1543

"This gene, CYP1A1, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme's endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. The gene has been associated with lung cancer risk. A related family member, CYP1A2, is located approximately 25 kb away from CYP1A1 on chromosome 15. Alternative splicing results in multiple transcript variants encoding distinct isoforms."[81]

Gene ID: 1544

"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The protein encoded by this gene localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme's endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. Other xenobiotic substrates for this enzyme include caffeine, aflatoxin B1, and acetaminophen. The transcript from this gene contains four Alu sequences flanked by direct repeats in the 3' untranslated region."[82]

Gene ID: 1548

"This gene, CYP2A6, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by phenobarbital. The enzyme is known to hydroxylate coumarin, and also metabolizes nicotine, aflatoxin B1, nitrosamines, and some pharmaceuticals. Individuals with certain allelic variants are said to have a poor metabolizer phenotype, meaning they do not efficiently metabolize coumarin or nicotine. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q. The gene was formerly referred to as CYP2A3; however, it has been renamed CYP2A6."[83]

Gene ID: 1553

"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. Although its endogenous substrate has not been determined, it is known to metabolize 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone, a major nitrosamine specific to tobacco. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q."[84]

Gene ID: 1571

"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is induced by ethanol, the diabetic state, and starvation. The enzyme metabolizes both endogenous substrates, such as ethanol, acetone, and acetal, as well as exogenous substrates including benzene, carbon tetrachloride, ethylene glycol, and nitrosamines which are premutagens found in cigarette smoke. Due to its many substrates, this enzyme may be involved in such varied processes as gluconeogenesis, hepatic cirrhosis, diabetes, and cancer."[85]

Gene ID: 1576

"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. This enzyme is involved in the metabolism of approximately half the drugs in use today, including acetaminophen, codeine, cyclosporin A, diazepam, erythromycin, and chloroquine. The enzyme also metabolizes some steroids and carcinogens. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, CYP3A3, was thought to exist; however, it is now thought that this sequence represents a transcript variant of CYP3A4. Alternatively spliced transcript variants encoding different isoforms have been identified."[86]

Gene ID: 1581

"This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. This enzyme is involved in the metabolism of approximately half the drugs in use today, including acetaminophen, codeine, cyclosporin A, diazepam, erythromycin, and chloroquine. The enzyme also metabolizes some steroids and carcinogens. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, CYP3A3, was thought to exist; however, it is now thought that this sequence represents a transcript variant of CYP3A4. Alternatively spliced transcript variants encoding different isoforms have been identified."[87]

Gene ID: 1602

"Nine elements were tested, representing a sampling of elements present in the two gene deserts and DACH introns, spread over a 1530-kb region surrounding the human DACH's TATA box."[88]

Gene ID: 1602 is the human gene DACH1 dachshund homolog 1 also known as DACH.[89] DACH1 has three isoforms: a, b, and c.

"This gene encodes a chromatin-associated protein that associates with other DNA-binding transcription factors to regulate gene expression and cell fate determination during development. The protein contains a Ski domain that is highly conserved from Drosophila to human. Expression of this gene is lost in some forms of metastatic cancer, and is correlated with poor prognosis."[89]

Gene ID: 1655

"This gene encodes a member of the DEAD box family of RNA helicases that are involved in a variety of cellular processes as a result of its role as an adaptor molecule, promoting interactions with a large number of other factors. This protein is involved in pathways that include the alteration of RNA structures, plays a role as a coregulator of transcription, a regulator of splicing, and in the processing of small noncoding RNAs. Members of this family contain nine conserved motifs, including the conserved Asp-Glu-Ala-Asp (DEAD) motif, important to ATP binding and hydrolysis as well as RNA binding and unwinding activities. Dysregulation of this gene may play a role in cancer development. Alternative splicing results in multiple transcript variants."[90]

Gene ID: 1728

"This gene is a member of the NAD(P)H dehydrogenase (quinone) family and encodes a cytoplasmic 2-electron reductase. This FAD-binding protein forms homodimers and reduces quinones to hydroquinones. This protein's enzymatic activity prevents the one electron reduction of quinones that results in the production of radical species. Mutations in this gene have been associated with tardive dyskinesia (TD), an increased risk of hematotoxicity after exposure to benzene, and susceptibility to various forms of cancer. Altered expression of this protein has been seen in many tumors and is also associated with Alzheimer's disease (AD). Alternate transcriptional splice variants, encoding different isoforms, have been characterized."[91]

Gene ID: 1735

"The protein encoded by this intronless gene belongs to the iodothyronine deiodinase family. It catalyzes the inactivation of thyroid hormone by inner ring deiodination of the prohormone thyroxine (T4) and the bioactive hormone 3,3',5-triiodothyronine (T3) to inactive metabolites, 3,3',5'-triiodothyronine (RT3) and 3,3'-diiodothyronine (T2), respectively. This enzyme is highly expressed in pregnant uterus, placenta, fetal and neonatal tissues, and thought to prevent premature exposure of developing fetal tissues to adult levels of thyroid hormones. It regulates circulating fetal thyroid hormone concentrations, and thus plays a critical role in mammalian development. Knockout mice lacking this gene exhibit abnormalities related to development and reproduction, and increased activity of this enzyme in infants with hemangiomas causes severe hypothyroidism. This protein is a selenoprotein, containing the rare selenocysteine (Sec) amino acid at its active site. Sec is encoded by the UGA codon, which normally signals translation termination. The 3' UTRs of selenoprotein mRNAs contain a conserved stem-loop structure, designated the Sec insertion sequence (SECIS) element, that is necessary for the recognition of UGA as a Sec codon rather than as a stop signal."[92]

Gene ID: 1805

"Dermatopontin is an extracellular matrix protein with possible functions in cell-matrix interactions and matrix assembly. The protein is found in various tissues and many of its tyrosine residues are sulphated. Dermatopontin is postulated to modify the behavior of TGF-beta through interaction with decorin."[93] It has a TATA box (TATAAAA) from -26 to -20 nts from the TSS.[6]

Gene ID: 1811

The DRA gene (colon mucosa-associated gene) has a TATA box.[94]

"The protein encoded by this gene is a transmembrane glycoprotein that transports chloride ions across the cell membrane in exchange for bicarbonate ions. It is localized to the mucosa of the lower intestinal tract, particularly to the apical membrane of columnar epithelium and some goblet cells. The protein is essential for intestinal chloride absorption, and mutations in this gene have been associated with congenital chloride diarrhea."[95]

Gene ID: 1831

"This gene encodes the anti-inflammatory protein glucocorticoid (GC)-induced leucine zipper. Expression of this gene stimulated by glucocorticoids and interleukin 10 and it appears to play a key role in the anti-inflammatory and immunosuppressive effects of this steroid. This protein has also been shown to inhibit pro-inflammatory molecules including nuclear factor κB. Alternate splicing results in multiple transcript variants."[96]

Gene ID: 1833

"Dermatan sulfate proteoglycan 3 is a member of the small leucine-rich repeat proteoglycan family. This gene is composed of seven exons. It regulates fibrillogenesis by interacting with collagen fibrils and other extracellular matrix proteins."[97]

Gene ID: 1893

"This gene encodes a soluble protein that is involved in endochondral bone formation, angiogenesis, and tumor biology. It also interacts with a variety of extracellular and structural proteins, contributing to the maintenance of skin integrity and homeostasis. Mutations in this gene are associated with lipoid proteinosis disorder (also known as hyalinosis cutis et mucosae or Urbach-Wiethe disease) that is characterized by generalized thickening of skin, mucosae and certain viscera. Alternatively spliced transcript variants encoding distinct isoforms have been described for this gene."[97]

Gene ID: 1906

"This gene encodes a preproprotein that is proteolytically processed to generate a secreted peptide that belongs to the endothelin/sarafotoxin family. This peptide is a potent vasoconstrictor and its cognate receptors are therapeutic targets in the treatment of pulmonary arterial hypertension. Aberrant expression of this gene may promote tumorigenesis. Alternative splicing results in multiple transcript variants."[98]

Gene ID: 1915

"This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes."[99]

Gene ID: 1938

Gene ID: 1961

"Enables DNA-binding transcription activator activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Involved in positive regulation of transcription by RNA polymerase II. Predicted to be located in nucleoplasm. Predicted to be part of chromatin."[100]

Gene ID: 1974

Gene ID: 1990

Gene ID: 1999

"Enables DNA-binding transcription activator activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Involved in inflammatory response; negative regulation of transcription, DNA-templated; and positive regulation of transcription by RNA polymerase II. Located in Golgi apparatus; cytosol; and nucleoplasm." It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.[6]

Gene ID: 2168

Gene ID: 2206

Gene ID: 2244

"The protein encoded by this gene is the beta component of fibrinogen, a blood-borne glycoprotein comprised of three pairs of nonidentical polypeptide chains. Following vascular injury, fibrinogen is cleaved by thrombin to form fibrin which is the most abundant component of blood clots. In addition, various cleavage products of fibrinogen and fibrin regulate cell adhesion and spreading, display vasoconstrictor and chemotactic activities, and are mitogens for several cell types. Fibrinogen serves key roles in hemostasis and antimicrobial host defense. Mutations in this gene lead to several disorders, including afibrinogenemia, dysfibrinogenemia, hypodysfibrinogenemia and thrombotic tendency."[101]

Gene ID: 2250

Gene ID: 2321

Gene ID: 2353

Gene ID: 2494

"The protein encoded by this gene is a DNA-binding zinc finger transcription factor and is a member of the fushi tarazu factor-1 subfamily of orphan nuclear receptors. The encoded protein is involved in the expression of genes for hepatitis B virus and cholesterol biosynthesis, and may be an important regulator of embryonic development."[102] It has a TATA box (TATAACA) from -28 to -21 nts from the TSS.[6]

Gene ID: 2538

Gene ID: 2542

Gene ID: 2597

Gene ID: 2641

"The protein encoded by this gene is actually a preproprotein that is cleaved into four distinct mature peptides. One of these, glucagon, is a pancreatic hormone that counteracts the glucose-lowering action of insulin by stimulating glycogenolysis and gluconeogenesis. Glucagon is a ligand for a specific G-protein linked receptor whose signalling pathway controls cell proliferation. Two of the other peptides are secreted from gut endocrine cells and promote nutrient absorption through distinct mechanisms. Finally, the fourth peptide is similar to glicentin, an active enteroglucagon."[103]

Gene ID: 2652

Gene ID: 2669

Gene ID: 2688

Gene ID: 2752

"The protein encoded by this gene belongs to the glutamine synthetase family. It catalyzes the synthesis of glutamine from glutamate and ammonia in an ATP-dependent reaction. This protein plays a role in ammonia and glutamate detoxification, acid-base homeostasis, cell signaling, and cell proliferation. Glutamine is an abundant amino acid, and is important to the biosynthesis of several amino acids, pyrimidines, and purines. Mutations in this gene are associated with congenital glutamine deficiency, and overexpression of this gene was observed in some primary liver cancer samples. There are six pseudogenes of this gene found on chromosomes 2, 5, 9, 11, and 12. Alternative splicing results in multiple transcript variants."[104] It has a TATA box (GATAAAG) from -30 to -24 nts from the TSS.[6]

Gene ID: 2780

"Transducin is a 3-subunit guanine nucleotide-binding protein (G protein) which stimulates the coupling of rhodopsin and cGMP-phoshodiesterase during visual impulses. The transducin alpha subunits in rods and cones are encoded by separate genes. This gene encodes the alpha subunit in cones."[105] It has a TATA box (TATAAAG) from -30 to -23 nts from the TSS.[6]

Gene ID: 2814

"Human platelet glycoprotein V (GP5) is a part of the Ib-V-IX system of surface glycoproteins that constitute the receptor for von Willebrand factor (VWF; MIM 613160) and mediate the adhesion of platelets to injured vascular surfaces in the arterial circulation, a critical initiating event in hemostasis. The main portion of the receptor is a heterodimer composed of 2 polypeptide chains, an alpha chain (GP1BA; MIM 606672) and a beta chain (GP1BB; MIM 138720), that are linked by disulfide bonds. The complete receptor complex includes noncovalent association of the alpha and beta subunits with platelet glycoprotein IX (GP9; MIM 173515) and GP5. Mutations in GP1BA, GP1BB, and GP9 have been shown to cause Bernard-Soulier syndrome (MIM 231200), a bleeding disorder (review by Lopez et al., 1998 [PubMed 9616133])."[106]

Gene ID: 2922

Gene ID: 2947

Gene ID: 2980

"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."[107] It has a TATA box (TTTAAAA) from -33 to -27 nts from the TSS.[6]

Gene ID: 2981

"Predicted to enable guanylate cyclase activator activity. Predicted to be involved in positive regulation of guanylate cyclase activity and signal transduction. Predicted to be located in extracellular region."[108] It has a TATA box (TATAAGG) from -30 to -24 nts from the TSS.[6]

Gene ID: 3006

"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."[109]

Gene ID: 3008

"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."[110]

Gene ID: 3009

"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3."[111]

Gene ID: 3010

"Histones are basic nuclear proteins responsible for nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6."[112]

Gene ID: 3015

Gene ID: 3050

"Zeta-globin is an alpha-like hemoglobin. The zeta-globin polypeptide is synthesized in the yolk sac of the early embryo, while alpha-globin is produced throughout fetal and adult life. The zeta-globin gene is a member of the human alpha-globin gene cluster that includes five functional genes and two pseudogenes. The order of genes is: 5' - zeta - pseudozeta - mu - pseudoalpha-1 - alpha-2 -alpha-1 - theta1 - 3'."[113]

Gene ID: 3133

Gene ID: 3158

"The protein encoded by this gene belongs to the HMG-CoA synthase family. It is a mitochondrial enzyme that catalyzes the first reaction of ketogenesis, a metabolic pathway that provides lipid-derived energy for various organs during times of carbohydrate deprivation, such as fasting. Mutations in this gene are associated with HMG-CoA synthase deficiency. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."[114] It has a TATA box (TATAAAG) from -30 to -24 nts from the TSS.[6]

Gene ID: 3164

Gene ID: 3171

Gene ID: 3182

Gene ID: 3206

Gene ID: 3222

Gene ID: 3232

Gene ID: 3273

Gene ID: 3280

"This protein belongs to the basic helix-loop-helix family of transcription factors. It is a transcriptional repressor of genes that require a bHLH protein for their transcription. The protein has a particular type of basic domain that contains a helix interrupting protein that binds to the N-box rather than the canonical E-box."[115]

Gene ID: 3283

"The protein encoded by this gene is an enzyme that catalyzes the oxidative conversion of delta-5-3-beta-hydroxysteroid precursors into delta-4-ketosteroids, which leads to the production of all classes of steroid hormones. The encoded protein also catalyzes the interconversion of 3-beta-hydroxy- and 3-keto-5-alpha-androstane steroids."[116] It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.[6]

Gene ID: 3284

"The protein encoded by this gene is a bifunctional enzyme that catalyzes the oxidative conversion of delta(5)-ene-3-beta-hydroxy steroid, and the oxidative conversion of ketosteroids. It plays a crucial role in the biosynthesis of all classes of hormonal steroids. This gene is predominantly expressed in the adrenals and the gonads. Mutations in this gene are associated with 3-beta-hydroxysteroid dehydrogenase, type II, deficiency. Alternatively spliced transcript variants have been found for this gene."[117] It has a TATA box (CATAAAG) from -30 to -24 nts from the TSS.[6]

Gene ID: 3304

Gene ID: 3308

The Drosophila hsp70 has a TATA box containing promoter.[118] This suggests that Gene ID: 3308 HSPA4 heat shock 70kDa protein 4 [Homo sapiens], also known as hsp70,[119] has a TATA box in its core promoter.

Gene ID: 3309

"The protein encoded by this gene is a member of the heat shock protein 70 (HSP70) family. This protein localizes to the lumen of the endoplasmic reticulum (ER) where it operates as a typical HSP70 chaperone involved in the folding and assembly of proteins in the ER and is a master regulator of ER homeostasis. During cellular stress, as during viral infection or tumorogenesis, this protein interacts with the transmembrane stress sensor proteins PERK (protein kinase R-like endoplasmic reticulum kinase), IRE1 (inositol-requiring kinase 1), and ATF6 (activating transcription factor 6) where it acts as a repressor of the unfolded protein response (UPR) and also plays a role in cellular apoptosis and senescence. Elevated expression and atypical translocation of this protein to the cell surface has been reported in viral infections and some types of cancer cells. At the cell surface this protein may facilitate viral attachment and entry to host cells. This gene is a therapeutic target for the treatment of coronavirus diseases and chemoresistant cancers."[120]

Gene ID: 3320

"The protein encoded by this gene is an inducible molecular chaperone that functions as a homodimer. The encoded protein aids in the proper folding of specific target proteins by use of an ATPase activity that is modulated by co-chaperones. Two transcript variants encoding different isoforms have been found for this gene."[121]

Gene ID: 3371

"This gene encodes an extracellular matrix protein with a spatially and temporally restricted tissue distribution. This protein is homohexameric with disulfide-linked subunits, and contains multiple EGF-like and fibronectin type-III domains. It is implicated in guidance of migrating neurons as well as axons during development, synaptic plasticity, and neuronal regeneration."[122]

Gene ID: 3375

"This gene encodes a member of the calcitonin family of peptide hormones. This hormone is released from pancreatic beta cells following food intake to regulate blood glucose levels and act as a satiation signal. Human patients with type 1 and advanced type 2 diabetes exhibit reduced levels of the encoded hormone in blood and pancreas. This protein also exhibits a bactericidal, antimicrobial activity."[123]

Gene ID: 3383

Gene ID: 3397

Gene ID: 3458

Gene ID: 3487

Gene ID: 3491

"The secreted protein encoded by this gene is growth factor-inducible and promotes the adhesion of endothelial cells. The encoded protein interacts with several integrins and with heparan sulfate proteoglycan. This protein also plays a role in cell proliferation, differentiation, angiogenesis, apoptosis, and extracellular matrix formation."[124] It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS.[6]

Gene ID: 3558

Gene ID: 3565

"The protein encoded by this gene is a pleiotropic cytokine produced by activated T cells. This cytokine is a ligand for interleukin 4 receptor. The interleukin 4 receptor also binds to IL13, which may contribute to many overlapping functions of this cytokine and IL13. STAT6, a signal transducer and activator of transcription, has been shown to play a central role in mediating the immune regulatory signal of this cytokine. This gene, IL3, IL5, IL13, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL13. This gene, IL13 and IL5 are found to be regulated coordinately by several long-range regulatory elements in an over 120 kilobase range on the chromosome. IL4 is considered an important cytokine for tissue repair, counterbalancing the effects of proinflammatory type 1 cytokines, however, it also promotes allergic airway inflammation. Moreover, IL-4, a type 2 cytokine, mediates and regulates a variety of human host responses such as allergic, anti-parasitic, wound healing, and acute inflammation. This cytokine has been reported to promote resolution of neutrophil-mediated acute lung injury. In an allergic response, IL-4 has an essential role in the production of allergen-specific immunoglobin (Ig) E. This pro-inflammatory cytokine has been observed to be increased in COVID-19 (Coronavirus disease 2019) patients, but is not necessarily associated with severe COVID-19 pathology. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported."[125]

Gene ID: 3593

"This gene encodes a subunit of interleukin 12, a cytokine that acts on T and natural killer cells, and has a broad array of biological activities. Interleukin 12 is a disulfide-linked heterodimer composed of the 40 kD cytokine receptor like subunit encoded by this gene, and a 35 kD subunit encoded by IL12A. This cytokine is expressed by activated macrophages that serve as an essential inducer of Th1 cells development. This cytokine has been found to be important for sustaining a sufficient number of memory/effector Th1 cells to mediate long-term protection to an intracellular pathogen. Overexpression of this gene was observed in the central nervous system of patients with multiple sclerosis (MS), suggesting a role of this cytokine in the pathogenesis of the disease. The promoter polymorphism of this gene has been reported to be associated with the severity of atopic and non-atopic asthma in children."[126]

Gene ID: 3596

"This gene encodes an immunoregulatory cytokine produced primarily by activated Th2 cells. This cytokine is involved in several stages of B-cell maturation and differentiation. It up-regulates CD23 and MHC class II expression, and promotes IgE isotype switching of B cells. This cytokine down-regulates macrophage activity, thereby inhibits the production of pro-inflammatory cytokines and chemokines. This cytokine is found to be critical to the pathogenesis of allergen-induced asthma but operates through mechanisms independent of IgE and eosinophils. This gene, IL3, IL5, IL4, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL4."[127]

Gene ID: 3605

Gene ID: 3624

Gene ID: 3630

"This gene encodes insulin, a peptide hormone that plays a vital role in the regulation of carbohydrate and lipid metabolism. After removal of the precursor signal peptide, proinsulin is post-translationally cleaved into three peptides: the B chain and A chain peptides, which are covalently linked via two disulfide bonds to form insulin, and C-peptide. Binding of insulin to the insulin receptor (INSR) stimulates glucose uptake. A multitude of mutant alleles with phenotypic effects have been identified, including insulin-dependent diabetes mellitus, permanent neonatal diabetes diabetes mellitus, maturity-onset diabetes of the young type 10 and hyperproinsulinemia. There is a read-through gene, INS-IGF2, which overlaps with this gene at the 5' region and with the IGF2 gene at the 3' region."[128]

Gene ID: 3640

Gene ID: 3662

Gene ID: 3708

"This gene encodes an intracellular receptor for inositol 1,4,5-trisphosphate. Upon stimulation by inositol 1,4,5-trisphosphate, this receptor mediates calcium release from the endoplasmic reticulum. Mutations in this gene cause spinocerebellar ataxia type 15, a disease associated with an heterogeneous group of cerebellar disorders. Multiple transcript variants have been identified for this gene."[129]

Gene ID: 3726

Gene ID: 3758

Gene ID: 3848

"The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the spinous and granular layers of the epidermis with family member KRT10 and mutations in these genes have been associated with bullous congenital ichthyosiform erythroderma. The type II cytokeratins are clustered in a region of chromosome 12q12-q13."[130]

Gene ID: 3858

"This gene encodes a member of the type I (acidic) cytokeratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. Mutations in this gene are associated with epidermolytic hyperkeratosis. This gene is located within a cluster of keratin family members on chromosome 17q21."[131]

Gene ID: 3859

"KRT12 encodes the type I intermediate filament chain keratin 12, expressed in corneal epithelia. Mutations in this gene lead to Meesmann corneal dystrophy."[132]

Gene ID: 3860

Gene ID: 3861

Gene ID: 3872

Gene ID: 3906

Gene ID: 3918

"Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Laminins, composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively), have a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains. Each laminin chain is a multidomain protein encoded by a distinct gene. Several isoforms of each chain have been described. Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1. The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 2. The gamma 2 chain, formerly thought to be a truncated version of beta chain (B2t), is highly homologous to the gamma 1 chain; however, it lacks domain VI, and domains V, IV and III are shorter. It is expressed in several fetal tissues but differently from gamma 1, and is specifically localized to epithelial cells in skin, lung and kidney. The gamma 2 chain together with alpha 3 and beta 3 chains constitute laminin 5 (earlier known as kalinin), which is an integral part of the anchoring filaments that connect epithelial cells to the underlying basement membrane. The epithelium-specific expression of the gamma 2 chain implied its role as an epithelium attachment molecule, and mutations in this gene have been associated with junctional epidermolysis bullosa, a skin disease characterized by blisters due to disruption of the epidermal-dermal junction. Two transcript variants resulting from alternative splicing of the 3' terminal exon, and encoding different isoforms of gamma 2 chain, have been described. The two variants are differentially expressed in embryonic tissues, however, the biological significance of the two forms is not known. Transcript variants utilizing alternative polyA_signal have also been noted in literature."[133] It has a TATA box (GATAAAA) from -33 to -27 nts from the TSS.[6]

Gene ID: 3938

Gene ID: 3976

Gene ID: 3977

"This gene encodes a protein that belongs to the type I cytokine receptor family. This protein combines with a high-affinity converter subunit, gp130, to form a receptor complex that mediates the action of the leukemia inhibitory factor, a polyfunctional cytokine that is involved in cellular differentiation, proliferation and survival in the adult and the embryo. Mutations in this gene cause Schwartz-Jampel syndrome type 2, a disease belonging to the group of the bent-bone dysplasias. A translocation that involves the promoter of this gene, t(5;8)(p13;q12) with the pleiomorphic adenoma gene 1, is associated with salivary gland pleiomorphic adenoma, a common type of benign epithelial tumor of the salivary gland. Multiple splice variants encoding two different isoforms have been found for this gene."[134]

Gene ID: 3995

TATA box is TATAA.[94]

The protein encoded by this FADS3 fatty acid desaturase 3 gene is "a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization."[135]

Gene ID: 4014

"This gene encodes loricrin, a major protein component of the cornified cell envelope found in terminally differentiated epidermal cells. Mutations in this gene are associated with Vohwinkel's syndrome and progressive symmetric erythrokeratoderma, both inherited skin diseases."[136] It has a TATA box (TATATATAA) from -40 to -32 nts from the TSS.[6]

Gene ID: 4025

Gene ID: 4049

Gene ID: 4144

Gene ID: 4222

Gene ID: 4225

Gene ID: 4254

Gene ID: 4256

Gene ID: 4284

"Major intrinsic protein is a member of the water-transporting aquaporins as well as the original member of the MIP family of channel proteins. The function of the fiber cell membrane protein encoded by this gene is undetermined, yet this protein is speculated to play a role in intracellular communication. The MIP protein is expressed in the ocular lens and is required for correct lens function. This gene has been mapped among aquaporins AQP2, AQP5, and AQP6, in a potential gene cluster at 12q13."[137]

Gene ID: 4316

Gene ID: 4319

Gene ID: 4321

Gene ID: 4322

Gene ID: 4327

Gene ID: 4357

Gene ID: 4435

Gene ID: 4501

Gene ID: 4504

Gene ID: 4582

"This gene encodes a membrane-bound protein that is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. This protein is expressed on the apical surface of epithelial cells that line the mucosal surfaces of many different tissues including lung, breast stomach and pancreas. This protein is proteolytically cleaved into alpha and beta subunits that form a heterodimeric complex. The N-terminal alpha subunit functions in cell-adhesion and the C-terminal beta subunit is involved in cell signaling. Overexpression, aberrant intracellular localization, and changes in glycosylation of this protein have been associated with carcinomas. This gene is known to contain a highly polymorphic variable number tandem repeats (VNTR) domain. Alternate splicing results in multiple transcript variants."[138] It has a TATA box (TATAAAG) from -24 to -18 nts from the TSS.[6]

Gene ID: 4609

Gene ID: 4616

Gene ID: 4618

Gene ID: 4624

Gene ID: 4629

Gene ID: 4632

"Myosin is a hexameric ATPase cellular motor protein. It is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene encodes a myosin alkali light chain expressed in fast skeletal muscle. Two transcript variants have been identified for this gene."[139]

Gene ID: 4638

Gene ID: 4653

"MYOC encodes the protein myocilin, which is believed to have a role in cytoskeletal function. MYOC is expressed in many occular tissues, including the trabecular meshwork, and was revealed to be the trabecular meshwork glucocorticoid-inducible response protein (TIGR). The trabecular meshwork is a specialized eye tissue essential in regulating intraocular pressure, and mutations in MYOC have been identified as the cause of hereditary juvenile-onset open-angle glaucoma."[140] It has a TATA box (TATATATAAAC) from -31 to -21 nts from the TSS.[6]

Gene ID: 4741

Gene ID: 4744

Gene ID: 4747

Gene ID: 4842

Gene ID: 4843

Gene ID: 4856

Gene ID: 4869

"The protein encoded by this gene is involved in several cellular processes, including centrosome duplication, protein chaperoning, and cell proliferation. The encoded phosphoprotein shuttles between the nucleolus, nucleus, and cytoplasm, chaperoning ribosomal proteins and core histones from the nucleus to the cytoplasm. This protein is also known to sequester the tumor suppressor ARF in the nucleolus, protecting it from degradation until it is needed. Mutations in this gene are associated with acute myeloid leukemia. Dozens of pseudogenes of this gene have been identified."[141]

Gene ID: 4878

"The protein encoded by this gene belongs to the natriuretic peptide family. Natriuretic peptides are implicated in the control of extracellular fluid volume and electrolyte homeostasis. This protein is synthesized as a large precursor (containing a signal peptide), which is processed to release a peptide from the N-terminus with similarity to vasoactive peptide, cardiodilatin, and another peptide from the C-terminus with natriuretic-diuretic activity. Mutations in this gene have been associated with atrial fibrillation familial type 6. This gene is located adjacent to another member of the natriuretic family of peptides on chromosome 1."[142] It has a TATA box (TATAAAAAG) from -30 to -22 nts from the TSS.[6]

Gene ID: 4922

Gene ID: 4946

Gene ID: 4982

Gene ID: 5004

Gene ID: 5005

Gene ID: 5016

"This gene encodes a large, carbohydrate-rich, epithelial glycoprotein with numerous O-glycosylation sites located within threonine, serine, and proline-rich tandem repeats. The gene is similar to members of the mucin and the glycosyl hydrolase 18 gene families. Regulation of expression may be estrogen-dependent. Gene expression and protein secretion occur during late follicular development through early cleavage-stage embryonic development. The protein is secreted from non-ciliated oviductal epithelial cells and associates with ovulated oocytes, blastomeres, and spermatozoan acrosomal regions."[143] It has a TATA box (TATAAAG) from -25 to -19 nts from the TSS.[6]

Gene ID: 5033

Gene ID: 5034

Gene ID: 5052

"This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. Four transcript variants encoding the same protein have been identified for this gene."[144] It has a TATA box (TATAAAG) from -31 to -25 nts from the TSS.[6]

Gene ID: 5054

Gene ID: 5055

Gene ID: 5068

Gene ID: 5069

Gene ID: 5079

Gene ID: 5132

"This gene encodes a phosphoprotein, which is located in the outer and inner segments of the rod cells in the retina. This protein may participate in the regulation of visual phototransduction or in the integration of photoreceptor metabolism. It modulates the phototransduction cascade by interacting with the beta and gamma subunits of the retinal G-protein transducin. This gene is a potential candidate gene for retinitis pigmentosa and Usher syndrome type II. Alternatively spliced transcript variants encoding different isoforms have been identified."[145] It has a TATA box (TTTAAAT) from -32 to -26 nts from the TSS.[6]

Gene ID: 5155

Gene ID: 5224

Gene ID: 5225

Gene ID: 5275

Gene ID: 5360

Gene ID: 5406

Gene ID: 5408

Gene ID: 5443

Gene ID: 5449

"This gene encodes a member of the POU family of transcription factors that regulate mammalian development. The protein regulates expression of several genes involved in pituitary development and hormone expression. Mutations in this genes result in combined pituitary hormone deficiency. Multiple transcript variants encoding different isoforms have been found for this gene."[146]

Gene ID: 5478

Gene ID: 5514

Gene ID: 5553

Gene ID: 5610

Gene ID: 5617

Gene ID: 5645

Gene ID: 5650

Gene ID: 5741

Gene ID: 5743

"Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes the inducible isozyme. It is regulated by specific stimulatory events, suggesting that it is responsible for the prostanoid biosynthesis involved in inflammation and mitogenesis."[147] It has a TATA box (TATAAAA) from -31 to -25 nts from the TSS.[6]

"[T]he human ... prostaglandin-endoperoxide-synthase-2 [gene contains] a canonical TATA box (nucleotide residues at positions -31 to -25 for the human gene)."[148] This is Gene ID: 5743.

Gene ID: 5744

Gene ID: 5950

Gene ID: 5956

Gene ID: 5967

Gene ID: 5996

"This gene encodes a member of the regulator of G-protein signalling family. This protein is located on the cytosolic side of the plasma membrane and contains a conserved, 120 amino acid motif called the RGS domain. The protein attenuates the signalling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal."[149] It has a TATA box (TATAAAG) from -28 to -22 nts from the TSS.[6]

Gene ID: 5997

"Regulator of G protein signaling (RGS) family members are regulatory molecules that act as GTPase activating proteins (GAPs) for G alpha subunits of heterotrimeric G proteins. RGS proteins are able to deactivate G protein subunits of the Gi alpha, Go alpha and Gq alpha subtypes. They drive G proteins into their inactive GDP-bound forms. Regulator of G protein signaling 2 belongs to this family. The protein acts as a mediator of myeloid differentiation and may play a role in leukemogenesis."[150] It has a TATA box (CATAAAT) from -28 to -22 nts from the TSS.[6]

Gene ID: 6046

Gene ID: 6121

"The protein encoded by this gene is a component of the vitamin A visual cycle of the retina which supplies the 11-cis retinal chromophore of the photoreceptors opsin visual pigments. It is a member of the carotenoid cleavage oxygenase superfamily. All members of this superfamily are non-heme iron oxygenases with a seven-bladed propeller fold and oxidatively cleave carotenoid carbon:carbon double bonds. However, the protein encoded by this gene has acquired a divergent function that involves the concerted O-alkyl ester cleavage of its all-trans retinyl ester substrate and all-trans to 11-cis double bond isomerization of the retinyl moiety. As such, it performs the essential enzymatic isomerization step in the synthesis of 11-cis retinal. Mutations in this gene are associated with early-onset severe blinding disorders such as Leber congenital."[151] It has a TATA box (CATAAAA) from -27 to -21 nts from the TSS.[6]

Gene ID: 6175

Gene ID: 6187

Gene ID: 6224

Gene ID: 6232

"Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of four RNA species and approximately 80 structurally distinct proteins. This gene encodes a member of the S27e family of ribosomal proteins and component of the 40S subunit. The encoded protein contains a C4-type zinc finger domain that can bind to zinc and may bind to nucleic acid. Mutations in this gene have been identified in numerous melanoma patients and in at least one patient with Diamond-Blackfan anemia (DBA). Elevated expression of this gene has been observed in various human cancers. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome."[152] It has a TATA box (TATATAA) from -29 to -23 nts from the TSS.[6]

Gene ID: 6279

"The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in the inhibition of casein kinase and as a cytokine. Altered expression of this protein is associated with the disease cystic fibrosis. Multiple transcript variants encoding different isoforms have been found for this gene."[153] It has a TATA box (TATAAAA) from -30 to -24 nts from the TSS, Code score: 1.00; Matrix score: 0.93.[6]

Gene ID: 6280

Gene ID: 6288

Gene ID: 6289

Gene ID: 6349

Gene ID: 6351

Gene ID: 6352

Gene ID: 6354

Gene ID: 6356

Gene ID: 6357

Gene ID: 6364

"This antimicrobial gene belongs to the subfamily of small cytokine CC genes. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The protein encoded by this gene displays chemotactic activity for lymphocytes and can repress proliferation of myeloid progenitors. Two transcript variants encoding different isoforms have been found for this gene."[146]

Gene ID: 6376

Gene ID: 6414

Gene ID: 6427

Gene ID: 6428

Gene ID: 6432

Gene ID: 6435

Gene ID: 6436

Gene ID: 6446

Gene ID: 6500

Gene ID: 6513

Gene ID: 6518

"The protein encoded by this gene is a fructose transporter responsible for fructose uptake by the small intestine. The encoded protein also is necessary for the increase in blood pressure due to high dietary fructose consumption."[154] It has a TATA box (TATAAAA) from -33 to -27 nts from the TSS, Code score: 1.00; Matrix score: 0.92.[6]

Gene ID: 6519

Gene ID: 6548

"This gene encodes a Na+/H+ antiporter that is a member of the solute carrier family 9. The encoded protein is a plasma membrane transporter that is expressed in the kidney and intestine. This protein plays a central role in regulating pH homeostasis, cell migration and cell volume. This protein may also be involved in tumor growth."[155] It has a TATA box (TATAAGT) from -32 to -26 nts from the TSS, Code score: 0.91; Matrix score: 0.84.[6]

Gene ID: 6559

Gene ID: 6566

"The protein encoded by this gene is a proton-linked monocarboxylate transporter that catalyzes the movement of many monocarboxylates, such as lactate and pyruvate, across the plasma membrane. Mutations in this gene are associated with erythrocyte lactate transporter defect. Alternatively spliced transcript variants have been found for this gene."[156] It has a TATA box (TATAAGG) from -31 to -25 nts from the TSS, Code score: 0.91; Matrix score: 0.78.[6]

Gene ID: 6624

Gene ID: 6647

Gene ID: 6698

"A structural constituent of skin epidermis. Involved in keratinocyte differentiation and peptide cross-linking. Located in cornified envelope."[157] It has a TATA box (TATAAAAG) from -30 to -23 nts from the TSS, Code score: 1.00; Matrix score: 0.91.[6]

Gene ID: 6707

Gene ID: 6781

Gene ID: 6783

Gene ID: 6870

Gene ID: 6906

Gene ID: 6916

Gene ID: 6988

Gene ID: 7021

Gene ID: 7031

Gene ID: 7032

"Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. The encoded protein inhibits gastric acid secretion. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21."[158]

Gene ID: 7037

Gene ID: 7038

Gene ID: 7042

Gene ID: 7043

Gene ID: 7056

Gene ID: 7124

Gene ID: 7167

Gene ID: 7252

Gene ID: 7262

Gene ID: 7276

Gene ID: 7280

Gene ID: 7295

Gene ID: 7306

Gene ID: 7316

Gene ID: 7369

Gene ID: 7432

Gene ID: 7803

"This gene encodes a member of a small class of prenylated protein tyrosine phosphatases (PTPs), which contain a PTP domain and a characteristic C-terminal prenylation motif. The encoded protein is a cell signaling molecule that plays regulatory roles in a variety of cellular processes, including cell proliferation and migration. The protein may also be involved in cancer development and metastasis. This tyrosine phosphatase is a nuclear protein, but may associate with plasma membrane by means of its prenylation motif. Pseudogenes related to this gene are located on chromosomes 1, 2, 5, 7, 11 and X."[159]

Gene ID: 7850

Gene ID: 7852

Gene ID: 8000

Gene ID: 8288

Gene ID: 8339

Gene ID: 8431

Gene ID: 8483

Gene ID: 8490

Gene ID: 8513

Gene ID: 8564

Gene ID: 8832

Gene ID: 8942

Gene ID: 8969

"Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33."[160]

Gene ID: 8970

Gene ID: 8991

Gene ID: 9001

Gene ID: 9421

Gene ID: 9510

Gene ID: 9518

Gene ID: 9607

Gene ID: 9643

Gene ID: 9709

Gene ID: 9768

"Enables chromatin binding activity. Involved in several processes, including cellular macromolecule biosynthetic process; centrosome cycle; and response to UV. Located in centrosome; nucleus; and perinuclear region of cytoplasm."[161]

Gene ID: 9921

Gene ID: 10057

Gene ID: 10215

Gene ID: 10284

Gene ID: 10350

Gene ID: 10397

Gene ID: 10458

Gene ID: 10482

Gene ID: 10563

Gene ID: 10631

"This gene encodes a secreted extracellular matrix protein that functions in tissue development and regeneration, including wound healing, and ventricular remodeling following myocardial infarction. The encoded protein binds to integrins to support adhesion and migration of epithelial cells. This protein plays a role in cancer stem cell maintenance and metastasis. Mice lacking this gene exhibit cardiac valve disease, and skeletal and dental defects. Alternative splicing results in multiple transcript variants encoding different isoforms."[162]

Gene ID: 10761

Gene ID: 10769

Gene ID: 10912

Gene ID: 10930

Gene ID: 10938

Gene ID: 11009

Gene ID: 11067

Gene ID: 11081

Gene ID: 11082

Gene ID: 11169

Gene ID: 12723

Gene ID: 22928

Gene ID: 22943

Gene ID: 23450

Gene ID: 23645

Gene ID: 25928

Gene ID: 26827

RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ] is also known as U6 or U6-1.[163] In the flanking region 5' to the gene[U6], there is a Hogness box sequence TATAAAT beginning at position -31 which is boxed in ...."[164]

Gene ID: 26576

Gene ID: 26827

Gene ID: 27063

Gene ID: 27106

Gene ID: 27129

Gene ID: 27159

Gene ID: 27316

Gene ID: 50943

Gene ID: 51050

Gene ID: 51129

Gene ID: 51155

Gene ID: 51203

Gene ID: 51278

Gene ID: 51297

Gene ID: 51313

Gene ID: 51582

"The protein encoded by this gene belongs to the antizyme inhibitor family, which plays a role in cell growth and proliferation by maintaining polyamine homeostasis within the cell. Antizyme inhibitors are homologs of ornithine decarboxylase (ODC, the key enzyme in polyamine biosynthesis) that have lost the ability to decarboxylase ornithine; however, retain the ability to bind to antizymes. Antizymes negatively regulate intracellular polyamine levels by binding to ODC and targeting it for degradation, as well as by inhibiting polyamine uptake. Antizyme inhibitors function as positive regulators of polyamine levels by sequestering antizymes and neutralizing their effect. This gene encodes antizyme inhibitor 1, the first member of this gene family that is ubiquitously expressed, and is localized in the nucleus and cytoplasm. Overexpression of antizyme inhibitor 1 gene has been associated with increased proliferation, cellular transformation and tumorigenesis. Gene knockout studies showed that homozygous mutant mice lacking functional antizyme inhibitor 1 gene died at birth with abnormal liver morphology. RNA editing of this gene, predominantly in the liver tissue, has been linked to the progression of hepatocellular carcinoma. Alternatively spliced transcript variants have been described for this gene."[165]

Gene ID: 51738

Gene ID: 54106

Gene ID: 54567

Gene ID: 55118

Gene ID: 55504

Gene ID: 55603

Gene ID: 56642

Gene ID: 56675

Gene ID: 56829

Gene ID: 56895

Gene ID: 56987

Gene ID: 57126

Gene ID: 57152

"The protein encoded by this gene is a member of the Ly6/uPAR family but lacks a GPI-anchoring signal sequence. It is thought that this secreted protein contains antitumor activity. Mutations in this gene have been associated with Mal de Meleda, a rare autosomal recessive skin disorder. This gene maps to the same chromosomal region as several members of the Ly6/uPAR family of glycoprotein receptors."[166]

Gene ID: 57579

Gene ID: 57626

Gene ID: 57823

"Enables identical protein binding activity. Predicted to be involved in adaptive immune response. Predicted to act upstream of or within regulation of natural killer cell activation. Located in endoplasmic reticulum."[167]

Gene ID: 64111

"Predicted to enable signaling receptor binding activity. Involved in negative regulation of gonadotropin secretion. Predicted to be located in extracellular region."[168]

Gene ID: 65108

"This gene encodes a member of the myristoylated alanine-rich C-kinase substrate (MARCKS) family. Members of this family play a role in cytoskeletal regulation, protein kinase C signaling and calmodulin signaling. The encoded protein affects the formation of adherens junction. Alternative splicing results in multiple transcript variants. Pseudogenes of this gene are located on the long arm of chromosomes 6 and 10."[169]

Gene ID: 79733

"This gene encodes a member of a family of transcription factors which regulate the expression of genes required for progression through the cell cycle. The encoded protein regulates progression from G1 to S phase by ensuring the nucleus divides at the proper time. Multiple alternatively spliced variants, encoding the same protein, have been identified."[170]

Gene ID: 80177

"Predicted to act upstream of or within hematopoietic stem cell homeostasis. Located in nucleoplasm."[171]

Gene ID: 80740

"LY6G6C belongs to a cluster of leukocyte antigen-6 (LY6) genes located in the major histocompatibility complex (MHC) class III region on chromosome 6. Members of the LY6 superfamily typically contain 70 to 80 amino acids, including 8 to 10 cysteines. Most LY6 proteins are attached to the cell surface by a glycosylphosphatidylinositol (GPI) anchor that is directly involved in signal transduction (Mallya et al., 2002 [PubMed 12079290])."[172]

Gene ID: 81285

"Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms."[173]

Gene ID: 81606

"Involved in negative regulation of transcription, DNA-templated; positive regulation of transcription, DNA-templated; and regulation of MAPK cascade. Located in cytoplasm and nucleus. Part of protein-containing complex."[174]

Gene ID: 83638

"Enables RNA binding activity."[175]

Gene ID: 83998

"Enables heparin binding activity and mannan binding activity. Predicted to act upstream of or within response to bacterium. Located in cytoplasm."[176]

Gene ID: 84107

"This gene encodes a member of the ZIC family of C2H2-type zinc finger proteins. Members of this family are important during development, and have been associated with X-linked visceral heterotaxy and holoprosencephaly type 5. This gene is closely linked to the gene encoding zinc finger protein of the cerebellum 1, a related family member on chromosome 3. Heterozygous deletion of these linked genes is involved in Dandy-Walker malformation, which is a congenital cerebellar malformation. Multiple transcript variants have been identified for this gene."[177]

Gene ID: 84223

"Enables Hsp70 protein binding activity and calmodulin binding activity. Predicted to be involved in sperm axoneme assembly. Located in cytoplasm."[178]

Gene ID: 84328

"Predicted to enable beta-catenin binding activity. Predicted to be involved in response to ionizing radiation."[179]

Gene ID: 84419

"This gene was first identified in a study of human esophageal squamous cell carcinoma tissues. Levels of both the message and protein are reduced in carcinoma samples. In adult human tissues, this gene is expressed in the the esophagus, stomach, small intestine, colon and placenta. Alternatively spliced transcript variants that encode the same protein have been identified."[180]

Gene ID: 84790

"Predicted to enable GTP binding activity. Predicted to be a structural constituent of cytoskeleton. Predicted to be involved in microtubule cytoskeleton organization and mitotic cell cycle. Located in microtubule cytoskeleton and nucleus."[181]

Gene ID: 84889

"This gene encodes a member of the solute carrier family 7. The encoded protein is a sodium-independent cationic amino acid transporter. Alternate splicing results in multiple transcripts that encoded the same protein."[182]

Gene ID: 85235

"Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33."[183]

Gene ID: 92736

"Predicted to enable proton channel activity. Predicted to be involved in proton transmembrane transport. Predicted to be located in plasma membrane. Predicted to be integral component of membrane."[184]

Gene ID: 114899

"Enables identical protein binding activity. Involved in several processes, including cellular triglyceride homeostasis; negative regulation of NIK/NF-kappaB signaling; and regulation of cytokine production. Acts upstream of or within negative regulation of gluconeogenesis. Located in extracellular exosome and membrane."[185]

Gene ID: 115265

"Predicted to be involved in negative regulation of signal transduction. Predicted to be located in cytoplasm."[186]

Gene ID: 116842

"This gene encodes a cysteine-rich cationic antimicrobial peptide that is expressed predominantly in the liver. The mature peptide has activity against gram-positive bacteria and yeasts."[187]

Gene ID: 117156

"The protein encoded by this gene is a secreted lung surfactant protein and a downstream target of thyroid transcription factor. A single nucleotide polymorphism in the promoter of this gene results in susceptibility to asthma."[188]

Gene ID: 117158

"Located in extracellular region. Is expressed in several structures, including mammary gland and respiratory system. Human ortholog(s) of this gene implicated in allergic rhinitis and asthma. Orthologous to human SCGB3A2 (secretoglobin family 3A member 2)."[189]

Gene ID: 126364

"Predicted to be located in cytoplasm. Predicted to be integral component of membrane."[190]

Gene ID: 126393

"This locus encodes a heat shock protein. The encoded protein likely plays a role in smooth muscle relaxation."[191]

Gene ID: 130120

"This gene encodes a member of the regenerating islet-derived genes (REG)3 protein family. These proteins are secreted, C-type lectins with a carbohydrate recognition domain and N-terminal signal peptide. The protein encoded by this gene is an antimicrobial lectin with activity against Gram-positive bacteria. Alternative splicing results in multiple transcript variants encoding multiple isoforms."[192]

Gene ID: 131377

"This gene encodes a protein containing a BACK domain, a BTB/POZ domain, and 5 Kelch repeats, however, its exact function is not known. The gene and the multi-domain protein structure are conserved across different taxa, including primates, rodents, chicken and zebrafish."[193]

Gene ID: 145957

"The neuregulins, including NRG4, activate type-1 growth factor receptors (see EGFR; MIM 131550) to initiating cell-to-cell signaling through tyrosine phosphorylation (Harari et al., 1999 [PubMed 10348342])."[194]

Gene ID: 147183

"This gene encodes a member of the type I (acidic) keratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. The type I keratin genes are clustered in a region of chromosome 17q12-q21."[195]

Gene ID: 153020

"Enables guanyl-nucleotide exchange factor activity. Predicted to be involved in regulation of catalytic activity and small GTPase mediated signal transduction. Predicted to be located in early endosome; late endosome; and midbody."[196]

Gene ID: 200504

"The secretory protein encoded by this gene is produced in gastric surface mucous cells, where it can bind trefoil factor family peptide 1 or gastrokine-1. This gene may be a tumor suppressor gene, as its expression is markedly decreased in gastric cancer tissues. The encoded protein interacts with gastrokine-1 and regulates homeostasis of the gastric mucosa."[197]

Gene ID: 200539

"This gene is a member of the muscle ankyrin repeat protein (MARP) family and encodes a protein with four tandem ankyrin-like repeats. The protein is localized to the nucleus, functioning as a transcriptional regulator. Expression of this protein is induced during recovery following starvation."[198]

Gene ID: 342574

"This gene encodes a member of the type I (acidic) keratin family, which belongs to the superfamily of intermediate filament (IF) proteins. Keratins are heteropolymeric structural proteins which form the intermediate filament. These filaments, along with actin microfilaments and microtubules, compose the cytoskeleton of epithelial cells. The type I keratin genes are clustered in a region of chromosome 17q12-q21."[199]

Gene ID: 389125

"Predicted to be involved in several processes, including chondrocyte differentiation; chondrocyte proliferation; and embryonic limb morphogenesis. Predicted to act upstream of or within positive regulation of chondrocyte differentiation; positive regulation of chondrocyte proliferation; and positive regulation of macromolecule metabolic process. Predicted to be located in nucleoplasm."[200]

Gene ID: 729230

"The protein encoded by this gene is a receptor for monocyte chemoattractant protein-1, a chemokine which specifically mediates monocyte chemotaxis. Monocyte chemoattractant protein-1 is involved in monocyte infiltration in inflammatory diseases such as rheumatoid arthritis as well as in the inflammatory response against tumors. The encoded protein mediates agonist-dependent calcium mobilization and inhibition of adenylyl cyclase. This protein can also be a coreceptor with CD4 for HIV-1 infection. This gene is located in the chemokine receptor gene cluster region of chromosome 3."[201] This Gene ID replaces Gene ID: 1231.

Families of TATA box genes

  1. Actin family: Gene IDs: 58, 60, 70.
  2. Actin/cytoskeleton/contractile family: Gene IDs: 60.
  3. Activity against gram-positive bacteria and yeasts: Gene IDs: 116842.
  4. Adenylate kinase: Gene IDs: 203.
  5. Albumin gene family: Gene IDs: 173.
  6. Aldolases: Gene IDs: 229.
  7. Alkaline phosphatase family: Gene IDs: 249.
  8. Alpha-amylase family: Gene IDs: 279.
  9. Amelogenin family: Gene IDs: 265.
  10. Amylases: Gene IDs: 280.
  11. Annexin family: Gene IDs: 302.
  12. AP-1 family of transcription factors: Gene IDs: .
  13. Apolipoprotein (apo-) A-II: Gene IDs: 336.
  14. Apolipoprotein of chylomicrons and low density lipoproteins (LDL): Gene IDs: 338.
  15. Aquaporin family: Gene IDs: 360.
  16. Arginase type I: Gene IDs: 383.
  17. Axoneme assembly: Gene IDs: 84223.
  18. Binding activity: Gene IDs: 84223, 83998, 83638.
  19. Binds copper, nickel, fatty acids and bilirubin: Gene IDs: 174.
  20. Binds phospholipids: Gene IDs: 301.
  21. cAMP-response element binding family: Gene IDs: .
  22. Cell cycle transcription factor family: Gene IDs: 79733.
  23. Cellular triglyceride homeostasis: Gene IDs: 114899.
  24. Chondrocyte differentiation; chondrocyte proliferation: Gene IDs: 389125.
  25. Class I fructose-biphosphate aldolase family: Gene IDs: 230.
  26. Component of triglyceride (TG)-rich lipoproteins (TRLs) including very low density lipoproteins (VLDL), high density lipoproteins (HDL) and chylomicrons: Gene IDs: 345.
  27. Copper amine oxidases: Gene IDs: 314.
  28. Enables identical protein binding activity: Gene IDs: 57823.
  29. Enable signaling receptor binding activity: Gene IDs: 64111.
  30. Enables guanyl-nucleotide exchange factor activity: Gene IDs: 153020.
  31. Epidermal growth factor family: Gene IDs: 374.
  32. G-protein-coupled receptors (GPCR) large family: Gene IDs: 81285.
  33. GTP binding activity: Gene IDs: 84790.
  34. Heat shock: Gene IDs: 126393.
  35. Histone H1 family: Gene IDs: 3010.
  36. Histone H2A family: Gene IDs: 85235.
  37. 'Housekeeping' family: Gene IDs: 229.
  38. Identical protein binding activity: Gene IDs: 114899.
  39. Inhibit inflammatory cytokines to disrupt inflammatory cascades: Gene IDs: 2.
  40. Integral component of membrane: Gene IDs: 126364.
  41. Involved in adaptive immune response: Gene IDs: 57823.
  42. Kelch-like family: Gene IDs: 131377.
  43. Maintaining blood pressure, body fluid and electrolyte homeostasis: Gene IDs: 183.
  44. Mammalian activation transcription factor/cAMP responsive element-binding (CREB) family: Gene IDs: 467.
  45. Mammary gland and respiratory system: Gene IDs: 117158.
  46. Membranes: Gene IDs: 126364.
  47. MIP/aquaporin family: Gene IDs: 359.
  48. Mitochondrial solute carrier subfamily: Gene IDs: 292.
  49. Muscle ankyrin repeat protein (MARP) family: Gene IDs: 200539.
  50. Myristoylated alanine-rich C-kinase substrate (MARCKS) family: Gene IDs: 65108.
  51. Na+/K+ and H+/K+ ATPases beta chain family: Gene IDs: 481, 482.
  52. Negative regulation of gluconeogenesis: Gene IDs: 114899.
  53. Negative regulation of gonadotropin secretion: Gene IDs: 64111.
  54. Negative regulation of NIK/NF-kappaB signaling: Gene IDs: 114899.
  55. Negative regulation of signal transduction: Gene IDs: 115265.
  56. Neuregulins: Gene IDs: 145957.
  57. Polyamine biosynthesis: Gene IDs: 262.
  58. Protease inhibitor and cytokine transporter: Gene IDs: 2.
  59. Proton channel activity: Gene IDs: 92736.
  60. Proton transmembrane transport: Gene IDs: 92736.
  61. Receptor for monocyte chemoattractant protein-1: Gene IDs: 729230.
  62. Regenerating islet-derived genes (REG)3 family: Gene IDs:130120.
  63. Regulate host inflammation and immunity: Gene IDs: 84419.
  64. Regulation of cytokine production: Gene IDs: 114899.
  65. Regulation of transcription: Gene IDs: 81606.
  66. Regulator of aldosterone secretion: Gene IDs: 185.
  67. Secretoglobin family: Gene IDs: 117158.
  68. Secretory: Gene IDs: 117158, 200504.
  69. Small integral membrane protein with six bilayer spanning domains that functions as a water channel: Gene IDs: 358.
  70. Solute carrier family 7: Gene IDs: 84889.
  71. Stem cell homeostasis: Gene IDs: 80177.
  72. Structural constituent of cytoskeleton: Gene IDs: 84790.
  73. DNA-binding family tax-responsive enhancer element in the left-to-right (LTR) of the Human T-lymphotropic virus 1 (HTLV-1) transcription factor also characterized as the cAMP-response element binding protein 2 (CREB-2): Gene IDs: 468.
  74. Type I (acidic) keratin family: Gene IDs: 147183, 342574.
  75. Vasodilation, regulation of hormone secretion, promotion of angiogenesis, and antimicrobial activity: Gene IDs: 133.
  76. Water channel protein aquaporin 3: Gene IDs: 360.
  77. ZIC family of C2H2-type zinc fingers: Gene IDs: 84107.
  78. Superfamily of ATP-binding cassette (ABC) transporters: Gene ID: 19.
  79. Superfamily of extracellular matrix proteins: amelogenin family: Gene ID: 265.
  80. Superfamily of intermediate filament (IF) proteins: keratin family.
  81. Superfamily of small GTPase protein: Gene IDs: 390.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

References

  1. R. P. Lifton, M. L. Goldberg, R. W. Karp, and D. S. Hogness (1978). "The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications". Cold Spring Harbor Symposia on Quantitative Biology. 42: 1047–51. doi:10.1101/SQB.1978.042.01.105. PMID 98262.
  2. Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter" (PDF). Annual Review of Biochemistry. 72 (1): 449–79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. Retrieved 2012-05-07.
  3. C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez (March 2007). "Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746.
  4. 4.0 4.1 Chuhu Yang, Eugene Bolotin, Tao Jiang, Frances M. Sladek, and Ernest Martinez (10 October 2006). "Prevalence of the Initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746. Retrieved 2024-06-07.
  5. 5.0 5.1 5.2 5.3 5.4 Muyu Xu, Elsie Gonzalez-Hurtado, and Ernest Martinez (April 2016). "Core promoter-specific gene regulation: TATA box selectivity and Initiator-dependent bi-directionality of serum response factor-activated transcription". Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 1859 (4): 553–563. doi:10.1016/j.bbagrm.2016.01.005. Retrieved 2024-06-08.
  6. 6.00 6.01 6.02 6.03 6.04 6.05 6.06 6.07 6.08 6.09 6.10 6.11 6.12 6.13 6.14 6.15 6.16 6.17 6.18 6.19 6.20 6.21 6.22 6.23 6.24 6.25 6.26 6.27 6.28 6.29 6.30 6.31 6.32 6.33 6.34 6.35 6.36 6.37 6.38 6.39 6.40 6.41 6.42 6.43 6.44 Victor X Jin, Gregory AC Singer, Francisco J Agosto-Pérez, Sandya Liyanarachchi, and Ramana V Davuluri (2006). "Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs". BMC Bioinformatics. 7: 114. doi:10.1186/1471-2105-7-114. Retrieved 2024-06-09.
  7. RefSeq (November 2016). "A2M alpha-2-macroglobulin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-14.
  8. RefSeq (September 2019). "ABCA1 ATP binding cassette subfamily A member 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  9. RefSeq (September 2019). "ACTA1 actin alpha 1, skeletal muscle [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  10. RefSeq (August 2017). "ACTB actin beta [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  11. RefSeq (July 2008). "ACTC1 actin alpha cardiac muscle 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-14.
  12. RefSeq (August 2014). "ADM adrenomedullin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  13. RefSeq (July 2008). "AFM afamin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  14. RefSeq (October 2019). "AFP alpha fetoprotein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  15. RefSeq (November 2019). "AGT angiotensinogen [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  16. RefSeq (August 2020). "AGTR1 angiotensin II receptor type 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  17. RefSeq (January 2022). "AK1 adenylate kinase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  18. RefSeq (December 2008). "ALDOB aldolase, fructose-bisphosphate B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  19. RefSeq (July 2008). "ALDOC aldolase, fructose-bisphosphate C [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  20. RefSeq (October 2015). "ALPL alkaline phosphatase, biomineralization associated [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  21. RefSeq (December 2013). "AMD1 adenosylmethionine decarboxylase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  22. RefSeq (July 2008). "AMELX amelogenin X-linked [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  23. RefSeq (January 2015). "AMY2A amylase alpha 2A [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  24. RefSeq (June 2013). "AMY2B amylase alpha 2B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  25. RefSeq (June 2013). "SLC25A5 solute carrier family 25 member 5 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  26. RefSeq (December 2014). "ALDOC aldolase, fructose-bisphosphate C [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  27. RefSeq (December 2019). "ANXA2 annexin A2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  28. RefSeq (July 2008). "AOC2 amine oxidase copper containing 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  29. RefSeq (July 2008). "APOA2 apolipoprotein A2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  30. RefSeq (December 2019). "APOB apolipoprotein B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  31. RefSeq (September 2017). "APOC3 apolipoprotein C3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  32. RefSeq (August 2016). "AQP1 aquaporin 1 (Colton blood group) [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  33. RefSeq (October 2008). "AQP2 aquaporin 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  34. RefSeq (December 2015). "AQP3 aquaporin 3 (Gill blood group) [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  35. RefSeq (April 2014). "AREG amphiregulin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  36. RefSeq (September 2011). "ARG1 arginase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  37. RefSeq (December 2011). "RND3 Rho family GTPase 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  38. RefSeq (April 2011). "ATF3 activating transcription factor 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  39. RefSeq (September 2011). "ATF4 activating transcription factor 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  40. RefSeq (March 2010). "ATP1B1 ATPase Na+/K+ transporting subunit beta 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  41. RefSeq (December 2014). "ATP1B2 ATPase Na+/K+ transporting subunit beta 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  42. RefSeq (July 2008). "ATP4B ATPase H+/K+ transporting subunit beta [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  43. 43.0 43.1 RefSeq (July 2008). "ATP5PB ATP synthase peripheral stalk-membrane subunit b [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  44. RefSeq (August 2015). "BCL6 BCL6 transcription repressor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  45. RefSeq (June 2011). "BRDT bromodomain testis associated [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  46. RefSeq (September 2011). "ZFP36L1 ZFP36 ring finger protein like 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  47. RefSeq (October 2008). "BTG1 BTG anti-proliferation factor 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  48. RefSeq (November 2016). "CA1 carbonic anhydrase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  49. RefSeq (June 2014). "CA2 carbonic anhydrase 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  50. RefSeq (July 2008). "CA4 carbonic anhydrase 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  51. RefSeq (March 2016). "CA8 carbonic anhydrase 8 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  52. RefSeq (June 2010). "CALB2 calbindin 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  53. RefSeq (May 2020). "CALR calreticulin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  54. RefSeq (August 2017). "CASR calcium sensing receptor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  55. RefSeq (July 2008). "CD247 CD247 molecule [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  56. RefSeq (July 2008). "CDX1 caudal type homeobox 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  57. RefSeq (October 2013). "CEBPB CCAAT enhancer binding protein beta [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  58. RefSeq (November 2011). "CGA glycoprotein hormones, alpha polypeptide [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  59. RefSeq (July 2008). "CHAD chondroadherin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  60. RefSeq (September 2009). "CHI3L1 chitinase 3 like 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-09.
  61. RefSeq (July 2008). "CKB creatine kinase B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  62. Alliance of Genome Resources (April 2022). "CIRBP cold inducible RNA binding protein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  63. RefSeq (July 2008). "CKM creatine kinase, M-type [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  64. RefSeq (July 2008). "CKMT2 creatine kinase, mitochondrial 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  65. RefSeq (March 2012). "CLCN1 chloride voltage-gated channel 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  66. RefSeq (September 2009). "CLCNKB chloride voltage-gated channel Kb [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  67. RefSeq (May 2011). "CLU clusterin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  68. RefSeq (April 2015). "CMA1 chymase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  69. RefSeq (February 2008). "COL1A1 collagen type I alpha 1 chain [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  70. RefSeq (February 2008). "COL1A2 collagen type I alpha 2 chain [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  71. RefSeq (February 2012). "CP ceruloplasmin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  72. RefSeq (December 2010). "CRABP2 cellular retinoic acid binding protein 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  73. RefSeq (November 2015). "CRH corticotropin releasing hormone [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  74. RefSeq (July 2008). "CRHBP corticotropin releasing hormone binding protein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  75. RefSeq (August 2020). "CRP C-reactive protein [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  76. RefSeq (January 2019). "CRYAB crystallin alpha B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  77. RefSeq (July 2008). "CRYBA1 crystallin beta A1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  78. RefSeq (July 2008). "CRYGS crystallin gamma S [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  79. RefSeq (May 2020). "CSF3 colony stimulating factor 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  80. RefSeq (November 2009). "CCN2 cellular communication network factor 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  81. RefSeq (January 2016). "CYP1A1 cytochrome P450 family 1 subfamily A member 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  82. RefSeq (July 2008). "CYP1A2 cytochrome P450 family 1 subfamily A member 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  83. RefSeq (July 2008). "CYP2A6 cytochrome P450 family 2 subfamily A member 6 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  84. RefSeq (July 2008). "CYP2A13 cytochrome P450 family 2 subfamily A member 13 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  85. RefSeq (July 2008). "CYP2E1 cytochrome P450 family 2 subfamily E member 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  86. RefSeq (August 2020). "CYP3A4 cytochrome P450 family 3 subfamily A member 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  87. RefSeq (August 2020). "CYP3A4 cytochrome P450 family 3 subfamily A member 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  88. Marcelo A. Nobrega, Ivan Ovcharenko, Veena Afzal, and Edward M. Rubin (October 2003). "Scanning human gene deserts for long-range enhancers". Science. 302 (5644): 413. doi:10.1126/science.1088328. PMID 14563999. Retrieved 2012-12-26.
  89. 89.0 89.1 HGNC (December 20, 2012). "DACH1 dachshund homolog 1 (Drosophila) [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2012-12-26.
  90. RefSeq (September 2017). "DDX5 DEAD-box helicase 5 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  91. RefSeq (July 2008). "NQO1 NAD(P)H quinone dehydrogenase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  92. RefSeq (May 2016). "DIO3 iodothyronine deiodinase 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  93. RefSeq (July 2008). "DPT dermatopontin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  94. 94.0 94.1 Yutaka Suzuki, Tatsuhiko Tsunoda, Jun Sese, Hirotoshi Taira, Junko Mizushima-Sugano, Hiroko Hata, Toshio Ota, Takao Isogai, Toshihiro Tanaka, Yusuke Nakamura, Akira Suyama, Yoshiyuki Sakaki, Shinichi Morishita, Kousaku Okubo, and Sumio Sugano (11 April 2001). "Identification and Characterization of the Potential Promoter Regions of 1031 Kinds of Human Genes". Genome Research. 11 (5): 677-684. doi:10.1101/gr.164001.
  95. RefSeq (October 2008). "SLC26A3 solute carrier family 26 member 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  96. RefSeq (January 2016). "TSC22D3 TSC22 domain family member 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  97. 97.0 97.1 RefSeq (July 2008). "EPYC epiphycan [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-19.
  98. RefSeq (October 2015). "EDN1 endothelin 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  99. RefSeq (July 2008). "EEF1A1 eukaryotic translation elongation factor 1 alpha 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  100. Alliance of Genome Resources (April 2022). "EGR4 early growth response 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  101. RefSeq (August 2020). "FGB fibrinogen beta chain [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  102. RefSeq (June 2016). "NR5A2 nuclear receptor subfamily 5 group A member 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  103. RefSeq (July 2008). "GCG glucagon [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  104. RefSeq (December 2014). "GLUL glutamate-ammonia ligase [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  105. RefSeq (July 2008). "GNAT2 G protein subunit alpha transducin 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  106. OMIM (November 2010). "GP5 glycoprotein V platelet [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  107. Alliance of Genome Resources (April 2022). "GUCA2A guanylate cyclase activator 2A [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  108. RefSeq (November 2015). "GUCA2B guanylate cyclase activator 2B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  109. RefSeq (August 2015). "H1-2 H1.2 linker histone, cluster member [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  110. RefSeq (August 2015). "H1-4 H1.4 linker histone, cluster member [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  111. RefSeq (August 2015). "H1-5 H1.5 linker histone, cluster member [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  112. RefSeq (August 2015). "H1-6 H1.6 linker histone, cluster member [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  113. RefSeq (November 2009). "HBZ hemoglobin subunit zeta [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  114. RefSeq (October 2009). "HMGCS2 3-hydroxy-3-methylglutaryl-CoA synthase 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  115. RefSeq (July 2008). "HES1 hes family bHLH transcription factor 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  116. RefSeq (June 2016). "HSD3B1 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  117. RefSeq (October 2009). "HSD3B2 hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid delta-isomerase 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  118. Thomas W. Burke and James T. Kadonaga (November 15, 1997). "The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila". Genes & Development. 11 (22): 3020–31. doi:10.1101/gad.11.22.3020. PMC 316699. PMID 9367984.
  119. HGNC (February 3, 2013). "HSPA4 heat shock 70kDa protein 4 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-02-07.
  120. RefSeq (July 2020). "HSPA5 heat shock protein family A (Hsp70) member 5 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  121. RefSeq (January 2012). "HSP90AA1 heat shock protein 90 alpha family class A member 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  122. RefSeq (July 2011). "TNC tenascin C [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  123. RefSeq (July 2016). "IAPP islet amyloid polypeptide [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  124. RefSeq (September 2011). "CCN1 cellular communication network factor 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  125. RefSeq (August 2020). "IL4 interleukin 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  126. RefSeq (July 2008). "IL4 interleukin 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  127. RefSeq (July 2008). "IL13 interleukin 13 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  128. RefSeq (May 2020). "INS insulin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  129. RefSeq (November 2009). "ITPR1 inositol 1,4,5-trisphosphate receptor type 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  130. RefSeq (July 2008). "KRT1 keratin 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  131. RefSeq (July 2008). "KRT10 keratin 10 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  132. RefSeq (July 2008). "KRT12 keratin 12 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  133. RefSeq (August 2011). "LAMC2 laminin subunit gamma 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-10.
  134. RefSeq (June 2018). "LIFR LIF receptor subunit alpha [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  135. HGNC:3576 (June 6, 2024). "FADS3 fatty acid desaturase 3 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-07.
  136. RefSeq (July 2008). "LORICRIN loricrin cornified envelope precursor protein [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  137. RefSeq (July 2008). "MIP major intrinsic protein of lens fiber [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  138. RefSeq (February 2011). "MUC1 mucin 1, cell surface associated [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  139. RefSeq (July 2008). "MYL1 myosin light chain 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  140. RefSeq (July 2008). "MYOC myocilin [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  141. RefSeq (August 2017). "NPM1 nucleophosmin 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-20.
  142. RefSeq (October 2015). "NPPA natriuretic peptide A [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  143. RefSeq (July 2008). "OVGP1 oviductal glycoprotein 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  144. RefSeq (January 2011). "PRDX1 peroxiredoxin 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  145. RefSeq (July 2008). "PDC phosducin [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  146. 146.0 146.1 RefSeq (July 2008). "POU1F1 POU class 1 homeobox 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  147. RefSeq (February 2009). "PTGS2 prostaglandin-endoperoxide synthase 2 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  148. Tetsuya Kosaka, Atsuro Miyata, Hayato Ihara, Shuntaro Hara, Tamiko Sugimoto, Osamu Takeda, Ei-ichi Takahashi, Tadashi Tanabe (May 1994). "Characterization of the human gene (PTGS2) encoding prostaglandin‐endoperoxide synthase 2". European Journal of Biochemistry. 221 (3): 889–97. doi:10.1111/j.1432-1033.1994.tb18804.x. Retrieved 2012-12-26.
  149. RefSeq (July 2008). "RGS1 regulator of G protein signaling 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-10.
  150. RefSeq (August 2009). "RGS2 regulator of G protein signaling 2 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  151. RefSeq (October 2017). "RPE65 retinoid isomerohydrolase RPE65 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  152. RefSeq (July 2018). "RPS27 ribosomal protein S27 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  153. RefSeq (January 2016). "S100A8 S100 calcium binding protein A8 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  154. RefSeq (June 2016). "SLC2A5 solute carrier family 2 member 5 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  155. RefSeq (September 2011). "SLC9A1 solute carrier family 9 member A1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  156. RefSeq (October 2009). "SLC16A1 solute carrier family 16 member 1 [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  157. Alliance of Genome Resources (April 2022). "SPRR1A small proline rich protein 1A [ Homo sapiens ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-11.
  158. RefSeq (July 2008). "TFF2 trefoil factor 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  159. RefSeq (June 2013). "PTP4A1 protein tyrosine phosphatase 4A1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  160. RefSeq (August 2015). "H2AC11 H2A clustered histone 11 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  161. Alliance of Genome Resources (April 2022). "PCLAF PCNA clamp associated factor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  162. RefSeq (September 2015). "POSTN periostin [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  163. HGNC:10227 (May 13, 2024). "RNU6-1 RNA, U6 small nuclear 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2024-06-08.
  164. Y Ohshima, N Okada, T Tani, Y Itoh, and M Itoh (10 October 1981). "Nucleotide sequences of mouse genomic loci including a gene or pseudogene for U6 (4.8S) nuclear RNA". Nucleic Acids Research. 9 (19): 5145–5158. doi:10.1093/nar/9.19.5145. PMID 6171774. Retrieved 2024-06-08.
  165. RefSeq (September 2014). "AZIN1 antizyme inhibitor 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  166. RefSeq (July 2008). "SLURP1 secreted LY6/PLAUR domain containing 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  167. Alliance of Genome Resources (April 2022). "SLAMF7 SLAM family member 7 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-24.
  168. Alliance of Genome Resources (April 2022). "NPVF neuropeptide VF precursor [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  169. RefSeq (June 2012). "MARCKSL1 MARCKS like 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  170. RefSeq (January 2012). "E2F8 E2F transcription factor 8 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-24.
  171. Alliance of Genome Resources (April 2022). "MYCT1 MYC target 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  172. OMIM (March 2008). "LY6G6C lymphocyte antigen 6 family member G6C [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  173. RefSeq (July 2008). "OR51E2 olfactory receptor family 51 subfamily E member 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  174. Alliance of Genome Resources (April 2022). "LBH LBH regulator of WNT signaling pathway [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  175. Alliance of Genome Resources (April 2022). "C11orf68 chromosome 11 open reading frame 68 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  176. Alliance of Genome Resources (April 2022). "REG4 regenerating family member 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  177. RefSeq (December 2009). "ZIC4 Zic family member 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-20.
  178. Alliance of Genome Resources (April 2022). "IQCG IQ motif containing G [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  179. Alliance of Genome Resources (April 2022). "LZIC leucine zipper and CTNNBIP1 domain containing [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  180. RefSeq (June 2012). "C15orf48 chromosome 15 open reading frame 48 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  181. Alliance of Genome Resources (April 2022). "TUBA1C tubulin alpha 1c [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  182. RefSeq (May 2010). "SLC7A3 solute carrier family 7 member 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  183. RefSeq (August 2015). "H2AC12 H2A clustered histone 12 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-23.
  184. RefSeq (April 2022). "OTOP2 otopetrin 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-22.
  185. Alliance of Genome Resources (April 2022). "C1QTNF3 C1q and TNF related 3 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-22.
  186. Alliance of Genome Resources (April 2022). "DDIT4L DNA damage inducible transcript 4 like [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-22.
  187. RefSeq (September 2014). "LEAP2 liver enriched antimicrobial peptide 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-22.
  188. RefSeq (March 2010). "Scgb3a2 secretoglobin, family 3A, member 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  189. Alliance of Genome Resources (April 2022). "Scgb3a2 secretoglobin, family 3A, member 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  190. RefSeq (April 2022). "LRRC25 leucine rich repeat containing 25 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  191. RefSeq (January 2012). "HSPB6 heat shock protein family B (small) member 6 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  192. RefSeq (November 2014). "REG3G regenerating family member 3 gamma [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  193. RefSeq (December 2012). "KLHL40 kelch like family member 40 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  194. OMIM (March 2008). "NRG4 neuregulin 4 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  195. RefSeq (July 2009). "KRT25 keratin 25 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  196. Alliance of Genome Resources (April 2020). "RASGEF1B RasGEF domain family member 1B [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  197. RefSeq (December 2015). "GKN2 gastrokine 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  198. RefSeq (July 2008). "ANKRD23 ankyrin repeat domain 23 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  199. RefSeq (July 2009). "KRT27 keratin 27 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.
  200. Alliance of Genome Resources (April 2022). "MUSTN1 musculoskeletal, embryonic nuclear protein 1 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-21.
  201. RefSeq (August 2017). "CCR2 C-C motif chemokine receptor 2 [ Homo sapiens ]". Bethsda, Maryland, USA: ncbi.nlm.nih. Retrieved 2024-06-18.

External links