CArG box gene transcriptions
Editor-In-Chief: Henry A. Hoff
CArG boxes are present in the promoters of smooth muscle cell genes. Template:TOCright
Boxes
Def. "a repeating sequence of nucleotides that forms a transcription or a regulatory signal"[1] is called a box.
Consensus sequences
"CArG box [CC(A/T)6GG] DNA [consensus] sequences present within the promoters of SMC genes play a pivotal role in controlling their transcription".[2]
The consensus sequence of CC(A/T)6GG is confirmed.[3]
"MADS-box proteins bind to a consensus sequence, the CArG box, that has the core motif CC(A/T)6GG (15)."[4]
"Of the [Flowering Locus C] FLC binding sites, 69% contained at least one CArG-box motif with the core consensus sequence CCAAAAAT(G/A)G and an AAA extension at the 3′ end [...]."[4]
Three "other MADS-box flowering-time regulators, SOC1, SVP, and AGAMOUS-LIKE 24 (AGL24), bind to two different CArG-box motifs at 502 bp (CTAAATATGG) and 287 bp (CAATAATTGG) upstream of the translation start in the SEP3 gene (24), consistent with different specificities for the different MADS-box proteins."[4] These together with the core motif CC(A/T)6GG (15) suggest a more general CArG-box motif of (C(C/A/T)(A/T)6(A/G)G).
Smooth muscle cells
"Serum response factor (SRF) controls [ smooth muscle cell ] SMC gene transcription via binding to CArG box DNA sequences found within genes that exhibit SMC-restricted expression."[2]
"SMC genes examined in this study display SMC-specific histone modifications at the 5′-CArG boxes."[2]
"The SRF-CArG association is required for transcriptional activation of SMC genes [...] the SMC genes examined in this study display SMC-specific histone modifications at the 5′-CArG boxes. [...] enrichment of H4 and H3 acetylation [...] were relatively low from positions –2,800 to –1,600 in the 5′ region. However, at position –1,600 to –1,200, there was a sharp rise in these modifications, which was increased even further at +400 in the coding region. We observed similar patterns for H3K4dMe and H3 Lys79 di-methylation [...]. SRF, TFIID, and RNA polymerase II displayed enrichments that were consistent with the positions of the CArG boxes, TATA box, and coding region, respectively".[2]
The CArG boxes occur between -400 and -200 nts, between the E boxes and the TC elements.[2]
"Functionally important CArG boxes have been identified in transcriptional regulatory elements controlling expression of sets of myogenic contractile and cytoskeletal proteins (reviewed elsewhere8,25). Of note, in cardiac and skeletal muscle cells, functionally important CArG boxes have been identified in transcriptional regulatory element controlling a relatively limited subset of myofibrillar proteins.26"[5]
"In the nucleus, MRTFs physically associate with SRF, facilitating the binding of SRF to single or dual CArG boxes, activating transcription of genes encoding cytoskeletal and myogenic proteins [...].39,40,53,55,56"[5]
"The binding of SRF to SMC CArG boxes is associated with specific alterations in chromatin structure including the methylation and acetylation of histones.76,79"[5]
"Both PDGF-BB and KLF-4 inhibit SRF binding to CArG boxes downregulating transcription of SMC contractile genes.92"[5]
Gene transcriptions
"SMC-restricted binding of SRF to murine SMC gene CArG box chromatin is associated with patterns of posttranslational histone modifications within this chromatin that are specific to the SMC lineage in culture and in vivo, including methylation and acetylation to histone H3 and H4 residues."[2]
"Ca2+/calmodulin-dependent protein kinase IV activates cysteine-rich protein 1 through adjacent CRE and CArG elements."[6]
"Smooth muscle-specific transcription is controlled by a multitude of transcriptional regulators that cooperate to drive expression in a temporospatial manner. Previous analysis of the cysteine-rich protein 1 (CRP1/Csrp) gene revealed an intronic enhancer that is sufficient for expression in arterial smooth muscle cells and requires a serum response factor-binding CArG element for activity. The presence of a CArG box in smooth muscle regulatory regions is practically invariant; however, it stands to reason that additional elements contribute to the modulation of transcription in concert with the CArG."[6]
A "conserved cAMP response element (CRE) [...] binds the cAMP response element-binding protein (CREB) and is activated by Ca2+/calmodulin-dependent protein kinase IV (CaMKIV), but not by CaMKII."[6]
"CaMKIV stimulates CRP1 expression not only through the CRE but also through the CArG box."[6]
A "conserved cyclic AMP-response element (CRE) within the CRP1 gene is critical for enhancer activity. The CRE is an 8-bp motif with the consensus sequence TGACGTCA (34). [...] CRE serves as a transcriptional conduit for cyclic AMP-stimulated processes, but it also responds to a variety of other stimuli, including intracellular Ca2+ through the activation of Ca2+/calmodulin-dependent protein kinases (30, 31, 49). The primary factors that bind CRE are the cAMP element-binding protein (CREB) and the related proteins activating transcription factor (ATF)-1 and CRE modulator (CREM). The activity of CREB on the CRE is dependent largely on phosphorylation of a Ser133 residue. This phosphorylation event transforms CREB into a potent transcriptional activator and facilitates interactions with additional regulators, namely, CREB-binding protein (CBP) (31, 49). With respect to function, CREB has been implicated in governing a host of cellular processes and adaptive responses, including differentiation, metabolic changes, cell survival, and proliferation (3, 18, 19, 28, 37, 44, 53)."[6]
The "utilization of two conserved binding sites within the CRP1-5.0 enhancer: a newly identified CRE and the CArG box [...] might serve to amplify a response. Given that these two elements are separated by only 14 bp, CREB and SRF could cooperate by assisting in the recruitment of CBP, both of which bind to CBP’s NH2 terminus."[6]
CArG box samplings
Testing the more general 3'-C(C/A/T)(A/T)6(A/G)G-5':
- Negative strand, negative direction (from ZSCAN22 to A1BG) is SuccessablesCArG--.bas, looking for C(C/A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G)G: 0.
- Negative strand, positive direction (from ZNF497 to A1BG) is SuccessablesCArG-+.bas, looking for C(C/A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G)G: 0.
- Positive strand, negative direction is SuccessablesCArG+-.bas, looking for C(C/A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G)G: 2, CATTAAAAGG at 3441, CAAAAAAAAG at 1399.
- positive strand, positive direction is SuccessablesCArG++.bas, looking for C(C/A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G)G: 0.
- inverse complement, negative strand, negative direction is SuccessablesCArGci--.bas, looking for C(C/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G/T)G: 0
- inverse complement, negative strand, positive direction is SuccessablesCArGci-+.bas, looking for C(C/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G/T)G: 0.
- inverse complement, positive strand, negative direction is SuccessablesCArGci+-.bas, looking for C(C/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G/T)G: 0.
- inverse complement, positive strand, positive direction is SuccessablesCArGci++.bas, looking for C(C/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/T)(A/G/T)G: 0.
CArG (4560-2846) UTRs
- Positive strand, negative direction: CATTAAAAGG at 3441.
CArG negative direction (2596-1) distal promoters
- Positive strand, negative direction: CAAAAAAAAG at 1399.
CArG box random dataset samplings
- CArGr0: 1, CTTATATTAG at 2375.
- CArGr1: 0.
- CArGr2: 3, CAATATAAAG at 3857, CTAAATTTAG at 2521, CTAAAAATGG at 187.
- CArGr3: 3, CATATTATGG at 3892, CTTTTTTTGG at 1916, CAAAAAAAAG at 552.
- CArGr4: 2, CCTATTAAGG at 2936, CCTTTTTAAG at 1736.
- CArGr5: 1, CCTTATAAGG at 3585.
- CArGr6: 1, CAAAATTTGG at 4527.
- CArGr7: 2, CATTATTAAG at 1655, CAAATATAAG at 1414.
- CArGr8: 3, CATATTAAAG at 881, CTATAAAAAG at 709, CCTATTTTAG at 168.
- CArGr9: 3, CTTATTAAGG at 3505, CTTTATTTAG at 969, CTAAAAATGG at 332.
- CArGr0ci: 0.
- CArGr1ci: 2, CCAATTTTTG at 791, CCATTATATG at 340.
- CArGr2ci: 2, CTAAATTTAG at 2521, CTAAAAATGG at 187.
- CArGr3ci: 1, CTTTTTTTGG at 1916.
- CArGr4ci: 4, CCTATTAAGG at 2936, CCTTTTTAAG at 1736, CTTTTTATTG at 1409, CTTTTTATTG at 1350.
- CArGr5ci: 2, CTATTTTTTG at 3707, CCTTATAAGG at 3585.
- CArGr6ci: 1, CCATAAAATG at 3698.
- CArGr7ci: 0.
- CArGr8ci: 2, CTATAAAAAG at 709, CCTATTTTAG at 168.
- CArGr9ci: 5, CTTATTAAGG at 3505, CTATAAAATG at 2624, CTTTATTTAG at 969, CCTTATTTTG at 468, CTAAAAATGG at 332.
CArGr arbitrary (evens) (4560-2846) UTRs
- CArGr2: CAATATAAAG at 3857.
- CArGr4: CCTATTAAGG at 2936.
- CArGr6: CAAAATTTGG at 4527.
- CArGr6ci: CCATAAAATG at 3698.
CArGr alternate (odds) (4560-2846) UTRs
- CArGr3: CATATTATGG at 3892.
- CArGr5: CCTTATAAGG at 3585.
- CArGr9: CTTATTAAGG at 3505.
- CArGr5ci: CTATTTTTTG at 3707.
CArGr alternate negative direction (odds) (2811-2596) proximal promoters
- CArGr9ci: CTATAAAATG at 2624.
CArGr arbitrary negative direction (evens) (2596-1) distal promoters
- CArGr0: CTTATATTAG at 2375.
- CArGr2: CTAAATTTAG at 2521, CTAAAAATGG at 187.
- CArGr4: CCTTTTTAAG at 1736.
- CArGr8: CATATTAAAG at 881, CTATAAAAAG at 709, CCTATTTTAG at 168.
- CArGr4ci: CTTTTTATTG at 1409, CTTTTTATTG at 1350.
CArGr alternate negative direction (odds) (2596-1) distal promoters
- CArGr3: CTTTTTTTGG at 1916, CAAAAAAAAG at 552.
- CArGr7: CATTATTAAG at 1655, CAAATATAAG at 1414.
- CArGr9: CTTTATTTAG at 969, CTAAAAATGG at 332.
- CArGr1ci: CCAATTTTTG at 791, CCATTATATG at 340.
- CArGr5ci: CCTTATAAGG at 3585.
- CArGr9ci: CCTTATTTTG at 468.
CArGr arbitrary positive direction (odds) (4050-1) distal promoters
- CArGr3: CATATTATGG at 3892, CTTTTTTTGG at 1916, CAAAAAAAAG at 552.
- CArGr5: CCTTATAAGG at 3585.
- CArGr7: CATTATTAAG at 1655, CAAATATAAG at 1414.
- CArGr9: CTTATTAAGG at 3505, CTTTATTTAG at 969, CTAAAAATGG at 332.
- CArGr1ci: CCAATTTTTG at 791, CCATTATATG at 340.
- CArGr5ci: CTATTTTTTG at 3707.
- CArGr9ci: CTATAAAATG at 2624, CCTTATTTTG at 468.
CArGr alternate positive direction (evens) (4050-1) distal promoters
- CArGr0: CTTATATTAG at 2375.
- CArGr2: CAATATAAAG at 3857, CTAAATTTAG at 2521, CTAAAAATGG at 187.
- CArGr4: CCTATTAAGG at 2936, CCTTTTTAAG at 1736.
- CArGr8: CATATTAAAG at 881, CTATAAAAAG at 709, CCTATTTTAG at 168.
- CArGr4ci: CTTTTTATTG at 1409, CTTTTTATTG at 1350.
- CArGr6ci: CCATAAAATG at 3698.
Actins
"Positively acting, rate-limiting regulatory factors that influence tissue-specific expression of the human cardiac α-actin gene in a mouse muscle cell line are shown by in vivo competition and gel mobility-shift assays to bind to upstream regions of its promoter but to neither vector DNA nor a β-globin promoter. Although the two binding regions are distinctly separated, each corresponds to a cis region required for muscle-specific transcriptional stimulation, and each contains a core CC(A+T-rich)6GG sequence (designated CArG box), which is found in the promoter regions of several muscle-associated genes. Each site has an apparently different binding affinity for trans-acting factors, which may explain the different transcriptional stimulation activities of the two cis regions. [The] two CArG box regions are responsible for muscle-specific transcriptional activity of the cardiac α-actin gene through a mechanism that involves their binding of a positive trans-acting factor in muscle cells."[7]
"SRF binds to an A/T-rich sequence (CCWWWWWWGG) that has been designated as the CArG box.10–12 CArG boxes were originally identified in transcriptional regulatory elements controlling expression of a set of growth- or serum-responsive genes including c-fos and egr-1.13,14 Subsequently, CArG boxes were identified in transcriptional regulatory elements controlling expression of a subset of genes encoding myogenic contractile and cytoskeletal proteins including α-cardiac actin, smooth muscle (SM)-α-actin, α-skeletal actin, and SM22α.15–19"[5]
Early growth responses
"Exposure of human HL-525 cells to x-rays was associated with increases in EGR1 mRNA levels. Nuclear run-on assays showed that this effect is related at least in part to activation of EGR1 gene transcription. Sequences responsive to ionizing radiation-induced signals were determined by deletion analysis of the EGR1 promoter. The results demonstrate that x-ray inducibility of the EGR1 gene is conferred by a region containing six serum response or CC(A+T-rich)6GG (CArG) motifs. Further analysis confirmed that the region encompassing the three distal or upstream CArG elements is functional in the x-ray response. Moreover, this region conferred x-ray inducibility to a minimal thymidine kinase gene promoter. Taken together, these results indicate that ionizing radiation induces EGR1 transcription through CArG elements."[8]
Myocardins
The "promyogenic SRF [SRF GeneID: 6722] coactivator myocardin [MYOCD GeneID: 93649] increased SRF association with methylated histones and CArG box chromatin during activation of SMC gene expression. [...] myocardin/SRF complexes physically interact with H3K4dMe and that the interaction of SRF with CArG box chromatin and H3K4dMe is sensitive to expression levels of myocardin."[2]
Kruppel-like factor 4
The "myogenic repressor Kruppel-like factor 4 recruited histone H4 deacetylase activity to SMC genes and blocked SRF association with methylated histones and CArG box chromatin during repression of SMC gene expression. [...] deacetylation of histone H4 coupled with loss of SRF binding during suppression of SMC differentiation in response to vascular injury. [...] KLF4 can bind to evolutionarily conserved TGF-β [control element] (TCE) DNA sequences adjacent to CArG boxes of SM gene promoters"[2]
Epigenomes
"SMC-selective epigenetic control of SRF binding to chromatin plays a key role in regulation of SMC gene expression in response to pathophysiological stimuli in vivo."[2]
Histone modifications in SMCs include H3K4dMe, H3 Lys79 di-methylation, H3 Lys9 acetylation, H4Ac, and SRF binding.[2]
MADS boxes
"RIN [Ripening Inhibitor] binds to DNA sequences known as the CA/T-rich-G (CArG) box, which is the general target of MADS box proteins (Ito et al., 2008)."[9]
Human genes
An "interaction between serum response factor (SRF)1 and the CArG box has been identified as a core machinery in the transcription of several muscle-specific genes, including the skeletal 𝛂-actin (8), caldesmon (9), cardiac 𝛂-actin (10), 𝛂1 integrin (11), SM22𝛂 (12), smooth muscle myosin heavy chain (13), smooth muscle 𝛂-actin (14), calponin (15), atrial natriuretic factor (16), and 𝛃-tropomyosin (17) genes."[10]
Actin genes
Gene ID: 58 is ACTA1 actin alpha 1, skeletal muscle. "The product encoded by this gene belongs to the actin family of proteins, which are highly conserved proteins that play a role in cell motility, structure and integrity. Alpha, beta and gamma actin isoforms have been identified, with alpha actins being a major constituent of the contractile apparatus, while beta and gamma actins are involved in the regulation of cell motility. This actin is an alpha actin that is found in skeletal muscle. Mutations in this gene cause a variety of myopathies, including nemaline myopathy, congenital myopathy with excess of thin myofilaments, congenital myopathy with cores, and congenital myopathy with fiber-type disproportion, diseases that lead to muscle fiber defects with manifestations such as hypotonia."[11]
Gene ID: 59 is ACTA2 actin alpha 2, smooth muscle. "This gene encodes one of six different actin proteins. Actins are highly conserved proteins that are involved in cell motility, structure, integrity, and intercellular signaling. The encoded protein is a smooth muscle actin that is involved in vascular contractility and blood pressure homeostasis. Mutations in this gene cause a variety of vascular diseases, such as thoracic aortic disease, coronary artery disease, stroke, and Moyamoya disease, as well as multisystemic smooth muscle dysfunction syndrome."[12]
- NP_001135417.1 actin, aortic smooth muscle. Transcript Variant: This variant (1) represents the longest transcript. Variants 1 and 2 encode the same protein. Variants 1-3 encode the same protein.[12]
- NP_001307784.1 actin, aortic smooth muscle. Transcript Variant: This variant (3) differs in the 5' UTR compared to variant 1. Variants 1-3 encode the same protein.[12]
- NP_001604.1 actin, aortic smooth muscle. Transcript Variant: This variant (2) differs in the 5' UTR compared to variant 1. Variants 1-3 encode the same protein.[12]
Gene ID: 70 is ACTC1 actin alpha cardiac muscle 1. "Actins are highly conserved proteins that are involved in various types of cell motility. Polymerization of globular actin (G-actin) leads to a structural filament (F-actin) in the form of a two-stranded helix. Each actin can bind to four others. The protein encoded by this gene belongs to the actin family which is comprised of three main groups of actin isoforms, alpha, beta, and gamma. The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus. Defects in this gene have been associated with idiopathic dilated cardiomyopathy (IDC) and familial hypertrophic cardiomyopathy (FHC)."[13]
Gene ID: 800 is CALD1 caldesmon 1. "This gene encodes a calmodulin- and actin-binding protein that plays an essential role in the regulation of smooth muscle and nonmuscle contraction. The conserved domain of this protein possesses the binding activities to Ca(2+)-calmodulin, actin, tropomyosin, myosin, and phospholipids. This protein is a potent inhibitor of the actin-tropomyosin activated myosin MgATPase, and serves as a mediating factor for Ca(2+)-dependent inhibition of smooth muscle contraction. Alternative splicing of this gene results in multiple transcript variants encoding distinct isoforms."[14]
- NP_004333.1 caldesmon isoform 2: Transcript Variant: This variant (2) uses an alternate in-frame splice site and lacks an alternate in-frame exon in the central coding region, compared to variant 1. It is mainly expressed in non-muscle tissues or cells. The resulting isoform (2, also known as WI-38 l-CaD II) lacks an internal region, compared to isoform 1. pfam02029 Location:267 → 525, Caldesmon; Caldesmon.[14]
- NP_149129.2 caldesmon isoform 1: Transcript Variant: This variant (1) encodes the longest isoform (1, also known as aorta h-CaD). It is predominantly expressed in smooth muscle tissues. pfam02029, Location:517 → 780, Caldesmon; Caldesmon.[14]
- NP_149130.1 caldesmon isoform 4: Transcript Variant: This variant (4) differs in the 5' UTR and 5' coding region, and uses an alternate in-frame splice site in the central coding region, compared to variant 1. It is mainly expressed in non-muscle tissues or cells. The resulting isoform (4, also known as HeLa l-CaD I) contains a distinct N-terminus and is shorter than isoform 1. pfam02029 Location:282 → 545, Caldesmon; Caldesmon.[14]
- NP_149131.1 caldesmon isoform 5: Transcript Variant: This variant (5) differs in the 5' UTR and 5' coding region, and uses an alternate in-frame splice site and lacks an alternate in-frame exon in the central coding region, compared to variant 1. It is mainly expressed in non-muscle tissues or cells. The resulting isoform (5, also known as HeLa l-CaD II) has a distinct N-terminus and is shorter than isoform 1. pfam02029 Location:256 → 519, Caldesmon; Caldesmon.[14]
- NP_149347.2 caldesmon isoform 3: Transcript Variant: This variant (3) uses two alternate in-frame splice sites in the central coding region, compared to variant 1. It is mainly expressed in non-muscle tissues or cells. The resulting isoform (3, also known as WI-38 l-CaD I) is shorter than isoform 1. pfam02029 Location:288 → 550, Caldesmon; Caldesmon.[14]
- XP_016868139.1 caldesmon isoform X1: pfam02029 Location:517 → 779, Caldesmon; Caldesmon.[14]
- XP_024302710.1 caldesmon isoform X2: pfam02029 Location:293 → 551, Caldesmon; Caldesmon.[14]
- XP_024302711.1 caldesmon isoform X4: pfam02029 Location:267 → 524, Caldesmon; Caldesmon.[14]
- XP_016868141.1 caldesmon isoform X3: pfam02029 Location:267 → 525, Caldesmon; Caldesmon.[14]
- XP_016868143.1 caldesmon isoform X5.[14]
- XR_002956488.1 RNA Sequence.[14]
- XR_001744877.2 RNA Sequence.[14]
- XR_002956489.1 RNA Sequence.[14]
- XR_002956490.1 RNA Sequence.[14]
- XR_001744880.2 RNA Sequence.[14]
- XR_927535.2 RNA Sequence.[14]
- XR_927541.2 RNA Sequence.[14]
- XR_927537.3 RNA Sequence.[14]
- XR_001744879.2 RNA Sequence.[14]
- XR_927542.3 RNA Sequence.[14]
- XR_001744881.2 RNA Sequence.[14]
Gene ID: 6876 is TAGLN transgelin aka SM22; SMCC; TAGLN1; WS3-10; SM22-alpha. "This gene encodes a shape change and transformation sensitive actin-binding protein which belongs to the calponin family. It is ubiquitously expressed in vascular and visceral smooth muscle, and is an early marker of smooth muscle differentiation. The encoded protein is thought to be involved in calcium-independent smooth muscle contraction. It acts as a tumor suppressor, and the loss of its expression is an early event in cell transformation and the development of some tumors, coinciding with cellular plasticity. The encoded protein has a domain architecture consisting of an N-terminal calponin homology (CH) domain and a C-terminal calponin-like (CLIK) domain. Mice with a knockout of the orthologous gene are viable and fertile but their vascular smooth muscle cells exhibit alterations in the distribution of the actin filament and changes in cytoskeletal organization."[15]
- NP_001001522.1 transgelin. Transcript Variant: This variant (1) represents the longer transcript. Variants 1 and 2 both encode the same protein. cd00014 Location:25 → 137, CH; Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav). pfam00402 Location:175 → 198 Calponin; Calponin family repeat.[15]
- NP_003177.2 transgelin. Transcript Variant: This variant (2) differs in the 5' UTR compared to variant 1. Variants 1 and 2 both encode the same protein. cd00014 Location:25 → 137, CH; Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav). pfam00402 Location:175 → 198 Calponin; Calponin family repeat.[15]
Atrial natriuretic factor genes
Gene ID: 4878 is NPPA natriuretic peptide A. "The protein encoded by this gene belongs to the natriuretic peptide family. Natriuretic peptides are implicated in the control of extracellular fluid volume and electrolyte homeostasis. This protein is synthesized as a large precursor (containing a signal peptide), which is processed to release a peptide from the N-terminus with similarity to vasoactive peptide, cardiodilatin, and another peptide from the C-terminus with natriuretic-diuretic activity. Mutations in this gene have been associated with atrial fibrillation familial type 6. This gene is located adjacent to another member of the natriuretic family of peptides on chromosome 1."[16]
Gene ID: 4879 is NPPB natriuretic peptide B. "This gene is a member of the natriuretic peptide family and encodes a secreted protein which functions as a cardiac hormone. The protein undergoes two cleavage events, one within the cell and a second after secretion into the blood. The protein's biological actions include natriuresis, diuresis, vasorelaxation, inhibition of renin and aldosterone secretion, and a key role in cardiovascular homeostasis. A high concentration of this protein in the bloodstream is indicative of heart failure. The protein also acts as an antimicrobial peptide with antibacterial and antifungal activity. Mutations in this gene have been associated with postmenopausal osteoporosis."[17]
Gene ID: 4880 is NPPC natriuretic peptide C. "This gene encodes a preproprotein that is proteolytically processed to generate multiple protein products. These products include the cardiac natriuretic peptides CNP-53, CNP-29 and CNP-22, which belong to the natriuretic family of peptides. The encoded peptides exhibit vasorelaxation activity in laboratory animals and elevated levels of CNP-22 have been observed in the plasma of chronic heart failure patients."[18]
Calponin genes
Gene ID: 1264 is CNN1 calponin 1.[19]
- NP_001290.2 calponin-1 isoform 1. Transcript Variant: This variant (1) represents the longer isoform (1).[19]
- NP_001295270.1 calponin-1 isoform 2. Transcript Variant: This variant (2) differs in the 5' UTR and uses an alternate exon in the 5' coding region, which results in use of a downstream start codon compared to variant 1. It encodes isoform 2, which has a shorter N-terminus than isoform 1. Variants 2 and 3 encode the same isoform.[19]
- NP_001295271.1 calponin-1 isoform 2. Transcript Variant: This variant (3) differs in the 5' UTR and uses an alternate exon in the 5' coding region, which results in use of a downstream start codon compared to variant 1. It encodes isoform 2, which has a shorter N-terminus than isoform 1. Variants 2 and 3 encode the same isoform.[19]
Gene ID: 1265 is CNN2 calponin 2.[20] "The protein encoded by this gene, which can bind actin, calmodulin, troponin C, and tropomyosin, may function in the structural organization of actin filaments. The encoded protein could play a role in smooth muscle contraction and cell adhesion. Several pseudogenes of this gene have been identified, and are present on chromosomes 1, 2, 3, 6, 9, 11, 13, 15, 16, 21 and 22. Alternative splicing results in multiple transcript variants encoding different isoforms."[20]
- NP_001290428.1 calponin-2 isoform c. Transcript Variant: This variant (3) uses two alternate in-frame splice sites in the central coding region, compared to variant 4. The encoded isoform (c) is shorter than isoform d.[20]
- NP_001290430.1 calponin-2 isoform d. Transcript Variant: This variant (4) represents the longest transcript and encodes the longest isoform (d).[20]
- NP_004359.1 calponin-2 isoform a. Transcript Variant: This variant (1) uses an alternate in-frame splice site in the central coding region, compared to variant 4. The encoded isoform (a) is shorter than isoform d.[20]
- NP_958434.1 calponin-2 isoform b. Transcript Variant: This variant (2) lacks an alternate in-frame exon in the 3' coding region, compared to variant 4. The encoded isoform (b) is shorter than isoform d.[20]
Gene ID: 1266 is CNN3 calponin 3, acidic.[21] "This gene encodes a protein with a markedly acidic C terminus; the basic N-terminus is highly homologous to the N-terminus of a related gene, CNN1. Members of the CNN gene family all contain similar tandemly repeated motifs. This encoded protein is associated with the cytoskeleton but is not involved in contraction."[21]
- NP_001272984.1 calponin-3 isoform 2. Transcript Variant: This variant (2) lacks an in-frame exon in the central coding region compared to variant 1. The encoded isoform (2) is shorter than isoform 1.[21]
- NP_001272985.1 calponin-3 isoform 3. Transcript Variant: This variant (3) differs in the 5' UTR and lacks a portion of the 5' coding region compared to variant 1. These differences cause translation initiation at a downstream start codon compared to variant 1. The encoded isoform (3) has a shorter N-terminus compared to isoform 1.[21]
- NP_001830.1 calponin-3 isoform 1. Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).[21]
Fos genes
Gene ID: 2353 is FOS Fos proto-oncogene, AP-1 transcription factor subunit. "The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation. In some cases, expression of the FOS gene has also been associated with apoptotic cell death."[22] "Serum response factor and the (CC(A/T)6GG) (CArG) box interact to promote the transcription of c-fos and muscle genes".[10]
- NP_005243.1 proto-oncogene c-Fos, cd14721 Location:147 → 200, bZIP_Fos; Basic leucine zipper (bZIP) domain of the oncogene Fos (Fos): a DNA-binding and dimerization domain.[22]
Integrin genes
Gene ID: 3672 is ITGA1 integrin subunit alpha 1, aka VLA1; CD49a. "This gene encodes the alpha 1 subunit of integrin receptors. This protein heterodimerizes with the beta 1 subunit to form a cell-surface receptor for collagen and laminin. The heterodimeric receptor is involved in cell-cell adhesion and may play a role in inflammation and fibrosis. The alpha 1 subunit contains an inserted (I) von Willebrand factor type I domain which is thought to be involved in collagen binding."[23]
- NP_852478.1 integrin alpha-1 precursor, smart00191 Location:568 → 621, Int_alpha; Integrin alpha (beta-propellor repeats), cd01469 Location:171 → 351, vWA_integrins_alpha_subunit; Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote cell survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins. The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions. pfam08441 Location:664 → 1056, Integrin_alpha2; Integrin alpha."[23]
MADS box genes
Gene ID: 4205 is MEF2A myocyte enhancer factor 2A. "The protein encoded by this gene is a DNA-binding transcription factor that activates many muscle-specific, growth factor-induced, and stress-induced genes. The encoded protein can act as a homodimer or as a heterodimer and is involved in several cellular processes, including muscle development, neuronal differentiation, cell growth control, and apoptosis. Defects in this gene could be a cause of autosomal dominant coronary artery disease 1 with myocardial infarction (ADCAD1). Several transcript variants encoding different isoforms have been found for this gene."[24]
- NP_001124398.1 myocyte-specific enhancer factor 2A isoform 2. Transcript Variant: This variant (2) lacks an in-frame coding exon and a 5' non-coding exon, compared to transcript variant 6. These differences result in a shorter isoform (2), compared to isoform 5. Variants 2 and 5 both encode isoform 2. MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001124399.1 myocyte-specific enhancer factor 2A isoform 3. Transcript Variant: This variant (3) lacks an in-frame 5' coding exon and a 5' non-coding exon, compared to transcript variant 6. These differences result in a shorter isoform (3), compared to isoform 5. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001124400.1 myocyte-specific enhancer factor 2A isoform 4. Transcript Variant: This variant (4) lacks multiple, in-frame coding exons, compared to transcript variant 6. These differences result in a shorter isoform (4), compared to isoform 5. The 5' UTR of this transcript variant is undefined. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001165365.1 myocyte-specific enhancer factor 2A isoform 2. Transcript Variant: This variant (5) lacks an in-frame coding exon and contains an additional 5' non-coding exon, compared to transcript variant 6. These differences result in a shorter isoform (2), compared to isoform 5. Variants 2 and 5 both encode isoform 2. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001306135.1 myocyte-specific enhancer factor 2A isoform 5. Transcript Variant: This variant (6) encodes the longest isoform (5). MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001339543.1 myocyte-specific enhancer factor 2A isoform 2. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001339544.1 myocyte-specific enhancer factor 2A isoform 2. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001339545.1 myocyte-specific enhancer factor 2A isoform 5. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001339546.1 myocyte-specific enhancer factor 2A isoform 1. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001339547.1 myocyte-specific enhancer factor 2A isoform 6. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352130.1 myocyte-specific enhancer factor 2A isoform 7. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352131.1 myocyte-specific enhancer factor 2A isoform 7. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352132.1 myocyte-specific enhancer factor 2A isoform 8. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352133.1 myocyte-specific enhancer factor 2A isoform 8. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352134.1 myocyte-specific enhancer factor 2A isoform 5. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352135.1 myocyte-specific enhancer factor 2A isoform 6. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352136.1 myocyte-specific enhancer factor 2A isoform 1. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352137.1 myocyte-specific enhancer factor 2A isoform 2. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352138.1 myocyte-specific enhancer factor 2A isoform 3. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352139.1 myocyte-specific enhancer factor 2A isoform 10. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_001352140.1 myocyte-specific enhancer factor 2A isoform 11. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
- NP_005578.2 myocyte-specific enhancer factor 2A isoform 1. Transcript Variant: This variant (1) lacks multiple, in-frame coding exons and uses an alternate coding exon, compared to transcript variant 6. These differences result in a shorter isoform (1), compared to isoform 5. MADS: MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptonal regulators. Binds DNA and exists as hetero and homo-dimers. Composed of 2 main subgroups: SRF-like/Type I and MEF2-like (myocyte enhancer factor 2)/ Type II. These subgroups differ mainly in position of the alpha 2 helix responsible for the dimerization interface; Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[24]
Gene ID: 4207 is BORCS8-MEF2B BORCS8-MEF2B readthrough, aka MEF2B myocyte enhancer factor 2B. "This gene represents numerous read-through transcripts that span GeneID:729991 and 100271849. Many read-through transcripts are predicted to be nonsense-mediated decay (NMD) candidates, and are thought to be non-coding. Some transcripts are predicted to be capable of translation reinitiation at a downstream AUG, resulting in expression of at least one isoform of myocyte enhancer factor 2B (MEF2B) from this read-through locus. At least one additional MEF2B variant and isoform can be expressed from a downstream promoter, and is annotated on GeneID:100271849."[25]
- NP_005910.1 myocyte-specific enhancer factor 2B isoform b. Transcript Variant: This variant (1) lacks two alternate exons in the 5' region and one alternate exon in the 3' region, compared to variant 2. This variant is thought to be protein coding because translation can reinitiate at the downstream AUG, resulting in expression of an isoform of MEF2B (geneID:100271849). Isoform b has a shorter and distinct C-terminus, compared to MEF2A isoform a (NP_001139257.1). MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[25]
- NR_027307.2 RNA Sequence. Transcript Variant: This variant (2) represents the longest transcript. This variant is represented as non-coding because the use of the 5'-most translational start codon, as used in NM_001145784.1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).[25]
- NR_027308.2 RNA Sequence. Transcript Variant: This variant (3) lacks an alternate exon in the 3' region, compared to variant 2. This variant is represented as non-coding because the use of the 5'-most translational start codon, as used in NM_001145784.1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).[25]
Gene ID: 4208 is MEF2C myocyte enhancer factor 2C. "This locus encodes a member of the MADS box transcription enhancer factor 2 (MEF2) family of proteins, which play a role in myogenesis. The encoded protein, MEF2 polypeptide C, has both trans-activating and DNA binding activities. This protein may play a role in maintaining the differentiated state of muscle cells. Mutations and deletions at this locus have been associated with severe cognitive disability, stereotypic movements, epilepsy, and cerebral malformation. Alternatively spliced transcript variants have been described."[26]
Gene ID: 4209 is MEF2D myocyte enhancer factor 2D. "This gene is a member of the myocyte-specific enhancer factor 2 (MEF2) family of transcription factors. Members of this family are involved in control of muscle and neuronal cell differentiation and development, and are regulated by class II histone deacetylases. Fusions of the encoded protein with Deleted in Azoospermia-Associated Protein 1 (DAZAP1) due to a translocation have been found in an acute lymphoblastic leukemia cell line, suggesting a role in leukemogenesis. The encoded protein may also be involved in Parkinson disease and myotonic dystrophy. Alternative splicing results in multiple transcript variants."[27]
- NP_001258558.1 myocyte-specific enhancer factor 2D isoform 2. Transcript Variant: This variant (2) contains an alternate exon and splice site in the 5' UTR, and lacks an internal in-frame exon in the coding region, compared to variant 1. The resulting isoform (2, also known as hMEF2Da0), is shorter than isoform 1. MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[27]
- NP_005911.1 myocyte-specific enhancer factor 2D isoform 1. Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1, also known as hMEF2Dab). MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[27]
Gene ID: 23523 is CABIN1 calcineurin binding protein 1. "Calcineurin plays an important role in the T-cell receptor-mediated signal transduction pathway. The protein encoded by this gene binds specifically to the activated form of calcineurin and inhibits calcineurin-mediated signal transduction. The encoded protein is found in the nucleus and contains a leucine zipper domain as well as several PEST motifs, sequences which confer targeted degradation to those proteins which contain them. Alternative splicing results in multiple transcript variants encoding two different isoforms."[28]
- NP_001186210.1 calcineurin-binding protein cabin-1 isoform a. Transcript Variant: This variant (1) represents the longest transcript and encodes the longer isoform (a). Both variants 1 and 2 encode the same isoform (a). MEF2 binding.[28]
- NP_001188358.1 calcineurin-binding protein cabin-1 isoform b. Transcript Variant: This variant (3) differs in the 5' UTR and lacks an alternate in-frame exon compared to variant 1. The resulting isoform (b) has the same N- and C-termini but is shorter compared to isoform a. Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1.[28]
- NP_036427.1 calcineurin-binding protein cabin-1 isoform a. Transcript Variant: This variant (2) differs in the 5' UTR compared to variant 1. Both variants 1 and 2 encode the same isoform (a). MEF2 binding.[28]
Gene ID: 100271849 is MEF2B myocyte enhancer factor 2B. "The product of this gene is a member of the MADS/MEF2 family of DNA binding proteins. The protein is thought to regulate gene expression, including expression of the smooth muscle myosin heavy chain gene. This region undergoes considerable alternative splicing, with transcripts supporting two non-overlapping loci (GeneID 729991 and 100271849) as well as numerous read-through transcripts that span both loci (annotated as GeneID 4207). Several isoforms of this protein are expressed from either this locus or from some of the read-through transcripts annotated on GeneID 4207."[29]
- NP_001139257.1 myocyte-specific enhancer factor 2B isoform 1. MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[29]
- NP_001354211.1 myocyte-specific enhancer factor 2B isoform 2. MEF2 (myocyte enhancer factor 2)-like/Type II subfamily of MADS ( MCM1, Agamous, Deficiens, and SRF (serum response factor) box family of eukaryotic transcriptional regulators. Binds DNA and exists as hetero and homo-dimers. Differs from SRF-like/Type I subgroup mainly in position of the alpha helix responsible for the dimerization interface. Important in homeotic regulation in plants and in immediate-early development in animals. Also found in fungi.[29]
Myosin heavy chains
Gene ID: 4619 is MYH1 myosin heavy chain 1. "Myosin is a major contractile protein which converts chemical energy into mechanical energy through the hydrolysis of ATP. Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. Myosin heavy chains are encoded by a multigene family. In mammals at least 10 different myosin heavy chain (MYH) isoforms have been described from striated, smooth, and nonmuscle cells. These isoforms show expression that is spatially and temporally regulated during development."[30]
Gene ID: 4620 is MYH2 myosin heavy chain 2. "Myosins are actin-based motor proteins that function in the generation of mechanical force in eukaryotic cells. Muscle myosins are heterohexamers composed of 2 myosin heavy chains and 2 pairs of nonidentical myosin light chains. This gene encodes a member of the class II or conventional myosin heavy chains, and functions in skeletal muscle contraction. This gene is found in a cluster of myosin heavy chain genes on chromosome 17. A mutation in this gene results in inclusion body myopathy-3. Multiple alternatively spliced variants, encoding the same protein, have been identified."[31]
- NP_001093582.1 myosin-2. Transcript Variant: This variant (2) differs in the 5' UTR compared to variant 1. Both variants encode the same protein.[31]
- NP_060004.3 myosin-2. Transcript Variant: This variant (1) differs in the 5' UTR compared to variant 2. Both variants encode the same protein.[31]
Gene ID: 4621 is MYH3 myosin heavy chain 3. "Myosin is a major contractile protein which converts chemical energy into mechanical energy through the hydrolysis of ATP. Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. This gene is a member of the MYH family and encodes a protein with an IQ domain and a myosin head-like domain. Mutations in this gene have been associated with two congenital contracture (arthrogryposis) syndromes, Freeman-Sheldon syndrome and Sheldon-Hall syndrome."[32]
Gene ID: 4622 is MYH4 myosin heavy chain 4.[33]
Gene ID: 4624 is MYH6 myosin heavy chain 6. "Cardiac muscle myosin is a hexamer consisting of two heavy chain subunits, two light chain subunits, and two regulatory subunits. This gene encodes the alpha heavy chain subunit of cardiac myosin. The gene is located approximately 4kb downstream of the gene encoding the beta heavy chain subunit of cardiac myosin. Mutations in this gene cause familial hypertrophic cardiomyopathy and atrial septal defect 3."[34]
Gene ID: 4625 is MYH7 myosin heavy chain 7. "Muscle myosin is a hexameric protein containing 2 heavy chain subunits, 2 alkali light chain subunits, and 2 regulatory light chain subunits. This gene encodes the beta (or slow) heavy chain subunit of cardiac myosin. It is expressed predominantly in normal human ventricle. It is also expressed in skeletal muscle tissues rich in slow-twitch type I muscle fibers. Changes in the relative abundance of this protein and the alpha (or fast) heavy subunit of cardiac myosin correlate with the contractile velocity of cardiac muscle. Its expression is also altered during thyroid hormone depletion and hemodynamic overloading. Mutations in this gene are associated with familial hypertrophic cardiomyopathy, myosin storage myopathy, dilated cardiomyopathy, and Laing early-onset distal myopathy."[35]
Gene ID: 4626 is MYH8 myosin heavy chain 8. "Myosins are actin-based motor proteins that function in the generation of mechanical force in eukaryotic cells. Muscle myosins are heterohexamers composed of 2 myosin heavy chains and 2 pairs of nonidentical myosin light chains. This gene encodes a member of the class II or conventional myosin heavy chains, and functions in skeletal muscle contraction. This gene is predominantly expressed in fetal skeletal muscle. This gene is found in a cluster of myosin heavy chain genes on chromosome 17. A mutation in this gene results in trismus-pseudocamptodactyly syndrome."[36]
Gene ID: 4627 is MYH9 myosin heavy chain 9. "This gene encodes a conventional non-muscle myosin; this protein should not be confused with the unconventional myosin-9a or 9b (MYO9A or MYO9B). The encoded protein is a myosin IIA heavy chain that contains an IQ domain and a myosin head-like domain which is involved in several important functions, including cytokinesis, cell motility and maintenance of cell shape. Defects in this gene have been associated with non-syndromic sensorineural deafness autosomal dominant type 17, Epstein syndrome, Alport syndrome with macrothrombocytopenia, Sebastian syndrome, Fechtner syndrome and macrothrombocytopenia with progressive sensorineural deafness."[37]
Gene ID: 4628 is MYH10 myosin heavy chain 10. "This gene encodes a member of the myosin superfamily. The protein represents a conventional non-muscle myosin; it should not be confused with the unconventional myosin-10 (MYO10). Myosins are actin-dependent motor proteins with diverse functions including regulation of cytokinesis, cell motility, and cell polarity. Mutations in this gene have been associated with May-Hegglin anomaly and developmental defects in brain and heart. Multiple transcript variants encoding different isoforms have been found for this gene."[38]
- NP_001242941.1 myosin-10 isoform 1. Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).[38]
- NP_001243024.1 myosin-10 isoform 3. Transcript Variant: This variant (3) uses an alternate in-frame splice site in the 5' coding region, and lacks an alternate in-frame exon in the central coding region, compared to variant 1. The encoded isoform (3) is shorter than isoform 1.[38]
- NP_001362195.1 myosin-10 isoform 4.[38]
- NP_005955.3 myosin-10 isoform 2. Transcript Variant: This variant (2) lacks an alternate in-frame exon in both the 5' and central coding regions, compared to variant 1. The encoded isoform (2) is shorter than isoform 1.[38]
Gene ID: 4629 is MYH11 myosin heavy chain 11. "The protein encoded by this gene is a smooth muscle myosin belonging to the myosin heavy chain family. The gene product is a subunit of a hexameric protein that consists of two heavy chain subunits and two pairs of non-identical light chain subunits. It functions as a major contractile protein, converting chemical energy into mechanical energy through the hydrolysis of ATP. The gene encoding a human ortholog of rat NUDE1 is transcribed from the reverse strand of this gene, and its 3' end overlaps with that of the latter. The pericentric inversion of chromosome 16 [inv(16)(p13q22)] produces a chimeric transcript that encodes a protein consisting of the first 165 residues from the N terminus of core-binding factor beta in a fusion with the C-terminal portion of the smooth muscle myosin heavy chain. This chromosomal rearrangement is associated with acute myeloid leukemia of the M4Eo subtype. Alternative splicing generates isoforms that are differentially expressed, with ratios changing during muscle cell maturation. Alternatively spliced transcript variants encoding different isoforms have been identified."[39]
- NP_001035202.1 myosin-11 isoform SM2B. Transcript Variant: This variant (SM2B) represents the longer transcript. It encodes the isoform SM2B.[39]
- NP_001035203.1 myosin-11 isoform SM1B. Transcript Variant: This variant (SM1B) lacks a segment in the coding region, which leads to a frameshift, compared to variant SM2B. The encoded isoform (SM1B) is longer and varies in the carboxyl terminus, compared to isoform SM2B.[39]
- NP_002465.1 myosin-11 isoform SM1A. Transcript Variant: This variant (SM1A) lacks two segments in the coding region, compared to variant SM2B. The encoded isoform (SM1A) is shorter and varies in the carboxyl terminus, compared to isoform SM2B.[39]
- NP_074035.1 myosin-11 isoform SM2A. Transcript Variant: This variant (SM2A) lacks an in-frame segment of the coding region, compared to variant SM2B. It encodes a shorter isoform (SM2A), that is missing an internal segment compared to isoform SM2B.[39]
- XP_016878739.1 myosin-11 isoform X1.
- XP_011520804.1 myosin-11 isoform X2.
Gene ID: 4644 is MYO5A myosin VA aka MYH12. "This gene is one of three myosin V heavy-chain genes, belonging to the myosin gene superfamily. Myosin V is a class of actin-based motor proteins involved in cytoplasmic vesicle transport and anchorage, spindle-pole alignment and mRNA translocation. The protein encoded by this gene is abundant in melanocytes and nerve cells. Mutations in this gene cause Griscelli syndrome type-1 (GS1), Griscelli syndrome type-3 (GS3) and neuroectodermal melanolysosomal disease, or Elejalde disease. Multiple alternatively spliced transcript variants encoding different isoforms have been reported, but the full-length nature of some variants has not been determined."[40]
- NP_000250.3 unconventional myosin-Va isoform 1. Transcript Variant: This variant (1) encodes the longer isoform (1).[40]
- NP_001135967.2 unconventional myosin-Va isoform 2. Transcript Variant: This variant (2) lacks an in-frame exon in the CDS, resulting in a shorter isoform (2), as compared to variant 1.[40]
Gene ID: 8735 is MYH13 myosin heavy chain 13.[41]
Gene ID: 22989 is MYH15 myosin heavy chain 15.[42]
- NP_055796.1 myosin-15 precursor.[42]
Gene ID: 57644 is MYH7B myosin heavy chain 7B. "The myosin II molecule is a multi-subunit complex consisting of two heavy chains and four light chains. This gene encodes a heavy chain of myosin II, which is a member of the motor-domain superfamily. The heavy chain includes a globular motor domain, which catalyzes ATP hydrolysis and interacts with actin, and a tail domain in which heptad repeat sequences promote dimerization by interacting to form a rod-like alpha-helical coiled coil. This heavy chain subunit is a slow-twitch myosin. Alternatively spliced transcript variants have been found, but the full-length nature of these variants is not determined."[43]
Gene ID: 79784 is MYH14 myosin heavy chain 14. "This gene encodes a member of the myosin superfamily. The protein represents a conventional non-muscle myosin; it should not be confused with the unconventional myosin-14 (MYO14). Myosins are actin-dependent motor proteins with diverse functions including regulation of cytokinesis, cell motility, and cell polarity. Mutations in this gene result in one form of autosomal dominant hearing impairment. Multiple transcript variants encoding different isoforms have been found for this gene."[44]
- NP_001070654.1 myosin-14 isoform 1. Transcript Variant: This variant (1) lacks an alternate in-frame exon in the 5' coding region, compared to variant 3. The resulting isoform (1) lacks an internal segment in the motor domain, compared to isoform 3.[44]
- NP_001139281.1 myosin-14 isoform 3. Transcript Variant: This variant (3) represents the longest transcript and encodes the longest isoform (3).[44]
- NP_079005.3 myosin-14 isoform 2. Transcript Variant: This variant (2) lacks two alternate in-frame exons in the 5' coding region, compared to variant 3. The resulting isoform (2) lacks two separate segments in the motor domain, compared to isoform 3.[44]
- XP_011525622.1 myosin-14 isoform X1.[44]
- XP_011525623.1 myosin-14 isoform X2.[44]
- XP_006723449.1 myosin-14 isoform X3.[44]
- XP_024307489.1 myosin-14 isoform X4.[44]
- XP_011525625.1 myosin-14 isoform X3.[44]
Smooth muscle kinase genes
Analysis "of the smMLCK promoter revealed that a single CArG box is required for basal promoter activity in smooth muscle and nonmuscle cell types. The smooth and cardiac muscle restricted serum response factor (SRF) coactivator myocardin robustly induced smMLCK expression in 10T1/2 cells, although it increased the activity of the proximal smMLCK promoter only twofold in reporter gene assays. In contrast to SRF and myocardin, GATA-6 repressed the activity of the smMLCK promoter and inhibited smMLCK protein expression in vascular smooth muscle cells. Altogether, these studies indicate that expression of the 130-kDa smMLCK is regulated by a CArG-dependent promoter located within an intron of the mouse mylk gene."[45] The transcription factor binding motifs in the smooth muscle MLCK proximal promoter are -166 CCTTATAAGG (CArG), -141 CCGATATA (GATA), -101 CAAT, -87 ATAAAC (Fox), -74 GGCCGGCCCC (Sp1), +6 ACCCAGCCCC (Sp1), and +60 GGGGGCGGGA (Sp1), with transcription start sites at G+1, +47 A, +54 A, and +118 A.[45]
Gene ID: 4638 is MYLK myosin light chain kinase aka KRP; AAT7; MLCK; MLCK1; MMIHS; MYLK1; smMLCK; MLCK108; MLCK210; MSTP083. "This gene, a muscle member of the immunoglobulin gene superfamily, encodes myosin light chain kinase which is a calcium/calmodulin dependent enzyme. This kinase phosphorylates myosin regulatory light chains to facilitate myosin interaction with actin filaments to produce contractile activity. This gene encodes both smooth muscle and nonmuscle isoforms. In addition, using a separate promoter in an intron in the 3' region, it encodes telokin, a small protein identical in sequence to the C-terminus of myosin light chain kinase, that is independently expressed in smooth muscle and functions to stabilize unphosphorylated myosin filaments. A pseudogene is located on the p arm of chromosome 3. Four transcript variants that produce four isoforms of the calcium/calmodulin dependent enzyme have been identified as well as two transcripts that produce two isoforms of telokin. Additional variants have been identified but lack full length transcripts."[46]
- NP_001308238.1 myosin light chain kinase, smooth muscle isoform 9.[46]
- NP_444253.3 myosin light chain kinase, smooth muscle isoform 1. Transcript Variant: This variant (1) is the full-length transcript and encodes the full-length nonmuscle isoform.[46]
- NP_444254.3 myosin light chain kinase, smooth muscle isoform 2. Transcript Variant: This variant (2) does not utilize exon 11, compared to variant 1, resulting in a shorter protein (isoform 2), compared to isoform 1.[46]
- NP_444255.3 myosin light chain kinase, smooth muscle isoform 3A. Transcript Variant: This variant (3A) does not utilize exon 30, compared to variant 1, resulting in a shorter protein (isoform 3A), compared to isoform 1.[46]
- NP_444256.3 myosin light chain kinase, smooth muscle isoform 3B. Transcript Variant: This variant (3B) does not utilize exons 11 and 30, compared to variant 1, resulting in a shorter protein (isoform 3B), compared to isoform 1.[46]
- NP_444259.1 myosin light chain kinase, smooth muscle isoform 7. Transcript Variant: This variant (7) encodes the shorter isoform of kinase related protein, telokin. The first exon corresponds to intron 30 and the remainder of the transcript corresponds to the last two exons of the gene. It is shorter than variant 8 by one codon at the splicing junction between the first two exons.[46]
- NP_444260.1 myosin light chain kinase, smooth muscle isoform 8. ranscript Variant: This variant (8) encodes the longer isoform of kinase related protein, telokin. It is longer than variant 7 by one codon at the splicing junction between the first two exons.[46]
Hypotheses
- A1BG is not transcribed using a CArG box.
- A CArG box on either side of A1BG may show that it is actively used to transcribe A1BG.
Results
There is a more general CArG box, 3'-CATTAAAAGG-5', at 3441 from ZSCAN22, or -1019 nts from the TSS of A1BG in the distal promoter.
A second more general CArG box, 3'-CAAAAAAAAG-5', at 1399 from ZSCAN22, or -3061 nts from the A1BG TSS may be a CArG box for ZSCAN22 in the negative direction on the positive strand in the distal promoter.
CArG box analysis and results
"CArG box [CC(A/T)6GG][3] DNA [consensus] sequences present within the promoters of SMC genes play a pivotal role in controlling their transcription".[2]
"MADS-box proteins bind to a consensus sequence, the CArG box, that has the core motif CC(A/T)6GG (15)."[4]
"Of the [Flowering Locus C] FLC binding sites, 69% contained at least one CArG-box motif with the core consensus sequence CCAAAAAT(G/A)G and an AAA extension at the 3′ end [...]."[4]
Three "other MADS-box flowering-time regulators, SOC1, SVP, and AGAMOUS-LIKE 24 (AGL24), bind to two different CArG-box motifs at 502 bp (CTAAATATGG) and 287 bp (CAATAATTGG) upstream of the translation start in the SEP3 gene (24), consistent with different specificities for the different MADS-box proteins."[4]
These together with the core motif CC(A/T)6GG suggest a more general CArG-box motif of (C(C/A/T)(A/T)6(A/G)G).
The real promoters for A1BG have only two CArG boxes, one within the UTR: CATTAAAAGG at 3441 (a direct with an occurrence of 0.5) and CAAAAAAAAG at 1399 (also a direct with an occurrence of 0.5) within the distal promoter, with 2032 nucleotides between the ending of the first at 1399 and the beginning of the second at 3431. No CArGs occur in the positive direction.
The random datasets have five in the UTR for an occurrence of 0.5 (three directs for 0.6 and two inverse complements for 0.4) but thirty-two in the distal promoters for an occurrence of 1.6, with an occurrence of 1.4 in the arbitrary negative direction and 1.9 in the arbitrary positive direction.
Although the occurrences in the UTR for real and random suggest the one is random, had the complement inverses not occurred as in the reals, the random probability for direct only would have been 0.3. This in turn suggests the occurrence in the UTR is likely active or activable. The much higher occurrences for the distal promoters also suggests that the real occurrence in the distal promoter in the negative direction is also likely active or activable. In the direct negative direction only the random datasets had an occurrence of 1.4.
Reals or randoms | Promoters | direction | Numbers | Strands | Occurrences | Averages (± 0.1) |
---|---|---|---|---|---|---|
Reals | UTR | negative | 1 | 2 | 0.5 | 0.5 |
Randoms | UTR | arbitrary negative | 4 | 10 | 0.4 | 0.4 |
Randoms | UTR | alternate negative | 4 | 10 | 0.4 | 0.4 |
Reals | Core | negative | 0 | 2 | 0 | 0 |
Randoms | Core | arbitrary negative | 0 | 10 | 0 | 0 |
Randoms | Core | alternate negative | 0 | 10 | 0 | 0 |
Reals | Core | positive | 0 | 2 | 0 | 0 |
Randoms | Core | arbitrary positive | 0 | 10 | 0 | 0 |
Randoms | Core | alternate positive | 0 | 10 | 0 | 0 |
Reals | Proximal | negative | 0 | 2 | 0 | 0 |
Randoms | Proximal | arbitrary negative | 0 | 10 | 0 | 0.05 |
Randoms | Proximal | alternate negative | 1 | 10 | 0.1 | 0.05 |
Reals | Proximal | positive | 0 | 2 | 0 | 0 |
Randoms | Proximal | arbitrary positive | 0 | 10 | 0 | 0 |
Randoms | Proximal | alternate positive | 0 | 10 | 0 | 0 |
Reals | Distal | negative | 1 | 2 | 0.5 | 0.5 |
Randoms | Distal | arbitrary negative | 9 | 10 | 0.9 | 0.95 |
Randoms | Distal | alternate negative | 10 | 10 | 1.0 | 0.95 |
Reals | Distal | positive | 0 | 2 | 0 | 0 |
Randoms | Distal | arbitrary positive | 14 | 10 | 1.4 | 1.3 |
Randoms | Distal | alternate positive | 12 | 10 | 1.2 | 1.3 |
Comparison:
The occurrences of real CArG UTRs are greater than the randoms and the negative direction distals are less than the randoms. This suggests that the real CArGs are likely active or activable.
Acknowledgements
The content on this page was first contributed by: Henry A. Hoff.
Initial content for this page in some instances came from Wikiversity.
See also
References
- ↑ 74.100.224.95 (10 January 2010). "Box (disambiguation)". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2013-06-15.
- ↑ 2.00 2.01 2.02 2.03 2.04 2.05 2.06 2.07 2.08 2.09 2.10 Oliver G. McDonald, Brian R. Wamhoff, Mark H. Hoofnagle, and Gary K. Owens (January 4, 2006). "Control of SRF binding to CArG box chromatin regulates smooth muscle gene expression in vivo". The Journal of Clinical Investigation. 116 (1): 36–48. Retrieved 2014-06-05.
- ↑ 3.0 3.1 Shinji Kamada and Takeshi Miwa (1 October 1992). "A protein binding to CArG box motifs and to single-stranded DNA functions as a transcriptional repressor". Gene. 119 (2): 229–236. doi:10.1016/0378-1119(92)90276-U. Retrieved 2017-09-17.
- ↑ 4.0 4.1 4.2 4.3 4.4 4.5 Weiwei Deng, Hua Ying, Chris A. Helliwell, Jennifer M. Taylor, W. James Peacock, and Elizabeth S. Dennis (19 April 2011). "FLOWERING LOCUS C (FLC) regulates development pathways throughout the life cycle of Arabidopsis". Proceedings of the National Academy of Sciences United States of America. 108 (16): 6680–6685. doi:10.1073/pnas.1103175108. Retrieved 2017-09-17.
- ↑ 5.0 5.1 5.2 5.3 5.4 Michael S. Parmacek (16 March 2007). "Myocardin-Related Transcription Factors : Critical Coactivators Regulating Cardiovascular Development and Adaptation" (PDF). Circulation Research. 100 (5): 633–644. doi:10.1161/01.RES.0000259563.61091.e8. Retrieved 2017-09-19.
- ↑ 6.0 6.1 6.2 6.3 6.4 6.5 Ida Najwer and Brenda Lilly (25 May 2005). "Ca2+/calmodulin-dependent protein kinase IV activates cysteine-rich protein 1 through adjacent CRE and CArG elements" (PDF). American Journal of Physiology-Cell Physiology. 289 (4): C785–C793. doi:10.1152/ajpcell.00098.2005. PMID 15917302. Retrieved 8 December 2019.
- ↑ Takeshi Miwa, Linda M. Boxer, and Larry Kedes (October 1987). "CArG boxes in the human cardiac α-actin gene are core binding sites for positive trans-acting regulatory factors" (PDF). Proceedings of the National Academy of Sciences USA. 84 (19): 6702–6706. Retrieved 2017-09-18.
- ↑ Rakesh Datta, Eric Rubin, Vikas Sukhatme, Sajjad Qureshi, Dennis Hallahan, Ralph R. Weichselbaum, and Donald W. Kufe (November 1992). "Ionizing radiation activates transcription of the EGR1 gene via CArG elements" (PDF). Proceedings of the National Academy of Sciences USA. 89 (21): 10149–10153. Retrieved 2017-09-18.
- ↑ Masaki Fujisawa, Toshitsugu Nakano, Yoko Shima and Yasuhiro Ito (5 February 2013). "A large-scale identification of direct targets of the tomato MADS box transcription factor RIPENING INHIBITOR reveals the regulation of fruit ripening". The Plant Cell. 25 (2): 371–86. doi:10.1105/tpc.112.108118. Retrieved 2017-02-19.
- ↑ 10.0 10.1 Wataru Nishida, Mako Nakamura, Syunsuke Mori, Masanori Takahashi, Yasuyuki Ohkawa, Satoko Tadokoro, Kenji Yoshida, Kunio Hiwada, Ken’ichiro Hayashi, and Kenji Sobue (1 March 2002). "A Triad of Serum Response Factor and the GATA and NK Families Governs the Transcription of Smooth and Cardiac Muscle Genes" (PDF). The Journal of Biological Chemistry. 277 (9): 7308–7317. doi:10.1074/jbc.M111824200. Retrieved 11 January 2020.
- ↑ RefSeq (September 2019). "ACTA1 actin alpha 1, skeletal muscle [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 12.0 12.1 12.2 12.3 RefSeq (September 2017). "ACTA2 actin alpha 2, smooth muscle [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ RefSeq (July 2008). "ACTC1 actin alpha cardiac muscle 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 14.00 14.01 14.02 14.03 14.04 14.05 14.06 14.07 14.08 14.09 14.10 14.11 14.12 14.13 14.14 14.15 14.16 14.17 14.18 14.19 14.20 14.21 RefSeq (July 2008). "CALD1 caldesmon 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 15.0 15.1 15.2 RefSeq (August 2017). "TAGLN transgelin [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ RefSeq (October 2015). "NPPA natriuretic peptide A [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (November 2014). "NPPB natriuretic peptide B [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (October 2015). "NPPC natriuretic peptide C [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 19.0 19.1 19.2 19.3 HGNC (21 December 2019). "CNN1 calponin 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 20.0 20.1 20.2 20.3 20.4 20.5 RefSeq (January 2015). "CNN2 calponin 2 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 21.0 21.1 21.2 21.3 21.4 RefSeq (July 2008). "CNN3 calponin 3 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 22.0 22.1 RefSeq (July 2008). "FOS Fos proto-oncogene, AP-1 transcription factor subunit [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 23.0 23.1 RefSeq (July 2008). "ITGA1 integrin subunit alpha 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 24.00 24.01 24.02 24.03 24.04 24.05 24.06 24.07 24.08 24.09 24.10 24.11 24.12 24.13 24.14 24.15 24.16 24.17 24.18 24.19 24.20 24.21 24.22 RefSeq (January 2010). "GATA1 GATA binding protein 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 25.0 25.1 25.2 25.3 RefSeq (October 2010). "BORCS8-MEF2B BORCS8-MEF2B readthrough [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (July 2010). "MEF2C myocyte enhancer factor 2C [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 27.0 27.1 27.2 RefSeq (October 2012). "MEF2D myocyte enhancer factor 2D [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 28.0 28.1 28.2 28.3 RefSeq (January 2011). "CABIN1 calcineurin binding protein 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 29.0 29.1 29.2 RefSeq (January 2014). "MEF2B myocyte enhancer factor 2B [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (July 2008). "MYH1 myosin heavy chain 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 31.0 31.1 31.2 RefSeq (July 2008). "MYH2 myosin heavy chain 2 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (July 2008). "MYH3 myosin heavy chain 3 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (21 December 2019). "MYH4 myosin heavy chain 4 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (February 2017). "MYH6 myosin heavy chain 6 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (July 2008). "MYH7 myosin heavy chain 7 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (September 2009). "MYH8 myosin heavy chain 8 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (December 2011). "MYH9 myosin heavy chain 9 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 38.0 38.1 38.2 38.3 38.4 RefSeq (December 2011). "MYH10 myosin heavy chain 10 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 39.0 39.1 39.2 39.3 39.4 RefSeq (July 2008). "MYH11 myosin heavy chain 11 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ 40.0 40.1 40.2 RefSeq (December 2008). "MYO5A myosin VA [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 11 January 2020.
- ↑ HGNC (21 December 2019). "MYH13 myosin heavy chain 13 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 42.0 42.1 HGNC (21 December 2019). "MYH15 myosin heavy chain 15 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ RefSeq (March 2010). "MYH7B myosin heavy chain 7B [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 44.0 44.1 44.2 44.3 44.4 44.5 44.6 44.7 44.8 RefSeq (December 2011). "MYH14 myosin heavy chain 14 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 12 January 2020.
- ↑ 45.0 45.1 Feng Yin, April M. Hoggatt, Jiliang Zhou, and B. Paul Herring (1 June 2006). "130-kDa smooth muscle myosin light chain kinase is transcribed from a CArG-dependent, internal promoter within the mouse mylk gene" (PDF). American Journal Physiology-Cell Physiology. 290 (6): C1599–C1609. doi:10.1152/ajpcell.00289.2005. Retrieved 13 January 2020.
- ↑ 46.0 46.1 46.2 46.3 46.4 46.5 46.6 46.7 RefSeq (July 2008). "MYLK myosin light chain kinase [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 13 January 2020.