C4orf21

Jump to navigation Jump to search
VALUE_ERROR (nil)
Identifiers
Aliases
External IDsGeneCards: [1]
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

n/a

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC)n/an/a
PubMed searchn/an/a
Wikidata
View/Edit Human

C4orf21 (Chromosome 4 open reading frame 21) is a protein in humans that is encoded by the C4orf21 gene that has uncharacterised function and a weight of 236.6 kDa.[1] The encoded protein of this gene has been linked with alcohol dependence.[2] This gene shows relatively low expression in most human tissues, with increased expression in situations of chemical dependence. C4orf21 is orthologous to nearly all kingdoms of Eukarya. Functional domains of this protein link it to a series of helicases, most notably the AAA_12 and AAA_11 domains.

Gene

The entire gene is 97,663 base pairs long and has an unprocessed mRNA that is 6,740 nucleotides in length. It consists of 28 exons that encode for a 2104 amino acid protein. 12 splice variants exist for C4orf21.

File:Chromosomal position of c4orf21 gene.png
Human chromosomal position of c4orf21 gene on the long arm of chromosome 4

Locus

C4orf21 is located on the fourth chromosome on the 4q25 position near the LARP7 gene. It is encoded for on the minus strand.

Homology and evolution

Homologous domains

C4orf21 contains a DUF2439 domain (domain of unknown function), zf-GRF domain, and AAA_11 and an AAA_12 domain (ATPases associated with diverse cellular activities). DUF domains are involved in telomere maintenance and meiotic segregation. AAA_11 and AAA_12 contain a P-loop motif which are involved in conjugative transfer proteins. Other helicase domains are also present in c4orf21 orthologs.

Paralogs

There are 9 moderately-related proteins in humans that are paralogous to the ATP-dependent helicase containing domains in the C-terminus of c4orf21 after the 1612th amino acid. A majority of these proteins are in the RNA helicase family. There are no known paralogs to the large N-terminal portion of the protein.

Sequence identity of helicase domain in paralogs
Paralogous Protein Protein Name Amino Acid Identity Amino Acid Similarity
UPF1 regulator of nonsense transcripts 1 32% 51%
IGHMBP2 immunoglobulin helicase μ-binding protein 2 30% 47%
MOV10 Moloney Leukemia Virus 10 30% 47%
SETX senataxin 29% 43%
ZNFX1 zinc finger, NFX1-type containing 1 28% 47%
DNA2 DNA replication ATP-dependent helicase/nuclease 26% 44%
PPARG peroxisome proliferator-activated receptor gamma 26% 43%
HELZ helicase with zinc finger domain 25% 42%
AQR intron-binding protein Aquarius 24% 48%
File:Unrooted Phylogenetic Tree of RNA Helicase Domain in c4orf21 Paralogs.jpg
Unrooted phylogenetic tree of proteins that are paralogous to the helicase domain containing portion of c4orf21

Orthologs

Complete orthologs of the c4orf21 gene are found in mammalia. The helicase domain containing C-terminus portion of the gene is conserved across Eukarya.

Protein

Primary sequence

C4orf21 is 236.6 kDa.

File:Amino Acid Composition of c4orf21.png
Amino Acid composition of c4orf21.

Post-translational modifications

C4orf21 has experimentally determined phosphorylation sites at the Y38, S137, S140, S325, and S864 positions.

File:Post-translational modification sites of c4orf21.png
Experimentally determined post-translational modification sites in c4orf21

Secondary structure

A weak transmembrane domain is predicted in the TMHMM server with one loop in the C-terminus of the protein prior to the helicase core. This domain contains both ends outside of a membrane.

Tertiary domains and quaternary structure

C4orf21 has related structures to Upf1, a paralog. These structures have the capability to bind zinc ions and mRNA.

File:C4orf21 model as proposed by Phyre 2.0. Regulator of nonsense transcripts. Hydrolase..png
Structure of C4ORF21 based upon UPF1 model. Image colored in rainbow from N to C terminus. This structure is based upon the crystal structure of the complex between 2 human nonsense mediated decay factors, upf1 and upf2, orthorhombic form.

Function and biochemistry

The function of c4orf21 is unknown. Given this, the paralogs to the helicase core of the gene are associated with translation, transcription, nonsense-mediated mRNA decay, RNA decay, miRNA processing, RISC assembly, and pre-mRNA splicing.[3] These paralogs operate under a SPF1 RNA helicase motif.[4]

Mov10, a paralog, and probable RNA helicase is required for RNA-mediated gene silencing by the RNA-induced silencing complex (RISC). It is also required for both miRNA-mediated translational repression and miRNA-mediated cleavage of complementary mRNAs by RISC, and for RNA-directed transcription and replication of the human hepatitis delta virus (HDV). Mov10 nteracts with small capped HDV RNAs derived from genomic hairpin structures that mark the initiation sites of RNA-dependent HDV RNA transcription.

Expression

Expression is relatively low for c4orf21 compared to other proteins. Expression of c4orf21 is slightly elevated compared to its average expression in tissue in the hematopoietic and lymphatic systems, and is above average in the brain also. Lower averages exist in liver, pharynx, and skin tissue.[5]

Transcription factor interactions

The transcriptional start site for c4orf21 aligns best with ATF, CREB, deltaCREB, E2F, and E2F-1 transcription factor binding sites.

Interacting proteins

C4orf21 shows predicted protein interaction with its AQR, DNA2, IGHMBP2, LOC91431, and SETX paralogs.[6]

Clinical significance

C4orf21 has been previously linked to alcohol dependence[2] (where genes linked to this disorder are also linked to alcoholism and other psychological and personality disorder).[7] Given this, expression of the gene in the liver and brain are particularly interesting. Upon examination of variable GEO profiles, there were many related to Hepatitis and other disorders of the liver. The best correlative studies were those in relation to liver transplant failure.[8][9] The link to alcohol dependence provides a strong connection to dependence to other chemical substances such as nicotine through analysis of lymphoblast cells. C4orf21 showed significantly increased expression in those who were nicotine dependent versus a control group of non-smokers.[9][10] Upregulation of c4orf21 was also present in certain cancer expression data sets.

A paralog of c4orf21 was found to inhibit HIV-1 Replication at multiple stages. Mov10 is involved in the biological processes of RNA-mediated gene silencing, transcription, transcription regulation and has hydrolase and helicase activity through ATP and RNA binding.[11]

References

  1. "Entrez Gene: Chromosome 4 open reading frame 21".
  2. 2.0 2.1 Kalsi G, Kuo PH, Aliev F, Alexander J, McMichael O, Patterson DG, Walsh D, Zhao Z, Schuckit M, Nurnberger J, Edenberg H, Kramer J, Hesselbrock V, Tischfield JA, Vladimirov V, Prescott CA, Dick DM, Kendler KS, Riley BP (Jun 2010). "A systematic gene-based screen of chr4q22-q32 identifies association of a novel susceptibility gene, DKK2, with the quantitative trait of alcohol dependence symptom counts". Human Molecular Genetics. 19 (12): 2497–506. doi:10.1093/hmg/ddq112. PMC 2876884. PMID 20332099.
  3. Jankowsky E (Jan 2011). "RNA helicases at work: binding and rearranging". Trends in Biochemical Sciences. 36 (1): 19–29. doi:10.1016/j.tibs.2010.07.008. PMC 3017212. PMID 20813532.
  4. Fairman-Williams ME, Guenther UP, Jankowsky E (Jun 2010). "SF1 and SF2 helicases: family matters". Current Opinion in Structural Biology. 20 (3): 313–24. doi:10.1016/j.sbi.2010.03.011. PMC 2916977. PMID 20456941.
  5. "c4orf21". Expression Atlas. Retrieved 16 May 2013.
  6. Anon. "Predicted protein interactions between paralogs and c4orf21". C4orf21 Gene - GeneCards. Retrieved 16 May 2013.
  7. "Genetic disorders linked to alcohol dependence as given be relative correlation distance". Alcohol Dependence Disease: Drugs, Articles, Genes, Clinical Trials - Malacards. Retrieved 16 May 2013.
  8. Nissim O, Melis M, Diaz G, Kleiner DE, Tice A, Fantola G, Zamboni F, Mishra L, Farci P (2012). "Liver regeneration signature in hepatitis B virus (HBV)-associated acute liver failure identified by gene expression profiling". PLOS ONE. 7 (11): e49611. doi:10.1371/journal.pone.0049611. PMC 3504149. PMID 23185381.
  9. 9.0 9.1 Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A (Jan 2013). "NCBI GEO: archive for functional genomics data sets--update". Nucleic Acids Research. 41 (Database issue): D991–5. doi:10.1093/nar/gks1193. PMC 3531084. PMID 23193258.
  10. Philibert RA, Ryu GY, Yoon JG, Sandhu H, Hollenbeck N, Gunter T, Barkhurst A, Adams W, Madan A (Jul 2007). "Transcriptional profiling of subjects from the Iowa adoption studies". American Journal of Medical Genetics Part B. 144B (5): 683–90. doi:10.1002/ajmg.b.30512. PMID 17342724.
  11. Burdick R, Smith JL, Chaipan C, Friew Y, Chen J, Venkatachari NJ, Delviks-Frankenberry KA, Hu WS, Pathak VK (Oct 2010). "P body-associated protein Mov10 inhibits HIV-1 replication at multiple stages". Journal of Virology. 84 (19): 10241–53. doi:10.1128/JVI.00585-10. PMC 2937795. PMID 20668078.

External links

Further reading