Uncharacterized protein KIAA1109 is a protein that in humans is encoded by the KIAA1109 gene.[5][6][7]
KIAA1109 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | KIAA1109, 4932438A13Rik, 4732443H21, B830039D19Rik, D630029K19Rik, FSA, Kiaa1109, Tweek, ALKKUCS | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 611565; MGI: 2444631; HomoloGene: 52105; GeneCards: KIAA1109; OMA:KIAA1109 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
This protein has a function that is not yet understood. KIAA1109 has 3 aliases, FSA (fragile site-associated) protein, MGC110967 and DKFZp781P0474.[8]
An ortholog of KIAA1109 has been found in Drosophila, responsible for regulating synaptic vesicle recycling and certain brain related functions.[9] Drosophila exhibiting mutated KIAA1109 were unable to walk or stand upright for long periods and exhibited seizures, reminiscent of severe neurological defects. This led to it being given the name "Tweek", after a character in South Park.[10]
Gene
editLocation
editKIAA1109 is found on the long arm of chromosome 4 (4q27), with the genomic sequence starting at 118,818,167 bp and ending at 119,010,362 bp[11]
Gene Neighborhood
editThe gene neighborhood of KIAA1109 involves 4 other genes. KIAA1109 is a part of the KIAA1109/Tenr/IL2/IL21 gene region. This region consists of the three genes to the right of KIAA1109; ADAD1, IL2 and IL21.[12] Another gene located in the neighborhood of KIAA1109 is TRPC3. This gene is to the left of KIAA1109 on the opposite side of the genes described above.[8]
Expression
editAccording to data on NCBI's EST Abundance Profile page for KIAA1109, the gene is expressed in many different tissues in humans. Human expression is seen most predominately in parathyroid, muscle, ear, eye, mammary gland, lymph node, thymus in addition to 27 other tissues. KIAA1109 is also expressed in various disease states including 12 different tumors as well as bladder carcinoma, chondrosarcoma, glioma, leukemia, lymphoma, non-neoplasia, retinoblastoma tissues.[13] KIAA1109 is expressed in all stages of development from embryoid body to adult, except in infants. No expression of my gene is seen during the infant stage of development.[13]
Promoter
editAccording to Genomatix's ElDorado program the promoter region of KIAA1109 is predicted to be 601 base pairs in length. The promoter region starts 500 base pairs upstream of the 5’ UTR of KIAA1109 mRNA transcript and contains part of this 5’ UTR.[14]
Homology
editKIAA1109 is conserved throughout many species. Orthologs have been found in many mammals and other vertebrates. More distant homologs have been identified in animals such as insects. See the mRNA and protein conservation sections below for more details. No human paralogs for KIAA1109 have been identified.[15]
mRNA
editSplice Variants
editKIAA1109 has 13 mRNA splice variants and 6 unspliced variants. Variant A is the longest and most commonly occurring variant of the gene[16] and is the subject of this article. KIAA1109 variant A is made up of 84 exons and is 15,592 base pairs in length.[12] The accession number for this nucleotide is NM_015312.3.
Conservation
editThe mRNA sequence of KIAA1109 is highly conserved throughout mammals. The mRNA sequence identity to mammals is no less than 81.9% (in platypus) and ranging up to 99.5% (in chimpanzees).[17] Birds also show fairly high conservation with mRNA sequence identities around 78% in zebra finches. The table blow shows information on the mRNA orthologs.
Genus and species name | Common name | mRNA accession number[18] | Sequence length (bp)[18] | Sequence identity to human mRNA[17] |
---|---|---|---|---|
Homo sapiens | Human | NM_015312.3 | 15592 | |
Pan troglodytes | Chimpanzee | XM_517422.2 | 15578 | 99.5% |
Macaca mulatta | Rhesus macaque | XM_001102884.2 | 15529 | 97.9% |
Callithris jacchus | Marmoset | XM_002745433.1 | 15566 | 97.2% |
Equus caballus | Horse | XM_001915982.1 | 15589 | 93.8% |
Ailuropoda melanoleuca | Giant panda | XM_002923821.1 | 15018 | 91% |
Oryctolagus cuniculus | Rabbit | XM_002717235.1 | 15015 | 90.7% |
Mus musculus | Mouse | NM_172679.2 | 15883 | 88.2% |
Monodelphis domestica | Opossum | XM_001370569.1 | 15048 | 82% |
Ornithorhynchus anatinus | Platypus | XM_001513933.1 | 15039 | 81.9% |
Taeniopygia guttata | Zebra finch | XM_002188249.1 | 15489 | 18.9% |
Gallus gallus | Chicken | XM_420625.2 | 15123 | 78.5% |
Tribolium castaneum | Red flour beetle | XM_967081.2 | 13797 | 48.6% |
Protein
editGeneral Properties
editKIAA1109 protein is 5005 amino acids in length,[19] and has a predicted molecular weight of 555519.38 daltons.[20] The isoelectirc point of KIAA1109 protein is predicted to be 6.12.[21]
Composition
editThe amino acid composition of KIAA1109 protein showed amino acid frequencies within 1.5% of that of normal human proteins for all but Alanine, Serine and Threonine. Alanine has a lower frequency in KIAA1109 than in that of a normal human protein while Serine and Threonine both have a higher frequency in KIAA1109 than in the average human protein.[22]
Conservation
editThe amino acid sequence of KIAA1109 is highly conserved throughout mammals. The protein identity ranges from 93.2% in Opossum to 99.8% in Chimpanzees and protein similarity is no less than 97% in all mammals included. Birds continue to show fairly high conservation with protein identities around 90% and proteins similarities at a high 96%. While conservation is still high the lower numbers may be due to small truncations on either, the 5’ and 3’ ends of these sequences.[15]
As we move to the more distant species of zebra fish and then the red four beetle and carpenter ant the conservations drops. In the insects the protein identities are down to around 34%.[15]
Genus and species name | Common name | Protein accession number[18] | Sequence length (amino acids)[18] | Sequence identity to human protein[15] | Sequence similarity to human protein[15] |
---|---|---|---|---|---|
Homo sapiens | Human | NP_056127.2 | 5005 | ||
Pan troglodytes | Chimpanzee | XP_517422.2 | 5005 | 99.8% | 99.8% |
Macaca mulatta | Rhesus macaque | XP_001102884.1 | 5007 | 99.2% | 99% |
Callithris jacchus | Marmoset | XP_002745479.1 | 5004 | 98.9% | 99% |
Equus caballus | Horse | XP_001916017.1 | 5006 | 98% | 99% |
Ailuropoda melanoleuca | Giant panda | XP_002923867.1 | 5005 | 98.1% | 99% |
Oryctolagus cuniculus | Rabbit | XP_002717281.1 | 5004 | 97.8% | 99% |
Mus musculus | Mouse | NP_766267.2 | 5005 | 96.7% | 99% |
Canis familiaris | Dog | XP_540963.2 | 4944 | 96.4% | 99% |
Monodelphis domestica | Opossum | XP_001370606.1 | 5015 | 93.2% | 97% |
Ornithorhynchus anatinus | Platypus | XP_001513983.1 | 5012 | 93.3% | 97% |
Taeniopygia guttata | Zebra finch | XP_002188285.1 | 4999 | 90.7% | 96% |
Gallus gallus | Chicken | XP_420625.2 | 5040 | 89.9% | 96% |
Danio rerio | Zebra fish | NP_001139056.1 | 4922 | 74.2% | 84% |
Tribolium castaneum | Red flour beetle | XP_972174.2 | 4598 | 34.8% | 49% |
Camponotus floridanus | Carpenter ant | EFN75044.1 | 4979 | 34.3 |
Conserved Domains
editNCBI conserved domains search identified two domains in KIAA1109. The first is the fragile site associated C-terminus, which is said to be linked to celiac disease susceptibility according to genome-wide-association studies and may also be associated with polycystic kidney disease.[23] The second conserved region identified by NCBI in KIAA1109 is an uncharacterized conserved protein (DUF2246), whose function is unknown and is conserved in various species from humans to worms.[24]
Post Translation Modifications
editKIAA1109 is predicted to undergo various types of post translational modifications including glycate, N-glycosylation, O-GlcNAc, O Glycosylation, Sulfonation and Phosphorylation.[25]
Subcellular Localization
editKIAA1109 contains one transmembrane domain from amino acids 26–46.[19] No signal peptides, mitochondrial targeting sequences or chloroplast peptides were predicted for my protein and it is therefore not predicted to localize to secretory pathway, mitochondria or chloroplast.[26]
Interacting Proteins
editMADH2 and Beta-catenin were both found to have a physical interaction with my protein as detached by display technonloy by Miyamoto-Sato et al. 2010.[27][28]
References
edit- ^ a b c GRCh38: Ensembl release 89: ENSG00000138688 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000037270 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ Kikuno R, Nagase T, Ishikawa K, Hirosawa M, Miyajima N, Tanaka A, Kotani H, Nomura N, Ohara O (June 1999). "Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Res. 6 (3): 197–205. doi:10.1093/dnares/6.3.197. PMID 10470851.
- ^ Kuo MT, Wei Y, Yang X, Tatebe S, Liu J, Troncoso P, Sahin A, Ro JY, Hamilton SR, Savaraj N (Jan 2006). "Association of fragile site-associated (FSA) gene expression with epithelial differentiation and tumor development". Biochem Biophys Res Commun. 340 (3): 887–93. doi:10.1016/j.bbrc.2005.12.088. PMID 16386706.
- ^ "Entrez Gene: KIAA1109 KIAA1109".
- ^ a b "NCBI, National Center for Biotechnology". Retrieved 5 Feb 2011.
- ^ Kumar, Kishore; Bellad, Anikha; Prasad, Pramada; Girimaji, Satish Chandra; Muthusamy, Babylakshmi (2020-06-26). "KIAA1109 gene mutation in surviving patients with Alkuraya-Kučinskas syndrome: a review of literature". BMC Medical Genetics. 21 (1): 136. doi:10.1186/s12881-020-01074-2. ISSN 1471-2350. PMC 7318400. PMID 32590954.
- ^ Verstreken, Patrik; Ohyama, Tomoko; Haueter, Claire; Habets, Ron L.P.; Lin, Yong Q.; Swan, Laura E.; Ly, Cindy V.; Venken, Koen J. T.; De Camilli, Pietro; Bellen, Hugo J. (2009-07-30). "Tweek, an evolutionary conserved proteinis required for synaptic vesicle recycling". Neuron. 63 (2): 203–215. doi:10.1016/j.neuron.2009.06.017. ISSN 0896-6273. PMC 2759194. PMID 19640479.
- ^ "Genecards". Retrieved 9 May 2011.
- ^ a b "NCBI, National Center for Biotechnology". Retrieved 5 Feb 2011.
- ^ a b "NCBI, UniGene. EST Profile".
- ^ "Genomatix, ElDorado". Retrieved 3 April 2011.
- ^ a b c d e "BLAST, NCBI". Retrieved 10 March 2011.
- ^ "AceView, NCBI". Retrieved 2 April 2011.
- ^ a b "ALIGN, SDSC Biology Workbench". Retrieved 12 March 2011.
- ^ a b c d "NCBI National Center for Biotechnology Information". Retrieved 9 May 2011.
- ^ a b "Protein, NCBI". Retrieved 18 March 2011.
- ^ "AASTATS, SDSC Biology WorkBench". Retrieved 23 April 2011.[permanent dead link ]
- ^ "PI/Mw, ExPasy". Archived from the original on 2003-10-03. Retrieved 6 May 2011.
- ^ "CLC Protein Workbench 5.5.5". Retrieved 6 May 2011.
- ^ "Conserved Domains, NCBI". Retrieved 9 May 2011.
- ^ "Conserved Domains, NCBI". Retrieved 9 May 2011.
- ^ "ExPasy Tools". Retrieved 21 April 2011.
- ^ "ChloroP, MITOPROT, Signal P and PTS1. ExPasy". Retrieved 21 April 2011.
- ^ Miyamoto-Sato E, Fujimori S, Ishizaka M, Hirai N, Masuoka K, Saito R, Ozawa Y, Hino K, Washio T, Tomita M, Yamashita T, Oshikubo T, Akasaka H, Sugiyama J, Matsumoto Y, Yanagawa H (February 2010). "A comprehensive resource of interacting protein regions for refining human transcription factor networks". PLOS ONE. 5 (2): e9289. Bibcode:2010PLoSO...5.9289M. doi:10.1371/journal.pone.0009289. PMC 2827538. PMID 20195357.
- ^ "9 binary interactions found for search term KIAA1109". IntAct Molecular Interaction Database. EMBL-EBI. Retrieved 2018-08-25.
Further reading
edit- Maruyama K, Sugano S (1994). "Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides". Gene. 138 (1–2): 171–4. doi:10.1016/0378-1119(94)90802-8. PMID 8125298.
- Suzuki Y, Yoshitomo-Nakagawa K, Maruyama K, et al. (1997). "Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library". Gene. 200 (1–2): 149–56. doi:10.1016/S0378-1119(97)00411-3. PMID 9373149.
- Nagase T, Kikuno R, Ishikawa KI, et al. (2000). "Prediction of the coding sequences of unidentified human genes. XVI. The complete sequences of 150 new cDNA clones from brain which code for large proteins in vitro". DNA Res. 7 (1): 65–73. doi:10.1093/dnares/7.1.65. PMID 10718198.
- Wei Y, Lin-Lee YC, Yang X, et al. (2006). "Molecular cloning of Chinese hamster 1q31 chromosomal fragile site DNA that is important to mdr1 gene amplification reveals a novel gene whose expression is associated with spermatocyte and adipocyte differentiation". Gene. 372: 44–52. doi:10.1016/j.gene.2005.12.024. PMID 16545529.
- van Heel DA, Franke L, Hunt KA, et al. (2007). "A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21". Nat. Genet. 39 (7): 827–9. doi:10.1038/ng2058. PMC 2274985. PMID 17558408.