YLPM1
YLP-motivni protein 1 je protein koji je kod ljudi kodiran genom YLPM1. Genski lokus YLP, nalazi se na dugom (q) kraku hromosoma 14, sekvenca 14q24.3.[5][6]
Projekt Zbirka gena sisara (MGC) Nacionalnog instituta za zdravlje osmišljen je za generiranje i sekvenciranje javno dostupnog izvora cDNK koji sadrži potpuni otvoreni okvir čitanja (ORF) za svaki gen čovjeka i miša. Projekt je u početku koristio slučajnu strategiju za selekciju klonova iz velikog broja biblioteka cDNK iz različitih tkiva. Klonovi kandidati su izabrani na osnovu 5'-EST sekvenci, a zatim su potpuno sekvencirani do velike tačnosti i analizirani algoritmima razvijenim za ovaj projekt. Postoji više od 11.000 ljudskih i 10.000 mišjih gena, predstavljeno u MGC-u od najmanje jednog klona s punim ORF-om. Pristup nasumičnom selekcijom sada dostiže tačku zasićenja, a prijelaz na protokole usmjerene na nedostajuće transkripte, sada je potreban za kompletiranje zbirki miševa i ljudi. Usporedba sekvence MGC klonova sa referentnim sekvencama genoma otkriva da je većina klonova cDNK vrlo visokog kvaliteta sekvence, iako je vjerovatno da neke cDNK mogu nositi pogrešne varijante kao posljedicu eksperimentalnog artefakta, poput PCR-a, kloniranja ili greške obrnute transkripcije. Nedavno je projektu dodana komponenta pacovske cDNK, a tekući projekti cDNK žabe (ksenopusa) i ribe zebrica (rod Danio) prošireni su, kako bi se iskoristili prednosti visokopropusnog MGC kanala.[7]
Aminokiselinska sekvenca
[uredi | uredi izvor]Dužina polipeptidnog lanca je 1.951 aminokiselina, a molekulska težina 219 985 Da.[8].
10 | 20 | 30 | 40 | 50 | ||||
---|---|---|---|---|---|---|---|---|
MYPNWGRYGG | SSHYPPPPVP | PPPPVALPEA | SPGPGYSSST | TPAAPSSSGF | ||||
MSFREQHLAQ | LQQLQQMHQK | QMQCVLQPHH | LPPPPLPPPP | VMPGGGYGDW | ||||
QPPPPPMPPP | PGPALSYQKQ | QQYKHQMLHH | QRDGPPGLVP | MELESPPESP | ||||
PVPPGSYMPP | SQSYMPPPQP | PPSYYPPTSS | QPYLPPAQPS | PSQSPPSQSY | ||||
LAPTPSYSSS | SSSSQSYLSH | SQSYLPSSQA | SPSRPSQGHS | KSQLLAPPPP | ||||
SAPPGNKTTV | QQEPLESGAK | NKSTEQQQAA | PEPDPSTMTP | QEQQQYWYRQ | ||||
HLLSLQQRTK | VHLPGHKKGP | VVAKDTPEPV | KEEVTVPATS | QVPESPSSEE | ||||
PPLPPPNEEV | PPPLPPEEPQ | SEDPEEDARL | KQLQAAAAHW | QQHQQHRVGF | ||||
QYQGIMQKHT | QLQQILQQYQ | QIIQPPPHIQ | ATTPPPGIPP | PGVPQGIPPQ | ||||
LTAAPVPPAS | SSQSSQVPEK | PRPALLPTPV | SFGSAPPTTY | HPPLQSAGPS | ||||
EQVNSKAPLS | KSALPYSSFS | SDQGLGESSA | APSQPITAVK | DMPVRSGGLL | ||||
PDPPRSSYLE | SPRGPRFDGP | RRFEDLGSRC | EGPRPKGPRF | EGNRPDGPRP | ||||
RYEGHPAEGT | KSKWGMIPRG | PASQFYITPS | TSLSPRQSGP | QWKGPKPAFG | ||||
QQHQQQPKSQ | AEPLSGNKEP | LADTSSNQQK | NFKMQSAAFS | IAADVKDVKA | ||||
AQSNENLSDS | QQEPPKSEVS | EGPVEPSNWD | QNVQSMETQI | DKAQAVTQPV | ||||
PLANKPVPAQ | STFPSKTGGM | EGGTAVATSS | LTADNDFKPV | GIGLPHSENN | ||||
QDKGLPRPDN | RDNRLEGNRG | NSSSYRGPGQ | SRMEDTRDKG | LVNRGRGQAI | ||||
SRGPGLVKQE | DFRDKMMGRR | EDSREKMNRG | EGSRDRGLVR | PGSSREKVPG | ||||
GLQGSQDRGA | AGSRERGPPR | RAGSQERGPL | RRAGSRERIP | PRRAGSRERG | ||||
PPRGPGSRER | GLGRSDFGRD | RGPFRPEPGD | GGEKMYPYHR | DEPPRAPWNH | ||||
GEERGHEEFP | LDGRNAPMER | ERLDDWDRER | YWRECERDYQ | DDTLELYNRE | ||||
DRFSAPPSRS | HDGDRRGPWW | DDWERDQDMD | EDYNREMERD | MDRDVDRISR | ||||
PMDMYDRSLD | NEWDRDYGRP | LDEQESQFRE | RDIPSLPPLP | PLPPLPPLDR | ||||
YRDDRWREER | NREHGYDRDF | RDRGELRIRE | YPERGDTWRE | KRDYVPDRMD | ||||
WERERLSDRW | YPSDVDRHSP | MAEHMPSSHH | SSEMMGSDAS | LDSDQGLGGV | ||||
MVLSQRQHEI | ILKAAQELKM | LREQKEQLQK | MKDFGSEPQM | ADHLPPQESR | ||||
LQNTSSRPGM | YPPPGSYRPP | PPMGKPPGSI | VRPSAPPARS | SVPVTRPPVP | ||||
IPPPPPPPPL | PPPPPVIKPQ | TSAVEQERWD | EDSFYGLWDT | NDEQGLNSEF | ||||
KSETAAIPSA | PVLPPPPVHS | SIPPPGPVPM | GMPPMSKPPP | VQQTVDYGHG | ||||
RDISTNKVEQ | IPYGERITLR | PDPLPERSTF | ETEHAGQRDR | YDRERDREPY | ||||
FDRQSNVIAD | HRDFKRDRET | HRDRDRDRGV | IDYDRDRFDR | ERRPRDDRAQ | ||||
SYRDKKDHSS | SRRGGFDRPS | YDRKSDRPVY | EGPSMFGGER | RTYPEERMPL | ||||
PAPSLSHQPP | PAPRVEKKPE | SKNVDDILKP | PGRESRPERI | VVIMRGLPGS | ||||
GKTHVAKLIR | DKEVEFGGPA | PRVLSLDDYF | ITEVEKEEKD | PDSGKKVKKK | ||||
VMEYEYEAEM | EETYRTSMFK | TFKKTLDDGF | FPFIILDAIN | DRVRHFDQFW | ||||
SAAKTKGFEV | YLAEMSADNQ | TCGKRNIHGR | KLKEINKMAD | HWETAPRHMM | ||||
RLDIRSLLQD | AAIEEVEMED | FDANIEEQKE | EKKDAEEEES | ELGYIPKSKW | ||||
EMDTSEAKLD | KLDGLRTGTK | RKRDWEAIAS | RMEDYLQLPD | DYDTRASEPG | ||||
KKRVRWADLE | EKKDADRKRA | IGFVVGQTDW | EKITDESGHL | AEKALNRTKY | ||||
I |
- Simboli
C: Cistein
D: Asparaginska kiselina
E: Glutaminska kiselina
F: Fenilalanin
G: Glicin
H: Histidin
I: Izoleucin
K: Lizin
L: Leucin
M: Metionin
N: Asparagin
P: Prolin
Q: Glutamin
R: Arginin
S: Serin
T: Treonin
V: Valin
W: Triptofan
Y: Tirozin
Reference
[uredi | uredi izvor]- ^ a b c GRCh38: Ensembl release 89: ENSG00000119596 - Ensembl, maj 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000021244 - Ensembl, maj 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ Sherrington R, Rogaev EI, Liang Y, Rogaeva EA, Levesque G, Ikeda M, Chi H, Lin C, Li G, Holman K, et al. (Aug 1995). "Cloning of a gene bearing missense mutations in early-onset familial Alzheimer's disease". Nature. 375 (6534): 754–60. doi:10.1038/375754a0. PMID 7596406.
- ^ "Entrez Gene: YLPM1 YLP motif containing 1".
- ^ Daniela S Gerhard et al. (2004): The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res, 14(10B):2121-7, doi: 10.1101/gr.2596504, pmid: 15489334, pmcid: pmc528928, doi: 10.1101/gr.2596504
- ^ "UniProt, P49750". Pristupljeno 11. 9. 2017.
Dopunska literatura
[uredi | uredi izvor]- Strausberg RL, Feingold EA, Grouse LH, et al. (2003). "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences". Proc. Natl. Acad. Sci. U.S.A. 99 (26): 16899–903. doi:10.1073/pnas.242603899. PMC 139241. PMID 12477932.
- Heilig R, Eckenberg R, Petit JL, et al. (2003). "The DNA sequence and analysis of human chromosome 14". Nature. 421 (6923): 601–7. doi:10.1038/nature01348. PMID 12508121.
- Ota T, Suzuki Y, Nishikawa T, et al. (2004). "Complete sequencing and characterization of 21,243 full-length human cDNAs". Nat. Genet. 36 (1): 40–5. doi:10.1038/ng1285. PMID 14702039.
- Beausoleil SA, Jedrychowski M, Schwartz D, et al. (2004). "Large-scale characterization of HeLa cell nuclear phosphoproteins". Proc. Natl. Acad. Sci. U.S.A. 101 (33): 12130–5. doi:10.1073/pnas.0404720101. PMC 514446. PMID 15302935.
- Gerhard DS, Wagner L, Feingold EA, et al. (2004). "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)". Genome Res. 14 (10B): 2121–7. doi:10.1101/gr.2596504. PMC 528928. PMID 15489334.
- Armstrong L, Lako M, van Herpe I, et al. (2005). "A role for nucleoprotein Zap3 in the reduction of telomerase activity during embryonic stem cell differentiation". Mech. Dev. 121 (12): 1509–22. doi:10.1016/j.mod.2004.07.005. PMID 15511642.
- Kimura K, Wakamatsu A, Suzuki Y, et al. (2006). "Diversification of transcriptional modulation: large-scale identification and characterization of putative alternative promoters of human genes". Genome Res. 16 (1): 55–65. doi:10.1101/gr.4039406. PMC 1356129. PMID 16344560.
- Olsen JV, Blagoev B, Gnad F, et al. (2006). "Global, in vivo, and site-specific phosphorylation dynamics in signaling networks". Cell. 127 (3): 635–48. doi:10.1016/j.cell.2006.09.026. PMID 17081983.
- Ulke-Lemée A, Trinkle-Mulcahy L, Chaulk S, et al. (2007). "The nuclear PP1 interacting protein ZAP3 (ZAP) is a putative nucleoside kinase that complexes with SAM68, CIA, NF110/45, and HNRNP-G". Biochim. Biophys. Acta. 1774 (10): 1339–50. doi:10.1016/j.bbapap.2007.07.015. PMID 17890166.