IPI:IPI00019848.2
Protein Information
Protein | IPI:IPI00019848.2 |
---|---|
Gene Symbol | HCFC1 |
Protein Description | Isoform 1 of Host cell factor |
Sequence Length | 2035 |
Mwt | 208602.172 Da |
Sequence | MASAVSPANL PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIDTL TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAPAPPTTTT IQVLPTVPGS SISVPTAART QGVPAVLKVT GPQATTGTPL VTMRPASQAG KAPVTVTSLP AGVRMVVPTQ SAQGTVIGSS PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTM AVTPGTTTLP ATVKVASSPV MVSNPATRML KTAAAQVGTS VSSATNTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGGH STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQEAAASLV TSTVGQQNGS VVRVCSNPPC ETHETGTTNT ATTATSNMAG QHGCSNPPCE THETGTTNTA TTAMSSVGAN HQRDARRACA AGTPAVIRIS VATGALEAAQ GSKSQCQTRQ TSATSTTMTV MATGAPCSAG PLLGPSMARE PGGRSPAFVQ LAPLSSKVRL SSPSIKDLPA GRHSHAVSTA AMTRSSVGAG EPRMAPVCES LQGGSPSTTV TVTALEALLC PSATVTQVCS NPPCETHETG TTNTATTSNA GSAQRVCSNP PCETHETGTT HTATTATSNG GTGQPEGGQQ PPAGRPCETH QTTSTGTTMS VSVGALLPDA TSSHRTVESG LEVAAAPSVT PQAGTALLAP FPTQRVCSNP PCETHETGTT HTATTVTSNM SSNQDPPPAA SDQGEVESTQ GDSVNITSSS AITTTVSSTL TRAVTTVTQS TPVPGPSVPP PEELQVSPGP RQQLPPRQLL QSASTALMGE SAEVLSASQT PELPAAVDLS STGEPSSGQE SAGSAVVATV VVQPPPPTQS EVDQLSLPQE LMAEAQAGTT TLMVTGLTPE ELAVTAAAEA AAQAAATEEA QALAIQAVLQ AAQQAVMGTG EPMDTSEAAA TVTQAELGHL SAEGQEGQAT TIPIVLTQQE LAALVQQQQL QEAQAQQQHH HLPTEALAPA DSLNDPAIES NCLNELAGTV PSTVALLPST ATESLAPSNT FVAPQPVVVA SPAKLQAAAT LTEVANGIES LGVKPDLPPP PSKAPMKKEN QWFDVGVIKG TNVMVTHYFL PPDDAVPSDD DLGTVPDYNQ LKKQELQPGT AYKFRVAGIN ACGRGPFSEI SAFKTCLPGF PGAPCAIKIS KSPDGAHLTW EPPSVTSGKI IEYSVYLAIQ SSQAGGELKS STPAQLAFMR VYCGPSPSCL VQSSSLSNAH IDYTTKPAII FRIAARNEKG YGPATQVRWL QETSKDSSGT KPANKRPMSS PEMKSAPKKS KADGQ |
Modification Site Information
Site Position | 363 |
---|---|
MS/MS spectra | 2 [show] |
Best localized sequence | K.DLWYLETEK#PPPPAR.V |
Matching Proteins |
|