IPI:IPI00414482.2

Protein Information

ProteinIPI:IPI00414482.2
Gene SymbolGTF3C1
Protein DescriptionIsoform 1 of General transcription factor 3C polypeptide 1
Sequence Length2130
Mwt240994.612 Da
Sequence
MDQGPWRCVA HAPRAPPTEV AMDALESLLD EVALEGLDGL CLPALWSRLE 
TRVPPFPLPL EPCTQEFLWR ALATHPGISF YEEPRERPDL QLQDRYEEID 
LETGILESRR DPVALEDVYP IHMILENKDG IQGSCRYFKE RKNITNDIRT 
KSLQPRCTMV EAFDRWGKKL IIVASQAMRY RALIGQEGDP DLKLPDFSYC 
ILERLGRSRW QGELQRDLHT TAFKVDAGKL HYHRKILNKN GLITMQSHVI 
RLPTGAQQHS ILLLLNRFHV DRRSKYDILM EKLSVMLSTR TNHIETLGKL 
REELGLCERT FKRLYQYMLN AGLAKVVSLR LQEIHPECGP CKTKKGTDVM 
VRCLKLLKEF KRNDHDDDED EEVISKTVPP VDIVFERDML TQTYDLIERR 
GTKGISQAEI RVAMNVGKLE ARMLCRLLQR FKVVKGFMED EGRQRTTKYI 
SCVFAEESDL SRQYQREKAR SELLTTVSLA SMQEESLLPE GEDTFLSESD 
SEEERSSSKR RGRGSQKDTR ASANLRPKTQ PHHSTPTKGG WKVVNLHPLK 
KQPPSFPGAA EERACQSLAS RDSLLDTSSV SEPNVSFVSH CADSNSGDIA 
VIEEVRMENP KESSSSLKTG RHSSGQDKPH ETYRLLKRRN LIIEAVTNLR 
LIESLFTIQK MIMDQEKQEG VSTKCCKKSI VRLVRNLSEE GLLRLYRTTV 
IQDGIKKKVD LVVHPSMDQN DPLVRSAIEQ VRFRISNSST ANRVKTSQPP 
VPQGEAEEDS QGKEGPSGSG DSQLSASSRS ESGRMKKSDN KMGITPLRNY 
HPIVVPGLGR SLGFLPKMPR LRVVHMFLWY LIYGHPASNT VEKPSFISER 
RTIKQESGRA GVRPSSSGSA WEACSEAPSK GSQDGVTWEA EVELATETVY 
VDDASWMRYI PPIPVHRDFG FGWALVSDIL LCLPLSIFIQ IVQVSYKVDN 
LEEFLNDPLK KHTLIRFLPR PIRQQLLYKR RYIFSVVENL QRLCYMGLLQ 
FGPTEKFQDK DQVFIFLKKN AVIVDTTICD PHYNLARSSR PFERRLYVLN 
SMQDVENYWF DLQCVCLNTP LGVVRCPRVR KNSSTDQGSD EEGSLQKEQE 
SAMDKHNLER KCAMLEYTTG SREVVDEGLI PGDGLGAAGL DSSFYGHLKR 
NWIWTSYIIN QAKKENTAAE NGLTVRLQTF LSKRPMPLSA RGNSRLNIWG 
EARVGSELCA GWEEQFEVDR EPSLDRNRRV RGGKSQKRKR LKKDPGKKIK 
RKKKGEFPGE KSKRLRYHDE ADQSALQRMT RLRVTWSMQE DGLLVLCRIA 
SNVLNTKVKG PFVTWQVVRD ILHATFEESL DKTSHSVGRR ARYIVKNPQA 
YLNYKVCLAE VYQDKALVGD FMNRRGDYDD PKVCANEFKE FVEKLKEKFS 
SALRNSNLEI PDTLQELFAR YRVLAIGDEK DQTRKEDELN SVDDIHFLVL 
QNLIQSTLAL SDSQMKSYQS FQTFRLYREY KDHVLVKAFM ECQKRSLVNR 
RRVNHTLGPK KNRALPFVPM SYQLSQTYYR IFTWRFPSTI CTESFQFLDR 
MRAAGKLDQP DRFSFKDQDN NEPTNDMVAF SLDGPGGNCV AVLTLFSLGL 
ISVDVRIPEQ IIVVDSSMVE NEVIKSLGKD GSLEDDEDEE DDLDEGVGGK 
RRSMEVKPAQ ASHTNYLLMR GYYSPGIVST RNLNPNDSIV VNSCQMKFQL 
RCTPVPARLR PAAAPLEELT MGTSCLPDTF TKLINPQENT CSLEEFVLQL 
ELSGYSPEDL TAALEILEAI IATGCFGIDK EELRRRFSAL EKAGGGRTRT 
FADCIQALLE QHQVLEVGGN TARLVAMGSA WPWLLHSVRL KDREDADIQR 
EDPQARPLEG SSSEDSPPEG QAPPSHSPRG TKRRASWASE NGETDAEGTQ 
MTPAKRPALQ DSNLAPSLGP GAEDGAEAQA PSPPPALEDT AAAGAAQEDQ 
EGVGEFSSPG QEQLSGQAQP PEGSEDPRGF TESFGAANIS QAARERDCES 
VCFIGRPWRV VDGHLNLPVC KGMMEAMLYH IMTRPGIPES SLLRHYQGVL 
QPVAVLELLQ GLESLGCIRK RWLRKPRPVS LFSTPVVEEV EVPSSLDESP 
MAFYEPTLDC TLRLGRVFPH EVNWNKWIHL

Modification Site Information

Site Position 418
MS/MS spectra 1 [show]
Best localized sequence R.VAMNVGK#LEAR@.M
Matching Proteins
Site Position 435
MS/MS spectra 1 [show]
Best localized sequence K.VVK#GFMEDEGR@.Q
Matching Proteins
Site Position 667
MS/MS spectra 2 [show]
Best localized sequence K.MIMDQEK#QEGVSTK.C
Matching Proteins
Site Position 706
MS/MS spectra 7 [show]
Best localized sequence R.TTVIQDGIK#K.K
Matching Proteins
Site Position 745
MS/MS spectra 283 [show]
Best localized sequence R.VK#TSQPPVPQGEAEEDSQGK.E
Matching Proteins
Site Position 1149
MS/MS spectra 1 [show]
Best localized sequence R.EVVDEGLIPGDGLGAAGLDSSFYGHLK#R.N
Matching Proteins
Site Position 1307
MS/MS spectra 1 [show]
Best localized sequence R.IASNVLNTK#VK.G
Matching Proteins
Site Position 1332
MS/MS spectra 1 [show]
Best localized sequence R.DILHATFEESLDK#TSHSVGR.R
Matching Proteins
Site Position 1556
MS/MS spectra 35 [show]
Best localized sequence R.AAGK#LDQPDR@.F
Matching Proteins
Site Position 1629
MS/MS spectra 65 [show]
Best localized sequence K.SLGK#DGSLEDDEDEEDDLDEGVGGK.R
Matching Proteins
Site Position 1650
MS/MS spectra 59 [show]
Best localized sequence K.DGSLEDDEDEEDDLDEGVGGK#R@.R
Matching Proteins
Site Position 1792
MS/MS spectra 2 [show]
Best localized sequence R.FSALEK#AGGGR.T
Matching Proteins
Site Position 1841
MS/MS spectra 3 [show]
Best localized sequence R.LK#DR@EDADIQR@.E
Matching Proteins