Talk:Protein Function

From MDWiki
Revision as of 04:33, 15 May 2007 by S4079195 (talk | contribs)
Jump to navigationJump to search

Information From 8th May 2007

MIF4G is the middle domain of eukaryotic initiation factor 4G (eIF4G). It also occurs in NMD2p (non-sense mediated mRNA decay protein - it is involved in the non-sense mediated decay of mRNAs containing premature stop codons) and in CBP80 (Cap binding protein).

The protein binds eIF4A, eIF3, RNA and DNA Therefore part of function is to bind to RNA

Possibly located in the cytoplasm - See link to LOCATE. Mouse protein of similar seuqence in this location.

MIF4G starts residue 28 Ends 240 (mouse)

It is soluble and non-secreted.

PA74324.2 Riken cDNA 2310075612 Rik Protein - AAH26740, AAH55812(mouse), AAH33759(human)

AAH55812 - Rik Protein Mouse. Present in the cerebellum, Striatum, Eye, Wholebrain, Liver, Hippocampus, Hematopoietic Stem Cells and Kidney Accession No: BC055812.1

Performed a MultiLoc prediction that determines location of the protein based on Amino Acid sequence and the presence etc of a N-termial targeting sequence. There is a 0.93 Probability that the protein is cytoplasmic. Now I have to find specific location, what the protein binds to and the structure of what it binds to. If i can identify the structure of the binding domain then I can predict to some extent the structure or a very small peice of the structure ie active site and can use this to perform function based analysis?


ProFunc Analysis:

http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/profunc/GetResults.pl?source=profunc&user_id=ay65&code=075103

Showed that the domain contains an ARM repeat. Further research into this will be done. Eliza found the same thing


Obtained Sequences

Human - Protein Sequ

mgepsreeyk iqsfdaetqq llktalkvac fetedgeysv cqrsysncsr lmpsrcntqy

rdpgavdlek vanvivdhsl qdcvfskeag rmcyaiiqae skqagqsvfr rgllnrlqqe

yqareqlrar slqgwvcyvt ficnifdylr vnnmpmmalv npvydclfrl aqpdslskee

evdclvlqlh rvgeqlekmn gqrmdelfvl irdgfllptg lsslaqllll eiiefraagw

kttpaahkyy ysevsd


>AAH26740 ARM repeat, position: 13-208 (Mouse)

SFDAQTQQLLKTALKDPGAVDLERVANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQKEYDAREQ

LRACSLQGWVCYVTFICNIFDYLRVNNMPMMALVNPVYDCLFQLAQPESLSREEEVDCLVLQLHRVGEQLEKMNGQRMDE

LFILIRDGFLLPTDLSSLARLLLLEMIEFRAAGWK


Mouse - Protein

mseasrddyk iqsfdaetqq llktalkdps avdlervanv ivdhslqdcv fskeagrmcy

aiiqaeskqa gqsvfrrgll nrlqkeydar eqlracslqg wvcyvtficn ifdylrvnnm

pmmalvnpvy dclfqlaqpe slsreeevdc lvlqlhrvge qlekmngqrm delfilirdg

fllptdlssl arllllemie fraagwkttp aahkyyysev sd


FASTA - Human

>gi|21707112|gb|AAH33759.1| MIF4G domain containing [Homo sapiens]

MGEPSREEYKIQSFDAETQQLLKTALKVACFETEDGEYSVCQRSYSNCSRLMPSRCNTQYRDPGAVDLEK

VANVIVDHSLQDCVFSKEAGRMCYAIIQAESKQAGQSVFRRGLLNRLQQEYQAREQLRARSLQGWVCYVT

FICNIFDYLRVNNMPMMALVNPVYDCLFRLAQPDSLSKEEEVDCLVLQLHRVGEQLEKMNGQRMDELFVL

IRDGFLLPTGLSSLAQLLLLEIIEFRAAGWKTTPAAHKYYYSEVSD