LOC56985 Evolution Main: Difference between revisions
No edit summary |
No edit summary |
||
Line 45: | Line 45: | ||
and weak groups are defined as strong score 0.5 and weak score =<0.5 respectively." (Thompson, J.D. et. al. 1997) | and weak groups are defined as strong score 0.5 and weak score =<0.5 respectively." (Thompson, J.D. et. al. 1997) | ||
Notice the cluster of residues that are invariable (ie '*') across all species. It is likely that these particular residues are important for the [[LOC56985 Structure Main|Structure]] of this protein. | |||
Revision as of 22:31, 3 June 2008
BLASTP Search Results
The initial top 60 blast search results, given in FASTA format, can be found in the link below.
For the purpose of creating a phylogenetic tree, sequences from the same species were removed and any sequences containing large gaps in the intial CLUSTAL alignment were also removed. The edited version of the blast search results used for the final clustal alignment can be seen below. Please note that {H} stands for hypothetical protein and {P} stands for predicted protein.
CLUSTALX Alignment
The results for the CLUSTALX alignment can be viewed here.
"The line above the ruler is used to mark strongly conserved positions. Three characters ('*', ':' and '.') are used: '*' indicates positions which have a single, fully conserved residue
':' indicates that one of the following 'strong' groups is fully conserved:- STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW
'.' indicates that one of the following 'weaker' groups is fully conserved:- CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, FVLIM, HFY
These are all the positively scoring groups that occur in the Gonnet Pam250 matrix. The strong and weak groups are defined as strong score 0.5 and weak score =<0.5 respectively." (Thompson, J.D. et. al. 1997)
Notice the cluster of residues that are invariable (ie '*') across all species. It is likely that these particular residues are important for the Structure of this protein.