LOC56985 Evolution Main: Difference between revisions
No edit summary |
No edit summary |
||
Line 14: | Line 14: | ||
The results for the CLUSTALX alignment can be viewed [[clustal alignment| here]]. | The results for the CLUSTALX alignment can be viewed [[clustal alignment| here]]. | ||
"The line above the ruler is used to mark strongly conserved positions. Three characters ('*', ':' and | "The line above the ruler is used to mark strongly conserved positions. Three characters ('*', ':' and '.') are used: | ||
'.') are used: | |||
'*' indicates positions which have a single, fully conserved residue | '*' indicates positions which have a single, fully conserved residue | ||
Revision as of 22:33, 3 June 2008
BLASTP Search Results
The initial top 60 blast search results, given in FASTA format, can be found in the link below.
For the purpose of creating a phylogenetic tree, sequences from the same species were removed and any sequences containing large gaps in the intial CLUSTAL alignment were also removed. The edited version of the blast search results used for the final clustal alignment can be seen below. Please note that {H} stands for hypothetical protein and {P} stands for predicted protein.
CLUSTALX Alignment
The results for the CLUSTALX alignment can be viewed here.
"The line above the ruler is used to mark strongly conserved positions. Three characters ('*', ':' and '.') are used:
'*' indicates positions which have a single, fully conserved residue
':' indicates that one of the following 'strong' groups is fully conserved:- STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW
'.' indicates that one of the following 'weaker' groups is fully conserved:- CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, FVLIM, HFY
These are all the positively scoring groups that occur in the Gonnet Pam250 matrix. The strong and weak groups are defined as strong score 0.5 and weak score =<0.5 respectively." (Thompson, J.D. et. al. 1997)
Notice the cluster of residues that are invariable (ie '*') across all species. It is likely that these particular residues are important for the Structure of this protein.