Evolution ERp18
This page discusses the evolution of the target protein, Endoplasmic reticulum thioredoxin superfamily member.
Introduction
importance of evolution
Methods
To generate a collection of sequences which were, apparently, related to the target protein (ERp18), a PSI-Blast search was conducted. PSI-BLAST is advantageous as it uses an iterative approach whereby selected, relavent results from previous searches are used to inform the next search operation. The PSI-BLAST method was particularly useful in this circumstance since ERp18 is part of a superfamily of proteins and so has many homologues which have high identity scores and low e-vaulues but are actually different proteins.
The collected sequences were analysed using the MEGA4 suite. This program packages allows:
- multiple sequence alignment,
- phylogenetic tree generation,
- bootstrapping, and,
- viewing and editing of phylogenetic trees
All of which were important to this assignment.
The Dayhoff Matrix was used to generate phylogenetic trees throughout this analysis
Results
Sequence Collection
The defining motif for a protein in the thioredoxin protein superfamily is the CXXC motif, with the two cystines representing the catalytic residues. It has been suggested in literature the the 'wild card' proteins between the cystines, in part, dictact the specific molecular function (see Function ERp18).
This statement is supported in part, by examining the evolution of proteins which have high identities to the target protein in different organisms. Examining the alignment shows the proteins are varied in the critical CXXC residues however they are likley to be related.
Figure 1 shows the CLUSTAL-W multiple sequence alignment for a selection of proteins which are annotated as 'Thioredoxin' or 'Thioredoxin Like' proteins. Examination of the key functional residues shows that there are at least three classes in the family, containing the CGAC, CHWC, CHHS motifs. CHHS clearly does not conform to the requirement for being a thioredoxin protein as it does not contain the CXXC motif, yet is has the stated annotation. The title 'Thioredoxin-like' may refer to the overall similarity in the sequences rather than the spefic domain.
Another observation from figure 1, is the convervation of the CXXC motif between higher organisms (Homo Sapiens, Salmo Salar, etc) and bacteria and archea. Bacteria and Archea contain the motif 'CHWC', in place of 'CGAC' which is seen in higher organisms. This suggests that the ERp18 protein has evolved from bacteria or archea. It is unlikley that lateral gene transfer has occured, however, it cannot be ruled out based on this evidence.
Phylogenetic Trees
[[Image:tree_cml.jpg]|Fig. 3]
add other zoomed in trees
Discussion
related to what organisms?
distant from other orgaisms?
important enzyme?