Evolution ERp18

From MDWiki
Jump to navigationJump to search

This page discusses the evolution of the target protein, Endoplasmic reticulum thioredoxin superfamily member.

Introduction

importance of evolution


Methods

To generate a collection of sequences which were, apparently, related to the target protein (ERp18), a PSI-Blast search was conducted. PSI-BLAST is advantageous as it uses an iterative approach whereby selected, relavent results from previous searches are used to inform the next search operation. The PSI-BLAST method was particularly useful in this circumstance since ERp18 is part of a superfamily of proteins and so has many homologues which have high identity scores and low e-vaulues but are actually different proteins.


The collected sequences were analysed using the MEGA4 suite. This program packages allows:

  • multiple sequence alignment,
  • phylogenetic tree generation,
  • bootstrapping, and,
  • viewing and editing of phylogenetic trees

All of which were important to this assignment.


The Dayhoff Matrix was used to generate phylogenetic trees throughout this analysis. All bootstrap exercises used 1000 replicates.

Results

The defining motif for a protein in the thioredoxin protein superfamily is the CXXC motif, with the two cystines representing the catalytic residues. It has been suggested in literature the the 'wild card' proteins between the cystines, in part, dictact the specific molecular function (see Function ERp18).

This statement is supported in part, by examining the evolution of proteins which have high identities to the target protein in different organisms. Examining the alignment shows the proteins are varied in the critical CXXC residues however they are likley to be related.


Fig. 1 Evolutionary analysis of different Thioredoxin proteins showing different subfamilies

Figure 1 shows the CLUSTAL-W multiple sequence alignment for a selection of proteins which are annotated as 'Thioredoxin' or 'Thioredoxin Like' proteins. Examination of the key functional residues shows that there are at least three classes in the family, containing the CGAC, CHWC, CHHS motifs. CHHS clearly does not conform to the requirement for being a thioredoxin protein as it does not contain the CXXC motif, yet is has the stated annotation. The title 'Thioredoxin-like' may refer to the overall similarity in the sequences rather than the spefic domain.

Another observation from figure 1, is the convervation of the CXXC motif between higher organisms (Homo Sapiens, Salmo Salar, etc) and bacteria and archea. Bacteria and Archea contain the motif 'CHWC', in place of 'CGAC' which is seen in higher organisms. This suggests that the ERp18 protein has evolved from bacteria or archea. It is unlikley that lateral gene transfer has occured, however, it cannot be ruled out based on this evidence.

Figure 2 shows in detail the relationship between the three apparent classes of proteins. In both groups of 'higher' organisms the similar trends can be seen where the proteins in more complex organisms are further from the common ancestor.

Fig. 2Unrooted phylogenetic tree showing homologies of different proteins lablled as 'Thioredoxins'

Unfortunatley, a detailed search for related ERp18 proteins. However there are enough to form a concise phylogenetic tree.

Fig. 3
Fig. 4

Phylogenetic Trees

[[Image:tree_cml.jpg|thumb|600px|Fig. 3]

add other zoomed in trees


Discussion

related to what organisms?

distant from other orgaisms?

important enzyme?