Evolution.

From MDWiki
Jump to navigationJump to search

Multiple sequence alignments:

File:SelNonredundantMSA.pdf

An alignment of a blast search of the NCBI's nonredundant database. Some of the sequences are protein or translated cDNA, but many come from genome annotations, with no functional information. The sources are both prokaryotes and eukaryotes.

This MSA shows that the sequence has been well conserved - some of the residues almost 100% conserved over a wide variety of organisms. However, if you use these to make other inferences, keep in mind that some of these sequences may not code for ssu72 (or anything else)... I've removed the shorter sequences. (H/V)C(X5)R(S/T) is very well conserved.

File:SwissProtMSA2.pdf

An alignment from a blast search of SwissProt. The data is higher quality and less redundant.

The (H/V)C(X5)R(S/T) is conserved in all the SwissProt sequences, except for Cryptococcus neoformans ('Cryne'). I don't know yet why it isn't conserved in C. neoformans; the sequence seems fine otherwise and comes up where it should in the tree. The sequence comes from gene prediction; so maybe it's just too short to include that conserved region.

3fdfTree2.jpg

Tree, from SwissProt hits. Forget how I described the tree before: it splits into fungi and animals - and one slime mould. Drosophila is in with the animals, but fairly distant from human, as you'd expect.

File:SwissProtTax.txt

The taxonomy of the hits from SwissProt. This confirms that the tree is showing the expected phylogeny.

DrosophilaVsHuman.png

A dot matrix alignment between Drosophila (x-axis) and Human (y-axis) ssu72. The proteins are virtually identical.

File:SSMPhosphatases.pdf

Alignment of 3fdf (bottom sequence) to 7 phosphatases with similar secondary structure (top sequences). Conserved regions at 10 and 100, and occasional conserved residues. (Note phosphatases align much better with each other than with 3fdf)