Sequence.: Difference between revisions

From MDWiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 1: Line 1:
Multiple sequence alignments:
Multiple sequence alignments:


NonredundantMSA is an alignment a blast search of the nonredundant database. Some of these sequences are cDNA/protein, but many are genome annotations with unknown function. For some organisms, a few variants of the sequence are included, so it's redundant in that sense. This has both prokaryotic and eukaryotic sources.
[[Image:NonredundantMSA.pdf]]
NonredundantMSAtop is the alignment repeated without the less similar sequences (mostly without the prokaryotic ones).
These MSAs show that the protein has been well conserved. They also show that there are some residues which have been almost 100% conserved in ~180 sequences. I will use these for my tree, but if you use these to make other inferences, keep in mind that some of these genes may not code for ssu72...


[[Image:NonredundantMSA.pdf]]
An alignment of a blast search of the NCBI's nonredundant database. Some of the sequences are protein or translated cDNA, but many come from genome annotations, with no functional information. For some organisms, a few variants of the sequence are included. The sources are both prokaryotes and eukaryotes.
(~180 sequences)


[[Image:TopNonredundantMSA.pdf]]
[[Image:TopNonredundantMSA.pdf]]


A realignment of the most similar sequences from above.
(155 sequences)


SwissProtMSA is an alignment from a blast search of SwissProt, so there is functional information for each sequence, and the data is higher quality. There are 24 sequences: 21 ssu72 from a variety of different organisms and 3 ssu72 homologues from human.  
These two MSAs show that the sequence has been well conserved - some of the residues almost 100% conserved over a wide variety of organisms. I will use these for the phylogenetic tree. However, if you use these to make other inferences, keep in mind that some of these sequences may not code for ssu72 (or anything else)...
Since these are ssu72 sequences, this is the more useful MSA for noting conserved regions.


[[Image:SwissProtMSA.pdf]]
[[Image:SwissProtMSA.pdf]]
An alignment from a blast search of SwissProt. The data is higher quality and less redundant, and there is functional information for each sequence. Of the 24 sequences, 3 are human ssu72 homologues, and the others are ssu72 from various organisms.
This MSA shows the conserved regions for ssu72. Since these are ssu72 sequences, I think the residues that show up as conserved here are probably important for structure/function.

Revision as of 12:15, 6 June 2009

Multiple sequence alignments:

File:NonredundantMSA.pdf

An alignment of a blast search of the NCBI's nonredundant database. Some of the sequences are protein or translated cDNA, but many come from genome annotations, with no functional information. For some organisms, a few variants of the sequence are included. The sources are both prokaryotes and eukaryotes. (~180 sequences)

File:TopNonredundantMSA.pdf

A realignment of the most similar sequences from above. (155 sequences)

These two MSAs show that the sequence has been well conserved - some of the residues almost 100% conserved over a wide variety of organisms. I will use these for the phylogenetic tree. However, if you use these to make other inferences, keep in mind that some of these sequences may not code for ssu72 (or anything else)...

File:SwissProtMSA.pdf

An alignment from a blast search of SwissProt. The data is higher quality and less redundant, and there is functional information for each sequence. Of the 24 sequences, 3 are human ssu72 homologues, and the others are ssu72 from various organisms. This MSA shows the conserved regions for ssu72. Since these are ssu72 sequences, I think the residues that show up as conserved here are probably important for structure/function.