ATP binding domain 4 Evolution: Difference between revisions

From MDWiki
Jump to navigationJump to search
(New page: Image:Untitled.bmp)
 
No edit summary
Line 1: Line 1:
[[Image:Untitled.bmp]]
[[METHODS]]
 
The protein that is used in this research is a putative n-type ATP pyrophosphatase from ''Pyrococcus furiosus''. This protein is predicted to be similar to ATP binding domain 4. From ‘Target Blast and Symatlas Table’, a Blastp search against the NCBI non-redundant protein database can be conducted for both sequences from human and mouse  ATP binding domain 4. Sequences with small E value were selected and Fasta format of selected sequences were retrieved. Using these sequences, a multiple sequence alignment was conducted by using ClustalX. A phylogenetic tree can be constructed based on the best multiple sequence alignment so that the evolutionary relationship between species can be observed.
 
[[RESULTS]]
 
From the Blastp search, it was found that the E value for both query sequence i.e; 1ru8A and the human ortholog, are very low. The highest E-value for 1ru8A and the human ortholog are 2e-25 and 1e-49 respectively which are very low.  Therefore, all 200 sequences were aligned by using ClustalX. From the multiple sequence alignment, it was found that only one conserved region presence in the sequences. Therefore, only extremely low E values were re-selected and unrelated sequences were ignored and deleted. Based on these new re-selected sequences, it was found that there are 7 conserved residues.
 
[[DISCUSSION]]
 
From the multiple sequence alignment, there are seven conserved regions found in all species from Domain Eukaryote and Archaea (based on Blastp search) but only three conserved residues which are significant to the function and structure of the protein since these residues are identified to be conserved amino acid sequence motif for P-loop of nucleotide binding domains which might be important in phosphate binding. These conserved residues are Serine-103, Glysine-104 and Glysine-105. Since this motif presence in uncharacterized ATP pyrophosphatase domain, the motif is called PP motif and can be written as S-G (2)-K-D-[GS].  This PP-loop motif is a modified version of the P-loop of nucleotide binding domain that is involved in phosphate binding. However, based on the multiple sequence alignment, the amino acids for PP-loop that are conserved across all species in eukaryote and archaea are Glysine-104 and Glysine-105. Substitution of Serine with Threonine in Pyrobaculum arsenaticum, Pyrobaculum calidifontis, Thermoproteus neutrophilis, Pyrobaculum islandicum and Staphlothermus marinus, are conserved since both amino acids are polar uncharged amino acid, suggesting that this residue is also important. However, this residue is not conserved in ATPBD4 in Homo sapiens which is only encoded by 259 amino acid sequences and the sequence started from Glysince-104.  Other sequences which are conserved across all species are Lysine-106 and Aspartic acid-107 where Aspartic acid-107 is also a part of PP motif. In fact, amino acid in residue 108 is also one of the important residues in PP motif. This residue is conserved across all species except for Theileria parva and Theileria annulata where Serine is substituted with Glysine.
Blastp search revealed the very low E value from many sequences from different species. These sequences appear to be encoding different group of proteins such as ATP pyrophosphatase, ATP-binding protein, ATPase and endoribonuclease. Although these proteins are responsible for different functions, the similarities of sequence between the proteins across species are very high (as indicated by E value) and the sequences which are conserved are a part of PP motif, suggesting that these proteins descend from a common ancestral sequence and therefore are paralogs. Such phenomenon could be as the result of duplication of genes within a genome.
The phylogeny tree and boostrap revealed that PP-loop motif from ATP binding domain 4 and other related proteins are found in species of Archaea and Eukaryotes suggesting that this motif is highly conserved throughout evolution. However, Bacteria found to be lacking of this conserved sequences. The PP-loop motif it can be suggested that the evolutionary model is hold since there is no evidence of the occurrence of lateral gene transfer.  Members of domain eukarya are nicely cluster at one side without any presence of other species from different domain i.e: Members from different domains are well-seperated.

Revision as of 23:05, 30 May 2009

METHODS

The protein that is used in this research is a putative n-type ATP pyrophosphatase from Pyrococcus furiosus. This protein is predicted to be similar to ATP binding domain 4. From ‘Target Blast and Symatlas Table’, a Blastp search against the NCBI non-redundant protein database can be conducted for both sequences from human and mouse ATP binding domain 4. Sequences with small E value were selected and Fasta format of selected sequences were retrieved. Using these sequences, a multiple sequence alignment was conducted by using ClustalX. A phylogenetic tree can be constructed based on the best multiple sequence alignment so that the evolutionary relationship between species can be observed.

RESULTS

From the Blastp search, it was found that the E value for both query sequence i.e; 1ru8A and the human ortholog, are very low. The highest E-value for 1ru8A and the human ortholog are 2e-25 and 1e-49 respectively which are very low. Therefore, all 200 sequences were aligned by using ClustalX. From the multiple sequence alignment, it was found that only one conserved region presence in the sequences. Therefore, only extremely low E values were re-selected and unrelated sequences were ignored and deleted. Based on these new re-selected sequences, it was found that there are 7 conserved residues.

DISCUSSION

From the multiple sequence alignment, there are seven conserved regions found in all species from Domain Eukaryote and Archaea (based on Blastp search) but only three conserved residues which are significant to the function and structure of the protein since these residues are identified to be conserved amino acid sequence motif for P-loop of nucleotide binding domains which might be important in phosphate binding. These conserved residues are Serine-103, Glysine-104 and Glysine-105. Since this motif presence in uncharacterized ATP pyrophosphatase domain, the motif is called PP motif and can be written as S-G (2)-K-D-[GS]. This PP-loop motif is a modified version of the P-loop of nucleotide binding domain that is involved in phosphate binding. However, based on the multiple sequence alignment, the amino acids for PP-loop that are conserved across all species in eukaryote and archaea are Glysine-104 and Glysine-105. Substitution of Serine with Threonine in Pyrobaculum arsenaticum, Pyrobaculum calidifontis, Thermoproteus neutrophilis, Pyrobaculum islandicum and Staphlothermus marinus, are conserved since both amino acids are polar uncharged amino acid, suggesting that this residue is also important. However, this residue is not conserved in ATPBD4 in Homo sapiens which is only encoded by 259 amino acid sequences and the sequence started from Glysince-104. Other sequences which are conserved across all species are Lysine-106 and Aspartic acid-107 where Aspartic acid-107 is also a part of PP motif. In fact, amino acid in residue 108 is also one of the important residues in PP motif. This residue is conserved across all species except for Theileria parva and Theileria annulata where Serine is substituted with Glysine. Blastp search revealed the very low E value from many sequences from different species. These sequences appear to be encoding different group of proteins such as ATP pyrophosphatase, ATP-binding protein, ATPase and endoribonuclease. Although these proteins are responsible for different functions, the similarities of sequence between the proteins across species are very high (as indicated by E value) and the sequences which are conserved are a part of PP motif, suggesting that these proteins descend from a common ancestral sequence and therefore are paralogs. Such phenomenon could be as the result of duplication of genes within a genome. The phylogeny tree and boostrap revealed that PP-loop motif from ATP binding domain 4 and other related proteins are found in species of Archaea and Eukaryotes suggesting that this motif is highly conserved throughout evolution. However, Bacteria found to be lacking of this conserved sequences. The PP-loop motif it can be suggested that the evolutionary model is hold since there is no evidence of the occurrence of lateral gene transfer. Members of domain eukarya are nicely cluster at one side without any presence of other species from different domain i.e: Members from different domains are well-seperated.