LOC56985 results

From MDWiki
Jump to navigationJump to search

Evolution

The ClustalX multiple sequence alignment showed various residues that were conserved throughout all species, these are indicated by a '*' above the sequences. These residues are likely play an important role in the structure of the protein.

Figure 1: Sequence alignment of high similarity matches to LOC56985 determined using BLASTp. Three highly conserved regions are shown including the GNHE/D region common to all phosphoresterases. This picture only shows part of the multiple sequence alignment attained through ClustalX. To see the full alingnment click here.


Protein Tree


Figure 2: Phylogenetic tree generated using the TreeView program from the above ClustalX alignment. The taxonomic groupings appear in a logical arrangement with eukaryotes, plants and bacteria grouped in the same lineages. The exception is Trypanosoma cruzi, a eukaryotic protozoa grouped in the bacterial lineage. Key: Orange = bacteria, green = plants and blue = vertebrates.



The protein tree produced from the aligned sequences is analogous with traditional phylogenetic trees, with the exception of the Trypanosoma. Trypanosoma is a parasitic protozoa, which means that it is an eukaryote. In this tree it has been classified along with the bacteria. The bootstrap value for its position in the tree is relatively high at almost 75%, suggesting that perhaps Trypanosoma has acquired this protein by means of lateral transmission. Apart from that single exemplified organism, given the similarity of this protein tree with traditional taxonomy groupings, it seems likely that the usual mode of inheritance for this protein is through vertical transmission.

Bootstrapping values are considerably high, with most branches having a value of greater than 60% and only one branch in the alphaproteobacteria group having less than a 50% confidence.


Structure

Figure 3: Table showing the top 20 structural homologs of the Danio rerio gene ortholog. Match 1 is the Danio rerio protein (2nxf-A); 2dxl-A is a phosphorhydrolase from Enterbacter aerogenes, followed by purple acid phosphatases from human (1war-A), Norwegian rat (1qhw-A) and pig (1ute-A). The remainder of the top homologs shown here are predominantly from Kidney Bean(Phaseolus vulgaris) purple acid phosphatase, which are assigned either 1kbp, 3kbp and 4kbp. This protein is a mutimer and has therefore returned results from all of its chains (designated A to D)

A structural comparison search of the Danio rerio ortholog (2nxf) showed that the gene product had its greatest similarity to a phosphorhydrolase and to various purple acid phosphatases (figure 3). All of these protein matches showed little sequence similarity to the 2nxf however the Z-score of these proteins were significant based on the guide given on the Dali server that Z-scores below 2.5 are insignificant matches. The top hit, 2dxl-A, showed a Z-score of 24.1; other members in the top five matches again showed Z-scores of above 20. Dali works by fitting the two structures together and determining how far apart each of the atoms are from one another to give a root mean square deviation (rmsd), even in the lowest of the top 20 matches the rmsd is 3.0 or below.

Figure 4: Secondary structure aligned with sequence of 2NXF annotated with the pfam domain

Pfam analysis shows a common domain to all these proteins as being a calcineurin-like phosphoesterase (PF00149). SCOP classification (Murzin et al 1995) of the protein shows it to be an Alpha and Beta (a + b) protein with a 4-layer sandwich fold of alpha/beta/beta/alpha, in the family of puple acid phosphatases (56301). The secondary structure assignment (Kabsch & Sander, 1983) show that the protein contains 20 beta-sheets, 12 alpha-helices and a single disulfide bond between Cys272 and Cys234 (Figure 4); the Pfam domain is shown to stretch from Phe6 to Gly269.

CASTp predicts that the protein contains a number of cavities the largest of which has a volume of 2043.1cubic angstrom covering an area of 950.8squared angstrom. This cavity is very large compared to the others predicted by CASTp. PyMOL analysis of this pocket shows that it is the binding site for the PO4 ligand and contains two Zn atoms buried in the pocket (Figure 5). This site shows three identical residues across the top five protein matches by Dali. This includes the human homolog as well as the Enterbacter aerogenes protein. The sequence of residues is GNH at positions 95 to 97, is unchanged, with a highly conserved acidic amino acid at position 98 down stream of this pattern. The ribbon view (Figure 6) shows that the ligands are positioned at the mid-point sandwiched between the two halves of the protein, above the inner beta-sheet centre of the protein. What only becomes apparent by looking at the surface view of the protein is that this cavity is bridged across the top to form a circle or donut shaped cavity (Figure 5), this may have implications for the type of substrate that can bind to this cavity. A close inspection of the binding site reveals that the GNH pattern appears to be directly involved with ligand binding (Figure 7). The binding site and interactions for the phosphate ion consists of Asp13, Gyn15, Asp60, Asp96, His97, His267 as well as interactions with the two Zn ions. Two residues, Asp96 and His97 form part of the GNH pattern seen in this class of proteins, Gly95 or the acidic amino acid at position 97 does not appear to interact directly with either the Zinc cofactors or the PO4 ligand, it may be that they are important in positioning the other two residues for their bonding interactions or creating the correct fold for the protein.


Figure 7: Binding site interactions with the PO4 ligand
Figure 5: The surface view of the protein showing the metal binding site in a large cavity on the protein.This cavity is bridged so that it appears to be a donut shape, this formation is not picked up in the ribbon view of 2NXF which shows a clear gap in the secondary structure of the protein. Many other smaller cavities can also be seen in the protein. Two Zinc atoms are shown as red spheres interacting with the phosphate ion
Figure 6:Ribbon view of 2NXF showing the 4-layer alpha/beta sandwich structure described by the SCOP classification. The structure shows that the outer layers of the protein are dominated by alpha helices (red coils), while the center of the protein consists of two distinct layers of beta sheets (yellow arrows).


Abstract | Introduction | Results | Discussion | Conclusion | Method | References

Back To Main hypothetical protein LOC56985 page