Pyridoxal Phosphatase Results: Difference between revisions

From MDWiki
Jump to navigationJump to search
Line 149: Line 149:
'''''General Annotation'''''
'''''General Annotation'''''


[[Image:Function.jpg|left|thumb|1000px|Fig 17. General annotation of Pyrixol Phosphotase. Picture courtesy from UniProt]]<BR>
[[Image:Function.jpg|left|thumb|1000px|Fig 17. General annotation of Pyrixol Phosphotase. Picture courtesy of UniProt]]<BR>
<BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR><BR>


===Sequence homology===
===Sequence homology===

Revision as of 16:54, 9 June 2008

Evolution

Basic Local Alignment Search Tool (BLAST)

The BLAST search resulted in a list of organisms to which the target sequence (2cfsA) is related. Complete BLAST results for human pyridoxal phosphatase

Multiple Sequence Alignment

Multiple sequence alignment of the sequences of pyridoxal phosphatase from sample organisms extracted from the blast search resulted in several conserved residues identified as well as some conservative and semi-conservative substitutions.

Fig 1. Multiple sequence alignment (MSA) showing the conserved residues for Pyridoxal phosphatase.

Pyridoxal Phosphatase Multiple Sequence Alignment

It was found that within the multiple sequence alignment, there was only conservation within the 810 to 890 nucleotide region. Outside of that region, there are blocks of similar sequences for some organisms but not for all.

The conserved residues found in the multiple sequence alignment (MSA) correspond to Lys-213, Asp-238, Arg-239 and Gly-260 in the human 2cfsA protein. Two other residues, Val-200 and Leu-245 have conservative substitutions, while Gly-212, Met-235 and Val-236 have semi-conservative substitutions.


Phylogeny Tree

Calculation of relationships between the different pyridoxal phosphatase proteins from the organisms allowed the generation of phylogeny trees. The trees show the divergence of the sequences of the pyridoxal phosphatase-related proteins in various organisms.

Fig 2. Rectangular cladogram generated from the BLAST and Multiple Sequence Alignment after boot-strapping.

Pyridoxal Phosphatase Phylogeny Diagram

Fig 3. Radial cladogram of pyridoxal phoshpatase phylogeny showing the domains of the organisms.

The radial cladogram above, showed that organisms from the same Kingdom tended to clump together in one branch of the tree. Four Kingdoms of organisms with related pyridoxal phosphatase proteins were found; Prokaryota, Fungi, Plantae and Animalia.

Structure


General Structure of Pyridoxal Phosphatase & 2cfsA

Fig 4. Courtesy (and edited) of Almo et al., 2007. Structure of 2cftA - Human Pyridoxal 5'-Phosphate Phosphatase with its substrate. As mentioned in the "Introduction" section, the core domain consists of beta sheets sandwiched by alpha helices. 2cftA is similar to 2cfsA, with the differences being the presence of a PLP ligand and the type of metal ions (2cfsA has magnesium ions, 2cftA has calcium ions). Notice the similarities between the structure of 2cftA and the PyMOL-generated image of 2cfsA (Fig 5.).
Fig 5. The three-dimensional structure of 2cfsA, as generated by PyMOL

Red - Helix
Yellow - Sheet
Green - Loop
Green Dots - Water molecules






























PDB

Based on the information provided in the website, Pyridoxal Phosphatase has the following characteristics:

  • Isolated from Homo Sapiens and expressed in Escherichia Coli.
  • Structure is similar to the Pyridoxal Phosphate Phospatase protein
  • 1 (A) Chain
  • Magnesium ions (x2)
  • Resolution of 2.40 angstroms. This means that the number of sidechains in the wrong rotamer is smaller as compared to proteins of a higher resolution (>2.5 angstroms). Other characteristics of proteins of similar resolutions are: (1) many small detectable errors, (2) correct folding, (3) fewer number of errors in the surface loops and (4) visible water molecules and small ligands.

http://www.rcsb.org/pdb/explore/explore.do?structureId=2cfs

DALI

Fig 6. Significant (and rejected) hits as generated from the DALI database

A total of 176 hits were generated, of which only the first 11 (as shown in Fig. 6) were predicted to be significant. The others were rejected on account that their respective lali scores (refer to the red, boxed section) were less than half of Pyridoxal Phosphatase's (nres: 296). The lali value is important as this refers to the number of structurally equivalent residues between the hit protein and the protein-of-interest. If two proteins are not structurally similar/related, chances are they may not have related functions.

http://ekhidna.biocenter.helsinki.fi/dali_server/results/20080506-034-babd470eae557046b605d8f91e8a16e5

It was noted that none of the hits actually matched 2cfsA. The closest was a protein belonging to the Pyridoxal Phosphate Phosphatase family (PDB ID 2oycA) which based on the results, was predicted to be highly similar to 2cfsA. Table 1 highlights the similarities between 2cfsA and 2oycA.

Table 1 - 2oycA, as compared against 2cfsA

















Using the PyMOL software, 2oycA was superimposed against 2cfsA and both structures, as shown in Fig. 7, are structurally similar.

Fig 7. Superimposition of 2oycA against 2cfsA, as performed on PyMOL























The results obtained when 2cfsA was re-run against the DALI database

Towards the end of the study, 2cfsA was re-run against the DALI database. The rationale behind doing so was that the initial results did not yield 2cfsA as a protein hit, much less the query protein.

As opposed to the first run, 515 hits were generated. The top 25 protein hits were deemed significant, with the 26th protein hit onwards rejected for the same reason as that of the first batch - that their lali values were too low (less than half of 2cfsA's) for them to be considered as structurally significant to 2cfsA. More importantly was the observation that the DALI database identified 2cfsA as the query protein (unlike the first run). In addition, protein hit numbers 2 (2cftA) - 6 (2oycA) were ALL observed to belong to the Pyridoxal Phosphate Phosphatase family, as was 2cfsA. Based on this, it was easier to narrow down the potential research candidates in terms of structural similarities. Due to time constraint however, not all the protein hits were researched on.







PDBsum

Fig 8. Three different views of 2cfsA, courtesy of PDBsum

Fig. 8 offers three different views of 2cfsA. The purple chains represent the amino acid chain of 293 amino acids while the green spheres represent the magnesium ions (x2).

By clicking the "Protein chain" link, the user was re-directed to a website containing information pertaining to the secondary structures of both 2cfsA and 2oycA.







Secondary Structures of 2cfsA (L, http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl) and 2oycA (R). Secondary Structure 2cfsA.jpgSecondary Structure 2oycA.jpg

Topology diagrams of 2cfsA (Fig 9.1. http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=2cfs&template=protein.html&r=wiring&l=1&chain=A) and 2oycA (Fig 9.2). The topology diagram is a simplified version of the secondary structure information as provided above, and is an indication of the location of the alpha helices (represented by the red cylinders) and beta strands (represented by the pink arrows).


Fig 9.1. Topology Diagram of 2cfsA
Fig 9.2. Topology diagram of 2oycA




















Cleft Analyses of 2cfsA (Fig 10.1. http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=2cfs&template=clefts.html&pdbcode=2cfs&r=speedfill) and 2oycA (Fig 10.2).

Fig 10.1. Cleft Analysis of 2cfsA
Fig 10.2. Cleft Analysis of 2oycA
























Cleft Analysis via PyMol

Fig 11.1. Front View of 2cfsA & its potential binding sites, as generated using PyMOL
Fig 11.2. Front View of 2oycA & its potential binding sites, as generated using PyMOL


















Fig 11.3. Back View of 2cfsA & its potential binding sites, as generated using PyMOL
Fig 11.4. Back View of 2oycA & its potential binding sites, as generated using PyMOL





















PROFUNC (2cfsA)

Related Protein Sequences in the PDB (SAS)
SAS Aligned Sequences.jpg

Matches to existing PDB Structures
PROFUNC Match To Existing PDB Stuctures.jpg

Secondary Structure Matching (SSM)
PROFUNC SSM.jpg

Nest Analysis
PROFUNC Nest Analysis.jpg

Summary of Predicted Function
Summary of Predicted Function.jpg

Based on the results obtained via the local alignment method (Smith-Waterman method), it was noted that 2fcsA and 2oycA were indeed similar in terms of sequence and structure. However, contrary to the implications of the DALI-generated results, 2oycA was no longer the most closely-related protein to 2cfsA. That role now belonged to 2cftA. Based on the results generated by Profunc, 2cftA's sequence and structure, as provided by PDB, were identical to that of 2cfsA. The secondary structure matching, however, showed that 2cfsA and 2cftA had differing (albeit minimal) Q scores (1.000 and 0.981 respectively). 2cfsA was described as a "Crystal structure of human pyridoxal 5'-phosphate phosphatase", while 2cftA was described as a "Crystal structure of human pyridoxal 5'-phosphate phosphatase with its substrate". 2oycA, originally the most closely-related protein to 2cfsA, was rated 4 tiers below 2cftA (second in the scoring table).

PROFUNC (2oycA)

A search on 2oycA was also carried out via PROFUNC, and consistency in the results confirmed that 2cfsA and 2oycA were indeed structurally similar. This lends weight to the original hypothesis that they could be functionally related. In fact, based on the Nest Analysis method, both 2cfsA and 2oycA were observed to be sharing a large number of clefts. The result of the Nest Analysis (2oycA) are as shown below. Notice the similarities between the active sites of both 2oycA and 2cfsA. The reader may need to refer back to the Nest Analysis results for 2cfsA.

2oyc NEST ANALYSIS.jpg

In addition, the predicted function of 2oycA was similar to 2cfsA. Based on the Summary of Predicted Function, protein activity is intracellular, involves metabolism on a biological level, and tends to take on the role of a catalyst biochemically. It could also be involved in other biochemical processes such as hydrolase activity, phosphoric monoester hydrolase activity, hydrolase activity (acting on ester bonds) and phosphoric ester hydrolase activity.

Visualizing the potential catalytic site of 2cfsA

Fig 12. The LIGPLOT of interactions involving the PLP ligand in 2cftA. This was indicated to be the location of 2cftA's active site. The Mg 1296(A) ion of 2cfsA is located in the same position as the calcium ion of 2cftA. Therefore, there could be a possibility that the active site of 2cfsA could be at Mg 1296(A).
Fig 13. The LIGPLOT of interactions involving the Mg 1296(A) ion in 2cfsA. Notice the similarity of its location with relation to the calcium ion in 2cftA
Fig 14. The LIGPLOT of interactions involving the Mg 1297(A) ion in 2cfsA. While it does not seem to share any similarities with the PLP ligand in 2cftA, the information provided by the LIGPLOT will be useful when visualizing the catalytic site of 2cfsA via PyMOL.



















Fig 15. The catalytic site of 2cftA, with its PLP ligand and inhibitory calcium ion. As illustrated, the calcium ion is hepta-coordinated and participates in a bidentate interaction with the active site nucleophile Asp-25, resulting in a loss of activity. Diagram and caption courtesy of Almo et al., 2007.
Fig 16. The three-dimensional visualization of the catalytic site of 2cfsA. This was generated based on the location of the residues constituting the catalytic site, as provided in Figs 13 and 14.


























Based on the above-mentioned results, it was concluded that 2cfsA, 2cftA and 2oycA were structurally similar, and that further functional studies could indeed uncover the function of the protein-of-interest: 2cfsA

Function

In order to predict the function of Pyrixol Phosphotase, sequence homology, structural similarities, neighbouring genes, protein-protein interactions were analysed.

General Annotation

Fig 17. General annotation of Pyrixol Phosphotase. Picture courtesy of UniProt


























Sequence homology

To check for function among similar homologs, 2cfs_A sequence was run against FASTA, Pfam and PSI Blast non-redundant database. A search using Pfam and BLAST revealed that 2cfs_A belonged to the superfamily, Haloacid dehalogenase-like hydrolase. It is noted that the top 15 search results from BLAST database all belong to the same superfamily, HAD-like hydrolase.

Fig 18. Results from FASTA showing five highly related enzymes to 2cfs_A


http://blast.ncbi.nlm.nih.gov/Blast.cgi

http://pfam.sanger.ac.uk/search/sequence/results?jobId=A74388CE-361C-11DD-9F74-DFE079E860E8


Structural Similarities

Fig 18. Search results using Interpro for similar protein structure based on motifs.


A search based on motifs resulted in six hits. Five of which are hydrolases and one with function unknown. Proteins with similar motifs does not suggest any other function for 2cfs_A other than being a hydrolase.


Location of Gene on Genome

Fig 21. Gene location results from Profunc showing two neighbouring genes of 2cfs_A with very different sequence but similar function.



Protein-Protein Interactions

String.png
Fig 19. Diagram on protein-protein interaction of 2cfs_A.


http://string.embl.de/newstring_cgi/show_network_section.pl?taskId=ph89gQxcLucL&allnodes=1

back to Pyridoxal Phosphatase main page