LOC56985 method: Difference between revisions

From MDWiki
Jump to navigationJump to search
mNo edit summary
Line 20: Line 20:
'''Multiple sequence alignment and identification of the metallophosphoesterase signature sequence'''
'''Multiple sequence alignment and identification of the metallophosphoesterase signature sequence'''


The above mentioned muliple sequence alignments were analysed for the metallometallophosphoesterase signature sequence DX[H/X]-(X)n-GDXX[D/X]-(X)n-GNH[D/E]-(X)n-[G/X]H-(X)n-GHX[H/X] and any other conserved regions.  
The above mentioned muliple sequence alignments were analysed for the metallophosphoesterase signature motif DX[H/X]-(X)n-GDXX[D/X]-(X)n-GNH[D/E]-(X)n-[G/X]H-(X)n-GHX[H/X] and any other conserved regions.  





Revision as of 00:58, 8 June 2008

Evolution

BLAST

A sequence similarity search was performed using the human orthologue of the protein 2nfx, the hypothetical protein LOC56985. This was performed through the use of blast program, BLASTP, (Altschul, Gish et al, 1990) that had been saved onto a CD in late April. Thus results could only have come from data present within the database before that day. That is the results may have been very similar but may not exactly match results achieved through a BLAST search on the NCBI (National Centre for Biotechnology Information) website. Only the highest 60 results (based on their E values) from the initial sequence search were used. For the purpose of creating a phylogenetic tree, any double up of species were removed, unless considered significant.

CLUSTAL

The selected sequences were aligned by the use of the CLUSTALX program (Larkin, et al., 2007), which is a global alignment program. From here any sequences that had extremely large gaps were also deleted. When a reasonable alignment was produced, the aligned sequences were used to form a phylogenetic tree. The tree was also bootstrapped and trees and bootstrap values were visualised using the program TreeView (Page, 1996).

Structure

The 2nxf structure was submitted to the DALI server (Holm & Sander, 1996) to search for structurally similar protiens among all solved structures deposited in the Protein Data Bank (PDB). These results were then analysed using ClustalX (Larkin, et al., 2007) for any conserved regions in the sequence that may provide vital information about the binding sites or sequences essential to the fold of the protein. CASTp (Dundas, Ouyang et al, 2006) and Pymol (DeLano Scientific LLC, 2007) were used for visualising the protein’s secondary and tertiary structure as well as the possible bindings sites. Both CATH (Orengo, Michie, Jones, Jones, Swindells, & Thornton, 1997) and SCOP (Murzin, Brenner et al, 1995) databases were used to identify and characterise the structure of the protein; LIGPLOT (Wallace, Laskowski, & Thornton, 1995) was used to generate schematic diagrams of the ligand-protein interactions at the binding site.

Function

Domain and Superfamily/ family identification

The Superfamily database (http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/hmm.html) and the Pfam domain database (http://pfam.sanger.ac.uk/) were searched using the LOC56985 sequence (default settings).

Multiple sequence alignment and identification of the metallophosphoesterase signature sequence

The above mentioned muliple sequence alignments were analysed for the metallophosphoesterase signature motif DX[H/X]-(X)n-GDXX[D/X]-(X)n-GNH[D/E]-(X)n-[G/X]H-(X)n-GHX[H/X] and any other conserved regions.



Abstract | Introduction | Results | Discussion | Conclusion | Method | References

Back To Main hypothetical protein LOC56985 page