C1orf41 Results: Difference between revisions

From MDWiki
Jump to navigationJump to search
No edit summary
No edit summary
Line 113: Line 113:


The sequences from platypus, urchin_156, urchin_159 and g.alga_fap were excluded from further investigation. This was due to poor alignment of platypus, urchin_156 and urchin_159 with the other organisms. In addition, the sequence for g.alga_fap was removed as it was a truncated sequence of g.alga_itp25.  
The sequences from platypus, urchin_156, urchin_159 and g.alga_fap were excluded from further investigation. This was due to poor alignment of platypus, urchin_156 and urchin_159 with the other organisms. In addition, the sequence for g.alga_fap was removed as it was a truncated sequence of g.alga_itp25.  


One of the characteristic of a small heat shock protein is that they have a conserved alpha B crystallin domain which can also be observed in c1orf41 as shown in figure 16 below:
One of the characteristic of a small heat shock protein is that they have a conserved alpha B crystallin domain which can also be observed in c1orf41 as shown in figure 16 below:
Line 118: Line 119:
   
   


The residues of the possible binding site were highlighted in black in figure 16 below. However, these residues were only conserved in terrestrial eukaryotes and some marine eukaryotes. Changes in the conserved amino acids may lead to higher binding affinity to ligand or different folding of the protein leading to different function.
The residues of the possible binding site were highlighted in black in figure 17 below. However, these residues were only conserved in terrestrial eukaryotes and some marine eukaryotes. Changes in the conserved amino acids may lead to higher binding affinity to ligand or different folding of the protein leading to different function.





Revision as of 09:50, 11 June 2009

Structure

Protein Structure'

PDB ID: 1TVG Information from the PDB stated that x-ray diffraction was sued to solve the structure of this protein with an R value of 0.215 and at 1.6Å. Two ligands were present in the crystal structure, a calcium (II) ion and a samarium (III) ion. This protein has 153 residues and its secondary structure consists of two helices and nine beta sheet strands.

Scop result

Scop.png


PDB sum 2.PNG

Figure 1: Above is the representation of c1orf41 secondary structure as presented in PDBsum.


Tertiary structure

1tvg ani.gif

Figure 2: This cartoon representation of c1orf41 shows that this protein is monomeric, consisting of a single domain. The fold of the protein is a jelly roll barrel where four pairs of anti-parallel beta strands are organised to form a barrel-like structure.


1tvg surface.png

Figure 3: Surface representation of c1orf41. The Ca (II) ion is located within a pocket of the protein. Sm (III) interacts with Asp92 on the protein but its presence was probably due to the method used in solving the phase problem during structure solution using x-ray crystallogrophy.


Charge surface ani.gif

Figure 4: Electrostatic charge distribution on the surface of c1orf41.



Structure analysis


1)Structure similarities

DALI was used to find proteins that are similar in structure to c1orf41. The results showed that sialidases, alpha-N-acetylglucosaminidases and galactose oxidases have similar structure to this protein.

Picture 1.png

Figure 5: DALI result showing the first 35 proteins that have similar structures to c1orf41. Even though the proteins are similar in structure they have very little sequence identity to c1orf41. 1xpw is c1orf41 but the structure was solved by NMR.


2)Domain Classification

1TVG pfam.png

Figure 6:Pfam result indicated that our protein has a F5/8 type C domain, also known as the discoidin domain which is apart of galactose binding domain super family.

Closer inspection of several of the other proteins from the DALI result showed that they also have discoidin domain in their structures. It was probably based on this domain that the DALI result was obtained.


3)Possible ligand binding sites

Observations and comparisons of several proteins from the DALI result with our protein showed that there is a similar position on each proteins where a metal ion is located.


1tvg ligand1.png

Figure 7: This figure shows the position where calcium (II) ion (yellow) is located in within a loop of c1orf41 structure. This loop where metal ion is coordinated to was also observed in several proteins from the DALI result.


1gog Na.png

2bzd discoidin2.png

Figure 8: Shown at the top left is the discoidin domain of galactose oxidase with a sodium ion (purple) located within a loop. The bottom figure is bacterial sialidase discoidin domain with a sodium ion (purple) and a beta-D-galactose molecule. The sodium ion is also located i within a loop.


1tvg castp3.PNG

Figure 9: CastP also identified a surface cleft that is located at the loop position.


1tvg ca res2.PNG

Figure 10: Pymol figure displaying possible metal binding position on c1orf41. Orange is the residues that may be involved in coordinating the metal and yellow is the calcium ion. The residues are based on CastP result.


Nest analysis by Profunc suggested two other ligand binding sites. Nests are structural motifs that commonly found in functionally important regions of protein structures.

Profunc nest.PNG

Figure 11: Nest 1 and has scores higher than 2. So, they are more likely to be functional. His54 and Lys55 NH atoms are accessible from a large surface cleft of the protein. The cleft is also deep indicating that the nest is functionally important. The residues of Nest 2 are part of the loop that coordinates the Ca (II)ion.


1tvg nests2.png

Figure 12: Pymol representation of Nest 1 and 2.


Surface nest3.PNG

Figure 13: Arrows indicating the pockets of the nests that may be accessible to ligands.



Evolution

Results from Blast showed that c1orf41 have high homology to small heat shock proteins which gave us a hint that the protein may be a heat shock protein (Figure 14). However, initial multiple sequence alignment showed many gaps and few conserved regions (Figure 15).

Figure 14: Homologues of c1orf41 from BLAST.


Figure 15:Initial multiple sequence alignment of the 27 proteins obtained from Blast


The sequences from platypus, urchin_156, urchin_159 and g.alga_fap were excluded from further investigation. This was due to poor alignment of platypus, urchin_156 and urchin_159 with the other organisms. In addition, the sequence for g.alga_fap was removed as it was a truncated sequence of g.alga_itp25.


One of the characteristic of a small heat shock protein is that they have a conserved alpha B crystallin domain which can also be observed in c1orf41 as shown in figure 16 below:


The residues of the possible binding site were highlighted in black in figure 17 below. However, these residues were only conserved in terrestrial eukaryotes and some marine eukaryotes. Changes in the conserved amino acids may lead to higher binding affinity to ligand or different folding of the protein leading to different function.


Figure 17:Residues involved in the possible binding site of c1orf41



After obtaining multiple sequence alignment, phylogenetic tree was construct based on the result from multiple sequence alignment. Bootstrap values are shown on each branch.

Treecolour.png

The unrooted tree above gave information on the evolutionary relationship between different organisms. This protein seems to have evolved from marine eukaryotes (red) to amphibians (blue) and then to terrestrial eukaryotes (green).

Tree.png


The bootstrap value shows the reliability of the branch. Higher bootstrap value show higher reliability.





Back to main page