Strings 2008 2 4: Difference between revisions

From MDWiki
Jump to navigationJump to search
No edit summary
No edit summary
 
(9 intermediate revisions by the same user not shown)
Line 4: Line 4:


=== Computing of a dot plot ===
=== Computing of a dot plot ===
The two sequences correspond to the two dimensions of the matrix and each matrix element represents the outcome of the comparison between the two corresponding characters in the sequences. Identical character pairings are visualised by a black pixel while different characters are coloured white.
The similarity between two sequences are represented by comparison matrix. Each element in the matrix compares the character in one sequence with the character of the other sequence. Identical character pairings are visualised by a black pixel while different characters are coloured white.


The figure below shows an example of a dot plot comparing two DNA sequencess.
The table below illustrates an example of a dot plot comparing DNA fragments of [http://en.wikipedia.org/wiki/Hemoglobin hemoglobin] from a person with [http://en.wikipedia.org/wiki/Sickle-cell_disease sickle-cell disease] and a healthy person. (This example can be generated interactively in class. It also is a nice example to show the effects of [http://web.txwes.edu/biology/macclark/genetics/genepages/cheryl/hemoglobin_mutants.html different mutations] on the transcriptional outcome.)


INSERT FIGURE HERE
 
{| class="wikitable" style="text-align:center"
|+Multiplication table
|-
! × !! C !! T !! G !! A !! C !! T !! C !! C !! T !! G !! A !! G !! G !! A !! G !! A !! A !! G !! T !! C !! T !! G !! C !! C
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! T
|  ||bgcolor="black"|X || || || ||bgcolor="black"|X || || ||bgcolor="black"|X || ||bgcolor="black"|X || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! A
|  || || ||bgcolor="black"|X || || || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || ||
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! T
|  ||bgcolor="black"|X || || || ||bgcolor="black"|X || || ||bgcolor="black"|X || ||bgcolor="black"|X || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X || || ||
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! T
|  ||bgcolor="black"|X || || || ||bgcolor="black"|X || || ||bgcolor="black"|X || ||bgcolor="black"|X || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! T
|  || || ||bgcolor="black"|X || || || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! A
|  || || ||bgcolor="black"|X || || || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! A
|  || || ||bgcolor="black"|X || || || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || ||
|-
! A
|  || || ||bgcolor="black"|X || || || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! T
|  ||bgcolor="black"|X || || || ||bgcolor="black"|X || || ||bgcolor="black"|X || ||bgcolor="black"|X || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X || || ||
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! T
|  ||bgcolor="black"|X || || || ||bgcolor="black"|X || || ||bgcolor="black"|X || ||bgcolor="black"|X || || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X || || ||
|-
! G
|  || ||bgcolor="black"|X || || || || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || ||bgcolor="black"|X || || ||bgcolor="black"|X || || || ||bgcolor="black"|X || ||
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|-
! C
| bgcolor="black"|X || || || ||bgcolor="black"|X || ||bgcolor="black"|X ||bgcolor="black"|X || || || || || || || || || || || ||bgcolor="black"|X || || ||bgcolor="black"|X ||bgcolor="black"|X
|}




Line 17: Line 94:
* Sketch a dot plot of two sequences which are identical except that a segment is deleted from the middle of one sequence and not the other.
* Sketch a dot plot of two sequences which are identical except that a segment is deleted from the middle of one sequence and not the other.
|}
|}
Further (very advanced) reading for the Mathematics and Physics inclined
[http://en.wikipedia.org/wiki/Recurrence_plot Recurrence plot of phase space trajectories]
[[Strings_2008_2_5 | goto Similarity of strings]]
--[[User:ThomasHuber|ThomasHuber]] 14:02, 10 January 2008 (EST)

Latest revision as of 02:42, 17 January 2008

Visual comparison of strings

Humans are very good at identifying patterns visually. One of the first ways to compare biological sequences was to generate identity matrices and visualise them in a so called dot plot.

Computing of a dot plot

The similarity between two sequences are represented by comparison matrix. Each element in the matrix compares the character in one sequence with the character of the other sequence. Identical character pairings are visualised by a black pixel while different characters are coloured white.

The table below illustrates an example of a dot plot comparing DNA fragments of hemoglobin from a person with sickle-cell disease and a healthy person. (This example can be generated interactively in class. It also is a nice example to show the effects of different mutations on the transcriptional outcome.)


Multiplication table
× C T G A C T C C T G A G G A G A A G T C T G C C
C X X X X X X X
T X X X X X X
G X X X X X X X
A X X X X
C X X X X X X X
T X X X X X X
C X X X X X X X
C X X X X X X X
T X X X X X X
G X X X X X X X
T X X X X
G X X X X X X X
G X X X X X X X
A X X X X
G X X X X X X X
A X X X X
A X X X X
G X X X X X X X
T X X X X X X
C X X X X X X X
T X X X X X X
G X X X X X X X
C X X X X X X X
C X X X X X X X


Discovery questions:
  • Suppose that two sequences are identical except that a segment is inverted in one sequence, relative to the other sequence. Explain how such an inversion would look like in a dot plot.
  • What would be the value of using a dot plot to compare a sequence to a second sequence, as well as the reverse compliment of the second sequence?
  • Sketch a dot plot of two sequences which are identical except that a segment is deleted from the middle of one sequence and not the other.


Further (very advanced) reading for the Mathematics and Physics inclined

Recurrence plot of phase space trajectories



goto Similarity of strings


--ThomasHuber 14:02, 10 January 2008 (EST)