Counting RNA Secondary Structures of Arbitrary Pseudoknot Type

RNA Structures Counting k-nc arch diagrams Counting RNA 2nd structures Counting RNA Secondary Structures of Arbitrary Pseudoknot Type Review of Jin...
Author: Cecily Lang
4 downloads 0 Views 605KB Size
RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting RNA Secondary Structures of Arbitrary Pseudoknot Type Review of Jin, Qin, & Reidys (2008) Bull. Math. Biol. 70

Berton A. Earnshaw Department of Mathematics, University of Utah Salt Lake City, Utah 84112

February 26, 2008

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

A single strand of RNA

http://ilab.cs.ucsb.edu/projects/helly/rna.jpg

Primary structure: sequence of bases (A,G,U,C) Secondary structure: pairing of bases Watson-Crick pairs: A-U, G-C (less often U-G)

Tertiary structure: resulting 3D molecule Different tertiary structures ⇒ different enzymatic properties

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

A single strand of RNA: An example

Primary structure: AACCAUGUGGUACUUGAUGGCGAC

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

A single strand of RNA: An example

Primary structure: AACCAUGUGGUACUUGAUGGCGAC Secondary structure:

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

A single strand of RNA: An example

Primary structure: AACCAUGUGGUACUUGAUGGCGAC Secondary structure:

Tertiary structure: extremely difficult to predict (probably NP-hard)

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

RNA secondary structure as k-noncrossing arch diagram k-noncrossing arch diagram of order n graph on vertex set {1, . . . , n} all vertices have degree ≤ 1 there do not exist k arches {i1 , j1 }, . . . , {ik , jk } such that i1 < · · · < ik < j1 < · · · < jk

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

RNA secondary structure as k-noncrossing arch diagram k-noncrossing arch diagram of order n graph on vertex set {1, . . . , n} all vertices have degree ≤ 1 there do not exist k arches {i1 , j1 }, . . . , {ik , jk } such that i1 < · · · < ik < j1 < · · · < jk

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

RNA secondary structure as k-noncrossing arch diagram k-noncrossing arch diagram of order n graph on vertex set {1, . . . , n} all vertices have degree ≤ 1 there do not exist k arches {i1 , j1 }, . . . , {ik , jk } such that i1 < · · · < ik < j1 < · · · < jk

RNA secondary structure of n bases, pseudoknot type k − 2 k-noncrossing (but not k − 1) arch diagram of order n no 1-arches {i, i + 1} “abstract” secondary structure (no primary structure)

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting RNA secondary structures

Establish bijection between k-noncrossing arch diagrams and certain walks in Zk−1 Count walks via reflection principle (Weyl groups) Enumerate restricted walks (RNA secondary structures)

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

k-noncrossing arch diagrams and walks in Weyl chamber

Walk in Zm of length n sequence of vectors x0 , x1 , . . . , xn ∈ Zm s.t. |xi +1 − xi | = 0 or 1

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

k-noncrossing arch diagrams and walks in Weyl chamber

Walk in Zm of length n sequence of vectors x0 , x1 , . . . , xn ∈ Zm s.t. |xi +1 − xi | = 0 or 1

Weyl chamber subset of vectors x = (x1 , . . . , xm ) ∈ Zm s.t. x1 > · · · > xm > 0

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

k-noncrossing arch diagrams and walks in Weyl chamber

Walk in Zm of length n sequence of vectors x0 , x1 , . . . , xn ∈ Zm s.t. |xi +1 − xi | = 0 or 1

Weyl chamber subset of vectors x = (x1 , . . . , xm ) ∈ Zm s.t. x1 > · · · > xm > 0

Theorem (Chen et al. (2007) Trans. Am. Math. Soc. 359) There exists a bijection between k-noncrossing arch diagrams of order n and walks of length n in Zk−1 which start and end at a = (k − 1, k − 2, . . . , 1) and remain in the Weyl chamber.

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof: Oscillating Young diagrams, RSK algorithm Young diagram collection of squares µ arranged in left-justified rows number of squares in each row weakly decreasing

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof: Oscillating Young diagrams, RSK algorithm Young diagram collection of squares µ arranged in left-justified rows number of squares in each row weakly decreasing

Oscillating Young diagrams sequence of Young diagrams ∅ = µ0 , µ1 , . . . , µn = ∅ µi and µi +1 differ by at most one square

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof: Oscillating Young diagrams, RSK algorithm Young diagram collection of squares µ arranged in left-justified rows number of squares in each row weakly decreasing

Oscillating Young diagrams sequence of Young diagrams ∅ = µ0 , µ1 , . . . , µn = ∅ µi and µi +1 differ by at most one square

Young tableau filling of Young diagram with positive integers numbers weakly increasing in each row numbers strictly decreasing in each column

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof: Oscillating Young diagrams, RSK algorithm Young diagram collection of squares µ arranged in left-justified rows number of squares in each row weakly decreasing

Oscillating Young diagrams sequence of Young diagrams ∅ = µ0 , µ1 , . . . , µn = ∅ µi and µi +1 differ by at most one square

Young tableau filling of Young diagram with positive integers numbers weakly increasing in each row numbers strictly decreasing in each column

RSK algorithm method for creating sequences of Young tableaux

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof: The bijection

(4,3,2,1),(5,3,2,1),(6,3,2,1),(6,4,2,1),(6,4,2,1),(6,4,2,1), (6,4,3,1),(6,4,3,1),(5,4,3,1),(5,4,3,2),(6,4,3,2),(6,4,3,1), . (6,4,3,1),(6,4,2,1),(6,4,2,1),(6,3,2,1),(5,3,2,1),(4,3,2,1)

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Weyl group Set ∆m = {em } ∪ {ej−1 − ej | j = 2, .., m} Each α ∈ ∆m called a (simple) root Hyperplane Pα normal to α ∈ ∆m called a wall Weyl chamber ⊆ region of Rm bounded by walls

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Weyl group Set ∆m = {em } ∪ {ej−1 − ej | j = 2, .., m} Each α ∈ ∆m called a (simple) root Hyperplane Pα normal to α ∈ ∆m called a wall Weyl chamber ⊆ region of Rm bounded by walls

∆1 = {1}, P1 = {0} ∆2 = {(0, 1), (1, −1)}, P(0,1) = h(1, 0)i, P(1,−1) = h(1, 1)i

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Weyl group Set ∆m = {em } ∪ {ej−1 − ej | j = 2, .., m} Each α ∈ ∆m called a (simple) root Hyperplane Pα normal to α ∈ ∆m called a wall Weyl chamber ⊆ region of Rm bounded by walls

∆1 = {1}, P1 = {0} ∆2 = {(0, 1), (1, −1)}, P(0,1) = h(1, 0)i, P(1,−1) = h(1, 1)i Weyl group Bm : generated by reflections through walls D E α·x Bm = x 7→ x − 2 α | α ∈ ∆m α·α B1 ∼ = Z2 , B2 ∼ = D4

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Reflection principle wn (x, y) = # walks x → y of length n wn+ (x, y) = # walks x → y of length n remaining in Weyl chamber

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Reflection principle wn (x, y) = # walks x → y of length n wn+ (x, y) = # walks x → y of length n remaining in Weyl chamber Theorem (Gessel & Zeilberger (1992) Proc. Am. Math. Soc. 115) If x, y ∈ Zk−1 are in the Weyl chamber, then X sgn(β)wn (β(x), y). wn+ (x, y) = β∈Bk−1

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting walks in Weyl chamber: Reflection principle wn (x, y) = # walks x → y of length n wn+ (x, y) = # walks x → y of length n remaining in Weyl chamber Theorem (Gessel & Zeilberger (1992) Proc. Am. Math. Soc. 115) If x, y ∈ Zk−1 are in the Weyl chamber, then X sgn(β)wn (β(x), y). wn+ (x, y) = β∈Bk−1

Theorem (Grabiner & Magyar (1993) J. Algebr. Comb. 2) If x = (x1 , . . . , xk−1 ), y = (y1 , . . . , yk−1 ) are in the Weyl chamber, ∞ X n=0

wn+ (x, y)

k−1 xn = ex det[Ixi −yj (2x) − Ixi +yj (2x)] i ,j=1 n!

P 2r +j /(j!(r + j)!) is hyperbolic Bessel where Ir (2x) = ∞ j=0 x function of 1st kind of order r .

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing arch diagrams Set fk (n, l ) = # k-nc arch diagrams of order n with l isolated nodes

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing arch diagrams Set fk (n, l ) = # k-nc arch diagrams of order n with l isolated nodes With a = (k − 1, k − 2, . . . , 1), we have shown that n X fk (n, l ) wn+ (a, a) = l=0

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing arch diagrams Set fk (n, l ) = # k-nc arch diagrams of order n with l isolated nodes With a = (k − 1, k − 2, . . . , 1), we have shown that n X fk (n, l ) wn+ (a, a) = l=0

∞ X n X n=1 l=0

fk (n, l )

xn n!

= ex det[Ii −j (2x) − Ii +j (2x)]|ik−1 ,j=1

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing arch diagrams Set fk (n, l ) = # k-nc arch diagrams of order n with l isolated nodes With a = (k − 1, k − 2, . . . , 1), we have shown that n X fk (n, l ) wn+ (a, a) = l=0

∞ X n X n=1 l=0

fk (n, l )

xn n!

= ex det[Ii −j (2x) − Ii +j (2x)]|ik−1 ,j=1

     n n 2 f2 (n, l ) = C n−l , f3 (n, l ) = C n−l C n−l − C n−l +1 2 2 2 l l 2   2m 1 Cm = , mth Catalan number m+1 m

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing RNA secondary structures Set Sk (n, l ) = # k-nc RNA structures of n bases with l isolated nodes n X Sk (n, l ) Sk (n) = # k-nc RNA structures of n bases = l=0

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Counting k-noncrossing RNA secondary structures Set Sk (n, l ) = # k-nc RNA structures of n bases with l isolated nodes n X Sk (n, l ) Sk (n) = # k-nc RNA structures of n bases = l=0

Theorem (Jin, Qin & Reidys, 2008) (n−l)/2

Sk (n, l ) =

X

b

(−1)

b=0

⌊n/2⌋

Sk (n) =

X b=0

b

(−1)





 n−b fk (n − 2b, l ) b

n−b b

 n−2b X l=0

fk (n − 2b, l )

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof Set Gk (n, l , j) = # k-nc arch diagrams of order n with l isolated nodes, j 1-arches (n−l)/2

Fk (x) =

X j=0

Gk (n, l , j)x j

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof Set Gk (n, l , j) = # k-nc arch diagrams of order n with l isolated nodes, j 1-arches (n−l)/2

Fk (x) =

X

Gk (n, l , j)x j

j=0

Note

  (n−l)/2   (b) X Fk (1) j n−b = Gk (n, l , j) = fk (n − 2b, l ) b! b b j=b

Both count (with multiplicity) all k-nc arch diagrams with l isolated nodes constructed by:  specifying b 1-arches (can be done in n−b ways) b filling n − 2b remaining nodes with k-nc arch diagram having l isolated nodes (can be done in fk (n − 2b, l) ways)

Each of Gk (n, l , j) arch diagrams counted

j b

times

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof Taylor expanding Fk about x = 1 gives (n−l)/2

Fk (x) =

X F (b) (1) (x − 1)b b! b=0

=

(n−l)/2 

X b=0

 n−b fk (n − 2b, l )(x − 1)b b

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof Taylor expanding Fk about x = 1 gives (n−l)/2

Fk (x) =

X F (b) (1) (x − 1)b b! b=0

=

(n−l)/2 

X b=0

 n−b fk (n − 2b, l )(x − 1)b b

Therefore Sk (n, l ) = Gk (n, l , 0) = Fk (0)   (n−l)/2 X b n−b = (−1) fk (n − 2b, l ) b b=0

RNA Structures

Counting k-nc arch diagrams

Counting RNA 2nd structures

Idea of proof Taylor expanding Fk about x = 1 gives (n−l)/2

X F (b) (1) (x − 1)b b!

Fk (x) =

b=0

=

(n−l)/2 

X b=0

 n−b fk (n − 2b, l )(x − 1)b b

Therefore Sk (n, l ) = Gk (n, l , 0) = Fk (0)   (n−l)/2 X b n−b = (−1) fk (n − 2b, l ) b b=0

Table 1 The rst 15 numbers of 3-noncrossing RNA structures n

1 2 3 4 5

6

7

8

9

10

11

12

13

14

15

S3 (n) 1 1 2 5 13 36 105 321 1018 3334 11216 38635 135835 486337 1769500

RNA Structures

Counting k-nc arch diagrams

The end

Thank you!

Counting RNA 2nd structures

Suggest Documents