- Martin Mann, Mostafa M Mohamed, Syed M Ali, and Rolf Backofen
Interactive implementations of thermodynamics-based RNA structure and RNA-RNA interaction prediction approaches for example-driven teaching
PLOS Computational Biology, 14 (8), e1006341, 2018. - Martin Raden, Syed M Ali, Omer S Alkhnbashi, Anke Busch, Fabrizio Costa, Jason A Davis, Florian Eggenhofer, Rick Gelhausen, Jens Georg, Steffen Heyne, Michael Hiller, Kousik Kundu, Robert Kleinkauf, Steffen C Lott, Mostafa M Mohamed, Alexander Mattheis, Milad Miladi, Andreas S Richter, Sebastian Will, Joachim Wolff, Patrick R Wright, and Rolf Backofen
Freiburg RNA tools: a central online resource for RNA-focused research and teaching
Nucleic Acids Research, 46(W1), W25-W29, 2018.
Teaching - counting : number of structures source at github@BackofenLab/RNA-Playground
The number of nested secondary structures that can be formed by an RNA sequence grows
exponentially with sequence length $n$ (about $\sim 2.3^{n}$, see
(Hofacker et al., 1997)).
Here, we use a variant of the algorithm by Michael S. Waterman and Temple F. Smith (1970) to compute the exact number of nested structures a sequence can form. The approach applies dynamic programming, i.e. tabularizes results for subproblems. To this end, it populates a matrix $C$, where an entry $C_{i,j}$ provides the exact number of admissible structures for the subsequence from position $i$ to $j$. The overall number of admissible structures for the sequences can be found in the upper right corner $C_{1,n}$, where $n$ denotes the sequence's length and Watson-Crick as well as GU base pairs are considered complementary.
Here, we use a variant of the algorithm by Michael S. Waterman and Temple F. Smith (1970) to compute the exact number of nested structures a sequence can form. The approach applies dynamic programming, i.e. tabularizes results for subproblems. To this end, it populates a matrix $C$, where an entry $C_{i,j}$ provides the exact number of admissible structures for the subsequence from position $i$ to $j$. The overall number of admissible structures for the sequences can be found in the upper right corner $C_{1,n}$, where $n$ denotes the sequence's length and Watson-Crick as well as GU base pairs are considered complementary.
RNA sequence $S$:
Minimal loop length $l$:
Recursion for counting the number of possible structures:
$C$ | ||
---|---|---|