Semiautomated improvement of RNA alignments

Research output: Contribution to journalJournal articleResearchpeer-review

  • Ebbe Sloth Andersen
  • Allan Lind-Thomsen
  • Bjarne Knudsen
  • Susie E. Kristensen
  • Havgaard, Jakob Hull
  • Elfar Torarinsson
  • Niels Larsen
  • Christian Zwieb
  • Peter Sestoft
  • Jørgen Kjems
  • Gorodkin, Jan
We have developed a semiautomated RNA sequence editor (SARSE) that integrates tools for analyzing RNA alignments. The editor highlights different properties of the alignment by color, and its integrated analysis tools prevent the introduction of errors when doing alignment editing. SARSE readily connects to external tools to provide a flexible semiautomatic editing environment. A new method, Pcluster, is introduced for dividing the sequences of an RNA alignment into subgroups with secondary structure differences. Pcluster was used to evaluate 574 seed alignments obtained from the Rfam database and we identified 71 alignments with significant prediction of inconsistent base pairs and 102 alignments with significant prediction of novel base pairs. Four RNA families were used to illustrate how SARSE can be used to manually or automatically correct the inconsistent base pairs detected by Pcluster: the mir-399 RNA, vertebrate telomase RNA (vert-TR), bacterial transfer-messenger RNA (tmRNA), and the signal recognition particle (SRP) RNA. The general use of the method is illustrated by the ability to accommodate pseudoknots and handle even large and divergent RNA families. The open architecture of the SARSE editor makes it a flexible tool to improve all RNA alignments with relatively little human intervention. Online documentation and software are available at (http://sarse.ku.dk).
Original languageEnglish
JournalRNA: A publication of the RNA Society
Volume13
Issue number11
Pages (from-to)1850-1859
Number of pages10
ISSN1355-8382
DOIs
Publication statusPublished - 2007

ID: 8096836