Veterinärmedizinische Universität Wien Forschungsinformationssystem VetDoc

Grafischer Link zur Startseite der Vetmeduni Vienna

Gewählte Publikation:

Open Access Logo

Publikationstyp: Zeitschriftenaufsatz
Dokumenttyp: Originalarbeit

Jahr: 2012

AutorInnen: Palmieri, N; Nolte, V; Suvorov, A; Kosiol, C; Schlötterer, C

Titel: Evaluation of different reference based annotation strategies using RNA-Seq - a case study in Drososphila pseudoobscura.

Quelle: PLoS One. 2012; 7(10):e46415

Autor/innen der Vetmeduni Vienna:

Kosiol Carolin
Nolte Viola
Palmieri Nicola
Schlötterer Christian

Beteiligte Vetmed-Organisationseinheiten
Institut für Populationsgenetik

Zugehörige(s) Projekt(e): Evolution und Genexpression in Drosophila

RNA-Seq is a powerful tool for the annotation of genomes, in particular for the identification of isoforms and UTRs. Nevertheless, several software tools exist and no standard strategy to obtain a reliable annotation is yet established. We tested different combinations of the most commonly used reference-based alignment tools (TopHat, GSNAP) in combination with two frequently used reference-based assemblers (Cufflinks, Scripture) and evaluated the potential of RNA-Seq to improve the annotation of Drosophila pseudoobscura. While GSNAP maps a higher proportion of reads, TopHat resulted in a more accurate annotation when used in combination with Cufflinks. Scripture had the lowest sensitivity. Interestingly, after subsampling to the same coverage for GSNAP and TopHat, we find that both mappers have similar performance, implying that the advantage of TopHat is mainly an artifact of the lower coverage. Overall, we observed a low concordance among the different approaches tested both at junction and isoform levels. Using data from both sexes of two adult strains of D. pseudoobscura we detected alternative splicing for about 30% of the FlyBase multiple-exon genes. Moreover, we extended the boundaries for 6523 genes (about 40%). We annotated 669 new genes, 45% of them with splicing evidence. Most of the new genes are located on unassembled contigs, reflecting their incomplete annotation. Finally, we identified 99 additional new genes that are not represented in the current genome contigs of D. pseudoobscura, probably due to location in genomic regions that are difficult to assemble (e.g. heterochromatic regions).

Keywords Pubmed: Animals
Sequence Analysis, RNA/methods*

© Veterinärmedizinische Universität Wien Hilfe und DownloadsErklärung zur Barrierefreiheit