uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping
Stockholm Univ, Sci Life Lab, Dept Biochem & Biophys, S-17121 Solna, Sweden..
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Immunology, Genetics and Pathology. Uppsala University, Science for Life Laboratory, SciLifeLab.
Swedish Univ Agr Sci, Dept Microbiol, SE-75007 Uppsala, Sweden..
Stockholm Univ, Sci Life Lab, Dept Biochem & Biophys, S-17121 Solna, Sweden..
Show others and affiliations
2015 (English)In: GigaScience, ISSN 2047-217X, E-ISSN 2047-217X, Vol. 4, 56Article in journal (Refereed) PublishedText
Abstract [en]

Background: It remains a challenge to perform de novo assembly using next-generation sequencing (NGS). Despite the availability of multiple sequencing technologies and tools (e.g., assemblers) it is still difficult to assemble new genomes at chromosome resolution (i.e., one sequence per chromosome). Obtaining high quality draft assemblies is extremely important in the case of yeast genomes to better characterise major events in their evolutionary history. The aim of this work is two-fold: on the one hand we want to show how combining different and somewhat complementary technologies is key to improving assembly quality and correctness, and on the other hand we present a de novo assembly pipeline we believe to be beneficial to core facility bioinformaticians. To demonstrate both the effectiveness of combining technologies and the simplicity of the pipeline, here we present the results obtained using the Dekkera bruxellensis genome. Methods: In this work we used short-read Illumina data and long-read PacBio data combined with the extreme long-range information from OpGen optical maps in the task of de novo genome assembly and finishing. Moreover, we developed NouGAT, a semi-automated pipeline for read-preprocessing, de novo assembly and assembly evaluation, which was instrumental for this work. Results: We obtained a high quality draft assembly of a yeast genome, resolved on a chromosomal level. Furthermore, this assembly was corrected for mis-assembly errors as demonstrated by resolving a large collapsed repeat and by receiving higher scores by assembly evaluation tools. With the inclusion of PacBio data we were able to fill about 5 % of the optical mapped genome not covered by the Illumina data.

Place, publisher, year, edition, pages
2015. Vol. 4, 56
National Category
Medical Genetics
URN: urn:nbn:se:uu:diva-271435DOI: 10.1186/s13742-015-0094-1ISI: 000365669400002PubMedID: 26617983OAI: oai:DiVA.org:uu-271435DiVA: diva2:891963
Available from: 2016-01-08 Created: 2016-01-08 Last updated: 2016-04-12Bibliographically approved

Open Access in DiVA

fulltext(869 kB)20 downloads
File information
File name FULLTEXT01.pdfFile size 869 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Bunikis, IgnasPettersson, Olga Vinnere
By organisation
Department of Immunology, Genetics and PathologyScience for Life Laboratory, SciLifeLab
In the same journal
Medical Genetics

Search outside of DiVA

GoogleGoogle Scholar
Total: 20 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 69 hits
ReferencesLink to record
Permanent link

Direct link