\(\textit {De novo}\) assembly of a 40 Mb eukaryotic genome from short sequence reads

  • Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, \(\textit {de novo}\) assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test \(\textit {de novo}\) genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the \(\textit {Sordaria macrospora}\) genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ~4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with \(\it Neurospora\) genomes increased the N50 to 498 kb. The \(\textit {S.macrospora}\) genome contains even fewer repeat regions than its closest sequenced relative, \(\textit {Neurospora crassa.}\) Comparison with genomes of other fungi showed that \(\textit {S. macrospora}\), a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, \(\textit {S. macrospora}\) contains more polyketide biosynthesis genes than \(\textit {N. crassa}\). Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, \(\textit {de novo}\) assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Minou NowrousianORCiDGND, Jason E. StajichGND, Meiling ChuGND, Ines EnghORCiDGND, Eric EspagneGND, Karen HallidayGND, Jens KamerewerdGND, Frank KempkenGND, Birgit KnabGND, Hsiao-Che KuoGND, Heinz D. OsiewaczGND, Stefanie PöggelerORCiDGND, Nick D. ReadGND, Stephan SeilerGND, Kristina M. SmithGND, Denise ZicklerGND, Ulrich KückORCiDGND, Michael FreitagGND
URN:urn:nbn:de:hbz:294-71540
DOI:https://doi.org/10.1371/journal.pgen.1000891
Parent Title (English):PLoS Genetics
Subtitle (English):\(\textit {Sordaria macrospora}\), a model organism for fungal morphogenesis
Publisher:Public Library of Science
Place of publication:San Francisco
Document Type:Article
Language:English
Date of Publication (online):2020/05/13
Date of first Publication:2010/04/08
Publishing Institution:Ruhr-Universität Bochum, Universitätsbibliothek
Tag:Comparative genomics; Fungal genetics; Fungal genomics; Fungi; Neurospora crassa; Phylogenetic analysis; Repeated sequences; Sequence assembly tools
Volume:6
Issue:4, Artikel e1000891
First Page:e1000891-1
Last Page:e1000891-22
Institutes/Facilities:Lehrstuhl für Allgemeine und Molekulare Botanik
Protein Research Department
open_access (DINI-Set):open_access
Licence (English):License LogoCreative Commons - CC BY 4.0 - Attribution 4.0 International