PhyloPythiaS+: a self-training method for the rapid reconstruction of low-ranking taxonomic bins from metagenomes.
Average rating
Cast your vote
You can rate an item by clicking the amount of stars they wish to award to this item.
When enough users have cast their vote on this item, the average rating will also be shown.
Star rating
Your vote was cast
Thank you for your feedback
Thank you for your feedback
Issue Date
2016
Metadata
Show full item recordAbstract
Background. Metagenomics is an approach for characterizing environmental microbial communities in situ, it allows their functional and taxonomic characterization and to recover sequences from uncultured taxa. This is often achieved by a combination of sequence assembly and binning, where sequences are grouped into 'bins' representing taxa of the underlying microbial community. Assignment to low-ranking taxonomic bins is an important challenge for binning methods as is scalability to Gb-sized datasets generated with deep sequencing techniques. One of the best available methods for species bins recovery from deep-branching phyla is the expert-trained PhyloPythiaS package, where a human expert decides on the taxa to incorporate in the model and identifies 'training' sequences based on marker genes directly from the sample. Due to the manual effort involved, this approach does not scale to multiple metagenome samples and requires substantial expertise, which researchers who are new to the area do not have. Results. We have developed PhyloPythiaS+, a successor to our PhyloPythia(S) software. The new (+) component performs the work previously done by the human expert. PhyloPythiaS+ also includes a new k-mer counting algorithm, which accelerated the simultaneous counting of 4-6-mers used for taxonomic binning 100-fold and reduced the overall execution time of the software by a factor of three. Our software allows to analyze Gb-sized metagenomes with inexpensive hardware, and to recover species or genera-level bins with low error rates in a fully automated fashion. PhyloPythiaS+ was compared to MEGAN, taxator-tk, Kraken and the generic PhyloPythiaS model. The results showed that PhyloPythiaS+ performs especially well for samples originating from novel environments in comparison to the other methods. Availability. PhyloPythiaS+ in a virtual machine is available for installation under Windows, Unix systems or OS X on: https://github.com/algbioi/ppsp/wiki.Citation
PhyloPythiaS+: a self-training method for the rapid reconstruction of low-ranking taxonomic bins from metagenomes. 2016, 4:e1603 PeerJAffiliation
Helmholtz Centre for infection research, Inhoffenstr. 7, D-38124 Braunschweig, Germany.Journal
PeerJPubMed ID
26870609Type
ArticleLanguage
enISSN
2167-8359ae974a485f413a2113503eed53cd6c53
10.7717/peerj.1603
Scopus Count
The following license files are associated with this item:
Related articles
- Taxator-tk: precise taxonomic assignment of metagenomes by fast approximation of evolutionary neighborhoods.
- Authors: Dröge J, Gregor I, McHardy AC
- Issue date: 2015 Mar 15
- Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.
- Authors: Papudeshi B, Haggerty JM, Doane M, Morris MM, Walsh K, Beattie DT, Pande D, Zaeri P, Silva GGZ, Thompson F, Edwards RA, Dinsdale EA
- Issue date: 2017 Nov 28
- The PhyloPythiaS web server for taxonomic assignment of metagenome sequences.
- Authors: Patil KR, Roune L, McHardy AC
- Issue date: 2012
- Large-scale machine learning for metagenomics sequence classification.
- Authors: Vervier K, Mahé P, Tournoud M, Veyrieras JB, Vert JP
- Issue date: 2016 Apr 1
- CAMISIM: simulating metagenomes and microbial communities.
- Authors: Fritz A, Hofmann P, Majda S, Dahms E, Dröge J, Fiedler J, Lesker TR, Belmann P, DeMaere MZ, Darling AE, Sczyrba A, Bremges A, McHardy AC
- Issue date: 2019 Feb 8