Machine learning identifies signatures of host adaptation in the bacterial pathogen Salmonella enterica.
Cast your vote
You can rate an item by clicking the amount of stars they wish to award to this item.
When enough users have cast their vote on this item, the average rating will also be shown.
Your vote was cast
Thank you for your feedback
Thank you for your feedback
MetadataShow full item record
AbstractEmerging pathogens are a major threat to public health, however understanding how pathogens adapt to new niches remains a challenge. New methods are urgently required to provide functional insights into pathogens from the massive genomic data sets now being generated from routine pathogen surveillance for epidemiological purposes. Here, we measure the burden of atypical mutations in protein coding genes across independently evolved Salmonella enterica lineages, and use these as input to train a random forest classifier to identify strains associated with extraintestinal disease. Members of the species fall along a continuum, from pathovars which cause gastrointestinal infection and low mortality, associated with a broad host-range, to those that cause invasive infection and high mortality, associated with a narrowed host range. Our random forest classifier learned to perfectly discriminate long-established gastrointestinal and invasive serovars of Salmonella. Additionally, it was able to discriminate recently emerged Salmonella Enteritidis and Typhimurium lineages associated with invasive disease in immunocompromised populations in sub-Saharan Africa, and within-host adaptation to invasive infection. We dissect the architecture of the model to identify the genes that were most informative of phenotype, revealing a common theme of degradation of metabolic pathways in extraintestinal lineages. This approach accurately identifies patterns of gene degradation and diversifying selection specific to invasive serovars that have been captured by more labour-intensive investigations, but can be readily scaled to larger analyses.
AffiliationHIRI, Helmoltz-Institut für RNA-basierteInfektionsforschung, Josef-Schneider-Strasse 2, 97080 Würzburg, Germany.
The following license files are associated with this item:
- Creative Commons
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 United States
- Genome and transcriptome adaptation accompanying emergence of the definitive type 2 host-restricted Salmonella enterica serovar Typhimurium pathovar.
- Authors: Kingsley RA, Kay S, Connor T, Barquist L, Sait L, Holt KE, Sivaraman K, Wileman T, Goulding D, Clare S, Hale C, Seshasayee A, Harris S, Thomson NR, Gardner P, Rabsch W, Wigley P, Humphrey T, Parkhill J, Dougan G
- Issue date: 2013 Aug 27
- Genomic Analysis of Salmonella enterica Serovar Typhimurium Characterizes Strain Diversity for Recent U.S. Salmonellosis Cases and Identifies Mutations Linked to Loss of Fitness under Nitrosative and Oxidative Stress.
- Authors: Hayden HS, Matamouros S, Hager KR, Brittnacher MJ, Rohmer L, Radey MC, Weiss EJ, Kim KB, Jacobs MA, Sims-Day EH, Yue M, Zaidi MB, Schifferli DM, Manning SD, Walson JL, Miller SI
- Issue date: 2016 Mar 8
- Host-pathogen interaction in invasive Salmonellosis.
- Authors: de Jong HK, Parry CM, van der Poll T, Wiersinga WJ
- Issue date: 2012
- Association between phylogeny, virulence potential and serovars of Salmonella enterica.
- Authors: Litrup E, Torpdahl M, Malorny B, Huehn S, Christensen H, Nielsen EM
- Issue date: 2010 Oct
- High-Resolution Identification of Multiple Salmonella Serovars in a Single Sample by Using CRISPR-SeroSeq.
- Authors: Thompson CP, Doak AN, Amirani N, Schroeder EA, Wright J, Kariyawasam S, Lamendella R, Shariat NW
- Issue date: 2018 Nov 1