Bayesian identification of bacterial strains from sequencing data

dc.contributor.author Ben Pascoe
dc.contributor.author Sion C. Bayliss
dc.contributor.author Edward J. Feil
dc.contributor.author Matthew D. Hitchings
dc.contributor.author Guillaume Méric
dc.contributor.author Samuel K. Sheppard
dc.contributor.author Aravind Sankar
dc.contributor.author Brandon Malone
dc.contributor.author Jukka Corander
dc.contributor.author Antti Honkela
dc.control.author Brandon Malone
dc.control.author Jukka Corander
dc.date.accessioned 2025-06-14T01:44:54Z
dc.date.available 2025-06-14T01:44:54Z
dc.date.issued 2016-08-25
dc.description.abstract <jats:p>Rapidly assaying the diversity of a bacterial species present in a sample obtained from a hospital patient or an environmental source has become possible after recent technological advances in DNA sequencing. For several applications it is important to accurately identify the presence and estimate relative abundances of the target organisms from short sequence reads obtained from a sample. This task is particularly challenging when the set of interest includes very closely related organisms, such as different strains of pathogenic bacteria, which can vary considerably in terms of virulence, resistance and spread. Using advanced Bayesian statistical modelling and computation techniques we introduce a novel pipeline for bacterial identification that is shown to outperform the currently leading pipeline for this purpose. Our approach enables fast and accurate sequence-based identification of bacterial strains while using only modest computational resources. Hence it provides a useful tool for a wide spectrum of applications, including rapid clinical diagnostics to distinguish among closely related strains causing nosocomial infections. The software implementation is available at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/PROBIC/BIB" xlink:type="simple">https://github.com/PROBIC/BIB</jats:ext-link>.</jats:p>
dc.description.volume 2
dc.identifier.arxiv http://arxiv.org/abs/1511.06546
dc.identifier.doi 10.1099/mgen.0.000075
dc.identifier.doi 10.48550/arxiv.1511.06546
dc.identifier.handle 10138/231150
dc.identifier.issn 2057-5858
dc.identifier.openaire doi_dedup___
dc.identifier.pmc PMC5320594
dc.identifier.pmid 28348870
dc.identifier.uri https://ror.circle-u.eu/handle/123456789/316106
dc.openaire.affiliation University of Oslo
dc.openaire.collaboration 1
dc.publisher Microbiology Society
dc.rights OPEN
dc.rights.license http://creativecommons.org/licenses/by/4.0/http://creativecommons.org/licenses/by/4.0/
dc.source Microbial Genomics
dc.subject staphylococcus aureus
dc.subject DNA, Bacterial
dc.subject FOS: Computer and information sciences
dc.subject strain identification
dc.subject Quantitative Biology - Quantitative Methods
dc.subject Statistics - Applications
dc.subject Humans
dc.subject Quantitative Biology - Genomics
dc.subject Applications (stat.AP)
dc.subject Plant biology, microbiology, virology
dc.subject Quantitative Methods (q-bio.QM)
dc.subject Genomics (q-bio.GN)
dc.subject Computer and information sciences
dc.subject Bacteria
dc.subject pathogenic bacteria
dc.subject Bayes Theorem
dc.subject Sequence Analysis, DNA
dc.subject Bacterial Typing Techniques
dc.subject FOS: Biological sciences
dc.subject probabilistic modelling
dc.subject /dk/atira/pure/subjectarea/asjc/2700/2700; name=General Medicine
dc.subject Genome, Bacterial
dc.subject Software
dc.subject Research Paper
dc.subject.fos 0301 basic medicine
dc.subject.fos 0206 medical engineering
dc.subject.fos 02 engineering and technology
dc.subject.fos 03 medical and health sciences
dc.title Bayesian identification of bacterial strains from sequencing data
dc.type publication

Files

Collections