<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="other" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">Wellcome Open Res</journal-id>
            <journal-title-group>
                <journal-title>Wellcome Open Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2398-502X</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/wellcomeopenres.14826.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Software Tool Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 2 approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Jolley</surname>
                        <given-names>Keith A.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0751-0287</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Bray</surname>
                        <given-names>James E.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-3554-4254</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Maiden</surname>
                        <given-names>Martin C. J.</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Funding Acquisition</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-6321-5138</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:keith.jolley@zoo.ox.ac.uk">keith.jolley@zoo.ox.ac.uk</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>24</day>
                <month>9</month>
                <year>2018</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2018</year>
            </pub-date>
            <volume>3</volume>
            <elocation-id>124</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>18</day>
                    <month>9</month>
                    <year>2018</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2018 Jolley KA et al.</copyright-statement>
                <copyright-year>2018</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://wellcomeopenresearch.org/articles/3-124/pdf"/>
            <abstract>
                <p>The 
                    <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org/">PubMLST.org</ext-link> website hosts a collection of open-access, curated databases that integrate population sequence data with provenance and phenotype information for over 100 different microbial species and genera.  Although the PubMLST website was conceived as part of the development of the first multi-locus sequence typing (MLST) scheme in 1998 the software it uses, the Bacterial Isolate Genome Sequence database (BIGSdb, published in 2010), enables PubMLST to include all levels of sequence data, from single gene sequences up to and including complete, finished genomes.  Here we describe developments in the BIGSdb software made from publication to June 2018 and show how the platform realises microbial population genomics for a wide range of applications.  The system is based on the gene-by-gene analysis of microbial genomes, with each deposited sequence annotated and curated to identify the genes present and systematically catalogue their variation.  Originally intended as a means of characterising isolates with typing schemes, the synthesis of sequences and records of genetic variation with provenance and phenotype data permits highly scalable (whole genome sequence data for tens of thousands of isolates) means of addressing a wide range of functional questions, including: the prediction of antimicrobial resistance; likely cross-reactivity with vaccine antigens; and the functional activities of different variants that lead to key phenotypes.  There are no limitations to the number of sequences, genetic loci, allelic variants or schemes (combinations of loci) that can be included, enabling each database to represent an expanding catalogue of the genetic variation of the population in question.  In addition to providing web-accessible analyses and links to third-party analysis and visualisation tools, the BIGSdb software includes a RESTful application programming interface (API) that enables access to all the underlying data for third-party applications and data analysis pipelines.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Database</kwd>
                <kwd>population annotation</kwd>
                <kwd>evolution</kwd>
                <kwd>epidemiology</kwd>
                <kwd>public health</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1" xlink:href="http://dx.doi.org/10.13039/100004440">
                    <funding-source>Wellcome Trust</funding-source>
                    <award-id>104992</award-id>
                </award-group>
                <award-group id="fund-2" xlink:href="http://dx.doi.org/10.13039/100011199">
                    <funding-source>FP7 Ideas: European Research Council</funding-source>
                    <award-id>FP7-278864-2</award-id>
                </award-group>
                <funding-statement>Development of PubMLST and BIGSdb has been supported by a Wellcome Trust Biomedical Resource Grant (104992). Design and implementation of the RESTful API has been further supported by the European Community grant FP7-278864-2 (PathoNgenTrace, http://www.patho-ngen-trace.eu/).</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Our ability to study complex phenotypes, i.e. those that depend on the interactions of multiple components of an organism and its environment, have been enhanced during the past 20 years by the very large increases in our capacity to collect and analyse biological information.  Amongst the most important of these developments have been very high-throughput sequencing methods and the informatics approaches required to interpret the large volumes of data that they generate; however, at the time of writing, there remain major challenges in realising the potential of the opportunities presented by such developments
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>.  Specifically, these data must be stored, organised, curated, interpreted, analysed, and disseminated in a usable way.  Each of these steps requires a sustainable infrastructure and need to be achieved in line with appropriate standards of accuracy and openness
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>, while meeting ethical and legislative requirements of confidentiality and data ownership
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>,
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>.   The 
                <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org/">PubMLST.org</ext-link> databases
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>, which are powered by the 
                <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org/software/database/bigsdb/">Bacterial Isolate Genome Sequence Database</ext-link> (BIGSdb) software
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>, represent an approach to meeting these goals for the analysis of microorganisms, especially bacterial pathogens.</p>
            <p>The PubMLST.org databases employ a bacterial population genomics approach to this problem
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>,
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>.   Population genomics combines the concepts of population genetics with genome-wide sequence data, to infer the links between phenotype and genotype synthesising evolutionary and functional analyses
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>.  This powerful paradigm requires an ability to link population-wide information on genome sequence data with information on the provenance (time and place) and phenotype (behaviour) of the organism in question.  It is especially suited to resolving complex phenotypes
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>, such as virulence and antibiotic resistance in bacterial pathogens, as many of these cannot be fully elucidated by 
                <italic toggle="yes">in vitro</italic> reductionist investigations that rely on single laboratory organisms
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>
                </sup>.  One of the first practical implementations of bacterial population genomics was multi-locus sequence typing (MLST).  MLST indexed the sequences of multiple, but few (six or seven), housekeeping gene fragments to identify bacterial genotypes and associate them with biological properties, for example the propensity to cause invasive disease
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-14">14</xref>
                </sup>.  The approach was later complemented by the analysis of genes encoding particular functions, such as vaccine antigens
                <sup>
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>, and was ultimately expanded to enable the inclusion of whole genome sequence (WGS) data by the development of the BIGSdb platform in 2010
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup> (
                <xref ref-type="fig" rid="f1">Figure 1</xref>).  The gene-by-gene approach exemplified by MLST is inherently scalable with respect to the number of loci and individual organisms included
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup> and the BIGSdb platform has been continually developed and extended to provide additional functionality.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>MLST comes of age: 21 years of population genomics.</title>
                    <p>The PubMLST website (
                        <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org">https://pubmlst.org</ext-link>) has been running for 15 years, having been established under the pubmlst.org domain in 2003. Its immediate progenitor was the original MLST database set up to support the 
                        <italic toggle="yes">Neisseria</italic> scheme
                        <sup>
                            <xref ref-type="bibr" rid="ref-12">12</xref>,
                            <xref ref-type="bibr" rid="ref-13">13</xref>
                        </sup>, the first MLST scheme developed in 1998. The initial role of the site was to host the nomenclature and isolate collection records for typing schemes, but it was rapidly opened to the wider community, hosting schemes of other organisms
                        <sup>
                            <xref ref-type="bibr" rid="ref-14">14</xref>
                        </sup>. Shortly afterwards, other sites began hosting MLST schemes, the most prominent of which was mlst.net
                        <sup>
                            <xref ref-type="bibr" rid="ref-43">43</xref>
                        </sup>, at Imperial College, along with others at the University of Cork, Ireland later migrated to University of Warwick and subsumed within the Enterobase platform
                        <sup>
                            <xref ref-type="bibr" rid="ref-44">44</xref>
                        </sup>, and the Pasteur Institute, Paris, France.  Early generations of software developed to support the databases
                        <sup>
                            <xref ref-type="bibr" rid="ref-13">13</xref>&#x2013;
                            <xref ref-type="bibr" rid="ref-15">15</xref>
                        </sup> were limited to specific loci defined for a single typing scheme specified in their configuration. With extensive WGS data in prospect, in 2008 work started on a platform designed to flexibly handle genomic data utilizing any number of loci and typing schemes. The resulting Bacterial Isolate Genome Sequence Database (BIGSdb) platform
                        <sup>
                            <xref ref-type="bibr" rid="ref-6">6</xref>
                        </sup> has been used since then to host databases on PubMLST as well as being used for the databases hosted at the Pasteur Institute. It has been under constant development since. In 2016, the databases hosted on mlst.net were migrated to PubMLST, with the result that most MLST schemes are now hosted using the same platform (the major exceptions being 
                        <italic toggle="yes">Salmonella</italic> and 
                        <italic toggle="yes">Escherichia coli</italic>, hosted on Enterobase, although these schemes are mirrored on PubMLST).</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure1.gif"/>
            </fig>
            <p>The volume of sequence data stored in the PubMLST.org databases has increased greatly with the advent of affordable WGS determination employing &#x2018;next generation sequencing&#x2019; (NGS) platforms (
                <xref ref-type="fig" rid="f2">Figure 2</xref>)
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup>.   At the time of writing, PubMLST hosted databases for over 100 species or genera, mainly bacteria, but also included some schemes for eukaryotes and plasmids.  In principle, the BIGSdb platform can be used for any organism or virus.  Many of these databases contain whole genome sequences linked to standard (seven locus) or higher resolution multilocus typing schemes such as ribosomal MLST (rMLST)
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup> or core genome MLST (cgMLST)
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup> (
                <xref ref-type="table" rid="T1">Table 1</xref>).  In total, there were more than 300,000 submitted isolate records and 100,000 genome assemblies in these databases (
                <xref ref-type="fig" rid="f2">Figure 2</xref>), with approximately 125 curators and 2000 active data submitters with submissions from across the world (
                <xref ref-type="fig" rid="f3">Figure 3</xref>). The site also hosts the rMLST databases
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup>, providing species identification and analysis tools, with approximately 15,000 unique visitors and over a million page views a month. The BIGSdb platform has been cited 990 times (August 2018, source Google Scholar).</p>
            <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                <label>Figure 2. </label>
                <caption>
                    <title>Submission of isolate records and genomes to the PubMLST species/genus-specific databases over time.</title>
                    <p>Prior to 2012, most submissions consisted of provenance metadata along with MLST results and a few antigen sequence designations. Since then, the proportion of submissions that include whole genome assemblies has continually increased. The apparent dip in isolates without genomes which occurred around 2014 was due to genome assemblies being added to existing records that had been submitted previously with just MLST results.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure2.gif"/>
            </fig>
            <table-wrap id="T1" orientation="portrait" position="anchor">
                <label>Table 1. </label>
                <caption>
                    <title>Curated data available in the ten largest databases within PubMLST (accessed July 2018; ordered by number of genomes).</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Database</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Isolates</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Genomes</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Loci</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Alleles</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Submitters</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">rMLST</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">223,292</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">223,292</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">53</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,350,997</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">-</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Staphylococcus aureus</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">33,670</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">25,762</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,215</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">593,996</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">547</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Campylobacter jejuni/coli</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">69,416</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">25,448</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,996</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">880,953</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">278</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Neisseria spp.</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">46,985</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">16,511</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,979</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,170,477</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">343</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Streptococcus pneumoniae</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">40,252</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">9,010</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">7</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">4,498</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">347</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Streptococcus agalactiae</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">4,228</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,844</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,072</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">115,266</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">66</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Pseudomonas aeruginosa</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">6,528</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,382</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">186</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,598</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">169</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Bordetella spp.</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,625</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,112</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1,460</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">35,885</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">12</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Acinetobacter baumannii</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3,903</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">936</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">16</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,527</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">166</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <italic toggle="yes">Burkholderia cepacia</italic>
                                <break/>
                                <italic toggle="yes">complex</italic>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2,791</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">651</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">7</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">4,443</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">61</td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>Global submissions to the PubMLST databases.</title>
                    <p>Isolate records submitted to individual databases represent almost every country in the world. There are approximately 125 curators handling submissions from over 2000 active data submitters across all the species- and genus-specific databases hosted on PubMLST.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure3.gif"/>
            </fig>
            <p>To facilitate the indexing of variation across thousands of loci within each deposited genome sequence (
                <xref ref-type="fig" rid="f4">Figure 4</xref>)
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>, BIGSdb consists of two distinct database structures: (i) an isolate (or &#x2018;specimen&#x2019;) database that hosts provenance and genome sequence information for each sample; and, (ii) a sequence definition database that contains allelic identifiers and profiles, which provides annotation and a genetic nomenclature
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>.  Unified, curated nomenclature is essential for studies that involve very large numbers of specimens and multiple genetic loci.   This separation of roles was an early design decision, reflecting the gene and allele-based paradigm used and facilitates the use of BIGSdb as a nomenclature server. Consequently, it is possible to have individual BIGSdb isolate databases acting as clients to multiple sequence definition databases and these may, in turn, each serve multiple isolate databases, creating a federated network of interconnected data resources. </p>
            <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                <label>Figure 4. </label>
                <caption>
                    <title>Analysis pipeline for short read data.</title>
                    <p>PubMLST and BIGSdb, its underlying genomics platform, links provenance metadata with allelic sequence variation found in corresponding whole genome assemblies or sequences derived from Sanger sequencing reactions. Population annotation, the process of assigning precise variant information for loci across the genomes of large numbers of bacterial isolates, creates a structured dataset that can be used to address a range of biological questions beyond epidemiology.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure4.gif"/>
            </fig>
            <p>Each database in the system can have different access restrictions, with different users having roles and access rights defined by a fine-grained permissions system.  This enables migration between pre-publication analyses and published data.  Pre-publication data can remain private to the owner until publication but then made open-access, whilst retaining the same database identifier.  As the private and the open-access data reside in the same database, it is straightforward for pre-publication or confidential data to be analysed in the context of published data.  This is especially important where it may not be possible to make some or all of the data public for regulatory, legal, or public policy reasons, for example during the live investigation of disease outbreaks.  All information stored within the databases is accessible to third-party applications via a RESTful application programming interface (API), depending on access restrictions
                <sup>
                    <xref ref-type="bibr" rid="ref-27">27</xref>
                </sup>. </p>
            <p>The hosting of typing and nomenclature information remains a central role of the PubMLST.org databases
                <sup>
                    <xref ref-type="bibr" rid="ref-28">28</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-37">37</xref>
                </sup> and the principle curated schemes available include: (i) conventional seven-locus MLST schemes
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>, implemented in all databases; (ii) cgMLST schemes, for example those available for 
                <italic toggle="yes">Neisseria meningitidis</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-38">38</xref>
                </sup>, 
                <italic toggle="yes">Campylobacter coli</italic> and 
                <italic toggle="yes">Campylobacter jejuni</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-39">39</xref>
                </sup>; and (iii) rMLST
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup>, available for over 223,000 bacterial specimens spanning more than 8500 species.  In addition, a number of schemes for particular vaccine formulations, for example the meningococcal Bexsero Antigen Sequence Type (BAST)
                <sup>
                    <xref ref-type="bibr" rid="ref-40">40</xref>
                </sup> or antibiotic resistance
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>,
                    <xref ref-type="bibr" rid="ref-41">41</xref>,
                    <xref ref-type="bibr" rid="ref-42">42</xref>
                </sup> are available.  Although whole genome sequence data are increasingly used in the resolution of outbreaks, nomenclatures based on a subset of loci remain crucial for the development of stable hierarchical typing and nomenclature schemes used to categorise genotypes and compare them globally
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>,
                    <xref ref-type="bibr" rid="ref-45">45</xref>
                </sup>.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Implementation</title>
                <p>PubMLST functionality is provided by the BIGSdb web application, which is written in object-oriented Perl and Javascript utilizing a PostgreSQL backend
                    <sup>
                        <xref ref-type="bibr" rid="ref-6">6</xref>,
                        <xref ref-type="bibr" rid="ref-46">46</xref>
                    </sup>. It runs under the Apache web server software on Linux.  The API is written as a Perl Dancer2 application interfacing with the same program libraries as the web application, run using the Starman high-performance PSGI/Plack web server
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>
                    </sup>.</p>
            </sec>
            <sec>
                <title>Operation</title>
                <p>The PubMLST website can be accessed from any platform using a modern web browser supporting Javascript. The underlying BIGSdb software can be locally installed on a Linux machine with as little as 4GB RAM and a single processor, but at least 4 processor cores and 16 GB RAM are recommended.</p>
                <p>
                    <bold>
                        <italic toggle="yes">Data access</italic>.</bold> Both manual and automated access to data are available.  Most end-users interact with PubMLST using the web interface (
                    <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org">https://pubmlst.org</ext-link>), through which all functionality is available.  Querying of sequences can be performed by either pasting in data or uploading files for analysis via web forms. Complex queries can be constructed by combining search elements on a user-modifiable form and results further analysed using integrated plugins.  Context-sensitive help is available within the interface linking to specific sections within an online users&#x2019; guide (
                    <ext-link ext-link-type="uri" xlink:href="http://bigsdb.readthedocs.io">http://bigsdb.readthedocs.io</ext-link>).  All data hosted within PubMLST is also accessible via the API, which enables machine-to-machine direct data exchange without user input
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>
                    </sup>.  Most methods are discoverable from the root entry point (
                    <ext-link ext-link-type="uri" xlink:href="http://rest.pubmlst.org/">http://rest.pubmlst.org/</ext-link>) and are fully documented (
                    <ext-link ext-link-type="uri" xlink:href="http://bigsdb.readthedocs.io/en/latest/rest.html">http://bigsdb.readthedocs.io/en/latest/rest.html</ext-link>).  The API is frequently used to synchronize allele and profile definitions (nomenclature), but it also supports authenticated data submissions to curator queues, isolate dataset querying and extraction of typing designations from uploaded genome sequences. This latter functionality facilitates incorporating PubMLST allele calling or use of species identification functionality from the rMLST database, directly in to an analysis pipeline (
                    <xref ref-type="other" rid="SF1">Supplementary File 1</xref>).</p>
                <p>
                    <bold>
                        <italic toggle="yes">Annotation</italic>.</bold> BIGSdb works exclusively with assembled nucleotide sequences, although these need not be single contiguous sequences and can range from single gene sequences through MLST datasets and draft genome assemblies to complete, finished genomes present as single sequences.  Therefore, genomic data present in the sequence read archives must be assembled before deposition into the database.  Once these sequences have been deposited, the identification of genes at specific locations (loci) within the deposited sequence data and their precise allelic variant, is central to the BIGSdb approach to annotation. This process is referred to as allele calling and is divided into three stages.</p>
                <p>The first stage is the identification and cataloguing of the loci (i.e
                    <italic toggle="yes">.</italic> genes) present within the group of organisms that are being analysed in a given database.  A variety of means of identifying genes within assembled bacterial genomes are available, such as 
                    <ext-link ext-link-type="uri" xlink:href="http://www.vicbioinformatics.com/software.prokka.shtml">PROKKA</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-47">47</xref>
                    </sup>. Many bacterial species now have an annotated reference genome
                    <sup>
                        <xref ref-type="bibr" rid="ref-48">48</xref>
                    </sup> and this can be used to seed the database with genes known to be present in that species.  As the majority of bacteria have &#x2018;open genomes&#x2019;, a complete catalogue of the &#x2018;pan genome&#x2019;, that is all genes that are available to that group of bacteria
                    <sup>
                        <xref ref-type="bibr" rid="ref-49">49</xref>
                    </sup>, requires a continual process of annotation of novel genomes as they are deposited, with an ever-expanding catalogue of genes known to be present in that group of organisms.  To this end, BIGSdb has no limit on the number of loci that can be stored in a database other than the storage capacity of the host computer.  BIGSdb indexes each gene, which is normally synonymous with one locus in a bacterial genome, with a unique identifier that can be associated with any number of names that have been previously used.  This is essential as individual annotations in bacteria from the same or related species often have incompatible names.  For example, in the PubMLST 
                    <italic toggle="yes">Neisseria</italic> database a NEIS number is assigned in order of discovery to each unique gene in the database, which is linked to other names that have been used.  As novel genes are identified by the on-going process of population annotation, new NEIS numbers are defined.  In this way, the pan genome of the organisms in question (in this case the genus 
                    <italic toggle="yes">Neisseria</italic> as the PubMLST 
                    <italic toggle="yes">Neisseria</italic> database is genome wide) is catalogued, and a universal gene and locus nomenclature established and maintained.</p>
                <p>The second stage is identifying the presence and location of the known genes within deposited genome data (locus identification). BIGSdb employs different strategies for this, depending on the number of loci being annotated at a given time. The most straightforward method performs a BLAST
                    <sup>
                        <xref ref-type="bibr" rid="ref-50">50</xref>
                    </sup> query of the genome sequence against a database of all known alleles for each gene in turn. This is satisfactory if the results for only a few genes are being indexed, but if analysing more loci, for example for a pan-genome or cgMLST analysis where perhaps 1500 or more loci are being catalogued simultaneously, this is a bottleneck that is increasingly time-consuming as more alleles are defined. Therefore, for routine calling, the process is facilitated by reducing the BLAST search space using &#x2018;exemplar alleles&#x2019;.  These are defined such that every known allele for a given gene is within 10% sequence identity of an exemplar allele of the same length. The BLAST query is then performed against a database of exemplar alleles for all loci together, which identifies the location of the loci and allows the individual locus sequences to be extracted and their allelic identities to be determined by a database lookup. This method is much more efficient and allows PubMLST to scan more than 1000 bacterial genomes for a 2000 loci cgMLST scheme within one hour on a single server using 32 cores.</p>
                <p>The third stage of population annotation is identifying and defining the alleles present at each locus.  Most of this process is automated but it requires curator oversight if a new sequence is too different from an existing allele.  Again, two different strategies are employed: (i) using all existing alleles in the search; or (ii) using exemplar or type alleles to constrain the search space and prevent definition of new alleles that are sufficiently divergent to the original type allele to warrant the definition of a new gene. The first strategy is more sensitive for use when developing schemes and for research purposes whereas the second is likely to be employed routinely for schemes used for surveillance or public health purposes.  For cgMLST or pan-genome automated allele assignment, new allele sequences must be within 98% identity and 98% total length of a known allele, contain terminal in-frame start and stop codons and have no internal stop codons. With type alleles employed, the allowed percentage identity difference will be increased.</p>
                <p>As each database comprises multiple genetic loci, the BIGSdb software has the capacity to organise these loci into any number of &#x2018;schemes&#x2019;.  A scheme is a group of genes (loci) that are grouped together for a particular purpose.  Any locus can, in principle, be included in any scheme and there can be any number of schemes of any size.  Schemes can be: (i) purely &#x2018;typological&#x2019;, for example conventional seven-locus MLST schemes
                    <sup>
                        <xref ref-type="bibr" rid="ref-51">51</xref>
                    </sup>, or (ii) directed to a particular biological or biochemical function, for example the Entner-Doudoroff pathway scheme developed for 
                    <italic toggle="yes">Campylobacter</italic> species
                    <sup>
                        <xref ref-type="bibr" rid="ref-52">52</xref>
                    </sup>; or combine both functional and typing roles, such as the ribosomal MLST (rMLST) scheme
                    <sup>
                        <xref ref-type="bibr" rid="ref-18">18</xref>
                    </sup>.   Schemes enable hierarchical and functional analyses with easy access to defined genetic variation, providing a facile way of extracting specific sequence diversity information from large datasets rapidly and conveniently.</p>
                <p>
                    <bold>
                        <italic toggle="yes">Core genome MLST profile definitions and clustering</italic>.</bold> The bacterial core genome was originally conceived as those functional genes present in every member of a given group of related microorganisms
                    <sup>
                        <xref ref-type="bibr" rid="ref-53">53</xref>
                    </sup>; however, for practical purposes it is desirable to use a more relaxed definition.  This is because &#x2018;essential&#x2019; core genes may be absent from a given WGS data set for several reasons including: (i) the organism from which it is generated is a rare mutant lacking a gene normally present in all members of the species; and (ii) technical issues due to incomplete genome assembly. Both reasons would lead to an ever-decreasing core genome, as isolates without the full complement of &#x2018;core&#x2019; genes accumulate.  Consequently, bacterial cgMLST schemes, which need to be stable if they are to form the basis of nomenclature, commonly include genes present in 95% or more of WGSs
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>,
                        <xref ref-type="bibr" rid="ref-39">39</xref>,
                        <xref ref-type="bibr" rid="ref-54">54</xref>,
                        <xref ref-type="bibr" rid="ref-55">55</xref>
                    </sup>.  Most isolates analysed with a given cgMLST scheme will therefore have some missing loci. To accommodate this, cgMLST profiles are usually defined with some missing loci, i.e
                    <italic toggle="yes">.</italic> where a locus position is marked with an &#x2018;N&#x2019; instead of an allele number. In pairwise comparisons, this will match to any allele, effectively removing the locus from that comparison, so the number of allowed missing loci needs to be kept at a low level in order to maintain resolution.  The cgMLST schemes hosted on PubMLST commonly allow profile definition with up to 50 missing loci. Loci that are frequently absent from many assemblies, possibly due to them containing regions of low complexity longer than the sequence read length, are generally removed from a scheme to minimize missing data.</p>
                <p>Once allelic profiles have been defined from WGS data, they can be clustered to identify groups of similar isolates that are likely to share a common ancestor. BIGSdb supports clustering of cgMLST schemes using a single-linkage model with multiple defined thresholds of allelic differences. These thresholds are chosen empirically and depend on the organism and the number of loci used in the scheme, but may range from 200 or more, to identify major lineages, down to 5 or fewer, suitable for identifying point source outbreaks.</p>
                <p>
                    <bold>
                        <italic toggle="yes">Comparative genomics.</italic>
                    </bold> The BIGSdb platform incorporates a comparative genomics plugin called Genome Comparator. This performs rapid gene-by-gene pairwise comparisons of up to 1000 genomes using either: (i) loci defined in the database, generating MLST profiles consisting of the chosen loci, which can be any defined scheme or user-selected collection of loci; or (ii) using an annotated reference genome, or simply a FASTA file of sequences, as a source of comparator sequences, producing an 
                    <italic toggle="yes">ad hoc</italic> whole genome MLST analysis.  A pairwise distance matrix generated from these profiles is then used to generate a NeighborNet analysis
                    <sup>
                        <xref ref-type="bibr" rid="ref-56">56</xref>
                    </sup> using the SplitsTree software package
                    <sup>
                        <xref ref-type="bibr" rid="ref-57">57</xref>
                    </sup> to visualise the relationships among the isolates being analysed. This is a rapid process which is suitable for identifying clusters of related organisms, for example in a disease outbreak scenario, or to find shared genetic variants when performing functional studies.  Alternatively, with the alignment option selected, aligned sequences for each locus are concatenated to produce whole genome coding sequence alignments to be used for analysis in third party applications to investigate phylogenetic relationships among more distantly related isolates
                    <sup>
                        <xref ref-type="bibr" rid="ref-58">58</xref>
                    </sup>. As well as using genomes already in the database, it is also possible to include uploaded private genomes for analysis in context with public datasets.</p>
                <p>Minimum-spanning trees can be constructed from allelic profiles generated from any set of loci using a plugin that exports to an integrated GrapeTree implementation
                    <sup>
                        <xref ref-type="bibr" rid="ref-59">59</xref>
                    </sup>. GrapeTree was specifically designed to visualise tens of thousands of whole genome MLST (wgMLST) profiles and reconstruct relationships despite missing data.  Metadata, including provenance or scheme fields (such as ST or clonal complex) can be included in the analysis and used to colour nodes, producing publication quality output.</p>
                <p>
                    <bold>
                        <italic toggle="yes">Links to third-party analysis tools</italic>.</bold> PubMLST provides rich, structured datasets that can be combined with other data. Other web services provide visualisation tools and have implemented APIs so that data can be programmatically uploaded to them and various BIGSdb plugins are now available that can select datasets, link to appropriate metadata fields, and send them to these sites for analysis (
                    <xref ref-type="fig" rid="f5">Figure 5</xref>). As an alternative to the integrated GrapeTree implementation, minimum spanning trees can be analysed using a plugin that uploads to 
                    <ext-link ext-link-type="uri" xlink:href="https://online.phyloviz.net/">PHYLOViZ Online</ext-link>
                    <sup>
                        <xref ref-type="bibr" rid="ref-60">60</xref>
                    </sup>. Phylogenetic trees can be generated from concatenated aligned sequences from any selected loci or set of loci defined by a scheme, and visualised with metadata overlays in Interactive Tree of Life
                    <sup>
                        <xref ref-type="bibr" rid="ref-61">61</xref>
                    </sup> or combined with geographical and temporal data for visualisation in the 
                    <ext-link ext-link-type="uri" xlink:href="https://microreact.org/showcase">Microreact</ext-link> software
                    <sup>
                        <xref ref-type="bibr" rid="ref-62">62</xref>
                    </sup>.</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Data flow overview.</title>
                        <p>Individuals may have different and overlapping roles: Users query and analyse data; submitters upload data for nomenclature assignment and inclusion in databases; and curators assign allele and profile identifiers, check metadata, upload genomes and perform allele calling (largely automated with manual oversight). Interaction with the PubMLST BIGSdb databases is via the web interface or RESTful API. Analysis of datasets returned by a query can be performed using integrated tools or forwarded to third party sites using their APIs to upload results in their required formats.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure5.gif"/>
                </fig>
            </sec>
        </sec>
        <sec>
            <title>Use cases</title>
            <sec>
                <title>Case 1: Extracting typing information from a local genome file</title>
                <p>PubMLST has been used to identify sequence variants for molecular typing since its inception. When most sequencing was still performed on individual genes by Sanger methods, it was necessary to assemble forward and reverse trace files and then compare each locus assembly against known variants. This can still be done on a per-locus basis and if an exact match is not found then the most similar allele is identified along with a list of nucleotide differences, which can be manually checked to see whether the allele is novel. With whole genome data, the rapid identification and classification of microbes is considerably more streamlined. A locally assembled genome sequence can be queried against all loci of interest rapidly
                    <sup>
                        <xref ref-type="bibr" rid="ref-8">8</xref>
                    </sup>. This can be performed via the web interface by either pasting assembled contigs in to a search box, or by choosing to upload a FASTA file containing the contigs (
                    <xref ref-type="fig" rid="f6">Figure 6A</xref>). Any defined scheme, such as MLST, can be selected from a drop-down box, and the analysis run by clicking the submit button (
                    <xref ref-type="fig" rid="f6">Figure 6B</xref>). The analysis usually takes about a second to perform. In the case of MLST, individual allelic matches will be identified along with the ST if the combination of alleles has been previously defined. Alternatively, this analysis can be performed via the API allowing it to be incorporated directly in to a local analysis pipeline (
                    <xref ref-type="other" rid="SF1">Supplementary File 1</xref>).</p>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>Extracting typing information from a local genome file.</title>
                        <p>Typing information can be readily extracted from whole genome sequence assemblies using the sequence query page. (
                            <bold>A</bold>) Genome assembly contigs are either pasted in to the sequence query form and the required scheme or locus (in this case, MLST) selected. (
                            <bold>B</bold>) Any locus exact matches are displayed and, if this corresponds to a defined combination of alleles, the profile definition (ST/clonal complex for MLST) is displayed.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure6.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Case 2: Investigating population structure</title>
                <p>Bacterial populations often comprise distinct lineages of related genotypes that exhibit specific phenotypic characteristics that persist over time. These are shaped by mutation, horizontal genetic transfer and selection from interactions with, for example, the host immune system. Understanding the population structure is therefore important for epidemiology and public health intervention as well as for addressing more fundamental questions concerning evolution, persistence, and adaptation. The population structure of a microorganism can be represented using a cgMLST scheme and the integrated GrapeTree plugin to generate a minimum-spanning tree that can be overlaid with any metadata stored in the database. This provides an intuitive means to investigate and present the clustering of genotypes and how phenotypic or provenance characteristics map to these clusters. For example, performing a query in the 
                    <italic toggle="yes">Neisseria</italic> database for any isolate record where the species is &#x2018;
                    <italic toggle="yes">Neisseria meningitidis</italic>&#x2019; with a sequence bin containing &gt;2 Mbp data (indicating a complete genome sequence), identifies just over 12,000 records. By selecting the GrapeTree plugin, and analysis by cgMLST, an interactive minimum-spanning tree can be produced with nodes coloured by user-selectable criteria such as country, year of isolation or defined clonal complex designation (
                    <xref ref-type="fig" rid="f7">Figure 7</xref>). Since the tree can be built using any combination of loci or schemes, and annotated with any set of metadata or field values determined by scheme (combinations of loci), datasets can be readily explored and relationships between genotype and phenotype identified. Analysis of a dataset of this size takes approximately 30 minutes to perform.</p>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>GrapeTree minimum-spanning tree of 
                            <italic toggle="yes">Neisseria meningitidis</italic> genomes (n=12,179) differentiated by cgMLST (1605 loci).</title>
                        <p>A minimum-spanning tree based on allelic profiles can be generated using isolate records returned from any query and any selected scheme or group of loci. The dataset was selected by searching for species &#x2018;
                            <italic toggle="yes">Neisseria meningitidis</italic>&#x2019; with an attached sequence bin size of &gt;2 Mbp, indicative of a complete genome. Nodes are coloured by clonal complex as defined by classical MLST (7 locus), indicating strong concordance between typing schemes. Branches shorter than 150 loci are collapsed.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure7.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Case 3: Surveillance of vaccine coverage over time</title>
                <p>Many vaccines target expressed proteins that exhibit natural sequence variation within the bacterial population as a result of interactions with the host immune system. Over time, this may result in vaccines becoming less effective as immune pressure selects against strains with cross-reactive antigens and new lineages or variants take their place. It is therefore important that ongoing surveillance is performed to detect such changes in the bacterial population. The BIGSdb platform is flexible in how loci are defined and it allows small antigenic peptide sequences, such as those expressed on surface-exposed loops of proteins, to be used in addition to the more commonly used nucleotide sequences of complete genes. Schemes can, therefore, be defined that include only the antigen sequences of the protein components of a vaccine formulation, and variants of these defined as for any other sequence. One such scheme is the meningococcal Bexsero Antigen Sequence Type (BAST) scheme
                    <sup>
                        <xref ref-type="bibr" rid="ref-40">40</xref>
                    </sup> that is being used to survey structured datasets, including carriage studies, in order to provide early warning of changes in the population of meningococci that might result in reduced vaccine efficacy
                    <sup>
                        <xref ref-type="bibr" rid="ref-63">63</xref>
                    </sup>.</p>
            </sec>
            <sec>
                <title>Case 4: Spatio-phylogenetic analysis</title>
                <p>The ability to plot the provenance of an isolate on a geographical map and relate this to its genotypic placement in a phylogenetic tree can be useful when investigating structuring of outbreaks or the global spread of clones. Microreact is a web tool developed to produce these visualisations and the site provides a means for data to be automatically uploaded from other resources, such as PubMLST
                    <sup>
                        <xref ref-type="bibr" rid="ref-62">62</xref>
                    </sup>. This can be demonstrated using data from a recent paper describing the development of a MLST scheme for 
                    <italic toggle="yes">Dichelobacter nodosus</italic>, the causative agent of ovine footrot, that indicated that the global bacterial population was geographically structured
                    <sup>
                        <xref ref-type="bibr" rid="ref-36">36</xref>
                    </sup>. All isolates that included a genome record in the 
                    <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org/dnodosus/">
                        <italic toggle="yes">D. nodosus</italic> PubMLST database (n=171)</ext-link> were selected and the Microreact analysis run using cgMLST loci. This generated a concatenated genome-wide alignment from which a Neighbor-joining tree was reconstructed automatically and uploaded to the Microreact web site
                    <sup>
                        <xref ref-type="bibr" rid="ref-62">62</xref>
                    </sup> along with associated metadata. The geographical structuring of clades can be seen in the analysis (
                    <xref ref-type="fig" rid="f8">Figure 8</xref>).</p>
                <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                    <label>Figure 8. </label>
                    <caption>
                        <title>Spatio-phylogenetic analysis of the global 
                            <italic toggle="yes">Dichelobacter nodosus</italic> population.</title>
                        <p>The geographical distribution of clades of 
                            <italic toggle="yes">Dichelobacter nodosus</italic> was demonstrated by analysing all genomes in the database (n=171) using the BIGSdb Microreact plugin with cgMLST loci. This created a concatenated alignment of core genes which was used to generate a Neighbor-joining tree that was automatically uploaded to the Microreact website with accompanying metadata for visualization.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure8.gif"/>
                </fig>
            </sec>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <p>Population-scale WGS data for bacterial genomes has a wide range of applications, but is most powerful when combined with other data
                <sup>
                    <xref ref-type="bibr" rid="ref-64">64</xref>
                </sup>.  Whilst providing cost-effective access to very high volumes of sequence data
                <sup>
                    <xref ref-type="bibr" rid="ref-65">65</xref>
                </sup>, the explosion of bacterial genome data generated by NGS technologies has presented challenges, first in the volume of data generated
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup> and second in the nature of those data, in that they usually result in incomplete &#x2018;high-quality draft&#x2019; genome sequences
                <sup>
                    <xref ref-type="bibr" rid="ref-66">66</xref>
                </sup>. The gene-by-gene approach enshrined in the PubMLST/BIGSdb platform enables a hierarchical, question-driven approach to the analysis of WGS data, founded on population genomic principles
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>.  This has the additional advantage that it is backwards and forwards compatible with single or other multiple locus analyses and isolate characterisation schemes. In addition, isolates with partial sequence data (e.g. those that have been characterised with conventional MLST) are easily included by choosing the appropriate analysis scheme for the data available
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>.  The platform has proved to be highly popular, with a multitude of schemes published on the website.  Schemes can be readily established and maintained via the web interface by expert curators without the requirement of extensive bioinformatics expertise.</p>
            <p>A vital, if apparently mundane, application is the provision of harmonised nomenclatures by means of the nomenclature server function.  The BIGSdb software has been built from its inception to enable the cross-referencing of multiple nomenclatures, isolate names, and gene and allele descriptors and this has proved to be of increasing importance.  As each WGS dataset, gene, isolate and allelic variant has a unique identifier within a given BIGSdb database, the PubMLST.org website can be used as a single point of reference to integrate and link information such as sequence read archive reference, GenBank accession number, PubMed id, different isolate names and designations, and different gene and allele identifiers that have been assigned by distinct annotation exercises.  As this is a cross-referencing system, each of these existing nomenclatures remain in place but can be readily related to each other in combined analyses.  The API provides a means whereby this information can be automatically exchanged, enhancing the capacity for data harmonisation.  </p>
            <p>Disease epidemiology, especially but by no means exclusively for human infections, has been and remains a major application for PubMLST.org and related databases.  PubMLST is the largest host of bacterial molecular typing databases and is extensively queried and analysed by different target audiences (
                <xref ref-type="fig" rid="f9">Figure 9</xref>)
                <sup>
                    <xref ref-type="bibr" rid="ref-40">40</xref>,
                    <xref ref-type="bibr" rid="ref-63">63</xref>,
                    <xref ref-type="bibr" rid="ref-67">67</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-78">78</xref>
                </sup>.  Some of the larger databases now host thousands of genomes and, with the increasing implementation of cgMLST and wgMLST schemes for various organisms, offer a wide range of possibilities beyond typing and epidemiological analyses.  Because the sequence variants of thousands of loci are indexed on deposition, this provides population annotation where not only the positions and identity of loci are recorded, but also their exact variant. Although the obvious utility of this is high-resolution typing, this is also the start of being able to link phenotypes to variant combinations of loci whose biochemical role has been established.</p>
            <fig fig-type="figure" id="f9" orientation="portrait" position="float">
                <label>Figure 9. </label>
                <caption>
                    <title>Relationships of community, tasks, methods and tools supported by PubMLST.</title>
                    <p>PubMLST links structured bacterial isolate datasets with whole genome sequence data and molecular typing nomenclature to provide a rich resource that can be exploited for a wide range of tasks including surveillance, vaccine development, evolutionary analysis and functional studies.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://wellcomeopenresearch-files.f1000.com/manuscripts/16155/6c3cae11-1e2a-40be-918e-9f99cffad97d_figure9.gif"/>
            </fig>
            <p>The composition of the majority of databases is contingent, as they are dependent on submission by users, who have different motivations for data submission.  A principle aim of most database curator teams is to catalogue the known diversity of the target organism or group of organisms.  Hence, some users submit data only to obtain new nomenclature designations; however, other users submit representative or complete sets of samples, for example those associated with a given study or publication. To accommodate this PubMLST hosts specific projects and datasets within the public databases that can be queried separately. An example of this is the longitudinal 15-year survey of the molecular epidemiology of clinical 
                <italic toggle="yes">Campylobacter</italic> isolates in Oxfordshire from 2003&#x2013;2018 (
                <ext-link ext-link-type="uri" xlink:href="https://pubmlst.org/campylobacter/projects/Oxfordshire_Human_Surveillance/">https://pubmlst.org/campylobacter/projects/Oxfordshire_Human_Surveillance/</ext-link>).  The 3,300 
                <italic toggle="yes">Campylobacter</italic> isolated from 2003&#x2013;2009 were characterised with MLST with data deposited on PubMLST in near real-time
                <sup>
                    <xref ref-type="bibr" rid="ref-79">79</xref>
                </sup>, but since 2010, WGS high-quality draft sequences have been deposited
                <sup>
                    <xref ref-type="bibr" rid="ref-69">69</xref>
                </sup>. A further example is the Meningitis Research Foundation Meningococcus Genome Library, where a WGS assembly for every 
                <italic toggle="yes">Neisseria meningitidis</italic> isolate collected from cases of invasive meningococcal disease in the UK from 2010 onwards is deposited
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>. This has proved an invaluable resource for meningococcal research and surveillance, in particular with regard to investigations of vaccine efficacy at a time when vaccine policy with respect to this organism has been changing
                <sup>
                    <xref ref-type="bibr" rid="ref-22">22</xref>,
                    <xref ref-type="bibr" rid="ref-40">40</xref>,
                    <xref ref-type="bibr" rid="ref-80">80</xref>,
                    <xref ref-type="bibr" rid="ref-81">81</xref>
                </sup>.</p>
            <p>The BIGSdb platform supports the combination of private and public data, with authorised users able to upload private data that can be shared with specified groups as required. This facilitates the use of PubMLST as a host for international multi-agency epidemiological surveillance projects, such as the European Meningococcal Epidemiology in Real Time (EMERT) database. This database was established in 2008, under the auspices of The European Meningococcal and Haemophilus Disease Society (EMGM) and allows European reference laboratories to upload partial or complete meningococcal isolate data to share among themselves, with the ability to update records as more information becomes available (
                <ext-link ext-link-type="uri" xlink:href="http://emgm.eu/emert/">http://emgm.eu/emert/</ext-link>). This stand-alone system that collects serological, and MLST and antigen sequence data linked to minimal metadata, is being migrated to PubMLST shortly, allowing collection and analysis of whole genome data. Access to EMERT data will be restricted to reference laboratories and the European Centre for Disease Prevention and Control (ECDC).  EMERT II will reside as a private project within the public database so that any submitted data can be analysed in the context of global datasets but these data will be made available subject to release policies, avoiding the current duplication of data submission to multiple databases. The system will integrate with the European Surveillance System (TESSy) database
                <sup>
                    <xref ref-type="bibr" rid="ref-82">82</xref>
                </sup> 
                <italic toggle="yes">via</italic> the BIGSdb RESTful API
                <sup>
                    <xref ref-type="bibr" rid="ref-27">27</xref>
                </sup>, facilitating automated extraction and reporting of molecular typing summaries from genome data.</p>
            <p>PubMLST also supports international surveillance and is part of a developing ecosystem of independent third-party tools that make molecular typing nomenclatures readily available (
                <xref ref-type="table" rid="T2">Table 2</xref>)
                <sup>
                    <xref ref-type="bibr" rid="ref-83">83</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-88">88</xref>
                </sup>. The emergence of cgMLST as a method of choice for long-term and international epidemiology by a number of international agencies
                <sup>
                    <xref ref-type="bibr" rid="ref-89">89</xref>,
                    <xref ref-type="bibr" rid="ref-90">90</xref>
                </sup> means that this role continues to be essential.  PubMLST is well positioned to continue serving nomenclatures for this effort, along with extensive collections of structured isolate record data for a wide range of pathogenic and other bacterial species. These structured datasets coupled with extensive genomic data, complex query tools and analysis methods provide a platform for investigating a wide range of biological questions.</p>
            <table-wrap id="T2" orientation="portrait" position="anchor">
                <label>Table 2. </label>
                <caption>
                    <title>Web services and software tools that directly utilize data hosted by PubMLST.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Service/software (ref.)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Tool type</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">URL</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">MLST-CGE
                                <sup>
                                    <xref ref-type="bibr" rid="ref-85">85</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Web service</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://cge.cbs.dtu.dk/services/">https://cge.cbs.dtu.dk/services/</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">GoSeqIt</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Web service</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://www.goseqit.com/">https://www.goseqit.com/</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Enterobase
                                <sup>
                                    <xref ref-type="bibr" rid="ref-44">44</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Web service</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://enterobase.warwick.ac.uk/">https://enterobase.warwick.ac.uk/</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">MLSTcheck
                                <sup>
                                    <xref ref-type="bibr" rid="ref-86">86</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/sanger-pathogens/mlst_check">https://github.com/sanger-pathogens/mlst_check</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">mlst</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/tseemann/mlst">https://github.com/tseemann/mlst</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">SRST2
                                <sup>
                                    <xref ref-type="bibr" rid="ref-88">88</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/katholt/srst2">https://github.com/katholt/srst2</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">stringMLST
                                <sup>
                                    <xref ref-type="bibr" rid="ref-87">87</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/jordanlab/stringMLST">https://github.com/jordanlab/stringMLST</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">MOST
                                <sup>
                                    <xref ref-type="bibr" rid="ref-84">84</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/phe-bioinformatics/MOST">https://github.com/phe-bioinformatics/MOST</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">MLSTar
                                <sup>
                                    <xref ref-type="bibr" rid="ref-83">83</xref>
                                </sup>
                            </td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/iferres/MLSTar">https://github.com/iferres/MLSTar</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Krocus</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Open source</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://github.com/andrewjpage/krocus">https://github.com/andrewjpage/krocus</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Bionumerics</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Commercial</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="http://www.applied-maths.com/bionumerics">http://www.applied-maths.com/bionumerics</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">SeqSphere+</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Commercial</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://www.ridom.de/seqsphere/">https://www.ridom.de/seqsphere/</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">CLC</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Commercial</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://www.qiagenbioinformatics.com/">https://www.qiagenbioinformatics.com/</ext-link>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Smartgene</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">Commercial</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">
                                <ext-link ext-link-type="uri" xlink:href="https://www.smartgene.com/">https://www.smartgene.com/</ext-link>
                            </td>
                        </tr>
                    </tbody>
                </table>
            </table-wrap>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>The PubMLST databases and BIGSdb software originated as part of the development of the MLST approach to the characterisation of bacterial strains in 1998
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>.  Over the succeeding twenty years, the capacity to generate data and to interpret it greatly increased, but the fundamental requirements have remained the same, with open-access, curated, and interpreted data at the heart of the endeavour
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>.  The population genomic framework, with its foundation in evolutionary biology
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>
                </sup>, provided an effective and powerful intellectual foundation for the structuring of the databases.  Importantly, the approach proved to be highly scalable, enabling a transition from twelve genes in just over 100 isolates in the first MLST paper
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup> to hundreds of thousands of isolates, characterised at thousands of loci
                <sup>
                    <xref ref-type="bibr" rid="ref-44">44</xref>
                </sup>.  Despite these advances, however, the power of this approach is yet to be fully realised.   The very rich data available at the time of writing has only been partially explored, especially with respect to population annotation, the cataloguing of variation across the genome with reference to biological function.  While advances in analysis approaches and data integration, especially those in artificial intelligence and machine learning techniques
                <sup>
                    <xref ref-type="bibr" rid="ref-91">91</xref>
                </sup>, are likely to substantially aid the realisation of the potential of these data, we contend that engagement of individuals expert in particular organisms or systems will remain the most important contribution to the exploration of these datasets, as will the continued integration of diverse data sources by automated means.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>All data underlying the results are available as part of the article and no additional source data are required.</p>
        </sec>
        <sec>
            <title>Software availability</title>
            <p>The source code for BIGSdb, including the RESTful API application, is available from: 
                <ext-link ext-link-type="uri" xlink:href="https://github.com/kjolley/BIGSdb">https://github.com/kjolley/BIGSdb</ext-link>.</p>
            <p>Archived source code at time of publication: 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.1420943">https://doi.org/10.5281/zenodo.1420943</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-46">46</xref>
                </sup>.</p>
            <p>License: 
                <ext-link ext-link-type="uri" xlink:href="https://www.gnu.org/licenses/gpl-3.0.en.html">GNU General Public License v3.0</ext-link>.</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>The authors would like to thank the hundreds of database curators and data submitters who have helped make the PubMLST databases such a rich resource of structured data.</p>
        </ack>
        <sec id="SM1" sec-type="supplementary-material">
            <title>Supplementary material</title>
            <p id="SF1">
                <bold>Supplementary File 1. Using the API to query PubMLST databases with a local genome assembly file.</bold>
            </p>
            <p>Examples are provided to demonstrate how to query a local contigs file against an MLST scheme hosted on PubMLST and how to interact with the rMLST species identification tool via the command line.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://wellcomeopenresearch.s3.amazonaws.com/supplementary/14826/70e2acdf-a11d-4f76-8053-afa160b0270c_Supplementary_S1.docx">Click here to access the data</ext-link>.</p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kyrpides</surname>
                            <given-names>NC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Eloe-Fadrosh</surname>
                            <given-names>EA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ivanova</surname>
                            <given-names>NN</given-names>
                        </name>
</person-group>:
                    <article-title>Microbiome Data Science: Understanding Our Microbial Planet.</article-title>
                    <source>

                        <italic toggle="yes">Trends Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>24</volume>(<issue>6</issue>):<fpage>425</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">27197692</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.tim.2016.02.011</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kerasidou</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Sharing the Knowledge: Sharing Aggregate Genomic Findings with Research Participants in Developing Countries.</article-title>
                    <source>

                        <italic toggle="yes">Dev World Bioeth.</italic>
</source>
                    <year>2015</year>;<volume>15</volume>(<issue>3</issue>):<fpage>267</fpage>&#x2013;<lpage>74</lpage>.
                    <pub-id pub-id-type="pmid">25292263</pub-id>
                    <pub-id pub-id-type="doi">10.1111/dewb.12071</pub-id>
                    <pub-id pub-id-type="pmcid">4632193</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chassang</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>The impact of the EU general data protection regulation on scientific research.</article-title>
                    <source>

                        <italic toggle="yes">Ecancermedicalscience.</italic>
</source>
                    <year>2017</year>;<volume>11</volume>:<fpage>709</fpage>.
                    <pub-id pub-id-type="pmid">28144283</pub-id>
                    <pub-id pub-id-type="doi">10.3332/ecancer.2017.709</pub-id>
                    <pub-id pub-id-type="pmcid">5243137</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>O'Brien</surname>
                            <given-names>SJ</given-names>
                        </name>
</person-group>:
                    <article-title>Stewardship of human biospecimens, DNA, genotype, and clinical data in the GWAS era.</article-title>
                    <source>

                        <italic toggle="yes">Annu Rev Genomics Hum Genet.</italic>
</source>
                    <year>2009</year>;<volume>10</volume>:<fpage>193</fpage>&#x2013;<lpage>209</lpage>.
                    <pub-id pub-id-type="pmid">19630558</pub-id>
                    <pub-id pub-id-type="doi">10.1146/annurev-genom-082908-150133</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>Using multilocus sequence typing to study bacterial variation: prospects in the genomic era. </article-title>
                    <source>

                        <italic toggle="yes">Future Microbiol.</italic>
</source>
                    <year>2014</year>;<volume>9</volume>(<issue>5</issue>):<fpage>623</fpage>&#x2013;<lpage>30</lpage>.
                    <pub-id pub-id-type="pmid">24957089</pub-id>
                    <pub-id pub-id-type="doi">10.2217/fmb.14.24</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>BIGSdb: Scalable analysis of bacterial genome variation at the population level.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2010</year>;<volume>11</volume>(<issue>1</issue>):<fpage>595</fpage>.
                    <pub-id pub-id-type="pmid">21143983</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-11-595</pub-id>
                    <pub-id pub-id-type="pmcid">3004885</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hill</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lucidarme</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gray</surname>
                            <given-names>SJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genomic epidemiology of age-associated meningococcal lineages in national surveillance: an observational cohort study.</article-title>
                    <source>

                        <italic toggle="yes">Lancet Infect Dis.</italic>
</source>
                    <year>2015</year>;<volume>15</volume>(<issue>12</issue>):<fpage>1420</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">26515523</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S1473-3099(15)00267-4</pub-id>
                    <pub-id pub-id-type="pmcid">4655307</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>Automated extraction of typing information for bacterial pathogens from whole genome sequence data: 
                        <italic toggle="yes">Neisseria meningitidis</italic> as an exemplar.</article-title>
                    <source>

                        <italic toggle="yes">Euro Surveill.</italic>
</source>
                    <year>2013</year>;<volume>18</volume>(<issue>4</issue>):<fpage>20379</fpage>.
                    <pub-id pub-id-type="pmid">23369391</pub-id>
                    <pub-id pub-id-type="doi">10.2807/ese.18.04.20379-en</pub-id>
                    <pub-id pub-id-type="pmcid">3977036</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Luikart</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>England</surname>
                            <given-names>PR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tallmon</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The power and promise of population genomics: from genotyping to genome typing.</article-title>
                    <source>

                        <italic toggle="yes">Nat Rev Genet.</italic>
</source>
                    <year>2003</year>;<volume>4</volume>(<issue>12</issue>):<fpage>981</fpage>&#x2013;<lpage>94</lpage>.
                    <pub-id pub-id-type="pmid">14631358</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg1226</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Vasemagi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Primmer</surname>
                            <given-names>CR</given-names>
                        </name>
</person-group>:
                    <article-title>Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies.</article-title>
                    <source>

                        <italic toggle="yes">Mol Ecol.</italic>
</source>
                    <year>2005</year>;<volume>14</volume>(<issue>12</issue>):<fpage>3623</fpage>&#x2013;<lpage>42</lpage>.
                    <pub-id pub-id-type="pmid">16202085</pub-id>
                    <pub-id pub-id-type="doi">10.1111/j.1365-294X.2005.02690.x</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Earle</surname>
                            <given-names>SG</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>CH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Charlesworth</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Identifying lineage effects when controlling for population structure improves power in bacterial association studies.</article-title>
                    <source>

                        <italic toggle="yes">Nat Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>1</volume>: 16041.
                    <pub-id pub-id-type="pmid">27572646</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmicrobiol.2016.41</pub-id>
                    <pub-id pub-id-type="pmcid">5049680</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bygraves</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Feil</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms.</article-title>
                    <source>

                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
</source>
                    <year>1998</year>;<volume>95</volume>(<issue>6</issue>):<fpage>3140</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">9501229</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.95.6.3140</pub-id>
                    <pub-id pub-id-type="pmcid">19708</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chan</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Spratt</surname>
                            <given-names>BG</given-names>
                        </name>
</person-group>:
                    <article-title>Database-driven multi locus sequence typing (MLST) of bacterial pathogens.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2001</year>;<volume>17</volume>(<issue>11</issue>):<fpage>1077</fpage>&#x2013;<lpage>83</lpage>.
                    <pub-id pub-id-type="pmid">11724739</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/17.11.1077</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chan</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>mlstdbNet - distributed multi-locus sequence typing (MLST) databases.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2004</year>;<volume>5</volume>(<issue>1</issue>):<fpage>86</fpage>.
                    <pub-id pub-id-type="pmid">15230973</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-5-86</pub-id>
                    <pub-id pub-id-type="pmcid">459212</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>AgdbNet - antigen sequence database software for bacterial typing.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2006</year>;<volume>7</volume>:<fpage>314</fpage>.
                    <pub-id pub-id-type="pmid">16790057</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-7-314</pub-id>
                    <pub-id pub-id-type="pmcid">1543660</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jansen van Rensburg</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bray</surname>
                            <given-names>JE</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MLST revisited: the gene-by-gene approach to bacterial genomics.</article-title>
                    <source>

                        <italic toggle="yes">Nat Rev Microbiol.</italic>
</source>
                    <year>2013</year>;<volume>11</volume>(<issue>10</issue>):<fpage>728</fpage>&#x2013;<lpage>36</lpage>.
                    <pub-id pub-id-type="pmid">23979428</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrmicro3093</pub-id>
                    <pub-id pub-id-type="pmcid">3980634</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Loman</surname>
                            <given-names>NJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Constantinidou</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chan</surname>
                            <given-names>JZ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity.</article-title>
                    <source>

                        <italic toggle="yes">Nat Rev Microbiol.</italic>
</source>
                    <year>2012</year>;<volume>10</volume>(<issue>9</issue>):<fpage>599</fpage>&#x2013;<lpage>606</lpage>.
                    <pub-id pub-id-type="pmid">22864262</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrmicro2850</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bliss</surname>
                            <given-names>CM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bennett</surname>
                            <given-names>JS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Ribosomal multilocus sequence typing: universal characterization of bacteria from domain to strain.</article-title>
                    <source>

                        <italic toggle="yes">Microbiology.</italic>
</source>
                    <year>2012</year>;<volume>158</volume>(<issue>Pt 4</issue>):<fpage>1005</fpage>&#x2013;<lpage>15</lpage>.
                    <pub-id pub-id-type="pmid">22282518</pub-id>
                    <pub-id pub-id-type="doi">10.1099/mic.0.055459-0</pub-id>
                    <pub-id pub-id-type="pmcid">3492749</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taha</surname>
                            <given-names>MK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hedberg</surname>
                            <given-names>ST</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Szatanik</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multicenter study for defining the breakpoint for rifampin resistance in 
                        <italic toggle="yes">Neisseria meningitidis</italic> by 
                        <italic toggle="yes">rpoB</italic> sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Antimicrob Agents Chemother.</italic>
</source>
                    <year>2010</year>;<volume>54</volume>(<issue>9</issue>):<fpage>3651</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">20606072</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AAC.00315-10</pub-id>
                    <pub-id pub-id-type="pmcid">2934967</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Martino</surname>
                            <given-names>ME</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fasolato</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Montemurro</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Determination of microbial diversity of 
                        <italic toggle="yes">Aeromonas</italic> strains on the basis of multilocus sequence typing, phenotype, and presence of putative virulence genes.</article-title>
                    <source>

                        <italic toggle="yes">Appl Environ Microbiol.</italic>
</source>
                    <year>2011</year>;<volume>77</volume>(<issue>14</issue>):<fpage>4986</fpage>&#x2013;<lpage>5000</lpage>.
                    <pub-id pub-id-type="pmid">21642403</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AEM.00708-11</pub-id>
                    <pub-id pub-id-type="pmcid">3147379</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bla&#x017e;kov&#x00e1;</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Javurkova</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vlach</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Diversity of O Antigens within the Genus 
                        <italic toggle="yes">Cronobacter</italic>: from Disorder to Order.</article-title>
                    <source>

                        <italic toggle="yes">Appl Environ Microbiol.</italic>
</source>
                    <year>2015</year>;<volume>81</volume>(<issue>16</issue>):<fpage>5574</fpage>&#x2013;<lpage>82</lpage>.
                    <pub-id pub-id-type="pmid">26070668</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AEM.00277-15</pub-id>
                    <pub-id pub-id-type="pmcid">4510170</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brehony</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hill</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lucidarme</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Meningococcal vaccine antigen diversity in global databases.</article-title>
                    <source>

                        <italic toggle="yes">Euro Surveill.</italic>
</source>
                    <year>2015</year>;<volume>20</volume>(<issue>49</issue>): pii=30084.
                    <pub-id pub-id-type="pmid">26676305</pub-id>
                    <pub-id pub-id-type="doi">10.2807/1560-7917.ES.2015.20.49.30084</pub-id>
                    <pub-id pub-id-type="pmcid">4820634</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Harrison</surname>
                            <given-names>OB</given-names>
                        </name>
</person-group>:
                    <article-title>Population and Functional Genomics of 
                        <italic toggle="yes">Neisseria</italic> Revealed with Gene-by-Gene Approaches.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>54</volume>(<issue>8</issue>):<fpage>1949</fpage>&#x2013;<lpage>55</lpage>.
                    <pub-id pub-id-type="pmid">27098959</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.00301-16</pub-id>
                    <pub-id pub-id-type="pmcid">4963508</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Stefanelli</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Neri</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tanabe</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Typing and surface charges of the variable loop regions of PorB from 
                        <italic toggle="yes">Neisseria meningitidis</italic>.</article-title>
                    <source>

                        <italic toggle="yes">IUBMB Life.</italic>
</source>
                    <year>2016</year>;<volume>68</volume>(<issue>6</issue>):<fpage>488</fpage>&#x2013;<lpage>95</lpage>.
                    <pub-id pub-id-type="pmid">27156582</pub-id>
                    <pub-id pub-id-type="doi">10.1002/iub.1508</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Versteeg</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bruisten</surname>
                            <given-names>SM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pannekoek</surname>
                            <given-names>Y</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genomic analyses of the 
                        <italic toggle="yes">Chlamydia trachomatis</italic> core genome show an association between chromosomal genome, plasmid type and disease.</article-title>
                    <source>

                        <italic toggle="yes">BMC Genomics.</italic>
</source>
                    <year>2018</year>;<volume>19</volume>(<issue>1</issue>):<fpage>130</fpage>.
                    <pub-id pub-id-type="pmid">29426279</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12864-018-4522-3</pub-id>
                    <pub-id pub-id-type="pmcid">5810182</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Borud</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>B&#x00e5;rnes</surname>
                            <given-names>GK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brynildsrud</surname>
                            <given-names>OB</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genotypic and Phenotypic Characterization of the O-Linked Protein Glycosylation System Reveals High Glycan Diversity in Paired Meningococcal Carriage Isolates.</article-title>
                    <source>

                        <italic toggle="yes">J Bacteriol.</italic>
</source>
                    <year>2018</year>;<volume>200</volume>(<issue>16</issue>): pii: e00794-17.
                    <pub-id pub-id-type="pmid">29555702</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JB.00794-17</pub-id>
                    <pub-id pub-id-type="pmcid">6060354</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bray</surname>
                            <given-names>JE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MCJ</given-names>
                        </name>
</person-group>:
                    <article-title>A RESTful application programming interface for the PubMLST molecular typing and genome databases. </article-title>
                    <source>

                        <italic toggle="yes">Database (Oxford).</italic>
</source>
                    <year>2017</year>;<volume>2017</volume>:<fpage>bax060</fpage>.
                    <pub-id pub-id-type="pmid">29220452</pub-id>
                    <pub-id pub-id-type="doi">10.1093/database/bax060</pub-id>
                    <pub-id pub-id-type="pmcid">5550937</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Price</surname>
                            <given-names>EP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>MacHunter</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Spratt</surname>
                            <given-names>BG</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Improved multilocus sequence typing of Burkholderia pseudomallei and closely related species.</article-title>
                    <source>

                        <italic toggle="yes">J Med Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>65</volume>(<issue>9</issue>):<fpage>992</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">27412128</pub-id>
                    <pub-id pub-id-type="doi">10.1099/jmm.0.000312</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Panda</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jena</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sharma</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Identification of Novel Sequence Types among 
                        <italic toggle="yes">Staphylococcus haemolyticus</italic> Isolated from Variety of Infections in India.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2016</year>;<volume>11</volume>(<issue>11</issue>):<fpage>e0166193</fpage>.
                    <pub-id pub-id-type="pmid">27824930</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0166193</pub-id>
                    <pub-id pub-id-type="pmcid">5100990</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Whatmore</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Koylass</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Muchowski</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Extended Multilocus Sequence Analysis to Describe the Global Population Structure of the Genus 
                        <italic toggle="yes">Brucella</italic>: Phylogeography and Relationship to Biovars.</article-title>
                    <source>

                        <italic toggle="yes">Front Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>7</volume>:<fpage>2049</fpage>.
                    <pub-id pub-id-type="pmid">28066370</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fmicb.2016.02049</pub-id>
                    <pub-id pub-id-type="pmcid">5174110</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gonzalez-Escalona</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Reed</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Defining a core genome multilocus sequence typing scheme for the global epidemiology of 
                        <italic toggle="yes">Vibrio parahaemolyticus</italic>.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>55</volume>(<issue>6</issue>):<fpage>1682</fpage>&#x2013;<lpage>97</lpage>.
                    <pub-id pub-id-type="pmid">28330888</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.00227-17</pub-id>
                    <pub-id pub-id-type="pmcid">5442524</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Karah</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hall</surname>
                            <given-names>RM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Database for the 
                        <italic toggle="yes">ampC</italic> alleles in 
                        <italic toggle="yes">Acinetobacter baumannii</italic>.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2017</year>;<volume>12</volume>(<issue>5</issue>):<fpage>e0176695</fpage>.
                    <pub-id pub-id-type="pmid">28459877</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0176695</pub-id>
                    <pub-id pub-id-type="pmcid">5411055</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Joseph</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Forsythe</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Multilocus Sequence Typing (MLST) for 
                        <italic toggle="yes">Cronobacter</italic> spp.</article-title>
                    <source>

                        <italic toggle="yes">Methods Mol Biol.</italic>
</source>
                    <year>2017</year>;<volume>1616</volume>:<fpage>241</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">28600774</pub-id>
                    <pub-id pub-id-type="doi">10.1007/978-1-4939-7037-7_16</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Overesch</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kuhnert</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Persistence of 
                        <italic toggle="yes">Mycoplasma hyopneumoniae</italic> sequence types in spite of a control program for enzootic pneumonia in pigs.</article-title>
                    <source>

                        <italic toggle="yes">Prev Vet Med.</italic>
</source>
                    <year>2017</year>;<volume>145</volume>:<fpage>67</fpage>&#x2013;<lpage>72</lpage>.
                    <pub-id pub-id-type="pmid">28903876</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.prevetmed.2017.06.007</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Duquesne</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Houssin</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>S&#x00e9;vin</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Development of a multilocus sequence typing scheme for 
                        <italic toggle="yes">Rhodococcus equi</italic>.</article-title>
                    <source>

                        <italic toggle="yes">Vet Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>210</volume>:<fpage>64</fpage>&#x2013;<lpage>70</lpage>.
                    <pub-id pub-id-type="pmid">29103698</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.vetmic.2017.08.010</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Blanchard</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MCJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The Applied Development of a Tiered Multilocus Sequence Typing (MLST) Scheme for 
                        <italic toggle="yes">Dichelobacter nodosus</italic>.</article-title>
                    <source>

                        <italic toggle="yes">Front Microbiol.</italic>
</source>
                    <year>2018</year>;<volume>9</volume>:<fpage>551</fpage>.
                    <pub-id pub-id-type="pmid">29628922</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fmicb.2018.00551</pub-id>
                    <pub-id pub-id-type="pmcid">5876313</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-37">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Buj&#x00e1;n</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Balboa</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>L Romalde</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Population genetic and evolution analysis of controversial genus 
                        <italic toggle="yes">Edwardsiella</italic> by multilocus sequence typing.</article-title>
                    <source>

                        <italic toggle="yes">Mol Phylogenet Evol.</italic>
</source>
                    <year>2018</year>;<volume>127</volume>:<fpage>513</fpage>&#x2013;<lpage>521</lpage>.
                    <pub-id pub-id-type="pmid">29751087</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.ympev.2018.05.006</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bratcher</surname>
                            <given-names>HB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Corton</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A gene-by-gene population genomics platform: 
                        <italic toggle="yes">de novo</italic> assembly, annotation and genealogical analysis of 108 representative 
                        <italic toggle="yes">Neisseria meningitidis</italic> genomes.</article-title>
                    <source>

                        <italic toggle="yes">BMC Genomics.</italic>
</source>
                    <year>2014</year>;<volume>15</volume>(<issue>1</issue>):<fpage>1138</fpage>.
                    <pub-id pub-id-type="pmid">25523208</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2164-15-1138</pub-id>
                    <pub-id pub-id-type="pmcid">4377854</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-39">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cody</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bray</surname>
                            <given-names>JE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>KA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Core Genome Multilocus Sequence Typing Scheme for Stable, Comparative Analyses of 
                        <italic toggle="yes">Campylobacter jejuni</italic> and 
                        <italic toggle="yes">C. coli</italic> Human Disease Isolates.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>55</volume>(<issue>7</issue>):<fpage>2086</fpage>&#x2013;<lpage>97</lpage>.
                    <pub-id pub-id-type="pmid">28446571</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.00080-17</pub-id>
                    <pub-id pub-id-type="pmcid">5483910</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-40">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brehony</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rodrigues</surname>
                            <given-names>CMC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Borrow</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Distribution of Bexsero&#x00ae; Antigen Sequence Types (BASTs) in invasive meningococcal disease isolates: Implications for immunisation.</article-title>
                    <source>

                        <italic toggle="yes">Vaccine.</italic>
</source>
                    <year>2016</year>;<volume>34</volume>(<issue>39</issue>):<fpage>4690</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">27521232</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.vaccine.2016.08.015</pub-id>
                    <pub-id pub-id-type="pmcid">5012890</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-41">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hong</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thulin Hedberg</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abad</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Target gene sequencing to define the susceptibility of 
                        <italic toggle="yes">Neisseria meningitidis</italic> to ciprofloxacin.</article-title>
                    <source>

                        <italic toggle="yes">Antimicrob Agents Chemother.</italic>
</source>
                    <year>2013</year>;<volume>57</volume>(<issue>4</issue>):<fpage>1961</fpage>&#x2013;<lpage>4</lpage>.
                    <pub-id pub-id-type="pmid">23357770</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AAC.02184-12</pub-id>
                    <pub-id pub-id-type="pmcid">3623314</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taha</surname>
                            <given-names>MK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>V&#x00e1;zquez</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hong</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Target gene sequencing to characterize the penicillin G susceptibility of 
                        <italic toggle="yes">Neisseria meningitidis</italic>.</article-title>
                    <source>

                        <italic toggle="yes">Antimicrob Agents Chemother.</italic>
</source>
                    <year>2007</year>;<volume>51</volume>(<issue>8</issue>):<fpage>2784</fpage>&#x2013;<lpage>92</lpage>.
                    <pub-id pub-id-type="pmid">17517841</pub-id>
                    <pub-id pub-id-type="doi">10.1128/AAC.00412-07</pub-id>
                    <pub-id pub-id-type="pmcid">1932518</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-43">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Aanensen</surname>
                            <given-names>DM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Spratt</surname>
                            <given-names>BG</given-names>
                        </name>
</person-group>:
                    <article-title>The multilocus sequence typing network: mlst.net.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2005</year>;<volume>33</volume>(<issue>Web Server issue</issue>):<fpage>W728</fpage>&#x2013;<lpage>33</lpage>.
                    <pub-id pub-id-type="pmid">15980573</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gki415</pub-id>
                    <pub-id pub-id-type="pmcid">1160176</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-44">
                <label>44</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Alikhan</surname>
                            <given-names>NF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sergeant</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A genomic overview of the population structure of 
                        <italic toggle="yes">Salmonella</italic>.</article-title>
                    <source>

                        <italic toggle="yes">PLoS Genet.</italic>
</source>
                    <year>2018</year>;<volume>14</volume>(<issue>4</issue>):<fpage>e1007261</fpage>.
                    <pub-id pub-id-type="pmid">29621240</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pgen.1007261</pub-id>
                    <pub-id pub-id-type="pmcid">5886390</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-45">
                <label>45</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bletz</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mellmann</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rothg&#x00e4;nger</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Ensuring backwards compatibility: traditional genotyping efforts in the era of whole genome sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Clin Microbiol Infect.</italic>
</source>
                    <year>2015</year>;<volume>21</volume>(<issue>4</issue>):<fpage>347.e1</fpage>&#x2013;<lpage>4</lpage>.
                    <pub-id pub-id-type="pmid">25658529</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.cmi.2014.11.005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-46">
                <label>46</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jolley</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Quevillon</surname>
                            <given-names>E</given-names>
                        </name>
</person-group>:
                    <article-title>kjolley/BIGSdb: BIGSdb version 1.19.1 (Version v_1.19.1).</article-title>
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2018</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.doi.org/10.5281/zenodo.1420943">http://www.doi.org/10.5281/zenodo.1420943</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-47">
                <label>47</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Seemann</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>Prokka: rapid prokaryotic genome annotation.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2014</year>;<volume>30</volume>(<issue>14</issue>):<fpage>2068</fpage>&#x2013;<lpage>9</lpage>.
                    <pub-id pub-id-type="pmid">24642063</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-48">
                <label>48</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Markowitz</surname>
                            <given-names>VM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chen</surname>
                            <given-names>IM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Palaniappan</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The integrated microbial genomes system: an expanding comparative analysis resource.</article-title>
                    <source>

                        <italic toggle="yes">Nucleic Acids Res.</italic>
</source>
                    <year>2010</year>;<volume>38</volume>(<issue>Database issue</issue>):<fpage>D382</fpage>&#x2013;<lpage>90</lpage>.
                    <pub-id pub-id-type="pmid">19864254</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkp887</pub-id>
                    <pub-id pub-id-type="pmcid">2808961</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-49">
                <label>49</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Medini</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Donati</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tettelin</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The microbial pan-genome.</article-title>
                    <source>

                        <italic toggle="yes">Curr Opin Genet Dev.</italic>
</source>
                    <year>2005</year>;<volume>15</volume>(<issue>6</issue>):<fpage>589</fpage>&#x2013;<lpage>94</lpage>.
                    <pub-id pub-id-type="pmid">16185861</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.gde.2005.09.006</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-50">
                <label>50</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Altschul</surname>
                            <given-names>SF</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gish</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Miller</surname>
                            <given-names>W</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Basic local alignment search tool.</article-title>
                    <source>

                        <italic toggle="yes">J Mol Biol.</italic>
</source>
                    <year>1990</year>;<volume>215</volume>(<issue>3</issue>):<fpage>403</fpage>&#x2013;<lpage>10</lpage>.
                    <pub-id pub-id-type="pmid">2231712</pub-id>
                    <pub-id pub-id-type="doi">10.1016/S0022-2836(05)80360-2</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-51">
                <label>51</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MC</given-names>
                        </name>
</person-group>:
                    <article-title>Multilocus Sequence Typing of Bacteria.</article-title>
                    <source>

                        <italic toggle="yes">Annu Rev Microbiol.</italic>
</source>
                    <year>2006</year>;<volume>60</volume>:<fpage>561</fpage>&#x2013;<lpage>88</lpage>.
                    <pub-id pub-id-type="pmid">16774461</pub-id>
                    <pub-id pub-id-type="doi">10.1146/annurev.micro.59.030804.121325</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-52">
                <label>52</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Vegge</surname>
                            <given-names>CS</given-names>
                        </name> 

                        <name name-style="western">
                            <surname>Jansen van Rensburg</surname>
                            <given-names>MJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rasmussen</surname>
                            <given-names>JJ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Glucose Metabolism via the Entner-Doudoroff Pathway in 
                        <italic toggle="yes">Campylobacter</italic>: A Rare Trait that Enhances Survival and Promotes Biofilm Formation in Some Isolates.</article-title>
                    <source>

                        <italic toggle="yes">Front Microbiol.</italic>
</source>
                    <year>2016</year>;<volume>7</volume>:<fpage>1877</fpage>.
                    <pub-id pub-id-type="pmid">27920773</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fmicb.2016.01877</pub-id>
                    <pub-id pub-id-type="pmcid">5118423</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-53">
                <label>53</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hey</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>The structure of genealogies and the distribution of fixed differences between DNA sequence samples from natural populations.</article-title>
                    <source>

                        <italic toggle="yes">Genetics.</italic>
</source>
                    <year>1991</year>;<volume>128</volume>(<issue>4</issue>):<fpage>831</fpage>&#x2013;<lpage>40</lpage>.
                    <pub-id pub-id-type="pmid">1916247</pub-id>
                    <pub-id pub-id-type="pmcid">1204556</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-54">
                <label>54</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Bennett</surname>
                            <given-names>JS</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Bentley</surname>
                            <given-names>SD</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Vernikos</surname>
                            <given-names>GS</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Independent evolution of the core and accessory gene sets in the genus 
                        <italic toggle="yes">Neisseria</italic>: insights gained from the genome of 
                        <italic toggle="yes">Neisseria lactamica</italic> isolate 020-06.</article-title>
                    <source>
				
                        <italic toggle="yes">BMC Genomics.</italic>
			</source>
                    <year>2010</year>;<volume>11</volume>:<fpage>652</fpage>.
                    <pub-id pub-id-type="pmid">21092259</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2164-11-652</pub-id>
                    <pub-id pub-id-type="pmcid">3091772</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-55">
                <label>55</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Pearce</surname>
                            <given-names>ME</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Alikhan</surname>
                            <given-names>NF</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Dallman</surname>
                            <given-names>TJ</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Comparative analysis of core genome MLST and SNP typing within a European 
                        <italic toggle="yes">Salmonella</italic> serovar Enteritidis outbreak.</article-title>
                    <source>
				
                        <italic toggle="yes">Int J Food Microbiol.</italic>
			</source>
                    <year>2018</year>;<volume>274</volume>:<fpage>1</fpage>&#x2013;<lpage>11</lpage>.
                    <pub-id pub-id-type="pmid">29574242</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.ijfoodmicro.2018.02.023</pub-id>
                    <pub-id pub-id-type="pmcid">5899760</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-56">
                <label>56</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Bryant</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Moulton</surname>
                            <given-names>V</given-names>
                        </name>
			</person-group>:
                    <article-title>Neighbor-net: an agglomerative method for the construction of phylogenetic networks.</article-title>
                    <source>
				
                        <italic toggle="yes">Mol Biol Evol.</italic>
			</source>
                    <year>2004</year>;<volume>21</volume>(<issue>2</issue>):<fpage>255</fpage>&#x2013;<lpage>65</lpage>.
                    <pub-id pub-id-type="pmid">14660700</pub-id>
                    <pub-id pub-id-type="doi">10.1093/molbev/msh018</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-57">
                <label>57</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Huson</surname>
                            <given-names>DH</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Bryant</surname>
                            <given-names>D</given-names>
                        </name>
			</person-group>:
                    <article-title>Application of phylogenetic networks in evolutionary studies.</article-title>
                    <source>
				
                        <italic toggle="yes">Mol Biol Evol.</italic>
			</source>
                    <year>2006</year>;<volume>23</volume>(<issue>2</issue>):<fpage>254</fpage>&#x2013;<lpage>67</lpage>.
                    <pub-id pub-id-type="pmid">16221896</pub-id>
                    <pub-id pub-id-type="doi">10.1093/molbev/msj030</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-58">
                <label>58</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Lees</surname>
                            <given-names>JA</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Kendall</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Parkhill</surname>
                            <given-names>J</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Evaluation of phylogenetic reconstruction methods using bacterial whole genomes: a simulation based study [version 2;  referees: 3 approved].</article-title>
                    <source>
				
                        <italic toggle="yes">Wellcome Open Res.</italic>
			</source>
                    <year>2018</year>;<volume>3</volume>:<fpage>33</fpage>.
                    <pub-id pub-id-type="pmid">29774245</pub-id>
                    <pub-id pub-id-type="doi">10.12688/wellcomeopenres.14265.2</pub-id>
                    <pub-id pub-id-type="pmcid">5930550</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-59">
                <label>59</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>Z</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Alikhan</surname>
                            <given-names>NF</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Sergeant</surname>
                            <given-names>MJ</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.</article-title>
                    <source>
				
                        <italic toggle="yes">Genome Res.</italic>
			</source>
                    <year>2018</year>;<volume>28</volume>(<issue>9</issue>):<fpage>1395</fpage>&#x2013;<lpage>1404</lpage>.
                    <pub-id pub-id-type="pmid">30049790</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.232397.117</pub-id>
                    <pub-id pub-id-type="pmcid">6120633</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-60">
                <label>60</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Ribeiro-Goncalves</surname>
                            <given-names>B</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Francisco</surname>
                            <given-names>AP</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Vaz</surname>
                            <given-names>C</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>PHYLOViZ Online: web-based tool for visualization, phylogenetic inference, analysis and sharing of minimum spanning trees.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2016</year>;<volume>44</volume>(<issue>W1</issue>):<fpage>W246</fpage>&#x2013;<lpage>51</lpage>.
                    <pub-id pub-id-type="pmid">27131357</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkw359</pub-id>
                    <pub-id pub-id-type="pmcid">4987911</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-61">
                <label>61</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Letunic</surname>
                            <given-names>I</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Bork</surname>
                            <given-names>P</given-names>
                        </name>
			</person-group>:
                    <article-title>Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees.</article-title>
                    <source>
				
                        <italic toggle="yes">Nucleic Acids Res.</italic>
			</source>
                    <year>2016</year>;<volume>44</volume>(<issue>W1</issue>):<fpage>W242</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">27095192</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkw290</pub-id>
                    <pub-id pub-id-type="pmcid">4987883</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-62">
                <label>62</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Argim&#x00f3;n</surname>
                            <given-names>S</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Abudahab</surname>
                            <given-names>K</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Goater</surname>
                            <given-names>RJ</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Microreact: visualizing and sharing data for genomic epidemiology and phylogeography.</article-title>
                    <source>
				
                        <italic toggle="yes">Microb Genom.</italic>
			</source>
                    <year>2016</year>;<volume>2</volume>(<issue>11</issue>):<fpage>e000093</fpage>.
                    <pub-id pub-id-type="pmid">28348833</pub-id>
                    <pub-id pub-id-type="doi">10.1099/mgen.0.000093</pub-id>
                    <pub-id pub-id-type="pmcid">5320705</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-63">
                <label>63</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Bratcher</surname>
                            <given-names>HB</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Brehony</surname>
                            <given-names>C</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Heuberger</surname>
                            <given-names>S</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Establishment of the European meningococcal strain collection genome library (EMSC-GL) for the 2011 to 2012 epidemiological year.</article-title>
                    <source>
				
                        <italic toggle="yes">Euro Surveill.</italic>
			</source>
                    <year>2018</year>;<volume>23</volume>(<issue>20</issue>).
                    <pub-id pub-id-type="pmid">29790460</pub-id>
                    <pub-id pub-id-type="doi">10.2807/1560-7917.ES.2018.23.20.17-00474</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-64">
                <label>64</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Medini</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Serruto</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Parkhill</surname>
                            <given-names>J</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Microbiology in the post-genomic era.</article-title>
                    <source>
				
                        <italic toggle="yes">Nat Rev Microbiol.</italic>
			</source>
                    <year>2008</year>;<volume>6</volume>(<issue>6</issue>):<fpage>419</fpage>&#x2013;<lpage>30</lpage>.
                    <pub-id pub-id-type="pmid">18475305</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrmicro1901</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-65">
                <label>65</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Wetterstrand</surname>
                            <given-names>KA</given-names>
                        </name>
			</person-group>:
                    <article-title>Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)</article-title>.
                    <ext-link ext-link-type="uri" xlink:href="http://www.genome.gov/sequencingcostsdata">www.genome.gov/sequencingcostsdata</ext-link>: National Institutes of Health;<year>2018</year>; [updated 25/4/2018;  cited 2018 27/08/2018].
                    <ext-link ext-link-type="uri" xlink:href="http://www.genome.gov/sequencingcostsdata">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-66">
                <label>66</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Chain</surname>
                            <given-names>PS</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Grafham</surname>
                            <given-names>DV</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Fulton</surname>
                            <given-names>RS</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Genomics. Genome project standards in a new era of sequencing.</article-title>
                    <source>
				
                        <italic toggle="yes">Science.</italic>
			</source>
                    <year>2009</year>;<volume>326</volume>(<issue>5950</issue>):<fpage>236</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">19815760</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.1180614</pub-id>
                    <pub-id pub-id-type="pmcid">3854948</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-67">
                <label>67</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>Y</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Hu</surname>
                            <given-names>W</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>B</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>
                        <italic toggle="yes">Vibrio parahaemolyticus</italic> isolates from southeastern Chinese coast are genetically diverse with circulation of clonal complex 3 strains since 2002.</article-title>
                    <source>
				
                        <italic toggle="yes">Foodborne Pathog Dis.</italic>
			</source>
                    <year>2011</year>;<volume>8</volume>(<issue>11</issue>):<fpage>1169</fpage>&#x2013;<lpage>76</lpage>.
                    <pub-id pub-id-type="pmid">21883006</pub-id>
                    <pub-id pub-id-type="doi">10.1089/fpd.2011.0865</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-68">
                <label>68</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Magri</surname>
                            <given-names>MM</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Gomes-Gouv&#x00ea;a</surname>
                            <given-names>MS</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>de Freitas</surname>
                            <given-names>VL</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Multilocus sequence typing of Candida tropicalis shows the presence of different clonal clusters and fluconazole susceptibility profiles in sequential isolates from candidemia patients in Sao Paulo, Brazil.</article-title>
                    <source>
				
                        <italic toggle="yes">J Clin Microbiol.</italic>
			</source>
                    <year>2013</year>;<volume>51</volume>(<issue>1</issue>):<fpage>268</fpage>&#x2013;<lpage>77</lpage>.
                    <pub-id pub-id-type="pmid">23152555</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.02366-12</pub-id>
                    <pub-id pub-id-type="pmcid">3536249</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-69">
                <label>69</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Cody</surname>
                            <given-names>AJ</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>McCarthy</surname>
                            <given-names>ND</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Jansen van Rensburg</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Real-time genomic epidemiological evaluation of human 
                        <italic toggle="yes">Campylobacter</italic> isolates by use of whole-genome multilocus sequence typing.</article-title>
                    <source>
				
                        <italic toggle="yes">J Clin Microbiol.</italic>
			</source>
                    <year>2013</year>;<volume>51</volume>(<issue>8</issue>):<fpage>2526</fpage>&#x2013;<lpage>34</lpage>.
                    <pub-id pub-id-type="pmid">23698529</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.00066-13</pub-id>
                    <pub-id pub-id-type="pmcid">3719633</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-70">
                <label>70</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Hampson</surname>
                            <given-names>DJ</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>La</surname>
                            <given-names>T</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Phillips</surname>
                            <given-names>ND</given-names>
                        </name>
			</person-group>:
                    <article-title>Emergence of 
                        <italic toggle="yes">Brachyspira</italic> species and strains: reinforcing the need for surveillance.</article-title>
                    <source>
				
                        <italic toggle="yes">Porcine Health Manag.</italic>
			</source>
                    <year>2015</year>;<volume>1</volume>:<fpage>8</fpage>.
                    <pub-id pub-id-type="pmid">28694985</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s40813-015-0002-1</pub-id>
                    <pub-id pub-id-type="pmcid">5499009</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-71">
                <label>71</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Schulz</surname>
                            <given-names>J</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Dumke</surname>
                            <given-names>J</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Hinse</surname>
                            <given-names>D</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Organic Turkey Flocks: A Reservoir of 
                        <italic toggle="yes">Streptococcus gallolyticus subspecies gallolyticus</italic>.</article-title>
                    <source>
				
                        <italic toggle="yes">PLoS One.</italic>
			</source>
                    <year>2015</year>;<volume>10</volume>(<issue>12</issue>):<fpage>e0144412</fpage>.
                    <pub-id pub-id-type="pmid">26657757</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0144412</pub-id>
                    <pub-id pub-id-type="pmcid">4675529</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-72">
                <label>72</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Harrison</surname>
                            <given-names>OB</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Clemence</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Dillard</surname>
                            <given-names>JP</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Genomic analyses of 
                        <italic toggle="yes">Neisseria gonorrhoeae</italic> reveal an association of the gonococcal genetic island with antimicrobial resistance.</article-title>
                    <source>
				
                        <italic toggle="yes">J Infect.</italic>
			</source>
                    <year>2016</year>;<volume>73</volume>(<issue>6</issue>):<fpage>578</fpage>&#x2013;<lpage>87</lpage>.
                    <pub-id pub-id-type="pmid">27575582</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jinf.2016.08.010</pub-id>
                    <pub-id pub-id-type="pmcid">5127880</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-73">
                <label>73</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
				
                        <name name-style="western">
                            <surname>Ganesh</surname>
                            <given-names>K</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Allam</surname>
                            <given-names>M</given-names>
                        </name>
				
                        <name name-style="western">
                            <surname>Wolter</surname>
                            <given-names>N</given-names>
                        </name>
				
                        <etal/>
			</person-group>:
                    <article-title>Molecular characterization of invasive capsule null 
                        <italic toggle="yes">Neisseria meningitidis</italic> in South Africa.</article-title>
                    <source>
				
                        <italic toggle="yes">BMC Microbiol.</italic>
			</source>
                    <year>2017</year>;<volume>17</volume>(<issue>1</issue>):<fpage>40</fpage>.
                    <pub-id pub-id-type="pmid">28222677</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12866-017-0942-5</pub-id>
                    <pub-id pub-id-type="pmcid">5320719</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-74">
                <label>74</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Guerrero</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liz&#x00e1;rraga-Partida</surname>
                            <given-names>ML</given-names>
                        </name>

                        <name name-style="western">
                            <surname>G&#x00f3;mez Gil Rodr&#x00ed;guez</surname>
                            <given-names>B</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genetic Analysis of 
                        <italic toggle="yes">Vibrio parahaemolyticus</italic> O3:K6 Strains That Have Been Isolated in Mexico Since 1998.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2017</year>;<volume>12</volume>(<issue>1</issue>):<fpage>e0169722</fpage>.
                    <pub-id pub-id-type="pmid">28099500</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0169722</pub-id>
                    <pub-id pub-id-type="pmcid">5242489</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-75">
                <label>75</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Menghwar</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>He</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genotype distribution of Chinese 
                        <italic toggle="yes">Mycoplasma</italic> bovis isolates and their evolutionary relationship to strains from other countries.</article-title>
                    <source>

                        <italic toggle="yes">Microb Pathog.</italic>
</source>
                    <year>2017</year>;<volume>111</volume>:<fpage>108</fpage>&#x2013;<lpage>17</lpage>.
                    <pub-id pub-id-type="pmid">28826770</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.micpath.2017.08.029</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-76">
                <label>76</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tsang</surname>
                            <given-names>RSW</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ulanova</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>The changing epidemiology of invasive 
                        <italic toggle="yes">Haemophilus influenzae</italic> disease: Emergence and global presence of serotype a strains that may require a new vaccine for control.</article-title>
                    <source>

                        <italic toggle="yes">Vaccine.</italic>
</source>
                    <year>2017</year>;<volume>35</volume>(<issue>33</issue>):<fpage>4270</fpage>&#x2013;<lpage>5</lpage>.
                    <pub-id pub-id-type="pmid">28666758</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.vaccine.2017.06.001</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-77">
                <label>77</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yang</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhan</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multilocus sequence type profiles of 
                        <italic toggle="yes">Bacillus cereus</italic> isolates from infant formula in China.</article-title>
                    <source>

                        <italic toggle="yes">Food Microbiol.</italic>
</source>
                    <year>2017</year>;<volume>62</volume>:<fpage>46</fpage>&#x2013;<lpage>50</lpage>.
                    <pub-id pub-id-type="pmid">27889164</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.fm.2016.09.007</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-78">
                <label>78</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>El Bannah</surname>
                            <given-names>AMS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nawar</surname>
                            <given-names>NN</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hassan</surname>
                            <given-names>RMM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Molecular Epidemiology of Carbapenem-Resistant Acinetobacter baumannii in a Tertiary Care Hospital in Egypt: Clonal Spread of blaOXA-23.</article-title>
                    <source>

                        <italic toggle="yes">Microb Drug Resist.</italic>
</source>
                    <year>2018</year>;<volume>24</volume>(<issue>3</issue>):<fpage>269</fpage>&#x2013;<lpage>77</lpage>.
                    <pub-id pub-id-type="pmid">28783427</pub-id>
                    <pub-id pub-id-type="doi">10.1089/mdr.2017.0057</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-79">
                <label>79</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cody</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>McCarthy</surname>
                            <given-names>NM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wimalarathna</surname>
                            <given-names>HL</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A longitudinal 6-year study of the molecular epidemiology of clinical 
                        <italic toggle="yes">campylobacter</italic> isolates in Oxfordshire, United kingdom.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Microbiol.</italic>
</source>
                    <year>2012</year>;<volume>50</volume>(<issue>10</issue>):<fpage>3193</fpage>&#x2013;<lpage>201</lpage>.
                    <pub-id pub-id-type="pmid">22814466</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.01086-12</pub-id>
                    <pub-id pub-id-type="pmcid">3457434</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-80">
                <label>80</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rodrigues</surname>
                            <given-names>CMC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Maiden</surname>
                            <given-names>MCJ</given-names>
                        </name>
</person-group>:
                    <article-title>A world without bacterial meningitis: how genomic epidemiology can inform vaccination strategy [version 1;  referees: 2 approved].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2018</year>;<volume>7</volume>(<issue>F1000 Faculty Rev</issue>):<fpage>401</fpage>.
                    <pub-id pub-id-type="pmid">29636909</pub-id>
                    <pub-id pub-id-type="doi">10.12688/f1000research.13793.1</pub-id>
                    <pub-id pub-id-type="pmcid">5871810</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-81">
                <label>81</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rodrigues</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brehony</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Borrow</surname>
                            <given-names>R</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Genomic surveillance and meningococcal group B vaccine coverage estimates after introduction of the vaccine into the national immunisation programme in the UK.</article-title>
                    <source>

                        <italic toggle="yes">Lancet.</italic>
</source>
                    <year>2017</year>;<volume>389</volume>:<fpage>S85</fpage>.
                    <pub-id pub-id-type="doi">10.1016/S0140-6736(17)30481-6</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-82">
                <label>82</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ammon</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Makela</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Integrated data collection on zoonoses in the European Union, from animals to humans, and the analyses of the data.</article-title>
                    <source>

                        <italic toggle="yes">Int J Food Microbiol.</italic>
</source>
                    <year>2010</year>;<volume>139 Suppl 1</volume>:<fpage>S43</fpage>&#x2013;<lpage>7</lpage>.
                    <pub-id pub-id-type="pmid">20395001</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.ijfoodmicro.2010.03.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-83">
                <label>83</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ferr&#x00e9;s</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Iraola</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>MLSTar: automatic multilocus sequence typing of bacterial genomes in R.</article-title>
                    <source>

                        <italic toggle="yes">PeerJ.</italic>
</source>
                    <year>2018</year>;<volume>6</volume>:<fpage>e5098</fpage>.
                    <pub-id pub-id-type="pmid">29922519</pub-id>
                    <pub-id pub-id-type="doi">10.7717/peerj.5098</pub-id>
                    <pub-id pub-id-type="pmcid">6005169</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-84">
                <label>84</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tewolde</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dallman</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Schaefer</surname>
                            <given-names>U</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MOST: a modified MLST typing tool based on short read sequencing.</article-title>
                    <source>

                        <italic toggle="yes">PeerJ.</italic>
</source>
                    <year>2016</year>;<volume>4</volume>:<fpage>e2308</fpage>.
                    <pub-id pub-id-type="pmid">27602279</pub-id>
                    <pub-id pub-id-type="doi">10.7717/peerj.2308</pub-id>
                    <pub-id pub-id-type="pmcid">4991843</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-85">
                <label>85</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Larsen</surname>
                            <given-names>MV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cosentino</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rasmussen</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multilocus sequence typing of total-genome-sequenced bacteria.</article-title>
                    <source>

                        <italic toggle="yes">J Clin Microbiol.</italic>
</source>
                    <year>2012</year>;<volume>50</volume>(<issue>4</issue>):<fpage>1355</fpage>&#x2013;<lpage>61</lpage>.
                    <pub-id pub-id-type="pmid">22238442</pub-id>
                    <pub-id pub-id-type="doi">10.1128/JCM.06094-11</pub-id>
                    <pub-id pub-id-type="pmcid">3318499</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-86">
                <label>86</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Page</surname>
                            <given-names>AJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Taylor</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Keane</surname>
                            <given-names>JA</given-names>
                        </name>
</person-group>:
                    <article-title>Multilocus sequence typing by blast from 
                        <italic toggle="yes">de novo</italic> assemblies against PubMLST.</article-title>
                    <source>

                        <italic toggle="yes">J Open Source Softw.</italic>
</source>
                    <year>2016</year>;<volume>1</volume>(<issue>8</issue>):<fpage>118</fpage>.
                    <pub-id pub-id-type="doi">10.21105/joss.00118</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-87">
                <label>87</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gupta</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jordan</surname>
                            <given-names>IK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rishishwar</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>stringMLST: a fast k-mer based tool for multilocus sequence typing.</article-title>
                    <source>

                        <italic toggle="yes">Bioinformatics.</italic>
</source>
                    <year>2017</year>;<volume>33</volume>(<issue>1</issue>):<fpage>119</fpage>&#x2013;<lpage>21</lpage>.
                    <pub-id pub-id-type="pmid">27605103</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btw586</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-88">
                <label>88</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Inouye</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dashnow</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Raven</surname>
                            <given-names>LA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>SRST2: Rapid genomic surveillance for public health and hospital microbiology labs.</article-title>
                    <source>

                        <italic toggle="yes">Genome Med.</italic>
</source>
                    <year>2014</year>;<volume>6</volume>(<issue>11</issue>):<fpage>90</fpage>.
                    <pub-id pub-id-type="pmid">25422674</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13073-014-0090-6</pub-id>
                    <pub-id pub-id-type="pmcid">4237778</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-89">
                <label>89</label>
                <mixed-citation publication-type="journal">
                    <article-title>Expert Opinion on the introduction of next-generation typing methods for food- and waterborne diseases in the EU and EEA</article-title>. Stockholm, Sweden: European Centre for Disease Prevention and Control,<year>2015</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://ecdc.europa.eu/en/publications/Publications/food-and-waterborne-diseases-next-generation-typing-methods.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-90">
                <label>90</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nadon</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Van Walle</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gerner-Smidt</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance.</article-title>
                    <source>

                        <italic toggle="yes">Euro Surveill.</italic>
</source>
                    <year>2017</year>;<volume>22</volume>(<issue>23</issue>): pii: 30544.
                    <pub-id pub-id-type="pmid">28662764</pub-id>
                    <pub-id pub-id-type="doi">10.2807/1560-7917.ES.2017.22.23.30544</pub-id>
                    <pub-id pub-id-type="pmcid">5479977</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-91">
                <label>91</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Louren&#x00e7;o</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Watkins</surname>
                            <given-names>ER</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Obolski</surname>
                            <given-names>U</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Lineage structure of 
                        <italic toggle="yes">Streptococcus pneumoniae</italic> may be driven by immune selection on the groEL heat-shock protein.</article-title>
                    <source>

                        <italic toggle="yes">Sci Rep.</italic>
</source>
                    <year>2017</year>;<volume>7</volume>(<issue>1</issue>):<fpage>9023</fpage>.
                    <pub-id pub-id-type="pmid">28831154</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-017-08990-z</pub-id>
                    <pub-id pub-id-type="pmcid">5567354</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report33973">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/wellcomeopenres.16155.r33973</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Pouseele</surname>
                        <given-names>Hannes</given-names>
                    </name>
                    <xref ref-type="aff" rid="r33973a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r33973a1">
                    <label>1</label>Data Analytics, BioM&#x00e9;rieux/Applied Maths, Sint-Martens-Latem, East Flanders, Belgium</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>15</day>
                <month>10</month>
                <year>2018</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2018 Pouseele H</copyright-statement>
                <copyright-year>2018</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport33973" related-article-type="peer-reviewed-article" xlink:href="10.12688/wellcomeopenres.14826.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>I found the article generally well written, with a good balance between the history of gene-by-gene systems, the previous version of BIGSDb and the novelties that have been implemented lately. The use cases are clearly describing the utility of this resource and its databases for a microbiologist. The effort that was done to provide an API is a key element in allowing machine-to-machine exchange and integration in the broader digital microbiology space. As such, the article serves as a good introduction for novices and a good overview for experts of recent evolutions in the platform.</p>
            <p> Some more specific comments. 
                <list list-type="bullet">
                    <list-item>
                        <p>On page 1, the sentence &#x201c;Originally intended as a means &#x2026; &#x201c; is rather long, and its meaning is not immediately clear</p>
                    </list-item>
                    <list-item>
                        <p>On page 3, what do the authors mean by the &#x201c;federated&#x201d; in &#x201c;a federated network of interconnected data resources&#x201d;</p>
                    </list-item>
                    <list-item>
                        <p>On page 3, the authors discuss the data access model, focusing on the private/public distinction. However, there is a third player in the game: the people that host the service itself have by definition administrator access, and maybe also curator access to all private data as well. It would be good to specify what the data access policy is for these types of access.</p>
                    </list-item>
                    <list-item>
                        <p>On page 3, what do you mean by &#x201c;the principle curated schemes&#x201d;?</p>
                    </list-item>
                    <list-item>
                        <p>On page 7, the allele detection method is described in great precision. However, the last sentence is of the first paragraph is rather vague. To what will the allowed percentage identity difference be increased? When will this be done, the formulation implies this is a future change?</p>
                    </list-item>
                    <list-item>
                        <p>Also on page 7, it is somehow remarkable that these sequence identity related thresholds are independent of the organism. Maybe worth mentioning that these are universal thresholds, and why this can be the case?</p>
                    </list-item>
                    <list-item>
                        <p>On page 8, the authors describe how schemes are built. It strikes me that there is no mention of filtering loci based on the presence of repeated DNA. Does this imply that there is no repeat filtering? In my experience, this is an avoidable cause of noise in the analysis.</p>
                    </list-item>
                    <list-item>
                        <p>On page 8, first line of the second column has &#x201c;colour&#x201d; following British spelling. What is the journal&#x2019;s policy on this? Is the spelling consistent throughout the article? I suspect the same is going on with &#x201c;proved&#x201d;(page 11 and 12), I would tend to use &#x201c;proven&#x201d; but I will leave the choice to the native speakers.</p>
                    </list-item>
                </list>
            </p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Partly</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report33972">
        <front-stub>
            <article-id pub-id-type="doi">10.21956/wellcomeopenres.16155.r33972</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Brisse</surname>
                        <given-names>Sylvain</given-names>
                    </name>
                    <xref ref-type="aff" rid="r33972a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2516-2108</uri>
                </contrib>
                <aff id="r33972a1">
                    <label>1</label>Biodiversity and Epidemiology of Bacterial Pathogens, Institut Pasteur, &#x00a0;Paris, France</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>My group is using the BIGSdb platform to power the Pasteur MLST web site and databases; as such, I am benefiting from the collaboration of K. Jolley in supporting the deployment of the web application at Pasteur.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>10</month>
                <year>2018</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2018 Brisse S</copyright-statement>
                <copyright-year>2018</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport33972" related-article-type="peer-reviewed-article" xlink:href="10.12688/wellcomeopenres.14826.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>General comments:</p>
            <p> </p>
            <p> The article describes the current status of the pubMLST web site and its underlying web application software tool, BIGSdb. The article is very clearly written, concise and extremely informative as to the novel functionalities developed since the previous publication on the same subject. The developments of the platform that were achieved in recent years are quite impressive and the platform now allows any number of genomes and their linked provenance data to be analyzed at any number of gene loci for any number of microbial species/genera. Perhaps the most important developments are the APIs that allow third party software to interconnect with the database contents, effectively placing the BIGSdb nomenclatures at the center of a complex ecosystem of bioinformatics tools and resources in the fields of microbial population biology, epidemiology, clinical diagnostics and public health surveillance of pathogens.</p>
            <p> </p>
            <p> The impressive community of curators and users that nourish the databases and benefit from the pubMLST resource is a testimony of its unique value for basic microbiology research as well as for public health and clinical applications.</p>
            <p> </p>
            <p> This large success in turns raises the important issue of the sustainability of the resource. As is apparent from the authors list of the manuscript, the BIGSdb resource was almost exclusively developed by a small team of academics. The impact of this team on the field is truly remarkable, but the huge success of the pubMLST resource creates a kind of anomaly in the sense that its fragile reliance on short-term academic engagement and grant-based financial support highlights the lack of long-term, institutional committment to secure the future of the resource. The large community adoption and the dependence of increasing numbers of public health surveillance networks on this resource calls for a collective reflection on how the resource could be supported by end users or other stakeholders. An engagement by the microbiology and public health communities to support the maintenance of the database infrastructure and future developments of novel functionality seem to be a logical consequence of the large recognition of the value of the pubMLST resource, but remains to be organized. One possible way forward might be to create a more structured community of users and supporters of the resource that could promote activities such as training or community-based software development, and would foster collective actions aiming at securing long-term financial support and sustainability of the resource. In this context, the authors might want to elaborate in the manuscript, on the way communities of users are currently structured.</p>
            <p> </p>
            <p> </p>
            <p> Specific comments</p>
            <p> </p>
            <p> The pubMLST home page is concise and focused on rapid access to main functionality or information and clearly seem to be designed for people familiar with the site purposes. It might be useful to add a short header text summarizing the purpose of the web site for non familiar users, with some links to key publications like the one under consideration.</p>
            <p> </p>
            <p> Figure 8: the tree is derived from a contatenation of alignments of gene sequences of a given scheme.&#x00a0; This is a very useful BIGSdb functionality as it allows capturing the entire sequence variation rather than just its summary based on allele identifiers. However in this case, the impact of homologous recombination on inferred strain relationships is not taken into account; a recombination purging step might be a useful addition.</p>
            <p> </p>
            <p> page 12: The composition of the majority of databases is contingent : what is meant by &#x201c;contigent&#x201d; here?</p>
            <p> page 12: A &#x201c;principle&#x201d; aim of most database curator teams is to catalogue the known diversity : is &#x201c;principal&#x201d; meant?</p>
            <p> Figure 9 is a useful overview of different ways the platform can be used by different user communities. Although it is largely self-explanatory, there is no discussion about this figure in the text.</p>
            <p>Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?</p>
            <p>Yes</p>
            <p>Is the rationale for developing the new software tool clearly explained?</p>
            <p>Yes</p>
            <p>Is the description of the software tool technically sound?</p>
            <p>Yes</p>
            <p>Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?</p>
            <p>Yes</p>
            <p>Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>bacterial population biology and public health microbiology</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
