Subscriber access provided by Northern Illinois University
Article
Automated High-Throughput Identification and Characterisation of Clinically Important Bacteria and Fungi using Rapid Evaporative Ionisation Mass Spectrometry (REIMS) Frances Bolt, Simon J S Cameron, Tamas Karancsi, Daniel Simon, Richard Schaffer, Tony Rickards, Kate Hardiman, Adam Burke, Zsolt Bodai, Alvaro Perdones-Montero, Monica Rebec, Julia Balog, and Zoltan Takats Anal. Chem., Just Accepted Manuscript • DOI: 10.1021/acs.analchem.6b01016 • Publication Date (Web): 25 Aug 2016 Downloaded from http://pubs.acs.org on August 30, 2016
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
Analytical Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
1
Automated High-Throughput Identification and Characterisation of Clinically Important Bacteria
2
and Fungi using Rapid Evaporative Ionisation Mass Spectrometry (REIMS)
3 4
Frances Bolt (1)†; Simon JS Cameron (1)†; Tamas Karancsi (2); Daniel Simon (2); Richard Schaffer (2);
5
Tony Rickards (3); Kate Hardiman (1); Adam Burke (1); Zsolt Bodai (1); Alvaro Perdones-Montero (1);
6
Monica Rebec (3); Julia Balog (2); Zoltan Takats (1)*.
7 8
(1) Section of Computational and Systems Medicine, Department of Surgery and Cancer, Imperial
9
College London, London, SW7 2AZ, United Kingdom; (2) Waters Research Centre, 7 Zahony Street,
10
Budapest, 1031, Hungary; (3) Department of Microbiology, Imperial College Healthcare NHS Trust,
11
Charing Cross Hospital, London, W6 8RF, United Kingdom
12 13
† Joint first authors / Contributed equally
14 15
* Corresponding Author: Professor Zoltan Takats, Division of Computational and Systems Medicine,
16
Department of Surgery and Cancer, Imperial College London, South Kensington Campus, Sir
17
Alexander
18
[email protected] Fleming
Building,
London,
SW7
2AZ.
Telephone:
19 20 21 22 23 24 25 26
1 Environment ACS Paragon Plus
020
7594
2760.
Email:
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
27
2 Environment ACS Paragon Plus
Page 2 of 38
Page 3 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
28
Abstract
29
Rapid Evaporative Ionisation Mass Spectrometry (REIMS) has been shown to quickly and accurately
30
speciate microorganisms based upon their species-specific lipid profile. Previous work by members
31
of this group showed that the use of a handheld bipolar probe allowed REIMS to analyse microbial
32
cultures directly from culture plates, without any prior preparation. However, this method of
33
analysis would likely be unsuitable for a high-throughput clinical microbiology laboratory. Here, we
34
report on the creation of a customised platform which enables automated, high-throughput REIMS
35
analysis, which requires minimal user input and operation; and suitable for use in clinical
36
microbiology laboratories. The ability of this high-throughput platform to speciate clinically
37
important microorganisms was tested through the analysis of 375 different clinical isolates, collected
38
from distinct patient samples, from 25 microbial species. After optimisation of our data analysis
39
approach, we achieved substantially similar results between the two REIMS approaches. For
40
handheld bipolar probe REIMS a speciation accuracy of 96.3% was achieved, whilst for high-
41
throughput REIMS, an accuracy of 93.9% was achieved. Thus, high-throughput REIMS offers an
42
alternative mass spectrometry based method for the rapid and accurate identification of clinically
43
important microorganisms in clinical laboratories without any pre-analysis preparative steps.
44 45 46 47 48 49 50 51 52 53
3 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 4 of 38
54
Introduction
55
The introduction of mass spectrometry based microbial identification has revolutionised clinical
56
microbiology laboratories. Historically, species delineations were often reliant on phenotypic and
57
biochemical characteristics, which were laborious, costly, and often extended the time required for
58
results by days
59
VITEK MS (bioMérieux) have reduced the time to identification, they do require the addition of a
60
matrix to assist in lysis and ionisation and, in some instances, additional extraction steps are
61
required for accurate species classification 2,4.
1-3
. Whilst mass spectrometry platforms including the MALDI Biotyper (Bruker) and
62 63
Rapid evaporative ionisation mass spectrometry (REIMS) provides accurate speciation of bacteria
64
and yeasts directly from colonies without the need for additional preparative steps 5,6. In contrast to
65
many of the commercially available instruments, which are based upon the fingerprinting of proteins
66
within the mass range 2−20 kDa 7, REIMS allows the lipidomic profile of bacteria and fungi to be
67
determined. The use of lipid analysis for microbial taxonomic classifications is well established and,
68
gas chromatography fatty acid profiling, has been widely used for over fifty years
69
also important in differentiating bacteria using Pyrolysis MS; however, in early studies this technique
70
was limited as only small molecules and fragments of larger molecules were observed due to the
71
high energy ionisation process 13-16. Fast atom bombardment (FAB-MS) based upon the desorption
72
of phospholipid molecules also allowed the differentiation of bacterial species
73
desorption electrospray ionization (DESI) provided strain level resolution on intact bacteria based
74
upon analysis within the lipid region 18.
17
8-12
. Lipids were
. More recently,
75 76
REIMS works by applying a radiofrequency electrical current to the microorganism, which causes
77
rapid heating of microbial biomass. The resulting vapour, containing gas phase ions of metabolites
78
and structural lipids, is channelled to a mass spectrometer for MS analysis. We have previously
4 Environment ACS Paragon Plus
Page 5 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
79
demonstrated that lipidomic spectral profiles obtained using REIMS could be used for the accurate
80
speciation of yeast and 28 clinically important bacteria 6.
81 82
To increase throughput and reduce costs, clinical microbiology laboratories have undergone rapid
83
automation. This has been the result of a range of factors, including increased financial pressures,
84
centralisation of laboratory diagnostics, and substantially increased number of specimens 19. Current
85
methods of automation are increasingly being paired with mass spectrometry platforms for
86
microbial identification and have been shown to reduce the time to identification by approximately
87
24 hours and improve antibiotic prescribing practices
88
clinical microbiology laboratories
89
customised TECAN Freedom EVO instrument integrated with a Pickolo system (SciRobotics, Israel) to
90
enable automated analysis. This provides a single platform for automated colony picking, REIMS
91
analysis, and molecular processing thereby minimising user related errors and thus making it highly
92
applicable to a modern clinical microbiology laboratory’s workflow. As described here, the method is
93
simple, requires minimal user input, and can easily be adapted to fit into a full laboratory automated
94
workflow
95
further reduce analysis times and improve patient outcomes.
20
21,22
20
. Due to the high-throughput nature of
, the REIMS platform has since been modified using a
. Furthermore, because the technique requires no pre-analysis preparation, it could
96 97 98 99 100 101 102 103 104
5 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
105
Experimental Section
106 107
Culturing of Microbial Isolates
108
A total of 375 isolates comprising 25 bacterial and fungal species, shown in Supplementary Table S1,
109
were collected from clinical samples routinely processed at the Imperial College Healthcare Trust
110
(London, UK) microbiology diagnostic laboratory. Prior to REIMS analysis, each isolate was cultivated
111
according to the conditions detailed in Table S1. Fifteen isolates were examined for each of the 25
112
species.
113 114
MALDI Biotyper Identification
115
Microbial species identifications were verified for all isolates using matrix assisted laser desorption
116
ionisation time of flight (MALDI-TOF) mass spectrometry on the Bruker Microflex LT (Bruker
117
Daltonics, Coventry, UK) system, according to the manufacturer’s instructions. For bacteria, a single
118
colony of culture was spotted onto a 24-spot steel plate (Bruker Daltonics). After drying, 1 µl of α-
119
cyano-4-hydroxycinnamic acid (HCCA) matrix, (Bruker Daltonics) dissolved in 50% acetonitrile, 47.5%
120
water, and 2.5% trifluoroacetic acid was applied. Once dried, the samples were analysed following
121
manufacturer’s instructions. For yeasts, one colony was suspended in 1 mL of 70% ethanol and
122
homogenised. The sample then underwent centrifugation at 15,000 x g for 2 minutes and the
123
supernatant was removed. After drying at room temperature for 30 minutes, between 10 µL and 60
124
µL of 70% formic acid, depending on the size of the culture pellet, was added and mixed through
125
pipetting. Pure acetonitrile was then added in an equal amount to formic acid and incubated at
126
room temperature for 3 minutes. The sample underwent centrifugation at 15,000 x g for 3 minutes
127
and 1 µL of supernatant was spotted onto an analysis plate. After drying, the spot was overlaid with
128
1 µL of HCCA matrix as previously described and allowed to dry before analysis following
129
manufacturer’s instructions. Both bacterial and yeast isolates were analysed using the Bruker
130
Biotyper software (version 3.0) and library (version 5.0) for taxonomic classification.
6 Environment ACS Paragon Plus
Page 6 of 38
Page 7 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
131 132
REIMS Analysis using Handheld Bipolar Probe
133
REIMS analysis with a handheld bipolar probe was performed using a similar protocol to that
134
described earlier6. In brief, handheld bipolar probes (Erbe Elektromedizin, Tübingen, Germany) were
135
combined with an ERBE IC 300 electrosurgical generator (Erbe Elektromedizin) used at a 70 W power
136
setting in biopolar mode as a radio frequency alternating current power supply (470 kHz sinusoid). A
137
ca. 1.5 m long (inner diameter 1.5 mm, outer diameter 3.2 mm) PTFE tube (Sigma-Aldrich, UK) was
138
used to connect the handheld bipolar probe to the inlet capillary of a Xevo G2-XS Q-ToF mass
139
spectrometer instrument (Waters Corporation, Wilmslow, UK), allowing transfer of the generated
140
aerosol. The vacuum of the mass spectrometer was used for aspiration of the aerosol and it was co-
141
aspirated, at a flow rate of 0.2 mL/s, with 2-propanol (HPLC Chromasolv grade, Sigma Aldrich)
142
containing Leu-enkephaline (VWR) at a concentration of 10 ng/µL. The aerosol and 2-propanol/leu-
143
enkephaline solution was introduced using a T-piece as shown in Supplementary Figure S1. For each
144
analytical repeat, between one and five colonies, was removed from the culture plate using one side
145
of the handheld bipolar probe. The handheld bipolar probe was then closed and the power supply
146
was triggered using a manual foot switch, which caused the biomass to rapidly evaporate and
147
produce an analyte-containing aerosol. Five individual measurements were performed for each
148
isolate.
149 150
High-Throughput REIMS Analysis
151
To allow for high-throughput REIMS analysis, a commercially available colony picker robot (Freedom
152
EVO 75 (TECAN, Switzerland) equipped with a Pickolo platform (SciRobotics, Israel)) was used. As
153
shown in Figure 1, the user places the agar plates in the rack, (a), and loads the probes in position,
154
(b). High-throughput REIMS uses a modified command script whereby the user initially selects the
155
number of culture plates to be analysed. The robotic arm of the instrument then picks up the first
156
plate and places it on the imaging platform, Figure 1c. The operator is then able to see microbial
7 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
157
colonies and select ones of interest, or the software is able to automatically detect individual
158
colonies for analysis. In this study, only the operator select mode was used for colony identification.
159
After the selection of colonies, a second arm picks up a stainless steel electrode probe as
160
demonstrated by Figure 1d. The bespoke stainless steel probe is a modified version of a 200 µL pure
161
tip (TECAN). Once the probe touches the microbial colony, an ERBE IC 300 electrosurgical generator,
162
set at a 17 W power setting in monopolar mode, applies an electrical current. The rapid heating of
163
the microbial biomass results in the formation of aerosol which is channelled from the probe
164
through a ca. 1.5 M long (inner diameter 1.5 mm, outer diameter 3.2 mm) PTFE tube (Sigma-Aldrich)
165
to a Xevo G2-XS Q-ToF instrument (Waters Corporation). The analyte aerosol is co-aspirated with a
166
10 ng/µL solution of leu-enkephaline in isopropanol as described for handheld bipolar probe REIMS
167
analysis. For each isolate, a total of five individual measurements, were performed with one
168
microbial colony used for each.
169 170
Operation of Xevo G2-XS Q-ToF Mass Spectrometer
171
The Xevo G2-XS Q-ToF instrument was operated under the instrument settings detailed in
172
Supplementary Table S2 for both handheld bipolar probe REIMS and high-throughput REIMS
173
analysis. Prior to its use each day, the mass spectrometer was calibrated using sodium formate in
174
negative electrospray ionisation mode, following the manufacturer’s standard instructions. All mass
175
spectrometry analysis was performed in negative ionisation mode across the 50 to 2500 m/z range.
176 177
Data Analysis
178
All acquired raw mass spectrometry files were imported into the Offline Model Builder (OMB)
179
software (Waters Research Center, Budapest, Hungary) for preprocessing and multivariate analysis.
180
First the data scans were combined into spectra, resulting in five individual replicate spectra of each
181
strain. The spectra were background subtracted, and lockmass corrected on external compound
182
Leucin-encephalin (negative ionisation = m/z 554.2516). After lockmass correction, spectra were TIC
8 Environment ACS Paragon Plus
Page 8 of 38
Page 9 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
183
normalised and rebinned to 0.1 Da. After preprocessing, the data was subjected to multiple
184
multivariate algorithms. First a principle component analysis (PCA) was calculated, and using the first
185
50 principle components, a supervised linear discriminant analysis (LDA) was calculated for
186
classification keeping the first 15 discriminant components. Both PCA and LDA attempt to identify
187
linear combinations of variables within a data set. PCA is an unsupervised data analysis approach
188
which does not take into account class seperation of samples
189
approach which explicity looks to identify linear relationships between variables taking into account
190
class groupings of samples 24. Here, we initially used PCA modelling to reduce the dimensionality of
191
our data set, and then used LDA to identify relationships between variables to form the basis of
192
classification models. The resulting mutivariate ’models’ were validated with leave-one isolate out
193
cross-validation, i.e. each data file containing spectra acquired from one isolate was left out of the
194
training set, a model was built on all other datasets and the data file left out was classified using the
195
training model. Each spectrum was classified to the closest class in the LDA space based on the
196
Mahalanobis distances between objects 25. The correct classification rate was calculated based on
197
the number of spectra classified correctly divided by all spectra of the full dataset. For all species
198
analysis, a model built using all isolates from 25 species was created and used in cross-validation. For
199
taxonomic groupings analysis, individual models were constructed for distinct groups at each
200
taxonomic hierarchy, namely phylum, class, order, family, genus, and species. A probability matrix
201
was then constructed to calculate the likely probability that an isolate would be correctly identified
202
to its taxonomic grouping at each level, shown on Supplementary Figure S2. For Random Forest
203
analyses, a total of 400 decision trees were constructed for each model based on a mass spectral
204
matrix exported from the OMB software after data normalisation, background removal, and lock-
205
massing to leu-enkaphaline. For each model, a mass range of 600 to 900 m/z was used, with mass
206
binning to 0.1.
23
. LDA however, is a supervised
207 208
Safety considerations
9 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
209
In this work, all microorganisms were treated as Hazard Group 2 organisms and were therefore
210
manipulated within containment level 2 facilities. All cultures and REIMS analyses were performed
211
within a class 2 biological safety cabinet. All solvents, such as 2-propanol and methanol, were
212
handled according to the material safety data sheet provided by their respective manufacturer.
213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238
10 Environment ACS Paragon Plus
Page 10 of 38
Page 11 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
239 240 241
11 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
242
Results and Discussion
243
The use of REIMS for the successful classification and characterisation of bacteria and yeasts has
244
previously been reported
245
(electrosurgical bipolar forceps) for analysis and thus may not be suitable for use in a high-
246
throughput, clinical microbiology laboratory. Here, an automated instrument incorporating a colony
247
picker robot and a mass spectrometer was developed to allow for high-throughput REIMS
248
processing. This study explored its use for the identification of microbes by analysing a taxonomically
249
diverse collection of bacteria and fungi. A total of 15 isolates from 25 species were analysed using
250
both REIMS approaches. This represents a total of 375 distinct clinical isolates of which 8% were
251
fungi, 52% were Gram-positive bacteria, and 40% Gram-negative bacteria. All of the species
252
examined are commonly encountered within a clinical diagnostic laboratory. Some species, including
253
Lactobacillus jensenii, are frequently isolated but rarely cause disease whilst others, including
254
Escherichia coli and Staphylococcus aureus, more frequently result in severe infections.
5,6
. However, this approach employed a handheld surgical tool
255 256
Comparison Between Handheld Bipolar and High-Throughput Monopolar Probe REIMS
257
Similar classification accuracies were obtained at Gram-stain, genus, and species level using both
258
manual and high-throughput automated REIMS approaches. However, high-throughput REIMS
259
showed a slightly lower (94% to 96%), genus and species level accuracy compared to handheld
260
bipolar probe REIMS. As Figure 2 demonstrates, this may result from differences in the temperature
261
gradient during the rapid heating of the bipolar (handheld) and monopolar (high-throughput) probe.
262
Because the handheld bipolar probe generates a radiofrequency electrical current between two
263
probe surfaces, the temperature is lower, but more consistent through the microbial biomass,
264
compared to the monopolar probe. The latter may result in reduced ionisation of some microbial
265
metabolites due to the higher localised temperature at the point of contact between the electrical
266
probe and microbial biomass. This may in part, explain the slightly reduced level of genus and
267
species level classification accuracy observed. Optimisation of the high-throughput REIMS’s
12 Environment ACS Paragon Plus
Page 12 of 38
Page 13 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
268
monopolar probe will be pursued to modify the temperature at the point of contact to improve
269
biomass ionisation and thus species-level accuracy.
270 271
Although the high-throughput REIMS approach has a slightly reduced level of genus and species-
272
level accuracy, it is capable of analysing substantially higher numbers of microbial colonies, with
273
reduced variability, and user input. At present, this method is potentially capable of analysing
274
approximately 3,000 to 4,000 microbial colonies over a 24 hour period. The increased analysis
275
throughput, reduced variability, and user input makes the high-throughput REIMS approach,
276
reported here, well suited to implementation in a clinical microbiology laboratory.
277 278
Effect of Mixing Analyte Aerosol with Solvent Matrix
279
The previously reported handheld bipolar REIMS method did not incorporate a solvent matrix within
280
the analyte aerosol. Here, we used 2-propanol as a solvent matrix based upon preliminary data
281
which revealed that it improves detectability (i.e. by the incremental improvement of MS response
282
factor) of spectral features. Furthermore, it reduces the contamination of the mass spectrometer by
283
removing all solid deposits from the capillary inlet of the instrument and allows an external lock
284
mass compound to be used so that mass drift correction can be applied. This ability would be
285
essential for a platform introduced into a clinical microbiology laboratory, since there is no universal
286
microbial marker compound which could be used as a lock mass reference similarly to human tissue
287
REIMS analysis 26.
288 289
Supplementary Figure S3 shows the Gram-stain, genus, and species-level cross-validation scores for
290
both handheld bipolar probe REIMS and high-throughput REIMS with and without 2-propanol
291
infusion. Although Gram-stain accuracy is comparable, there is a marked reduction in genus and
292
species-level accuracy when isolates are analysed using both handheld bipolar probe REIMS and
293
high-throughput REIMS without 2-propanol infusion.
13 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 14 of 38
294 295
Initial All Species Analysis
296
Initial data analysis evaluated the entire acquisition range of 50 to 2500 m/z, using a mass bin of 1.
297
An LDA model was created using all 375 isolates and CV scores were determined from this for both
298
REIMS approaches (shown in Figure 3). For handheld bipolar probe REIMS, cross-validation scores to
299
Gram-stain, genus, and species levels were 99.2%, 94.1%, and 88.0% respectively. Cross-validation
300
scores were 98.7%, 91.5%, and 85.6% for each taxonomic level for high-throughput REIMS.
301
Supplementary Table S2 shows the individual species cross-validation scores and suggests that the
302
overall cross-validation score is being reduced substantially by poor scores associated with closely
303
related species, such as members of the Staphylococcus genus, and members of the
304
Enterobacteriaceae family.
305 306
Comparison of Mass Spectral Ranges
307
The species-level accuracy of both handheld bipolar probe REIMS and high-throughput monopolar
308
REIMS using an all species analysis model was substantially lower than previously reported
309
accuracies for bipolar REIMS 5,6. To explore possible optimisations to our data analysis approach, we
310
created eight different all species models using restricted mass ranges of 50 to 2500 m/z, 150 to
311
2500 m/z, 250 to 2500 m/z, 500 to 2500 m/z, 600 to 900 m/z, 1000 to 2000 m/z, and 1200 to 1800
312
m/z. Figure 4 shows that through a reduction in mass range from 50 to 2500 m/z to 600 to 900 m/z,
313
species-level accuracy increases from 85% to 90% for handheld bipolar probe REIMS and from 86%
314
to 90% for high-throughput REIMS. This mass range commonly contains ionised lipids
315
prior to the introduction of advanced biochemical, genetic, and mass spectrometric analyses, have
316
historically been used for species-level identification
317
accuracy, analysis using a reduced mass range could also increase the method’s throughput rate. The
318
reduced mass range of 600 to 900 m/z equals approximately 12% of the entire acquisition mass
319
range of the Xevo G2-XS Q-ToF. This would correspond to a similar level of reduction in the
29-31
26-28
, which
. In addition to the improved species-level
14 Environment ACS Paragon Plus
Page 15 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
320
computational requirements of data analysis and allow for real-time species-level identification, as
321
has been shown in other applications of REIMS including intra-operative tissue assessment 26.
322 323
Effect of Mass Bin on Species Cross-Validation
324
We also assessed the effect of bin size on the classification performance of the statistical models.
325
For both REIMS approaches, the mass bin size appeared to have minimal effect on species-level
326
cross-validation accuracy, Figure 5. At mass resolutions below 1, namely 0.01, 0.05, 0.1, and 0.5, no
327
consistent result was visible between the two REIMS approaches. For example, the 0.05 mass bin
328
achieved the highest species accuracy for high-throughput REIMS out of all six bin sizes evaluated,
329
whilst for handheld bipolar probe REIMS it achieved the lowest. In agreement with our earlier
330
studies
331
good species level accuracy and computational time. For each tenfold decrease in bin size (for
332
example from 1 to 0.1) the number of variables within each spectrum is increased ten-fold, along
333
with the computational time required for analysis. To minimise the time required for bioinformatic
334
analysis and species-level identification, using a mass bin of one will substantially reduce the
335
computational time and resources required; allowing near real-time species-level identification in a
336
clinical laboratory.
5,6
, we utilised unit resolution for all subsequent analyses as it achieves a balance between
337 338
Species Cross-Validation Using Taxonomy Grouped Models
339
Members of closely-related taxonomic groups, such as the Enterobacteriaceae family or
340
Streptococcus genera, are likely to share substantial similarities in their mass spectral profiles
341
because of the lower level of evolutionary divergence. Therefore, determining species-level cross-
342
validation accuracy using an approach which combines all taxonomic species into one classification
343
model, as described above, may suffer from misclassifications between closely related species. Here,
344
we have explored an alternative approach whereby individual models are created for each
345
taxonomic group at the domain, phylum, class, order, family, genus, and species levels. As
15 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 16 of 38
346
Supplementary Figures S4 and S5 demonstrate, this allows for a probability matrix to be constructed
347
to calculate the likelihood that an isolate from each of the 25 species analysed in this study, would
348
be correctly classified within each model of its relevant taxonomic nodes. Using this approach,
349
species-level accuracy was increased from 90.40% to 96.17% for handheld bipolar probe REIMS and
350
from 89.87% to 94.03% for high-throughput REIMS. It has to be noted that hierarchical classification
351
systems have also been employed for species-level identification of bacterial mass spectrometry
352
fingerprints from MALDI-ToF based identification platforms 32.
353 354
Species Cross-Validation Using Random Forest Models
355
To this point, REIMS spectral fingerprints have been analysed using a combined PCA/LDA approach.
356
This approach is beneficial as it reduces the dimensionality of data sets and provides good predictive
357
power. However, it is more susceptible than other methods (e.g. Random Forest) to over training
358
and over fitting and has a poor ability to rank variables according to their discriminatory importance
359
33
360
was originally designed to handle high-dimensional, non-linear ecological data
361
considerable application to mass spectrometry data sets as well 35. Here. Random Forest constructs
362
decision making trees using mass spectral bins, at 0.1 Da, to construct classification models which
363
are internally validated through the splitting of samples into training and test data sets. Through the
364
construction of hundreds of decision making trees, the importance of individual variables is
365
determined; allowing for a final model using the most discriminatory variables to be constructed.
366
The classification accuracy of this model is then determined using leave one sample out cross-
367
validation.
. The Random Forest algorithm is a machine learning algorithm based on decision making trees and 34
, but has found
368 369
In the present study we used Random Forest to construct a total of 400 decision making trees to
370
assign species-level taxonomy to all 15 isolates from 25 species. Through this approach, we were
371
able to achieved species-level accuracy of 96.27% and 93.87% to handheld bipolar probe REIMS and
16 Environment ACS Paragon Plus
Page 17 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
372
monopolar high-throughput REIMS respectively. These results were comparable to those achieved
373
using taxonomy grouped models, albeit in substantially reduced computational time. The confusion
374
matrices for handheld bipolar probe REIMS, Figure 6a, and for high-throughput REIMS, Figure 6b,
375
show limited misclassification, predominately within the Enterobacteriaceae family and
376
Staphylococcus genus. Whilst the classification accuracies for some members of the Staphylococcus
377
genus and Enterobacteriaceae group are lower than those described using the Bruker Biotyper
378
system, further platform optimisations, targeting these taxonomic groups, will likely improve
379
classification accuracies substantially
380
throughput diagnostic environment due to its rapid computational ability. Table 1 gives classification
381
accuracy rates for each of the 25 microbial species analysed using both REIMS approaches using
382
three bioinformatic approaches.
36-38
. Random Forest may have particular utility in a high-
383 384
Intra-Sample Variation
385
To assess the intra-sample variability in handheld bipolar probe REIMS and monopolar high-
386
throughput REIMS analysis, we completed species-level cross-validation using each of the five
387
analysis points from each isolate for each REIMS approach. Results were compared against the
388
species-level cross-validation for the combined mass spectra for the five analysis points, as shown in
389
Supplementary Figure S6. Theoretically, the difference between the species-level accuracy of these
390
two approaches would be positively correlated to the level of intra-sample variation. Thus, the
391
greater the difference between the two cross-validation figures, the greater the level of intra-sample
392
variation. As shown in Supplementary Figure S7, minimal differences were observed between the
393
species-level accuracy of the cross-validation. Within the 50 to 2500 m/z range a difference of two
394
percentage points was observed for the handheld bipolar probe REIMS approach, and less than one
395
percentage point for the high-throughput REIMS approach. In regards to the 600 to 900 m/z range, a
396
difference of two percentage points was observed for handheld bipolar probe REIMS, and one
397
percentage point for high-throughput REIMS. Although not measured directly, the use of leucine
17 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
398
enkaphline as an external lock mass compound, allowing for correction of the mass drift associated
399
with time of flight MS, will minimise the extent of variation between analysis periods. In addition,
400
this work was completed over multiple analysis days, using two different instrument operators. As
401
evident with the reduced intra-sample variation of the high-throughput REIMS platform, the nature
402
of the analytical set up will also likely reduce any potential inter-sample variation introduced as a
403
result of different analysis day or operator.
404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427
18 Environment ACS Paragon Plus
Page 18 of 38
Page 19 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
428
19 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
429
Conclusions
430
Here, we have shown that the initial species-level identification and characterisation previously
431
reported by members of this group using bipolar REIMS technology can be comparably achieved
432
using an automated, high-throughput REIMS approach. Both bacterial and fungal isolates are
433
analysed directly from culture, without any prior preparative steps. The method described requires
434
minimal user input and can easily be integrated with automated clinical microbiology workflows.
435
Although not employed here, based on colony analysis time of 15 seconds, this high-throughput
436
workflows allows approximately 3,000 to 4,000 microbial colonies to be analysed over a 24 hour
437
period, including time required for instrument calibration, maintenance, and cleaning, with species-
438
level identifications given in near real-time. Although not directly comparable as the bioinformatic
439
pipeline for clinical microbiology laboratories is in development, this time period is substantially
440
lower than that required for commercial MALDI-ToF platforms, which take approximately six
441
minutes to analyse each isolate 3.
442 443
Here, we have discussed commercially available MS-based platforms which are currently used in
444
clinical microbiology laboratories, such as the Bruker BioTyper MALDI-ToF system. These references
445
were used to contextualise our REIMS based platform against familiar MS platforms. Although we
446
used the Bruker BioTyper platform for species level identification of microbial isolates, in line with
447
the standard operation procedures of our collaborating clinical laboratory, we have not drawn direct
448
comparisons between its identification accuracy and that of high-throughput REIMS. The Gram-stain,
449
genus, and species level classifications accuracies reported are used primarily as a tool to compare
450
the handheld and high-throughput REIMS approaches. They could however, be considered a
451
percentage conformity to the Bruker BioTyper platform. To determine the true comparable
452
classification accuracies of the two platforms, and other commercially available ones, a non-MS
453
species identification method, such as 16S/18S/ITS rRNA gene sequencing would be required; which
454
was not conducted.
20 Environment ACS Paragon Plus
Page 20 of 38
Page 21 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
455 456
Although we have focussed on the lipid region of the mass spectra, additional microbial metabolites
457
identified using the REIMS procedure could have utility in defining phenotypic features including
458
virulence and antibiotic resistance. Furthermore, work is currently underway to extend high-
459
throughput REIMS analysis from pure microbial cultures to analysis of mixed microbial cultures and
460
communities, and complex human bio-fluids including urine and blood using species-specific
461
features for identifications.
462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480
21 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
481
22 Environment ACS Paragon Plus
Page 22 of 38
Page 23 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
482
Supporting Information
483
Detailed information regarding bacterial and fungal species analysed in this study, and on additional
484
data analyses completed as referenced within this text is given within the supporting information
485
accompanying this manuscript. This material is available free of charge via the Internet at
486
http://pubs.acs.org.
487 488
Author Contributions
489
Study was planned by ZT, FB, and SJSC. Experiments were conducted by AB, TR, KH, FB, and SJSC.
490
Data was analysed and interpreted by SJSC, FB, ZT, and APM. Technical assistance and method
491
optimisation was provided by ZB, MR, JB, TK, DS, RS, and ZT. The manuscript was written by FB, SJSC,
492
and ZT with input from all authors. All authors have given approval to the final version of the
493
manuscript.
494 495
Conflict of Interest Statement
496
This work was funded by a Biotechnology and Biological Sciences Research Council (BBSRC) grant
497
(reference number BB/L020858/1). Additional support, both technical and financial, was provided by
498
Waters Corporation. TK, DS, RS, and JB are employed by the Waters Corporation. ZT consults for
499
Waters Corporation. The work detailed in this manuscript does not promote any available
500
commercial product from Waters Corporation.
501 502
Acknowledgments
503
The authors would like to thank Imperial College Healthcare Trust for providing isolates and their
504
technical input. We thank the anonymous reviewers for their constructive comments which have
505
made this a better manuscript.
506 507
23 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
508 509
References
510 511
(1) Clark, A. E.; Kaleta, E. J.; Arora, A.; Wolk, D. M. Clin. Microbiol. Rev. 2013, 26, 547-603.
512
(2) Tan, K. E.; Ellis, B. C.; Lee, R.; Stamper, P. D.; Zhang, S. X.; Carroll, K. C. J. Clin. Microbiol. 2012, 50,
513
3301-3308.
514
(3) Seng, P.; Drancourt, M.; Gouriet, F.; La Scola, B.; Fournier, P. E.; Rolain, J. M.; Raoult, D. Clin.
515
Infect. Dis. 2009, 49, 543-551.
516
(4) Cassagne, C.; Cella, A.-L.; Suchon, P.; Normand, A.-C.; Ranque, S.; Piarroux, R. Med. Mycol. 2013,
517
51, 371-377.
518
(5) Strittmatter, N.; Jones, E. A.; Veselkov, K. A.; Rebec, M.; Bundy, J. G.; Takats, Z. Chem. Commun.
519
2013, 49, 6188-6190.
520
(6) Strittmatter, N.; Rebec, M.; Jones, E. A.; Golf, O.; Abdolrasouli, A.; Balog, J.; Behrends, V.;
521
Veselkov, K. A.; Takats, Z. Anal. Chem. 2014, 86, 6555-6562.
522
(7) Ryzhov, V.; Fenselau, C. Anal. Chem. 2001, 73, 746-750.
523
(8) Purcaro, G.; Tranchida, P. Q.; Dugo, P.; Camera, E. L.; Bisignano, G.; Conte, L.; Mondello, L. J. Sep.
524
Sci. 2010, 33, 2334-2340.
525
(9) Buyer, J. S. J. Microbiol. Methods 2006, 67, 187-190.
526
(10) Abel, K.; deSchmertzing, H.; Peterson, J. I. J. Bacteriol. 1963, 85, 1039-1044.
527
(11) Miller, L. T. J. Clin. Microbiol. 1982, 16, 584-586.
528
(12) Welch, D. F. Clin. Microbiol. Rev. 1991, 4, 422-438.
529
(13) DeLuca, S.; Sarver, E. W.; Harrington, P. d. B.; Voorhees, K. J. Anal. Chem. 1990, 62, 1465-1472.
530
(14) Anhalt, J. P.; Fenselau, C. Anal. Chem. 1975, 47, 219-225.
531
(15) Basile, F.; Voorhees, K. J.; Hadfield, T. L. Appl. Environ. Microbiol. 1995, 61, 1534-1539.
532
(16) Basile, F.; Beverly, M. B.; Voorhees, K. J.; Hadfield, T. L. TrAC, Trends Anal. Chem. 1998, 17, 95-
533
109.
24 Environment ACS Paragon Plus
Page 24 of 38
Page 25 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
534
(17) Heller, D. N.; Cotter, R. J.; Fenselau, C.; Uy, O. M. Anal. Chem. 1987, 59, 2806-2809.
535
(18) Song, Y.; Talaty, N.; Tao, W. A.; Pan, Z.; Cooks, R. G. Chem. Commun. 2007, 61-63.
536
(19) Bourbeau, P. P.; Ledeboer, N. A. J. Clin. Microbiol. 2013, 51, 1658-1665.
537
(20) Mutters, N. T.; Hodiamont, C. J.; de Jong, M. D.; Overmeijer, H. P.; van den Boogaard, M.; Visser,
538
C. E. Ann. Lab. Med. 2014, 34, 111-117.
539
(21) Greub, G.; Prod’Hom, G. Clin. Microbiol. Infect. 2011, 17, 655-660.
540
(22) Novak, S. M.; Marlowe, E. M. Clin. Lab. Med. 2013, 33, 567-588.
541
(23) Wold, S.; Esbensen, K.; Geladi, P. Chemom. Intell. Lab. Syst. 1987, 2, 37-52.
542
(24) Fisher, R. A. Ann. Hum. Genet. 1938, 8, 376-386.
543
(25) Mahalanobis, P. C. Proc. Natl. Inst. Sci. India 1936, 2, 49-55.
544
(26) Balog, J.; Sasi-Szabó, L.; Kinross, J.; Lewis, M. R.; Muirhead, L. J.; Veselkov, K.; Mirnezami, R.;
545
Dezső, B.; Damjanovich, L.; Darzi, A. Sci. Transl. Med. 2013, 5, 194ra193-194ra193.
546
(27) Pulfer, M.; Murphy, R. C. Mass Spectrom. Rev. 2003, 22, 332-364.
547
(28) Gibson, B.; Melaugh, W.; Phillips, N. J.; Apicella, M.; Campagnari, A.; Griffiss, J. J. Bacteriol. 1993,
548
175, 2702-2712.
549
(29) Komagata, K.; Suzuki, K.-I. Methods Microbiol. 1987, 19, 162-207.
550
(30) Lüderitz, O.; Galanos, C.; Lehmann, V.; Nurminen, M.; Rietschel, E. T.; Rosenfelder, G.; Simon,
551
M.; Westphal, O. J. Infect. Dis. 1973, 128, S17-S29.
552
(31) van Baar, B. L. FEMS Microbiol. Rev. 2000, 24, 193-219.
553
(32) Simmuteit, S.; Schleif, F.-M.; Villmann, T.; Hammer, B. Knowl. Inf. Syst. 2010, 25, 327-343.
554
(33) Gromski, P. S.; Muhamadali, H.; Ellis, D. I.; Xu, Y.; Correa, E.; Turner, M. L.; Goodacre, R. Anal.
555
Chim. Acta 2015, 879, 10-23.
556
(34) Cutler, D. R.; Edwards Jr, T. C.; Beard, K. H.; Cutler, A.; Hess, K. T.; Gibson, J.; Lawler, J. J. Ecology
557
2007, 88, 2783-2792.
558
(35) Patterson, A. D.; Bonzo, J. A.; Li, F.; Krausz, K. W.; Eichler, G. S.; Aslam, S.; Tigno, X.; Weinstein, J.
559
N.; Hansen, B. C.; Idle, J. R. J. Biol. Chem. 2011, 286, 19511-19522.
25 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
560
(36) Dubois, D.; Leyssene, D.; Chacornac, J. P.; Kostrzewa, M.; Schmit, P. O.; Talon, R.; Bonnet, R.;
561
Delmas, J. J. Clin. Microbiol. 2010, 48, 941-945.
562
(37) Saffert, R. T.; Cunningham, S. A.; Ihde, S. M.; Jobe, K. E. M.; Mandrekar, J.; Patel, R. J. Clin.
563
Microbiol. 2011, 49, 887-892.
564
(38) Zhu, W.; Sieradzki, K.; Albrecht, V.; McAllister, S.; Lin, W.; Stuchlik, O.; Limbago, B.; Pohl, J.;
565
Rasheed, J. K. J. Microbiol. Methods 2015, 117, 14-17.
566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589
26 Environment ACS Paragon Plus
Page 26 of 38
Page 27 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
590
Analytical Chemistry
Figure Captions
591 592
Figure 1. Customised TECAN Freedom EVO automated sample handling machine and sampling
593
probe
594
A customised TECAN Freedom EVO automated sample handling machine is contained within a class
595
2 microbiological safety cabinet. Before use, agar culture plates to be placed into a plate rack (a),
596
sampling probes to be loaded into a 96 rack (b). Plates are then moved from the plate rack to the
597
Pickolo visualisation platform (c), which also acts as the return electrode, where the user or the
598
automated system can select colonies for analysis. A technical drawing of the stainless steel
599
sampling probe is given in (d).
600 601
Figure 2. Temperature gradient differences between REIMS approaches
602
The handheld bipolar probe (a) employed in this study generates a radiofrequency electrical current
603
between two probes, and thus has a lower, but more even, heating temperature throughout the
604
microbial biomass. However, the monopolar probe used in high-throughput REIMS (b) generates a
605
higher, localised heating temperature at the point of contact between probe and microbial biomass.
606
Counter electrode shown by grey bar at base of microbial biomass. Colour gradient of microbial
607
biomass indicates heating temperature, with red showing highest temperatures and green lowest
608
temperatures. The mass spectra beside each representative diagram is of the 600 to 900 m/z range
609
of the same Lactobacillus jensenii isolate after background removal for each REIMS approach. Figure
610
is not constructed to scale.
611 612
Figure 3. Gram-stain, genus, and species-level cross-validation accuracy for both REIMS
613
approaches
614
Cross-validation using a PCA/LDA model over the 50 to 2500 m/z range with a mass bin of 1 gives
615
similar Gram-stain level accuracy for both handheld bipolar probe REIMS and high-throughput
27 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
616
REIMS. However, at genus and species-level accuracy, handheld bipolar probe REIMS gives slightly
617
higher accuracy levels, by three and two percentage points respectively.
618 619
Figure 4. Species-level classification accuracy using restricted mass ranges
620
A total of eight PCA/LDA models were constructed using a mass bin of 1 for both handheld bipolar
621
probe REIMS and high-throughput REIMS using restricted mass ranges. For handheld bipolar probe
622
REIMS, the highest species-level accuracy was achieved through using the 500 to 1500 m/z range
623
and 600 to 900 m/z range, whilst for high-throughput REIMS the highest accuracy was achieved with
624
the 600 to 900 m/z range.
625 626
Figure 5. Effect of mass bin size on species-level accuracy
627
Species-level accuracy was measured through cross-validation of PCA/LDA models constructed using
628
the 600 to 900 m/z range for mass bins of 0.01, 0.05, 0.1, 0.5, 1.0, and 5.0. For both handheld bipolar
629
probe REIMS and high-throughput REIMS, minimal differences were evident between different mass
630
bin sizes. To achieve a balance between good species-level accuracy and reduced computational
631
time for further analyses, a mass bin size of 1 was subsequently used.
632 633
Figure 6. Species-Level confusion matrices from Random Forest analysis
634
Random Forest analysis was conducted on (a) handheld bipolar probe REIMS and (b) high-
635
throughput REIMS datasets within the 600 to 900 m/z range, using a mass bin of 0.1. Random Forest
636
models were constructed using 400 decision making trees. Confusion matrices suggest a marginally
637
higher level of misclassification using the high-throughput REIMS approach than with handheld
638
bipolar probe REIMS, with misclassifications occurring predominately within the Enterobacteriaceae
639
family. Colour scale gradient indicates strength of species-level accuracy with darker gradients
640
showing higher species-level accuracy. Abbreviations used within the True Class and Predicted Class
641
labels are given alongside full species names in Table 1.
28 Environment ACS Paragon Plus
Page 28 of 38
Page 29 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
642 643
Table 1. Individual species-level accuracy for all species, taxonomic groupings, and Random Forest
644
models
645
The species-level accuracy for each of the 25 microbial species analysed using both REIMS approach
646
is given for each of the three bioinformatic data analysis methods used in this study. For handheld
647
bipolar probe REIMS, the highest overall species-level accuracy was achieved using the Random
648
Forest approach, whilst for high-throughput REIMS the highest was achieved using the taxonomic
649
groupings approach. Although, in both cases, the differences between the taxonomic groupings and
650
Random Forest approaches is less than 0.2 percentage points.
651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670
29 Environment ACS Paragon Plus
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
671
30 Environment ACS Paragon Plus
Page 30 of 38
Page 31 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
672
Analytical Chemistry
Table 1 Handheld Bipolar Probe REIMS (% Accuracy)
High-Throughput REIMS (% Accuracy)
Isolate Number
All Species
Taxonomic Groupings
Random Forest
All Species
Taxonomic Groupings
Random Forest
Average
∑ 375
90.4
96.2
96.3
89.9
94.0
93.9
Candida albicans (CALB)
15
100.0
100.0
100.0
100.0
100.0
100.0
Candida parapsilosis (CALP)
15
100.0
100.0
100.0
93.3
100.0
100.0
Corynebacterium amycolatum (CAMY)
15
100.0
100.0
100.0
100.0
97.0
100.0
Micrococcus luteus (MLUT)
15
100.0
100.0
100.0
86.7
97.0
100.0
Enterococcus faecalis (EFAC)
15
60.0
91.2
100.0
86.7
77.6
87.0
Enterococcus faecium (EFAM)
15
66.7
91.2
83.0
86.7
84.4
80.0
Lactobacillus jensenii (LACJ)
15
86.7
98.0
100.0
100.0
93.0
86.0
Streptococcus agalactiae(SAGA)
15
93.3
91.2
100.0
93.3
86.5
87.0
Streptococcus pneumoniae (SPNE)
15
100.0
98.0
97.0
93.3
93.0
97.0
Streptococcus pyogenes (SPYO)
15
100.0
98.0
97.0
93.3
93.0
93.0
Staphylococcus aureus (SAUR)
15
100.0
98.0
93.0
93.3
93.0
85.0
Staphylococcus epidermidis (SEPI)
15
80.0
91.2
100.0
73.3
100.0
97.0
Staphylococcus haemolyticus (SHAM)
15
73.3
78.4
100.0
73.3
80.0
100.0
Staphylococcus hominis (SHOM)
15
86.7
91.2
97.0
73.3
100.0
97.0
Clostridium difficile (CDIF)
15
100.0
99.0
100.0
100.0
100.0
100.0
Enterobacter cloacae (ECLO)
15
100.0
99.0
93.0
86.7
92.1
100.0
Escherichia coli (ECOL)
15
46.7
92.1
97.0
60.0
79.2
93.0
Klebsiella oxytoca (KOXY)
15
100.0
99.0
97.0
86.7
99.0
97.0
Klebsiella pneumoniae (KPNE)
15
86.7
99.0
80.0
86.7
99.0
84.0 90.0
Morganella morganii (MMORG)
15
86.7
99.0
97.0
93.3
99.0
Proteus mirabilis (PMIR)
15
100.0
92.1
87.0
93.3
99.0
83.0
Serratia marcescens(SMAR)
15
93.3
99.0
100.0
93.3
92.1
100.0 100.0
Haemophilus influenza (HINF)
15
100.0
100.0
100.0
100.0
99.0
Pseudomonas aeruginosa (PAER)
15
100.0
100.0
93.0
100.0
99.0
93.0
Stenotrophomonas maltophilia (SMAL)
15
100.0
100.0
97.0
100.0
99.0
97.0
673
31 ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
674
Table of Contents Graphic (Actual Size)
675 676
32 Environment ACS Paragon Plus
Page 32 of 38
Page 33 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Figure 1. Customised TECAN Freedom EVO automated sample handling machine and sampling probe. A customised TECAN Freedom EVO automated sample handling machine is contained within a class 2 microbiological safety cabinet. Before use, agar culture plates to be placed into a plate rack (a), sampling probes to be loaded into a 96 rack (b). Plates are then moved from the plate rack to the Pickolo visualisation platform (c), which also acts as the return electrode, where the user or the automated system can select colonies for analysis. A technical drawing of the stainless steel sampling probe is given in (d). 525x500mm (150 x 150 DPI)
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 2. Temperature gradient differences between REIMS approaches. The handheld bipolar probe (a) employed in this study generates a radiofrequency electrical current between two probes, and thus has a lower, but more even, heating temperature throughout the microbial biomass. However, the monopolar probe used in high-throughput REIMS (b) generates a higher, localised heating temperature at the point of contact between probe and microbial biomass. Counter electrode shown by grey bar at base of microbial biomass. Colour gradient of microbial biomass indicates heating temperature, with red showing highest temperatures and green lowest temperatures. The mass spectra beside each representative diagram is of the 600 to 900 m/z range of the same Lactobacillus jensenii isolate after background removal for each REIMS approach. Figure is not constructed to scale. 1109x499mm (115 x 115 DPI)
ACS Paragon Plus Environment
Page 34 of 38
Page 35 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Figure 3. Gram-stain, genus, and species-level cross-validation accuracy for both REIMS approaches. Crossvalidation using a PCA/LDA model over the 50 to 2500 m/z range with a mass bin of 1 gives similar Gramstain level accuracy for both handheld bipolar probe REIMS and high-throughput REIMS. However, at genus and species-level accuracy, handheld bipolar probe REIMS gives slightly higher accuracy levels, by three and two percentage points respectively. 397x500mm (150 x 150 DPI)
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 4. Species-level classification accuracy using restricted mass ranges. A total of eight PCA/LDA models were constructed using a mass bin of 1 for both handheld bipolar probe REIMS and high-throughput REIMS using restricted mass ranges. For handheld bipolar probe REIMS, the highest species-level accuracy was achieved through using the 500 to 1500 m/z range and 600 to 900 m/z range, whilst for high-throughput REIMS the highest accuracy was achieved with the 600 to 900 m/z range.
842x500mm (150 x 150 DPI)
ACS Paragon Plus Environment
Page 36 of 38
Page 37 of 38
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Figure 5. Effect of mass bin size on species-level accuracy. Species-level accuracy was measured through cross-validation of PCA/LDA models constructed using the 600 to 900 m/z range for mass bins of 0.01, 0.05, 0.1, 0.5, 1.0, and 5.0. For both handheld bipolar probe REIMS and high-throughput REIMS, minimal differences were evident between different mass bin sizes. To achieve a balance between good species-level accuracy and reduced computational time for further analyses, a mass bin size of 1 was subsequently used. 871x500mm (150 x 150 DPI)
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 6. Species-Level confusion matrices from Random Forest analysis. Random Forest analysis was conducted on (a) handheld bipolar probe REIMS and (b) high-throughput REIMS datasets within the 600 to 900 m/z range, using a mass bin of 0.1. Random Forest models were constructed using 400 decision making trees. Confusion matrices suggest a marginally higher level of misclassification using the high-throughput REIMS approach than with handheld bipolar probe REIMS, with misclassifications occurring predominately within the Enterobacteriaceae family. Colour scale gradient indicates strength of species-level accuracy with darker gradients showing higher species-level accuracy. Abbreviations used within the True Class and Predicted Class labels are given alongside full species names in Table 1. 312x155mm (150 x 150 DPI)
ACS Paragon Plus Environment
Page 38 of 38