Subscriber access provided by UNIV OF NEWCASTLE
Article
Food Targeting: Geographical origin determination of hazelnuts (Corylus avellana) by LC-QqQ-MS/MS based targeted metabolomics application Sven Klockmann, Eva Reiner, Nicolas Cain, and Markus Fischer J. Agric. Food Chem., Just Accepted Manuscript • DOI: 10.1021/acs.jafc.6b05007 • Publication Date (Web): 09 Jan 2017 Downloaded from http://pubs.acs.org on January 10, 2017
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
Journal of Agricultural and Food Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 36
Journal of Agricultural and Food Chemistry
1
Food Targeting: Geographical Origin Determination of Hazelnuts (Corylus
2
avellana) by LC-QqQ-MS/MS Based Targeted Metabolomics Application
3 4
Sven Klockmann, Eva Reiner, Nicolas Cain, and Markus Fischer*
5 6
HAMBURG SCHOOL OF FOOD SCIENCE; Institute of Food Chemistry, University of
7
Hamburg, Grindelallee 117, 20146 Hamburg, Germany, *Corresponding author: Tel.: +49-
8
40-428384357; Fax: +49-40-428384342; E-Mail:
[email protected] 9
1 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 2 of 36
10
ABSTRACT
11
A targeted metabolomics LC-ESI-QqQ-MS application for geographical origin discrimination
12
based on 20 non-polar key metabolites was developed, validated according to accepted
13
guidelines and used for quantitation via stable isotope labeled internal standards in 202 raw
14
authentic hazelnut samples from six countries (Turkey, Italy, Georgia, Spain, France and
15
Germany) out of harvest years 2014 and 2015. Multivariate statistics were used for detection
16
of significant variations in metabolite levels between countries and moreover, a prediction
17
model using support vector machine classification (SVM) was calculated yielding 100%
18
training accuracy and 97% cross-validation accuracy which was subsequently applied to 55
19
hazelnut samples for confectionary industry gaining up to 80% correct classifications
20
compared to declared origin. The present method demonstrates the great suitability for
21
targeted metabolomics applications in geographical origin determination of hazelnuts and
22
their applicability in routine analytics.
23 24
KEYWORDS
25
Targeted Metabolomics, Triple quadrupole, Hazelnut, Corylus avellana, Geographical origin,
26
Chemometrics, Metabolic profiling, Lipids
2 ACS Paragon Plus Environment
Page 3 of 36
Journal of Agricultural and Food Chemistry
27
INTRODUCTION
28
While 64% of the hazelnut (Corylus avellana L.) world crop (800,000 tons p.a.) is provided
29
by Turkey, Italy represents the second largest producer (13%) followed by the United States,
30
Georgia, Azerbaijan, Spain, France, Iran and China (less than 5%).1 Only 10% are dedicated
31
to direct consumption (mostly as in-shell nuts) but 90% of the crop is processed by food
32
industry, represented by chocolate, confectionary and bakery industry (predominantly in their
33
shelled and roasted form). With regard to demands of manufacturing industries, the quality of
34
nuts essentially depends on its shape and size, but also its chemical composition, represented
35
by aroma profile or blanching character.2,3 There are numerous varieties cultivated in
36
commercial orchards, each with its own characteristics and demands. However, besides the
37
genotype, cultural techniques, postharvest management and especially the geographical
38
location (soil, sun exposure, rainfall, temperature, height above sea level, etc.) of the hazelnut
39
plant affects the yield as well as morphological, physical and chemical characteristics and
40
thus, the ‘quality’ of hazelnuts.4-6 Consequently, sales prices may vary depending on - despite
41
of the harvest year - the geographical origin of products with highest values of Italian nuts
42
gaining 5,207 USD/t (with shell) in 2014, whereas nuts from Turkey had a 18% lower value,
43
followed by USA (-24%), Georgia (-31%) and Azerbaijan (-49%).1 In general, high price
44
differences are a potential incentive to food fraud for profit enhancement. Since there are no
45
existing methods in routine analytics for origin authentication of hazelnuts, no information
46
about the magnitude of deliberate or accidental false declarations are currently available. To
47
overcome this circumstance, this paper presents a targeted metabolomics approach for the
48
differentiation of hazelnuts from distinct countries based on 20 previously identified marker
49
substances. In this previous study, these key metabolites were determined in a non-targeted
50
approach using UPLC-ESI-QTOF-MS analysis of hazelnut samples from harvest years 2014
51
and 2015. Different prediction models were created and evaluated showing best results using
52
support vector machine classification (SVM) in combination with soft independent modelling 3 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 4 of 36
53
of class analogy (SIMCA). 100% of the training samples and 80% of the prediction samples
54
(hazelnuts from confectionary industry) could be predicted correct at no false positives.7
55
However, the following development of a targeted metabolomics application for absolute
56
quantitation purposes was conducted by LC-QqQ-MS instead. Triple quadrupole mass
57
spectrometry (QqQ-MS) provides greater linear dynamic range, higher precision, less matrix
58
interferences and better robustness compared to LC-QTOF-MS. Furthermore, it is one of the
59
most popular instruments for food-quality and -safety analysis and nowadays commonly
60
applied in most laboratories of industrial quality control and governmental food control,
61
because it offers high sensitivity, selectivity and specificity for identification and shows good
62
quantitation capabilities for the analysis of complex food samples when operated in multiple-
63
reaction monitoring (MRM) mode.8-10 This scanning technique has a unique capability for
64
simultaneous analysis of large number of compounds in complex mixtures and is able to
65
reduce chemical noise and increase selectivity and sensitivity by selection of specific mass
66
transitions from precursor to fragment ions.11,12 The obtained mini-fingerprints are
67
representative for each group of issue to be examined and thus, used for comparing and
68
differentiating them by use of commonly applied multivariate statistics such as analysis of
69
variances (ANOVA), principal component analysis (PCA), soft independent modeling of
70
class analogy (SIMCA), cluster analysis, linear discriminant analysis (LDA) or support vector
71
machine classification (SVM).6,13-15
72
Present studies concerning metabolomics based authentication of the geographical origin of
73
foods have been extensively reviewed by several authors, mostly dealing with NMR, IR,
74
stable isotope analysis or GC-MS as detection techniques.6,14,16,17 Furthermore, in this context,
75
current LC-MS approaches predominantly make use of non-targeted high resolution LC-ESI-
76
QTOF-MS applications while targeted approaches are still rare.13,18-20 Some studies used
77
metabolic profiling for investigating the geographic differences of certain substance classes,
78
like steviol glycosides in stevia and polyphenols in red wine or apple juices.21-23 Accordingly, 4 ACS Paragon Plus Environment
Page 5 of 36
Journal of Agricultural and Food Chemistry
79
the development of a target-oriented LC-ESI-QqQ-MS method for geographical origin
80
discrimination by only analyzing 20 previously identified key metabolites is unique. Besides
81
the already highlighted analytical benefits, the motivation for switching the analytical method
82
from non-targeted QTOF to targeted QqQ were the comparatively low costs and the widely
83
dissemination of such devices in modern laboratories, giving the great opportunity for a
84
consequent implementation of developed methods into routine analysis. Constantly growing
85
databases would ensure consideration of annual variation of hazelnut metabolome, caused by
86
mutable factors like climate, and improve statistical models by consecutive updating.
87
Otherwise, extraordinary phenomena such as extreme cold and rainy or hot and dry summers
88
may lead to misinterpretations caused by metabolic changes.
89 90
MATERIALS AND METHODS
91
Reagents and chemicals
92
Ultrapure water was obtained by purifying demineralized water in a Direct-Q® 3 UV-R
93
system (Merck Millipore, Darmstadt, Germany). LC-MS grade isopropanol, methanol, HPLC
94
grade chloroform, ammonium formiate ˃95%, formic acid >99% and butylated
95
hydroxytoluene (BHT) ≥99.8% were purchased from Carl Roth (Karlsruhe, Germany).
96
The reference standards 1,2-dilinoleoyl-sn-glycero-3-phosphocholine (PC(18:2/18:2)), 1-
97
palmitoyl-2-linoleoyl-sn-glycero-3-phosphocholine
(PC(16:0/18:2)),
98
glycero-3-phosphoethanolamine
and
99
(DG(16:0/18:1)) were acquired from Avanti Polar Lipids (Alabaster, AL, USA), 1,3-
(PE(18:2/18:2))
(DG(18:2/18:2)),
1,2-dilinoleoyl-sn-
1-palmitoyl-2-oleoyl-sn-glycerol
100
dilinoleoyl-rac-glycerol
1-palmitoyl-2-linoleoyl-rac-glycerol-3-
101
phosphoethanolamine (PE(16:0/18:2)) and 1-palmitoyl-2-oleoyl-3-linoleoyl-rac-glycerol
102
(TG(16:0/18:1/18:2)) from Cayman Chemical (Ann Arbor, MI, USA), 1,2-dioleoyl-sn-
103
glycerol (DG(18:1/18:1), 1,2-dioleoyl-sn-phosphoethanolamine (PE(18:1/18:1) and γ-
104
tocopherol from Sigma-Aldrich (Munich, Germany), 1,2-dioleoyl-3-palmitoyl-rac-glycerol 5 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 6 of 36
105
(TG(16:0/18:1/18:1)) from Toronto Research Chemicals (Toronto, ON, Canada) and 1-
106
linoleoyl-2-hydoxy-sn-glycero-3-phosphocholine (PC(18:2/0:0)) from Echelon Biosciences
107
(Salt Lake City, UT, USA).
108
The stable isotope labeled internal standards (IS), 1-pentadecanoyl-2-oleoyl(d7)-sn-glycero-3-
109
phosphocholine
110
phosphoethanolamine
111
phosphocholine
112
(DG(15:0/18:1(d7))) and 1,3-dipentadecanoyl-2-oleyol(d7)-glycerol (TG(15:0/18:1(d7)/15:0))
113
were obtained from Avanti Polar Lipids (Alabaster, AL, USA).
(PC(15:0/18:1(d7))), (PE(15:0/18:1(d7))), (PC(18:1(d7)/0:0)),
1-pentadecanoyl-2-oleoyl(d7)-sn-glycero-31-oleoyl(d7)-2-hydoxy-sn-glycero-31-pentadecanoyl-2-oleyol(d7)-sn-glycerol
114 115
Standard solutions and calibration
116
For internal standard calibration all five stable isotope labeled lipids were pooled to one mix-
117
solution containing 302.48 µM PC(18:1d7/0:0), 272.00 µM DG(15:0/18:1d7), 229.87 µM
118
PE(15:0/18:1d7), 201.19 µM TG(15:0/18:1d7/15:0) and 212.36 µM PC(15:0/18:1d7).
119
Standard solutions of individual lipids were prepared by dissolving the respective compound
120
separately in methanol/chloroform to a stock solution of approx. 10 mM. Validation solutions
121
were created by mixing all stock solutions and diluting to the following concentrations in µM
122
with methanol: 0.02, 0.04, 0.06, 0.08, 0.1, 0.2, 0.4, 0.6, 0.8, 1, 2, 4, 6, 8, 10, 20, 40, 60, 80,
123
100 (for DG(18:1/18:1) and DG(18:2/18:2) additionally 200, 400, 600, 800, 1000), equally
124
performed with internal standard mix-solution. Validation was realized either in absence (base
125
calibration) or presence (matrix calibration) of a representative hazelnut extract as matrix
126
(50% of the resulting standard solution, respectively). 10 blank samples containing only pure
127
extraction solution were created for determination of the limit of detection (LOD).
128 129
Hazelnut samples
6 ACS Paragon Plus Environment
Page 7 of 36
Journal of Agricultural and Food Chemistry
130
Overall 202 authentic raw hazelnut samples of different varieties, origins and producers from
131
harvest years 2014 (107) and 2015 (95) were obtained for analyses. The samples were
132
harvested in the respective commercial relevant regions of each country, represented by 115
133
French (mainly Midi-Pyrénées and Aquitaine), 35 German (mainly in Bavaria), 22 Italian
134
(Piedmont, Campania and Lazio), 14 Turkish (Ordu, Akçakoca and Samsun), ten Georgian
135
(Guria, Samegrelo and Imereti) and six Spanish (Tarragona) samples. In addition, 50 samples
136
of raw hazelnut kernels for confectionary industry (non-authentic samples) were provided by
137
industrial partners, including one Spanish, one French, six Georgian, nine Italian and 33
138
Turkish samples. Furthermore, five samples of roasted and blanched hazelnut kernels that
139
would have been used for production of confectionary products were obtained (four Turkish
140
and one Italian). Each sample comprises either 1000 grams hazelnut kernels with skin (testa)
141
or 1500 grams unshelled hazelnuts.
142 143
Sample treatment
144
Harvest and postharvest processing was executed by suppliers under realistic conditions
145
following common commercial practices in drying, storage, and sometimes cracking
146
conditions as well as time of harvest. Authentic hazelnut samples were shipped as quickly as
147
possible under common storage conditions after the crop. For non-authentic samples no
148
information about postharvest storage time could be obtained. All hazelnut samples were
149
equally handled during all analytical processes being stored at -40 °C in the unprocessed state
150
and at -20 °C as lyophilized powder or extracts. Only hazelnut kernels with skin were used for
151
further procedure (except of roasted ones), those samples obtained in-shell were previously
152
cracked manually. After freezing with liquid nitrogen hazelnut kernels were ground in
153
combination with dry ice at a ratio of 1:1 using a Grindomix GM 300 knife mill equipped
154
with a stainless steel grinding container and a full metal knife (Retsch, Haan, Germany). A
155
representative aliquot of the grist was freeze-dried using a Gamma 1-20 freeze-dryer (Martin 7 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 8 of 36
156
Christ Gefriertrocknungsanlagen, Osterode am Harz, Germany). 50 mg of each lyophilisate
157
was used for extraction adding 10 µL internal standard mixture, 990 µL extraction solvent
158
(chloroform/isopropanol 1/2 with 0.1% butylated hydroxytoluene) and two steel balls
159
followed by ball milling for 3 min at 3 m/s using a Bead Ruptor 24 equipped with a 1.5 mL
160
microtube carriage kit (Biolabproducts, Bebensee, Germany). The remaining suspension was
161
centrifuged for 10 min at 16,000 x g at 4 °C and the supernatant was used for LC-MS analysis
162
after filtration with a Rotilabo® PTFE syringe filter, 0.45 µm pore diameter (Carl Roth,
163
Karlsruhe, Germany).
164 165
HPLC-ESI-QqQ-MS/MS acquisition
166
Chromatographic separation was carried out by a 50 mm x 4.6 mm i.d., 2.7 µm, Poroshell 120
167
EC-C18 column (Agilent Technologies, Waldbronn, Germany) at 30 °C with a flow rate of
168
500 µL/min and elution solvents A water and B isopropanol, both containing 5 mM
169
ammonium formiate buffer at pH 3.5 using an Agilent 1260 Infinity Quaternary LC System
170
(Agilent Technologies, Waldbronn, Germany). The gradient elution started with 85% B for
171
2 min, linearly increased to 100% in 3 min and kept for 4 min, brought back to 85% in
172
0.5 min followed by 4 min of re-equilibration. Injection volume was set to 3 µL. For detection
173
a triple quadrupole-MS/MS API 2000 (Applied Biosystems, Darmstadt, Germany) equipped
174
with a turbo ion spray source was used with the following mass spectrometer settings in
175
positive ion mode: ion spray voltage = 5500 V; temperature = 450 °C; ion source
176
gas 1 = 30 psi; ion source gas 2 = 70 psi; curtain gas = 20 psi; collision gas = 5 psi; ion spray
177
probe vertical position = 3; ion spray probe horizontal position = 5. The MS acquisition was
178
divided into two periods (period 1 = 5.5 min; period 2 = 8.0 min) analyzing only compounds
179
eluting in the respective period to increase dwell-times at consistent cycle times for better
180
sensitivity and more data points. For each compound one mass transition for quantitation
181
purposes (quantifier) and one for additional identification (qualifier) was acquired. Dwell8 ACS Paragon Plus Environment
Page 9 of 36
Journal of Agricultural and Food Chemistry
182
time was set to 60 ms for quantifier in period 1, 100 ms for quantifier in period 2 and 20 ms
183
for all qualifiers. To optimize data acquisition parameters for multiple reaction monitoring
184
(MRM) ideal compound-dependent device voltages for each mass transition were determined
185
via automatic compound optimization using Analyst® Software (AB Sciex, version 1.6,
186
Foster City, CA, USA) injecting a 10 µM solution of each compound directly into the source
187
with a flow of 10 µL/min. For those metabolites no standard substance could be purchased the
188
acquisition parameters were extrapolated taking related structures as a basis. Various
189
transitions with different parameters were tested and quantifier and qualifier finally selected
190
according to highest signals achieved. To prevent bias due to instrumental drifts samples were
191
injected in a random order followed by a blank sample (pure extraction solvent) every 12
192
injections. Samples were acquired in triplicate analysis, each one in a separate batch.
193
Carryover effects were minimized by applying a needle wash step with methanol after each
194
injection.
195 196
Method validation
197
Linearity, lower limit of quantitation (LLOQ), precision and accuracy were assessed in
198
analogy to accepted guidelines.24 The limit of detection (LOD) was determined according to
199
DIN 32645.
200
Linearity was determined for all commercially available standards using a twenty-point (resp.
201
twenty-five-point for DG(18:1/18:1) and DG(18:2/18:2)) calibration curve (n = 5) with OLS
202
regression. Homoscedasticity was confirmed for all samples using Breusch-Pagan test and
203
linear range was calculated by means of Mandel’s fitting test.25,26 Furthermore, the LLOQ has
204
been set as the lowest standard on the calibration curve as it met the following conditions: The
205
analyte response was at least five times the response compared to a blank response and the
206
analyte peak is identifiable, discrete, and reproducible, and the back-calculated concentration
9 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 10 of 36
207
should have precision that does not exceed 20% of the coefficient of variation CV and
208
accuracy within 20% of the nominal concentration.
209
Precision and accuracy were obtained determining the CV, resp. the back-calculated
210
concentration of every analyte by replicate analysis of calibration standards (n = 5) at three
211
concentration levels (smallest, highest and medium concentration of the determined linear
212
range). The mean value for precision and accuracy should be within 15% of the nominal value
213
except at LLOQ, where it should not deviate by more than 20%.
214
LOD was calculated based on the standard deviation of the response and the slope of the
215
calibration curve by replicate analysis of ten independent blank samples (pure extraction
216
solvent).
217
To determine the required dilution factor for measuring hazelnut extracts being in linear range
218
for each metabolite a dilution array was built by mixing equal aliquots of a randomly chosen
219
representative extract from each country and diluting the mixture with following dilution
220
factors: 1.18, 1.43, 1.82, 2.5, 4, 10, 11.76, 14.29, 18.18, 25, 40, 100, 117.65, 142.86, 181.82,
221
250, 400, 1000.
222 223
Data processing and chemometrics
224
Peak integration of LC-MS data was performed with Analyst® Software (AB Sciex, version
225
1.6, Foster City, CA, USA). Metabolite concentration was calculated based on its
226
corresponding internal standard, represented their respective substance class, adding a
227
correction factor for each metabolite in its linear ranges considering the individual signal
228
responses for each mass transition. For those metabolites no standard substance could be
229
obtained, an estimated correction factor was extrapolated by averaging the values of its
230
related substance class. Since no triacylglycerol marker substance standards were available
231
TG(16:0/18:1/18:1) and TG(16:0/18:1/18:2) were used additionally instead. Multivariate
232
statistics was assessed in The Unscrambler X 10.3 (Camo Software, Oslo, Norway). Prior to 10 ACS Paragon Plus Environment
Page 11 of 36
Journal of Agricultural and Food Chemistry
233
statistical process the data set was scaled applying interquartile range scaling using the range
234
between the 75th and 25th percentile for each metabolite separately. Scaling parameters were
235
calculated based on authentic sample data and afterwards applied to non-authentic sample
236
data. Classification models for prediction of unknown samples were applied using support
237
vector machine classification (SVM) applying 4th grade polynomial kernel type with
238
offset = 3, C = 1 and Gamma = 0.1 as optimal parameters. Optimal kernel function and
239
related parameters were identified by exercising grid search function via cross-validation. The
240
data of triplicate analysis of the 150 authentic hazelnut samples was used as training data for
241
model creation. To validate the model, cross-validation was applied by dividing the training
242
set into six segments. The model was then used for prediction of the 50 non-authentic
243
hazelnut samples assuming that one sample (three measurements) is only attributed to a
244
particular country if at least two of three classifications predict the same. Furthermore, linear
245
discriminant analysis based on PCA scores (PCA-LDA) using quadratic method at 17
246
components as optimal parameters and soft independent modeling of class analogy (SIMCA)
247
(α = 0.05) were tested. Two-sided T-Test models for each country pair (e.g. Italy vs. Turkey)
248
were calculated employing unscaled values to test the equality of two means. Furthermore,
249
one-way analysis of variance (ANOVA) was calculated in ProfileAnalysis 2.1 (Bruker
250
Daltronics, Bremen, Germany) to test the equality of three or more means.
251 252
RESULTS AND DISSCUSION
253
LC-MS acquisition and validation
254
The metabolites analyzed in the presented work were previously identified as marker
255
substances for geographical origin discrimination of hazelnuts via non-targeted UPLC-ESI-
256
QTOF-MS considering samples of five countries (Germany, France, Italy, Georgia, Turkey).7
257
Subsequently, a targeted HPLC-ESI-QqQ-MS method for quantitation of the previously
258
identified 20 marker substances by usage of internal standards was developed and validated 11 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 12 of 36
259
(Table 1). For each substance class (phosphatidylcholines, phosphatidylethanolamines,
260
lysophosphatidylcholines, diacylglycerols and triacylglycerols) one representative deuterated
261
internal standard was used. With this method the great advantage over the already presented
262
non-targeted application is the possibility for determining absolute metabolite levels in mg/kg
263
hazelnut in only 13.5 min per sample by using a relatively cheap but robust high-throughput
264
HPLC-ESI-QqQ-MS application. Thus, the step from technically, financially and time
265
demanding high-resolution UPLC-ESI-QTOF-MS to this application is a reasonable
266
advancement and a great opportunity for routine analytics. Since origin discrimination
267
demands continual progression and improvement of statistical methods due to annual climatic
268
changes and perpetual developments in international trading, the comparability of results from
269
different acquisition times and laboratories is a crucial point. Thus, absolute quantitation via
270
internal standards using robust and commonly applied LC-ESI-QqQ-MS/MS provides
271
maximum comparability and the possibility for data fusion.
272
In contrast to UPLC-ESI-QTOF-MS acquisition, DG(16:0/16:1) could not be detected as its
273
sodium adduct. Instead, the ammonium adduct found during compound optimization was
274
used for quantitation, which is in compliance to the other di- and triacylglycerols the most
275
abundant ion. Validation of LC-MS method was performed for all commercially available
276
standards in accordance with guidelines for bioanalytical method validation of the FDA and
277
the German DIN 32645 either in absence (base calibration) or presence (matrix calibration) of
278
hazelnut matrix.24 Since none of the triacylglycerol marker substances could be purchased,
279
two structural similar triacylglycerols were used in method validation to demonstrate method
280
efficiency. In baseline calibration (Table 2) all compounds showed good linearity (R2 > 0.99)
281
in a broad dynamic linear range from lowest 0.03 µM to highest 106.46 µM. The limit of
282
detection (LOD) ranged from 0.19 nM to 978.49 nM. While phospholipids showed lowest
283
values, limits of detection for diacylglycerols were up to 22-fold higher and for
284
triacylglycerols up to 717-fold, expect of regarding at the internal standards, where all 12 ACS Paragon Plus Environment
Page 13 of 36
Journal of Agricultural and Food Chemistry
285
substance classes gained similar values. The obtained values for precision and accuracy at the
286
LLOQ were below 15% for 82% of the analytes, but all being less than 20%. 94% of the
287
values for expected concentrations ranged below 15% while none was above 20%.
288
To overcome matrix effects, linearity responses were studied in hazelnut matrix spiked with
289
mix-standard solution. The correlation coefficients of all the calibration curves showed good
290
linearity (R2 > 0.99) for 80% of the analytes and still acceptable linearity (0.98 < R2 < 0.99)
291
for 13%. Values for accuracy and precision at the LLOQ were below 15% for 80% of the
292
analytes, but all being less than 20%. 95% of the values for expected concentrations ranged
293
below 15% while none was above 20%, which demonstrated a minor influence of matrix
294
effects on the presented method.
295
For the remaining marker substances a dilution array of a representative mix of hazelnut
296
extracts from all six countries was used to estimate the required dilution factor for being in
297
linear range. Furthermore, this was used to determine the dilution factor for extracts being
298
used for quantitation of each metabolite. It turned out that all metabolites could be measured
299
in pure extracts except of TG(16:0/16:1/18:1), TG(14:0/16:0/18:1), TG(17:1/18:1/18:2) and
300
TG(18:2/18:2/18:3), which had to be quantitated in a 100-fold dilution. For those metabolites
301
that could be analyzed in both conditions the undiluted state was used. During first analyses
302
of hazelnut extracts it became apparent that the developed application was not sensitive
303
enough for quantitating γ-tocopherol since signal intensity was either below LOD or LLOQ.
304
Ensuing from LOD (81.4573 nM) and LLOQ (14.9995 µM), a γ-tocopherol content of
305
0.69 mg/kg should be theoretically detectable and 201.61 mg/kg quantifiable, respectively.
306
There are numerous publications about γ-tocopherol content in hazelnuts ranging from 2 to
307
240 mg/kg extracted hazelnut oil or not detectable, depending on variety, origin and analysis
308
method.27-32 Furthermore, values for mg/kg hazelnut range between 0.0038 and 20.8, or were
309
not detectable either.33-35 Although the development of a customized extraction procedure
310
including an enrichment step might overcome this problem, this was not contemplated as 13 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 14 of 36
311
extraction protocol was designed for maximum reproducibility. Changing the composition of
312
the extraction solvent to lower chloroform rate at higher methanol content resulting in higher
313
polarity for better tocopherol extraction would impair efficiency for the less polar tri- and
314
diacylglycerols. This should not be pursued since signal intensity for DG(16:0/16:1) and
315
TG(15:0/16:0/18:1) already is near the lower limit. Hence, extraction was kept as simple and
316
fast as possible with the downside of losing γ-tocopherol and thus, no internal standard for
317
tocopherols needed to be applied during validation process.
318
Since detectors response for different mass transitions varied within substance classes
319
between internal standard and metabolites, a correction factor for each metabolite was
320
calculated to overcome this variation. For those metabolites no standard could be achieved, a
321
value was extrapolated by average substance class correction factor. For triacylglycerol
322
marker substances TG(16:0/18:1/18:1) and TG(16:0/18:1/18:2) were used for extrapolation
323
resulting in limited accuracy for calculated metabolite levels, though.
324 325
Method application
326
19 of the previously identified marker substances were analyzed in 202 authentic hazelnut
327
samples. In contrast to non-targeted acquisition, samples were out of six commercially
328
relevant countries now (Turkey, Italy, Georgia, Spain, France, and Germany), additionally
329
including Spain. Every sample was measured as triplicate. As expected, level distributions
330
were similar to previous findings with decreasing contents of lipids with at least one
331
polyunsaturated fatty acid side chain from Germany over France and Italy to Georgia, Turkey
332
and Spain and lipids with only monounsaturated fatty acids in reverse order. (Figure 1)
333
Between Georgia, Turkey and Spain the order partially changes depending on key metabolite.
334
Average metabolite levels ranged from 1.03 mg/kg (DG(16:0/16:1)) to 4350.25 mg/kg
335
(DG(18:1/18:1)) at confidence intervals (α = 0.05) between 1.22% and 21.45% of the average
336
value showing up to 6-fold changed between countries (PC(18:2/18:2)). (Table 3) Principal 14 ACS Paragon Plus Environment
Page 15 of 36
Journal of Agricultural and Food Chemistry
337
component analysis was calculated visualizing the relations of samples against each other
338
(Figure 2). The overall variance explained by PC1-PC3 already yields 80%. Samples lying
339
close together have a similar metabolic profile, which is the case for all countries though
340
overlapping partially. While German samples are only overlapping with some French ones,
341
Turkish, Georgian, Spanish and Italian samples are more mixed. Nevertheless, clusters for
342
each country are visible. Only Spanish samples are divided stronger and two Georgian
343
samples are further away from the rest but both not divided by harvest years and neither for
344
any other country. It can be concluded that the harvest year does not cause much bias to the
345
metabolome, at least for the present samples. T-test for comparison of means was calculated
346
for each country-pair and metabolite to evaluate the significance of differences between
347
countries. (Figure 3) P-values between 0.001 and 0.05 indicate evidence against the null
348
hypothesis (metabolite levels of two groups are equal) and thus, that these two groups are
349
significantly different, while p-values < 0.001 represent very strong evidence for a highly
350
significant difference. A p-value above 0.05 signifies no evidence for rejecting the null
351
hypothesis.36 The amounts of metabolites comprising significant differences for each country-
352
pair could be used to estimate the similarity of metabolite contents and thus, the
353
discriminative ability of the mini-fingerprints. Although some metabolite ranges show strong
354
overlapping, Turkey-Georgia, Turkey-France, Italy-Georgia and Italy-Spain obtain 10 to 14
355
highly significant metabolites and moreover, the number for the remaining country pairs is
356
higher than 17 at maximum two non-significant ones. Difficulties may occur especially
357
discriminating Georgia and Spain, involving eight metabolites with coinciding levels at only
358
seven highly significant. Nevertheless, seven metabolites with highly significant changes still
359
may be sufficient for efficient discrimination. Out of the 19 marker substances PE(18:2/18:2)
360
overall has the best discriminative abilities according to one-way analysis of variance
361
(ANOVA), which is comparing the means of all six countries simultaneously, owing the
362
lowest p-value followed by PC(18:2/18:2) and DG(18:2/18:2). Because of the broad range of 15 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 16 of 36
363
metabolite contents a scaling step had to be applied to the data prior to multivariate statistics
364
to bring values to one scale which has also been done before PCA calculation. Here,
365
interquartile range scaling was utilized. Scaling parameters for each metabolite were
366
calculated based on authentic sample set and afterwards applied to non-authentic data. These
367
parameters can also be used for processing future measurements, so upcoming data can be
368
easily integrated into the present statistical models. With an appropriate scale data now can be
369
reasonably visualized without discriminating low values. (Figure 4) The bar charts can be
370
seen as a kind of barcodes or ‘mini-fingerprints’ being representative for each country. As
371
already appeared after calculating p-values in t-test, Georgia and Spain are very similar but
372
still differences can be overserved visually, while all other countries can be easily
373
differentiated by this chart. Because France’s metabolite levels are always located in the
374
middle of metabolite ranges, scaled values reckon around zero.
375 376
Sample prediction
377
For prediction of unknown samples SVM was used because of its great capabilities in
378
complex biological problems. SVM has some major advances over classical multivariate
379
approaches being able to separate even non-linear data by transformation of data into higher-
380
dimensional feature spaces and searching for optimal separating hyperplanes enabling
381
discrimination of complex cases and overlapping classes in original data.37,38 Several kernel
382
functions (sigmoid, radial basis, polynomial) give rise to classification with overlapping
383
groups mapping non-linear data into a linear separation case in feature space. In this case
384
polynomial function (with offset = 3, C = 1 and Gamma = 0.1 as optimal parameters) fits best
385
yielding 100% both for training accuracy as well as cross-validation accuracy. Cross-
386
validation was applied by multiple dividing training set into variable six segments, using 5/6
387
for model creation and subsequent prediction of the separated sixth. Even when using just two
388
segments, cross-validation accuracy still reached 97%, demonstrating the high robustness of 16 ACS Paragon Plus Environment
Page 17 of 36
Journal of Agricultural and Food Chemistry
389
the present model. Thus, the prediction model is able to differentiate all countries despite
390
overlapping metabolite ranges. This evinces the excellent suitability and strength of the
391
presented targeted metabolomics application for geographical origin discrimination of
392
hazelnuts.
393
In addition, 50 hazelnut samples for confectionary industry were analyzed in equal measure
394
and metabolite levels were calculated. Due to current market situation, mainly samples from
395
Turkey (33), Italy (nine) and Georgia (six) could be obtained and additionally one each from
396
France and Spain. Overlapping confidence intervals with authentic samples could be found in
397
77%. When only regarding countries with more than one confectionary sample, 93%
398
overlapping confidence intervals were obtained. By overlaying spider charts of scaled values
399
for authentic and confectionary samples these findings could be confirmed (Figure 5). Shapes
400
of spider diagrams reflect the similarities of the illustrated sample groups. While countries
401
among themselves have different shapes and can be easily differentiated by visual like bar
402
charts the contour of authentic and confectionary plots for each country closely resemble,
403
except of French ones. Nevertheless, the shape of its confectionary sample still coincides
404
most with the French’s authentic samples. Like in bar charts Georgia and Spain look very
405
similar, but characteristic differences in the markedness of spikes still can be perceived.
406
Employing confectionary samples on the SVM prediction model 80% were classified correct
407
at 20% wrong. One sample was defined as classified ‘correct’ if two or three single
408
measurements of triplicate analysis were predicted correct, ‘wrong’ if two or three predicted
409
the same but false country and ‘not classified’ if all three predictions were dissimilar.
410
Additionally, like in the non-targeted approach, PCA-LDA (98.4% training but only 24.0%
411
prediction accuracy) and SIMCA (multiple classifications, 0% false negative but only 25.3%
412
single correct attributions) were tested and the obtained results were again inferior to SVM.
413
Unlike previous findings, the combination of SVM with SIMCA as surveillance analysis
414
(because no sample was attributed false negative) did not improve the outcome dramatically 17 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 18 of 36
415
since only one sample could be assigned ‘not classified’ instead of being false predicted.
416
Looking at the data of SVM, four out of six Georgian samples could not be classified
417
correctly, indicating a poor suitability for this country. Nevertheless, 93% of the determined
418
metabolite levels were inside the respective range for authentic Georgian samples and 84%
419
for all ten false predicted samples, showing that there is not much deviation in metabolite
420
profiles themselves but rather blurred borders. Because of the partly strong overlapping
421
metabolite levels of countries the margins become indistinct in certain cases and properly
422
prediction is not easy. Concerning the French sample, despite the already mentioned deviation
423
from authentic set which could be observed visually by comparing both spider charts, it was
424
predicted correct though. As the metabolic profile of French hazelnuts differs relative strong
425
to other countries a higher variation could be tolerated for still correct prediction unlike it
426
would be the case for closer resembling countries. However, there is the possibility of being
427
different from authentic samples due to either biological variance or economic handling
428
resulting in poor predictability. This indicates that further improvements of prediction models
429
still have to be evaluated. By increasing the sample amount used for model creation this
430
problem may be overcome, since statistical reliability could be improved as larger sets of
431
samples would take much better account of natural extremes. This thesis could be confirmed,
432
when using confectionary and authentic samples together for model creation in SVM,
433
reaching again both 100% for training accuracy and cross-validation accuracy, respectively.
434
However, since the declared geographical origin of samples for confectionary industry could
435
not be entirely verified, one cannot rule out the possibility that conscious or accidental
436
adulterations have been taken place distorting the results and furthermore, some samples may
437
remain undistinguishable with this method because of too strong overlapping scopes.
438
Especially samples from border regions remain challenging, probably having very similar
439
metabolic profiles. This becomes critical if areas of cultivation within one country are more
440
spatially divided than the proximity of one of them to other country’s ones. Thus, for some 18 ACS Paragon Plus Environment
Page 19 of 36
Journal of Agricultural and Food Chemistry
441
cases, a classification into (transnational) geographical areas would be more sensible but
442
usually does not reflect commercial reality of declaration. Fortunately, harvest areas for the
443
present issue are not directly linked to areas of other countries.
444
In addition, five confectionary samples with roasted and blanched hazelnut kernels were
445
equally analyzed to estimate the effect of roasting to the predictability of the model.
446
Surprisingly again 80% could be attributed correct while only one sample was predicted
447
wrong. The metabolic profiles of correct predicted samples predominantly coincide with
448
authentic ones, lying inside the respective ranges in 93% of the cases. Further investigations
449
still have to proof these preliminary findings but the results are demonstrating that the general
450
applicability is given and that the presented marker substances seem to remain unaffected by
451
temperature impact of roasting process as well as blanching. Even though the high
452
temperatures of roasting would have a supposedly great influence on hazelnut metabolome,
453
the results are in accordance to a study of Locatelli et al. (2015) who inferred from
454
chemometric evaluation that the fatty acid composition of hazelnuts is not influenced by
455
roasting and thus, geographical origin discrimination capacity remains unaffected.39 The same
456
holds true for Alasalvar et al. (2010) who reported minor influences on fatty acid profile of
457
different Turkish hazelnut cultivars.40 Therefore, it can be concluded that the lipid fraction of
458
hazelnuts is not very sensitive to the temperature impact of roasting process.
459
In comparison to the non-targeted metabolomics application this method offers the great
460
opportunity for being relatively easy implemented in other laboratories using commonly
461
applied HPLC-ESI-QqQ-MS instruments while still yielding good results in discriminative
462
abilities. It offers a robust and fast quantitation of 19 key metabolites that are suitable for
463
origin discrimination of hazelnuts from six countries. Ongoing analyses of further samples
464
from other countries and harvest years still have to be executed for constant improvement of
465
statistical methods for better prediction of raw as well as roasted hazelnut kernels. Data
19 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 20 of 36
466
sharing and fusion could then be used to create a unique and comprehensive database
467
enabling greatest possible accuracy for prediction of unknown samples.
468 469
ABBREVIATIONS USED
470
MRM, multiple reaction monitoring; ANOVA, analysis of variances; SIMCA, soft
471
independent modelling of class analogy; PCA, principal component analysis; SVM, support
472
vector
473
phosphatidylcholine; PE, phosphatidylethanolamine; LOD, limit of detection; LLOQ, lower
474
limit of quantitation; FDA, Food and Drug Administration; Qnt, quantifier; Qal, qualifier;
475
IQR, interquartile range
machine
classification;
TG,
triacylglycerol;
DG,
diacylglycerol;
PC,
476 477
ACKNOWLEDGEMENTS
478
The authors are very grateful to SCA Unicoque, Erzeugerorganisation Deutscher
479
Haselnussanbauer UG, Schlüter&Maack GmbH, Amt für Ernährung, Landwirtschaft und
480
Forsten
481
AgroTeamConsulting, Institute of Biotechnology and Microbiology, University of Hamburg,
482
August Storck KG, Seeberger GmbH, Crisol de Frutos Secos, Azienda Agricola Cascina
483
Valcrosa, Basaran Entegre Gıda san. ve Tic. A.Ş, Alta Langa Azienda Agricola, Corilu
484
Societa Cooperativa Agricola, Coselva SCCL, Eganut LLC and Franken Genuss UG &
485
Co.KG for providing us with authentic hazelnut samples.
486
Furthermore we thank Ferrero OHG mbH, Heinrich Brüning GmbH, August Töpfer & Co.
487
(GmbH & Co.) KG, Lübecker Marzipan-Fabrik v. Minden & Bruhns GmbH & Co. KG, Carl
488
Wilhelm Clasen GmbH, Horst Walberg Trockenfrucht Import GmbH, Fratelli Caffa s.a.s.,
489
Ludwig Schokolade GmbH & Co. KG, Alfred Ritter GmbH & Co. KG, Kaufland
490
Omnichannel International GmbH & Co. KG, Stollwerck GmbH and Rapunzel Naturkost
491
GmbH for supporting us with raw hazelnut kernels for confectionary industry.
Fürth/Sortenversuchsanstalt
Gonnersdorf,
Stelma
SRL
Unipersonale,
20 ACS Paragon Plus Environment
Page 21 of 36
Journal of Agricultural and Food Chemistry
492
In addition, the authors thank René Bachmann and Thomas Hackl for excellent assistance in
493
sample preparation.
494 495 496
SUPPORTING INFORMATION
497
Detailed list with suppliers, provenance and cultivar information of all used authentic
498
hazelnut samples (Table S1) and confectionary samples (Table S2), metabolite concentrations
499
in validation solutions (Table S3), MRM acquisition parameters for each mass transition
500
(Table S4), scaling values and correction factors (Table S5), accuracy and precision for three
501
concentration levels in base calibration (Table S6) and matrix calibration (Table S8),
502
validation parameters for matrix calibration (Table S7), dilution array of a representative mix
503
of extract with regression equation, correlation coefficient and linear range (Table S9),
504
metabolite ranges of authentic samples for each country and metabolite (Table S10), p-values
505
for ANOVA calculations (Table S11), average and confidence intervals for confectionary
506
samples (Table S12), loadings plots of PCA calculations (Figure S1) and results of SVM
507
model predictions (Table S13) as well as equation for calculation of scaled values for each
508
metabolite (Equation S1).
21 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 22 of 36
509
REFERENCES
510
(1)
511
2015; http://faostat3.fao.org/browse/Q/QC/E (accessed Dez 20, 2016).
512
(2)
513
Image analysis techniques for automated hazelnut peeling determination. Food Bioprocess
514
Technol. 2009, 3, 155-159.
515
(3)
Mehlenbacher, S. A. Hazelnuts (Corylus). Acta Hortic. 1991, 290, 791-838.
516
(4)
Locatelli, M.; Coïsson, J. D.; Travaglia, F.; Cereti, E.; Garino, C.; D’Andrea, M.;
517
Martelli, A.; Arlorio, M. Chemotype and genotype chemometrical evaluation applied to
518
authentication and traceability of “Tonda Gentile Trilobata” hazelnuts from Piedmont (Italy).
519
Food Chem. 2011, 129, 1865-1873.
520
(5)
521
plant metabolomics strategies: from analytical platforms to data acquisition and processing.
522
Nat. Prod. Rep. 2014, 31, 784-806.
523
(6)
524
authentication. Food Res. Int. 2014, 60, 95-107.
525
(7)
526
Metabolomic approaches for geographical origin discrimination of hazelnuts (Corylus
527
avellana) by UPLC-QTOF-MS. J. Agric. Food Chem. 2016, 64, 9253-9262.
528
(8)
529
mass spectrometry and related techniques. J. Chromatogr. A. 2012, 1259, 3-15.
530
(9)
531
spectrometry in food-safety and quality analysis. TrAC, Trends Anal. Chem. 2013, 52, 170-
532
185.
533
(10)
534
flight mass spectrometry for the determination of pesticide residues in comparison with an
FAOstat Food and Agriculture Organization of the United Nations, Agricultural data
Pallottino, F.; Menesatti, P.; Costa, C.; Paglia, G.; De Salvador, F. R.; Lolletti, D.
Ernst, M.; Silva, D. B.; Silva, R. R.; Vencio, R. Z.; Lopes, N. P. Mass spectrometry in
Cubero-Leon, E.; Peñalver, R.; Maquet, A. Review on metabolomics for food
Klockmann, S.; Reiner, E.; Bachmann, R.; Hackl, T.; Fischer, M. Food fingerprinting:
Holcapek, M.; Jirasko, R.; Lisa, M. Recent developments in liquid chromatography-
Wang, X.; Wang, S.; Cai, Z. The latest developments and applications of mass
Picó, Y.; Blasco, C.; Farré, M.; Barcelo, D. Analytical utility of quadrupole time-of-
22 ACS Paragon Plus Environment
Page 23 of 36
Journal of Agricultural and Food Chemistry
535
optimized column high-performance liquid chromatography/tandem mass spectrometry
536
method. J. AOAC Int. 2009, 92, 734-744.
537
(11)
538
Dossena, A.; Drahos, L.; Vekey, K. Applications of liquid chromatography-mass
539
spectrometry for food analysis. J. Chromatogr. A. 2012, 1259, 74-85.
540
(12)
541
reaction monitoring for quantitative biomarker analysis in proteomics and metabolomics. J.
542
Chromatogr. B: Anal. Technol. Biomed. Life Sci. 2009, 877, 1229-1239.
543
(13)
544
Chemical Industry Press: Beijing, 2014.
545
(14)
546
fingerprinting for authentication of food in official control. Food Res. Int. 2014, 60, 189-204.
547
(15)
548
BioSyst. 2012, 8, 470-481.
549
(16)
550
spectrometry for food safety, quality and traceability. TrAC, Trends Anal. Chem. 2013, 52,
551
74-87.
552
(17)
553
safety and quality of plant-derived food. Food Res. Int. 2013, 54, 1172-1183.
554
(18)
555
metabolome analysis of fresh tobacco leaves in different geographical regions using liquid
556
chromatography-mass spectrometry. Anal. Bioanal. Chem. 2015, 407, 5009-5020.
557
(19)
558
orange origin discrimination by ultra-high performance liquid chromatography coupled to
559
quadrupole time-of-flight mass spectrometry. Food Chem. 2014, 157, 84-93.
Di Stefano, V.; Avellone, G.; Bongiorno, D.; Cunsolo, V.; Muccilli, V.; Sforza, S.;
Kitteringham, N. R.; Jenkins, R. E.; Lane, C. S.; Elliott, V. L.; Park, B. K. Multiple
Qi, X.; Chen, X.; Wang, Y., Plant Metabolomics: Methods and applications.
Esslinger, S.; Riedl, J.; Fauhl-Hassek, C. Potential and limitations of non-targeted
Zhou, B.; Xiao, J. F.; Tuli, L.; Ressom, H. W. LC-MS-based metabolomics. Mol.
Castro-Puyana, M.; Herrero, M. Metabolomics approaches based on mass
Oms-Oliu, G.; Odriozola-Serrano, I.; Martín-Belloso, O. Metabolomics for assessing
Li, L.; Lu, X.; Zhao, J.; Zhang, J.; Zhao, Y.; Zhao, C.; Xu, G. Lipidome and
Diaz, R.; Pozo, O. J.; Sancho, J. V.; Hernandez, F. Metabolomic approaches for
23 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 24 of 36
560
(20)
Bondia-Pons, I.; Savolainen, O.; Törrönen, R.; Martinez, J. A.; Poutanen, K.;
561
Hanhineva, K. Metabolic profiling of goji berry extracts for discrimination of geographical
562
origin by non-targeted liquid chromatography coupled to quadrupole time-of-flight mass
563
spectrometry. Food Res. Int. 2014, 63, 132-138.
564
(21)
565
according to variety and geographical origin based on polyphenolic profiles. J. Agric. Food
566
Chem. 2013, 61, 6949-6963.
567
(22)
568
MS/MS analysis of phenols for classification of red wine according to geographic origin,
569
grape variety and vintage. Food Chem. 2010, 122, 366-372.
570
(23)
571
M. Determination of six steviol glycosides of Stevia rebaudiana (Bertoni) from different
572
geographical origin by LC–ESI–MS/MS. Food Chem. 2013, 141, 745-753.
573
(24)
574
Health and Human Services, FDA, Center for Drug Evaluation and Research, Rockville, MD,
575
2001.
576
(25)
577
New York, 1964.
578
(26)
579
coefficient variation. Econometrica. 1979, 47, 1287-1294.
580
(27)
581
oils in hazelnuts (Corylus avellana L.) grown in New Zealand. J. Am. Oil Chem. Soc. 1997,
582
74, 755-759.
583
(28)
584
hazelnut oil composition with comparisons to other vegetable oils, particularly olive oil. Eur.
585
Food Res. Technol. 2003, 218, 13-19.
Guo, J.; Yue, T.; Yuan, Y.; Wang, Y. Chemometric classification of apple juices
Jaitz, L.; Siegl, K.; Eder, R.; Rak, G.; Abranko, L.; Koellensperger, G.; Hann, S. LC–
Montoro, P.; Molfetta, I.; Maldini, M.; Ceccarini, L.; Piacente, S.; Pizza, C.; Macchia,
U.S. FDA. Guidance for Industry: Bioanalytical method validation; US Department of
Mandel, J., The statistical analysis of experimental data. Dover Publications, INC.:
Breusch, T. S.; Pagan, A. R. A simple test for heteroscedasticity and random
Savage, G. P.; McNeil, D. L.; Dutta, P. C. Lipid composition and oxidative stability of
Benitez-Sánchez, P. L.; León-Camacho, M.; Aparicio, R. A comprehensive study of
24 ACS Paragon Plus Environment
Page 25 of 36
Journal of Agricultural and Food Chemistry
586
(29)
Alasalvar, C.; Amaral, J. S.; Satır, G.; Shahidi, F. Lipid characteristics and essential
587
minerals of native Turkish hazelnut varieties (Corylus avellana L.). Food Chem. 2009, 113,
588
919-925.
589
(30)
590
composition, tocols profile, and oxidative stability of some fully refined edible oils. Int. J.
591
Food Prop. 2015, 18, 2064-2076.
592
(31)
593
M. F. Determination of lipid and phenolic fraction in two hazelnut (Corylus avellana L.)
594
cultivars grown in Poland. Food Chem. 2015, 168, 615-622.
595
(32)
596
the most represented hazelnut varieties cultivated in Croatia. Ital. J. Food Sci. 2014, 26, 24-
597
30.
598
(33)
599
(Corylus avellana L.) varieties cultivated in Turkey. Food Chem. 2006, 99, 509-515.
600
(34)
601
and tocotrienol content of hazelnut cultivars grown in Portugal. J. Agric. Food Chem. 2006,
602
54, 1329-1336.
603
(35)
604
various tree nuts and peanuts. J. Agric. Food Chem. 2013, 61, 11636-11644.
605
(36)
606
and Purpose. Am. Stat. 2016, 70, 129-133.
607
(37)
608
classification in chemometrics. Crit. Rev. Anal. Chem. 2006, 36, 177-188.
609
(38)
610
dimension reduction and statistical analysis. TrAC, Trends Anal. Chem. 2011, 30, 827-841.
Ayyildiz, H. F.; Topkafa, M.; Kara, H.; Sherazi, S. T. H. Evaluation of fatty acid
Ciemniewska-Zytkiewicz, H.; Verardo, V.; Pasini, F.; Brys, J.; Koczon, P.; Caboni,
Vujević, P.; Petrović, M.; Vahčić, N.; Milinović, B.; Čmelik, Z. Lipids and minerals of
Köksal, A. Đ.; Artik, N.; Şimşek, A.; Güneş, N. Nutrient composition of hazelnut
Amaral, J. S.; Casal, S.; Alves, M. R.; Seabra, R. M.; Oliveira, B. P. P. Tocopherol
Esche, R.; Müller, L.; Engel, K.-H. Online LC-GC-based analysis of minor lipids in
Wasserstein, R. L.; Lazar, N. A. The ASA's Statement on p-Values: Context, Process,
Xu, Y.; Zomer, S.; Brereton, R. G. Support vector machines: A recent method for
Liland, K. H. Multivariate methods in metabolomics – from pre-processing to
25 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 26 of 36
611
(39)
Locatelli, M.; Coisson, J. D.; Travaglia, F.; Bordiga, M.; Arlorio, M. Impact of
612
roasting on identification of hazelnut (Corylus avellana L.) origin: A chemometric approach.
613
J. Agric. Food Chem. 2015, 63, 7294-7303.
614
(40)
615
composition of Turkish hazelnut varieties (Corylus avellana L.). Int. J. Food Sci. Nutr. 2010,
616
61, 630-642.
Alasalvar, C.; Pelvan, E.; Topal, B. Effects of roasting on oil and fatty acid
617
618
26 ACS Paragon Plus Environment
Page 27 of 36
Journal of Agricultural and Food Chemistry
FIGURE CAPTIONS Figure 1: Metabolite levels of authentic hazelnut samples in mg/kg Figure 2: PCA scores plots of authentic hazelnut samples, A) PC1 vs. PC2 and B) PC1 vs. PC3 Figure 3: P-values of T-test calculations between country pairs for each metabolite Figure 4: Bar charts with confidence intervals of scaled values for authentic hazelnut samples Figure 5: Spider diagrams of authentic and confectionary hazelnut samples for each country displayed as scaled values
27 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 28 of 36
TABLES Table 1: Chromatographic and mass spectrometric information of key metabolites for geographical origin discrimination of hazelnuts metabolite
sum formula
adduct ion
Rt [min]
precursor [Da]
Qnt [Da]
Qal [Da]
γ-tocopherol
C28H48O2
+H
2.60
417.646
151.1
69.1
PC(18:2/0:0)
C26H50NO7P
+H
1.19
520.353
184.2
104.2
PC(18:1d7/0:0)
C26H45D7NO7P
+H
1.28
529.358
184.2
104.2
DG(16:0/16:1)
C35H66O5
+NH4
3.49
584.525
311.4
313.4
DG(15:0/18:1d7)
C36H61D7O5
+NH4
3.81
605.431
570.6
299.3
DG(16:0/18:1)
C37H70O5
+NH4
4.26
612.607
339.4
313.5
DG(18:2/18:2)
C39H68O5
+NH4
3.38
634.520
377.3
95.2
DG(18:1/18:1)
C39H72O5
+NH4
4.43
638.640
399.5
83.2
TG(2:0/18:2/18:2)
C41H70O6
+NH4
4.12
676.551
379.4
599.5
TG(2:0/18:1/18:2)
C41H72O6
+NH4
4.73
678.570
381.3
379.3
PE(15:0/18:1d7)
C38H67D7NO8P
+H
2.44
711.571
570.6
57.1
PE(16:0/18:2)
C39H74NO8P
+H
2.41
716.581
575.6
81.2
PE(18:2/18:2)
C41H74NO8P
+H
2.26
740.647
599.5
95.2
PE(18:1/18:1)
C41H78NO8P
+H
2.82
744.470
603.6
69.0
PC(15:0/18:1d7)
C41H73D7NO8P
+H
2.43
753.592
184.2
86.2
PC(16:0/18:3)
C42H78NO8P
+H
2.13
756.550
184.2
86.1
PC(16:0/18:2)
C42H80NO8P
+H
2.41
758.635
184.3
86.2
PC(18:2/18:2)
C44H80NO8P
+H
2.25
782.596
184.2
86.1
PC(18:1/18:2)
C44H82NO8P
+H
2.48
784.596
184.2
86.1
TG(14:0/16:0/18:1)
C51H96O6
+NH4
8.19
822.755
549.5
577.5
TG(15:0/18:1d7/15:0)
C51H89D7O6
+NH4
8.15
829.782
570.6
523.5
TG(15:0/16:0/18:1)
C52H98O6
+NH4
6.76
836.771
563.5
537.5
TG(16:0/16:1/18:1)
C53H98O6
+NH4
8.23
848.770
575.5
81.1
TG(16:0/18:1/18:1)1
C55H102O6
+NH4
8.45
874.786
95.1
81.0
TG(16:0/18:1/18:2)1
C55H100O6
+NH4
8.26
876.801
81.0
95.2
TG(17:1/18:1/18:2)
C56H100O6
+NH4
8.20
886.787
587.5
589.5
TG(18:2/18:2/18:3)
C57H96O6
+NH4
7.76
894.756
597.5
599.5
Rt: retention time; Qnt: Quantifier; Qal: Qualifier; PC: Phosphatidylcholine; DG: Diacylglycerol; TG: Triacylglycerol; PE: Phosphatidylethanolamine 1 only used for method validation
28 ACS Paragon Plus Environment
Page 29 of 36
Journal of Agricultural and Food Chemistry
Table 2: Validation parameters for base calibration analyte
regression equation
R2
PC(18:2/0:0)
y = 2.61E+05 + 6.09E+04
0.9976 0.5773 – 9.6219
1.3634
DG(16:0/18:1)
y = 8.24E+04 – 1.78E+05
0.9999 3.3600 – 84.0000
3.5744
DG(18:2/18:2)
y = 8.98E+04 – 3.08E+04
0.9971 1.9449 – 58.3468
8.6852
DG(18:1/18:1)
y = 6.56E+04 – 4.21E+04
0.9961 2.3020 – 92.0800
30.2737
PE(16:0/18:2)
y = 9.99E+04 – 4.64E+04
0.9992 2.1487 – 21.4869
4.2465
PE(18:2/18:2)
y = 1.01E+05 + 3.21E+02
0.9967 0.0400 – 20.0028
2.4762
PE(18:1/18:1)
y = 7.93E+04 – 4.40E+03
0.9996 0.3978 – 59.6750
3.6945
PC(16:0/18:2)
y = 4.82E+05 + 5.18E+04
0.9989 0.3997 – 19.9872
1.6968
PC(18:2/18:2)
y = 4.68E+05 + 4.35E+04
0.9989 0.3996 – 19.9785
1.5106
γ-tocopherol
y = 1.37E+02 + 3.13E+03
0.9981 14.9995 – 74.9976 81.4573
TG(16:0/18:1/18:1)1
y = 5.85E+04 + 1.92E+05
0.9974 6.0187 - 20.0623
978.4892
1
TG(16:0/18:1/18:2)
y = 8.03E+04 + 1.63E+05
0.9963 4.2413 - 21.2063
733.9936
PC(18:1d7/0:0)
y = 4.13E+04 + 4.50E+03
0.9982 0.1184 - 23.6790
0.2000
DG(15:0/18:1d7)
y = 3.06E+04 - 2.64E+03
0.9979 0.1065 - 106.4625 0.4072
linear range [µM]
LOD [nM]
PE(15:0/18:1d7)
y = 3.28E+04 + 6.48E+03
0.9980 0.0900 - 17.9946
0.3735
PC(15:0/18:1d7)
y = 7.25E+04 - 2.45E+02
0.9986 0.0499 - 33.2476
0.1924
0.9982 0.0315 - 6.2999
0.3839
TG(15:0/18:1d7/15:0) y = 1.69E+05 - 2.15E+03 1
only used for method validation
29 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 30 of 36
Table 3: Average metabolite levels for authentic hazelnut samples with confidence intervals (α = 0.05) in mg/kg PC(18:2/0:0) Germany 5.00 ± 0.28
DG(16:0/16:1) 1.03 ± 0.05
DG(16:0/18:1) 470.10 ± 25.02
DG(18:2/18:2) 1633.95 ± 73.10
DG(18:1/18:1) 2052.64 ± 86.24
France
3.86 ± 0.09
1.34 ± 0.03
690.28 ± 13.93
1236.34 ± 29.41
3044.35 ± 50.27
Georgia
2.26 ± 0.19
1.98 ± 0.10
917.22 ± 33.51
500.45 ± 36.56
3911.33 ± 108.52
Italy
2.48 ± 0.12
1.59 ± 0.07
854.77 ± 29.74
780.03 ± 45.66
3779.80 ± 73.22
Spain
1.87 ± 0.23
1.87 ± 0.27
992.95 ± 117.71
966.65 ± 134.04
4295.52 ± 234.07
Turkey
1.76 ± 0.07
1.78 ± 0.08
953.89 ± 48.02
535.37 ± 33.34
4350.25 ± 267.33
TG(2:0/18:1/18:2) 1295.90 ± 57.96
PE(16:0/18:2) 127.12 ± 5.44
PE(18:2/18:2) 143.63 ± 6.12
PE(18:1/18:1) 491.01 ± 18.34
TG(2:0/18:2/18:2) Germany 394.27 ± 23.51 France
220.57 ± 9.76
893.69 ± 29.03
92.87 ± 1.64
86.09 ± 2.45
602.82 ± 10.68
Georgia
78.99 ± 13.25
423.13 ± 55.02
43.22 ± 4.89
23.11 ± 3.25
621.24 ± 30.50
Italy
71.79 ± 9.10
398.00 ± 35.82
58.98 ± 2.53
39.88 ± 2.45
650.24 ± 20.02
Spain
154.89 ± 29.79
711.24 ± 110.87
53.48 ± 8.77
32.69 ± 5.06
569.65 ± 31.05
Turkey
103.22 ± 15.61
505.25 ± 53.06
41.14 ± 2.41
22.62 ± 2.65
693.27 ± 31.34
PC(16:0/18:2) 112.98 ± 4.33
PC(18:2/18:2) 177.01 ± 7.90
PC(18:1/18:2) 258.52 ± 5.78
TG(14:0/16:0/18:1) 99.02 ± 4.66
PC(16:0/18:3) Germany 4.95 ± 0.24 France
3.74 ± 0.08
85.90 ± 1.47
110.08 ± 2.97
230.97 ± 2.84
132.87 ± 3.86
Georgia
1.64 ± 0.20
49.87 ± 5.16
34.54 ± 5.09
129.44 ± 8.58
192.40 ± 18.76
Italy
2.20 ± 0.11
60.17 ± 2.55
54.46 ± 2.85
168.63 ± 4.33
182.41 ± 12.35
Spain
1.82 ± 0.39
53.85 ± 8.92
45.21 ± 8.58
144.35 ± 14.13
208.90 ± 24.71
Turkey
1.41 ± 0.14
40.24 ± 2.41
29.06 ± 3.29
120.27 ± 5.92
119.70 ± 7.08
TG(15:0/16:0/18:1) TG(16:0/16:1/18:1) TG(17:1/18:1/18:2) TG(18:2/18:2/18:3) γ-tocopherol Germany 18.62 ± 0.99 3658.28 ± 150.83 572.76 ± 26.86 1082.30 ± 64.11 < LLOQ France
20.90 ± 0.40
3505.22 ± 78.31
439.32 ± 10.00
730.32 ± 18.78
< LLOQ
Georgia
13.31 ± 0.66
3708.43 ± 434.26
292.88 ± 24.51
459.13 ± 34.98
< LLOQ
Italy
15.84 ± 0.98
3198.68 ± 160.06
262.15 ± 15.82
460.93 ± 30.57
< LLOQ
Spain
14.76 ± 1.28
4233.26 ± 538.53
338.35 ± 47.09
688.94 ± 104.53
< LLOQ
Turkey
15.86 ± 0.80
2533.94 ± 124.89
346.11 ± 25.45
692.57 ± 56.16
< LLOQ
30 ACS Paragon Plus Environment
Page 31 of 36
Journal of Agricultural and Food Chemistry
FIGURES Figure 1
31 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 32 of 36
Figure 2
32 ACS Paragon Plus Environment
Page 33 of 36
Journal of Agricultural and Food Chemistry
Figure 3
33 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 34 of 36
Figure 4
34 ACS Paragon Plus Environment
Page 35 of 36
Journal of Agricultural and Food Chemistry
Figure 5
35 ACS Paragon Plus Environment
Journal of Agricultural and Food Chemistry
Page 36 of 36
TABLE OF CONTENT
36 ACS Paragon Plus Environment