Forensic Investigations of Diesel Oil Spills in the Environment Using

Dec 5, 2018 - Forensic investigations of oil spills aim to find the responsible source(s) of the spill. Oil weathering processes change the chemical c...
1 downloads 0 Views 2MB Size
Subscriber access provided by University of Winnipeg Library

Energy and the Environment

Forensic Investigations of Diesel Oil Spills in the Environment using Comprehensive Two-Dimensional Gas Chromatography – High Resolution Mass Spectrometry and Chemometrics: New perspectives in the Absence of Recalcitrant Biomarkers Guilherme Lionello Alexandrino, Giorgio Tomasi, Paul Kienhuis, Fabio Augusto, and Jan H. Christensen Environ. Sci. Technol., Just Accepted Manuscript • DOI: 10.1021/acs.est.8b05238 • Publication Date (Web): 05 Dec 2018 Downloaded from http://pubs.acs.org on December 6, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 29

Environmental Science & Technology

1

Forensic Investigations of Diesel Oil Spills in the Environment using

2

Comprehensive Two-Dimensional Gas Chromatography – High

3

Resolution Mass Spectrometry and Chemometrics: New perspectives in

4

the Absence of Recalcitrant Biomarkers

5

Guilherme L. Alexandrino*1,2, Giorgio Tomasi2, Paul G. M. Kienhuis3, Fabio Augusto1 and Jan. H.

6

Christensen2

7 8

1

9

970, Campinas – SP, Brazil.

Institute of Chemistry, State University of Campinas, Cidade Universitária Zeferino Vaz, 13083-

10

2

11

Thorvaldsensvej 40, DK-1871, Frederiksberg C, Denmark.

12

3

13

2, Lelystad, The Netherlands

Department of Plant and Environmental Sciences, Faculty of Science, University of Copenhagen,

Ministry of Infrastructure and Water Management, Rijkswaterstaat laboratory, Zuiderwagenplein

14 15

*Corresponding author:

16

Guilherme L. Alexandrino, Ph.D.

17

Analytical Chemistry, Department of Plant and Environment Sciences

18

Faculty of Sciences, University of Copenhagen

19

Thorvaldsensvej 40, 1871 Frederiksberg C, Denmark

20

Phone: +45 50 27 23 80

21

E-mail: [email protected]

22 23

1 ACS Paragon Plus Environment

Environmental Science & Technology

Page 2 of 29

24

Abstract

25

Forensic investigations of oil spills aim to find the responsible source(s) of the spill. Oil

26

weathering processes changes the chemical composition of the spilled oil and makes the matching of

27

oil spill samples to potential sources difficult. Diesel oil spill cases are more challenging, because

28

biomarkers recalcitrant to long-term weathering are absent. We developed and tested a new method

29

for the analysis and matching of diesel oil spills using two-dimensional gas chromatography – high

30

resolution mass spectrometry (GC×GC – HRMS) and 2D-CHEMSIC (2-Dimensional CHEMometric

31

analysis of Selected Ion Chromatograms), an extension of the CHEMSIC method to GC×GC data.

32

The 2D-CHEMSIC performs pixel-based analysis using chemometrics on concatenated sections of

33

2D extracted ions chromatograms to assess the overall chemical variability of the samples, with

34

potential applications for matching spill-source pairs in forensic investigations. The method was

35

tested on samples from a number of diesel oil spill cases, i) distinguishing chemically similar source

36

diesels, ii) investigating weathering effects on spill samples to determine type and degree of

37

weathering, and iii) to improve matching of diesel oil spill affected by weathering. Positive matches

38

for spill-source pairs were identified after excluding the signals from the hydrocarbons most

39

susceptible to evaporation, and photo-oxidized spills were also matched due to the presence of

40

unaffected hydrocarbons. Forensic diagnostics obtained by the 2D-CHEMSIC were validated by the

41

conventional CEN-Tr method.

42 43

Keywords: quadrupole – time of flight mass spectrometry, diesel oil spills, pixel-based analysis,

44

weathering, chemometrics

45 46

2 ACS Paragon Plus Environment

Page 3 of 29

Environmental Science & Technology

47

1. Introduction

48

Oil spill investigations demand large effort worldwide to identify the source of the pollution, to 1,2.

49

assess environmental risks and to mitigate potential damages

Weathering effects such as

50

evaporation, dissolution, photo-oxidation and microbial degradation affect the hydrocarbons profile

51

in spilled fuels, while the most weathering-resistant petroleum biomarkers steranes and hopanes,

52

which are contained exclusively in the heavier fractions, can persist

53

identifications of petroleum mid-distillates, such as diesel fuels, are more challenging compared to

54

cases involving heavy fuels or crude oils as the spill-source pairs matching can only be based on

55

lighter compounds that are more susceptible to weathering processes. Bicyclic sesquiterpanes is one

56

group of compounds that have been used to this end 4,5,6 as their isomer composition is unaffected by

57

short-term weathering effects 7,8. However, more diagnostic compound groups are sought for oil spill

58

identification of mid-distillates affected for both short-term but especially for long-term weathering.

59

Comprehensive two-dimensional gas chromatography (GC×GC) is a powerful tool for the

60

separation of highly complex samples such as oil products 9. GC×GC uses two capillary columns

61

with orthogonal separation properties (e.g.; non-polar × polar or vice versa) connected through a

62

modulator, which periodically concentrates a fraction of the eluate coming from the first column (first

63

dimension, 1D) and next reinjects this fraction as a narrower band into the head of the shorter second

64

column (second dimension, 2D). The analytical advantages of GC×GC have been demonstrated for

65

the analysis of crude oils and petroleum products and the more detailed chemical fingerprints of

66

different classes of hydrocarbons have already been investigated in crude oils for geochemistry

67

purposes 10,11,12, for the characterization of petroleum distillates 13,14,15 and for oil spill investigations

68

16.

69

HRMS), has been shown to provide a more accurate target analysis of key compounds and less

70

ambiguous identifications of the chromatographic peaks 17.

1,3.

Forensic oil spill

More recently, the coupling of GC×GC with high resolution mass spectrometry (GC×GC –

3 ACS Paragon Plus Environment

Environmental Science & Technology

Page 4 of 29

71

In oil spills forensics using conventional 1D GC-FID and GC-MS, the golden standard is a tiered

72

approach used for data analysis which encompasses the visual comparison of chromatograms

73

targeting the n-alkanes distribution and the calculation of diagnostic ratios from the relative peak

74

areas or concentrations of PAHs and saturated petroleum biomarkers 18. However, the conventional

75

approach is less suited for handling long-term weathered mid-distillates due to the absence of the

76

recalcitrant biomarkers that could provide successful oil-source correlations in forensic investigations

77

about oil spills. Pixel-based analysis of sections of chromatograms is an especially interesting

78

alternative to peak integration for forensic mid-distillate spill investigations, because the search for

79

spill-source matching is done using all the potential chromatographic pixels, instead of only few

80

selected compounds susceptible to weathering. Christensen et al. have already introduced the so-

81

called CHEMSIC method (CHEMometric analysis of Selected Ion Chromatograms) for forensic

82

investigations about crude oil environmental spills using 1D GC-MS

83

important advantages compared to the conventional approach, such as i) a more comprehensive

84

diagnostics as a larger fraction of the chromatographic data is analyzed instead of only pre-selected

85

target compounds, and ii) higher flexibility in forensic investigations as tens of different SICs can be

86

combined for the analysis 20.

19,20,21,22.

CHEMSIC provides

87

In this study, we developed and tested a new method for analysis and matching of diesel oil spills

88

using GC×GC – HRMS and 2D-CHEMSIC, an extension of the CHEMSIC method to GC×GC data.

89

In conventional 1D GC-MS, the forensic investigations are focused to the analysis of the compound

90

groups included in the CEN-guidelines

91

resolution MS detection. Conversely, GC×GC – HRMS provides enhanced chromatographic

92

separation and the HRMS detection provides accurate mass extracted ions chromatograms (EICs) for

93

less ambiguous identification of the group of compounds 25. This more powerful instrumentation is a

94

key advantage when analyzing highly complex mixture of naphthenes in mid-distillates that could

23,24

due to coelution of compounds group and the low-

4 ACS Paragon Plus Environment

Page 5 of 29

Environmental Science & Technology

95

contain a great diagnostic power for forensic investigations, because they are relatively recalcitrant

96

toward biodegradation, dissolution and photo-oxidation 8,26. The method was tested on samples from

97

a number of diesel oil spill cases. The sample set was split into; i) diesels spills containing the initial

98

suspected source(s) (Group A), and ii) diesel spills with no initial suspected source (Group B). The

99

samples were analyzed using a reversed phase columns set for the GC×GC, i.e. polar × non-polar,

100

because this column configuration has provided better chromatographic resolution between

101

linear/branched alkanes, naphthenes and monoaromatics than the normal phase columns set (i.e., non-

102

polar × polar) 15,27, in addition to the enhanced separation of compound classes due to the extraction

103

of the high resolution extracted ion chromatograms (EICs). 2D-CHEMSIC was applied to GC×GC–

104

HRMS data using EICs from the different groups of compounds contained in diesels that are already

105

investigated for diagnostic information about the spills. The data from all the spill cases were

106

preprocessed by retention time shift corrections, normalization and then submitted to pixel-based

107

analysis using Weighted Principal Component Analysis (WPCA), aiming to find the most probable

108

source(s) of the spilled diesels that were susceptible to different degrees of weathering. Because of

109

the inherent absence of recalcitrant biomarkers, the modeling strategy was focused on: i)

110

distinguishing different sources of similar diesels, ii) identifying the weathering effects in the diesel

111

spills, and iii) proposing modelling strategies to minimize or handle the influence of the weathering

112

effects in the chromatographic fingerprints. The matching criterion for overall chemical similarities

113

between spills and suspected sources were based on statistical hypothesis tests in the reduced PC

114

subspace 28,29. The 2D-CHEMSIC method was validated according to the diagnostics obtained by the

115

standardized CEN-Tr method for forensic investigation of oil spills, and the advantages of our method

116

were highlighted.

117 118

2. Materials and Methods

5 ACS Paragon Plus Environment

Environmental Science & Technology

Page 6 of 29

119 120

2.1.

Samples

121

The sample set included 32 diesel samples of 13 potential source diesel candidates and 19

122

samples of spills split into 12 real cases of diesel spills into marine environments (Table 1). The spills

123

and the source diesels were obtained from the collaboration within the Bonn Agreement Oil Spill

124

Identification Network Group (BonnOSInet). The spill samples were grouped according to the

125

presence or absence of the diesel source suspect(s) (A or B, respectively), the i-th case number and

126

the j-th spill sample from the i-th case, while the diesels source oils were coded for each k-th sample

127

(k = 1 to 13) A quality control (QC) sample was prepared by mixing equal volumes of seven sample

128

extracts (5 source diesels and 2 spill samples). The composition of the QC sample provided a good

129

target chromatogram for the retention time alignment step (Section 3.1. Preprocessing) and for daily

130

GC column performance quality assurance.

131 132

2.2.

Sample preparation

133

Samples from oil cases can arrive at the lab as pure gasoil from fuel tanks, as thick layers on

134

water, or as a small amount of oil taken with an EFTE net. Nets are used to collect oil from the surface

135

water, from surfaces on board of a ship and from fuel and bilge tanks. Pure gasoil samples were

136

diluted (15 µl into 10 ml DCM =1.26 mg/ml) and 1 µl was used for the analysis. The net samples

137

arrived in a pot. 30 ml of DCM were added and the pot was shaken for 5 min to extract the oil from

138

the net. Based on visual appearance, 1 µl of the extract was directly analyzed in a test run with GC-

139

FID or injected after dilution. Black samples were first cleaned over a small column filled with

140

sodium sulfate and Florisil before injection 30. Based on the GC-FID test run results, the concentration

141

of oil in the net samples was adjusted to a concentration between 1.0 to 1.5 mg/ml.

6 ACS Paragon Plus Environment

Page 7 of 29

Environmental Science & Technology

142 143

2.3.

GC×GC – (HR)QTOFMS

144

The sample set included the 32 sample extracts of diesel source oil and spill samples, analytical

145

replicates of 10 randomly selected sample extracts, 5 QC replicates and 3 solvent blanks. The runs

146

were divided into 3 batches that contained no more than 17 sample extracts, incl. analytical replicates,

147

2 QC samples and 1 solvent blank to check for cross-contamination. The extracts were analyzed on

148

an Agilent 7890B GC System modified with a secondary column oven and a Zoex ZX2 cryogenic

149

loop-type modulator (Zoex Corporation, Houston, TX – USA), which was interfaced with a 7200

150

Accurate Mass GC/QTOF mass spectrometer (Agilent Technologies, Palo Alto, CA – USA). The MS

151

system was operating in electron ionization mode. An inverse GC×GC column set was installed: 1D

152

column was an intermediate polarity ZB-50 column (60 m, 0.25 mm i.d., 0.25 μm) and the 2D column

153

was a non-polar ZB-5 column (1.5 m, 0.15 mm i.d., 0.15 μm) (Phenomenex, Torrance, CA – USA).

154

A SilTite™ µ-Union ferrule (SGE Analytical Science, Wetherill Park, NSW - Australia) was used

155

for the column connection. Aliquots of 1μL were injected in split mode 1:20 and injection temperature

156

of 300 oC, using the following oven temperature gradient: 60 oC held for 10 min, 5 oC min−1 to 300

157

oC

158

the secondary oven containing the 2D column. Helium (≥ 99.9999 % purity) was used as carrier gas

159

at a flow rate of 2.0 ml min-1. The modulation was performed with the Zoex ZX2 cryogenic loop-

160

type modulator using the final 0.7 m length of the first column as the loop. An independent cooling

161

system provided cold jets of N2(g) at approximately -70 oC, while the hot jets (pressure = 20 psi) were

162

obtained using N2(g) heated at a constant temperature offset of +60 oC from the primary oven. The

163

modulation period was adjusted to 7.0 s, holding the hot jet for 800 ms each cycle. The electron

164

ionization source was operated at 230 oC with emission current of 35 μA and 70 eV. The QTOF mass

165

analyzer was operated in the TOF mode with transfer line temperature of 300 oC and quadrupole

and held for 5 min. A constant temperature offset of +10 oC from the primary oven was used in

7 ACS Paragon Plus Environment

Environmental Science & Technology

Page 8 of 29

166

temperature of 150 oC. Mass calibration was performed before each run using perfluorotributylamine

167

as instrument internal standard. Data acquisition was performed in the m/z range of 60 – 500 Da with

168

25 spectra s-1. MassHunter ver. B.06.00 was used for instrument control (Agilent Technologies).

169 170

3. Data Treatment

171 172

3.1.

Preprocessing

173

A graphical scheme about the preprocessing steps can be seen in the upper part of Fig. 1. The

174

MassHunter data files were converted to netCDF files using the AIA File Translator program from

175

Agilent, which inevitably bins the m/z values according to an accuracy of ±0.025 Da. An in-house

176

script was written to import specific high-resolution (1rt × 2rt) 2D-EICs, where 1rt and 2rt are the

177

retention times in the 1D and 2D, respectively, into Matlab R2016a (Matworks, Natick, MA – USA).

178

A total of 54 2D-EICs that describe the most relevant groups of hydrocarbons found in diesels, i.e.

179

straight and branched alkanes, cyclic alkanes, olefins, monoaromatic hydrocarbons and two- to four

180

rings non- and alkylsubstituted PAHs were imported for each sample (Table S1 in the Supporting

181

Information). The 2D-EIC corresponding to the same type of compounds (e.g.; m/z = 113.15, 127.15,

182

183.20 and 197.25 for branched alkanes, see Table S1) were summed into a new single 2D - summed

183

EIC (2D-sEIC) per group, to reduce redundant signals that describe the same group of compounds.

184

The 2D-sEICs were phase-corrected for each sample to account for rigid retention time shifts due to

185

the synchronization gaps between the instrument and the starting of the modulation, and regions of

186

each 2D-sEIC containing only baseline noise were visually identified and excluded to reduce the data

187

size. The 2D correlation optimized warping algorithm (2D-COW)

188

sEICs to a common target. The optimal warping parameters: segment length and slack (how much

31

was used to align of the 2D-

8 ACS Paragon Plus Environment

Page 9 of 29

Environmental Science & Technology

189

each segment is allowed to change in the alignment) were obtained using an in-house script in Matlab

190

that performs grid search in the parameters space

191

dimensions. The 2D-COW aligns each 2D-sEIC individually, here using the chromatogram of a QC

192

replicate obtained in the middle of the analytical sequence as target. The aligned 2D-sEICs were

193

unfolded and combined row-wise for each sample, resulting in an augmented data matrix D (K × L),

194

where K are the objects (i.e. samples, replicates and QC samples) and L the variables containing the

195

aligned EICs. Some slight peak misalignments persisting only in the 2D was corrected using the

196

icoshift v. 3.0 algorithm 33 (downloaded from http://models.life.ku.dk). Finally, D was normalized to

197

unitary Euclidian norm to remove the differences in the injected oil concentrations between samples

198

and to reduce the effects of variations in the instrument sensitivity along the analysis of the batches.

32,

herein accounting for both chromatographic

199 200

3.2.

Data modeling using WPCA

201

The graphical scheme about the data modeling is shown in the lower part of Fig. 1. The pixel-

202

based analysis should facilitate the comparison of source oils and spill samples to distinguish oils

203

from different sources and to identify the source of the spilled oils. Chromatograms contain variations

204

that are unrelated to the chemical composition such as instrumental noise, residual retention time

205

shifts and chemical noise (e.g., column bleeding and peak saturation), which will negatively affect

206

the PC model quality. To mitigate these effects, WPCA was used where the weight vector w contains

207

the inverses of the pixel-by-pixel relative analytical standard deviations (RSDA), using only the EICs

208

from the analytical replicates and the QC samples (Fig. S1 in the Supplementary Material). Since all

209

elements in each column of D have the same weight, WPCA can be obtained via preprocessing

210

(namely, scaling each column by the corresponding element of w) 34,29. The extraction of information

211

from D was therefore improved by downscaling signals or pixels that contain little inherent chemical

9 ACS Paragon Plus Environment

Environmental Science & Technology

Page 10 of 29

212

information and up-scaling pixels with low analytical variation. PCA was performed on the scaled

213

and mean-centered D excluding sample replicates (‘training set’ consisting samples and QCs). The

214

10 analytical replicates were used for model validation (‘validation set’) to check for absence of

215

model overfitting, outliers during the chromatographic analysis and to calculate the accuracy of the

216

method (i.e. the confidence interval of the samples in the WPCA models). The number of PCs in the

217

models was chosen after considering only the PCs containing relevant chemical information,

218

neglecting the next PCs describing practically instrumental noise. The sample uncertainties in the

219

PCA models were estimated from the pooled average error of the scores (weighted by the explained

220

variance of each PC) from the QC samples and the predicted analytical replicates (validation set),

221

considering the respective degrees of freedom for their replicates. Initially, PCA models were fitted

222

considering each preprocessed (unfolded) 2D-EIC separately, and an F-test was performed to

223

statistically compare the overall variance of the models with the error variance. The 2D-sEICs with

224

low diagnostic power (i.e. when the overall variance and the error variance for that 2D-sEIC are not

225

significantly different, at the 95% confidence level) were excluded from the dataset. The relevant

226

unfolded 2D-sEICs were then re-combined into a new data matrix DN (K × N), where N contains only

227

the relevant signals, which was then modeled using WPCA for the forensic oil spill investigations.

228

The hypothesis about one spill potentially belonging to one suspected source was done in the PCs

229

subspace, after computing the averaged p-value (weighted by the explained variance of each PC)

230

from a two-sample hypothesis t-test performed using the corresponding scores (i.e. spill × source

231

diesel) from each relevant PC 28. The pooled standard deviation was calculated from the scores of the

232

analytical replicates and the QC samples. The null-hypothesis, i.e. the spill cannot be distinguished

233

from one suspected source with reasonable doubt, was accepted for p-value > 0.05 (i.e. confidence

234

level = 95 %). Due to the previous weighting of the variables in DN before WPCA, the

10 ACS Paragon Plus Environment

Page 11 of 29

Environmental Science & Technology

235

chromatographic loadings were re-scaled (i.e. element-wise multiplying by the corresponding

236

weights vector wN) for data interpretation 35.

237 238

4. Results and Discussions

239 240

4.1.

Identifying the evaporation weathered diesel spills

241

The first screening of the dataset was performed using WPCA on DN after excluding two non-

242

diagnostic EICs: m/z 180 (C1–Fluorenes) and m/z 184.05 (Dibenzothiophene), for which the overall

243

variance of the samples was not significantly larger than the error variance. Therefore, DN consisted

244

of a total of 30 unfolded and combined 2D-sEICs. The 4 component WPCA model explains 77.4%

245

of the variance, and, along PC1, spills are separated according to the degree of evaporation while

246

source diesels form three clusters characterized by different heaviness (boiling point region – Fig. 2).

247

The refolded PC1 2D-loadings (Fig. S2 – Supplementary material) are positive for naphthenes,

248

adamantanes and monoaromatics < C14, including the C0-, C1- and C2-naphtalenes, and C0-fluorene.

249

In contrast, compounds ≥ C14, incl. the targeted bicyclic sesquiterpanes (drimanes), and the higher

250

molecular weight PAHs have negative PC1 loadings (blue pixels). Therefore, the samples with the

251

largest negative PC1 scores (e.g., spills B-C7-s2, B-C8-s1 and B-C6-s1) have the lowest relative

252

concentration of the lighter compounds with < n-C15 and are the samples most affected by

253

evaporation. In contrast, samples with positive PC1 scores (e.g., B-C2-s2, B-C5-s2 and B-C5-s1)

254

have a higher than relative concentration of the lighter compounds and can therefore be concluded to

255

be less affected by evaporative weathering. The heaviness of spills not affected by evaporation can

256

also be described through their corresponding scores in this PC. For instance, the source diesels can

257

also be distinguished on this PC according to their relative abundance of light and heavy

258

hydrocarbons. No evidence of evaporation weathering is found for the diesels spills that are not 11 ACS Paragon Plus Environment

Environmental Science & Technology

Page 12 of 29

259

completely separated from the source diesels in the PC1 scores plot, e.g.; A-C3-s1, A-C3-s2 and A-

260

C4-s1. Therefore, two types of information are confounded in PC1, both are related to the difference

261

between the relative concentration of the heavier (> n-C15) and the lighter (≤ n-C15) hydrocarbons

262

in the diesel fingerprints: the most prominent regards the evaporation weathering of spill samples, the

263

other relates to the heaviness of the diesel oils.

264

PC2 describes chemical differences that can be used for source identification as samples with

265

highest negative PC2 scores have the highest relative concentration of PAHs (except for C0-

266

napthalene), mono-aromatic hydrocarbons > C16, and the lowest relative concentrations of FAMEs

267

(Fig. S3 in the Supplementary Material), vice-versa for samples with positive PC2 scores. FAMEs

268

are the main component of biodiesels blended with mineral diesels to reduce environmental pollution

269

36.

270

to the order; SD-3, SD-4, SD-5 < SD-6, SD-9, SD-11, SD-7, SD-12, SD-8 < SD-13, SD-1, SD-2, SD-

271

10.

The chemical fingerprints described by PC2 is ranking the environmental-friendly fuels according

272

The capability of 2D-CHEMSIC to sort the correlations among the chromatographic signals into

273

the PCs facilitated the identification of the main weathering effect and source information along the

274

combined 2D-sEICs simultaneously, without the need to evaluate the large amount of peaks obtained

275

by GC×GC individually.

276 277

4.2.

Handling evaporation weathering for spill-source correlations in diesels spills

278

To address evaporation in forensic spill-source correlation of diesel spills using 2D-CHEMSIC

279

the affected chromatographic sections were gradually excluded from the chromatograms according

280

to the number of carbons, fitting successive WPCA models after each step (Fig. S4 in the

281

Supplementary Material). The visual inspection of the samples in the PC1 and PC2 scores plot from

12 ACS Paragon Plus Environment

Page 13 of 29

Environmental Science & Technology

282

the models revealed the gradual mitigation of the evaporation weathering for all the affected spills.

283

The score plots of the model obtained after excluding chromatographic signals < n-C18, i.e. those

284

more susceptible to the evaporation weathering in the spills, are shown in Fig. 3. The chemical

285

similarity of the highly evaporated spills and the source diesels is enhanced after excluding the signals

286

from the most volatile hydrocarbons, because the distances between the corresponding scores in the

287

PCs subspace decreased. This result is very evident for the spills A-C2-s1, B-C1-s1, B-C6-s1, B-C7-

288

s1, B-C7-s2, B-C7-s3 and B-C8-s1, which were de-cluttered in the first WPCA model that was fitted

289

using the signals from the entire hydrocarbons range of the diesels.

290

Furthermore, source diesels from the same origin and spills from the same source and similar

291

degrees of weathering can be identified accordingly as their chemical similarities are still preserved

292

for the compounds eluting after n-C18; for example, source diesels SD-3, SD-4 and SD-5, and spills

293

B-C3-s1 and B-C3-s2 are still not statistically distinguished (confidence level of 95%) in the new

294

WPCA model (see Fig. 2 and Fig. 3). Interestingly, spill-source pairs that were different before the

295

mitigation of the evaporation weathering on the spills could be matched in this new WPCA model.

296

The potential matching of spill-source pairs was investigated while considering all the successive

297

WPCA models (see Fig. 3), and the individual spill cases are discussed in the sections 4.2.1 and 4.2.2.

298 299

4.2.1. Statistical spill-source forensic correlations for the diesel spills with potential suspects

300

The weathering effects on the spills for forensic oil-source correlations was initially evaluated

301

for the cases with specific diesel source suspects. The extent of the weathering was assessed based

302

on the distances between spill and source in the PCs subspace of the successive WPCA models, and

303

therefore without the necessity to evaluate individual peaks for each sample. In a positive match, the

304

scores distances between spills and sources in the WPCA models are expected to decrease whether

13 ACS Paragon Plus Environment

Environmental Science & Technology

Page 14 of 29

305

excluding more variables susceptible to weathering, until the spill – source pair becomes

306

indistinguishable for the models (i.e. the samples cannot be separated in the multivariate space

307

according to the analytical method’s accuracy). This trend will only be valid for highly-weathered

308

spills mostly affected by evaporation, after gradually excluding the chromatographic signals from the

309

most susceptible lighter hydrocarbons; conversely, spills subjected to the remaining weathering

310

effects that also affect heavier compounds, e.g.; biodegradation, dissolution and photo-oxidation, can

311

still be distinguishable from a true source in this variable-reduction approach. However, positive

312

matches for spill – source pairs can be obtained if the chemical changes of the spill can be explained

313

exclusively by weathering effects 1. The averaged scores distances between spills and the

314

corresponding suspected source(s) for these cases are depicted in Fig. 4. The mean distances of the

315

spill-source pairs decreased while excluding the signals up to n-C15 for spills A-C1-s1 (Fig. 4a) and

316

A-C2-s1 (Fig. 4b), but they are not statistically different along the successive WPCA models

317

according to the corresponding error bars. These profiles are coherent with evaporation weathering

318

in these spills affecting mostly the hydrocarbons up to n-C15, and a minor effect on the hydrocarbons

319

after n-C18. This diagnostic is corroborated with the increase of the chemical similarities of the

320

corresponding spill – source pairs when considering hydrocarbons > n-C18 exclusively. The two-

321

samples t-test statistics after considering only the hydrocarbons > n-C18 suggests both spills possibly

322

come from the respective suspected sources (p-value = 0.700 and 0.501 for A-C1-s1 and A-C2-s1,

323

respectively, confidence level 95%), see Fig. 4a) and Fig. 4b).

324

The case involving three spills (A-C3-s1, A-C3-s2 and A-C3-s3) and the similar source diesels

325

SD-3, SD-4 and SD-5 (i.e. they cannot be distinguished in the WPCA models) resulted that A-C3-s1

326

and A-C3-s2 spills differ from the source diesel suspects after n-C15, contrary to A-C3-s1 spill that

327

remains similar. The slight increase of the chemical similarity between A-C3-s1 and the suspect

328

source diesels after n-C17 (p-value > 0.05, at 95% confidence level) suggests the presence of only

14 ACS Paragon Plus Environment

Page 15 of 29

Environmental Science & Technology

329

short evaporation for this spill (Fig. 4c). Therefore, the positive match between this spill-source pair

330

cannot be confirmed without reasonable doubt, because the fingerprint of the hydrocarbons ≤ n-C18

331

(incl. pristane and phytane) are very different.

332

The scores distances computed for A-C4-s1 spill from the four suspected sources (i.e. SD-6, SD-

333

7, SD-8, SD-9) reveal that the chemical composition of the spill differs significantly from the diesels

334

sources after n-C19, except for SD-9, Fig. 4d). Contrary to the above-mentioned cases, there is no

335

evidence of remarkable evaporation weathering while comparing the scores distances along the

336

WPCA models. The hydrocarbon fingerprints of spill and the suspected sources cannot be

337

distinguished in the WPCA models until ≤ n-C17. In contrast, the corresponding spill – source pairs

338

differ for the SD-6, SD-7 and SD-8 (confidence level of 95%) for the WPCA models that considers

339

only the ≥ n-C17 hydrocarbons. The high chemical similarity between A-C4-s1 spill and the SD-9

340

diesel source along the entire hydrocarbon range indicates a positive spill-source match only for this

341

suspect (p-value = 0.299 ± 0.235). The drastic increase of the chemical similarity between the spill

342

and all four suspects after considering only the signals ≥ n-C23 is due to the low diagnostic power

343

occurring in this small hydrocarbon range when comparing the diesels.

344

While PC1 discriminates spills from different sources after excluding hydrocarbons most

345

susceptible to evaporation (i.e. ≤ n-C17), PC2 explains the photo-oxidation weathering in the spills.

346

The photo-oxidation effects on the spills can be mostly identified in the chromatograms by the

347

degradation of the pyrenes (yellow loadings for C0-, C1- and C2-pyrenes in PC2) and naphthenic

348

one-ring aromatics, and the differences between isomers distribution for the C2- and C3-

349

phenanthrenes 26, Fig. 5. The A-C1-s1 spill show a slight level of photo-oxidation in relation to SD1.

350

The A-C3-s2 spill shows the highest influence of photo-oxidation effects in case 3 (i.e. it contains the

351

most negative score value in PC2). Furthermore, the n-alkanes, isoprenoids, naphthenes and the one-

352

ring aromatics were more resistant to photo-oxidation (blue pixels in Fig. 5), which is in accordance

15 ACS Paragon Plus Environment

Environmental Science & Technology

Page 16 of 29

353

to laboratory experiments already published 37. Interestingly, the chromatographic loadings from the

354

one-ring aromatics showing interference of the naphthenic one-ring aromatics even in the HRMS data

355

demonstrate that GC×GC was crucial to separate these similar groups of hydrocarbons that may have

356

responded differently to photo-oxidation. Moreover, the more efficient analysis of the higher MW

357

hydrocarbons more resistant to photo-oxidation and to evaporation by GC×GC – HRMS is a

358

remarkable advantage when analyzing the whole fingerprint of the samples in oil spill investigations

359

of mid-distillates.

360

The profiles of score distances along the WPCA models is an interesting approach to simplify

361

the main sources of variance due to weathering that are affecting the chromatographic signals of the

362

spills for the individual cases. Additionally, the complex chromatographic fingerprints of the spills

363

can be compared more easily among the cases.

364 365

4.2.2. Validation of the cases with potential suspects according to the CEN/Tr method

366

The samples studied in this paper are derived from oil cases originally assessed with the

367

conventional CEN/Tr method 23 for oil spill identification. Similarly to the 2D-CHEMSIC method, a

368

match has been concluded between the samples A-C1-s1 and SD1, in which the spill showed some

369

evaporation up to n-C12, and pyrene and the C1- and C2-pyrenes are reduced to 70-80% by photo-

370

oxidation.

371

A non-match has been concluded between the samples A-C2-s1 and SD-2. Differences were

372

found above n-C30 for the alkanes and between the biomarkers present at a low concentration, which

373

suggests a small contamination of heavy oil in the spill sample. N-alkanes before n-C14 were reduced

374

by evaporation and compounds sensitive for photo-oxidation were slightly reduced. Because the

16 ACS Paragon Plus Environment

Page 17 of 29

Environmental Science & Technology

375

biomarkers were not studied with 2D-CHEMSIC the results are in accordance with the conclusion

376

based on Fig. 4b.

377

A match has been found between the diesel sources SD-3, -4 and -5. Samples were taken from

378

two diesel tanks and from the diesel daily tank. Compared with A-C3-s1, - s2 and –s3 a non-match

379

has been found for all spill samples. Fig. 4c shows clear differences for A-C3-s1 and –s2, but some

380

similarity with A-C3-s3. A-C3-s3 is not weathered by evaporation or photo-oxidation, but shows

381

clear differences for the isoprenoid ratios (n-C17/pristane, n-C18/ and pristane/phytane) and some other

382

compounds eluting around the same retention time section. This is confirmed by Fig. 4c in which

383

clear difference have been found above n-C14.

384

A match has been found between SD-9 and A-C4-s1. The spill sample showed some evaporation

385

up to n-C12, no indication of photo-oxidation and non-matches for the other 3 source samples. These

386

conclusions were also obtained with the 2D-CHEMSIC method.

387 388

4.2.3. Statistical spill-source correlations for the diesel spills without potential suspects

389

The spill cases without specific potential suspects were evaluated to find the diesel source(s)

390

candidate in our dataset that could be similar candidate in a forensic investigation. The diagnostics of

391

these spill cases without specific sources provided by 2D-CHEMSIC are supported by the previous

392

validation done with the CEN-Tr method for the cases with suspected sources. Diesel spill samples

393

from cases 2, 4, 5, and 7 (Group B) cluster in the scores plot of the WPCA model in Fig. 2. This may

394

indicate that the source of the spill samples are very similar and they have similar degrees of

395

evaporation weathering, e.g.; B-C2-s1 and B-C2-s2 (case 2), B-C3-s1 and B-C3-s2 (case 3), B-C5-

396

s1 and B-C5-s2 (case 5), B-C7-s1 and B-C7-s3 (case 7). Moreover, B-C1-s1 and B-C4-s1 cannot be

397

distinguished to the spill samples from cases 3 and 7, respectively, despite they belong to different

17 ACS Paragon Plus Environment

Environmental Science & Technology

Page 18 of 29

398

cases. The B-C2-s1 and B-C2-s2 spills most probably come from a diesel source similar to SD-6, and

399

no potential suspect could be identified for the spills in case 5 (i.e. B-C5-s1 and B-C5-s2).

400

Interestingly, the photo-oxidation weathering described by PC2 emphasizes the different photo-

401

oxidation levels between the spills in case 2, in which B-C2-s1 is more photo-oxidized than B-C2-s2

402

even they have similar evaporation levels. The spills from cases 1, 3, 4 and 7 are chemically similar

403

after minimizing the influence of the evaporation weathering, according to the MPCA models; they

404

most probably come from diesel source(s) similar to SD-9 or SD-12 (Fig. 2), with no evidence of

405

high photo-oxidation.

406

The highly evaporated B-C6-s1 spill most probably comes from a suspect diesel sources similar

407

to SD-10 (p-value = 0.251 ± 0.149 after excluding ≤ n-C17), while B-C8-s1 spill should be similar to

408

the SD-1 (p-value = 0.351 ± 0.261, after excluding ≥ n-C18) or SD-13 (p-value = 0.310 ± 0.247, after

409

excluding ≥ n-C18). There is no evidence of significant photo-oxidation effects on these spills.

410 411

Acknowledgments

412

The São Paulo Research Foundation (FAPESP, grants 2015/08201-0 and 2016/18453-0) is

413

acknowledged for the research funding.

414 415

Supporting Information

416

The supplementary material contains the diagnostic m/z from the main type of compounds in the

417

diesels, the weight vector for each chromatographic pixel, the loadings from the general WPCA

418

model, and the cutoffs used to mitigate the evaporation weathering effects in the diesels.

419

18 ACS Paragon Plus Environment

Page 19 of 29

5. References

420 421

(1)

Wang, Z.; Stout, S. A.; Fingas, M. Forensic Fingerprinting of Biomarkers for Oil Spill Characterization and Source Identification. Environ. Forensics 2006, 7 (2), 105–146.

422 423

Environmental Science & Technology

(2)

Bayona, J. M.; Domínguez, C.; Albaigés, J. Trends in Environmental Analytical Chemistry

424

Analytical Developments for Oil Spill Fingerprinting. Trends Environ. Anal. Chem. 2015, 5,

425

26–34.

426

(3)

Gros, J.; Reddy, C. M.; Aeppli, C.; Nelson, R. K.; Carmichael, C. A.; Arey, J. S. Resolving

427

Biodegradation Patterns of Persistent Saturated Hydrocarbons in Weathered Oil Samples from

428

the Deepwater Horizon Disaster. Environ. Sci. Technol. 2014, 48 (3), 1628–1637.

429

(4)

Using Light Biomarkers. Environmental Forensics. 2016, 17 (3), 244–252.

430 431

Malmborg, J.; Nordgaard, A. Forensic Characterization of Mid-Range Petroleum Distillates

(5)

Wang, Z.; Yang, C.; Fingas, M.; Hollebone, B.; Peng, X.; Hansen, A. B.; Christensen, J. H.

432

Characterization, Weathering, and Application of Sesquiterpanes to Source Identification of

433

Spilled Lighter Petroleum Products. Environ. Sci. Technol. 2005, 39 (22), 8700–8707.

434

(6)

Yang, C.; Wang, Z. D.; Hollebone, B.; Brown, C. E.; Landriault, M.; Fieldhouse, B.; Yang, Z.

435

Y. Application of Light Petroleum Biomarkers for Forensic Characterization and Source

436

Identification of Spilled Light Refined Oils. Environ. Forensics 2012, 13 (4), 298–311.

437

(7)

Drimane-Based Bicyclic Sesquiterpanes. Environ. Forensics 2005, 6 (3), 241–251.

438 439

(8)

(9)

Marriott, P.; Shellie, R. Principles and Applications of Comprehensive Two-Dimensional Gas Chromatography. Trends Anal. Chem. 2002, 21 (02), 573–583.

442 443

Malmborg, J. Examining the Weathering Susceptibility of Light Biomarkers. Environ. Forensics 2017, 18 (3), 197–206.

440 441

Stout, S. A.; Uhler, A. D.; McCarthy, K. J. Middle Distillate Fuel Fingerprinting Using

(10)

Mogollón, N. G. S.; Prata, P. S.; dos Reis, J. Z.; Neto, E. V. dos S.; Augusto, F.

444

Characterization of Crude Oil Biomarkers Using Comprehensive Two-Dimensional Gas

445

Chromatography Coupled to Tandem Mass Spectrometry. J. Sep. Sci. 2016, 39 (17), 3384–

446

3391.

447

(11)

Laakia, J.; Casilli, A.; Araújo, B. Q.; Gonçalves, F. T. T.; Marotta, E.; Oliveira, C. J. F.; 19 ACS Paragon Plus Environment

Environmental Science & Technology

Page 20 of 29

448

Carbonezi, C. A.; Loureiro, M. R. B.; Azevedo, D. A.; Aquino Neto, F. R. Characterization of

449

Unusual Tetracyclic Compounds and Possible Novel Maturity Parameters for Brazilian Crude

450

Oils Using Comprehensive Two-Dimensional Gas Chromatography-Time of Flight Mass

451

Spectrometry. Org. Geochem. 2017, 106, 93–104.

452

(12)

Casilli, A.; Silva, R. C.; Laakia, J.; Oliveira, C. J. F.; Ferreira, A. A.; Loureiro, M. R. B.;

453

Azevedo, D. A.; Aquino Neto, F. R. High Resolution Molecular Organic Geochemistry

454

Assessment of Brazilian Lacustrine Crude Oils. Org. Geochem. 2014, 68, 61–70.

455

(13)

Jennerwein, M. K.; Eschner, M.; Gröger, T.; Wilharm, T.; Zimmermann, R. Complete Group-

456

Type Quantification of Petroleum Middle Distillates Based on Comprehensive Two-

457

Dimensional Gas Chromatography Time-of-Flight Mass Spectrometry (GC×GC-TOFMS) and

458

Visual Basic Scripting. Energy & Fuels 2014, 28 (9), 5670–5681.

459

(14)

Shi, X.; Li, H.; Song, Z.; Zhang, X.; Liu, G. Quantitative Composition-Property Relationship

460

of Aviation Hydrocarbon Fuel Based on Comprehensive Two-Dimensional Gas

461

Chromatography with Mass Spectrometry and Flame Ionization Detector. Fuel 2017, 200,

462

395–406.

463

(15)

Vendeuvre, C.; Ruiz-Guerrero, R.; Bertoncini, F.; Duval, L.; Thiébaut, D.; Hennion, M. C.

464

Characterisation

465

Chromatography (GC x GC): A Powerful Alternative for Performing Various Standard

466

Analysis of Middle-Distillates. J. Chromatogr. A 2005, 1086 (1–2), 21–28.

467

(16)

of

Middle-Distillates

by

Comprehensive

Two-Dimensional

Gas

B. Gaines, R.; S. Frysinger, G.; M. Reddy, C.; K. Nelson, R. Oil Spill Source Identification by

468

Comprehensive Two-Dimensional Gas Chromatography (GC × GC). In Oil Spill

469

Environmental Forensics; Elsevier, 2007, pp 169–206.

470

(17) Yang, C.; Kim, M.; Brown, C. E.; Wang, Z.; Yang, Z.; Hollebone, B. P.; Lambert, P. Oil

471

Fingerprinting Analysis Using Gas Chromatography-Quadrupole Time-of-Flight (GC-QTOF).

472

In Standard Handbook Oil Spill Environmental Forensics: Fingerprinting and Source

473

Identification: Second Edition; Elsevier, 2016; pp 449–480.

474

(18)

369–411.

475 476

Wang, Z.; Fingas, M.; Page, D. S. Oil Spill Identification. J. Chromatogr. A 1999, 843 (1),

(19)

Christensen, J. H.; Tomasi, G.; Scofield, A. de L.; Meniconi, M. de F. G. A Novel Approach

20 ACS Paragon Plus Environment

Page 21 of 29

Environmental Science & Technology

477

for Characterization of Polycyclic Aromatic Hydrocarbon (PAH) Pollution Patterns in

478

Sediments from Guanabara Bay, Rio de Janeiro, Brazil. Environ. Pollut. 2010, 158 (10), 3290–

479

3297.

480

(20)

Gallotta, F. D. C.; Christensen, J. H. Source Identification of Petroleum Hydrocarbons in Soil

481

and Sediments from Iguaçu River Watershed, Paraná, Brazil Using the CHEMSIC Method

482

(CHEMometric Analysis of Selected Ion Chromatograms). J. Chromatogr. A 2012, 1235, 149–

483

158.

484

(21)

Lübeck, J. S.; Poulsen, K. G.; Knudsen, S. B.; Soleimani, M.; Furbo, S.; Tomasi, G.;

485

Christensen, J. H. Source Apportionment of Polycyclic Aromatic Hydrocarbons (PAHs) in

486

Sediments from Khuzestan Province, Iran. Mar. Pollut. Bull. 2016, 110 (1), 584–590.

487

(22)

Al-Kaabi, N. S.; Kristensen, M.; Zouari, N.; Solling, T. I.; Bach, S. S.; Al-Ghouti, M.;

488

Christensen, J. H. Source Identification of Beached Oil at Al Zubarah, Northwestern Qatar. J.

489

Pet. Sci. Eng. 2017, 149, 107–113.

490

(23)

CEN/TR15522-2: Oil Spill Identification. Waterborne Petroleum and Petroleum Products.

491

Analytical Methodology and Interpretation of Results Based on GC-FID and GC-MS Low

492

Resolution Analyses. 2012, 138.

493

(24)

Fingerprinting. Trends Environ. Anal. Chem. 2015, 5, 26–34.

494 495

Bayona, J. M.; Domínguez, C.; Albaigés, J. Analytical Developments for Oil Spill

(25)

Byer, J. D.; Siek, K.; Jobst, K. Distinguishing the C3 vs SH4 Mass Split by Comprehensive

496

Two-Dimensional Gas Chromatography-High Resolution Time-of-Flight Mass Spectrometry.

497

Anal. Chem. 2016, 88 (12), 6101–6104.

498

(26)

Albaigés, J.; Bayona, J. M.; Radović, J. R. Photochemical Effects on Oil Spill Fingerprinting.

499

In Standard Handbook Oil Spill Environmental Forensics: Fingerprinting and Source

500

Identification: Second Edition; Elsevier, 2016, 917-959.

501

(27) Li, S.; Cao, J.; Hu, S. Analyzing Hydrocarbon Fractions in Crude Oils by Two-Dimensional

502

Gas Chromatography/Time-of-Flight Mass Spectrometry under Reversed-Phase Column

503

System. Fuel 2015, 158, 191–199.

504 505

(28)

Christensen, J. H.; Hansen, A. B.; Tomasi, G.; Mortensen, J.; Andersen, O. Integrated Methodology for Forensic Oil Spill Identification. Environ. Sci. Technol. 2004, 38 (10), 2912–

21 ACS Paragon Plus Environment

Environmental Science & Technology

2918.

506 507

(29)

(30)

(31) Zhang, D.; Huang, X.; Regnier, F. E.; Zhang, M. Two-Dimensional Correlation Optimized Warping Algorithm for Aligning GC x GC-MS Data. Anal. Chem. 2008, 80 (8), 2664–2671.

512 513

(32)

(33)

Tomasi, G.; Savorani, F.; Engelsen, S. B. Icoshift: An Effective Tool for the Alignment of Chromatographic Data. J. Chromatogr. A 2011, 1218 (43), 7832–7840.

516 517

Skov, T.; Van Den Berg, F.; Tomasi, G.; Bro, R. Automated Alignment of Chromatographic Data. J. Chemom. 2006, 20, 484–497.

514 515

Albaigés, J.; Kienhuis, P. G. M.; Dahlmann, G. Oil Spill Identification. In Handbook of Oil Spill Science and Technology; John Wiley & Sons, 2015; pp 165–204.

510 511

Christensen, J. H.; Tomasi, G. Practical Aspects of Chemometrics for Oil Spill Fingerprinting. J. Chromatogr. A 2007, 1169 (1–2), 1–22.

508 509

Page 22 of 29

(34)

Christensen, J. H.; Hansen, A. B.; Karlson, U.; Mortensen, J.; Andersen, O. Multivariate

518

Statistical Methods for Evaluating Biodegradation of Mineral Oil. J. Chromatogr. A 2005,

519

1090 (1), 133–145.

520

(35)

(1), 16–33.

521 522

Bro, R.; Smilde, A. K. Centering and Scaling in Component Analysis. J. Chemom. 2003, 17

(36)

Yang, Z.; Hollebone, B. P.; Wang, Z.; Yang, C.; Brown, C.; Landriault, M. Forensic

523

Identification of Spilled Biodiesel and Its Blends with Petroleum Oil Based on Fingerprinting

524

Information. J. Sep. Sci. 2013, 36 (11), 1788–1796.

525 526

(37)

Dutta, T. K.; Harayama, S. Analysis of Long-Side-Chain Alkylaromatics in Crude Oil for Evaluation of Their Fate in the Environment. Environ. Sci. Technol. 2001, 35 (1), 102–107.

527 528 529 530

22 ACS Paragon Plus Environment

Page 23 of 29

Environmental Science & Technology

Table 1. Source diesel and spills samples. Sample code n-Ci-sj refers to the n-th group (A or B), i-th case within the same group and the j-th sample of the i-th case. SD-k refers to the k-th diesel source. Case Group

A

531

Sample A-C1-s1 SD-1 A-C2-s1 SD-2 A-C3-s1 A-C3-s2 A-C3-s3 SD-3 SD-4 SD-5 A-C4-s1 SD-6 SD-7 SD-8 SD-9

Type

Sampling*

Spill Source Diesel Spill Source Diesel Spill Spill Spill Source Diesel Source Diesel Source Diesel Spill Source Diesel Source Diesel Source Diesel Source Diesel

surface water as gasoil water using the EFTE net

water using the EFTE net

water using the EFTE net

SD-10 Source Diesel SD-11 Source Diesel SD-12 Source Diesel SD-13 Source Diesel B-C1-s1 Spill water using the EFTE net B-C2-s1 Spill water using the EFTE net B-C2-s2 Spill B-C3-s1 Spill water using the EFTE net B B-C3-s2 Spill bilge water from ship using the EFTE net B-C4-s1 Spill bilge water from ship using the EFTE net B-C5-s1 Spill water using the EFTE net B-C5-s2 Spill B-C6-s1 Spill water using the EFTE net B-C7-s1 Spill B-C7-s2 Spill water using the EFTE net B-C7-s3 Spill B-C8-s1 Spill water using the EFTE net * Source Diesels were obtained directly from tanks as gasoil.

23 ACS Paragon Plus Environment

Environmental Science & Technology

Graphic Abstract 84x42mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 24 of 29

Page 25 of 29

Environmental Science & Technology

Figure 1. Scheme of the data preprocessing (section 3.2) and next data modeling by pixel-based analysis using WPCA (section 3.3). The 2D-EIC (rt1D, rt2D) groups are described in Table S1. 2D-COW (2D – Correlation Optimized Warping) and icoshift (interval Co-Shifting) are algorithms used to correct retention time shifts among the samples chromatograms. D(K, L) is the data matrix for WPCA containing the unfolded and next row-wisely concatenated 2D-sEIC groups, which is plotted and colorized for each sample. The signals from the first three 2D-sEICs (i.e. I, II and III) and from all the PAH in D are also highlighted in the figure. The dimensions K and L are the total number of samples and variables (i.e. chromatographic pixels from all the 2D-sEICs), respectively. Data was normalized to the Euclidian norm = 1 (i.e. total area of the row-wise chromatograms) before the data analysis. w(1, L) is the weights vector computed as pixels-bypixel 1/RSD(QCs), RSD = relative standard deviation and QCs = (unfolded) 2D-sEICs from the QC samples. DN(K, N) is the data matrix containing only the 2D-sEICs with significant S/N ratio, N = number of variables. 136x128mm (300 x 300 DPI)

ACS Paragon Plus Environment

Environmental Science & Technology

Figure 2. PC1 and PC2 scores plot from the WPCA model while identifying the different degrees of weathering of the spills and distinguishing the source diesels. The explained variances of the PCs are expressed between the brackets. The samples were assigned as i-Cj-sk; i refers to the group of spill cases in which the suspected source(s) within each case is(are) present (A) or absent (B). j and k are the number of the case in the corresponding group and the number of the sample within the j-th case, respectively. SD-l refers to the l-th diesel source. 293x89mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 26 of 29

Page 27 of 29

Environmental Science & Technology

Figure 3. PC1 and PC2 scores plot from the WPCA model after excluding the chromatographic signals < nC18. The areas inside the solid rectangles were zoomed for clarification (dashed rectangles). The explained variances of the PCs are shown in the brackets. The samples were assigned as i-Cj-sk; i refers to the group of spill cases in which the suspected source(s) within each case is(are) present (A) or absent (B). j and k are the number of the case in the corresponding group and the number of the sample within the j-th case, respectively. SD-l refers to the l-th diesel source. 242x113mm (300 x 300 DPI)

ACS Paragon Plus Environment

Environmental Science & Technology

Figure 4. Averaged scores distances (weighted by the percentage of the variance explained by each PC) between spills and the suspected source(s) in the successive WPCA models computed after gradually excluding the chromatographic signals from the lighter hydrocarbons more susceptible to evaporation weathering. “all” represents WPCA models fitted using the entire 2D chromatograms. The values in the plot are the averaged p-value computed from a two-sample Student’s t-test for the spill belonging to the corresponding source. a) A-C1-s1 (spill) paired with SD-1 (source diesel), b) A-C2-s1 (spill) paired with SD2 (source diesel), c) A-C3-s1, A-C3-s2 and A-C3-s3 (spills) paired with SD-3, SD-4 or SD-5 (source diesels); here, the p-values are the averages from the 3 source diesels that cannot be distinguished by the method, i.e. they most probably come from a common source. d) A-C4-s1 (spill) paired with SD-6, SD-7, SD-8 or SD-9 (source diesels). The error bars in the plots are the averaged errors of the score values computed from the analytical replicates and QC samples (weighted by the percentage of the explained variance of each PC). The samples were assigned as i-Cj-sk; i refers to the group of spill cases in which the suspected diesel source(s) within each case is(are) present (A) or absent (B). j and k are the number of the case in the corresponding group and the number of the sample within the j-th case, respectively. SD-l refers to the l-th diesel source. 330x252mm (300 x 300 DPI)

ACS Paragon Plus Environment

Page 28 of 29

Page 29 of 29

Environmental Science & Technology

Figure 5. Two-dimensional chromatographic PC2 loadings highlighting the degradation of phenanthrene isomers and pyrenes by photo-oxidation (i.e. yellow pixels in the corresponding loadings) from WPCA after excluding chromatographic signals from hydrocarbons < n-C18, which were more susceptible to the evaporation weathering. 236x105mm (300 x 300 DPI)

ACS Paragon Plus Environment