Subscriber access provided by University of Newcastle, Australia
Article
A Fluorescence Approach for the Determination of Fluorescent Dissolved Organic Matter Chen Qian, Long-Fei Wang, Wei Chen, Yan-Shan Wang, Xiao-Yang Liu, Hong Jiang, and Han-Qing Yu Anal. Chem., Just Accepted Manuscript • DOI: 10.1021/acs.analchem.7b00324 • Publication Date (Web): 02 Mar 2017 Downloaded from http://pubs.acs.org on March 4, 2017
Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.
Analytical Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.
Page 1 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
A Fluorescence Approach for the Determination of Fluorescent Dissolved Organic Matter
Chen Qian, Long-Fei Wang, Wei Chen, Yan-Shan Wang, Xiao-Yang Liu, Hong Jiang, Han-Qing Yu* CAS Key Laboratory of Urban Pollutant Conversion, Department of Chemistry, University of Science and Technology of China, Hefei, 230026, China
1
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
1
Excitation-emission matrix (EEM) fluorescence spectroscopy coupled with parallel
2
factor analysis (PARAFAC) has been widely applied to characterize dissolved organic
3
matter (DOM) in aquatic and terrestrial systems. However, its application in
4
environmental samples is limited because PARAFAC is not able to handle
5
non-trilinear EEM data, leading to the overestimated number of components and
6
incorrect decomposition results. In this work, a new method, Parallel Factor
7
Framework-Clustering Analysis (PFFCA), is proposed to resolve this problem. First,
8
simulated data with different signal to noise ratios and intensities of non-trilinear
9
structure were tested to confirm the robustness of PFFCA. The residual sum of
10
squares (RSS) of PARAFAC was significantly higher than that of PFFCA (P0.92) closer to actual EEM than PARAFAC (R2>0.81).
14
Finally, to confirm the feasibility of PFFCA in analyzing natural samples,
15
DOM-containing samples collected from both a polluted lake and river were tested,
16
indicating that PFFCA provides a more precise estimation than PARAFAC. The
17
results clearly indicate that PFFCA offers a robust approach for the unique
18
decomposition of complex synthetic and natural samples, which is of great
19
significance in understanding the characteristics of DOM in aqueous systems.
20
2
ACS Paragon Plus Environment
Page 2 of 33
Page 3 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
21
INTRODUCTION
22 23
Dissolved organic matter (DOM) is a mixture of complex and heterogeneous
24
compounds with varying solubility and reactivity. Ubiquitous DOM plays critical
25
roles in global biochemical/geochemical cycling and significantly affects the fate and
26
transport of pollutants. Great efforts have been made to analyze and characterize
27
DOM using multiple methods and instruments including infrared spectroscopy,1,
28
nuclear magnetic resonance,1, 3, 4 and fluorescence spectroscopy.4-6 Among the various
29
techniques, excitation-emission matrix (EEM) fluorescence spectroscopy coupled with
30
parallel factor (PARAFAC) analysis can provide useful information regarding the
31
fluorescence characteristics, especially for complex samples.7-9 It has been used as a
32
powerful tool to characterize chromophoric DOM in environmentals due to its high
33
rapidity, selectivity and sensitivity.9-11 PARAFAC analysis is a tensor decomposition
34
method that decomposes the original data (intensity × emission × excitation) into
35
trilinear components, with each component consisting of one score vector and two
36
loading vectors (emission and excitation spectrums). The score vector of each
37
component corresponds to its relative concentration. By using PARAFAC analysis,
38
complex EEM fluorescence spectra can be separated into several components, usually
39
indicating the compositions of tested samples.12, 13 This mathematical approach is of
40
great significance because the overlapped spectroscopy of various contributors can be
41
well separated, facilitating understanding of the behaviors of each component in
3
ACS Paragon Plus Environment
2
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
42
environmental changes.9, 14
43
When data are approximately low-rank trilinear and the fluorescent groups are
44
independent, the general idea is that the components decomposed from the data
45
correspond to the actual components in PARAFAC analysis. However, this concept is
46
not valid in environmental samples because of the highly variant structure,
47
conformation, and heterogeneity, as well as intra- and inter-molecular interactions of
48
DOM.4, 15 Dissolved humic acid (HA), one of the most important types of DOM in
49
aquatic environments, is a mixture of many molecules, some of which are based on a
50
motif of aromatic nuclei with phenolic and carboxylic substituents linked together.16, 17
51
The fluorescent groups in HA are not independent, suggesting that PARAFAC analysis
52
may decompose HA into more than a single component, each representing merely a
53
proportion of HA. This phenomenon has been observed in many previous studies, in
54
which some components of HA derived from PARAFAC analysis showed a positive
55
linear correlation (R2 > 0.8).18-23 Evidently, it is inappropriate to treat the highly
56
correlated fluorophores as individual component. Component does not mean a certain
57
substance but an identifiable unit, which has different fates or behaviors with each
58
other. On one hand, if two substances come from the same source and change
59
synchronously, correlation could not be used to identify which one is responsible for
60
the changes. Although the impacts of substances can be inferred from their structure
61
or chemical compositions, UV/visible humic acid-like or marine humic acid-like
62
substances cannot provide useful information regarding their structure or chemical
4
ACS Paragon Plus Environment
Page 4 of 33
Page 5 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
63
composition. Therefore, it is unnecessary to distinguish two substances with highly
64
correlated concentrations. As a result, the number of components estimated by
65
PARAFAC analysis might not be the actual component number (two examples are
66
provided in Supporting Information to support this statement). The misestimate of the
67
component number inevitably leads to totally different decomposition results, which
68
remains one of the major defects of PARAFAC analysis.
69
For a given sample with fixed compositions, a unique decomposition of the
70
fluorophores is essential to make fair comparisons. Nevertheless, a unique
71
decomposition cannot always be acquired when performing PARAFAC on complex
72
environmental samples. The decomposition result of one sample may change when it is
73
decomposed with different sample series, which is another major limitation for
74
PARAFAC applications. A unique decomposition in PARAFAC analysis can be
75
acquired if all spectra and concentration profiles are linearly independent.14 A
76
non-unique result will result in misunderstanding sample compositions and
77
fluorophore changes, which may notably affect or alter some previous conclusions.
78
The main reason for these two barriers is that the PARAFAC approach can only
79
be applied in treating linearly dependent fluorophores, and the analysis of
80
non-trilinear structure is beyond the limits of PARAFAC. In other words, sloping peak
81
in the EEM landscape destroys the independence of excitation and emission spectra.
82
Therefore, the acquisition of a unique decomposition result from data sets that do not
83
conform to trilinear structure should be of great concern. It was observed that the use
5
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
84
of more components in Diltiazem contributed to a more correct and robust
85
concentration estimation.24 Consequently, a new separation method is required with an
86
aim at accurately determining the actual component number by eliminating the
87
PARAFAC limitation of requiring trilinear results. To date, there is no such
88
information about how to determine the number of components and combine the
89
collinear components to estimate the actual components in the samples.
90
In this work, a new method, called Parallel Factor Framework-Clustering
91
Analysis (PFFCA), is proposed, which includes two steps: data decomposition and
92
component formation. First, 24 simulated data sets with different intensities of
93
non-trilinear structure and signal to noise ratios (SNR) were tested to investigate the
94
robustness when handling non-trilinear data. Second, synthetic and environmental
95
samples were tested with the PFFCA approach. One mixture of HA and bovine serum
96
albumin (BSA) was used to test the feasibility of PFFCA in processing synthetic
97
samples. Finally, two environmental sample series from a polluted lake and river were
98
examined in an attempt to test whether the decomposition could provide a unique
99
result for environmental samples. For all of the above cases, PARAFAC analysis with
100
a core consistency diagnostic and split-half analysis was also examined for
101
comparison. The reason why the core consistency diagnostic is used is given in SI.
102
To avoid confusion of the components decomposed from the data and the actual
103
components in the samples, in the following sections, the components decomposed
104
from the data will be called “factors”, and the actual components in the samples will
6
ACS Paragon Plus Environment
Page 6 of 33
Page 7 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
105
be called “components”.
106 107
MATERIALS AND METHODS
108 109
Principles of PFFCA. Theoretically, excitation and emission wavelengths are
110
independent, and PARAFAC is thus a reasonable tool. However, when dealing with
111
complex samples such as DOM, the presupposition is no longer valid, and the only
112
way to distinguish different components is the derived scores. The PFFCA method
113
principally assumes that all of the components are independent and that the dataset is
114
decomposed into many factors. Afterwards, the factors whose scores are highly
115
correlated should be combined into one component because they individually
116
represent only a proportion of the component. Two steps make up the basic idea of
117
PFFCA showing in Supporting Information. PFFCA modeling was carried out in
118
MATLAB 7.10.0 (Mathworks, Natick, MA, USA) with the N-way toolbox 3.1.25
119
PFFCA is available on via the Internet at http://pubs.acs.org/.
120
Simulated Fluorescence Data Acquisition. Two peaks (Peaks A and B)
121
conformed to trilinear structure were defined to simulate fluorescent peaks (Figure 1a).
122
Peak C did not conform to trilinear structure and was designed to simulate the first or
123
second Rayleigh scatterings or complex fluorescent substances, e.g., HA. All of these
124
peaks were simulated by a two-dimensional Gaussian function:
125
f, = · exp −
+
7
ACS Paragon Plus Environment
(1)
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
126
where the coefficient M is the amplitude indicating the “concentration” of each peak,
127
(xo, yo) is the center, and σx and σy represent the standard deviations of the
128
concentrations in the x and y dimensions, respectively. The coefficients of Peaks A, B
129
and C are listed in Table S1. The data were composed of 20 EEM landscapes, each
130
with a size of (70 × 60). We presume that x and y represent the emission wavelength
131
(Em) from 300 nm to 359 nm and excitation wavelength (Ex) from 280 nm to 349 nm,
132
respectively. Peaks A and B were both Gaussion peaks with the amplitudes (M)
133
changing sinusoidally and cosinoidally, respectively, while the Peak C was a 45o
134
rotated Gaussian peak with a constant amplitude in each EEM. The amplitude of Peak
135
C was kept constant in the 20 samples to eliminate the influence of rate-of-change of
136
Peak C. To examine the influence of Peak C, the amplitude of Peak C was tested from
137
0.2 to 1.7 with a 0.3 step increase in different datasets. To simulate the actual EEM
138
data as closely as possible, random noise was added to the EEM spectra and SNR
139
values at 10, 15, 20 and 50 were tested. The EEM matrixes of these three peaks were
140
added together to generate the combination EEMs. Therefore, 24 datasets with
141
different Peak C intensities and SNR were simulated, each with a size of (20 ×70 × 60)
142
(Table S2).
143
Fluorescence Data of an HA and BSA Mixture. HA and BSA were purchased
144
from Sigma-Aldrich Co., USA. Deionized water with resistivity higher than 18.2
145
MΩ/cm at 25 °C was used. The tested concentrations were from 0 to 25 mg/L for HA
146
and from 0 to 50 mg/L for BSA. Three-dimensional EEM fluorescence spectra were
8
ACS Paragon Plus Environment
Page 8 of 33
Page 9 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
147
recorded using an Aqualog spectrometer (Horiba Co., Japan). An orthogonal testing
148
with a total number of 36 EEM landscapes was produced (Table S3). The measured
149
emission spectra ranged from 211 to 618 nm with a 3.8-nm step increase, while the
150
excitation wavelength increased sequentially from 230 to 600 nm with a 3-nm step.
151
Inner filter effect (IFE) was corrected by the Aqualog software. To test the impact of
152
second-order Rayleigh scattering, each sample was measured with and without a low
153
pass filter with a 290-nm cutoff. The resulting data cube had a size of (36 × 125 × 124).
154
The EEM landscape of a sample containing 15 mg/L HA and 40 mg/L BSA is shown in
155
Figure 1b.
156
Fluorescence Data of Natural Water Samples. Two sample sets from natural
157
sources with different contamination degrees were tested. Samples from Chaohu Lake,
158
Anhui Province, China, representing low-contamination natural water, were collected
159
monthly from East Chaohu Lake (31°32.081’ N, 117°38.845’ E) and West Chaohu
160
Lake (31°38.341’ N, 117°21.416’ E) from March 2014 to January 2015. The second
161
sample series was collected from the Nanfeihe River, Anhui Province, China (with a
162
spatial distribution from 31°53.178' N, 117°13.074' E to 31°43.050' N, 117°24.246' E)
163
in a single sampling day, and was selected to represent highly contaminated natural
164
water.26, 27 Each sample of 500 mL was collected with polyethylene terephthalate
165
bottle and filtered with 0.22-µm cellulose acetate membrane immediately after
166
sampling. All samples were kept in a 4 oC refrigerator and tested in 48 h. The pH
167
values were 7.45±0.28 for the Lake samples and 6.93±0.21 for the River samples.
9
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
168
Considering no significant difference in pH for each sample, the sample pH value was
169
not adjusted before measurements to reveal the spectroscopic characteristic of each
170
sample as close to natural environment as possible. The two sample series are
171
respectively referred to as the Lake sample and the River sample in the following
172
discussion (Figure S1). Each sample was tested with an emission spectrum ranging
173
from 211 to 618 nm with a 3.8-nm step increase and an excitation wavelength
174
increasing sequentially from 264 to 600 nm with a 3-nm step. IFE was corrected by
175
the Aqualog software. For each sample, the analysis was conducted in triplicate. The
176
resulting data cube of the East Lake sample (dataset E) and the West Lake sample
177
(dataset W) had a size of (10 × 125 × 113). A dataset (dataset EW) including both the
178
East and West Lake samples (20 × 125 × 113) was also tested to validate whether the
179
PFFCA method can provide a unique decomposition result when decomposing
180
different sample series. The resulting data cube of the River sample (25× 125 × 113)
181
was divided into two sub-datasets (one with a size of 12× 125 × 113 containing all of
182
the even-numbered samples and the other with a size of 13× 125 × 113 containing all
183
of the odd-number samples).
184 185
RESULTS AND DISCUSSION
186 187
Robustness in Treating a Non-trilinear Dataset. Simulated data (the amplitude of
188
Peak C and SNR were 0.8 and 15, respectively) were first analyzed to test the
10
ACS Paragon Plus Environment
Page 10 of 33
Page 11 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
189
feasibility of dealing with non-trilinear data. After decomposition by PFFCA analysis,
190
8 factors were determined. The scores of these factors and the locations of these
191
factors in the EEM landscape are respectively shown in Figures S2a and S2b. F2
192
represents Peak A, F1 represents Peak B, and the combination of F3 to F8 is the
193
representation of Peak C. PARAFAC was also checked as a comparison. As shown in
194
Figure S3a-b, the number of factors determined by the core consistency diagnostic
195
was 3. Split-half analysis indicates that 3-factor model is valid. A paired samples t-test
196
was used to reflect the significance level between the actual and estimated
197
concentrations (Table 1). The p values of both methods were over 0.05, suggesting
198
that the estimated concentrations were not significantly different from the actual
199
concentrations. However, the average residual of the estimated concentrations and the
200
actual concentrations of PFFCA were less than that in PARAFAC analysis for Peaks A,
201
B and C, and p values were less than 0.05, suggesting that the residual of the PFFCA
202
method was significantly less than that of PARAFAC.
203
Although the t-test results indicate that the estimated and actual concentrations
204
were not significantly different in the PARAFAC method, the estimated
205
concentrations versus the actual concentrations of Peak A seemed to be an oval,
206
indicating that the series of Peak A had a slight phase-shift (Figure S3c-d). This result
207
shows that two samples with the same actual concentration may generate different
208
estimated concentrations simply because the location in the test sequence is different.
209
This might result from the interference of the non-trilinear structure. Thus, when
11
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
210
PARAFAC is applied in quantitative analysis, the potential risk of the interference of
211
non-trilinear structure should be strongly noted.
212
The residuals of actual and estimated landscapes are shown in Figure 2. As more
213
factors were used in PFFCA, the color bar in Figure 2 shows that the residuals of
214
PFFCA were smaller than those of PARAFAC by five to ten times, suggesting a less
215
distortion with the PFFCA method (Figure 2a-f). The item “distortion” is commonly
216
used in image processing and communication engineering to describe the alteration of
217
the original shape, peak, image, or waveform. Since our work was focused on the
218
improvement of data processing, this term was thus selected to represent the alteration
219
of the EEMs and concentration sequence. The residuals (actual landscape minus
220
estimated landscape) at simulated Ex/Em 330/340 nm were significantly positive and
221
the left and top sides were significantly negative (Figure 2b), implying that Peak A
222
moved slightly to the top left corner. For Peak B (Figure 2d), a new peak was
223
generated at simulated Ex/Em 325/330 nm, implying that the actual component was
224
unimodal, while the estimated component was multimodal. The distortion of
225
landscapes might be confusing when recognizing the assignment of the peaks. The
226
result indicates that it is feasible to use a core consistency diagnostic in determining
227
the correct component number when handling non-trilinear data, but the
228
decomposition results may be distorted because they have to conform to the trilinear
229
structure.
230
The impacts of the amplitude of Peak C and the SNR are also examined (Figure
12
ACS Paragon Plus Environment
Page 12 of 33
Page 13 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
231
S4). The RSS (residual sum of squares) of the landscape and the RSS of the amplitude
232
of the PFFCA method were less than those in PARAFAC. With an increase in the
233
amplitude of Peak C, the residuals notably increased in PARAFAC while exhibiting
234
no clear increase in PFFCA. The SNR had no significant effect on the residuals for
235
Peaks A and B (Figure S4a-d). However, with an increase in SNR, the residuals of
236
Peak C exhibited a significant decrease (Figures S4e-f). This result is consistent with
237
the fact that the high noise level will increase the residuals during fitting. Therefore,
238
the reduction percentage is diluted when the number of factors is increased. The
239
increase in the SNR tends to choose fewer factors, as confirmed in Table S4. As a
240
result, with more factors involved, PFFCA can provide more precise results and
241
overcome the limitation of trilinear structures faced by PARAFAC. The robustness in
242
dealing with non-trilinear datasets will provide PFFCA with more extensive
243
applications compared to PARAFAC analysis.
244
Determination of Component Number in Synthetic Samples. The EEM
245
spectra of the HA and BSA mixture were decomposed by the PFFCA method. Fifteen
246
factors were determined by the residuals reduction rate and clustered into 3
247
components shown in Figure 3a-c. These 3 components corresponded to BSA, HA
248
and second-order Rayleigh scattering, respectively. PARAFAC analysis with a core
249
consistency diagnostic was also tested as a comparison. The PARAFAC model with
250
three factors had a consistency index of 96.41%, which decreased to 69.05% and
251
37.94%, respectively, when four and five factors were considered. The residual
13
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
252
analysis indicated that the 3-factor model was adequate (Figure 3d-f), with the first
253
factor corresponding to BSA. However, the second and third factors corresponded to
254
only part of HA. The correlation coefficient scores of the second factor and the third
255
factor were both 0.930, indicating that these two factors might correspond to the same
256
component in the samples. There are two possible explanations for this result. First,
257
the impact of second-order Rayleigh scatterings destroyed the trilinear structure of
258
HA. Second, as a complex mixture of many different moieties containing fluorescent
259
fractions such as quinonyl, amino and many other aromatic groups, HA has many
260
fluorescent groups that all correspond to its concentration. In other words, these
261
fluorescent groups are not independent in HA samples.
262
To find out why HA was separated into 2 factors, all of the samples were tested
263
with the 290-nm-cutoff long pass filter. The PFFCA determined 2 components
264
clustered by 5 factors. These two components corresponded to BSA and HA,
265
respectively. The EEM landscape of these 2 components was similar to the
266
corresponding components in the result with second-order Rayleigh scatterings (R2
267
values of HA and BSA were 0.993 and 0.952, respectively, in Table 2). This result
268
indicate that PFFCA is robust even when the peaks are highly overlapped with
269
second-order Rayleigh scatterings.
270
With regard to PARAFAC, the 2-factor model had a consistency index of 99.99%
271
and fell to 76.28 and 30.75%, respectively, when three and four factors were
272
considered. Compared with the dataset without the filter, the absence of the
14
ACS Paragon Plus Environment
Page 14 of 33
Page 15 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
273
second-order Rayleigh scatterings reduced the core consistency index. This
274
observation confirmed that the non-trilinear structure may affect the correctness of
275
factor number estimation. The two factors in the 2-factor model represented BSA and
276
HA, respectively. The R2 values of the model landscapes with the EEM landscapes of
277
pure HA and BSA were 0.814 and 0.932, respectively (Table 2). The 3-factor model
278
had one factor corresponding to BSA and two factors corresponding to part of HA,
279
similar to the result without the filter. This 3-factor model was also validated by the
280
split-half analysis. The correlation coefficient of the score between the second factor
281
and the third factor was 0.970, higher than that without the filter. This difference
282
resulted from the impact of second-order Rayleigh scattering. Referring to the score of
283
the second and the third factors as weights, the weighted sum of their model
284
landscapes was calculated. The R2 values of these landscapes with the EEM
285
landscapes of pure HA and BSA were 0.911 and 0.935, respectively, which were
286
higher than those in the 2-factor model. This result indicates that HA may not conform
287
to trilinear structure. Taking the complexity of HA into account, the non-independence
288
of the fluorescent groups in HA might be a possible explanation of this observation.
289
Therefore, the factors whose scores are highly correlated should not be
290
considered as independent components, but only as part of one component, which is
291
the basic principle behind the PFFCA method. To reduce the potential risk of the
292
impact of the non-trilinear structure in the dataset, the first step decomposes the
293
dataset into many factors to reduce the RSS in an attempt to interpret the dataset more
15
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
294
correctly. Nevertheless, it increases the risk of overestimating the number of
295
components if each factor is treated as an independent component. In the second step,
296
the risk of mis-estimation can be eliminated because the factors that are highly
297
correlated will be clustered during the process. As a result, a better interpretation of
298
the dataset without overestimation of the components number can be achieved, and
299
the non-trilinear dataset can also be well handled using the PFFCA approach.
300
Unique Decomposition Results in Natural Water Samples. Three datasets of
301
the Lake samples were analyzed first. Although the factor numbers in Datasets E, W,
302
and EW were 11, 8, and 18, respectively, all of the datasets were decomposed into 2
303
components with PFFCA analysis. One of the components represented an HA-like
304
substance, and the other represented a protein-like substance.28 PARAFAC analysis
305
was also evaluated as a comparison, and two components were determined in all
306
datasets using a core consistency diagnostic and validated by the split-half analysis.
307
However, the difference between the components decomposed by the two methods
308
was remarkable (Figure S5).
309
In natural water samples, the EEM landscapes and concentrations of the HA-like
310
and protein-like substances were unknown, and therefore we cannot decide which
311
decomposition approach is superior by comparing the RSS of the concentrations or
312
the EEM landscapes between estimated values and real samples. Meanwhile, because
313
the concentrations and components are fixed in one sample, the decomposition result
314
should be the same, whether it is decomposed with other samples or not. Thus, the
16
ACS Paragon Plus Environment
Page 16 of 33
Page 17 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
315
decomposition results of East Lake samples from Dataset E should be equal to those
316
decomposed from Dataset EW. However, the decomposition results of the three
317
datasets were slightly different (Figures S5e-f and S6). The average residuals of
318
corresponding components of Dataset E and Dataset EW (or Dataset W and Dataset
319
EW) in each sample are shown in Figure 4. The residuals of PARAFAC (Figure 4e-h)
320
show some notable positive and negative peaks in the EEM landscape, implying that
321
the landscape of the peak was shifted as the dataset changed. Conversely, the residuals
322
of PFFCA (Figure 4a-d) were far less than those of PARAFAC, suggesting that
323
PFFCA outperformed PARAFAC analysis. These results demonstrated that samples in
324
PFFCA analysis did not affect each other during decomposition.
325
For the River samples, a 2-factor model was determined by a core consistency
326
diagnostic and validated using the split-half analysis when performing PARAFAC
327
analysis. One component represented a HA-like substance, and the other represented a
328
protein-like substance.4, 29 As shown in Figure S7e-h, the residuals of each sub-dataset
329
and corresponding samples in the total dataset showed a pair of evident positive and
330
negative peaks, indicating the shift of the peaks. For the protein-like substance, the
331
emission wavelength of the odd sub-dataset had a blue shift (Figure S7e), while a red
332
shift (Figure S7g) was observed for the even sub-dataset. For the HA-like substance, a
333
red shift was noticed for the odd dataset, and a blue shift in the even dataset was
334
clearly observed. Unlike in PARAFAC analysis, no notable peaks were found in the
335
residuals of the sub-dataset and corresponding samples in the total dataset (Figure
17
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
336
S7a-d), suggesting that the samples in PFFCA analysis did not affect each other
337
during decomposition.
338
One possible reason for this observation is that when the dataset was combined
339
with some new samples with slight differences in composition, the landscape of the
340
factors will shift to reduce the RSS (the reason for the peaks shift for PARAFAC when
341
decomposing with new samples is provide in SI). As a consequence, the estimates of
342
the original dataset will be inevitably changed, which may mislead our understanding
343
of the samples. However, by using adequate factors determined by the RSS reduction
344
rate, the PFFCA method can acquire better performance in reducing the RSS. When
345
combining with new samples, the original dataset will not be affected because it can
346
always be well decomposed. Although the landscape and even the numbers of the
347
factors are different, the subsequent clustering analysis will integrate them into one
348
single component, which avoids generating too many components that cannot be
349
interpreted.
350
Advantages of PFFCA over PARAFAC. The main differences between
351
PARAFAC and PFFCA are listed in Table S5. PARAFAC analysis extracts
352
components under the assumption that fluorescent substances are linearly independent.
353
In other words, it couples the data decomposition process and components
354
determination by simply making the number of components equal to the number of
355
linear independent factors. It is reasonable in simple situations, but in more
356
complicated environmental cases, the assumption is no longer valid because of the
18
ACS Paragon Plus Environment
Page 18 of 33
Page 19 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
357
complexity of fluorescent substance such as HA. In contrast, the PFFCA method
358
extracts the components by following a different assumption—that the fluorescent
359
groups whose intensities are highly correlated come from the same substance or are
360
affected by the same impact. It is unreasonable to separate them as independent
361
components before more information is known. Therefore, these factors should be
362
clustered together as a single component. Taking advantage of this novel strategy,
363
more factors can be involved to provide a better fitting and accordingly a unique
364
decomposition result.
365
Before performing conventional PARAFAC analysis, it is essential to remove
366
Rayleigh and Raman scatterings.30-32 For PFFCA analysis, it is not necessary to
367
remove scatterings before decomposition because the strategy can robustly treat
368
non-trilinear structures. The PFFCA method can effectively recognize and separate
369
them automatically after decomposition even when they are highly overlapped.
370
Significance of This Work. In the present study, a new method, PFFCA, is
371
proposed with a focus on non-trilinear EEM dataset decomposition. By using this
372
novel data-handling strategy, a robust, unique and interpretable decomposition result
373
can be approached even when the data are not linearly independent and overlap highly
374
with scatterings. Because components of natural water, such as DOM and HA, are
375
complex and may contain several fluorophores, PFFCA may be a better performing
376
alternative method for accurately revealing their internal components. Our work
377
yields new insights into the decomposition process and decouples the data
19
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
378
decomposition and components determination to treat synthetic and environmental
379
samples. In addition, as a framework, the two steps can be replaced by other suitable
380
methods. For example, PARAFAC can be replaced by other decomposition methods
381
such as alternating trilinear decomposition or self-weighted alternating trilinear
382
decomposition. Clustering analysis can be replaced by support vector machine,
383
artificial neural network or other machine learning algorithms. The replaceability of
384
this framework may help us to achieve a faster and more precise decomposition
385
method in DOM-related issues. Moreover, this robust method may find potential
386
applications in engineering fields. For example, chromophoric substances in soluble
387
microbial products and extracellular polymeric substances from biological wastewater
388
treatment plants could be better separated and interpreted by using PFFCA.
389 390
Nomenclature BSA
bovine serum albumin
DOM
dissolved organic matter
EEM
excitation-emission matrix
HA
humic acids
IFE
Inner filter effect
PARAFAC
parallel factor
PCA
principal component analysis
PFFCA
parallel factor framework-clustering analysis
RSS
residual sum of squares
SNR
signal to noise ratio
391
20
ACS Paragon Plus Environment
Page 20 of 33
Page 21 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
392 393 394
AUTHOR INFORMATION *Corresponding author: Prof. Han-Qing Yu, Fax: +86-551-63601592; E-mail:
[email protected] 395 396
ACKNOWLEDGEMENTS
397
We thank the National Natural Science Foundation of China (51538011), the
398
Collaborative Innovation Center of Suzhou Nano Science and Technology of the
399
Ministry of Education of China for the support of this study.
400 401
ASSOCIATED CONTENT
402
Supporting Information Available. Detailed descriptions about data processing
403
procesedures, Coefficients of simulated peaks (Table S1), The levels of SNR and
404
Amplitude (M) in 24 Samples (Table S2), The Concentration of HA and BSA in 36
405
Samples (Table S3), Impact of SNR and amplitude of Peak C to the number of factor
406
(Table S4), Comparison between PARAFAC and PFFCA (Table S5), Sampling points
407
of Chaohu Lake and Nanfeihe River (Figure S1), Decomposition result of the dataset
408
with the amplitude of Peak C at 0.2 and SNR of 15 (Figure S2), RSS of different
409
factors, core consistency of different factors and the estimated amplitude versus actual
410
amplitude of Peaks A and B (Figure S3), RSS of the amplitude and landscape of Peaks
411
A, B and C (Figure S4), the decomposition result of PFFCA and PARAFAC for
412
dataset EW and river samples (Figure S5), the decomposition results of dataset E and
21
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
413
W using PARAFAC (Figure S6), the average residuals landscape of the protein-like
414
and HA-like substances between the odd sub-dataset and the total dataset decomposed
415
by PARAFAC and PFFCA (Figure S7), Residual of simulated data with one and two
416
factors. Residual of pure HA samples with one and two factors (Figure S8), EEM with
417
the amplitude of HA and one gauss peak, estimated sequence of three factors
418
decomposed by PARAFAC, actual amplitude versus estimated relative amplitude of
419
two factors corresponding to HA and gauss peak (Figure S9), simulated peak with the
420
angle is 45 degree, the RSS of different angles and factors number from one to eight,
421
the relationship between angle and validated factor number (Figure S10), and the
422
average residuals landscape of Peak A and Peak B between sub-dataset and dataset
423
(Figure S11). PFFCA (PFFCA.zip) is available on at http://pubs.acs.org/. This
424
information is available free of charge via the Internet at http://pubs.acs.org/.
425 426
REFERENCES
427 428
1. Abdulla, H. A. N.; Minor, E. C.; Hatcher, P. G., Using Two-Dimensional
429
Correlations of C-13 NMR and FTIR To Investigate Changes in the Chemical
430
Composition of Dissolved Organic Matter along an Estuarine Transect. Environ Sci
431
Technol 2010, 44, (21), 8044-8049.
432
2. Chen, W.; Qian, C.; Liu, X. Y.; Yu, H. Q., Two-Dimensional Correlation
433
Spectroscopic Analysis on the Interaction between Humic Acids and TiO2
434
Nanoparticles. Environ Sci Technol 2014, 48, (19), 11119-11126.
435 436
3. Peuravuori, J., NMR spectroscopy study of freshwater humic material in light of supramolecular assembly. Environ Sci Technol 2005, 39, (15), 5541-5549. 22
ACS Paragon Plus Environment
Page 22 of 33
Page 23 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
437
4. Chen, W.; Westerhoff, P.; Leenheer, J. A.; Booksh, K., Fluorescence excitation -
438
Emission matrix regional integration to quantify spectra for dissolved organic matter.
439
Environ Sci Technol 2003, 37, (24), 5701-5710.
440
5. Cory, R. M.; McKnight, D. M., Fluorescence spectroscopy reveals ubiquitous
441
presence of oxidized and reduced quinones in dissolved organic matter. Environ Sci
442
Technol 2005, 39, (21), 8142-8149.
443
6. Murphy, K. R.; Butler, K. D.; Spencer, R. G. M.; Stedmon, C. A.; Boehme, J. R.;
444
Aiken, G. R., Measurement of Dissolved Organic Matter Fluorescence in Aquatic
445
Environments: An Interlaboratory Comparison. Environ Sci Technol 2010, 44, (24),
446
9405-9412.
447 448 449 450
7. Bro, R., PARAFAC. Tutorial and applications. Chemometr Intell Lab 1997, 38, (2), 149-171. 8. Nebbioso, A.; Piccolo, A., Molecular characterization of dissolved organic matter (DOM): a critical review. Anal Bioanal Chem 2013, 405, (1), 109-124.
451
9. Ishii, S. K. L.; Boyer, T. H., Behavior of Reoccurring PARAFAC Components in
452
Fluorescent Dissolved Organic Matter in Natural and Engineered Systems: A
453
Critical Review. Environ Sci Technol 2012, 46, (4), 2006-2017.
454
10. Yamashita, Y.; Jaffe, R., Characterizing the interactions between trace metals and
455
dissolved organic matter using excitation-emission matrix and parallel factor
456
analysis. Environ Sci Technol 2008, 42, (19), 7374-7379.
457
11. Murphy, K. R.; Hambly, A.; Singh, S.; Henderson, R. K.; Baker, A.; Stuetz, R.;
458
Khan, S. J., Organic Matter Fluorescence in Municipal Water Recycling Schemes:
459
Toward a Unified PARAFAC Model. Environ Sci Technol 2011, 45, (7), 2909-2916.
460
12. Li, W. H.; Sheng, G. P.; Liu, X. W.; Yu, H. Q., Characterizing the extracellular and
461
intracellular fluorescent products of activated sludge in a sequencing batch reactor.
462
Water Res 2008, 42, (12), 3173-3181.
463
13. Stedmon, C. A.; Bro, R., Characterizing dissolved organic matter fluorescence
464
with parallel factor analysis: a tutorial. Limnol Oceanogr-Meth 2008, 6, 572-579.
465
14. Andersen, C. M.; Bro, R., Practical aspects of PARAFAC modeling of fluorescence 23
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
466
excitation-emission data. J Chemometr 2003, 17, (4), 200-215.
467
15. Andrade-Eiroa, A.; Canle, M.; Cerda, V., Environmental Applications of
468
Excitation-Emission Spectrofluorimetry: An In-Depth Review I. Appl Spectrosc Rev
469
2013, 48, (1), 1-49.
470 471 472 473
16. Stevenson, F. J., Humus chemistry: genesis, composition, reactions. John Wiley & Sons: 1994. 17. Sutton, R.; Sposito, G., Molecular structure in soil humic substances: The new view. Environ Sci Technol 2005, 39, (23), 9009-9015.
474
18. Hambly, A. C.; Arvin, E.; Pedersen, L. F.; Pedersen, P. B.; Seredynska-Sobecka, B.;
475
Stedmon, C. A., Characterising organic matter in recirculating aquaculture systems
476
with fluorescence EEM spectroscopy. Water Res 2015, 83, 112-120.
477
19. McKnight, D. M.; Boyer, E. W.; Westerhoff, P. K.; Doran, P. T.; Kulbe, T.;
478
Andersen, D. T., Spectrofluorometric characterization of dissolved organic matter
479
for indication of precursor organic material and aromaticity. Limnol Oceanogr 2001,
480
46, (1), 38-48.
481
20. Baghoth, S. A.; Sharma, S. K.; Amy, G. L., Tracking natural organic matter (NOM)
482
in a drinking water treatment plant using fluorescence excitation-emission matrices
483
and PARAFAC. Water Res 2011, 45, (2), 797-809.
484
21. He, W.; Hur, J., Conservative behavior of fluorescence EEM-PARAFAC
485
components in resin fractionation processes and its applicability for characterizing
486
dissolved organic. Water Res 2015, 83, 217-226.
487
22. Osburn, C. L.; Handsel, L. T.; Mikan, M. P.; Paerl, H. W.; Montgomery, M. T.,
488
Fluorescence Tracking of Dissolved and Particulate Organic Matter Quality in a
489
River-Dominated Estuary. Environ Sci Technol 2012, 46, (16), 8628-8636.
490
23. Sanchez, N. P.; Skeriotis, A. T.; Miller, C. M., A PARAFAC-Based Long-Term
491
Assessment of DOM in a Multi-Coagulant Drinking Water Treatment Scheme.
492
Environ Sci Technol 2014, 48, (3), 1582-1591.
493
24. Leitao, J. M. M.; Simoes, E. F. C.; da Silva, J. C. G. E., PARAFAC based methods
494
for the analysis of Diltiazem drug excitation emission matrices of fluorescence 24
ACS Paragon Plus Environment
Page 24 of 33
Page 25 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
495
obtained by a derivatization reaction. Anal Methods-Uk 2011, 3, (12), 2758-2769.
496
25. Andersson, C. A.; Bro, R., The N-way Toolbox for MATLAB. Chemometr Intell
497
Lab 2000, 52, (1), 1-4.
498
26. Li, G. L.; Liu, G. J.; Zhou, C. C.; Chou, C. L.; Zheng, L. G.; Wang, J. Z., Spatial
499
distribution and multiple sources of heavy metals in the water of Chaohu Lake,
500
Anhui, China. Environ Monit Assess 2012, 184, (5), 2763-2773.
501
27. Liu, E. F.; Shen, J.; Yang, X. D.; Zhang, E. L., Spatial distribution and human
502
contamination quantification of trace metals and phosphorus in the sediments of
503
Chaohu Lake, a eutrophic shallow lake, China. Environ Monit Assess 2012, 184, (4),
504
2105-2118.
505
28. Sheng, G. P.; Yu, H. Q., Characterization of extracellular polymeric substances of
506
aerobic and anaerobic sludge using three-dimensional excitation and emission
507
matrix fluorescence spectroscopy. Water Res 2006, 40, (6), 1233-1239.
508
29. Stedmon, C. A.; Markager, S.; Bro, R., Tracing dissolved organic matter in aquatic
509
environments using a new approach to fluorescence spectroscopy. Mar Chem 2003,
510
82, (3-4), 239-254.
511
30. Bahram, M.; Bro, R.; Stedmon, C.; Afkhami, A., Handling of Rayleigh and Raman
512
scatter for PARAFAC modeling of fluorescence data using interpolation. J
513
Chemometr 2006, 20, (3-4), 99-105.
514 515
31. Eilers, P. H. C.; Kroonenberg, P. M., Modeling and correction of Raman and Rayleigh scatter in fluorescence landscapes. Chemometr Intell Lab 2014, 130, 1-5.
516
32. Li, Y.; Wu, H. L.; Yu, Y. J.; Zhang, S. R.; Chen, Y.; Tu, D. Z.; Nie, C. C.; Xu, H.; Yu,
517
R. Q., A novel method to handle Rayleigh scattering in three-way
518
excitation-emission fluorescence data. Anal Methods-Uk 2012, 4, (12), 3987-3996.
25
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Page 26 of 33
Table 1. T-test of the Estimates and Actual Amplitudes, and Residuals of PFFCA and PARAFAC P-values of comparing estimate average residuals and actual concentration PFFCA PARAFAC PFFCA PARAFAC P-values of comparing two methods peak A
0.973
0.563
0.004
0.034
1.05E-05
peak B
0.899
0.559
0.003
0.004
0.037
peak C
0.935
0.823
0.006
0.167
9.96E-08
26
ACS Paragon Plus Environment
Page 27 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
Analytical Chemistry
Table 2. R2 Values of Pure HA/BSA and Decomposition Components pure without filter with filter HA/BSA PFFCA PARAFAC,N=3 PFFCA without PFFCA 0.922/0.922 filter PARAFAC,N=3* 0.875/0.922 0.9/0.934 with filter
PFFCA PARAFAC,N=2 PARAFAC,N=3
0.932/0.972 0.814/0.932 0.911/0.935
0.993/0.952 0.874/0.908 0.969/0.927
0.913/0.896 0.867/0.962 0.946/0.964
*: N represents the number of factor.
27
ACS Paragon Plus Environment
0.94/0.949 0.988/0.962
PARAFAC,N=2
0.952/0.997
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure captions
Figure 1. (a) EEM landscape of the simulated data; and (b) a mixture of 15 mg/LHA and 40 mg/L BSA.
Figure 2. EEM residual landscape of (a) Peak A with PFFCA; (b) Peak A with PARAFAC; (c) Peak B with PFFCA; (d) Peak B with PARAFAC; (e) Peak C with PFFCA; and (f) Peak C with PARAFAC for the simulated dataset.
Figure 3. (a)–(c) 3 components decomposed by PFFCA; (d)–(f) 3 components decomposed by PARAFAC.
Figure 4. The average residuals landscape of the HA-like substance (a, e) and protein-like substance (b, f) between datasets E and WE. The average residuals landscape of HA-like substance (c, g) and protein-like substance (d, h) between datasets W and WE. (a)–(d) were decomposed by PFFCA, and (e)–(h) were decomposed by PARAFAC.
28
ACS Paragon Plus Environment
Page 28 of 33
Page 29 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Figure 1
29
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 2
30
ACS Paragon Plus Environment
Page 30 of 33
Page 31 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Figure 3
31
ACS Paragon Plus Environment
Analytical Chemistry
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Figure 4
32
ACS Paragon Plus Environment
Page 32 of 33
Page 33 of 33
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Analytical Chemistry
Table of Contents (TOC) Art
33
ACS Paragon Plus Environment