Pretreatment and Integrated Analysis of Spectral Data Reveal

Feb 3, 2015 - *E-mail: [email protected]. ... Application of Market Basket Analysis for the Visualization of Transaction Data Based on Human Lifest...
1 downloads 0 Views 1MB Size
Subscriber access provided by - Access paid by the | UCSF Library

Article

Pretreatment and Integrated Analysis of Spectral Data Reveal Seaweed Similarities Based on Chemical Diversity Feifei Wei, Kengo Ito, Kenji Sakata, Yasuhiro Date, and Jun Kikuchi Anal. Chem., Just Accepted Manuscript • DOI: 10.1021/ac504211n • Publication Date (Web): 03 Feb 2015 Downloaded from http://pubs.acs.org on February 11, 2015

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a free service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are accessible to all readers and citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

Analytical Chemistry is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

1

Pretreatment and Integrated Analysis of Spectral

2

Data Reveal Seaweed Similarities Based on

3

Chemical Diversity

4

Feifei Wei†, Kengo Ito‡, Kenji Sakada†, Yasuhiro Date†, ‡ and Jun Kikuchi*,†,‡,§,¶

5 6 7 8 9 10 11 12 13 14



RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku,

Yokohama 235-0045, Japan ‡

Graduate School of Medical Life Science, Yokohama City University, 1-7-29

Suehirocho, Tsurumi-ku, Yokohama 230-0045, Japan §

Biomass Engineering Research Program, RIKEN Research Cluster for Innovation, 2-1

Hirosawa, Wako 351-0198, Japan ¶

Graduate School of Bioagricultural Sciences and School of Agricultural Sciences,

Nagoya University, 1 Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan

15 16

[email protected]; [email protected]; [email protected];

17

[email protected]; [email protected]

18 19

* To whom correspondence should be addressed.

20

Tel: +49(89)63641603. Fax: +49(89)63646881. E-mail: [email protected]

21 22 1

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

23

Page 2 of 27

ABSTRACT

24

Extracting useful information from high dimensionality and large data sets is a major

25

challenge for data-driven approaches. The present study was aimed at developing novel

26

integrated analytical strategies for comprehensively characterizing seaweed similarities

27

based on chemical diversity. The chemical compositions of 107 seaweed and 2 seagrass

28

samples were analyzed using multiple techniques, including Fourier transform infrared

29

(FT-IR) and solid- and solution-state nuclear magnetic resonance (NMR) spectroscopy,

30

thermogravimetry-differential thermal analysis (TG-DTA), inductively coupled

31

plasma-optical emission spectrometry (ICP-OES), CHNS/O total elemental analysis, and

32

isotope ratio mass spectrometry (IR-MS). The spectral data were preprocessed using

33

non-negative matrix factorization (NMF) and NMF combined with multivariate curve

34

resolution-alternating least-squares (MCR-ALS) methods in order to separate individual

35

component information from the overlapping and/or broad spectral peaks. Integrated

36

analysis of the preprocessed chemical data demonstrated distinct discrimination of

37

differential seaweed species. Further network analysis revealed a close correlation

38

between the heavy metal elements and characteristic components of brown algae such as

39

cellulose, alginic acid, and sulfated mucopolysaccharides, providing a componential

40

basis for its metal-sorbing potential. These results suggest that this integrated analytical

41

strategy is useful for extracting and identifying the chemical characteristics of diverse

42

seaweeds based on large chemical data sets, particularly complicated overlapping

43

spectral data.

44 45

KEYWORDS

46

spectral data preprocessing, integrated analysis, seaweed diversity, peak separation 2

ACS Paragon Plus Environment

Page 3 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

47

INTRODUCTION

48

With the technological advances in high-throughput analysis, it is becoming

49

increasingly essential to understand complicated global biological and ecological systems

50

in both a hypothesis-driven and a data-driven manner.1,2 Data-driven approaches based

51

on bioinformatics tools and computational technologies enable researchers to

52

prospectively assess variables simultaneously rather than independently without explicit

53

knowledge.3-14 One of the main problems with data-driven research, however, is the need

54

to understand very large amounts of different data types (called “big data”) from a holistic

55

view at a high abstraction level. Big data means more than just mountains of data.

56

Therefore, it becomes particularly important to develop novel methodologies, such as

57

dimension reduction, to mine hidden patterns, unknown correlations, and other useful

58

information in high dimensionality and large data sets.

59

Seaweed has been used as a traditional food and an important source of fiber, minerals,

60

vitamins, polysaccharides, and iodine in eastern Asia for several centuries. Increasing

61

attention has recently been focused on the biomedical and pharmaceutical applications of

62

seaweed in drug development, because it is rich in bioactive metabolites with

63

anticoagulant, antiviral, antioxidant, antiallergic, anti-inflammatory, antiobesity, and

64

anticancer properties.15,16 Seaweed has also been widely used in the removal of

65

contaminants from industrial effluents due to its biosorptive capacity and metal-sorbing

66

potential.17 On the other hand, accumulation of toxic substances such as arsenic (As)

67

makes seaweed consumption a potential risk for human health.18 It seems likely that

68

further comprehensive characterization of seaweed will enhance our understanding of the

69

functional biological properties of seaweed and shed light on the development of natural

70

health foods. 3

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 4 of 27

71

Several measurement instruments have been applied to the chemical composition

72

analysis of seaweed, such as Fourier transform infrared (FT-IR) and solid- and

73

solution-state

74

thermogravimetry-differential thermal analysis (TG-DTA), inductively coupled

75

plasma-optical emission spectrometry (ICP-OES), CHNS/O total elemental analysis. Our

76

group recently investigated the temporal changes in the chemical composition of the

77

brown algae Sargassum fusiforme (S. fusiforme) during seasonal fluctuations based on a

78

variety of heterogeneous measurements of organic and inorganic chemical data.19-22 To

79

perform this analysis, an integrated analytical strategy was devised by combining a data

80

processing step for isolation of pure peaks via removal of noise and the separation of

81

overlapping signals with a pool of statistical analyses, including principle component

82

analysis (PCA), self-organizing maps (SOMs), and correlation network analysis, in order

83

to extract temporal signatures for the composition of S. fusiforme and explore multiple

84

biological interactions between components and the environment within aquatic

85

ecosystems.

nuclear

magnetic

resonance

(NMR)

spectroscopy,

86

We are also interested in the comprehensive characterization and evaluation of

87

seaweed biodiversity using integrated analytical approaches. Compared to the tracking of

88

temporal variations in the chemical composition of a single species, composition analysis

89

of diverse seaweed species is more complicated. The differentiation in species

90

composition may result in serious overlapping of spectral signals originally derived from

91

differential components, which may lead to erroneous data and misinterpretation in the

92

integrated data analysis. Therefore, the present study was aimed at further developing

93

novel data preprocessing techniques for integrated data analysis using various

94

measurement techniques in order to characterize huge data sets collected for seaweeds 4

ACS Paragon Plus Environment

Page 5 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

95

with species diversity.

96 97

MATERIALS AND METHODS

98

Samples. The 107 seaweed samples, including red, brown, and green algae, as well as

99

2 seagrass samples used in this study were collected from an intertidal area at Aburatsubo

100

in Miura City (35°16′ N, 139°62′ E) and Tenjin Island in Yokosuka City (35°22′ N,

101

139°60′ E), Kanagawa, Japan between August 2010 and April 2012, as shown in Table 1.

102

The pretreatment conditions for the samples have been described previously.19

103

Biomass Measurements. As a follow-up study, all of the samples were analyzed in the

104

same manner as previously reported.19,20,23-26 With more details, the 13C solid-state NMR

105

spectra were observed using a DRX-500 spectrometer (500 MHz Bruker-BioSpin,

106

Billerica, MA) operating at 125 MHz with a Bruker MAS VTN 500SB BL4 probe, using

107

a 4 mm cross polarization magic angle spinning (CP-MAS) probe head; the 1H-NMR of

108

Watergate spectra were acquired at 298 K on a 700 MHz Bruker Biospin NMR

109

instrument (AVENCEII-700) equipped with an inverse (proton coils closest to the

110

sample) gradient 5 mm Cryo 1H/13C/15N probe (Bruker Biospin, Rheinstetten, Germany);

111

the FT-IR spectra (650-4000 cm−1) were obtained using a Nicolet 6700 FT-IR

112

spectrometer (Thermo Fisher Scientific Inc., Waltham, MA) with KBr disks; the

113

thermogravimetric analysis was conducted using an EXSTAR TG/DTA 6300 (SII

114

Nanotechnology Inc., Tokyo, Japan) instrument; the ICP-OES analysis was conducted

115

using an SPS 5510 (SII Nanotechnology Inc., Tokyo, Japan) instrument with CCD

116

detector, with a range of wavelengths from 167 to 785 nm and 74 applicable elements; the

117

elemental analysis was performed with a CHNS/O analyzer (Vario Micro cube,

118

Elementar Analysensysteme GmbH, Hanau, Germany) using helium as the carrier gas; 5

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 6 of 27

119

the isotope ratio mass spectrometry (IR-MS) analysis was performed on an IsoPrime 100

120

(Jasco international Co., Ltd., Tokyo, Japan) in combination with an elemental analyzer

121

(Vario MICRO cube) in “CN mode,” and isotopic ratios of carbon and nitrogen in the

122

samples were measured as CO2 and N2 gases using IR-MS.

123

Multivariate Spectral Decomposition. The 1H and 13C CP-MAS NMR spectra were

124

manually phased and baseline collected. The 1H NMR spectra were normalized to the

125

DSS (4,4-dimethyl-4-silapentane-1-sulfonic acid) intensity. The

126

spectra were normalized to the total integral area. Intensity scores for the NMR spectra

127

were extracted from relative peaks using Topspin software (Bruker Biospin). The

128

non-negative matrix factorization (NMF) method was then applied in order to separate

129

overlapping spectral peaks, followed by multivariate curve resolution-alternating

130

least-squares (MCR-ALS) analysis using “R” software (http://www.r-project.org/). The

131

FT-IR spectra were normalized to the intensity of the 650 cm−1 peak and then produced

132

using the NMF and MCR-ALS methods. The TG-DTA data set was processed using the

133

NMF method for spectral decomposition.

13

C CP-MAS NMR

134

Statistical Analyses of the Integrated Data. PCA and partial least squares

135

discriminant analysis (PLS-DA) were used to explore any biomass component clustering

136

of R-based on intrinsic biochemical similarities between the seaweed species. Pearson’s

137

correlations for the integrated data were calculated using R software, and high correlation

138

coefficients (|r| > 0.95 between seaweed samples; |r| > 0.7 between multiple

139

measurements) were collected from all of the correlation coefficients and transformed

140

into a matrix of connections between sources and targets. The transformed data matrix

141

was imported to the network analysis software Gephi (https://gephi.org/). SOMs are

142

considered to be a nonlinear mapping technique that identifies clusters in an unsupervised 6

ACS Paragon Plus Environment

Page 7 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

143

way within data sets without the rigid assumptions of linearity or normality associated

144

with traditional statistical techniques. In this study, all SOM calculations were performed

145

using the Kohonen library on the R platform.

146 147

RESULTS AND DISCUSSION

148

Spectral Data Pretreatment. The solid-state NMR and FT-IR data were used to

149

evaluate intact biomass components; the TG-DTA data were used to characterize the

150

thermal decomposition profiles; the solution-state NMR data were used to evaluate the

151

metabolic profiles; and the ICP-OES, CHNS/O analysis, and IR-MS data were used to

152

examine the elemental contents of the samples.27-31 Before integrated data analysis, data

153

pre-processing was performed on the derivative thermogravimetry (DTG), FT-IR,

154

CP-MAS, and 1H-NMR spectral data to separate individual component information.

155

Spectral-editing techniques have been used as an effective method to detect different

156

functional chemical groups with overlapping chemical shifts.32,33 In addition to this, it

157

has been discussed that editing pulse sequence in solid-state NMR experiment allows to

158

be possible for absolute quantification of spectral components.34 However, it should be

159

noted that the absolute quantification is not definitely needed to identify the chemical

160

characteristics of diverse seaweeds based on large chemical data sets. In contrast, the

161

relative quantification based on the entire carbon information with less information loss

162

will lead to more comprehensive understanding of the complicated biological diversity

163

of the ecological systems.19,20,29-31 Therefore, in the present study, entire carbon

164

information, but not partial structure information from selective pulse sequences, was

165

obtained by decomposing peaks using NMF or NMF combined with MCR-ALS

166

methods. 7

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 8 of 27

167

Regarding the wave-fitting methods to separate spectral peaks, MCR-ALS with a

168

Gaussian shape constraint was previously applied for the multivariate spectral

169

decomposition of FT-IR, DTG, and CP-MAS data in order to monitor the temporal

170

variation in the chemical composition of the brown algae S. fusiforme.19,35 However, in a

171

complicated system including diverse seaweed species, MCR-ALS with a Gaussian

172

shape exhibited poor performance since the peak centers of the same component derived

173

from different seaweed species shifted obviously in different spectra (data not shown). In

174

addition, the independent components analysis (ICA) was also considered as a peak

175

separation method.36 However, a problem of ICA is that it will lead to negative values in

176

the decomposed results, which in fact have no meaning in the interpretation of NMR or

177

FT-IR spectral data. In the present study, the seaweed species diversity resulted in serious

178

overlapping of the signals originally derived from multiple measurements, which

179

required reasonable and effective data pretreatment. The NMF method is an algorithm

180

that decomposes multivariate data into a smaller number of basis functions and

181

encodings using non-negative constraints.37 It was first introduced by Paatero and

182

Tapper as a positive matrix factorization concept for estimating errors in widely varying

183

environmental data.38,39 Lee and Seung represented parts-based objects using NMF as

184

an effective multiplicative algorithm.40 Due to the non-negativity and sparseness

185

constraints, NMF has also been widely used in multidimensional data analyses.41-45

186

Here, we applied the parts-based NMF method for the first time to spectral data

187

pretreatment for the integrated analysis of a huge data set representing biodiversity.

188

For NMF models, it is important to determine the number of components in order to

189

capture the true underlying trends in big data. Several approximate and heuristic

190

techniques are available for obtaining the number of components, although in the 8

ACS Paragon Plus Environment

Page 9 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

191

present study, parameters residual sum of squares (RSS) and Durbin–Watson (DW)

192

criterion were calculated.36,46 As the name suggests, RSS is the sum of squared residuals

193

between an original and a reconstructed matrix for each model. Therefore, the model

194

with the lowest RSS value is expected to be the most representative. Meanwhile, the

195

DW criterion has been proposed as a measure of the signal/noise ratio, the value of

196

which approaches 0 when there is no noise in the signal and 2 when the signal contains

197

only noise.

198

The spectral decomposition of the DTG data is shown in Figure 1(A). The original

199

DTG spectrum appeared to be simple, but indeed contained both a large number of

200

peaks and noise due to the differential components in the complex mixture of seaweeds.

201

After spectral decomposition using NMF, the number of components were determined

202

to be 18 (DW ≤ 1.4, see Figure S1 in the Supporting Information), which indicated that

203

18 peaks were extracted and only a little noise was left, as revealed by RSS value. These

204

results confirmed the superiority of the NMF approach for extracting individual

205

component information from broad spectral peaks, which may be important but often

206

overlooked and unappreciated in data pretreatment using methods such as binning.

207

Furthermore, NMF coupled with MCR-ALS exhibited good performance for peak

208

separation of the FT-IR, CP-MAS, and 1H-NMR spectral data, which were much more

209

complicated than that of the DTG. As shown in Figures 1(B), (C), and (D), 92

210

component peaks, including broad peaks, were extracted from the FT-IR spectra, while

211

the CP-MAS and 1H-NMR spectra were decomposed into 62 and 54 peaks, respectively.

212

These spectral decomposition results obtained using NMF or NMF followed by

213

MCR-ALS indicated a thorough extraction of the peaks due to the main components,

214

and the spectral data were converted in a part-based manner. The component 9

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 10 of 27

215

information from the broad spectral peaks thus became acquirable, which provided

216

informative and appropriate input data for the following integrated data analyses.

217

Integrated Analysis of Seaweed Variety. The present study is aimed to develop

218

novel integrated analytical strategies to extract useful information from not only broad

219

signals but also overlapped peaks in the spectrum. Until recently, multivariate analysis

220

such as PCA has been widely applied using binning data without spectral decomposition.

221

However, if two components varied in opposite directions in overlapped peaks, no matter

222

how narrow the bin size was set they would be treated as one bin ignoring the inside

223

variation. In contrast, overlapped signals would be decomposed from the overlapping

224

parts as the wave manner, which is similar to their original way, and be treated as

225

different components by using NMF and/or MCR-ALS method. As shown in Figure S2,

226

a better discrimination of all seaweed species was observed in the score plots based on

227

the integrated data matrix with NMF/MCR-ALS peak decomposition (Figure S2(B)) than

228

that using unprocessed data matrix (bin data) (Figure S2(A)). Furthermore, the partial

229

least squares-discriminant analysis (PLS-DA) were performed on the preprocessed

230

chemical composition data matrix for all of the seaweed samples. As shown in Figure

231

S2 (C) in the Supporting Information, clear discrimination of the seaweed samples was

232

observed in the score plots, indicating that pretreatment of each spectral data set enabled

233

extraction and identification of the chemical characteristics of each sample. Pearson's

234

correlation coefficients between all of the seaweed samples were then calculated in

235

order to evaluate their biological similarity. As shown in Figure 2, seaweed samples

236

from the same species showed high correlation coefficients (|r| ≥ 0.95). For example, as

237

a species of brown algae, samples of S. fusiforme (aqua) together showed relatively high

238

correlation coefficients with the other brown algae samples (gray). The present results 10

ACS Paragon Plus Environment

Page 11 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

239

indicated that the proposed integrated analytical strategy using a variety of chemical

240

data can be effective for evaluating the biological similarity of diverse seaweeds.

241

Comprehensive Feature Extraction for Seaweed Similarity Based on Chemical

242

Diversity. The mathematical modeling in the present study provides a powerful approach

243

for promoting a deep understanding and comprehensive characterization of the

244

biodiversity and biological similarity of seaweeds. Because elemental stoichiometry may

245

be related to metabolomic changes,47 the CP-MAS, 1H-NMR, and ICP data were

246

expanded and highlighted in Figure 3. As shown in this figure, the correlation network

247

analysis for all of the brown seaweed components identified using CP-MAS, 1H-NMR,

248

and ICP analyses indicated that Cd had strong relationships with the 1H-NMR peaks at

249

3.14 and 3.78 ppm and the CP-MAS NMR peaks at 72.92, 73.06, 73.21, 73.35, 73.50,

250

73.79, and 73.94 ppm (|r| > 0.7). According to our previous study, these NMR spectral

251

peaks are possibly derived from characteristic components of brown algae such as

252

cellulose, alginic acid, and sulfated mucopolysaccharides.20 These results suggested that

253

the polysaccharide components of brown algae are closely related to its selective

254

biosorption capability for heavy metals.

255

Biosorption is a term that describes the removal of heavy metals via the passive

256

binding to nonliving biomass from an aqueous solution.15 Selective adsorption of toxic

257

heavy metals by brown algae has gained increasing interest due to its high efficiency.

258

Mechanical studies have emphasized the cell wall properties of brown algae such as

259

alginate and fucoidan, because both electrostatic attraction and complexation can

260

contribute to heavy metal chelation.20,48 Specific attention has been focused on the

261

Fucales, a major order of brown algae, because it is abundant in nature and includes the

262

most structurally complex seaweeds.49 As shown in Figure 4(A), brown algae samples 11

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 12 of 27

263

belonging to the order Fucales (labels 1, 3, 7, 9, 10, and 12) were assigned with similar

264

colors, indicating high values in the SOM plot based on their relationships with the heavy

265

metal cadmium (Cd). These data suggest that brown algae belonging to the order Fucales

266

have similar tendencies with respect to the selective biosorption of Cd, which is

267

consistent with the conclusions of previous studies.15

268

Furthermore, PCA and PLS-DA analyses demonstrated a very clear distinction

269

between the chemical composition of S. fusiforme and that of other brown algae species

270

(data shown in Figures S3 (A) and (B) of the Supporting Information). The loading plots

271

(see Figures S3 (C), (D) and (E)) revealed that mannitol, laminaran, fucoidan, alginate,

272

As, and Cd were identified as characteristic components of S. fusiforme, suggesting a

273

higher biosorption capability for Cd and As. Indeed, as can be seen in Figure 4(B), the

274

SOM plot based on the relationship with As indicated that the order Fucales, particularly

275

the species S. fusiforme (label 1), has a high biosorption capability for As. More

276

specifically, alginate is a family of linear polysaccharides containing 1,4-linked

277

β-D-mannuronic and α-L-guluronic acid residues arranged in a nonregular, blockwise

278

order along the chain.50 The unique composition of the alginates present in S. fusiforme

279

may represent a distinct advantage over other brown algae with respect to the binding of

280

divalent heavy metal ions.51 According to these structural characteristics, this seaweed

281

can concentrate more than 80% natural inorganic As and thus can be used as an effective

282

heavy metal sorbent.52,53

283 284

CONCLUSIONS AND OUTLOOK

285

Spectroscopic data such as NMR and FT-IR spectra have the advantage of enabling

286

fully quantitative analyses that provide quick, direct, and comprehensive observations, 12

ACS Paragon Plus Environment

Page 13 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

287

and these large quantities of information lead to a better understanding of complicated

288

global biological and ecological systems. However, one major problem is that

289

overlapping and broad spectral peaks derived from diverse components in a chemical

290

mixture lead to erroneous data and misinterpretations in integrated data analyses. The

291

current spectral partitioning process (the so called binning method) often overlooks

292

these complicated peaks, and thus much useful information is lost. In the present study,

293

a novel data preprocessing approach was developed based on NMF and MCR-ALS

294

methods for the extraction of valuable chemical information from spectral data in a

295

parts-based manner. Further multivariate statistical analysis and correlation network

296

analysis of the preprocessed data revealed seaweed similarities based on chemical

297

diversity. These findings suggest that the proposed analytical strategy is useful for

298

extracting and identifying individual component features from large chemical data sets,

299

which will shed light on the links between chemical data and the biological

300

consequences.

301 302

ASSOCIATED CONTENT

303

Supporting Information

304

Additional information as noted in the text. This material is available free of charge via

305

the Internet at http://pubs.acs.org.

306 307

AUTHOR INFORMATION

308

Corresponding Author

309

*E-mail: [email protected]. Tel: +49(89)63641603. Fax: +49(89)63646881.

310

Author Contributions 13

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 14 of 27

311

The manuscript was written with equal contribution from all of the authors. All the

312

authors have approved the final version of the manuscript.

313

Funding

314

This research was supported in part by Grants-in-Aid for Scientific Research (Grant No.

315

25513012) (to J.K.), and also partially supported by Council for Science, Technology and

316

Innovation (CSTI), Cross-ministerial Strategic Innovation Promotion Program (SIP),

317

“Technologies for creating next-generation agriculture, forestry and fisheries” funded

318

from Bio-oriented Technology Research Advancement Institution, NARO)

319

Notes

320

The authors have no conflict of interest to declare.

321 322

ACKNOWLEDGMENTS

323

The authors wish to thank Drs. Jiro Tanaka (Tokyo University of Marine Science and

324

Technology) and Yuji Omori (Yokosuka City Museum) for their valuable advice on

325

identification of a variety of seaweed species. The 252 numerical data used for the

326

integrated analysis can be provided on request.

327

14

ACS Paragon Plus Environment

Page 15 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

328

REFERENCES

329 330

(1) Pe'er, D.; Hacohen, N. Cell 2011, 2011 144, 864-873.

331

(2) Ogata, Y.; Chikayama, E.; Morioka, Y.; Everroad, R. C.; Shino, A.; Matsushima, A.;

332

Haruna, H.; Moriya, S.; Toyoda, T.; Kikuchi, J. PloS one 2012, 2012 7, e30263.

333

(3) Southam, A. D.; Lange, A.; Hines, A.; Hill, E. M.; Katsu, Y.; Iguchi, T.; Tyler, C. R.; Viant,

334

M. R. Environ Sci Technol 2011, 2011 45, 3759-3767.

335

(4) Kwon, Y. K.; Bong, Y. S.; Lee, K. S.; Hwang, G. S. Food Chem 2014, 2014 161, 168-175.

336

(5) Kwon, Y. K.; Jung, Y. S.; Park, J. C.; Seo, J.; Choi, M. S.; Hwang, G. S. Mar Pollut Bull

337

2012, 2012 64, 1874-1879.

338

(6) Kim, J.; Jung, Y.; Song, B.; Bong, Y. S.; Ryu do, H.; Lee, K. S.; Hwang, G. S. Food Chem

339

2013, 2013 137, 68-75.

340

(7) Ellis, R. P.; Spicer, J. I.; Byrne, J. J.; Sommer, U.; Viant, M. R.; White, D. A.; Widdicombe,

341

S. Environ Sci Technol 2014, 2014 48, 7044-7052.

342

(8) Lewis, S. J.; Foltynie, T.; Blackwell, A. D.; Robbins, T. W.; Owen, A. M.; Barker, R. A.

343

Journal of neurology, neurosurgery, and psychiatry 2005, 2005 76, 343-348.

344

(9) Zhang, G. F.; Sadhukhan, S.; Tochtrop, G. P.; Brunengraber, H. J Biol Chem 2011, 2011 286,

345

23631-23635.

346

(10) Asakura, T.; Date, Y.; Kikuchi, J. Anal Chem 2014, 2014 86, 5425-5432.

347

(11) Ogawa, D. M.; Moriya, S.; Tsuboi, Y.; Date, Y.; Prieto-da-Silva, A. R.; Radis-Baptista, G.;

348

Yamane, T.; Kikuchi, J. PloS one 2014, 2014 9, e110723.

349

(12) Larive, C. K. Anal Bioanal Chem 2007, 2007 387, 523.

350

(13) Barding, G. A., Jr.; Salditos, R.; Larive, C. K. Anal Bioanal Chem 2012, 2012 404, 1165-1179.

351

(14) Orr, D. J.; Barding, G. A., Jr.; Tolley, C. E.; Hicks, G. R.; Raikhel, N. V.; Larive, C. K.

352

Methods Mol Biol 2014, 2014 1056, 225-239.

353

(15) Davis, T. A.; Volesky, B.; Mucci, A. Water Res 2003, 2003 37, 4311-4330.

354

(16) MacArtain, P.; Gill, C. I. R.; Brooks, M.; Campbell, R.; Rowland, I. R. Nutr Rev 2007, 2007 65,

355

535-543.

356

(17) Sheng, P. X.; Ting, Y. P.; Chen, J. P.; Hong, L. Journal of colloid and interface science

357

2004, 2004 275, 131-141.

358

(18) Nakamura, Y.; Narukawa, T.; Yoshinaga, J. Journal of agricultural and food chemistry

359

2008, 2008 56, 2536-2540.

360

(19) Ito, K.; Sakata, K.; Date, Y.; Kikuchi, J. Anal Chem 2014, 2014 86, 1098-1105.

361

(20) Date, Y.; Sakata, K.; Kikuchi, J. Polym J 2012, 2012 44, 888-894.

362

(21) Leal, D.; Matsuhiro, B.; Rossi, M.; Caruso, F. Carbohyd Res 2008, 2008 343, 308-316. 15

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 16 of 27

363

(22) Larrea-Marin, M. T.; Pomares-Alfonso, M. S.; Gomez-Juaristi, M.; Sanchez-Muniz, F. J.;

364

de la Rocha, S. R. J Food Compos Anal 2010, 2010 23, 814-820.

365

(23) Sekiyama, Y.; Kikuchi, J. Phytochemistry 2007, 2007 68, 2320-2329.

366

(24) Sekiyama, Y.; Chikayama, E.; Kikuchi, J. Anal Chem 2010, 2010 82, 1643-1652.

367

(25) Komatsu, T.; Kikuchi, J. J Phys Chem Lett 2013, 2013 4, 2279-2283.

368

(26) Mori, T.; Chikayama, E.; Tsuboi, Y.; Ishida, N.; Shisa, N.; Noritake, Y.; Moriya, S.;

369

Kikuchi, J. Carbohyd Polym 2012, 2012 90, 1197-1203.

370

(27) Lattao, C.; Cao, X.; Li, Y.; Mao, J.; Schmidt-Rohr, K.; Chappell, M. A.; Miller, L. F.; dela

371

Cruz, A. L.; Pignatello, J. J. Environ Sci Technol 2012, 2012 46, 12814-12822.

372

(28) Mao, J. D.; Johnson, R. L.; Lehmann, J.; Olk, D. C.; Neves, E. G.; Thompson, M. L.;

373

Schmidt-Rohr, K. Environ Sci Technol 2012, 2012 46, 9571-9576.

374

(29) Okushita, K.; Chikayama, E.; Kikuchi, J. Biomacromolecules 2012, 2012 13, 1323-1330.

375

(30) Yamazawa, A.; Iikura, T.; Morioka, Y.; Shino, A.; Ogata, Y.; Date, Y.; Kikuchi, J.

376

Metabolites 2013, 2013 4, 36-52.

377

(31) Yamazawa, A.; Iikura, T.; Shino, A.; Date, Y.; Kikuchi, J. Molecules 2013, 2013 18, 9021-9033.

378

(32) Schmidt-Rohr, K.; Mao, J. D. J Am Chem Soc 2002, 2002 124, 13938-13948.

379

(33) Mao, J. D.; Schmidt-Rohr, K. J Magn Reson 2005, 2005 176, 1-6.

380

(34) Mao, J. D.; Schmidt-Rohr, K. Environmental Science & Technology 2004, 2004 38, 2680-2684.

381

(35) Karakach, T. K.; Knight, R.; Lenz, E. M.; Viant, M. R.; Walter, J. A. Magn Reson Chem

382

2009, 2009 47 Suppl 1, S105-117.

383

(36) Bouveresse, D. J. R.; Moya-Gonzalez, A.; Ammari, F.; Rutledge, D. N. Chemometr Intell

384

Lab 2012, 2012 112, 24-32.

385

(37) Guimet, F.; Boque, R.; Ferre, J. Chemometr Intell Lab 2006, 2006 81, 94-106.

386

(38) Paatero, P.; Tapper, U. Environmetrics 1994, 1994 5, 111-126.

387

(39) Paatero, P. Chemometr Intell Lab 1997, 1997 37, 23-35.

388

(40) Lee, D. D.; Seung, H. S. Nature 1999, 1999 401, 788-791.

389

(41) Hoyer, P. O. J Mach Learn Res 2004, 2004 5, 1457-1469.

390

(42) Pauca, V. P.; Piper, J.; Plemmons, R. J. Linear Algebra Appl 2006, 2006 416, 29-47.

391

(43) Wang, Y. X.; Zhang, Y. J. Ieee T Knowl Data En 2013, 2013 25, 1336-1353.

392

(44) Tikole, S.; Jaravine, V.; Rogov, V.; Dotsch, V.; Guntert, P. Bmc Bioinformatics 2014, 2014 15.

393

(45) Snyder, D. A.; Zhang, F.; Robinette, S. L.; Bruschweiler-Li, L.; Bruschweiler, R. J Chem

394

Phys 2008, 2008 128.

395

(46) Durbin, J.; Watson, G. S. Biometrika 1950, 1950 37, 409-428.

396

(47) Rivas-Ubach, A.; Sardans, J.; Perez-Trujillo, M.; Estiarte, M.; Penuelas, J. P Natl Acad

397

Sci USA 2012, 2012 109, 4181-4186.

398

(48) Bertagnolli, C.; Uhart, A.; Dupin, J. C.; da Silva, M. G.; Guibal, E.; Desbrieres, J. 16

ACS Paragon Plus Environment

Page 17 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

399

Bioresource technology 2014, 2014 164, 264-269.

400

(49) Surif, M. B.; Raven, J. A. Oecologia 1989, 1989 78, 97-105.

401

(50) Haug, A.; Larsen, B.; Smidsrod, O. Acta Chem Scand 1966, 1966 20, 183-&.

402

(51) Haug, A. Acta Chem Scand 1961, 1961 15, 1794-&.

403

(52) Perales-Vela, H. V.; Pena-Castro, J. M.; Canizares-Villanueva, R. O. Chemosphere 2006, 2006

404

64, 1-10.

405

(53) Sanders, J. G. Estuar Coast Mar Sci 1979, 1979 9, 95-99.

406

17

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

407

Page 18 of 27

Table 1. Algae Samples Used in the Study. sample ID

a

species (class No.)

b

order

sampling position

sampling date

F1

Sargassum fusiforme (1)

Fucales

Aburatsubo

2011.05.20

F2

S. fusiforme (1)

Fucales

Aburatsubo

2011.06.01

F3

S. fusiforme (1)

Fucales

Aburatsubo

2011.06.01

F4

S. fusiforme (1)

Fucales

Aburatsubo

2011.06.16

F5

S. fusiforme (1)

Fucales

Aburatsubo

2011.07.05

F6

S. fusiforme (1)

Fucales

Aburatsubo

2011.07.14

F7

S. fusiforme (1)

Fucales

Aburatsubo

2011.08.01

F8

S. fusiforme (1)

Fucales

Aburatsubo

2011.08.15

F9

S. fusiforme (1)

Fucales

Aburatsubo

2011.08.30

F10

S. fusiforme (1)

Fucales

Aburatsubo

2011.09.30

F11

S. fusiforme (1)

Fucales

Aburatsubo

2011.10.12

F12

S. fusiforme (1)

Fucales

Aburatsubo

2011.11.24

F13

S. fusiforme (1)

Fucales

Aburatsubo

2011.12.14

F14

S. fusiforme (1)

Fucales

Aburatsubo

2012.01.24

F15

S. fusiforme (1)

Fucales

Aburatsubo

2012.02.20

F16

S. fusiforme (1)

Fucales

Aburatsubo

2012.03.21

F17

S. fusiforme (1)

Fucales

Aburatsubo

2012.04.09

B1

Sargassum ringgoldianum (3)

Fucales

Aburatsubo

2011.06.16

B2

Sargassum ringgoldianum (3)

Fucales

Aburatsubo

2011.07.05

B3

Sargassum patens (7)

Fucales

Aburatsubo

2011.07.14

B4

Sargassum ringgoldianum (3)

Fucales

Aburatsubo

2011.07.14

B5

Sargassum hemiphyllum (9)

Fucales

Aburatsubo

2011.08.01

B6

Sargassum fulvellum (10)

Fucales

Aburatsubo

2011.08.01

B7

Sargassum ringgoldianum (3)

Fucales

Aburatsubo

2011.08.01

B8

Sargassum thunbergii (12)

Fucales

Tenjin Island

2010.08.11

B9

Padina arborescens (4)

Dictyotales

Aburatsubo

2011.07.05

B10

Dictyopteris undulata (6)

Dictyotales

Aburatsubo

2011.07.05

B11

Padina arborescens (4)

Dictyotales

Aburatsubo

2011.07.14

B12

Padina arborescens (4)

Dictyotales

Aburatsubo

2011.08.01

B13

Dictyopteris undulata (6)

Dictyotales

Aburatsubo

2011.08.01

B14

Dictyopteris undulata (6)

Dictyotales

Aburatsubo

2011.08.01

(to be continued)

18

ACS Paragon Plus Environment

Page 19 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

(continued) B15

Dictyopteris prolifera (11)

Dictyotales

Aburatsubo

2011.08.01

B16

Padina arborescens (4)

Dictyotales

Aburatsubo

2011.08.01

B17

Padina arborescens (4)

Dictyotales

Aburatsubo

2011.06.16

B18

Ishige okamurae (2)

Ishigeales

Aburatsubo

2011.06.16

B19

Ishige okamurae (2)

Ishigeales

Aburatsubo

2011.07.05

B20

Ishige okamurae (2)

Ishigeales

Aburatsubo

2011.07.14

B21

Ishige okamurae (2)

Ishigeales

Aburatsubo

2011.08.01

B22

Eisenia bicyclis (5)

Laminariales

Aburatsubo

2011.07.05

B23

Eisenia bicyclis (5)

Laminariales

Aburatsubo

2011.07.14

B24

Eisenia bicyclis (5)

Laminariales

Aburatsubo

2011.08.01

B25

Ecklonia cava (13)

Laminariales

Aburatsubo

2011.06.16

B26

Colpomenia sinuosa (8)

Scytosiphonales

Aburatsubo

2011.07.14

B27

Colpomenia sinuosa (8)

Scytosiphonales

Aburatsubo

2011.06.16

R1

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.06.16

R2

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.06.16

R3

Chondrus ocellatus

Gigartinales

Aburatsubo

2011.06.16

R4

Chondrus ocellatus

Gigartinales

Aburatsubo

2011.06.16

R5

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.06.16

R6

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.07.05

R7

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.07.05

R8

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.07.05

R9

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.07.05

R10

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.08.01

R11

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.08.01

R12

Chondrus verrucosus

Gigartinales

Aburatsubo

2011.07.14

R13

Prionitis cornea

Gigartinales

Aburatsubo

2011.06.16

R14

Prionitis cornea

Gigartinales

Aburatsubo

2011.06.16

R15

Prionitis cornea

Gigartinales

Aburatsubo

2011.07.05

R16

Prionitis cornea

Gigartinales

Aburatsubo

2011.07.05

R17

Grateloupia asiatica

Gigartinales

Aburatsubo

2011.07.05

R18

Grateloupia elliptica

Gigartinales

Aburatsubo

2011.07.05

R19

Prionitis cornea

Gigartinales

Aburatsubo

2011.07.14

R20

Grateloupia chiangii

Gigartinales

Aburatsubo

2011.07.14

(to be continued)

19

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 20 of 27

(continued) R21

Prionitis cornea

Gigartinales

Aburatsubo

2011.08.01

R22

Prionitis cornea

Gigartinales

Aburatsubo

2011.08.01

R23

Meristotheca papulosa

Gigartinales

Aburatsubo

2011.07.14

R24

Meristotheca papulosa

Gigartinales

Aburatsubo

2011.07.14

R25

Gelidium elegans

Gelidiales

Aburatsubo

2011.07.14

R26

Gelidium elegans

Gelidiales

Aburatsubo

2011.08.01

R27

Gelidium elegans

Gelidiales

Tenjin Island

2010.08.11

R28

Gelidium elegans

Gelidiales

Aburatsubo

2011.08.01

R29

Gracilaria textorii

Gracilariales

Aburatsubo

2011.06.16

R30

Gracilaria textorii

Gracilariales

Aburatsubo

2011.06.16

R31

Gracilaria textorii

Gracilariales

Aburatsubo

2011.07.05

R32

Gracilaria textorii

Gracilariales

Aburatsubo

2011.07.05

R33

Amphiroa zonata

Corallinales

Aburatsubo

2011.07.05

R34

Amphiroa zonata

Corallinales

Aburatsubo

2011.08.01

R35

Amphiroa zonata

Corallinales

Aburatsubo

2011.06.16

R36

Martensia jejuensis

Ceramiales

Aburatsubo

2011.07.14

R37

Martensia jejuensis

Ceramiales

Aburatsubo

2011.08.01

R38

Laurencia intermedia

Ceramiales

Aburatsubo

2011.08.01

R39

Lomentaria catenata

Rhodymeniales

Aburatsubo

2011.06.16

G1

Ulva pertusa

Ulvales

Aburatsubo

2011.06.16

G2

Ulva pertusa

Ulvales

Aburatsubo

2011.07.05

G3

Ulva pertusa

Ulvales

Aburatsubo

2011.07.14

G4

Ulva pertusa

Ulvales

Aburatsubo

2011.07.14

G5

Ulva pertusa

Ulvales

Aburatsubo

2011.08.01

G6

Ulva pertusa

Ulvales

Aburatsubo

2011.08.01

G7

Codium cylindricum

Codiales

Aburatsubo

2011.06.16

G8

Codium fragile

Codiales

Aburatsubo

2011.06.16

G9

Codium latum

Codiales

Aburatsubo

2011.07.05

G10

Codium fragile

Codiales

Aburatsubo

2011.07.05

G11

Codium cylindricum

Codiales

Aburatsubo

2011.07.05

G12

Codium fragile

Codiales

Aburatsubo

2011.07.14

G13

Codium fragile

Codiales

Aburatsubo

2011.07.14

G14

Codium fragile

Codiales

Aburatsubo

2011.07.14

(to be continued)

20

ACS Paragon Plus Environment

Page 21 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

(continued)

408 409 410

a

G15

Codium cylindricum

Codiales

Aburatsubo

2011.07.14

G16

Codium subtubulosum

Codiales

Aburatsubo

2011.07.14

G17

Codium cylindricum

Codiales

Aburatsubo

2011.08.01

G18

Codium fragile

Codiales

Aburatsubo

2011.08.01

G19

Codium fragile

Codiales

Aburatsubo

2011.06.16

G20

Codium latum

Codiales

Aburatsubo

2011.06.16

G21

Codium cylindricum

Codiales

Aburatsubo

2011.07.05

G22

Chaetomorpha crassa

Cladophorales

Aburatsubo

2011.07.14

G23

Chaetomorpha crassa

Cladophorales

Aburatsubo

2011.08.01

G24

Codium lucasii

Bryopsidales

Aburatsubo

2011.06.16

S1

Zostera marina

Helobiales

Aburatsubo

2011.07.05

S2

Zostera marina

Helobiales

Aburatsubo

2011.07.14

“F”, “B”, “R”, “G”, and “S” mean S. fusiforme, brown algae, brown algae, red algae,

green algae, and seagrass, respectively. b

the class No. of only brown algae is consistent with Figure 4.

411

21

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

412

Page 22 of 27

FIGURE LEGENDS

413 414

Figure 1. Multivariate spectral decomposition of (A) DTG, (B) FT-IR, (C) CP-MAS, and

415

(D) 1H-NMR spectra of S. fusiforme using NMF or NMF/MCR-ALS methods. Lines in

416

black: original data; lines in red: basis functions (loading plots) of the reconstructed

417

NMF or NMF/MCR-ALS models; lines in blue: RSS values for the original and

418

reconstructed models.

419 420

Figure 2. Similarity network for various seaweeds based on Pearson’s correlation

421

coefficients (|r| > 0.95).

422 423 424

Figure 3. Correlation network for brown algae components obtained using CP-MAS, 1

H-NMR, and ICP data (|r| > 0.7).

425 426

Figure 4. SOM plots of brown algae components based on their relationships with (A) Cd

427

and (B) As. The class number of brown algae samples corresponds to Table 1.

428

22

ACS Paragon Plus Environment

Page 23 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

429 430

Figure 1. Wei et al. 23

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 24 of 27

431 432 433

434 435

Figure 2. Wei et al.

436

24

ACS Paragon Plus Environment

Page 25 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

437 438 439

440 441

Figure 3. Wei et al.

442

25

ACS Paragon Plus Environment

Analytical Chemistry

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Page 26 of 27

443 444

445 446

Figure 4. Wei et al.

447 448

26

ACS Paragon Plus Environment

Page 27 of 27

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Analytical Chemistry

449

For TOC only

450 451 452

27

ACS Paragon Plus Environment