Screening Priority Factors Determining and ... - ACS Publications

forest model, the priority factors determining reproductive toxicity were successfully. 25 screened from highly heterogeneous data. Among ten factors ...
0 downloads 0 Views 2MB Size
Subscriber access provided by NEW MEXICO STATE UNIV

Characterization of Natural and Affected Environments

Screening Priority Factors Determining and Predicting the Reproductive Toxicity of Various Nanoparticles Zhan Ban, Li Mu, Qixing Zhou, Anqi Sun, and Xiangang Hu Environ. Sci. Technol., Just Accepted Manuscript • DOI: 10.1021/acs.est.8b02757 • Publication Date (Web): 30 Jul 2018 Downloaded from http://pubs.acs.org on July 30, 2018

Just Accepted “Just Accepted” manuscripts have been peer-reviewed and accepted for publication. They are posted online prior to technical editing, formatting for publication and author proofing. The American Chemical Society provides “Just Accepted” as a service to the research community to expedite the dissemination of scientific material as soon as possible after acceptance. “Just Accepted” manuscripts appear in full in PDF format accompanied by an HTML abstract. “Just Accepted” manuscripts have been fully peer reviewed, but should not be considered the official version of record. They are citable by the Digital Object Identifier (DOI®). “Just Accepted” is an optional service offered to authors. Therefore, the “Just Accepted” Web site may not include all articles that will be published in the journal. After a manuscript is technically edited and formatted, it will be removed from the “Just Accepted” Web site and published as an ASAP article. Note that technical editing may introduce minor changes to the manuscript text and/or graphics which could affect content, and all legal disclaimers and ethical guidelines that apply to the journal pertain. ACS cannot be held responsible for errors or consequences arising from the use of information contained in these “Just Accepted” manuscripts.

is published by the American Chemical Society. 1155 Sixteenth Street N.W., Washington, DC 20036 Published by American Chemical Society. Copyright © American Chemical Society. However, no copyright claim is made to original U.S. Government works, or works produced by employees of any Commonwealth realm Crown government in the course of their duties.

Page 1 of 45

Environmental Science & Technology

TOC 84x47mm (300 x 300 DPI)

ACS Paragon Plus Environment

Environmental Science & Technology

1

Screening Priority Factors Determining and Predicting the Reproductive

2

Toxicity of Various Nanoparticles

3

Zhan Ban†, Qixing Zhou†, Anqi Sun†, Li Mu‡, Xiangang Hu†,*

4



5

Education)/Tianjin Key Laboratory of Environmental Remediation and Pollution

6

Control, College of Environmental Science and Engineering, Nankai University,

7

Tianjin 300350, China

8



9

Laboratory for Environmental Factors Control of Agro-Product Quality and Safety

10

(Ministry of Agriculture), Institute of Agro-Environmental Protection, Ministry of

11

Agriculture, Tianjin 300191, China

Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of

Tianjin Key Laboratory of Agro-Environment and Agro-Product Safety, Key

12 13

Corresponding author: Xiangang Hu, [email protected]

14

Fax: 0086-022-23507800

15

Tel.: 0086-022-23507800

16 17 18 19 20 1

ACS Paragon Plus Environment

Page 2 of 45

Page 3 of 45

Environmental Science & Technology

21

ABSTRACT

22

Due to the numerous factors (e.g., nanoparticle [NP] properties and experimental

23

conditions) influencing nanotoxicity, it is difficult to identify the priority factors

24

dominating nanotoxicity. Herein, by integrating data from the literature and a random

25

forest model, the priority factors determining reproductive toxicity were successfully

26

screened from highly heterogeneous data. Among ten factors from more than eighteen

27

different NPs, the NP type and the exposure pathway were found to dominantly

28

determine NP accumulation. The reproductive toxicity of various NPs primarily

29

depended on the NP type and the toxicity indicators. Nanoparticles containing major

30

elements (e.g., Zn and Fe) tended to accumulate in rats but induced lower toxicity

31

than NPs containing noble elements. Compared with other exposure pathways, i.p.

32

injection posed significantly higher risks for NP accumulation. By combining

33

similarity network analysis and hierarchical clustering, the sources of highly

34

heterogeneous data were identified, the factor-toxicity dependencies were extracted

35

and visualized, and the prediction of nanotoxicity was then achieved based on the

36

screened priority factors. The present work provides insights for the design of animal

37

experiments and the illustration and prediction of nanotoxicity.

38 39

Keywords: nanotoxicology, carbon nanotube, graphene, Ag, TiO2, rat

40 2

ACS Paragon Plus Environment

Environmental Science & Technology

41

INTRODUCTION

42

With unique and advanced properties, engineered nanoparticles (ENPs) have been

43

widely applied in human activities (e.g., approximately 760 nano-enabled consumer

44

products in the health and fitness category, with an increasing number of

45

nano-enabled products for industry, agriculture and environmental protection).1 With

46

the rapid development of NPs, nanosafety has attracted increasing attention

47

worldwide due to environmental contamination or direct human exposure.2, 3 However,

48

the numerous physicochemical properties (e.g., size, shape and surface chemistry) of

49

ENPs and the various experimental conditions (e.g., various exposure pathways)

50

affect nanotoxicity.4 For example, suspended graphene showed toxic effects through

51

direct physical interaction with the cell membrane, generating reactive oxygen species

52

(ROS) and trapping cells via aggregation.5-9 The chemical state of graphene (e.g., GO

53

and rGO) also affected its interaction with cells.10 Based on the hypothesis that the

54

physicochemical properties of ENPs and the experimental conditions determine

55

nanotoxicology, the priority factors determining nanotoxicity were identified and

56

nanotoxicity was predicted using models in the present work. In addition, the cost of

57

animal experiments is very high, and the mechanisms of nanotoxicity are very

58

complicated, especially in experiments involving reproductive toxicity. As a result,

59

arguments concerning nanotoxicity are frequently reported.11-13 It is becoming

60

increasingly important to effectively screen priority factors determining and 3

ACS Paragon Plus Environment

Page 4 of 45

Page 5 of 45

Environmental Science & Technology

61

predicting the toxicity of various NPs using other methods to supplement animal

62

experiments, which is critical for the design of animal experiments and for the

63

illustration and prediction of nanotoxicity.

64

Multidimensional datasets (e.g., the physicochemical properties of ENPs and the

65

various experimental conditions) challenge traditional tools (e.g., quantitative

66

structure-activity relationships and qualitative classification-based models) for the

67

assessment and prediction of nanotoxicity.14-16 Machine learning is a discipline that

68

can be used to build models to explain observed data through experience.17, 18 With its

69

robust capacity to recognize meaningful patterns, machine learning has a wide range

70

of applications, including controlling robots,18 predicting toxicity19 and predicting

71

synthetic reactions.19, 20 In the above approach, numerous data points were used to

72

overcome the heterogeneous data. Thus, machine learning may solve the

73

aforementioned problems of assessing nanosafety.

74

However, publications and data regarding reproductive toxicity are relatively

75

limited due to the high cost and complexity of performing such studies. To evaluate

76

the reproductive toxicity of ENPs, the reported data from animal experiments are

77

valuable sources of toxicological information for meta-analysis, which could produce

78

general answers to questions regarding diverse data.19-23 Given the highly

79

heterogeneous data, it is difficult to reveal the relationships between the

80

multidimensional factors of ENPs and nanotoxicity using traditional models. 4

ACS Paragon Plus Environment

Environmental Science & Technology

81

Therefore, accurately extracting the hidden multidimensional factor-toxicity

82

dependencies through machine learning is critical for toxicity assessment and

83

prediction19, 24 and remains a challenge for current workflows.25

84

As a new interdisciplinary technique, machine learning can be used to build models

85

to explain observed data through experience.17, 18 Random forest (RF)26 is a powerful

86

algorithm based on the decision tree method and is suitable for dealing with big

87

data.19, 27 In the present work, RF was used to examine heterogeneous data from a

88

meta-analysis. To explore the heterogeneity of the datasets, similarity network

89

analysis was performed to visualize the similarity between data points with the

90

proximity matrix in RF. The proximity matrix, a by-product of RF, records the

91

frequency of two data points appearing in the same nodes.19 The priority factors for

92

determining reproductive toxicity were screened by RF and are critical to the

93

scientific design of experiments, as well as the accurate illustration and prediction of

94

nanotoxicity. Finally, multiple methods were integrated to predict nanotoxicity by

95

revealing factor-toxicity dependencies.

96

MATERIALS AND METHODS

97

Data extraction

98 99 100

Data regarding the female reproductive toxicity induced by NPs (e.g., those related to the uterus, ovaries and sex hormones) are highly heterogeneous.28, 29 In contrast, the indexes of male reproductive toxicity showed a high representation of fertility (e.g., 5

ACS Paragon Plus Environment

Page 6 of 45

Page 7 of 45

Environmental Science & Technology

101

the sperm count and the sperm abnormality rate).30 Therefore, the present work

102

focused on reproductive toxicity in male rodents. The tested data were obtained from

103

the ISI Web of Knowledge, PubMed and Scopus databases; the searched topics are

104

presented in the Supporting Information (search time, July 23, 2017). To obtain a

105

comprehensive understanding of the relationships between different factors (e.g., ENP

106

types) and reproductive toxicity, the present work collected data from animal

107

experiments related to nanotoxicity in the literature, as shown in Tables S1-S3. The

108

chosen NPs, including carbon nanotubes, graphene, and Ag and TiO2 NPs, have been

109

widely applied in various fields and have high potential for environmental or human

110

exposure; for example, Ag and TiO2 NPs have been detected in various surface

111

waters.31, 32 More importantly, the chosen NPs cause obvious reproductive toxicity by

112

reducing sperm parameters and accumulating in the testicles.33, 34 The present work

113

intended to study the relationships between different ENP types (e.g., carbon

114

nanotubes and Ag and Au NPs) and reproductive toxicity in the testes. All NPs

115

causing reproductive toxicity in male rats were analyzed. A summary of toxicity

116

information from each publication collected in the present work is provided in Table

117

S3 to ensure the reliability of the results of the model. Ten factors, including five

118

physicochemical properties (ENP size, shape, type, surface ligand and element) and

119

five experimental conditions (rodent age, dosage and exposure time, duration and

120

method), and two adverse biological impacts (reproductive toxicity and accumulation 6

ACS Paragon Plus Environment

Environmental Science & Technology

121

of ENPs) were extracted from each publication, as shown in Tables S1 and S2. The

122

organ tested for the accumulation of ENPs was the testis. The tested reproductive

123

toxicity indicators were sperm parameters, blood testosterone concentration and testis

124

index. Among the above three indicators, sperm parameters were the priority

125

parameters for measuring the male reproductive toxicity of ENPs. The analyzed

126

sperm parameters included the sperm count (106/ml), rate of motile sperm (%),

127

progressive motility (%), sperm abnormalities (%) and total sperm production (106/g

128

testis). Among the above sperm parameters, the sperm count (106/ml) was set as a

129

priority sperm parameter.

130

Treatment of special data

131

Approximately one hundred data samples did not include the age of the tested

132

rodents. The curve-fitting method was used to calculate the missing ages based on the

133

age-weight relationships of different species (details are presented in the

134

Supplementary Information and Table S4). In a few publications without information

135

regarding rodent age or weight, the median age of overall rodents was used to replace

136

the missing age values. For nanowires or nanorods, the widths were recorded in the

137

present analysis. According to respiratory rates in previous research,35 μg/݉ଷ or

138

mg/݉ଷ was converted to mg/݇݃ for rodents based on the formula described by

139

Nouwen.36 The unit of exposure dosage was normalized to mg/(݇݃ ∙ ݀ܽ‫)ݕ‬. To

7

ACS Paragon Plus Environment

Page 8 of 45

Page 9 of 45

Environmental Science & Technology

140

normalize the diverse toxicity indicators, the ratios of data from treated groups to data

141

from control groups were used in the present analysis.

142

RF regression and validation

143

In RF, each tree is built by a bootstrap sample from the overall data and the best

144

partitions among a subset of attributes that are randomly selected for each node. The

145

RF algorithm performs the predictions by aggregating the predictions of each tree,

146

using the majority vote for classification and the average for regression. To measure

147

the relative importance of factors, the percent increase in the RF mean square error

148

(MSE) was calculated using the R package randomForest. To find the sources of

149

heterogeneity and to identify the priority factors determining the reproductive toxicity

150

of ENPs, the original datasets were classified into subsets for further analysis

151

according to the above priority factors. Given that the type of ENP was crucial for

152

ENP toxicity and accumulation, metallic ENPs vs metal-free ENPs and carbonaceous

153

ENPs vs carbon-free ENPs were set as two important classifications. Regarding the

154

exposure method in accumulation studies, the data were divided according to injection

155

(e.g., intravenous injection, intraperitoneal injection, abdominal subcutaneous

156

injection and intratracheal injection) and noninjection (e.g., gavage and inhalation)

157

exposure methods.

158

RF, as a data-driven model, includes two important parameters: ntree, the number

159

of trees in the forest; and mtry, the number of attributes randomly selected for a subset 8

ACS Paragon Plus Environment

Environmental Science & Technology

160

at each node.26 The values of ntree and mtry for each study were adjusted to obtain

161

the best predictive accuracy. During tree construction, approximately 63% of the raw

162

data were used to build the trees in each RF bootstrap sample, and the remaining

163

out-of-bag (OOB) data (not in the bootstrap samples) were used to validate the

164

performance of the model.37 As the RF algorithm is inherently tolerant to

165

overfitting,37 the OOB predictive accuracy was selected to represent the performance

166

and predictive accuracy of the model. To measure the performance, the correlation

167

coefficient (R2) and root mean square error (RMSE) between the predictions and

168

observations were calculated as predictive accuracy metrics.24 Then, to evaluate the

169

predictive accuracy of the RF model, linear regression of quantitative factors was

170

performed to predict nanotoxicity (accumulation and toxicity of ENPs). To evaluate

171

the performance of the model with different data, training datasets (from 20% to

172

100%) were randomly selected from original datasets, with ten repetitions, and the R2

173

and RMSE were then calculated.

174

Similarity network analysis

175

The similarity network was drawn using the Kamada-Kawai layout algorithm.38

176

Similarity network nodes with higher than average values were linked in the

177

proximity matrix using the R package igraph. Subsequently, to find tight-knit

178

subgroups with intrinsic commonality and to visualize the heterogeneity of the

179

datasets, the walktrap community algorithm,39 a hierarchical clustering method, was 9

ACS Paragon Plus Environment

Page 10 of 45

Page 11 of 45

Environmental Science & Technology

180

applied to the similarity network. Then, two important measurements, the density and

181

clustering coefficient (transitivity), reflecting the kernel density estimation and the

182

probability of a vertex connecting to adjacent vertices, were applied to measure the

183

tendency of the data to cluster.40, 41 Given the heterogeneity in the datasets, the first to

184

the third quartiles and the most frequent categories were applied to represent the

185

distribution of quantitative and qualitative data in each cluster, respectively.

186

Factor-toxicity dependence analysis and nanotoxicity prediction

187

Two methods were used to analyze the complex factor-toxicity dependencies. The

188

first method was to visualize the average partial relationships between the adverse

189

impacts (accumulation and toxicity) of ENPs and factors by partial dependence plots42,

190

43

191

was then used to estimate statistically significant differences (p < 0.05) between

192

observations (p1) and predictions (p2). The above method was suitable for visualizing

193

low-dimensional (one or two) parameters. Similarity network analysis with

194

hierarchical clustering could identify tight-knit clusters with intrinsic similarity and

195

common properties. Similarity network analysis was also employed to visualize

196

high-dimensional parameters. The classification analysis of RF was used to extract

197

reliable factor-toxicity dependencies shared in clusters. The prediction analysis used

198

data with a uniform distribution over a certain range for each factor. The median

199

values or most frequent categories were used to describe nonsignificant attributes

and individual conditional expectation (ICE) plots.44 Analysis of variance (ANOVA)

10

ACS Paragon Plus Environment

Environmental Science & Technology

200

(rodent age, ENP shape, exposure duration, surface ligands for accumulation studies

201

and indicators for toxicity studies). The sperm parameters were selected for measuring

202

reproductive toxicity due to their low heterogeneity.

203

To validate the analysis of multidimensional factor-toxicity dependence, the

204

over-80% vote rate and the classification rate were calculated to measure the data

205

matching the cluster and the RF classification performance. To estimate the predictive

206

performance, the predictions (TP and CP) were compared with the observations (TO

207

and CO) of data meeting the conditions of the six important attributes (ENP type, size,

208

exposure method, exposure time, dosage, element for accumulation studies and

209

surface ligand for toxicity studies) screened by variable importance, shown in Figure

210

2, and the observations in the original cluster (TC and CC). ANOVA with Tukey’s

211

multiple comparison test was performed using IBM SPSS Statistics 20. In the RF

212

analysis, partial dependence plots and ICE plots were generated using the

213

randomForest package and the pdp package in R software version 3.3.2, respectively.

214 215

RESULTS AND DISCUSSION

216

Analysis of highly heterogeneous data

217

Mining data regarding the reproductive toxicity of ENPs followed the process

218

described in the Methods section (Tables S1 and S2). The data from eighty-two

219

publications directly related to the male reproductive toxicity of ENPs were analyzed. 11

ACS Paragon Plus Environment

Page 12 of 45

Page 13 of 45

Environmental Science & Technology

220

Regarding the accumulation and the reproductive toxicity of ENPs, 242 and 250 data

221

points were obtained, respectively, as summarized in Table 1. The quantitative

222

attributes (rodent age, ENP size, dosage, exposure time and duration) and qualitative

223

attributes (ENP type, shape, surface ligand, exposure method, and element) were

224

tested in the RF analysis (Table 1). The aforementioned ten qualitative and

225

quantitative attributes covered the main issues related to the reproductive impacts of

226

ENPs.45 The distribution of the extracted data points is presented in Figure S1. No

227

particular studies dominating the databases were observed, and further analysis was

228

performed.

229

However, the data heterogeneity was still a challenge for further analysis. Table 1

230

provides an overview of the heterogeneity of databases regarding the accumulation

231

and toxicity of ENPs. The large standard deviations for the accumulation

232

concentration (e.g., 1437 ± 6201 ng/g) and the toxicity (e.g., 0.12 ± 0.25) of ENPs

233

demonstrated the heterogeneity of the data (Table 1), leading to difficulty in drawing

234

an unbiased conclusion. In addition to the influence of the physicochemical properties

235

of ENPs and the experimental conditions, the method of exposure to NPs can lead to

236

disruption of the cellular redox status and activation of compensatory mechanisms

237

(e.g., changes in starch granules, lysosomes and autophagy), resulting heterogeneous

238

toxicological data.46 First, linear regression was used to fit the heterogeneous data and

239

explore the connections of quantitative factors with the accumulation concentration 12

ACS Paragon Plus Environment

Environmental Science & Technology

240

and toxicity of ENPs. As shown in Figure S2, the small R2 (less than 0.05 for

241

accumulation studies and less than 0.12 for toxicity studies) and the large RMSE

242

(greater than 6200 ng/g for accumulation studies and greater than 0.25 for

243

reproductive toxicity) values indicated that the traditional linear regression model had

244

no capacity to handle the database heterogeneity. To compare with the above linear

245

regression model, the R2 and RMSE were also used to evaluate the performance of the

246

RF model (Table 2 and Figure 1). Higher R2 (greater than 0.62) and smaller RMSE

247

(less than 6000 ng/g) values indicated that the RF model was better for analyzing the

248

heterogeneous data than the linear regression model. Moreover, the RMSE and R2

249

gradually decreased and increased with increasing training data, respectively (Figure

250

1), indicating that the RF model would exhibit better performance with more data.

251

These measurements ensured the reliability of the meta-analysis machine learning

252

workflow and the results of the model. Regarding the above heterogeneous data, the

253

RF model combined with similarity network and hierarchical clustering analyses was

254

used to handle and visualize data heterogeneity, to screen priority factors determining

255

ENP toxicity and to reveal factor-toxicity dependencies, as presented below.

256

Screening priority factors determining reproductive toxicity and contributing to

257

heterogeneity

258

The present work screened priority factors determining toxicity by measuring the

259

increase in factor importance with increasing MSE. According to the MSE increase 13

ACS Paragon Plus Environment

Page 14 of 45

Page 15 of 45

Environmental Science & Technology

260

illustrated in Figure 2, two top priority factors with the highest MSE increases (greater

261

than 32%) for NP accumulation were screened: the exposure method (e.g., i.p., i.v.

262

and ga.) and ENP type (e.g., MWCNT, ZnO and Ag). The above two factors should

263

be well considered in the design of animal experiments and the assessment of

264

nanomaterial accumulation. In contrast, toxicity indicators (e.g., sperm parameters,

265

testosterone level and testis index) and the ENP type (e.g., Ag, MWCNT and TiO2)

266

with the highest MSE increase (greater than 15%) were identified as the two top

267

priority factors for reproductive toxicity. Therefore, the determination of nanomaterial

268

reproductive toxicity depended on the toxicity indicators and ENP type used in the

269

experiments.

270

To find the sources of heterogeneity, the present work classified the priority factors

271

into subsets and evaluated the heterogeneity in these subsets using an RF regression

272

model. As shown in Table 2, the RF model had good predictive performance in all of

273

the subsets, and the R2 values were greater than 0.6. The differences between the

274

observations and predictions modeled by the RMSE were obvious in the accumulation

275

studies due to the presence of few data sources with large predictive errors (Figure

276

S3a). In other words, the RMSE could be used to find the sources of large predictive

277

errors and the contributors to heterogeneity. In the toxicity studies (Table 2), the small

278

RMSE (e.g., 0.091 and 0.111) reflected the high predictive accuracy in those subsets.

279

The limited data points (n = 37 and n = 56) of the subsets with carbonaceous ENPs 14

ACS Paragon Plus Environment

Environmental Science & Technology

280

and the testis index contributed to the high heterogeneity, indicated by small R2 values

281

(0.489 and 0.383, respectively), in the prediction of reproductive toxicity (Table 2).

282

However, compared with the accumulation studies (Figure S3a), the toxicity studies

283

did not contain data that dominantly affected the regression performance or reduced

284

the predictive accuracy (Figure S3b).

285

To further identify sources of heterogeneity, the present work built similarity

286

networks (Figures 3 and 4) to visualize the construction and heterogeneity of the

287

datasets based on the RF proximity matrix. To analyze the major constructions of the

288

similarity network, hierarchical clustering was performed to find tight-knit clusters,

289

which are labeled with different colors in Figure 3. Table S5 clearly lists the densities

290

and clustering coefficients, which were provided as measures of the aggregation of

291

each cluster in the similarity network for NP accumulation and reproductive toxicity.

292

With few connections, sparse clusters demonstrated the heterogeneity of accumulation

293

or toxicity study-related data in terms of factor-toxicity dependencies. For example,

294

the sparse cluster 8 (C8), with low density (0.692) and clustering coefficient (0.782)

295

values, indicates high heterogeneity in terms of the properties of ENPs and the

296

experimental conditions, consistent with the wide range (0-14191 ng/g) of ENP

297

accumulation concentrations. Additionally, tight clusters tended to present lower

298

accumulation concentrations (e.g., C1, C2 and C3 in accumulation studies) and higher

299

toxicity levels (e.g., C7, C8 and C9 in toxicity studies) than sparse clusters. The 15

ACS Paragon Plus Environment

Page 16 of 45

Page 17 of 45

Environmental Science & Technology

300

accumulation studies contained more separate clusters (e.g., C1 and C3) with specific

301

commonalities than the toxicity studies, indicating the diverse factor-accumulation

302

dependencies in the accumulation studies. To assess the heterogeneity of the priority

303

factors (i.e., ENP type and exposure method for accumulation studies; ENP type and

304

indicator for toxicity studies), Figure 4 presents similarity networks colored by the

305

categories of the priority factors. The data for some categories (e.g., Au in Figure 4a,

306

i.v. in Figure 4b and sperm parameters in Figure 4c) were separated from neighbor

307

clusters and gathered in tight clusters with great homogeneity. For example, the Au

308

NPs gathered in two tight clusters for NP accumulation. Compared with the clusters in

309

Figure 4b, the two clusters of Au NPs were separated by different exposure methods

310

(i.v. and i.p.). The data also suggested that the exposure method was the priority

311

factor that more dominantly determined ENP accumulation than the ENP type, which

312

was verified by the variable importance in Figure 2. Furthermore, in Figure 4d,

313

compared with the cluster of sperm parameters, the testosterone and testis index

314

clusters contained sparse connections, indicating the high heterogeneity of the data

315

regarding testis index-toxicity and testosterone-toxicity. Compared with Figure 4a,

316

Figure 4c, with nodes of diverse types of ENPs clustering closely together, shows the

317

high homogeneity of the relationships between ENP type and ENP toxicity. The

318

figure also confirms that the exposure methods with tight clusters play more important

319

roles in ENP toxicity than the ENP types in Figure 2. The employed similarity 16

ACS Paragon Plus Environment

Environmental Science & Technology

320

networks identified the heterogeneity and homogeneity of factor-toxicity

321

dependencies from sparse and tight clusters, respectively. The above analysis of

322

factor-toxicity dependence provided a reliable foundation from which to predict the

323

reproductive toxicity of ENPs, as described below.

324

Predicting the reproductive toxicity of ENPs

325

Visualizing multidimensional factor-toxicity dependencies is effective for

326

interpreting the relationship between factors and toxicity hidden in the RF model and

327

contributes to the prediction of nanotoxicity. The partial dependence plots shown in

328

Figure 5 provided an easy way to deeply understand the relationships between factors

329

(ENP properties and exposure conditions) and biological impacts (ENP accumulation

330

and reproductive toxicity) (more details are presented in Figure S4). As shown in

331

Figure 5a, sperm parameters exhibited higher sensitivity for indicating reproductive

332

toxicity risk than did other reproductive toxicity indicators (e.g., testosterone and the

333

testis index) (p1, p2 < 0.0001). The above result was consistent with the better

334

predictive performance of RF in the sperm parameter indicator subset than other

335

subsets of reproductive indicators (Table 2). Compared with other types of ENPs,

336

CdTe quantum dots and Si NPs were likely to induce significantly high reproductive

337

toxicity (p1 = 0.023, p2 = 0.007). Previous studies have demonstrated that CdTe

338

quantum dots can result in acute reproductive toxicity by destroying the structure of

339

testicular tissue.47 As shown in Figure 5b, Fe3O4, Fe2O3 and ZnO NPs exhibited 17

ACS Paragon Plus Environment

Page 18 of 45

Page 19 of 45

Environmental Science & Technology

340

significantly higher accumulation concentrations than the other types of ENPs. Figure

341

S4 also illustrates that the ENPs with major elements (e.g., Zn and Fe) tended to reach

342

significantly high accumulation concentrations in the testes (p1, p2 < 0.0001). As

343

demonstrated by animal experiments,48, 49 essential elements (e.g., Fe and Zn) with

344

high absorptivity in spermatogenesis could cross the blood-testis barrier and easily

345

accumulate in the testes. Interestingly, these NPs (e.g., Fe and Zn ENPs) reached

346

higher accumulation concentrations, but with less reproductive toxicity, as shown in

347

Figures 5a and 5b. Such inconsistency between the accumulation concentration and

348

reproductive toxicity of ENPs deserves attention in the assessment and control of

349

nanotoxicity.50, 51 Figure 5b shows that i.p. injection posed a significantly higher risk

350

of ENP accumulation in the testes than other exposure pathways (p1, p2 < 0.0001),

351

probably because i.p. injection could effectively transport ENPs to the testes.48 The

352

above result suggests that assessments of nanotoxicity must consider the exposure

353

pathway: i.p. injection may overestimate the nanotoxicity or pose a higher risk.

354

To reveal any multidimensional factor-toxicity dependencies, the present work

355

applied data mining technology to the similarity network (Figure 3), a platform for

356

predicting the accumulation and reproductive toxicity of ENPs under specific

357

conditions. The tight clusters in the similarity networks shared commonalities in terms

358

of ENP properties and experimental conditions. Extracting the reliable commonalities

359

shared in the clusters could visualize the multidimensional factor-toxicity 18

ACS Paragon Plus Environment

Environmental Science & Technology

360

dependencies. The present work applied RF classification analysis to achieve

361

excellent classification accuracy based on hierarchical clustering with high vote rates

362

(greater than 95%) and small error rates (less than 4%), as listed in Table S6. The high

363

classification accuracy demonstrated the good affiliation of the extracted

364

factor-dependencies (Table 3) with their original clusters.

365

The specific dependencies of significant factors (e.g., ENP type and size) on the

366

accumulation concentration and reproductive toxicity of ENPs are listed in Table 3

367

(details are presented in the Methods section). The RF classification obtained high

368

vote rates (greater than 80%) for each factor dependency, as shown in Table 3,

369

indicating the good affiliation of each extracted factor dependency with their original

370

clusters. The predictions (CP and TP) and observations (CP and TP) were compared to

371

measure the predictive performance. In the accumulation studies, the similar

372

prediction and observation ranges (e.g., C1, 4-15 ng/g vs 0.9-18.9 ng/g; C3, 177-252

373

ng/g vs 183-287 ng/g) verified the acceptable predictive performance. However, there

374

was remarkable predictive error (1562-12110 ng/g vs 200 ng/g) in C7 accumulation

375

studies, indicating the difficulty in predicting the accumulation concentrations of

376

ENPs with essential elements (e.g., Zn and Fe). In the toxicity studies, there were

377

slight differences between the predicted and observed ranges, and comparable toxicity

378

levels were achieved in the predictions and observations (e.g., C8: 0.26-0.33 vs

379

0.14-0.42). As denoted for the C9 toxicity studies, the model performed well in 19

ACS Paragon Plus Environment

Page 20 of 45

Page 21 of 45

Environmental Science & Technology

380

predicting the reproductive toxicity of three ENP types in terms of a high vote rate

381

(0.89). However, there was a slight difference between the predicted and observed

382

ranges (0.14-0.19 vs 0.01-0.15), which indicated the complicated nature of

383

reproductive toxicity, especially for carbon nanomaterials. There are some works

384

related to the complicated biocompatibility of graphene, for example, the roles of

385

surface modification in the growth, proliferation and differentiation of cells.52, 53 In

386

addition to cytotoxic materials, graphene materials also showed genotoxic effects,

387

which were dependent on size, shape and concentration.5-10 As a result, the toxicity

388

study predictions presented low predictive accuracy in terms of greater error between

389

predictions and observations due to the high heterogeneity of the toxicity studies, and

390

the accumulation study predictions exhibited higher accuracy in comparison.

391

According to the prediction results, Ag NPs could attain high accumulation

392

concentrations (177-252 ng/g) at small sizes (15-40 nm), medium-term exposure

393

times (14-28 days) and wide dosage ranges (1-100 mg/kg) by ga. or i.v. exposure. The

394

above results indicated that dosage over a certain range was not a sensitive factor for

395

the accumulation of Ag NPs in the testes. Furthermore, compared with Ag NPs in two

396

dependencies (C4 vs C7 of toxicity studies in Table 3), a citrate coating could reduce

397

the reproductive toxicity of Ag NPs (0.04-0.06 vs 0.06-0.08). This result supported

398

the idea that the surface ligands of Ag NPs should be taken into consideration for

399

controlling and reducing Ag nanotoxicity. 20

ACS Paragon Plus Environment

Environmental Science & Technology

400

Perspectives of machine learning in nanotoxicity assessment and prediction

401

The rapid development and application of ENPs in various fields have attracted

402

much attention to nanosafety worldwide.2, 54 There are two challenges to the scientific

403

assessment of nanosafety: the numerous attributes (various ENP properties and

404

experimental conditions) related to nanosafety and the high heterogeneity of the data.

405

There are some special biological end points and mechanisms for the toxicological

406

study of nanomaterials compared with common pollutants or xenobiotics, such as

407

cellular uptake, cell membrane and cell wall damage by sharp edges and the release of

408

monomers and ions with stronger toxicity than the parent nanomaterials.55, 56 The

409

above issues contributed the high heterogeneity of the data. With the robust capacity

410

to recognize meaningful patterns in large and heterogeneous datasets, machine

411

learning can be used in a wide range of applications, such as cellular toxicity

412

prediction,19 synthetic reaction prediction24 and personality judgment.57 Thus,

413

machine learning may solve the above two problems in nanosafety assessment and

414

prediction. The present work applied a workflow combining machine learning (RF)

415

with similarity network analysis to provide a roadmap to robustly handle highly

416

heterogeneous data, reveal multidimensional factor-toxicity dependencies and predict

417

the nanotoxicity of various ENPs (more than eighteen types). However, the data

418

quality influences the performance of data-driven RF models.19 As stated above, the

419

lack of ENP characterization and the scarcity of data for some ENPs in the analyzed 21

ACS Paragon Plus Environment

Page 22 of 45

Page 23 of 45

Environmental Science & Technology

420

references have influenced the performance of the model. Integrating RF with

421

similarity network analysis is an option for analyzing heterogeneous data from the

422

literature (Table 3 and Figure 3) for other nanotoxicological studies (including those

423

of female reproductive toxicity). Machine learning also reduces the use of animals in

424

experiments and will be a powerful tool in the assessment and prediction of complex

425

nanosafety issues. In addition, it is well known that reproductive toxicity evaluations

426

are very complicated and require detailed mechanistic studies. The method applied in

427

this research revealed the complex relationships between each piece of data in the RF

428

model and allowed the visualization of hidden complex factor-toxicity dependencies

429

via combination with a similarity network model. Animal experiments provide data

430

and information to support such analyses; thus, animal experiments, especially those

431

investigating toxicological mechanisms, are irreplaceable in assessing the

432

reproductive toxicity of ENPs. Notably, the data or databases are the most important

433

issue in machine learning. The intentional establishment of big data or databases

434

determines the quality of machine learning applications. In the present work, the

435

priority factors determining reproductive toxicity were successfully screened from

436

highly heterogeneous data by integrating data mined from the literature and an RF

437

model. By combining similarity network analysis and hierarchical clustering, the

438

sources of highly heterogeneous data were identified. Then, the factor-toxicity

439

dependencies were extracted and visualized. Subsequently, the prediction of 22

ACS Paragon Plus Environment

Environmental Science & Technology

Page 24 of 45

440

nanotoxicity was achieved based on the screened priority factors. The above

441

workflow and used models are not specific to male toxicity; they are also suitable for

442

examine female reproductive toxicity, among others.

443 444

ASSOCIATED CONTENT

445

Supporting Information Available

446

Methods of data extraction, data treatment and ENP characterization and related

447

results are provided in Tables S1-S6 and Figures S1-S5.

448 449

AUTHOR INFORMATION

450

Corresponding author

451

*Email:

452

86-22-23507800.

453

The authors declare no competing financial interests.

[email protected]

(X.H.).

Tel.:

86-22-23507800;

fax:

454 455

ACKNOWLEDGMENTS

456

This work was financially supported by the National Natural Science Foundation of

457

China (grant nos. 21577070 and 21307061).

458 459 23

ACS Paragon Plus Environment

Page 25 of 45

Environmental Science & Technology

460

REFERENCES

461

1.

462

D.; Hull, M. S., Nanotechnology in the real world: Redeveloping the nanomaterial

463

consumer products inventory. Beilstein J. Nanotech. 2015, 6, 1769-80.

464

2.

465

113 (5), 3815-3835.

466

3.

467

Maurizi, L.; Jochum, W.; Krug, H. F.; Buerkithurnherr, T.; Mandach, U. V.,

468

Bidirectional Transfer Study of Polystyrene Nanoparticles across the Placental Barrier

469

in an ex Vivo Human Placental Perfusion Model. Environ. Health Perspect. 2015,

470

123 (12), 1280-6.

471

4.

472

Perez-Potti, A.; Cookman, J.; Garry, D.; P. Silveira, C.; Boselli, L.; Pelaz, B.; Serchi,

473

T.; Cambier, S.; Gutleb, A. C.; Feliu, N.; Yan, Y.; Salmona, M.; Parak, W. J.;

474

Dawson, K. A.; Bigini, P., Influence of Size and Shape on the Anatomical

475

Distribution of Endotoxin-Free Gold Nanoparticles. ACS Nano 2017, 11 (6),

476

5519-5529.

477

5.

478

A., Cyto and genotoxicities of graphene oxide and reduced graphene oxide sheets on

479

spermatozoa. RSC Adv. 2014, 4 (52), 27213-27223.

Vance, M. E.; Kuiken, T.; Vejerano, E. P.; Mcginnis, S. P.; Jr, M. F. H.; Rejeski,

Hu, X.; Zhou, Q., Health and Ecosystem Risks of Graphene. Chem. Rev. 2013,

Grafmueller, S.; Manser, P.; Diener, L.; Diener, P. A.; Maederalthaus, X.;

Talamini, L.; Violatto, M. B.; Cai, Q.; Monopoli, M. P.; Kantner, K.; Krpetić, Ž.;

Hashemi, E.; Akhavan, O.; Shamsara, M.; Rahighi, R.; Esfandiar, A.; Tayefeh,

24

ACS Paragon Plus Environment

Environmental Science & Technology

480

6.

Kaner, E. F.; Lock, C. A.; Mcavoy, B. R.; Heather, N.; Gilvarry, Wrapping

481

bacteria by graphene nanosheets for isolation from environment, reactivation by

482

sonication, and inactivation by near-infrared irradiation. J. Phys. Chem. B 2011, 115

483

(19), 6279-6288.

484

7.

485

Protein corona-mediated mitigation of cytotoxicity of graphene oxide. ACS Nano

486

2011, 5 (5), 3693-3700.

487

8.

488

Cytotoxicity effects of graphene and single-wall carbon nanotubes in neural

489

phaeochromocytoma-derived PC12 cells. ACS Nano 2010, 4 (6), 3181-3186.

490

9.

491

against bacteria. ACS Nano 2010, 4 (10), 5731-5736.

492

10. Akhavan, O.; Ghaderi, E., Escherichia coli bacteria reduce graphene oxide to

493

bactericidal graphene in a self-limiting manner. Carbon 2012, 50 (5), 1853-1860.

494

11. Semmler-Behnke, M.; Lipka, J.; Wenk, A.; Hirn, S.; Schäffler, M.; Tian, F.;

495

Schmid, G.; Oberdörster, G.; Kreyling, W. G., Size dependent translocation and fetal

496

accumulation of gold nanoparticles from maternal blood in the rat. Part. Fibre

497

Toxicol. 2014, 11 (1) 33.

498

12. Yang, X.; Gondikas, A. P.; Marinakos, S. M.; Auffan, M.; Liu, J.; Hsu-Kim, H.;

499

Meyer, J. N., Mechanism of Silver Nanoparticle Toxicity Is Dependent on Dissolved

Hu, W.; Peng, C.; Lv, M.; Li, X.; Zhang, Y.; Chen, N.; Fan, C.; Huang, Q.,

Zhang, Y.; Ali, S. F.; Dervishi, E.; Xu, Y.; Li, Z.; Casciano, D.; Biris, A. S.,

Akhavan, O.; Ghaderi, E., Toxicity of graphene and graphene oxide nanowalls

25

ACS Paragon Plus Environment

Page 26 of 45

Page 27 of 45

Environmental Science & Technology

500

Silver and Surface Coating in Caenorhabditis elegans. Environ. Sci. Technol. 2012, 46

501

(2), 1119-1127.

502

13. Kataoka, C.; Nakahara, K.; Shimizu, K.; Kowase, S.; Nagasaka, S.; Ifuku, S.;

503

Kashiwada, S., Salinity-dependent toxicity of water-dispersible, single-walled carbon

504

nanotubes to Japanese medaka embryos. J. Appl. Toxicol. 2017, 37 (4), 408-416.

505

14. Gajewicz, A., How to judge whether QSAR/read-across predictions can be

506

trusted: a novel approach for establishing a model's applicability domain. Environ.

507

Sci-Nano 2018, 5 (2), 408-421.

508

15. Wang, D.; Shi, J.; Xiong, Y.; Hu, J.; Lin, Z.; Qiu, Y.; Cheng, J., A QSAR-based

509

mechanistic study on the combined toxicity of antibiotics and quorum sensing

510

inhibitors against Escherichia coli. J. Hazard. Mater. 2018, 341, 438-447.

511

16. Tropsha, A., Best Practices for QSAR Model Development, Validation, and

512

Exploitation. Qsar Comb. Sci. 2010, 29 (6-7), 476-488.

513

17. Ghahramani, Z., Probabilistic machine learning and artificial intelligence. Nature

514

2015, 521 (7553), 452.

515

18. Jordan, M. I.; Mitchell, T. M., Machine learning: Trends, perspectives, and

516

prospects. Science 2015, 349 (6245), 255-260.

517

19. Oh, E.; Liu, R.; Nel, A.; Gemill, K. B.; Bilal, M.; Cohen, Y.; Medintz, I. L.,

518

Meta-analysis of cellular toxicity for cadmium-containing quantum dots. Nat.

519

Nanotechnol. 2016, 11 (5), 479-486. 26

ACS Paragon Plus Environment

Environmental Science & Technology

520

20. Gernand, J. M.; Casman, E. A., A meta-analysis of carbon nanotube pulmonary

521

toxicity studies--how physical dimensions and impurities affect the toxicity of carbon

522

nanotubes. Risk Analysis 2014, 34 (3), 583-597.

523

21. Wang, X.; Zou, C.; Gao, X.; Guan, X.; Zhang, W.; Zhang, Y.; Shi, X.; Chen, X.,

524

Nitrous oxide emissions in Chinese vegetable systems: A meta-analysis. Environ.

525

Pollut. 2018, 239, 375-383.

526

22. Stringhini, S.; Carmeli, C.; Jokela, M.; Avendano, M.; Muennig, P.; Guida, F.;

527

Ricceri, F.; D'Errico, A.; Barros, H.; Bochud, M., Socioeconomic status and the 25 x

528

25 risk factors as determinants of premature mortality: a multicohort study and

529

meta-analysis of 1.7 million men and women. Lancet 2017, 389(10075), 1229-1237.

530

23. Ettehad, D.; Emdin, C. A.; Kiran, A.; Anderson, S. G.; Callender, T.; Emberson,

531

J.; Chalmers, J.; Rodgers, A.; Rahimi, K., Blood pressure lowering for prevention of

532

cardiovascular disease and death: a systematic review and meta-analysis. Lancet

533

2015, 387 (10022), 957–967.

534

24. Ahneman, D. T.; Estrada, J. G.; Lin, S.; Dreher, S. D.; Doyle, A. G., Predicting

535

reaction performance in C-N cross-coupling using machine learning. Science 2018,

536

360 (6385), 186-190.

537

25. Rauber, P. E.; Fadel, S. G.; Falcao, A. X.; Telea, A. C., Visualizing the Hidden

538

Activity of Artificial Neural Networks. IEEE T. Vis. Comput. Gr. 2017, 23 (1),

539

101-110. 27

ACS Paragon Plus Environment

Page 28 of 45

Page 29 of 45

Environmental Science & Technology

540

26. Breiman, L., Random Forest. Mach. Learn. 2001, 45, 5-32.

541

27. Eze, I. C.; Hemkens, L. G.; Bucher, H. C.; Hoffmann, B.; Schindler, C.; Kuenzli,

542

N.; Schikowski, T.; Probst-Hensch, N. M., Association between Ambient Air

543

Pollution and Diabetes Mellitus in Europe and North America: Systematic Review

544

and Meta-Analysis. Environ. Health Perspect. 2015, 123 (5), 381-389.

545

28. Ema, M.; Hougaard, K. S.; Kishimoto, A.; Honda, K., Reproductive and

546

developmental toxicity of carbon-based nanomaterials: A literature review.

547

Nanotoxicology 2016, 10 (4), 391-412.

548

29. Tian, X.; Zhu, M.; Du, L.; Wang, J.; Fan, Z.; Liu, J.; Zhao, Y.; Nie, G.,

549

Intrauterine inflammation increases materno-fetal transfer of gold nanoparticles in a

550

size-dependent manner in murine pregnancy. Small 2013, 9 (14), 2432-2439.

551

30. Hong, F.; Zhao, X.; Si, W.; Ze, Y.; Wang, L.; Zhou, Y.; Hong, J.; Yu, X.; Sheng,

552

L.; Liu, D.; Xu, B.; Zhang, J., Decreased spermatogenesis led to alterations of

553

testis-specific gene expression in male mice following nano-TiO2 exposure. J.

554

Hazard. Mater. 2015, 300, 718-728.

555

31. Schaumann, G. E.; Baumann, T.; Lang, F.; Metreveli, G.; Vogel, H. J.,

556

Engineered nanoparticles in soils and waters. Sci. Total Environ. 2015, 535, 1-2.

557

32. Loosli, F.; Le, C. P.; Stoll, S., Effect of electrolyte valency, alginate concentration

558

and pH on engineered TiO₂ nanoparticle stability in aqueous solution. Sci. Total

559

Environ. 2015, 535, 28-34. 28

ACS Paragon Plus Environment

Environmental Science & Technology

560

33. Balasubramanian, S. K.; Jittiwat, J.; Manikandan, J.; Ong, C. N.; Yu, L. E.; Ong,

561

W. Y., Biodistribution of gold nanoparticles and gene expression changes in the liver

562

and spleen after intravenous administration in rats. Biomaterials 2010, 31 (8),

563

2034-2042.

564

34. Khorsandi, L.; Orazizadeh, M.; Moradi-Gharibvand, N.; Hemadi, M.; Mansouri,

565

E., Beneficial effects of quercetin on titanium dioxide nanoparticles induced

566

spermatogenesis defects in mice. Environ Sci Pollut Res Int 2016.

567

35. Creutzenberg, O.; Kock, H.; Schaudien, D., Translocation and biokinetic

568

behavior of nanoscaled europium oxide particles within 5 days following an acute

569

inhalation in rats. J. Appl. Toxicol. 2016, 36 (3), 474-478.

570

36. Nouwen, J.; Cornelis, C.; De, F. R.; Wevers, M.; Viaene, P.; Mensink, C.; Patyn,

571

J.; Verschaeve, L.; Hooghe, R.; Maes, A., Health risk assessment of dioxin emissions

572

from municipal waste incinerators: the Neerlandquarter (Wilrijk, Belgium).

573

Chemosphere 2001, 43 (4), 909-923.

574

37. Bylander, T., Estimating Generalization Error on Two-Class Datasets Using

575

Out-of-Bag Estimates. Mach. Learn. 2002, 48 (1-3), 287-297.

576

38. Kamada, T.; Kawai, S., An algorithm for drawing general undirected graphs.

577

Inform. Process. Lett. 1989, 31 (1), 7-15.

578

39. Ilany, A.; Akçay, E., Social inheritance can explain the structure of animal social

579

networks. Nat. Commun. 2016, 7, 12084. 29

ACS Paragon Plus Environment

Page 30 of 45

Page 31 of 45

Environmental Science & Technology

580

40. Newman, M. E. J., The Structure and Function of Complex Networks. Siam

581

Review 2003, 45 (2), 167-256.

582

41. Opsahl, T.; Panzarasa, P., Clustering in weighted networks. Social Networks

583

2009, 31 (2), 155-163.

584

42. Cutler, D. R.; Edwards, T. C., Jr.; Beard, K. H.; Cutler, A.; Hess, K. T., Random

585

forests for classification in ecology. Ecology 2007, 88 (11), 2783-2792.

586

43. Allen, T.; Murray, K. A.; Zambrana-Torrelio, C.; Morse, S. S.; Rondinini, C.; Di

587

Marco, M.; Breit, N.; Olival, K. J.; Daszak, P., Global hotspots and correlates of

588

emerging zoonotic diseases. Nat. Commun. 2017, 8 (1), 1124.

589

44. Goldstein, A.; Kapelner, A.; Bleich, J.; Pitkin, E., Peeking Inside the Black Box:

590

Visualizing Statistical Learning With Plots of Individual Conditional Expectation. J.

591

Comput. Graph. Stat. 2013, 24 (1), 44-65.

592

45. Ema, M.; Kobayashi, N.; Naya, M.; Hanai, S.; Nakanishi, J., Reproductive and

593

developmental toxicity studies of manufactured nanomaterials. Reprod. Toxicol. 2010,

594

30 (3), 343-352.

595

46. Hu, X.; Li, D.; Gao, Y.; Mu, L.; Zhou, Q., Knowledge gaps between

596

nanotoxicological research and nanomaterial safety. Environ. Int. 2016, 94, 8-23.

597

47. Li, X.; Yang, X.; Yuwen, L.; Yang, W.; Weng, L.; Teng, Z.; Wang, L.,

598

Evaluation of toxic effects of CdTe quantum dots on the reproductive system in adult

599

male mice. Biomaterials 2016, 96, 24-32. 30

ACS Paragon Plus Environment

Environmental Science & Technology

600

48. Li, C. H.; Shen, C. C.; Cheng, Y. W.; Huang, S. H.; Wu, C. C.; Kao, C. C.; Liao,

601

J. W.; Kang, J. J., Organ biodistribution, clearance, and genotoxicity of orally

602

administered zinc oxide nanoparticles in mice. Nanotoxicology 2012, 6 (7), 746-756.

603

49. Sundarraj, K.; Manickam, V.; Raghunath, A.; Periyasamy, M.; Viswanathan, M.

604

P.; Perumal, E., Repeated exposure to iron oxide nanoparticles causes testicular

605

toxicity in mice. Environ. Toxicol. 2017, 32 (2), 594-608.

606

50. Choi, J.; Kim, H.; Kim, P.; Jo, E.; Kim, H. M.; Lee, M. Y.; Jin, S. M.; Park, K.,

607

Toxicity of zinc oxide nanoparticles in rats treated by two different routes: Single

608

intravenous injection and single oral administration. J. Toxicol. Environ. Health. A

609

2015, 78 (4), 226-243.

610

51. Wang, X.; Zhang, J.; Yang, X.; Tang, Z.; Hu, Y.; Chen, B.; Tang, J., In vivo

611

assessment of hepatotoxicity, nephrotoxicity and biodistribution using

612

3-aminopropyltriethoxysilane-coated magnetic nanoparticles (APTS-MNPs) in ICR

613

mice. Chin. Sci. Bull. 2014, 59 (16), 1800-1808.

614

52. Akhavan, O.; Ghaderi, E.; Emamy, H.; Akhavan, F., Genotoxicity of graphene

615

nanoribbons in human mesenchymal stem cells. Carbon 2013, 54 (2), 419-431.

616

53. Akhavan, O.; Ghaderi, E.; Akhavan, A., Size-dependent genotoxicity of graphene

617

nanoplatelets in human stem cells. Biomaterials 2012, 33 (32), 8017-8025.

31

ACS Paragon Plus Environment

Page 32 of 45

Page 33 of 45

Environmental Science & Technology

618

54. Mu, Q.; Jiang, G.; Chen, L.; Zhou, H.; Fourches, D.; Tropsha, A.; Yan, B.,

619

Chemical Basis of Interactions Between Engineered Nanoparticles and Biological

620

Systems. Chem. Rev. 2014, 114 (15), 7740-7781.

621

55. Li, M.; Yue, G.; Hu, X., l -Cysteine: A biocompatible, breathable and beneficial

622

coating for graphene oxide. Biomaterials 2015, 52 (1), 301-311.

623

56. Laycock, A.; Diezortiz, M.; Larner, F.; Dybowska, A.; Spurgeon, D.;

624

Valsamijones, E.; Rehkämper, M.; Svendsen, C., Earthworm Uptake Routes and

625

Rates of Ionic Zn and ZnO Nanoparticles at Realistic Concentrations, Traced Using

626

Stable Isotope Labeling. Environ. Sci. Technol. 2015, 50 (1), 412-419.

627

57. Youyou, W.; Kosinski, M.; Stillwell, D., Computer-based personality judgments

628

are more accurate than those made by humans. P. Natl. Acad. Sci. USA 2015, 112 (4),

629

1036-1040.

630 631 632 633 634 635 636 637

32

ACS Paragon Plus Environment

Environmental Science & Technology

Page 34 of 45

638

Table 1. Data analysis of ENP accumulation (242 data points) and reproductive

639

toxicity (250 data points). Five qualitative factors (rodent age, ENP size, and exposure

640

time, duration and dosage) and five quantitative factors (ENP type, surface ligand,

641

shape, exposure method, and element) were analyzed for each study.

642

“#”, the elements of the ENPs are major elements in organisms. “★”, the types of

643

ENPs include CeO2, Fe2O3, Fe3O4, VO, nano-alloy particles (Au and Cu), CoCr,

644

Eu2O3, MWCNTs and PbS NPs. “⚐”, special surface ligands, including

645

3-aminopropyltriethoxysilane (ATPS), carboxyl, amino, hydrophilic polymer, citrate,

646

polyvinyl pyrrolidone (PVP) and peptide. “⸙ ”, special ENP shapes, including colloid,

647

cluster, crystalline, rod, pentacle and line. “☼”, ENP types, including Se, CdTe,

648

CdSe/ZnS, Fe2O3, Si, PbS, Mn2O3, Ni and CoCr NPs. “⸙ ”, the surface ligands

649

polyethylene glycol-amino (PEG-NH2), polyethylene glycol and PVP. “♫”, ENP

650

shapes (line, rod and cube). MWCNTs, multiwalled carbon nanotubes; i.v.,

651

intravenous injection; i.p., intraperitoneal injection; i.t., intratracheal injection; ga.,

652

gavage; i.h., inhalation; sc., abdominal subcutaneous injection. Factor

Age (weeks) Mean (SD) Range Type of ENP (%) Au Ag TiO2 ZnO CdTe

Accumulation of ENPs Factor (n = 242) Age (weeks) 7.4 (2.3) Mean (SD) 1.1 to 13.0 Range Type of ENP (%) 32.2 Ag 21.9 TiO2 10.7 Au 7.0 MWCNT 6.6 ZnO 33

ACS Paragon Plus Environment

Toxicity of ENPs (n = 250)

8.0 (3.9) 1.1 to 19.3 30.8 18.8 8.4 7.6 7.2

Page 35 of 45

Environmental Science & Technology

CdS …★ Surface ligand (%) None Glutathione Polyethylene glycol …⚐ Shape (%) Sphere …⸙ Exposure method (%) i.v. ga. i.p. i.h. i.t. Elements (%)#

3.3 18.1 62.4 8.3 8.3

21.1 77.7 22.3

50 26.9 14.5 7.0 1.7

Yes

28.9

No

71.1

Size (nm) Mean (SD) Range

GO …☼ Surface ligand (%) None Citrate Hydrophilic Polymer …₂ Shape (%) Sphere …♫ Exposure method (%) ga. i.v. i.p. i.h. sc. Indicator (%) Sperm Parameters Testosterone Testis index Size (nm) Mean (SD) Range Exposure time (days) Mean (SD) Range Duration time (days) Mean (SD) Range Dosage (mg/kg) Mean (SD) Range

37.7 (51.2) 1.9 to 323

Exposure time (days)

Mean (SD) Range

15.1 (34.0) 0 to 224

Duration time (days) Mean (SD) Range Dosage (mg/kg) Mean (SD) Range Accumulation Concentration Mean (SD) Range

17.4 (33.2) 0 to 180 233 (862) 0 to 5000

6.8 19.6 81.6 6.8 4.8

6.4 91.1 8.9

43.6 29.2 20.8 3.6 2.4 66.8 10.4 22.4 41.9 (48.5) 1.4 to 245

32.3 (38.9) 1 to 224 12.0 (37.7) 0 to 365 112 (519) 0 to 5000

Toxicity 1437 (6201) -3280 to 69856

Mean (SD) Range

653 654

34

ACS Paragon Plus Environment

0.12 (0.25) -1.3 to 0.95

Environmental Science & Technology

Page 36 of 45

655

Table 2. Assessments of the performance of the random forest (RF) model using the

656

R2 correlation coefficient and root mean square error (RMSE). The subset of

657

metal-free engineering nanoparticles (ENPs) in NP accumulation studies was not

658

analyzed due to the limited number of samples (n = 3). For other subsets, the number

659

of tested samples was not less than 37. a, Number of data points in the subset. b,

660

Injection exposure methods, including intramuscular, intravenous, intraperitoneal,

661

intra-articular and abdominal subcutaneous injection. c, Noninjection exposure

662

methods, including inhalation and gavage. Accumulation of ENPs Priority Factor Overall Type of ENP Exposure method Priority Factor Overall Type of ENP Toxicity indicator

Nd a

R2

242 135 104 156 86

0.744 0.634 0.757 0.769 0.758

RMSE (ng/g) 4218 392 6177 5033 851

Subset

Nd a

R2

RMSE

Metallic Metal-free Carbonaceous Carbon-free Sperm parameters Testosterone Testis index

250 194 56 37 213 168 26 56

0.624 0.605 0.553 0.485 0.619 0.543 0.471 0.383

0.198 0.215 0.131 0.091 0.212 0.181 0.424 0.111

Subset Single-metallic element Multiple-metallic element Injectionb Noninjectionc Toxicity of ENPs

663

35

ACS Paragon Plus Environment

Page 37 of 45

Environmental Science & Technology

664

Table 3. Predictions of nanomaterial accumulation and reproductive toxicity. The factors are normalized for each study as follows: rodent age, 8

665

weeks; ENP shape, particles; duration, 14 days; ENP surface ligands, none; toxicity indicator, sperm parameters. ⚐, cluster number; *, duration,

666

0 to 4 weeks; $, lowest vote rates of data points. Cp and Tp represent the predictions of the extracted data points in the RF model for the

667

accumulation and toxicity studies, respectively. Co and To represent the observations of actual data points meeting the conditions of the six

668

important attributes (ENP type, size, exposure method, exposure time, dosage, and element for accumulation studies and surface ligand for

669

toxicity studies) for the accumulation and reproductive toxicity of ENPs, respectively. Nd and Np represent the number of actual data points and

670

publications meeting the conditions of the six attributes, respectively. Cc and Tc represent the accumulation concentration and toxicity

671

distribution of each cluster, respectively. The first to third quartiles were used as the main range.

C⚐

ENP

Element

Size Exposure (nm) Method

1 3 3

Au TiO2 Ag ZnO, Fe2O3

N Y N

10-40 i.v., i.h. 5-100 ga., i.v. 15-40 ga., i.v.

Y

20-40

7*

ga.

Predictions for Accumulation Studies Exposure Dosage Time Cp (ng/g) Nd (mg/kg) (days) 1 0.01-1 4-15 31 1-90 10-200 405-1123 5 14-28 1-100 177-252 6 10-40

30-200

1562-12110

36

ACS Paragon Plus Environment

2

Np

Co (ng/g)

4 3 3

0.9-18.9 230-3860 183-287

Vote Rate$ (%) 96.3 80.0 87.0

2

200

82.5

Cc (ng/g)

2-18 3-174 3-174 130-3080

Environmental Science & Technology



C

4 7 8

ENP

Ag Ag TiO2 MWCNT, 9 GO, Ag

Predictions for Toxicity Studies Exposure Dosage Surface Size Exposure Time Tp (mg/kg) Ligand (nm) Method (days) citrate 5-40 ga. 90-365 10-150 0.04-0.06 n 10-15 i.v. 10-15 0.1-5 0.06-0.08 n 20-70 ga. 30-60 20-200 0.26-0.33 n

3-20

i.p.

1-15

0.5-70

0.14-0.19

672

37

ACS Paragon Plus Environment

Page 38 of 45

Nd

Np

To

2 3 6

1 1 4

0.00-0.01 0.00-0.03 0.14-0.42

Vote Rate$ (%) 80.0 86.0 87.0

24

3

0.01-0.15

89.0

Tc

-0.03-0.04 0.00-0.18 0.04-0.35 0.08-0.36

Page 39 of 45

Environmental Science & Technology

673

Figure Captions

674

Figure 1. Performance of the RF model. The training sets were randomly selected

675

from original datasets, and each training set was repeated ten times. a and b,

676

accumulation of ENPs; b and d, reproductive toxicity of ENPs.

677 678

Figure 2. Importance of attributes to the reproductive toxicity and accumulation

679

concentration of ENPs analyzed by the RF model. The increase in the MSE was used

680

to measure the importance of the attributes.

681 682

Figure 3. Similarity network analysis based on the proximity matrix of the RF model.

683

The present work connected adjacent nodes with higher than average values in the

684

proximity matrix, which measures the probability of two data points occurring in the

685

same node of a tree in the RF model, and vice versa. The tight connections in the

686

similarity network indicate the high homogeneity of the clusters. The clusters with

687

different colors were classified by hierarchical clustering.

688 689

Figure 4. Similarity network analysis of priority factors based on the proximity

690

matrix of the RF model. The nodes were colored according to the categories of the

691

priority factors.

38

ACS Paragon Plus Environment

Environmental Science & Technology

692

Figure 5. Bivariate partial dependence plots of ENP types, exposure methods and

693

toxicity indicators. a, Reproductive toxicity of ENPs; b, accumulation concentration

694

of ENPs.

695 696 697 698 699 700 701 702 703 704 705 706

39

ACS Paragon Plus Environment

Page 40 of 45

Page 41 of 45

Environmental Science & Technology

707 708

Figure 1.

709

40

ACS Paragon Plus Environment

Environmental Science & Technology

710 711

Figure 2.

712 713 714 715 716 717 718 719 41

ACS Paragon Plus Environment

Page 42 of 45

Page 43 of 45

Environmental Science & Technology

720 721

Figure 3. 42

ACS Paragon Plus Environment

Environmental Science & Technology

722 723

724 725

Figure 4.

43

ACS Paragon Plus Environment

Page 44 of 45

Page 45 of 45

Environmental Science & Technology

726 727

Figure 5.

728

44

ACS Paragon Plus Environment